This volume was born from the experience of the authors as researchers and educators,whichsuggeststhatmanystudentsofdataminingarehandicapped in their research by the lack of a formal, systematic education in its mat- matics. The data mining literature contains many excellent titles that address the needs of users with a variety of interests ranging from decision making to p- tern investigation in biological data. However, these books do not deal with the mathematical tools that are currently needed by data mining researchers and doctoral students. We felt it timely to produce a book that integrates the mathematics of data mining with its applications. We emphasize that this book is about mathematical tools for data mining and not about data mining itself; despite this, a substantial amount of applications of mathematical c- cepts in data mining are presented. The book is intended as a reference for the working data miner. In our opinion, three areas of mathematics are vital for data mining: set theory,includingpartially orderedsetsandcombinatorics;linear algebra,with its many applications in principal component analysis and neural networks; and probability theory, which plays a foundational role in statistics, machine learning and data mining. Thisvolumeisdedicatedtothestudyofset-theoreticalfoundationsofdata mining. Two further volumes are contemplated that will cover linear algebra and probability theory. The ?rst part of this book, dedicated to set theory, begins with a study of functionsandrelations.Applicationsofthesefundamentalconceptstosuch- sues as equivalences and partitions are discussed. Also, we prepare the ground for the following volumes by discussing indicator functions, ?elds and?-?elds, and other concepts.
From the book reviews:
"This textbook is appropriate for an advanced undergraduate or graduate mathematics elective class. All theorems are proved, notation is standard, and ample exercise sets are included at the end of every chapter. ... Mathematical Tools for Data Mining: Set Theory, Partial Orders, Combinatorics is more than just the data-mining reference book. It is highly readable textbook that successfully connects classic, theoretical mathematics to an enormously popular current application in modern society." (Susan D'Agostino, MAA Reviews, March, 2015)
"The goal of this book is to present the basic mathematical theory and principles used in data mining tools and techniques. ... Graduate or advanced undergraduate students with prior coursework in mathematics will find this book a useful collection of the fundamental mathematical ideas ... . The exposition of concepts is clear and readable. Comfort with mathematical notation is necessary, since the book makes significant use of such notation. Several exercises are included, with solutions being provided in outline." (R. M. Malyankar, Computing Reviews, September, 2014)
"This textbook is appropriate for an advanced undergraduate or graduate mathematics elective class. All theorems are proved, notation is standard, and ample exercise sets are included at the end of every chapter. ... Mathematical Tools for Data Mining: Set Theory, Partial Orders, Combinatorics is more than just the data-mining reference book. It is highly readable textbook that successfully connects classic, theoretical mathematics to an enormously popular current application in modern society." (Susan D'Agostino, MAA Reviews, March, 2015)
"The goal of this book is to present the basic mathematical theory and principles used in data mining tools and techniques. ... Graduate or advanced undergraduate students with prior coursework in mathematics will find this book a useful collection of the fundamental mathematical ideas ... . The exposition of concepts is clear and readable. Comfort with mathematical notation is necessary, since the book makes significant use of such notation. Several exercises are included, with solutions being provided in outline." (R. M. Malyankar, Computing Reviews, September, 2014)