Main description:
Our ability to generate and collect data has been increasing rapidly. Not only are all of our business, scientific, and government transactions now computerized, but the widespread use of digital cameras, publication tools, and bar codes also generate data. On the collection side, scanned text and image platforms, satellite remote sensing systems, and the World Wide Web have flooded us with a tremendous amount of data. This explosive growth has generated an even more urgent need for new techniques and automated tools that can help us transform this data into useful information and knowledge.
Like the first edition, voted the most popular data mining book by KD Nuggets readers, this book explores concepts and techniques for the discovery of patterns hidden in large data sets, focusing on issues relating to their feasibility, usefulness, effectiveness, and scalability. However, since the publication of the first edition, great progress has been made in the development of new data mining methods, systems, and applications. This new edition substantially enhances the first edition, and new chapters have been added to address recent developments on mining complex types of data including stream data, sequence data, graph structured data, social network data, and multi-relational data.
Whether you are a seasoned professional or a new student of data mining, this book has much to offer you:
- A comprehensive, practical look at the concepts and techniques you need to know to get the most out of real business data.
- Updates that incorporate input from readers, changes in the field, and more material on statistics and machine learning.
- Dozens of algorithms and implementation examples, all in easily understood pseudo-code and suitable for use in real-world, large-scale data mining projects.
- Complete classroom support for instructors at www.mkp.com/datamining2e companion site.
Review quote:
‘The second edition of Han and Kamber Data Mining: Concepts and Techniques updates and improves the already comprehensive coverage of the first edition and adds coverage of new and important topics, such as mining stream data, mining social networks, and mining spatial, multi-media and other complex data. This book will be an excellent textbook for courses on Data Mining and Knowledge Discovery.’
Gregory Piatetsky-Shapiro, President, KDnuggets
‘The second edition is the most complete and up-to-date presentation on this topic. Compared to the already comprehensive and thorough coverage of the first edition it adds the state-of-the-art research results in new topics such as mining stream, time-series and sequence data as well as mining spatial, multimedia, text and web data. This book is a "must have" for all instructors, researchers, developers and users in the area of data mining and knowledge discovery.’ Hans-Peter Kriegel, University of Munich, Germany
Table of contents:
Chapter 1: Introduction
Chapter 2: Data Warehouse and OLAP Technology for Data Mining
Chapter 3: Data Preprocessing
Chapter 4: Data Mining Primitives, Languages, and System Architectures
Chapter 5: Concept Description: Characterization and Comparison
Chapter 6: Mining Association Rules in Large Databases
Chapter 7: Classification and Prediction
Chapter 8: Cluster Analysis
Chapter 9: Mining Time-Series, Sequence, and Stream Data
Chapter 10: Mining Spatial, Multimedia, and Biological Databases
Chapter 11: Text Mining and Web Mining
Chapter 12: Visual and Audio Data Mining
Chapter 13: Data Mining Applications and Trends in Data Mining
Bibliography
Our ability to generate and collect data has been increasing rapidly. Not only are all of our business, scientific, and government transactions now computerized, but the widespread use of digital cameras, publication tools, and bar codes also generate data. On the collection side, scanned text and image platforms, satellite remote sensing systems, and the World Wide Web have flooded us with a tremendous amount of data. This explosive growth has generated an even more urgent need for new techniques and automated tools that can help us transform this data into useful information and knowledge.
Like the first edition, voted the most popular data mining book by KD Nuggets readers, this book explores concepts and techniques for the discovery of patterns hidden in large data sets, focusing on issues relating to their feasibility, usefulness, effectiveness, and scalability. However, since the publication of the first edition, great progress has been made in the development of new data mining methods, systems, and applications. This new edition substantially enhances the first edition, and new chapters have been added to address recent developments on mining complex types of data including stream data, sequence data, graph structured data, social network data, and multi-relational data.
Whether you are a seasoned professional or a new student of data mining, this book has much to offer you:
- A comprehensive, practical look at the concepts and techniques you need to know to get the most out of real business data.
- Updates that incorporate input from readers, changes in the field, and more material on statistics and machine learning.
- Dozens of algorithms and implementation examples, all in easily understood pseudo-code and suitable for use in real-world, large-scale data mining projects.
- Complete classroom support for instructors at www.mkp.com/datamining2e companion site.
Review quote:
‘The second edition of Han and Kamber Data Mining: Concepts and Techniques updates and improves the already comprehensive coverage of the first edition and adds coverage of new and important topics, such as mining stream data, mining social networks, and mining spatial, multi-media and other complex data. This book will be an excellent textbook for courses on Data Mining and Knowledge Discovery.’
Gregory Piatetsky-Shapiro, President, KDnuggets
‘The second edition is the most complete and up-to-date presentation on this topic. Compared to the already comprehensive and thorough coverage of the first edition it adds the state-of-the-art research results in new topics such as mining stream, time-series and sequence data as well as mining spatial, multimedia, text and web data. This book is a "must have" for all instructors, researchers, developers and users in the area of data mining and knowledge discovery.’ Hans-Peter Kriegel, University of Munich, Germany
Table of contents:
Chapter 1: Introduction
Chapter 2: Data Warehouse and OLAP Technology for Data Mining
Chapter 3: Data Preprocessing
Chapter 4: Data Mining Primitives, Languages, and System Architectures
Chapter 5: Concept Description: Characterization and Comparison
Chapter 6: Mining Association Rules in Large Databases
Chapter 7: Classification and Prediction
Chapter 8: Cluster Analysis
Chapter 9: Mining Time-Series, Sequence, and Stream Data
Chapter 10: Mining Spatial, Multimedia, and Biological Databases
Chapter 11: Text Mining and Web Mining
Chapter 12: Visual and Audio Data Mining
Chapter 13: Data Mining Applications and Trends in Data Mining
Bibliography