Data Mining: Practical Machine Learning Tools and Techniques

Open AccessBook

Data Mining: Practical Machine Learning Tools and Techniques

TLDR

This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.

Abstract:

Data Mining: Practical Machine Learning Tools and Techniques offers a thorough grounding in machine learning concepts as well as practical advice on applying machine learning tools and techniques in real-world data mining situations. This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining. Thorough updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including new material on Data Transformations, Ensemble Learning, Massive Data Sets, Multi-instance Learning, plus a new version of the popular Weka machine learning software developed by the authors. Witten, Frank, and Hall include both tried-and-true techniques of today as well as methods at the leading edge of contemporary research. *Provides a thorough grounding in machine learning concepts as well as practical advice on applying the tools and techniques to your data mining projects *Offers concrete tips and techniques for performance improvement that work by transforming the input or output in machine learning methods *Includes downloadable Weka software toolkit, a collection of machine learning algorithms for data mining tasks-in an updated, interactive interface. Algorithms in toolkit cover: data pre-processing, classification, regression, clustering, association rules, visualization

Data Mining: Practical Machine Learning Tools and Techniques

Citations

Data Mining: Concepts and Techniques

The WEKA data mining software: an update

Classification and regression trees

A review of feature selection techniques in bioinformatics

Activity recognition from user-annotated acceleration data

References

Genetic algorithms in search, optimization, and machine learning

The Nature of Statistical Learning Theory

Support-Vector Networks

An introduction to the bootstrap

A Coefficient of agreement for nominal Scales

Related Papers (5)

C4.5: Programs for Machine Learning

Random Forests

Data Mining: Concepts and Techniques

The WEKA data mining software: an update

Induction of Decision Trees