scispace - formally typeset
Open AccessJournal ArticleDOI

Interactive machine learning: letting users build classifiers

Reads0
Chats0
TLDR
It is shown that appropriate techniques can empower users to create models that compete with classifiers built by state-of-the-art learning algorithms, and that small expert-defined models offer the additional advantage that they will generally be more intelligible than those generated by automatic techniques.
Abstract
According to standard procedure, building a classifier using machine learning is a fully automated process that follows the preparation of training data by a domain expert. In contrast, interactive machine learning engages users in actually generating the classifier themselves. This offers a natural way of integrating background knowledge into the modelling stage—as long as interactive tools can be designed that support efficient and effective communication. This paper shows that appropriate techniques can empower users to create models that compete with classifiers built by state-of-the-art learning algorithms. It demonstrates that users—even users who are not domain experts—can often construct good classifiers, without any help from a learning algorithm, using a simple two-dimensional visual interface. Experiments on real data demonstrate that, not surprisingly, success hinges on the domain: if a few attributes can support good predictions, users generate accurate classifiers, whereas domains with many high-order attribute interactions favour standard machine learning techniques. We also present an artificial example where domain knowledge allows an “expert user” to create a much more accurate model than automatic learning algorithms. These results indicate that our system has the potential to produce highly accurate classifiers in the hands of a domain expert who has a strong interest in the domain and therefore some insights into how to partition the data. Moreover, small expert-defined models offer the additional advantage that they will generally be more intelligible than those generated by automatic techniques.

read more

Content maybe subject to copyright    Report






Citations
More filters
Journal ArticleDOI

Power to the People: The Role of Humans in Interactive Machine Learning

TL;DR: It is argued that the design process for interactive machine learning systems should involve users at all stages: explorations that reveal human interaction patterns and inspire novel interaction methods, as well as refinement stages to tune details of the interface and choose among alternatives.
Journal ArticleDOI

Decision trees: a recent overview

TL;DR: Basic decision tree issues and current research points are described, guiding the researcher in interesting research directions and suggesting possible bias combinations that have yet to be explored.
Journal ArticleDOI

From visual data exploration to visual data mining: a survey

TL;DR: This work surveys work on the different uses of graphical mapping and interaction techniques for visual data mining of large data sets represented as table data and reviews recent innovative approaches that attempt to integrate visualization into the DM/KDD process, using it to enhance user interaction and comprehension.
Proceedings ArticleDOI

EnsembleMatrix: interactive visualization to support machine learning with multiple classifiers

TL;DR: EnsembleMatrix is an interactive visualization system that presents a graphical view of confusion matrices to help users understand relative merits of various classifiers and allows users to directly interact with the visualizations in order to explore and build combination models.
Journal ArticleDOI

Online learning: A comprehensive survey

TL;DR: Online learning as mentioned in this paper is a family of machine learning methods, where a learner attempts to tackle some predictive (or any type of decision-making) task by learning from a sequence of data instances one by one at each time.
References
More filters
Book

C4.5: Programs for Machine Learning

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.
Journal ArticleDOI

A system for induction of oblique decision trees

TL;DR: In this paper, a new system for induction of oblique decision trees, OC1, combines deterministic hill-climbing with two forms of randomization to find a good oblique split (in the form of a hyperplane) at each node of a decision tree.
Journal ArticleDOI

XGobi: Interactive Dynamic Data Visualization in the X Window System

TL;DR: XGobi is a data visualization system with state-of-the-art interactive and dynamic methods for the manipulation of views of data that implements 2-D displays of projections of points and lines in high-dimensional spaces, as well as parallel coordinate displays and textual views thereof.
Journal ArticleDOI

Induction of ripple-down rules applied to modeling large databases

TL;DR: A methodology for the modeling of large data sets is described which results in rule sets having minimal inter-rule interactions, and being simply maintained.