Neural Network Ensembles, Cross Validation, and Active Learning

Open AccessProceedings Article

Neural Network Ensembles, Cross Validation, and Active Learning

- Vol. 7, pp 231-238

TLDR

It is shown how to estimate the optimal weights of the ensemble members using unlabeled data and how the ambiguity can be used to select new training data to be labeled in an active learning scheme.

Abstract:

Learning of continuous valued functions using neural network ensembles (committees) can give improved accuracy, reliable estimation of the generalization error, and active learning. The ambiguity is defined as the variation of the output of ensemble members averaged over unlabeled data, so it quantifies the disagreement among the networks. It is discussed how to use the ambiguity in combination with cross-validation to give a reliable estimate of the ensemble generalization error, and how this type of ensemble cross-validation can sometimes improve performance. It is shown how to estimate the optimal weights of the ensemble members using unlabeled data. By a generalization of query by committee, it is finally shown how the ambiguity can be used to select new training data to be labeled in an active learning scheme.

Citations

PDF

Open Access

More filters

Book

Neural networks for pattern recognition

Christopher M. Bishop

TL;DR: This is the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition, and is designed as a text, with over 100 exercises, to benefit anyone involved in the fields of neural computation and pattern recognition.

...read moreread less

Journal ArticleDOI

Wrappers for feature subset selection

Ron Kohavi, +1 more

- 01 Dec 1997 -

Artificial Intelligence

TL;DR: The wrapper method searches for an optimal feature subset tailored to a particular algorithm and a domain and compares the wrapper approach to induction without feature subset selection and to Relief, a filter approach tofeature subset selection.

...read moreread less

Journal ArticleDOI

Statistical pattern recognition: a review

Anil K. Jain, +2 more

- 01 Jan 2000 -

IEEE Transactions on Pattern Analysis an...

TL;DR: The objective of this review paper is to summarize and compare some of the well-known methods used in various stages of a pattern recognition system and identify research topics and applications which are at the forefront of this exciting and challenging field.

...read moreread less

Journal ArticleDOI

On combining classifiers

Josef Kittler, +3 more

- 01 Mar 1998 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A common theoretical framework for combining classifiers which use distinct pattern representations is developed and it is shown that many existing schemes can be considered as special cases of compound classification where all the pattern representations are used jointly to make a decision.

...read moreread less

Book

Pattern recognition and neural networks

Brian D. Ripley, +1 more

TL;DR: Professor Ripley brings together two crucial ideas in pattern recognition; statistical methods and machine learning via neural networks in this self-contained account.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Original Contribution: Stacked generalization

David H. Wolpert

- 05 Feb 1992 -

Neural Networks

TL;DR: The conclusion is that for almost any real-world generalization problem one should use some version of stacked generalization to minimize the generalization error rate.

...read moreread less

Journal ArticleDOI

Neural network ensembles

Lars Kai Hansen, +1 more

- 01 Oct 1990 -

IEEE Transactions on Pattern Analysis an...

TL;DR: It is shown that the remaining residual generalization error can be reduced by invoking ensembles of similar networks, which helps improve the performance and training of neural networks for classification.

...read moreread less

Journal ArticleDOI

Neural networks and the bias/variance dilemma

Stuart Geman, +2 more

- 03 Jan 1992 -

Neural Computation

TL;DR: It is suggested that current-generation feedforward neural networks are largely inadequate for difficult problems in machine perception and machine learning, regardless of parallel-versus-serial hardware or other implementation issues.

...read moreread less

Proceedings ArticleDOI

Query by committee

H. S. Seung, +2 more

TL;DR: It is suggested that asymptotically finite information gain may be an important characteristic of good query algorithms, in which a committee of students is trained on the same data set.

...read moreread less

Proceedings Article

Information, Prediction, and Query by Committee

Yoav Freund, +3 more

TL;DR: It is shown that if the two-member committee algorithm achieves information gain with positive lower bound, then the prediction error decreases exponentially with the number of queries, and this exponential decrease holds for query learning of thresholded smooth functions.

...read moreread less

Neural Network Ensembles, Cross Validation, and Active Learning

Citations

Neural networks for pattern recognition

Wrappers for feature subset selection

Statistical pattern recognition: a review

On combining classifiers

Pattern recognition and neural networks

References

Original Contribution: Stacked generalization

Neural network ensembles

Neural networks and the bias/variance dilemma

Query by committee

Information, Prediction, and Query by Committee

Related Papers (5)

Bagging predictors

Experiments with a new boosting algorithm

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Ensemble Methods in Machine Learning

Random Forests