An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes

doi:10.1016/J.PATCOG.2011.01.017

Home
/
Papers
/
An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes

Journal Article•DOI•

An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes

Mikel Galar¹, Alberto Fernández², Edurne Barrenechea¹, Humberto Bustince¹, Francisco Herrera³ - Show less +1 more•Institutions (3)

Universidad Pública de Navarra¹, University of Jaén², University of Granada³

01 Aug 2011-Pattern Recognition (Elsevier Science Inc.)-Vol. 44, Iss: 8, pp 1761-1776

TL;DR: This work develops a double study, using different base classifiers in order to observe the suitability and potential of each combination within each classifier, and compares the performance of these ensemble techniques with the classifiers' themselves.

read less

About: This article is published in Pattern Recognition.The article was published on 2011-08-01. It has received 653 citations till now. The article focuses on the topics: Cascading classifiers & Random subspace method.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning with convolutional neural networks for EEG decoding and visualization.

[...]

Robin Tibor Schirrmeister¹, Jost Tobias Springenberg¹, Lukas D. J. Fiederer¹, Martin Glasstetter¹, Katharina Eggensperger¹, Michael Tangermann¹, Frank Hutter¹, Wolfram Burgard¹, Tonio Ball¹ - Show less +5 more•Institutions (1)

University of Freiburg¹

01 Nov 2017-Human Brain Mapping

TL;DR: This study shows how to design and train convolutional neural networks to decode task‐related information from the raw EEG without handcrafted features and highlights the potential of deep ConvNets combined with advanced visualization techniques for EEG‐based brain mapping.

...read moreread less

Abstract: Deep learning with convolutional neural networks (deep ConvNets) has revolutionized computer vision through end-to-end learning, that is, learning from the raw data. There is increasing interest in using deep ConvNets for end-to-end EEG analysis, but a better understanding of how to design and train ConvNets for end-to-end EEG decoding and how to visualize the informative EEG features the ConvNets learn is still needed. Here, we studied deep ConvNets with a range of different architectures, designed for decoding imagined or executed tasks from raw EEG. Our results show that recent advances from the machine learning field, including batch normalization and exponential linear units, together with a cropped training strategy, boosted the deep ConvNets decoding performance, reaching at least as good performance as the widely used filter bank common spatial patterns (FBCSP) algorithm (mean decoding accuracies 82.1% FBCSP, 84.0% deep ConvNets). While FBCSP is designed to use spectral power modulations, the features used by ConvNets are not fixed a priori. Our novel methods for visualizing the learned features demonstrated that ConvNets indeed learned to use spectral power modulations in the alpha, beta, and high gamma frequencies, and proved useful for spatially mapping the learned features by revealing the topography of the causal contributions of features in different frequency bands to the decoding decision. Our study thus shows how to design and train ConvNets to decode task-related information from the raw EEG without handcrafted features and highlights the potential of deep ConvNets combined with advanced visualization techniques for EEG-based brain mapping. Hum Brain Mapp 38:5391-5420, 2017. © 2017 Wiley Periodicals, Inc.

...read moreread less

1,675 citations

Cites background from "An overview of ensemble methods for..."

...…to investigate the potential of ConvNets for brain-signal decoding [Antoniades et al., 2016; Bashivan et al., 2016; Cecotti and Graser, 2011; Hajinoroozi et al., 2016; Lawhern et al., 2016; Liang et al., 2016; Manor et al., 2016; Manor and Geva, 2015; Page et al., 2016; Ren and Wu, 2014;…...
[...]

Journal Article•DOI•

SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary

[...]

Alberto Fernández¹, Salvador García¹, Francisco Herrera¹, Nitesh V. Chawla²•Institutions (2)

University of Granada¹, University of Notre Dame²

01 Jan 2018-Journal of Artificial Intelligence Research

TL;DR: The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is considered "de facto" standard in the framework of learning from imbalanced data because of its simplicity in the design, as well as its robustness when applied to different type of problems.

...read moreread less

Abstract: The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is considered "de facto" standard in the framework of learning from imbalanced data. This is due to its simplicity in the design of the procedure, as well as its robustness when applied to different type of problems. Since its publication in 2002, SMOTE has proven successful in a variety of applications from several different domains. SMOTE has also inspired several approaches to counter the issue of class imbalance, and has also significantly contributed to new supervised learning paradigms, including multilabel classification, incremental learning, semi-supervised learning, multi-instance learning, among others. It is standard benchmark for learning from imbalanced data. It is also featured in a number of different software packages -- from open source to commercial. In this paper, marking the fifteen year anniversary of SMOTE, we reect on the SMOTE journey, discuss the current state of affairs with SMOTE, its applications, and also identify the next set of challenges to extend SMOTE for Big Data problems.

...read moreread less

905 citations

Journal Article•DOI•

High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning

[...]

Sarah M. Erfani¹, Sutharshan Rajasegarar¹, Shanika Karunasekera¹, Christopher Leckie¹•Institutions (1)

University of Melbourne¹

01 Oct 2016-Pattern Recognition

TL;DR: A hybrid model where an unsupervised DBN is trained to extract generic underlying features, and a one-class SVM is trained from the features learned by the DBN, which delivers a comparable accuracy with a deep autoencoder and is scalable and computationally efficient.

...read moreread less

876 citations

Journal Article•DOI•

A survey of multiple classifier systems as hybrid systems

[...]

Michał Woniak¹, Manuel Graña², Emilio Corchado³•Institutions (3)

University of Wrocław¹, University of the Basque Country², University of Salamanca³

01 Mar 2014-Information Fusion

TL;DR: An up-to-date survey on multiple classifier system (MCS) from the point of view of Hybrid Intelligent Systems is presented, providing a vision of the spectrum of applications that are currently being developed.

...read moreread less

856 citations

Cites background from "An overview of ensemble methods for..."

...A comprehensive recent survey of binary classifier ensembles is [124]....
[...]

Journal Article•DOI•

Human–Agent Teaming for Multirobot Control: A Review of Human Factors Issues

[...]

Jessie Y. C. Chen¹, Michael J. Barnes¹•Institutions (1)

United States Army Research Laboratory¹

06 Jan 2014-IEEE Transactions on Human-Machine Systems

TL;DR: The human factors literature on intelligent systems was reviewed, and two key human performance issues related to H-A teaming for multirobot control and some promising user interface design solutions to address these issues were discussed.

...read moreread less

Abstract: The human factors literature on intelligent systems was reviewed in relation to the following: efficient human supervision of multiple robots, appropriate human trust in the automated systems, maintenance of human operator's situation awareness, individual differences in human-agent (H-A) interaction, and retention of human decision authority. A number of approaches-from flexible automation to autonomous agents-were reviewed, and their advantages and disadvantages were discussed. In addition, two key human performance issues (trust and situation awareness) related to H-A teaming for multirobot control and some promising user interface design solutions to address these issues were discussed. Some major individual differences factors (operator spatial ability, attentional control ability, and gaming experience) were identified that may impact H-A teaming in the context of robotics control.

...read moreread less

354 citations

Cites background from "An overview of ensemble methods for..."

...Nevertheless, the OVO strategy makes the size of the training samples even smaller, whereas the OVA strategy causes class imbalance in the training samples [13]....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131

Collapse

References

PDF

Open Access

More filters

Book•

An introduction to the bootstrap

[...]

Bradley Efron¹, Robert Tibshirani•Institutions (1)

South Dakota School of Mines and Technology¹

01 Jan 1993

TL;DR: This article presents bootstrap methods for estimation, using simple arguments, with Minitab macros for implementing these methods, as well as some examples of how these methods could be used for estimation purposes.

...read moreread less

Abstract: This article presents bootstrap methods for estimation, using simple arguments. Minitab macros for implementing these methods are given.

...read moreread less

37,183 citations

Journal Article•DOI•

A Coefficient of agreement for nominal Scales

[...]

Jacob Cohen¹•Institutions (1)

York University¹

01 Apr 1960-Educational and Psychological Measurement

TL;DR: In this article, the authors present a procedure for having two or more judges independently categorize a sample of units and determine the degree, significance, and significance of the units. But they do not discuss the extent to which these judgments are reproducible, i.e., reliable.

...read moreread less

Abstract: CONSIDER Table 1. It represents in its formal characteristics a situation which arises in the clinical-social-personality areas of psychology, where it frequently occurs that the only useful level of measurement obtainable is nominal scaling (Stevens, 1951, pp. 2526), i.e. placement in a set of k unordered categories. Because the categorizing of the units is a consequence of some complex judgment process performed by a &dquo;two-legged meter&dquo; (Stevens, 1958), it becomes important to determine the extent to which these judgments are reproducible, i.e., reliable. The procedure which suggests itself is that of having two (or more) judges independently categorize a sample of units and determine the degree, significance, and

...read moreread less

34,965 citations

Additional excerpts

...Methods’ Description’’....
[...]
...All rights reserved....
[...]

Statistical learning theory

[...]

Vladimir Vapnik

01 Jan 1998

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

Abstract: A comprehensive look at learning and generalization theory. The statistical theory of learning and generalization concerns the problem of choosing desired functions on the basis of empirical data. Highly applicable to a variety of computer science and robotics fields, this book offers lucid coverage of the theory as a whole. Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

26,531 citations

"An overview of ensemble methods for..." refers methods in this paper

...In spite of using the whole data set to train each classifier, which prevents the submission of unseen instances to the classifiers in testing time, it also may lead to more complex classifiers than OVO scheme with higher training times....
[...]

Book•

C4.5: Programs for Machine Learning

[...]

J. Ross Quinlan¹•Institutions (1)

University of Sydney¹

15 Oct 1992

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.

...read moreread less

Abstract: From the Publisher: Classifier systems play a major role in machine learning and knowledge-based systems, and Ross Quinlan's work on ID3 and C4.5 is widely acknowledged to have made some of the most significant contributions to their development. This book is a complete guide to the C4.5 system as implemented in C for the UNIX environment. It contains a comprehensive guide to the system's use , the source code (about 8,800 lines), and implementation notes. The source code and sample datasets are also available on a 3.5-inch floppy diskette for a Sun workstation. C4.5 starts with large sets of cases belonging to known classes. The cases, described by any mixture of nominal and numeric properties, are scrutinized for patterns that allow the classes to be reliably discriminated. These patterns are then expressed as models, in the form of decision trees or sets of if-then rules, that can be used to classify new cases, with emphasis on making the models understandable as well as accurate. The system has been applied successfully to tasks involving tens of thousands of cases described by hundreds of properties. The book starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting. Advantages and disadvantages of the C4.5 approach are discussed and illustrated with several case studies. This book and software should be of interest to developers of classification-based intelligent systems and to students in machine learning and expert systems courses.

...read moreread less

21,674 citations

Book•

Pattern Classification

[...]

Peter E. Hart, Richard O. Duda, David G. Stork

01 Jan 1973

20,541 citations