Home
/
Topics
/
Overfitting

Topic

Overfitting

About: Overfitting is a research topic. Over the lifetime, 11696 publications have been published within this topic receiving 441877 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1981
1979
1977
1975
1971
1966

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Ensemble Methods in Machine Learning

[...]

Thomas G. Dietterich¹•Institutions (1)

Oregon State University¹

21 Jun 2000

TL;DR: Some previous studies comparing ensemble methods are reviewed, and some new experiments are presented to uncover the reasons that Adaboost does not overfit rapidly.

...read moreread less

Abstract: Ensemble methods are learning algorithms that construct a set of classifiers and then classify new data points by taking a (weighted) vote of their predictions. The original ensemble method is Bayesian averaging, but more recent algorithms include error-correcting output coding, Bagging, and boosting. This paper reviews these methods and explains why ensembles can often perform better than any single classifier. Some previous studies comparing ensemble methods are reviewed, and some new experiments are presented to uncover the reasons that Adaboost does not overfit rapidly.

...read moreread less

5,679 citations

Posted Content•

Explaining and Harnessing Adversarial Examples

[...]

Ian Goodfellow¹, Jonathon Shlens¹, Christian Szegedy¹•Institutions (1)

Google¹

20 Dec 2014-arXiv: Machine Learning

TL;DR: The authors argue that the primary cause of neural networks' vulnerability to adversarial perturbation is their linear nature, which is supported by new quantitative results while giving the first explanation of the most intriguing fact about adversarial examples: their generalization across architectures and training sets.

...read moreread less

Abstract: Several machine learning models, including neural networks, consistently misclassify adversarial examples---inputs formed by applying small but intentionally worst-case perturbations to examples from the dataset, such that the perturbed input results in the model outputting an incorrect answer with high confidence. Early attempts at explaining this phenomenon focused on nonlinearity and overfitting. We argue instead that the primary cause of neural networks' vulnerability to adversarial perturbation is their linear nature. This explanation is supported by new quantitative results while giving the first explanation of the most intriguing fact about them: their generalization across architectures and training sets. Moreover, this view yields a simple and fast method of generating adversarial examples. Using this approach to provide examples for adversarial training, we reduce the test set error of a maxout network on the MNIST dataset.

...read moreread less

4,967 citations

Journal Article•DOI•

RELION: implementation of a Bayesian approach to cryo-EM structure determination.

[...]

Sjors H.W. Scheres¹•Institutions (1)

Laboratory of Molecular Biology¹

01 Dec 2012-Journal of Structural Biology

TL;DR: Developments that reduce the computational costs of the underlying maximum a posteriori (MAP) algorithm, as well as statistical considerations that yield new insights into the accuracy with which the relative orientations of individual particles may be determined are described.

...read moreread less

4,554 citations

Posted Content•

Network In Network

[...]

Min Lin¹, Qiang Chen¹, Shuicheng Yan¹•Institutions (1)

National University of Singapore¹

16 Dec 2013-arXiv: Neural and Evolutionary Computing

TL;DR: With enhanced local modeling via the micro network, the proposed deep network structure NIN is able to utilize global average pooling over feature maps in the classification layer, which is easier to interpret and less prone to overfitting than traditional fully connected layers.

...read moreread less

Abstract: We propose a novel deep network structure called "Network In Network" (NIN) to enhance model discriminability for local patches within the receptive field. The conventional convolutional layer uses linear filters followed by a nonlinear activation function to scan the input. Instead, we build micro neural networks with more complex structures to abstract the data within the receptive field. We instantiate the micro neural network with a multilayer perceptron, which is a potent function approximator. The feature maps are obtained by sliding the micro networks over the input in a similar manner as CNN; they are then fed into the next layer. Deep NIN can be implemented by stacking mutiple of the above described structure. With enhanced local modeling via the micro network, we are able to utilize global average pooling over feature maps in the classification layer, which is easier to interpret and less prone to overfitting than traditional fully connected layers. We demonstrated the state-of-the-art classification performances with NIN on CIFAR-10 and CIFAR-100, and reasonable performances on SVHN and MNIST datasets.

...read moreread less

3,905 citations

Journal Article•DOI•

Free R value: a novel statistical quantity for assessing the accuracy of crystal structures.

[...]

Axel T. Brunger¹•Institutions (1)

Howard Hughes Medical Institute¹

30 Jan 1992-Nature

TL;DR: In this article, a statistical quantity (RfreeT) is defined to measure the agreement between observed and computed structure factor amplitudes for a 'test' set of reflections that is omitted in the modelling and refinement process.

...read moreread less

Abstract: THE determination of macromolecular structure by crystallography involves fitting atomic models to the observed diffraction data1. The traditional measure of the quality of this fit, and presumably the accuracy of the model, is theR value. Despite stereochemical restraints2, it is possible to overfit or 'misfit' the diffraction data: an incorrect model can be refined to fairly good R values as several recent examples have shown3. Here I propose a reliable and unbiased indicator of the accuracy of such models. By analogy with the cross-validation method4,5 of testing statistical models I define a statistical quantity (RfreeT) that measures the agreement between observed and computed structure factor amplitudes for a 'test' set of reflections that is omitted in the modelling and refinement process. As examples show, there is a high correlation between RfreeT and the accuracy of the atomic model phases. This is useful because experimental phase information is usually inaccurate, incomplete or unavailable. I expect that RfreeT will provide a measure of the information content of recently proposed models of thermal motion and disorder6–8, time-averaging9 and bulk solvent10.

...read moreread less

3,714 citations

1
2
3
4
5
6
…
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

17,730

Papers

621,533

Citations

No. of papers in the topic in previous years
Year	Papers
2023	2,065
2022	3,968
2021	2,035
2020	1,973
2019	1,503
2018	1,120

Overfitting

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics