Least Median of Squares Regression

doi:10.1080/01621459.1984.10477105

Home
/
Papers
/
Least Median of Squares Regression

Journal Article•DOI•

Least Median of Squares Regression

Peter J. Rousseeuw¹•Institutions (1)

Delft University of Technology¹

01 Dec 1984-Journal of the American Statistical Association (Taylor & Francis Group)-Vol. 79, Iss: 388, pp 871-880

TL;DR: In this paper, the median of the squared residuals is used to resist the effect of nearly 50% of contamination in the data in the special case of simple least square regression, which corresponds to finding the narrowest strip covering half of the observations.

read less

Abstract: Classical least squares regression consists of minimizing the sum of the squared residuals. Many authors have produced more robust versions of this estimator by replacing the square by something else, such as the absolute value. In this article a different approach is introduced in which the sum is replaced by the median of the squared residuals. The resulting estimator can resist the effect of nearly 50% of contamination in the data. In the special case of simple regression, it corresponds to finding the narrowest strip covering half of the observations. Generalizations are possible to multivariate location, orthogonal regression, and hypothesis testing in linear models.

...read moreread less

Citations

PDF

Open Access

More filters

Book•

Gaussian Processes for Machine Learning

[...]

Carl Edward Rasmussen¹, Christopher Williams•Institutions (1)

Max Planck Society¹

23 Nov 2005

TL;DR: The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics, and deals with the supervised learning problem for both regression and classification.

...read moreread less

Abstract: A comprehensive and self-contained introduction to Gaussian processes, which provide a principled, practical, probabilistic approach to learning in kernel machines. Gaussian processes (GPs) provide a principled, practical, probabilistic approach to learning in kernel machines. GPs have received increased attention in the machine-learning community over the past decade, and this book provides a long-needed systematic and unified treatment of theoretical and practical aspects of GPs in machine learning. The treatment is comprehensive and self-contained, targeted at researchers and students in machine learning and applied statistics. The book deals with the supervised-learning problem for both regression and classification, and includes detailed algorithms. A wide variety of covariance (kernel) functions are presented and their properties discussed. Model selection is discussed both from a Bayesian and a classical perspective. Many connections to other well-known techniques from machine learning and statistics are discussed, including support-vector machines, neural networks, splines, regularization networks, relevance vector machines and others. Theoretical issues including learning curves and the PAC-Bayesian framework are treated, and several approximation methods for learning with large datasets are discussed. The book contains illustrative examples and exercises, and code and datasets are available on the Web. Appendixes provide mathematical background and a discussion of Gaussian Markov processes.

...read moreread less

11,357 citations

Journal Article•DOI•

The Colonial Origins of Comparative Development: An Empirical Investigation

[...]

Daron Acemoglu¹, Simon Johnson¹, James A. Robinson²•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

01 Dec 2001-The American Economic Review

TL;DR: Acemoglu, Johnson, and Robinson as discussed by the authors used estimates of potential European settler mortality as an instrument for institutional variation in former European colonies today, and they followed the lead of Curtin who compiled data on the death rates faced by European soldiers in various overseas postings.

...read moreread less

Abstract: In Acemoglu, Johnson, and Robinson, henceforth AJR, (2001), we advanced the hypothesis that the mortality rates faced by Europeans in different parts of the world after 1500 affected their willingness to establish settlements and choice of colonization strategy. Places that were relatively healthy (for Europeans) were—when they fell under European control—more likely to receive better economic and political institutions. In contrast, places where European settlers were less likely to go were more likely to have “extractive” institutions imposed. We also posited that this early pattern of institutions has persisted over time and influences the extent and nature of institutions in the modern world. On this basis, we proposed using estimates of potential European settler mortality as an instrument for institutional variation in former European colonies today. Data on settlers themselves are unfortunately patchy—particularly because not many went to places they believed, with good reason, to be most unhealthy. We therefore followed the lead of Curtin (1989 and 1998) who compiled data on the death rates faced by European soldiers in various overseas postings. 1 Curtin’s data were based on pathbreaking data collection and statistical work initiated by the British military in the mid-nineteenth century. These data became part of the foundation of both contemporary thinking about public health (for soldiers and for civilians) and the life insurance industry (as actuaries and executives considered the

...read moreread less

6,495 citations

Book•

Computer Vision: Algorithms and Applications

[...]

Richard Szeliski

30 Sep 2010

TL;DR: Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images and takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene.

...read moreread less

Abstract: Humans perceive the three-dimensional structure of the world with apparent ease. However, despite all of the recent advances in computer vision research, the dream of having a computer interpret an image at the same level as a two-year old remains elusive. Why is computer vision such a challenging problem and what is the current state of the art? Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply to their own personal photos and videos. More than just a source of recipes, this exceptionally authoritative and comprehensive textbook/reference also takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene. These problems are also analyzed using statistical models and solved using rigorous engineering techniques Topics and features: structured to support active curricula and project-oriented courses, with tips in the Introduction for using the book in a variety of customized courses; presents exercises at the end of each chapter with a heavy emphasis on testing algorithms and containing numerous suggestions for small mid-term projects; provides additional material and more detailed mathematical topics in the Appendices, which cover linear algebra, numerical techniques, and Bayesian estimation theory; suggests additional reading at the end of each chapter, including the latest research in each sub-field, in addition to a full Bibliography at the end of the book; supplies supplementary course material for students at the associated website, http://szeliski.org/Book/. Suitable for an upper-level undergraduate or graduate-level course in computer science or engineering, this textbook focuses on basic techniques that work under real-world conditions and encourages students to push their creative boundaries. Its design and exposition also make it eminently suitable as a unique reference to the fundamental techniques and current research literature in computer vision.

...read moreread less

4,146 citations

Cites methods from "Least Median of Squares Regression"

...6 Two widely used approaches to this problem are called RANdom SAmple Consensus, or RANSAC for short (Fischler and Bolles 1981) and least median of squares (LMS) (Rousseeuw 1984)....
[...]

Modern Applied Statistics with S Fourth edition

[...]

W. N. Venables, Brian D. Ripley

01 Jan 2002

2,894 citations

Cites background from "Least Median of Squares Regression"

...mcd (Rousseeuw, 1984; Rousseeuw and Leroy, 1987) andcovRob in library sectionrobust....
[...]
...due to Scott (1979) and Freedman and Diaconis (1981), respectively....
[...]

Journal Article•

A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models

[...]

Jeff A. Bilmes¹•Institutions (1)

University of California, Berkeley¹

01 Jan 1998-CTIT technical reports series

TL;DR: In this paper, the authors describe the EM algorithm for finding the parameters of a mixture of Gaussian densities and a hidden Markov model (HMM) for both discrete and Gaussian mixture observation models.

...read moreread less

Abstract: We describe the maximum-likelihood parameter estimation problem and how the ExpectationMaximization (EM) algorithm can be used for its solution. We first describe the abstract form of the EM algorithm as it is often given in the literature. We then develop the EM parameter estimation procedure for two applications: 1) finding the parameters of a mixture of Gaussian densities, and 2) finding the parameters of a hidden Markov model (HMM) (i.e., the Baum-Welch algorithm) for both discrete and Gaussian mixture observation models. We derive the update equations in fairly explicit detail but we do not prove any convergence properties. We try to emphasize intuition rather than mathematical rigor.

...read moreread less

2,455 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Book•

Applied Regression Analysis

[...]

Norman R. Draper, Harry Smith

01 Jan 1966

TL;DR: In this article, the Straight Line Case is used to fit a straight line by least squares, and the Durbin-Watson Test is used for checking the straight line fit.

...read moreread less

Abstract: Basic Prerequisite Knowledge. Fitting a Straight Line by Least Squares. Checking the Straight Line Fit. Fitting Straight Lines: Special Topics. Regression in Matrix Terms: Straight Line Case. The General Regression Situation. Extra Sums of Squares and Tests for Several Parameters Being Zero. Serial Correlation in the Residuals and the Durbin--Watson Test. More of Checking Fitted Models. Multiple Regression: Special Topics. Bias in Regression Estimates, and Expected Values of Mean Squares and Sums of Squares. On Worthwhile Regressions, Big F's, and R 2 . Models Containing Functions of the Predictors, Including Polynomial Models. Transformation of the Response Variable. "Dummy" Variables. Selecting the "Best" Regression Equation. Ill--Conditioning in Regression Data. Ridge Regression. Generalized Linear Models (GLIM). Mixture Ingredients as Predictor Variables. The Geometry of Least Squares. More Geometry of Least Squares. Orthogonal Polynomials and Summary Data. Multiple Regression Applied to Analysis of Variance Problems. An Introduction to Nonlinear Estimation. Robust Regression. Resampling Procedures (Bootstrapping). Bibliography. True/False Questions. Answers to Exercises. Tables. Indexes.

...read moreread less

18,952 citations

Journal Article•DOI•

Estimates of the Regression Coefficient Based on Kendall's Tau

[...]

Pranab Kumar Sen¹•Institutions (1)

University of North Carolina at Chapel Hill¹

01 Dec 1968-Journal of the American Statistical Association

TL;DR: In this article, a simple and robust estimator of regression coefficient β based on Kendall's rank correlation tau is studied, where the point estimator is the median of the set of slopes (Yj - Yi )/(tj-ti ) joining pairs of points with ti ≠ ti.

...read moreread less

Abstract: The least squares estimator of a regression coefficient β is vulnerable to gross errors and the associated confidence interval is, in addition, sensitive to non-normality of the parent distribution. In this paper, a simple and robust (point as well as interval) estimator of β based on Kendall's [6] rank correlation tau is studied. The point estimator is the median of the set of slopes (Yj - Yi )/(tj-ti ) joining pairs of points with ti ≠ ti , and is unbiased. The confidence interval is also determined by two order statistics of this set of slopes. Various properties of these estimators are studied and compared with those of the least squares and some other nonparametric estimators.

...read moreread less

8,409 citations

Journal Article•DOI•

Robust Regression: Asymptotics, Conjectures and Monte Carlo

[...]

Peter J. Huber

01 Sep 1973-Annals of Statistics

TL;DR: In this paper, a formal power series expansion of the initial terms of a power-series expansion with respect to the number of observations has been proposed, in most cases down to 4 observations per parameter.

...read moreread less

Abstract: Maximum likelihood type robust estimates of regression are defined and their asymptotic properties are investigated both theoretically and empirically. Perhaps the most important new feature is that the number $p$ of parameters is allowed to increase with the number $n$ of observations. The initial terms of a formal power series expansion (essentially in powers of $p/n$) show an excellent agreement with Monte Carlo results, in most cases down to 4 observations per parameter.

...read moreread less

2,221 citations

Book Chapter•DOI•

A Rank-Invariant Method of Linear and Polynomial Regression Analysis

[...]

Henri Theil¹•Institutions (1)

University of Amsterdam¹

01 Jan 1992

TL;DR: In most cases, the assumption that one of the variables is normally distributed with constant variance, its mean being a function of the other variables, is not always satisfied, and in most cases difficult to ascertain this paper.

...read moreread less

Abstract: Regression analysis is usually carried out under the hypothesis that one of the variables is normally distributed with constant variance, its mean being a function of the other variables. This assumption is not always satisfied, and in most cases difficult to ascertain.

...read moreread less

1,968 citations

Journal Article•DOI•

A Projection Pursuit Algorithm for Exploratory Data Analysis

[...]

Jerome H. Friedman¹, John W. Tukey•Institutions (1)

Stanford University¹

01 Sep 1974-IEEE Transactions on Computers

TL;DR: An algorithm for the analysis of multivariate data is presented and is discussed in terms of specific examples to find one-and two-dimensional linear projections of multivariable data that are relatively highly revealing.

...read moreread less

Abstract: An algorithm for the analysis of multivariate data is presented and is discussed in terms of specific examples. The algorithm seeks to find one-and two-dimensional linear projections of multivariate data that are relatively highly revealing.

...read moreread less

1,635 citations