Learnability and the Vapnik-Chervonenkis dimension

doi:10.1145/76359.76371

Journal ArticleDOI

Learnability and the Vapnik-Chervonenkis dimension

Anselm Blumer, +3 more

- 01 Oct 1989 -

Journal of the ACM

- Vol. 36, Iss: 4, pp 929-965

Chats0

TLDR

This paper shows that the essential condition for distribution-free learnability is finiteness of the Vapnik-Chervonenkis dimension, a simple combinatorial parameter of the class of concepts to be learned.

Abstract:

Valiant's learnability model is extended to learning classes of concepts defined by regions in Euclidean space En. The methods in this paper lead to a unified treatment of some of Valiant's results, along with previous results on distribution-free convergence of certain pattern recognition algorithms. It is shown that the essential condition for distribution-free learnability is finiteness of the Vapnik-Chervonenkis dimension, a simple combinatorial parameter of the class of concepts to be learned. Using this parameter, the complexity and closure properties of learnable classes are analyzed, and the necessary and sufficient conditions are provided for feasible learnability.

Citations

PDF

Open Access

More filters

Proceedings Article

An Almost Optimal PAC Algorithm

Hans Ulrich Simon

TL;DR: It is shown that every consistent algorithm L (even a provably suboptimal one) induces a family of PAC algorithms which come very close to optimality: the number of labeled examples needed by LK exceeds the general lower bound only by factor ‘K(1= ) where ’K denotes (a truncated version of) the K-times iterated logarithm.

...read moreread less

Proceedings ArticleDOI

Compressing and Teaching for Low VC-Dimension

Shay Moran, +3 more

TL;DR: This work shows that given an arbitrary set of labeled examples from an unknown concept in C, one can retain only a subset of exp(d) of them, in a way that allows to recover the labels of all other examples in the set, using additional exp( d) information bits.

...read moreread less

Posted Content

Why Are Convolutional Nets More Sample-Efficient than Fully-Connected Nets?

Zhiyuan Li, +2 more

- 16 Oct 2020 -

arXiv: Learning

TL;DR: This work describes a natural task on which a provable sample complexity gap can be shown, for standard training algorithms, and demonstrates a single target function, learning which on all possible distributions leads to an $O(1)$ vs $Omega(d^2/\varepsilon)$ gap.

...read moreread less

Proceedings Article

Algorithmic stability and uniform generalization

Ibrahim M. Alabdulmohsin

TL;DR: This paper proves that algorithmic stability in the inference process is equivalent to uniform generalization across all parametric loss functions, and establishes a relationship between algorithmic Stability and the size of the observation space, which provides a formal justification for dimensionality reduction methods.

...read moreread less

Posted Content

A Theory of Universal Learning.

Olivier Bousquet, +4 more

- 09 Nov 2020 -

arXiv: Learning

TL;DR: There are only three possible rates of universal learning, which aims to understand the performance of learning algorithms on every data distribution, but without requiring uniformity over the distribution: exponential, linear, or arbitrarily slow rates.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Johnson: Computers and Intractability-A Guide to the Theory of NP-Completeness

Michael Randolph Garey

Book

Computers and Intractability: A Guide to the Theory of NP-Completeness

Michael Randolph Garey, +1 more

TL;DR: The second edition of a quarterly column as discussed by the authors provides a continuing update to the list of problems (NP-complete and harder) presented by M. R. Garey and myself in our book "Computers and Intractability: A Guide to the Theory of NP-Completeness,” W. H. Freeman & Co., San Francisco, 1979.

...read moreread less

Book

The Art of Computer Programming

Donald Ervin Knuth

TL;DR: The arrangement of this invention provides a strong vibration free hold-down mechanism while avoiding a large pressure drop to the flow of coolant fluid.

...read moreread less

Journal ArticleDOI

Pattern Classification and Scene Analysis.

Ulf Grenander, +2 more

- 01 Sep 1974 -

Journal of the American Statistical Asso...

Book

Pattern classification and scene analysis

Richard O. Duda, +1 more

TL;DR: In this article, a unified, comprehensive and up-to-date treatment of both statistical and descriptive methods for pattern recognition is provided, including Bayesian decision theory, supervised and unsupervised learning, nonparametric techniques, discriminant analysis, clustering, preprosessing of pictorial data, spatial filtering, shape description techniques, perspective transformations, projective invariants, linguistic procedures, and artificial intelligence techniques for scene analysis.

...read moreread less

Collapse

Learnability and the Vapnik-Chervonenkis dimension

Citations

An Almost Optimal PAC Algorithm

Compressing and Teaching for Low VC-Dimension

Why Are Convolutional Nets More Sample-Efficient than Fully-Connected Nets?

Algorithmic stability and uniform generalization

A Theory of Universal Learning.

References

Johnson: Computers and Intractability-A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness

The Art of Computer Programming

Pattern Classification and Scene Analysis.

Pattern classification and scene analysis

Related Papers (5)

A theory of the learnable

On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities

An Introduction to Computational Learning Theory

Queries and Concept Learning

Estimation of Dependences Based on Empirical Data