Distance Metric Learning for Large Margin Nearest Neighbor Classification

doi:10.5555/1577069.1577078

Open AccessJournal ArticleDOI

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Kilian Q. Weinberger, +1 more

- 01 Dec 2009 -

Journal of Machine Learning Research

- Vol. 10, Iss: 9, pp 207-244

TLDR

This paper shows how to learn a Mahalanobis distance metric for kNN classification from labeled examples in a globally integrated manner and finds that metrics trained in this way lead to significant improvements in kNN Classification.

Abstract:

The accuracy of k-nearest neighbor (kNN) classification depends significantly on the metric used to compute distances between different examples. In this paper, we show how to learn a Mahalanobis distance metric for kNN classification from labeled examples. The Mahalanobis metric can equivalently be viewed as a global linear transformation of the input space that precedes kNN classification using Euclidean distances. In our approach, the metric is trained with the goal that the k-nearest neighbors always belong to the same class while examples from different classes are separated by a large margin. As in support vector machines (SVMs), the margin criterion leads to a convex optimization based on the hinge loss. Unlike learning in SVMs, however, our approach requires no modification or extension for problems in multiway (as opposed to binary) classification. In our framework, the Mahalanobis distance metric is obtained as the solution to a semidefinite program. On several data sets of varying size and difficulty, we find that metrics trained in this way lead to significant improvements in kNN classification. Sometimes these results can be further improved by clustering the training examples and learning an individual metric within each cluster. We show how to learn and combine these local metrics in a globally integrated manner.

Citations

PDF

Open Access

More filters

Posted Content

Representation Learning with Contrastive Predictive Coding

Aaron van den Oord, +2 more

- 10 Jul 2018 -

arXiv: Learning

TL;DR: This work proposes a universal unsupervised learning approach to extract useful representations from high-dimensional data, which it calls Contrastive Predictive Coding, and demonstrates that the approach is able to learn useful representations achieving strong performance on four distinct domains: speech, images, text and reinforcement learning in 3D environments.

...read moreread less

Proceedings ArticleDOI

FaceNet: A Unified Embedding for Face Recognition and Clustering

Florian Schroff, +2 more

- 12 Mar 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: FaceNet as discussed by the authors uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches, and achieves state-of-the-art face recognition performance using only 128 bytes per face.

...read moreread less

Proceedings Article

Matching networks for one shot learning

Oriol Vinyals, +4 more

TL;DR: In this paper, a network that maps a small labeled support set and an unlabeled example to its label obviates the need for fine-tuning to adapt to new class types.

...read moreread less

Journal ArticleDOI

Integrated analysis of multimodal single-cell data

Yuhan Hao, +24 more

- 24 Jun 2021 -

Cell

TL;DR: Weighted-nearest neighbor analysis as mentioned in this paper is an unsupervised framework to learn the relative utility of each data type in each cell, enabling an integrative analysis of multiple modalities.

...read moreread less

Journal ArticleDOI

A survey of transfer learning

Karl R. Weiss, +2 more

- 28 May 2016 -

Journal of Big Data

TL;DR: This survey paper formally defines transfer learning, presents information on current solutions, and reviews applications applied toTransfer learning, which can be applied to big data environments.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Convex Optimization

Stephen Boyd, +1 more

TL;DR: In this article, the focus is on recognizing convex optimization problems and then finding the most appropriate technique for solving them, and a comprehensive introduction to the subject is given. But the focus of this book is not on the optimization problem itself, but on the problem of finding the appropriate technique to solve it.

...read moreread less

Book

Principal Component Analysis

Ian T. Jolliffe

TL;DR: In this article, the authors present a graphical representation of data using Principal Component Analysis (PCA) for time series and other non-independent data, as well as a generalization and adaptation of principal component analysis.

...read moreread less

Journal ArticleDOI

Eigenfaces for recognition

Matthew Turk, +1 more

- 01 Jan 1991 -

Journal of Cognitive Neuroscience

TL;DR: A near-real-time computer system that can locate and track a subject's head, and then recognize the person by comparing characteristics of the face to those of known individuals, and that is easy to implement using a neural network architecture.

...read moreread less

Journal ArticleDOI

Normalized cuts and image segmentation

Jianbo Shi, +1 more

- 01 Aug 2000 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.

...read moreread less

Journal ArticleDOI

Nearest neighbor pattern classification

Thomas M. Cover, +1 more

- 01 Jan 1967 -

IEEE Transactions on Information Theory

TL;DR: The nearest neighbor decision rule assigns to an unclassified sample point the classification of the nearest of a set of previously classified points, so it may be said that half the classification information in an infinite sample set is contained in the nearest neighbor.

...read moreread less

Collapse

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Citations

Representation Learning with Contrastive Predictive Coding

FaceNet: A Unified Embedding for Face Recognition and Clustering

Matching networks for one shot learning

Integrated analysis of multimodal single-cell data

A survey of transfer learning

References

Convex Optimization

Principal Component Analysis

Eigenfaces for recognition

Normalized cuts and image segmentation

Nearest neighbor pattern classification

Related Papers (5)

FaceNet: A unified embedding for face recognition and clustering

Deep Residual Learning for Image Recognition

Learning a similarity metric discriminatively, with application to face verification

ImageNet Classification with Deep Convolutional Neural Networks

Dimensionality Reduction by Learning an Invariant Mapping