Quad-networks: unsupervised learning to rank for interest point detection

Open AccessPosted Content

Quad-networks: unsupervised learning to rank for interest point detection

Nikolay Savinov, +4 more

- 22 Nov 2016 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

This paper is the first to propose such a formulation: training a neural network to rank points in a transformation-invariant manner, and shows that this unsupervised method performs better or on-par with baselines on two tasks.

Abstract:

Several machine learning tasks require to represent the data using only a sparse set of interest points. An ideal detector is able to find the corresponding interest points even if the data undergo a transformation typical for a given domain. Since the task is of high practical interest in computer vision, many hand-crafted solutions were proposed. In this paper, we ask a fundamental question: can we learn such detectors from scratch? Since it is often unclear what points are "interesting", human labelling cannot be used to find a truly unbiased solution. Therefore, the task requires an unsupervised formulation. We are the first to propose such a formulation: training a neural network to rank points in a transformation-invariant manner. Interest points are then extracted from the top/bottom quantiles of this ranking. We validate our approach on two tasks: standard RGB image interest point detection and challenging cross-modal interest point detection between RGB and depth images. We quantitatively show that our unsupervised method performs better or on-par with baselines.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

D2-Net: A Trainable CNN for Joint Description and Detection of Local Features

Mihai Dusmanu, +6 more

TL;DR: This work proposes an approach where a single convolutional neural network plays a dual role: It is simultaneously a dense feature descriptor and a feature detector, and shows that this model can be trained using pixel correspondences extracted from readily available large-scale SfM reconstructions, without any further annotations.

...read moreread less

Journal ArticleDOI

Image Matching from Handcrafted to Deep Features: A Survey

Jiayi Ma, +4 more

- 01 Jan 2021 -

International Journal of Computer Vision

TL;DR: This survey introduces feature detection, description, and matching techniques from handcrafted methods to trainable ones and provides an analysis of the development of these methods in theory and practice, and briefly introduces several typical image matching-based applications.

...read moreread less

Posted Content

R2D2: Repeatable and Reliable Detector and Descriptor.

Jerome Revaud, +6 more

- 14 Jun 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work argues that salient regions are not necessarily discriminative, and therefore can harm the performance of the description, and proposes to jointly learn keypoint detection and description together with a predictor of the local descriptor discriminativeness.

...read moreread less

Posted Content

LF-Net: Learning Local Features from Images.

Yuki Ono, +3 more

- 24 May 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A novel deep architecture and a training strategy to learn a local feature pipeline from scratch, using collections of images without the need for human supervision, and shows that it can optimize the network in a two-branch setup by confining it to one branch, while preserving differentiability in the other.

...read moreread less

Proceedings Article

D2-Net: A Trainable CNN for Joint Detection and Description of Local Features.

Mihai Dusmanu, +6 more

TL;DR: In this paper, a single CNN is simultaneously a dense feature descriptor and a feature detector, and the obtained keypoints are more stable than their traditional counterparts based on early detection of low-level structures.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Journal Article

Visualizing Data using t-SNE

Laurens van der Maaten, +1 more

- 01 Jan 2008 -

Journal of Machine Learning Research

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.

...read moreread less

Collapse

arXiv: Computer Vision and Pattern Recog...

Unsupervised Feature Learning via Non-parametric Instance Discrimination

Zhirong Wu, +3 more

Quad-networks: unsupervised learning to rank for interest point detection

Citations

D2-Net: A Trainable CNN for Joint Description and Detection of Local Features

Image Matching from Handcrafted to Deep Features: A Survey

R2D2: Repeatable and Reliable Detector and Descriptor.

LF-Net: Learning Local Features from Images.

D2-Net: A Trainable CNN for Joint Detection and Description of Local Features.

References

Distinctive Image Features from Scale-Invariant Keypoints

Generative Adversarial Nets

ImageNet Large Scale Visual Recognition Challenge

Microsoft COCO: Common Objects in Context

Visualizing Data using t-SNE

Related Papers (5)

Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection

SIPS: Unsupervised Succinct Interest Points.

An information-theoretic unsupervised learning algorithm for neural networks

A Pose-Sensitive Embedding for Person Re-Identification with Expanded Cross Neighborhood Re-Ranking

Unsupervised Feature Learning via Non-parametric Instance Discrimination