Deep Semantic Feature Matching

doi:10.1109/CVPR.2017.628

Proceedings ArticleDOI

Deep Semantic Feature Matching

Nikolai Ufer, +1 more

- pp 5929-5938

Chats0

TLDR

A novel method for semantic matching with pre-trained CNN features which is based on convolutional feature pyramids and activation guided feature selection and can be transformed into a dense correspondence field.

Abstract:

Estimating dense visual correspondences between objects with intra-class variation, deformations and background clutter remains a challenging problem. Thanks to the breakthrough of CNNs there are new powerful features available. Despite their easy accessibility and great success, existing semantic flow methods could not significantly benefit from these without extensive additional training. We introduce a novel method for semantic matching with pre-trained CNN features which is based on convolutional feature pyramids and activation guided feature selection. For the final matching we propose a sparse graph matching framework where each salient feature selects among a small subset of nearest neighbors in the target image. To improve our method in the unconstrained setting without bounding box annotations we introduce novel object proposal based matching constraints. Furthermore, we show that the sparse matching can be transformed into a dense correspondence field. Extensive experimental evaluations on benchmark datasets show that our method significantly outperforms existing semantic matching methods.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Image Matching from Handcrafted to Deep Features: A Survey

Jiayi Ma, +4 more

- 01 Jan 2021 -

International Journal of Computer Vision

TL;DR: This survey introduces feature detection, description, and matching techniques from handcrafted methods to trainable ones and provides an analysis of the development of these methods in theory and practice, and briefly introduces several typical image matching-based applications.

...read moreread less

Proceedings ArticleDOI

Learning Correspondence From the Cycle-Consistency of Time

Xiaolong Wang, +2 more

TL;DR: A self-supervised method to use cycle-consistency in time as free supervisory signal for learning visual representations from scratch and demonstrates the generalizability of the representation -- without finetuning -- across a range of visual correspondence tasks, including video object segmentation, keypoint tracking, and optical flow.

...read moreread less

Proceedings ArticleDOI

Unsupervised Part-Based Disentangling of Object Shape and Appearance

Dominik Lorenz, +3 more

TL;DR: In this paper, an unsupervised approach for disentangling appearance and shape by learning parts consistently over all instances of a category is presented, which can be applied to a wide range of object categories and diverse tasks including pose prediction, image synthesis, and video-to-video translation.

...read moreread less

Proceedings ArticleDOI

End-to-End Weakly-Supervised Semantic Alignment

Ignacio Rocco, +2 more

TL;DR: In this article, a differentiable soft inlier scoring module is proposed to compute the quality of the alignment based on geometrically consistent correspondences, which reduces the effect of background clutter.

...read moreread less

Proceedings ArticleDOI

Robust Point Cloud Registration Framework Based on Deep Graph Matching

Kexue Fu, +3 more

TL;DR: Wu et al. as discussed by the authors proposed a novel deep graph matching-based framework for point cloud registration, where they first transform point clouds into graphs and extract deep features for each point, then they develop a module based on Deep Graph Matching (DGM) to calculate a soft correspondence matrix.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Journal ArticleDOI

A mathematical theory of communication

Claude E. Shannon

- 01 Jul 1948 -

Bell System Technical Journal

TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less