scispace - formally typeset
Proceedings ArticleDOI

Deep Semantic Feature Matching

Nikolai Ufer, +1 more
- pp 5929-5938
Reads0
Chats0
TLDR
A novel method for semantic matching with pre-trained CNN features which is based on convolutional feature pyramids and activation guided feature selection and can be transformed into a dense correspondence field.
Abstract
Estimating dense visual correspondences between objects with intra-class variation, deformations and background clutter remains a challenging problem. Thanks to the breakthrough of CNNs there are new powerful features available. Despite their easy accessibility and great success, existing semantic flow methods could not significantly benefit from these without extensive additional training. We introduce a novel method for semantic matching with pre-trained CNN features which is based on convolutional feature pyramids and activation guided feature selection. For the final matching we propose a sparse graph matching framework where each salient feature selects among a small subset of nearest neighbors in the target image. To improve our method in the unconstrained setting without bounding box annotations we introduce novel object proposal based matching constraints. Furthermore, we show that the sparse matching can be transformed into a dense correspondence field. Extensive experimental evaluations on benchmark datasets show that our method significantly outperforms existing semantic matching methods.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Image Matching from Handcrafted to Deep Features: A Survey

TL;DR: This survey introduces feature detection, description, and matching techniques from handcrafted methods to trainable ones and provides an analysis of the development of these methods in theory and practice, and briefly introduces several typical image matching-based applications.
Proceedings ArticleDOI

Learning Correspondence From the Cycle-Consistency of Time

TL;DR: A self-supervised method to use cycle-consistency in time as free supervisory signal for learning visual representations from scratch and demonstrates the generalizability of the representation -- without finetuning -- across a range of visual correspondence tasks, including video object segmentation, keypoint tracking, and optical flow.
Proceedings ArticleDOI

Unsupervised Part-Based Disentangling of Object Shape and Appearance

TL;DR: In this paper, an unsupervised approach for disentangling appearance and shape by learning parts consistently over all instances of a category is presented, which can be applied to a wide range of object categories and diverse tasks including pose prediction, image synthesis, and video-to-video translation.
Proceedings ArticleDOI

End-to-End Weakly-Supervised Semantic Alignment

TL;DR: In this article, a differentiable soft inlier scoring module is proposed to compute the quality of the alignment based on geometrically consistent correspondences, which reduces the effect of background clutter.
Proceedings ArticleDOI

Robust Point Cloud Registration Framework Based on Deep Graph Matching

TL;DR: Wu et al. as discussed by the authors proposed a novel deep graph matching-based framework for point cloud registration, where they first transform point clouds into graphs and extract deep features for each point, then they develop a module based on Deep Graph Matching (DGM) to calculate a soft correspondence matrix.
References
More filters
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Journal ArticleDOI

A mathematical theory of communication

TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Related Papers (5)