Hyperpixel Flow: Semantic Correspondence with Multi-layer Neural Features

Open AccessPosted Content

Hyperpixel Flow: Semantic Correspondence with Multi-layer Neural Features

Juhong Min, +3 more

- 18 Aug 2019 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

Hyperpixel Flow as mentioned in this paper represents images by "hyperpixels" that leverage a small number of relevant features selected among early to late layers of a convolutional neural network, taking advantage of the condensed features of hyperpixels.

Abstract:

Establishing visual correspondences under large intra-class variations requires analyzing images at different levels, from features linked to semantics and context to local patterns, while being invariant to instance-specific details. To tackle these challenges, we represent images by "hyperpixels" that leverage a small number of relevant features selected among early to late layers of a convolutional neural network. Taking advantage of the condensed features of hyperpixels, we develop an effective real-time matching algorithm based on Hough geometric voting. The proposed method, hyperpixel flow, sets a new state of the art on three standard benchmarks as well as a new dataset, SPair-71k, which contains a significantly larger number of image pairs than existing datasets, with more accurate and richer annotations for in-depth analysis.

Citations

PDF

Open Access

More filters

Book ChapterDOI

MotionSqueeze: Neural Motion Feature Learning for Video Understanding

Heeseung Kwon, +3 more

TL;DR: This work proposes a trainable neural module, dubbed MotionSqueeze, for effective motion feature extraction, and demonstrates that the proposed method provides a significant gain on four standard benchmarks for action recognition with only a small amount of additional cost, outperforming the state of the art on Something-Something-V1&V2 datasets.

...read moreread less

Proceedings ArticleDOI

Semantic Correspondence as an Optimal Transport Problem

Yanbin Liu, +3 more

TL;DR: This work solves the problem of establishing dense correspondences across semantically similar images by converting the maximization problem to the optimal transport formulation and incorporating the staircase weights into optimal transport algorithm to act as empirical distributions.

...read moreread less

Proceedings ArticleDOI

Correspondence Networks With Adaptive Neighbourhood Consensus

Shuda Li, +4 more

TL;DR: This paper proposes a convolutional neural network architecture, called adaptive neighbourhood consensus network (ANC-Net), that can be trained end-to-end with sparse key-point annotations, to handle the task of establishing dense visual correspondences between images containing objects of the same category.

...read moreread less

Posted Content

SPair-71k: A Large-scale Benchmark for Semantic Correspondence

Juhong Min, +3 more

- 28 Aug 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A new large-scale benchmark dataset of semantically paired images, SPair-71k, which contains 70,958 image pairs with diverse variations in viewpoint and scale is presented, which is significantly larger in number and contains more accurate and richer annotations.

...read moreread less

Book ChapterDOI

Deep Hough-Transform Line Priors

Yancong Lin, +2 more

TL;DR: This work reduces the dependency on labeled data by building on the classic knowledge-based priors while using deep networks to learn features, and shows that adding prior knowledge improves data efficiency as line priors no longer need to be learned from data.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Collapse

Related Papers (5)

NCNet: Neighbourhood Consensus Networks for Estimating Image Correspondences.

Ignacio Rocco, +5 more

- 14 Aug 2020 -

IEEE Transactions on Pattern Analysis an...

ACM Transactions on Graphics

Hyperpixel Flow: Semantic Correspondence with Multi-layer Neural Features

Citations

MotionSqueeze: Neural Motion Feature Learning for Video Understanding

Semantic Correspondence as an Optimal Transport Problem

Correspondence Networks With Adaptive Neighbourhood Consensus

SPair-71k: A Large-scale Benchmark for Semantic Correspondence

Deep Hough-Transform Line Priors

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

ImageNet: A large-scale hierarchical image database

Deep Residual Learning for Image Recognition

Histograms of oriented gradients for human detection

Related Papers (5)

NCNet: Neighbourhood Consensus Networks for Estimating Image Correspondences.

Neighbourhood Consensus Networks

Local Features and Visual Words Emerge in Activations

Realizing a feature-based framework for scientific data mining

Learning Local Shape Descriptors from Part Correspondences with Multiview Convolutional Networks