Large-Scale Image Retrieval with Attentive Deep Local Features

doi:10.1109/ICCV.2017.374

Open AccessProceedings ArticleDOI

Large-Scale Image Retrieval with Attentive Deep Local Features

Hyeonwoo Noh, +4 more

- pp 3476-3485

Chats0

TLDR

An attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELE (DEep Local Feature), based on convolutional neural networks, which are trained only with image-level annotations on a landmark image dataset.

Abstract:

We propose an attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELE (DEep Local Feature). The new feature is based on convolutional neural networks, which are trained only with image-level annotations on a landmark image dataset. To identify semantically useful local features for image retrieval, we also propose an attention mechanism for key point selection, which shares most network layers with the descriptor. This frame-work can be used for image retrieval as a drop-in replacement for other keypoint detectors and descriptors, enabling more accurate feature matching and geometric verification. Our system produces reliable confidence scores to reject false positives–in particular, it is robust against queries that have no correct match in the database. To evaluate the proposed descriptor, we introduce a new large-scale dataset, referred to as Google-Landmarks dataset, which involves challenges in both database and query such as background clutter, partial occlusion, multiple landmarks, objects in variable scales, etc. We show that DELE outperforms the state-of-the-art global and local descriptors in the large-scale setting by significant margins.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Leveraging CNNs for Panoramic Image Matching Based on Improved Cube Projection Model

Tian Gao, +5 more

- 05 Jul 2023 -

Remote sensing

TL;DR: The Improved Cube Projection Model (ICPM) as mentioned in this paper was proposed to solve the problem of panoramic image feature point extraction and matching using convolutional neural networks (CNNs) by projecting panoramas into split-frame perspective images with significant overlap in six directions.

...read moreread less

Journal ArticleDOI

Learning Condition-Invariant Scene Representations for Place Recognition across the Seasons Using Auto-Encoder and ICA

Tarikul Islam, +2 more

- 30 Nov 2022 -

Journal of Electrical and Computer Engin...

TL;DR: In this article , independent component analysis (ICA) and auto-encoder are proposed to complete the research work and the proposed algorithm ICA showed a 91.05% accuracy rate, which was better than the baseline algorithms, and the appropriate route-finding rate using an auto encoder is also acceptable.

...read moreread less

Book ChapterDOI

Deep Learning-Based Image Retrieval in the JPEG Compressed Domain

TL;DR: In this paper , the authors proposed a unified model for image retrieval which takes DCT coefficients as input and efficiently extracts global and local features directly in the JPEG compressed domain for accurate image retrieval.

...read moreread less

Proceedings ArticleDOI

Multi-Deep Features Fusion Algorithm for Clothing Image Recognition

Zhiqiang He, +4 more

TL;DR: Wang et al. as mentioned in this paper utilized target detection technology and deep residual network (ResNet) to extract comprehensive clothing features, aims at focusing on clothing itself in the process of extraction procedure, and proposes multi-deep features fusion algorithm for clothing image recognition.

...read moreread less

Proceedings ArticleDOI

Orthogonal Vector-Decomposed Disentanglement Network of Interactive Image Retrieval for Fashion Outfit Recommendation

Chen Chen, +3 more

TL;DR: A novel Orthogonal Vector-Decomposed Disentanglement Network (OVDDN) is presented, which proposes to leverage the disentangled parts to learn a controllable denoising embedding space and maintain the cross-modal semantic consistency of paired images.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Journal ArticleDOI

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Martin A. Fischler, +1 more

- 01 Jun 1981 -

Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

Collapse

Large-Scale Image Retrieval with Attentive Deep Local Features

Citations

Leveraging CNNs for Panoramic Image Matching Based on Improved Cube Projection Model

Learning Condition-Invariant Scene Representations for Place Recognition across the Seasons Using Auto-Encoder and ICA

Deep Learning-Based Image Retrieval in the JPEG Compressed Domain

Multi-Deep Features Fusion Algorithm for Clothing Image Recognition

Orthogonal Vector-Decomposed Disentanglement Network of Interactive Image Retrieval for Fashion Outfit Recommendation

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Distinctive Image Features from Scale-Invariant Keypoints

ImageNet Large Scale Visual Recognition Challenge

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Related Papers (5)

Deep Residual Learning for Image Recognition

Distinctive Image Features from Scale-Invariant Keypoints

Object retrieval with large vocabularies and fast spatial matching

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition

Video Google: a text retrieval approach to object matching in videos