Billion-scale similarity search with GPUs

Open AccessPosted Content

Billion-scale similarity search with GPUs

Jeff Johnson, +2 more

- 28 Feb 2017 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

In this paper, the authors propose a design for k-selection that operates at up to 55% of theoretical peak performance, enabling a nearest neighbor implementation that is 8.5x faster than prior GPU state of the art.

Abstract:

Similarity search finds application in specialized database systems handling complex data such as images or videos, which are typically represented by high-dimensional features and require specific indexing structures. This paper tackles the problem of better utilizing GPUs for this task. While GPUs excel at data-parallel tasks, prior approaches are bottlenecked by algorithms that expose less parallelism, such as k-min selection, or make poor use of the memory hierarchy. We propose a design for k-selection that operates at up to 55% of theoretical peak performance, enabling a nearest neighbor implementation that is 8.5x faster than prior GPU state of the art. We apply it in different similarity search scenarios, by proposing optimized design for brute-force, approximate and compressed-domain search based on product quantization. In all these setups, we outperform the state of the art by large margins. Our implementation enables the construction of a high accuracy k-NN graph on 95 million images from the Yfcc100M dataset in 35 minutes, and of a graph connecting 1 billion vectors in less than 12 hours on 4 Maxwell Titan X GPUs. We have open-sourced our approach for the sake of comparison and reproducibility.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Nils Reimers, +1 more

TL;DR: Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity is presented.

...read moreread less

Posted Content

Deep Clustering for Unsupervised Learning of Visual Features

Mathilde Caron, +3 more

- 15 Jul 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents DeepCluster, a clustering method that jointly learns the parameters of a neural network and the cluster assignments of the resulting features and outperforms the current state of the art by a significant margin on all the standard benchmarks.

...read moreread less

Posted Content

Dense Passage Retrieval for Open-Domain Question Answering

Vladimir Karpukhin, +7 more

- 10 Apr 2020 -

arXiv: Computation and Language

TL;DR: This work shows that retrieval can be practically implemented using dense representations alone, where embeddings are learned from a small number of questions and passages by a simple dual-encoder framework.

...read moreread less

Posted ContentDOI

Biological Structure and Function Emerge from Scaling Unsupervised Learning to 250 Million Protein Sequences

Alexander Rives, +8 more

- 29 Apr 2019 -

bioRxiv

TL;DR: This work uses unsupervised learning to train a deep contextual language model on 86 billion amino acids across 250 million protein sequences spanning evolutionary diversity, enabling state-of-the-art supervised prediction of mutational effect and secondary structure, and improving state- of- the-art features for long-range contact prediction.

...read moreread less

Journal ArticleDOI

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

Alexander Rives, +12 more

- 13 Apr 2021 -

Proceedings of the National Academy of S...

TL;DR: This paper used unsupervised learning to train a deep contextual language model on 86 billion amino acids across 250 million protein sequences spanning evolutionary diversity, which contains information about biological properties in its representations.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013 -

arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Proceedings Article

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Song Han, +3 more

TL;DR: Deep Compression as mentioned in this paper proposes a three-stage pipeline: pruning, quantization, and Huffman coding to reduce the storage requirement of neural networks by 35x to 49x without affecting their accuracy.

...read moreread less

Journal ArticleDOI

Product Quantization for Nearest Neighbor Search

Hervé Jégou, +2 more

- 01 Jan 2011 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper introduces a product quantization-based approach for approximate nearest neighbor search to decompose the space into a Cartesian product of low-dimensional subspaces and to quantize each subspace separately.

...read moreread less

Proceedings ArticleDOI

Sorting networks and their applications

Kenneth E. Batcher

TL;DR: To achieve high throughput rates today's computers perform several operations simultaneously; not only are I/O operations performed concurrently with computing, but also, in multiprocessors, several computing operations are done concurrently.

...read moreread less

Collapse

Billion-scale similarity search with GPUs

Citations

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Deep Clustering for Unsupervised Learning of Visual Features

Dense Passage Retrieval for Open-Domain Question Answering

Biological Structure and Function Emerge from Scaling Unsupervised Learning to 250 Million Protein Sequences

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

References

Deep Residual Learning for Image Recognition

Distributed Representations of Words and Phrases and their Compositionality

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Product Quantization for Nearest Neighbor Search

Sorting networks and their applications

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Attention is All you Need

Deep Residual Learning for Image Recognition

Adam: A Method for Stochastic Optimization

Glove: Global Vectors for Word Representation