The sketchy database: learning to retrieve badly drawn bunnies

doi:10.1145/2897824.2925954

Open AccessJournal ArticleDOI

The sketchy database: learning to retrieve badly drawn bunnies

Patsorn Sangkloy, +3 more

- Vol. 35, Iss: 4, pp 119

Chats0

TLDR

The Sketchy database is presented, the first large-scale collection of sketch-photo pairs and it is shown that the learned representation significantly outperforms both hand-crafted features as well as deep features trained for sketch or photo classification.

Abstract:

We present the Sketchy database, the first large-scale collection of sketch-photo pairs. We ask crowd workers to sketch particular photographic objects sampled from 125 categories and acquire 75,471 sketches of 12,500 objects. The Sketchy database gives us fine-grained associations between particular photos and sketches, and we use this to train cross-domain convolutional networks which embed sketches and photographs in a common feature space. We use our database as a benchmark for fine-grained retrieval and show that our learned representation significantly outperforms both hand-crafted features as well as deep features trained for sketch or photo classification. Beyond image retrieval, we believe the Sketchy database opens up new opportunities for sketch and image understanding and synthesis.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Deeper, Broader and Artier Domain Generalization

Da Li, +3 more

TL;DR: In this article, a low-rank parameterized CNN model is proposed for domain generalization, which can learn from multiple training domains and extract a domain-agnostic model that can then be applied to an unseen domain.

...read moreread less

Posted Content

Deeper, Broader and Artier Domain Generalization

Da Li, +3 more

- 09 Oct 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper builds upon the favorable domain shift-robust properties of deep learning methods, and develops a low-rank parameterized CNN model for end-to-end DG learning that outperforms existing DG alternatives.

...read moreread less

Proceedings Article

A Neural Representation of Sketch Drawings

David Ha, +1 more

TL;DR: Sketch-rnn is presented, a recurrent neural network (RNN) able to construct stroke-based drawings of common objects that is trained on thousands of crude human-drawn images representing hundreds of classes.

...read moreread less

Proceedings ArticleDOI

Scribbler: Controlling Deep Image Synthesis with Sketch and Color

Patsorn Sangkloy, +4 more

TL;DR: In this paper, the authors proposed a deep adversarial image synthesis architecture that is conditioned on sketched boundaries and sparse color strokes to generate realistic cars, bedrooms, or faces, which allows users to scribble over the sketch to indicate preferred color for objects.

...read moreread less

Posted Content

Scribbler: Controlling Deep Image Synthesis with Sketch and Color

Patsorn Sangkloy, +4 more

- 02 Dec 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A deep adversarial image synthesis architecture that is conditioned on sketched boundaries and sparse color strokes to generate realistic cars, bedrooms, or faces is proposed and demonstrates a sketch based image synthesis system which allows users to scribble over the sketch to indicate preferred color for objects.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Journal Article

Visualizing Data using t-SNE

Laurens van der Maaten, +1 more

- 01 Jan 2008 -

Journal of Machine Learning Research

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.

...read moreread less

Collapse

The sketchy database: learning to retrieve badly drawn bunnies

Citations

Deeper, Broader and Artier Domain Generalization

Deeper, Broader and Artier Domain Generalization

A Neural Representation of Sketch Drawings

Scribbler: Controlling Deep Image Synthesis with Sketch and Color

Scribbler: Controlling Deep Image Synthesis with Sketch and Color

References

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

ImageNet Large Scale Visual Recognition Challenge

Microsoft COCO: Common Objects in Context

Visualizing Data using t-SNE

Related Papers (5)

How do humans sketch objects

Sketch Me That Shoe

Deep Residual Learning for Image Recognition

ImageNet: A large-scale hierarchical image database

A Neural Representation of Sketch Drawings