scispace - formally typeset
Open AccessJournal ArticleDOI

Sketch-based manga retrieval using manga109 dataset

Reads0
Chats0
TLDR
A manga-specific image retrieval system that consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization is proposed.
Abstract
Manga (Japanese comics) are popular worldwide. However, current e-manga archives offer very limited search support, i.e., keyword-based search by title or author. To make the manga search experience more intuitive, efficient, and enjoyable, we propose a manga-specific image retrieval system. The proposed system consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization. For querying, the system provides a sketch-based interface. Based on the interface, two interactive reranking schemes are presented: relevance feedback and query retouch. For evaluation, we built a novel dataset of manga images, Manga109, which consists of 109 comic books of 21,142 pages drawn by professional manga artists. To the best of our knowledge, Manga109 is currently the biggest dataset of manga images available for research. Experimental results showed that the proposed framework is efficient and scalable (70 ms from 21,142 pages using a single computer with 204 MB RAM).

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Residual Dense Network for Image Super-Resolution

TL;DR: This paper proposes residual dense block (RDB) to extract abundant local features via dense connected convolutional layers and uses global feature fusion in RDB to jointly and adaptively learn global hierarchical features in a holistic way.
Posted Content

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

TL;DR: This work proposes a residual in residual (RIR) structure to form very deep network, which consists of several residual groups with long skip connections, and proposes a channel attention mechanism to adaptively rescale channel-wise features by considering interdependencies among channels.
Book ChapterDOI

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

TL;DR: Very deep residual channel attention networks (RCAN) as mentioned in this paper proposes a residual in residual (RIR) structure to form very deep network, which consists of several residual groups with long skip connections Each residual group contains some residual blocks with short skip connections.
Proceedings ArticleDOI

Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution

TL;DR: In this paper, the Laplacian pyramid super-resolution network (LapSRN) is proposed to progressively reconstruct the sub-band residuals of high-resolution images.
Posted Content

Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution

TL;DR: This paper proposes the Laplacian Pyramid Super-Resolution Network (LapSRN) to progressively reconstruct the sub-band residuals of high-resolution images and generates multi-scale predictions in one feed-forward pass through the progressive reconstruction, thereby facilitates resource-aware applications.
References
More filters
Posted Content

A Survey on Learning to Hash

TL;DR: This paper presents a comprehensive survey of the learning to hash algorithms, categorize them according to the manners of preserving the similarities into: pairwise similarity preserving, multiwise Similarity preserving, implicit similarity preserve, as well as quantization, and discusses their relations.
Journal ArticleDOI

Sketch classification and classification-driven analysis using Fisher vectors

TL;DR: An approach for sketch classification based on Fisher vectors that significantly outperforms existing techniques and is able to recover semantic aspects of the individual sketches, such as the quality of the drawing and the importance of each part of the sketch for the recognition.
Journal ArticleDOI

A Comprehensive Study Over VLAD and Product Quantization in Large-Scale Image Retrieval

TL;DR: An in-depth analysis of the state-of-the-art framework of VLAD and Product Quantization proposed by Jegou is made, which develops an enhanced framework that significantly outperforms the previous best reported accuracy results on standard benchmarks and is more efficient.
Proceedings ArticleDOI

Gradient field descriptor for sketch based retrieval and localization

TL;DR: Gradient Field HoG (GF-HOG) is introduced as a depiction invariant image descriptor, encapsulating local spatial structure in the sketch and facilitating efficient codebook based retrieval.
Proceedings ArticleDOI

Sketch-a-Net that Beats Humans

TL;DR: A multi-scale multi-channel deep neural network framework that yields sketch recognition performance surpassing that of humans, and not only delivers the best performance on the largest human sketch dataset to date, but also is small in size making efficient training possible using just CPUs.
Related Papers (5)