Sketch-based manga retrieval using manga109 dataset
Reads0
Chats0
TLDR
A manga-specific image retrieval system that consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization is proposed.Abstract:
Manga (Japanese comics) are popular worldwide. However, current e-manga archives offer very limited search support, i.e., keyword-based search by title or author. To make the manga search experience more intuitive, efficient, and enjoyable, we propose a manga-specific image retrieval system. The proposed system consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization. For querying, the system provides a sketch-based interface. Based on the interface, two interactive reranking schemes are presented: relevance feedback and query retouch. For evaluation, we built a novel dataset of manga images, Manga109, which consists of 109 comic books of 21,142 pages drawn by professional manga artists. To the best of our knowledge, Manga109 is currently the biggest dataset of manga images available for research. Experimental results showed that the proposed framework is efficient and scalable (70 ms from 21,142 pages using a single computer with 204 MB RAM).read more
Citations
More filters
Proceedings ArticleDOI
Residual Dense Network for Image Super-Resolution
TL;DR: This paper proposes residual dense block (RDB) to extract abundant local features via dense connected convolutional layers and uses global feature fusion in RDB to jointly and adaptively learn global hierarchical features in a holistic way.
Posted Content
Image Super-Resolution Using Very Deep Residual Channel Attention Networks
TL;DR: This work proposes a residual in residual (RIR) structure to form very deep network, which consists of several residual groups with long skip connections, and proposes a channel attention mechanism to adaptively rescale channel-wise features by considering interdependencies among channels.
Book ChapterDOI
Image Super-Resolution Using Very Deep Residual Channel Attention Networks
TL;DR: Very deep residual channel attention networks (RCAN) as mentioned in this paper proposes a residual in residual (RIR) structure to form very deep network, which consists of several residual groups with long skip connections Each residual group contains some residual blocks with short skip connections.
Proceedings ArticleDOI
Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution
TL;DR: In this paper, the Laplacian pyramid super-resolution network (LapSRN) is proposed to progressively reconstruct the sub-band residuals of high-resolution images.
Posted Content
Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution
TL;DR: This paper proposes the Laplacian Pyramid Super-Resolution Network (LapSRN) to progressively reconstruct the sub-band residuals of high-resolution images and generates multi-scale predictions in one feed-forward pass through the progressive reconstruction, thereby facilitates resource-aware applications.
References
More filters
Posted Content
A Survey on Learning to Hash
TL;DR: This paper presents a comprehensive survey of the learning to hash algorithms, categorize them according to the manners of preserving the similarities into: pairwise similarity preserving, multiwise Similarity preserving, implicit similarity preserve, as well as quantization, and discusses their relations.
Journal ArticleDOI
Sketch classification and classification-driven analysis using Fisher vectors
TL;DR: An approach for sketch classification based on Fisher vectors that significantly outperforms existing techniques and is able to recover semantic aspects of the individual sketches, such as the quality of the drawing and the importance of each part of the sketch for the recognition.
Journal ArticleDOI
A Comprehensive Study Over VLAD and Product Quantization in Large-Scale Image Retrieval
Eleftherios Spyromitros-Xioufis,Symeon Papadopoulos,Ioannis Kompatsiaris,Grigorios Tsoumakas,Ioannis Vlahavas +4 more
TL;DR: An in-depth analysis of the state-of-the-art framework of VLAD and Product Quantization proposed by Jegou is made, which develops an enhanced framework that significantly outperforms the previous best reported accuracy results on standard benchmarks and is more efficient.
Proceedings ArticleDOI
Gradient field descriptor for sketch based retrieval and localization
TL;DR: Gradient Field HoG (GF-HOG) is introduced as a depiction invariant image descriptor, encapsulating local spatial structure in the sketch and facilitating efficient codebook based retrieval.
Proceedings ArticleDOI
Sketch-a-Net that Beats Humans
TL;DR: A multi-scale multi-channel deep neural network framework that yields sketch recognition performance surpassing that of humans, and not only delivers the best performance on the largest human sketch dataset to date, but also is small in size making efficient training possible using just CPUs.