scispace - formally typeset
Open AccessJournal ArticleDOI

Sketch-based manga retrieval using manga109 dataset

Reads0
Chats0
TLDR
A manga-specific image retrieval system that consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization is proposed.
Abstract
Manga (Japanese comics) are popular worldwide. However, current e-manga archives offer very limited search support, i.e., keyword-based search by title or author. To make the manga search experience more intuitive, efficient, and enjoyable, we propose a manga-specific image retrieval system. The proposed system consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization. For querying, the system provides a sketch-based interface. Based on the interface, two interactive reranking schemes are presented: relevance feedback and query retouch. For evaluation, we built a novel dataset of manga images, Manga109, which consists of 109 comic books of 21,142 pages drawn by professional manga artists. To the best of our knowledge, Manga109 is currently the biggest dataset of manga images available for research. Experimental results showed that the proposed framework is efficient and scalable (70 ms from 21,142 pages using a single computer with 204 MB RAM).

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Revised Spatial Transformer Network towards Improved Image Super-resolutions

TL;DR: The revised spatial transformer network can be used in the future for simultaneous geometric transformation and imagesuper-resolution, which solve the practical applications of image super-resolution in real life.
Journal ArticleDOI

Progressive residual networks for image super-resolution

TL;DR: A novel Progressive Residual Network (PRNet) is proposed to integrate hierarchical and scale features for single image SR, which works well for both small and large scaling factors.
Journal ArticleDOI

Image Super-Resolution With Deep Variational Autoencoders

TL;DR: VDVAE-SR tackles image super-resolution using transfer learning on pretrained VDVAEs, a new model that aims to exploit the most recent deep VAE methodologies to improve upon the results of similar models.
Proceedings ArticleDOI

Information-Growth Attention Network for Image Super-Resolution

TL;DR: IGAN as discussed by the authors proposes an information-growth attention mechanism to pay attention to features involving large information growth capacity by assimilating the difference from current features to the former features within a network.
Journal ArticleDOI

Multi-scale skip-connection network for image super-resolution

TL;DR: A multi-scale skip-connection network (MSN) to improve the visual quality of the image SR by being evaluated on a wide variety of images and achieving an advantage over the state-of-the-art methods in terms of both numerical results and visual quality.
References
More filters
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Journal ArticleDOI

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

TL;DR: A generalized gray-scale and rotation invariant operator presentation that allows for detecting the "uniform" patterns for any quantization of the angular space and for any spatial resolution and presents a method for combining multiple operators for multiresolution analysis.
Proceedings ArticleDOI

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.
Related Papers (5)