scispace - formally typeset
Open AccessJournal ArticleDOI

Sketch-based manga retrieval using manga109 dataset

Reads0
Chats0
TLDR
A manga-specific image retrieval system that consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization is proposed.
Abstract
Manga (Japanese comics) are popular worldwide. However, current e-manga archives offer very limited search support, i.e., keyword-based search by title or author. To make the manga search experience more intuitive, efficient, and enjoyable, we propose a manga-specific image retrieval system. The proposed system consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization. For querying, the system provides a sketch-based interface. Based on the interface, two interactive reranking schemes are presented: relevance feedback and query retouch. For evaluation, we built a novel dataset of manga images, Manga109, which consists of 109 comic books of 21,142 pages drawn by professional manga artists. To the best of our knowledge, Manga109 is currently the biggest dataset of manga images available for research. Experimental results showed that the proposed framework is efficient and scalable (70 ms from 21,142 pages using a single computer with 204 MB RAM).

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Sketch-Based Image Retrieval With Multi-Clustering Re-Ranking

TL;DR: An SBIR re-ranking approach based on multi-clustering is proposed, which uses the semantic information of three types of images: edge maps, object images and natural images themselves to improve the performance of sketch-based image retrieval methods.
Proceedings ArticleDOI

Text detection in manga by combining connected-component-based and region-based classifications

TL;DR: A method to detect text regions in manga using classifiers for both connected components and regions is developed and a text region dataset of manga is developed, which enables learning and detailed evaluations of methods used to detecttext regions.
Journal ArticleDOI

Blind single image super-resolution with a mixture of deep networks

TL;DR: A mixture model of deep networks is proposed, which is capable of clustering SR tasks of different blur kernels into a set of groups, and derives a lower bound of the likelihood function, which circumvents the intractability in direct maximum likelihood estimation.
Posted Content

KOALAnet: Blind Super-Resolution using Kernel-Oriented Adaptive Local Adjustment

TL;DR: The proposed KOALAnet produces the most natural results for artistic photographs with intentional blur, which are not over-sharpened, by effectively handling images mixed with in-focus and out-of-focus areas.
Journal ArticleDOI

Multi-attention augmented network for single image super-resolution

TL;DR: A multi-attention augmented network, which mainly consists of content-, orientation- and position-aware modules, is proposed, which develops an attention augmented U-net structure to form the content-aware module in order to learn and combine multi-scale informative features within a large receptive field.
References
More filters
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Journal ArticleDOI

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

TL;DR: A generalized gray-scale and rotation invariant operator presentation that allows for detecting the "uniform" patterns for any quantization of the angular space and for any spatial resolution and presents a method for combining multiple operators for multiresolution analysis.
Proceedings ArticleDOI

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.
Related Papers (5)