Sketch-based manga retrieval using manga109 dataset
Reads0
Chats0
TLDR
A manga-specific image retrieval system that consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization is proposed.Abstract:
Manga (Japanese comics) are popular worldwide. However, current e-manga archives offer very limited search support, i.e., keyword-based search by title or author. To make the manga search experience more intuitive, efficient, and enjoyable, we propose a manga-specific image retrieval system. The proposed system consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization. For querying, the system provides a sketch-based interface. Based on the interface, two interactive reranking schemes are presented: relevance feedback and query retouch. For evaluation, we built a novel dataset of manga images, Manga109, which consists of 109 comic books of 21,142 pages drawn by professional manga artists. To the best of our knowledge, Manga109 is currently the biggest dataset of manga images available for research. Experimental results showed that the proposed framework is efficient and scalable (70 ms from 21,142 pages using a single computer with 204 MB RAM).read more
Citations
More filters
Proceedings ArticleDOI
Super-Resolution Based on Back-Projection of Interpolated Image
TL;DR: A new image super-resolution method to combine fast image interpolation with iterative back-projection is proposed that does not require any external pre-trained datasets and has low computation time while the quality of the reconstructed image can be measured up to the high programming complexity methods such as the dictionary and deep convolutional neural networks.
Proceedings ArticleDOI
A Content-Based Multi-Scale Network for Single Image Super-Resolution
Jiahuan Ji,Baojiang Zhong +1 more
TL;DR: In this article , a content-based multi-scale network (CMNet) is proposed for conducting single image super-resolution (SISR), which is motivated by the fact that the contents of real-world images normally have different scales.
Proceedings ArticleDOI
Extraction of Frame Sequences in the Manga Context
Christian Roggia,Fabio Persia +1 more
TL;DR: In this paper, a novel approach to comics segmentation and sequencing is proposed by taking advantage of existing machine learning concepts which are used to generate an artificial intelligence (AI) capable of correctly detecting panels within an image.
Journal ArticleDOI
Leveraging Expert Knowledge for Label Noise Mitigation in Machine Learning
TL;DR: In this article, a novel method for reducing the effect of label noise is introduced, where the rules are created from expert knowledge to identify the incorrect non-expert training data and the violating data samples are weighted less to mitigate their effects during model training.
References
More filters
Journal ArticleDOI
Distinctive Image Features from Scale-Invariant Keypoints
TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI
Histograms of oriented gradients for human detection
Navneet Dalal,Bill Triggs +1 more
TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Journal ArticleDOI
The Pascal Visual Object Classes (VOC) Challenge
TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Journal ArticleDOI
Multiresolution gray-scale and rotation invariant texture classification with local binary patterns
TL;DR: A generalized gray-scale and rotation invariant operator presentation that allows for detecting the "uniform" patterns for any quantization of the angular space and for any spatial resolution and presents a method for combining multiple operators for multiresolution analysis.
Proceedings ArticleDOI
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.