scispace - formally typeset
Open AccessJournal ArticleDOI

Sketch-based manga retrieval using manga109 dataset

Reads0
Chats0
TLDR
A manga-specific image retrieval system that consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization is proposed.
Abstract
Manga (Japanese comics) are popular worldwide. However, current e-manga archives offer very limited search support, i.e., keyword-based search by title or author. To make the manga search experience more intuitive, efficient, and enjoyable, we propose a manga-specific image retrieval system. The proposed system consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization. For querying, the system provides a sketch-based interface. Based on the interface, two interactive reranking schemes are presented: relevance feedback and query retouch. For evaluation, we built a novel dataset of manga images, Manga109, which consists of 109 comic books of 21,142 pages drawn by professional manga artists. To the best of our knowledge, Manga109 is currently the biggest dataset of manga images available for research. Experimental results showed that the proposed framework is efficient and scalable (70 ms from 21,142 pages using a single computer with 204 MB RAM).

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

RSHAN: Image super-resolution network based on residual separation hybrid attention module

TL;DR: Wang et al. as mentioned in this paper proposed Residual Separation Hybrid Attention Module (RSHAM), which fuses the local features extracted by the convolutional neural network (CNN) branch and the long-range dependencies extracted by Transformers.
Proceedings ArticleDOI

Self-calibrated Attention Residual Network for Image Super-Resolution

TL;DR: Li et al. as mentioned in this paper proposed a self-calibrated attention residual network (SARN), which consists of carefully designed multi-scale paths, which can capture rich structure information from different scales.
Proceedings ArticleDOI

Edge Guided Learning for Image Super-resolution with Realistic Textures

TL;DR: An edge-guided SR neural network is proposed by introducing a plug-in edge detection module and incorporating a new edge loss, which increases the reconstruction accuracy and reduces artifacts, which indicates the ability of reconstructing realistic textures can be transferred well from Edge-SRN to a small model.
Posted Content

End-to-end Alternating Optimization for Blind Super Resolution.

TL;DR: Zhang et al. as discussed by the authors proposed an alternating optimization algorithm to estimate the blur kernel and restore the SR image in a single model, which can largely outperform state-of-the-art methods.

ScaMP: Scalable Meta-Parallelism for Deep Learning Search

TL;DR: ScaMP as discussed by the authors proposes Scalable Meta-Parallelism for Deep Learning Search (SCaMP), a distributed Hyperparameter Optimization (HPO) and Neural Architecture Search (NAS) framework that supports out-of-core models with flexible parallelism schemes.
References
More filters
Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Journal ArticleDOI

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

TL;DR: A generalized gray-scale and rotation invariant operator presentation that allows for detecting the "uniform" patterns for any quantization of the angular space and for any spatial resolution and presents a method for combining multiple operators for multiresolution analysis.
Proceedings ArticleDOI

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.
Related Papers (5)