Sketch-based manga retrieval using manga109 dataset

doi:10.1007/S11042-016-4020-Z

Open AccessJournal ArticleDOI

Sketch-based manga retrieval using manga109 dataset

Yusuke Matsui, +4 more

- 01 Oct 2017 -

Multimedia Tools and Applications

- Vol. 76, Iss: 20, pp 21811-21838

Chats0

TLDR

A manga-specific image retrieval system that consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization is proposed.

Abstract:

Manga (Japanese comics) are popular worldwide. However, current e-manga archives offer very limited search support, i.e., keyword-based search by title or author. To make the manga search experience more intuitive, efficient, and enjoyable, we propose a manga-specific image retrieval system. The proposed system consists of efficient margin labeling, edge orientation histogram feature description with screen tone removal, and approximate nearest-neighbor search using product quantization. For querying, the system provides a sketch-based interface. Based on the interface, two interactive reranking schemes are presented: relevance feedback and query retouch. For evaluation, we built a novel dataset of manga images, Manga109, which consists of 109 comic books of 21,142 pages drawn by professional manga artists. To the best of our knowledge, Manga109 is currently the biggest dataset of manga images available for research. Experimental results showed that the proposed framework is efficient and scalable (70 ms from 21,142 pages using a single computer with 204 MB RAM).

Citations

PDF

Open Access

More filters

Journal ArticleDOI

RSHAN: Image super-resolution network based on residual separation hybrid attention module

Ying Shen, +3 more

- 01 Jun 2023 -

Engineering Applications of Artificial I...

TL;DR: Wang et al. as mentioned in this paper proposed Residual Separation Hybrid Attention Module (RSHAM), which fuses the local features extracted by the convolutional neural network (CNN) branch and the long-range dependencies extracted by Transformers.

...read moreread less

Proceedings ArticleDOI

Self-calibrated Attention Residual Network for Image Super-Resolution

Anqi Rong, +3 more

TL;DR: Li et al. as mentioned in this paper proposed a self-calibrated attention residual network (SARN), which consists of carefully designed multi-scale paths, which can capture rich structure information from different scales.

...read moreread less

Proceedings ArticleDOI

Edge Guided Learning for Image Super-resolution with Realistic Textures

Zhan Li, +5 more

TL;DR: An edge-guided SR neural network is proposed by introducing a plug-in edge detection module and incorporating a new edge loss, which increases the reconstruction accuracy and reduces artifacts, which indicates the ability of reconstructing realistic textures can be transferred well from Edge-SRN to a small model.

...read moreread less

Posted Content

End-to-end Alternating Optimization for Blind Super Resolution.

Zhengxiong Luo, +4 more

- 14 May 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Zhang et al. as discussed by the authors proposed an alternating optimization algorithm to estimate the blur kernel and restore the SR image in a single model, which can largely outperform state-of-the-art methods.

...read moreread less

DOI

ScaMP: Scalable Meta-Parallelism for Deep Learning Search

Quentin Anthony, +3 more

TL;DR: ScaMP as discussed by the authors proposes Scalable Meta-Parallelism for Deep Learning Search (SCaMP), a distributed Hyperparameter Optimization (HPO) and Neural Architecture Search (NAS) framework that supports out-of-core models with flexible parallelism schemes.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

Mark Everingham, +4 more

- 01 Jun 2010 -

International Journal of Computer Vision

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.

...read moreread less

Journal ArticleDOI

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

Timo Ojala, +2 more

- 01 Jul 2002 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A generalized gray-scale and rotation invariant operator presentation that allows for detecting the "uniform" patterns for any quantization of the angular space and for any spatial resolution and presents a method for combining multiple operators for multiresolution analysis.

...read moreread less

Proceedings ArticleDOI

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

Svetlana Lazebnik, +2 more

TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.

...read moreread less

Collapse

Related Papers (5)

Accurate Image Super-Resolution Using Very Deep Convolutional Networks

Jiwon Kim, +2 more

A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics

David Martin, +3 more

Sketch-based manga retrieval using manga109 dataset

Citations

RSHAN: Image super-resolution network based on residual separation hybrid attention module

Self-calibrated Attention Residual Network for Image Super-Resolution

Edge Guided Learning for Image Super-resolution with Realistic Textures

End-to-end Alternating Optimization for Blind Super Resolution.

ScaMP: Scalable Meta-Parallelism for Deep Learning Search

References

Distinctive Image Features from Scale-Invariant Keypoints

Histograms of oriented gradients for human detection

The Pascal Visual Object Classes (VOC) Challenge

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

Related Papers (5)

Accurate Image Super-Resolution Using Very Deep Convolutional Networks

A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics

Enhanced Deep Residual Networks for Single Image Super-Resolution

Deep Residual Learning for Image Recognition

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network