Text Detection and Recognition in Imagery: A Survey

doi:10.1109/TPAMI.2014.2366765

Open AccessJournal ArticleDOI

Text Detection and Recognition in Imagery: A Survey

Qixiang Ye, +1 more

- 01 Jul 2015 -

IEEE Transactions on Pattern Analysis an...

- Vol. 37, Iss: 7, pp 1480-1500

TLDR

This review provides a fundamental comparison and analysis of the remaining problems in the field and summarizes the fundamental problems and enumerates factors that should be considered when addressing these problems.

Abstract:

This paper analyzes, compares, and contrasts technical challenges, methods, and the performance of text detection and recognition research in color imagery It summarizes the fundamental problems and enumerates factors that should be considered when addressing these problems Existing techniques are categorized as either stepwise or integrated and sub-problems are highlighted including text localization, verification, segmentation and recognition Special issues associated with the enhancement of degraded text and the processing of video text, multi-oriented, perspectively distorted and multilingual text are also addressed The categories and sub-categories of text are illustrated, benchmark datasets are enumerated, and the performance of the most representative approaches is compared This review provides a fundamental comparison and analysis of the remaining problems in the field

Citations

PDF

Open Access

More filters

Journal ArticleDOI

An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition

Baoguang Shi, +2 more

- 01 Nov 2017 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Zhang et al. as mentioned in this paper proposed a novel neural network architecture, which integrates feature extraction, sequence modeling and transcription into a unified framework, and achieved remarkable performances in both lexicon free and lexicon-based scene text recognition tasks.

...read moreread less

Journal ArticleDOI

Deep Learning for Generic Object Detection: A Survey

Li Liu, +7 more

- 01 Feb 2020 -

International Journal of Computer Vision

TL;DR: A comprehensive survey of the recent achievements in this field brought about by deep learning techniques, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics.

...read moreread less

Proceedings ArticleDOI

EAST: An Efficient and Accurate Scene Text Detector

Xinyu Zhou, +6 more

TL;DR: This work proposes a simple yet powerful pipeline that yields fast and accurate text detection in natural scenes, and significantly outperforms state-of-the-art methods in terms of both accuracy and efficiency.

...read moreread less

Journal ArticleDOI

Arbitrary-Oriented Scene Text Detection via Rotation Proposals

Jianqi Ma, +6 more

- 23 Mar 2018 -

IEEE Transactions on Multimedia

TL;DR: The Rotation Region Proposal Networks are designed to generate inclined proposals with text orientation angle information that are adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation.

...read moreread less

Posted Content

Object Detection in 20 Years: A Survey

Zhengxia Zou, +3 more

- 13 May 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper extensively reviews 400+ papers of object detection in the light of its technical evolution, spanning over a quarter-century's time (from the 1990s to 2019), and makes an in-deep analysis of their challenges as well as technical improvements in recent years.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Handbook of Psycholinguistics

Morton Ann Gernsbacher

TL;DR: The authors, The Neuropsychology of language and its relationship with the human brain: A Guide to Research on the Perception of Speech and its Implications for Research and Theory, The authors.

...read moreread less

Proceedings ArticleDOI

Detecting text in natural scenes with stroke width transform

Boris Epshtein, +2 more

TL;DR: A novel image operator is presented that seeks to find the value of stroke width for each image pixel, and its use on the task of text detection in natural images is demonstrated.

...read moreread less

Journal ArticleDOI

Limits on super-resolution and how to break them

Simon Baker, +1 more

- 01 Sep 2002 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work derives a sequence of analytical results which show that the reconstruction constraints provide less and less useful information as the magnification factor increases, and proposes a super-resolution algorithm which attempts to recognize local features in the low-resolution images and then enhances their resolution in an appropriate manner.

...read moreread less

Proceedings ArticleDOI

ICDAR 2013 Robust Reading Competition

Dimosthenis Karatzas, +9 more

TL;DR: The datasets and ground truth specification are described, the performance evaluation protocols used are details, and the final results are presented along with a brief summary of the participating methods.

...read moreread less

Journal ArticleDOI

reCAPTCHA: Human-Based Character Recognition via Web Security Measures

Luis von Ahn, +4 more

- 12 Sep 2008 -

Science

TL;DR: This research explored whether human effort can be channeled into a useful purpose: helping to digitize old printed material by asking users to decipher scanned words from books that computerized optical character recognition failed to recognize.

...read moreread less