Scene Text Detection via Connected Component Clustering and Nontext Filtering

doi:10.1109/TIP.2013.2249082

Journal ArticleDOI

Scene Text Detection via Connected Component Clustering and Nontext Filtering

Hyung Il Koo, +1 more

- 01 Jun 2013 -

IEEE Transactions on Image Processing

- Vol. 22, Iss: 6, pp 2296-2305

Chats0

TLDR

A new scene text detection algorithm based on two machine learning classifiers that allows us to generate candidate word regions and the other filters out nontext ones, and extends the approach to exploit multichannel information.

Abstract:

In this paper, we present a new scene text detection algorithm based on two machine learning classifiers: one allows us to generate candidate word regions and the other filters out nontext ones. To be precise, we extract connected components (CCs) in images by using the maximally stable extremal region algorithm. These extracted CCs are partitioned into clusters so that we can generate candidate regions. Unlike conventional methods relying on heuristic rules in clustering, we train an AdaBoost classifier that determines the adjacency relationship and cluster CCs by using their pairwise relations. Then we normalize candidate word regions and determine whether each region contains text or not. Since the scale, skew, and color of each candidate can be estimated from CCs, we develop a text/nontext classifier for normalized images. This classifier is based on multilayer perceptrons and we can control recall and precision rates with a single free parameter. Finally, we extend our approach to exploit multichannel information. Experimental results on ICDAR 2005 and 2011 robust reading competition datasets show that our method yields the state-of-the-art performance both in speed and accuracy.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

ICDAR 2013 Robust Reading Competition

Dimosthenis Karatzas, +9 more

TL;DR: The datasets and ground truth specification are described, the performance evaluation protocols used are details, and the final results are presented along with a brief summary of the participating methods.

...read moreread less

Proceedings ArticleDOI

EAST: An Efficient and Accurate Scene Text Detector

Xinyu Zhou, +6 more

TL;DR: This work proposes a simple yet powerful pipeline that yields fast and accurate text detection in natural scenes, and significantly outperforms state-of-the-art methods in terms of both accuracy and efficiency.

...read moreread less

Journal ArticleDOI

Text Detection and Recognition in Imagery: A Survey

Qixiang Ye, +1 more

- 01 Jul 2015 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This review provides a fundamental comparison and analysis of the remaining problems in the field and summarizes the fundamental problems and enumerates factors that should be considered when addressing these problems.

...read moreread less

Journal ArticleDOI

Robust Text Detection in Natural Scene Images

Xu-Cheng Yin, +3 more

- 01 May 2014 -

IEEE Transactions on Pattern Analysis an...

TL;DR: An accurate and robust method for detecting texts in natural scene images using a fast and effective pruning algorithm to extract Maximally Stable Extremal Regions (MSERs) as character candidates using the strategy of minimizing regularized variations is proposed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Proceedings ArticleDOI

Rapid object detection using a boosted cascade of simple features

Paul A. Viola, +1 more

TL;DR: A machine learning approach for visual object detection which is capable of processing images extremely rapidly and achieving high detection rates and the introduction of a new image representation called the "integral image" which allows the features used by the detector to be computed very quickly.

...read moreread less

Journal ArticleDOI

Additive Logistic Regression : A Statistical View of Boosting

Jerome H. Friedman, +2 more

- 01 Apr 2000 -

Annals of Statistics

TL;DR: This work shows that this seemingly mysterious phenomenon of boosting can be understood in terms of well-known statistical principles, namely additive modeling and maximum likelihood, and develops more direct approximations and shows that they exhibit nearly identical results to boosting.

...read moreread less

Journal ArticleDOI

Robust wide-baseline stereo from maximally stable extremal regions

Jiri Matas, +3 more

- 01 Sep 2004 -

Image and Vision Computing

TL;DR: The high utility of MSERs, multiple measurement regions and the robust metric is demonstrated in wide-baseline experiments on image pairs from both indoor and outdoor scenes.

...read moreread less

Proceedings ArticleDOI

Robust wide baseline stereo from maximally stable extremal regions

Jiri Matas, +3 more

TL;DR: The wide-baseline stereo problem, i.e. the problem of establishing correspondences between a pair of images taken from different viewpoints, is studied and an efficient and practically fast detection algorithm is presented for an affinely-invariant stable subset of extremal regions, the maximally stable extremal region (MSER).

...read moreread less

Collapse

Scene Text Detection via Connected Component Clustering and Nontext Filtering

Citations

ICDAR 2015 competition on Robust Reading

ICDAR 2013 Robust Reading Competition

EAST: An Efficient and Accurate Scene Text Detector

Text Detection and Recognition in Imagery: A Survey

Robust Text Detection in Natural Scene Images

References

Gradient-based learning applied to document recognition

Rapid object detection using a boosted cascade of simple features

Additive Logistic Regression : A Statistical View of Boosting

Robust wide-baseline stereo from maximally stable extremal regions

Robust wide baseline stereo from maximally stable extremal regions

Related Papers (5)

Detecting text in natural scenes with stroke width transform

Real-time scene text localization and recognition

ICDAR 2013 Robust Reading Competition

Detecting texts of arbitrary orientations in natural images

Detecting and reading text in natural scenes