scispace - formally typeset
Journal ArticleDOI

Automatic detection and recognition of signs from natural scenes

TLDR
This paper proposes a local intensity normalization method to effectively handle lighting variations, followed by a Gabor transform to obtain local features, and finally a linear discriminant analysis (LDA) method for feature selection.
Abstract
In this paper, we present an approach to automatic detection and recognition of signs from natural scenes, and its application to a sign translation task. The proposed approach embeds multiresolution and multiscale edge detection, adaptive searching, color analysis, and affine rectification in a hierarchical framework for sign detection, with different emphases at each phase to handle the text in different sizes, orientations, color distributions and backgrounds. We use affine rectification to recover deformation of the text regions caused by an inappropriate camera view angle. The procedure can significantly improve text detection rate and optical character recognition (OCR) accuracy. Instead of using binary information for OCR, we extract features from an intensity image directly. We propose a local intensity normalization method to effectively handle lighting variations, followed by a Gabor transform to obtain local features, and finally a linear discriminant analysis (LDA) method for feature selection. We have applied the approach in developing a Chinese sign translation system, which can automatically detect and recognize Chinese signs as input from a camera, and translate the recognized text into English.

read more

Citations
More filters
Journal ArticleDOI

Text Detection and Recognition in Imagery: A Survey

TL;DR: This review provides a fundamental comparison and analysis of the remaining problems in the field and summarizes the fundamental problems and enumerates factors that should be considered when addressing these problems.
Journal ArticleDOI

Histogram of Gabor Phase Patterns (HGPP): A Novel Object Representation Approach for Face Recognition

TL;DR: The proposed methods are successfully applied to face recognition, and the experiment results on the large-scale FERET and CAS-PEAL databases show that the proposed algorithms significantly outperform other well-known systems in terms of recognition rate.
Book ChapterDOI

A method for text localization and recognition in real-world images

TL;DR: The paper is first to report both text detection and recognition results on the standard and rather challenging ICDAR 2003 dataset, and the text localization works for number of alphabets and the method is easily adapted to recognition of other scripts, e.g. cyrillics.
Journal ArticleDOI

A Hybrid Approach to Detect and Localize Texts in Natural Scene Images

TL;DR: A hybrid approach to robustly detect and localize texts in natural scene images using a text region detector, a conditional random field model, and a learning-based energy minimization method are presented.
Journal ArticleDOI

Text String Detection From Natural Scenes by Structure-Based Partition and Grouping

TL;DR: A new framework to detect text strings with arbitrary orientations in complex natural scene images with outperform the state-of-the-art results on the public Robust Reading Dataset, which contains text only in horizontal orientation.
References
More filters
Journal ArticleDOI

Face recognition by elastic bunch graph matching

TL;DR: A system for recognizing human faces from single images out of a large database containing one image per person, based on a Gabor wavelet transform, which is constructed from a small get of sample image graphs.
Book ChapterDOI

Face Recognition by Elastic Bunch Graph Matching

TL;DR: A system for recognizing human faces from single images out of a large database with one image per person, using the bunch graph, which is constructed from a small set of sample image graphs.
Journal ArticleDOI

Representation of local geometry in the visual system

TL;DR: It is shown that a convolution with certain reasonable receptive field (RF) profiles yields the exact partial derivatives of the retinal illuminance blurred to a specified degree and how this representation can function as the substrate for “point processors” computing geometrical features such as edge curvature.
Journal ArticleDOI

Automatic text detection and tracking in digital video

TL;DR: This work presents algorithms for detecting and tracking text in digital video that implements a scale-space feature extractor that feeds an artificial neural processor to detect text blocks.
Proceedings ArticleDOI

Automatic text location in images and video frames

TL;DR: Compared with some traditional text location methods, this method has the following advantages: 1) low computational cost; 2) robust to font size; and 3) high accuracy.
Related Papers (5)