scispace - formally typeset
Open Access

Applications of Text Detection and its Challenges: A Review

Reads0
Chats0
TLDR
In this article, the authors present a survey of text detection and recognition from images to a large extent, including general documents such as newspapers, books and magazines, forms, scientific documents, maps, architectural and engineering drawings, and scene images with textual information.
Abstract
The rising need for automation of systems has effected the development of text detection and recognition from images to a large extent. Text recognition has a wide range of applications, each with scenario dependent challenges and complications. How can these challenges be mitigated? What image processing techniques can be applied to make the text in the image machine readable? How can text be localized and separated from non textual information? How can the text image be converted to digital text format? This paper attempts to answer these questions in chosen scenarios. The types of document images that we have surveyed include general documents such as newspapers, books and magazines, forms, scientific documents, unconstrained documents such as maps, architectural and engineering drawings, and scene images with textual information.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Expense Control: A Gamified, Semi-Automated, Crowd-Based Approach For Receipt Capturing

TL;DR: It is found that the crowd-based approach to enhance the outcome of optical character recognition in the domain of receipt capturing to keep track of expenses is appreciated, that the approach reduces the error rate of captured receipts significantly, and that the gamification provided additional motivation to contribute more and thereby enrich the database.
Journal ArticleDOI

Hindi Text Document Classification System Using SVM and Fuzzy: A Survey

TL;DR: A new idea of Hindi printed and handwritten document classification system using support vector machine and fuzzy logic first pre-processes and then classifies textual imaged documents into predefined categories.

Hand-drawn Electric Circuit Diagram Understanding Using 2D Dynamic programming

TL;DR: In this paper, a twodimensional dynamic programming technique (2D-DP) is used for symbol hypothesis generation, which can correctly locate symbols even when they are drawn temporally overlapped with each other.
Journal ArticleDOI

Advanced Applications on Bilingual Document Analysis and Processing Systems

TL;DR: A journey of bilingual NLP and image-based document classification systems is discussed and an overview of their methods, feature extraction techniques, document sets, classifiers, and accuracy for English-Hindi and other language pairs is provided.
Journal ArticleDOI

A Fuzzy Matching based Image Classification System for Printed and Handwritten Text Documents

TL;DR: This article proposes a system that performs better than existing systems, and shows the results of experiments on this and other proposed systems.
References
More filters
Journal ArticleDOI

A Survey of Digital Map Processing Techniques

TL;DR: This article presents an overview of existing map processing techniques, bringing together the past and current research efforts in this interdisciplinary field, to characterize the advances that have been made, and to identify future research directions and opportunities.
Book

The document spectrum for page layout analysis

TL;DR: The document spectrum (or docstrum), which is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components, yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks.
Journal ArticleDOI

A Laplacian Approach to Multi-Oriented Text Detection in Video

TL;DR: Experimental results show that the proposed method is able to handle graphics text and scene text of both horizontal and nonhorizontal orientation.
Journal ArticleDOI

Recognizing mathematical expressions using tree transformation

TL;DR: A robust and efficient system for recognizing typeset and handwritten mathematical notation that allows robust handling of unexpected input, increases the scalability of the system, and provides the groundwork for handling dialects of mathematical notation.