Open Access
Applications of Text Detection and its Challenges: A Review
M. P. Nevetha,A. Baskar +1 more
Reads0
Chats0
TLDR
In this article, the authors present a survey of text detection and recognition from images to a large extent, including general documents such as newspapers, books and magazines, forms, scientific documents, maps, architectural and engineering drawings, and scene images with textual information.Abstract:
The rising need for automation of systems has effected the development of text detection and recognition from images to a large extent. Text recognition has a wide range of applications, each with scenario dependent challenges and complications. How can these challenges be mitigated? What image processing techniques can be applied to make the text in the image machine readable? How can text be localized and separated from non textual information? How can the text image be converted to digital text format? This paper attempts to answer these questions in chosen scenarios. The types of document images that we have surveyed include general documents such as newspapers, books and magazines, forms, scientific documents, unconstrained documents such as maps, architectural and engineering drawings, and scene images with textual information.read more
Citations
More filters
Proceedings ArticleDOI
Expense Control: A Gamified, Semi-Automated, Crowd-Based Approach For Receipt Capturing
TL;DR: It is found that the crowd-based approach to enhance the outcome of optical character recognition in the domain of receipt capturing to keep track of expenses is appreciated, that the approach reduces the error rate of captured receipts significantly, and that the gamification provided additional motivation to contribute more and thereby enrich the database.
Journal ArticleDOI
Hindi Text Document Classification System Using SVM and Fuzzy: A Survey
Shalini Puri,Satya Prakash Singh +1 more
TL;DR: A new idea of Hindi printed and handwritten document classification system using support vector machine and fuzzy logic first pre-processes and then classifies textual imaged documents into predefined categories.
Hand-drawn Electric Circuit Diagram Understanding Using 2D Dynamic programming
TL;DR: In this paper, a twodimensional dynamic programming technique (2D-DP) is used for symbol hypothesis generation, which can correctly locate symbols even when they are drawn temporally overlapped with each other.
Journal ArticleDOI
Advanced Applications on Bilingual Document Analysis and Processing Systems
Shalini Puri,Satya Prakash Singh +1 more
TL;DR: A journey of bilingual NLP and image-based document classification systems is discussed and an overview of their methods, feature extraction techniques, document sets, classifiers, and accuracy for English-Hindi and other language pairs is provided.
Journal ArticleDOI
A Fuzzy Matching based Image Classification System for Printed and Handwritten Text Documents
Shalini Puri,Satya Prakash Singh +1 more
TL;DR: This article proposes a system that performs better than existing systems, and shows the results of experiments on this and other proposed systems.
References
More filters
Journal ArticleDOI
A Survey of Digital Map Processing Techniques
TL;DR: This article presents an overview of existing map processing techniques, bringing together the past and current research efforts in this interdisciplinary field, to characterize the advances that have been made, and to identify future research directions and opportunities.
Book
The document spectrum for page layout analysis
TL;DR: The document spectrum (or docstrum), which is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components, yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks.
Journal ArticleDOI
A Laplacian Approach to Multi-Oriented Text Detection in Video
TL;DR: Experimental results show that the proposed method is able to handle graphics text and scene text of both horizontal and nonhorizontal orientation.
Journal ArticleDOI
Recognizing mathematical expressions using tree transformation
TL;DR: A robust and efficient system for recognizing typeset and handwritten mathematical notation that allows robust handling of unexpected input, increases the scalability of the system, and provides the groundwork for handling dialects of mathematical notation.