scispace - formally typeset
Journal ArticleDOI

A Fuzzy Matching based Image Classification System for Printed and Handwritten Text Documents

Reads0
Chats0
TLDR
This article proposes a system that performs better than existing systems, and shows the results of experiments on this and other proposed systems.
Abstract
Thisarticleproposesabi-leveledimageclassificationsystemtoclassifyprintedandhandwritten Englishdocumentsintomutuallyexclusivepredefinedcategories.Theproposedsystemfollowsthe stepsofpreprocessing, segmentation, featureextraction, andSVMbasedcharacterclassification atlevel1,andwordassociationandfuzzymatchingbaseddocumentclassificationatlevel2.The systemarchitectureanditsmodularstructurediscussvarioustaskstagesandtheirfunctionalities. Further,acasestudyondocumentclassificationisdiscussedtoshowtheinternalscorecomputations ofwordsandkeywordswithfuzzymatching.Theexperimentsonproposedsystemillustratethatthe systemachievespromisingresultsinthetime-efficientmannerandachievesbetteraccuracywith lesscomputation timeforprinteddocuments thanhandwrittenones.Finally, theperformanceof theproposedsystemiscomparedwiththeexistingsystemsanditisobservedthatproposedsystem performsbetterthanmanyothersystems. KeywoRDS Confidence Computation, Document Image, Fuzzy Matching, Handwritten Documents, Performance Analysis, Printed Documents, SVM, Text Image Classification, Word Association

read more

Citations
More filters
Journal ArticleDOI

Intelligent Recognition and Teaching of English Fuzzy Texts Based on Fuzzy Computing and Big Data

TL;DR: In this paper, in-depth research and analysis is conducted on the intelligent recognition and teaching of English fuzzy text through parallel projection and region expansion and the substring representation in the model ensures the generation of unregistered word vectors.

Applications of Text Detection and its Challenges: A Review

TL;DR: In this article, the authors present a survey of text detection and recognition from images to a large extent, including general documents such as newspapers, books and magazines, forms, scientific documents, maps, architectural and engineering drawings, and scene images with textual information.
Journal ArticleDOI

Advanced Applications on Bilingual Document Analysis and Processing Systems

TL;DR: A journey of bilingual NLP and image-based document classification systems is discussed and an overview of their methods, feature extraction techniques, document sets, classifiers, and accuracy for English-Hindi and other language pairs is provided.
Journal ArticleDOI

Opinion Classification of Product Reviews Using Naïve Bayes, Logistic Regression and Sentiwordnet: Challenges and Survey

A Dadhich, +1 more
TL;DR: A detailed review and comparative analysis of various existing sentiment analysis algorithms especially for the Amazon products, which have worked upon the supervised learning techniques called Naïve Bayes, logistic regression and SentiWordNet are presented.
References
More filters
Proceedings ArticleDOI

Script based text identification: a multi-level architecture

TL;DR: The proposed framework presents a top-down approach by performing page, block/paragraph and word level script identification in multiple stages by utilizing texture and shape based information embedded in the documents at different levels for feature extraction.
Journal ArticleDOI

Separation of Handwritten and Machine-Printed Texts from Noisy Documents Using Contourlet Transform

TL;DR: A new algorithm is proposed for separation of machine-printed and handwritten texts using correlation coefficients and probabilities-based moments features and it provides a better text separation performance compared to that of other state-of-the-art approaches.
Book ChapterDOI

Word-Level Script Identification from Handwritten Multi-script Documents

TL;DR: A combination of shape based and texture based features are used to identify the script of the handwritten word images written in any of five scripts namely, Bangla, Devnagari, Malayalam, Telugu and Roman.
Proceedings ArticleDOI

Applications of Text Detection and its Challenges: A Review

TL;DR: The rising need for automation of systems has effected the development of text detection and recognition from images to a large extent, and this paper attempts to answer questions in chosen scenarios.
Journal ArticleDOI

A Hybrid Hindi Printed Document Classification System Using SVM and Fuzzy: An Advancement

TL;DR: A new advanced tri-layered segmentation and bi-leveled-classifier-based Hindi printed document classification system, which categorizes imaged documents into pre-defined mutually exclusive categories by using SVM and Fuzzy matching at character and document classifications, respectively.