scispace - formally typeset
Journal ArticleDOI

A Fuzzy Matching based Image Classification System for Printed and Handwritten Text Documents

Reads0
Chats0
TLDR
This article proposes a system that performs better than existing systems, and shows the results of experiments on this and other proposed systems.
Abstract
Thisarticleproposesabi-leveledimageclassificationsystemtoclassifyprintedandhandwritten Englishdocumentsintomutuallyexclusivepredefinedcategories.Theproposedsystemfollowsthe stepsofpreprocessing, segmentation, featureextraction, andSVMbasedcharacterclassification atlevel1,andwordassociationandfuzzymatchingbaseddocumentclassificationatlevel2.The systemarchitectureanditsmodularstructurediscussvarioustaskstagesandtheirfunctionalities. Further,acasestudyondocumentclassificationisdiscussedtoshowtheinternalscorecomputations ofwordsandkeywordswithfuzzymatching.Theexperimentsonproposedsystemillustratethatthe systemachievespromisingresultsinthetime-efficientmannerandachievesbetteraccuracywith lesscomputation timeforprinteddocuments thanhandwrittenones.Finally, theperformanceof theproposedsystemiscomparedwiththeexistingsystemsanditisobservedthatproposedsystem performsbetterthanmanyothersystems. KeywoRDS Confidence Computation, Document Image, Fuzzy Matching, Handwritten Documents, Performance Analysis, Printed Documents, SVM, Text Image Classification, Word Association

read more

Citations
More filters
Journal ArticleDOI

Intelligent Recognition and Teaching of English Fuzzy Texts Based on Fuzzy Computing and Big Data

TL;DR: In this paper, in-depth research and analysis is conducted on the intelligent recognition and teaching of English fuzzy text through parallel projection and region expansion and the substring representation in the model ensures the generation of unregistered word vectors.

Applications of Text Detection and its Challenges: A Review

TL;DR: In this article, the authors present a survey of text detection and recognition from images to a large extent, including general documents such as newspapers, books and magazines, forms, scientific documents, maps, architectural and engineering drawings, and scene images with textual information.
Journal ArticleDOI

Advanced Applications on Bilingual Document Analysis and Processing Systems

TL;DR: A journey of bilingual NLP and image-based document classification systems is discussed and an overview of their methods, feature extraction techniques, document sets, classifiers, and accuracy for English-Hindi and other language pairs is provided.
Journal ArticleDOI

Opinion Classification of Product Reviews Using Naïve Bayes, Logistic Regression and Sentiwordnet: Challenges and Survey

A Dadhich, +1 more
TL;DR: A detailed review and comparative analysis of various existing sentiment analysis algorithms especially for the Amazon products, which have worked upon the supervised learning techniques called Naïve Bayes, logistic regression and SentiWordNet are presented.
References
More filters
Proceedings ArticleDOI

Script Identification A Han and Roman Script Perspective

TL;DR: This work proposes a system to address the problem of identification of scripts from a single document page using directional features along with a Gaussian Kernel-based Support Vector Machine, and gets promising results.
Proceedings ArticleDOI

Machine printed handwritten text discrimination using Radon transform and SVM classifier

TL;DR: This paper addresses the problem of identifying each type by using the Radon transform and Support Vector Machines, which is conducted at three steps: preprocessing, feature generation and classification.
Journal ArticleDOI

Word-Wise Thai and Roman Script Identification

TL;DR: An SVM-based method is proposed for identification of word-wise printed Roman and Thai scripts from a single line of a document page and it is obtained 99.62% script identification accuracy from the proposed scheme.
Proceedings ArticleDOI

A fuzzy approach for word level script identification of text in low resolution display board images using wavelet features

TL;DR: A new fuzzy based approach for word level script identification of text in low resolution images of display boards is presented that is robust and insensitive to the variations in size and style of font, number of characters, thickness and spacing between characters, noise, and other degradations.
Proceedings ArticleDOI

A technical study and analysis of text classification techniques in N - Lingual documents

TL;DR: A technical study and analysis is presented to show N-lingual document classification for normal text, printed and handwritten documents and three statistically analyzed charts are shown, which are based on content type classification, language-mode pair and most-to-least preferred languages of existing algorithms.