Journal ArticleDOI
A Fuzzy Matching based Image Classification System for Printed and Handwritten Text Documents
Shalini Puri,Satya Prakash Singh +1 more
Reads0
Chats0
TLDR
This article proposes a system that performs better than existing systems, and shows the results of experiments on this and other proposed systems.Abstract:
Thisarticleproposesabi-leveledimageclassificationsystemtoclassifyprintedandhandwritten Englishdocumentsintomutuallyexclusivepredefinedcategories.Theproposedsystemfollowsthe stepsofpreprocessing, segmentation, featureextraction, andSVMbasedcharacterclassification atlevel1,andwordassociationandfuzzymatchingbaseddocumentclassificationatlevel2.The systemarchitectureanditsmodularstructurediscussvarioustaskstagesandtheirfunctionalities. Further,acasestudyondocumentclassificationisdiscussedtoshowtheinternalscorecomputations ofwordsandkeywordswithfuzzymatching.Theexperimentsonproposedsystemillustratethatthe systemachievespromisingresultsinthetime-efficientmannerandachievesbetteraccuracywith lesscomputation timeforprinteddocuments thanhandwrittenones.Finally, theperformanceof theproposedsystemiscomparedwiththeexistingsystemsanditisobservedthatproposedsystem performsbetterthanmanyothersystems. KeywoRDS Confidence Computation, Document Image, Fuzzy Matching, Handwritten Documents, Performance Analysis, Printed Documents, SVM, Text Image Classification, Word Associationread more
Citations
More filters
Journal ArticleDOI
Intelligent Recognition and Teaching of English Fuzzy Texts Based on Fuzzy Computing and Big Data
Ling Liu,Sang-Bing Tsai +1 more
TL;DR: In this paper, in-depth research and analysis is conducted on the intelligent recognition and teaching of English fuzzy text through parallel projection and region expansion and the substring representation in the model ensures the generation of unregistered word vectors.
Applications of Text Detection and its Challenges: A Review
M. P. Nevetha,A. Baskar +1 more
TL;DR: In this article, the authors present a survey of text detection and recognition from images to a large extent, including general documents such as newspapers, books and magazines, forms, scientific documents, maps, architectural and engineering drawings, and scene images with textual information.
Journal ArticleDOI
Advanced Applications on Bilingual Document Analysis and Processing Systems
Shalini Puri,Satya Prakash Singh +1 more
TL;DR: A journey of bilingual NLP and image-based document classification systems is discussed and an overview of their methods, feature extraction techniques, document sets, classifiers, and accuracy for English-Hindi and other language pairs is provided.
Journal ArticleDOI
Opinion Classification of Product Reviews Using Naïve Bayes, Logistic Regression and Sentiwordnet: Challenges and Survey
A Dadhich,B Thankachan +1 more
TL;DR: A detailed review and comparative analysis of various existing sentiment analysis algorithms especially for the Amazon products, which have worked upon the supervised learning techniques called Naïve Bayes, logistic regression and SentiWordNet are presented.
References
More filters
Proceedings ArticleDOI
Script based text identification: a multi-level architecture
TL;DR: The proposed framework presents a top-down approach by performing page, block/paragraph and word level script identification in multiple stages by utilizing texture and shape based information embedded in the documents at different levels for feature extraction.
Journal ArticleDOI
Separation of Handwritten and Machine-Printed Texts from Noisy Documents Using Contourlet Transform
Parul Sahare,Sanjay B. Dhok +1 more
TL;DR: A new algorithm is proposed for separation of machine-printed and handwritten texts using correlation coefficients and probabilities-based moments features and it provides a better text separation performance compared to that of other state-of-the-art approaches.
Book ChapterDOI
Word-Level Script Identification from Handwritten Multi-script Documents
TL;DR: A combination of shape based and texture based features are used to identify the script of the handwritten word images written in any of five scripts namely, Bangla, Devnagari, Malayalam, Telugu and Roman.
Proceedings ArticleDOI
Applications of Text Detection and its Challenges: A Review
M. P. Nevetha,A. Baskar +1 more
TL;DR: The rising need for automation of systems has effected the development of text detection and recognition from images to a large extent, and this paper attempts to answer questions in chosen scenarios.
Journal ArticleDOI
A Hybrid Hindi Printed Document Classification System Using SVM and Fuzzy: An Advancement
Shalini Puri,Satya Prakash Singh +1 more
TL;DR: A new advanced tri-layered segmentation and bi-leveled-classifier-based Hindi printed document classification system, which categorizes imaged documents into pre-defined mutually exclusive categories by using SVM and Fuzzy matching at character and document classifications, respectively.