scispace - formally typeset
Search or ask a question
Topic

Devanagari

About: Devanagari is a research topic. Over the lifetime, 655 publications have been published within this topic receiving 7428 citations. The topic is also known as: Deva nagari & Hindi Script.


Papers
More filters
Proceedings ArticleDOI
01 Dec 2014
TL;DR: A Word and Character Segmentation method for machine printed Devanagari text and some basic morphological operations on the scanned document images are proposed and got much better results.
Abstract: Finding Structural Layout, Text Line Segmentation, Word Level Segmentation and Character Level Segmentation is major step in offline OCR systems for Devanagari Script in Document Image Processing. This paper proposes a Word and Character Segmentation method for machine printed Devanagari text. A complete word and character segmentation system for Devanagari printed text is presented here. Sometimes, interline space and fused characters make line segmentation and character segmentation a difficult task respectively. We have tested our method on documents in Marathi scripts. A novel technique of character segmentation for printed Devanagari text is presented here. After removing the Shirorekha (header line) of Devanagari text, the bounding boxes are used to surround the segmented characters. Results obtained from this method are encouraging because of morphological operations. In this method we are proposing some basic morphological operations on the scanned document images and got much better results.

10 citations

Proceedings ArticleDOI
27 Jan 2008
TL;DR: An OCR-based technique for word spotting in Devanagari printed documents that achieves an average precision of 80% for 10 queries and 67% for 20 queries for a set of 64 documents containing 5780 word images.
Abstract: This paper describes an OCR-based technique for word spotting in Devanagari printed documents. The system accepts a Devanagari word as input and returns a sequence of word images that are ranked according to their similarity with the input query. The methodology involves line and word separation, pre-processing document words, word recognition using OCR and similarity matching. We demonstrate a Block Adjacency Graph (BAG) based document cleanup in the pre-processing phase. During word recognition, multiple recognition hypotheses are generated for each document word using a font-independent Devanagari OCR. The similarity matching phase uses a cost based model to match the word input by a user and the OCR results. Experiments are conducted on document images from the publicly available ILT and Million Book Project dataset. The technique achieves an average precision of 80% for 10 queries and 67% for 20 queries for a set of 64 documents containing 5780 word images. The paper also presents a comparison of our method with template-based word spotting techniques.

10 citations

Proceedings ArticleDOI
11 Jul 2018
TL;DR: The improved accuracy of Hindi, English and Bangla digit dataset is shown by using the proposed approach and also performing a number of cross-validation experiments on all three datasets using image augmentation.
Abstract: Handwritten digit recognition has turned into one of the demanding areas of research in the field of image processing. Many approaches have been proposed which include a statistical method, fuzzy technique, and neural network for feature classification and feature selection but have not been found to use convolutional autoencoder for Devanagari digit after performing image augmentation on the training dataset. This paper shows the use of unsupervised training using convolutional autoencoder with deep ConvNet in order to detect handwritten Devanagari digits, i.e., 0–9. Convolutional autoencoder is the type of autoencoder that is used to encode the input for extracting important features and then try to reconstruct the input image. This paper shows the improved accuracy of Hindi, English and Bangla digit dataset by using the proposed approach and also performing a number of cross-validation experiments on all three datasets using image augmentation.

10 citations

01 Jan 2010
TL;DR: A brief survey of Devanagari script, research and different classifiers approach used for Character Recognition is done and database of handwritten characters is created.
Abstract: In this paper we have done a brief survey of Devanagari script, research and different classifiers approach used for Character Recognition .We have collected and created database of handwritten characters. We have preprocessed it, extracted feature using local intensity distribution of gradient and used in our experiment. We have used KNN Classifier. Experiment result illustrates the accuracy, rejection, error percentage of classification. Keywords- Devanagari script, KNN Classifier, accuracy, rejection, error percentage of classification

10 citations

Journal ArticleDOI
TL;DR: A new technique for recognition of handwritten Devanagari Script using deep learning architecture that learns hierarchies of features using an unsupervised stacked Restrict...
Abstract: In this paper, we have proposed a new technique for recognition of handwritten Devanagari Script using deep learning architecture. In any OCR or classification system extracting discriminating feature is most important and crucial step for its success. Accuracy of such system often depends on the good feature representation. Deciding upon the appropriate features for classification system is highly subjective and requires lot of experience to decide proper set of features for a given classification system. For handwritten Devanagari characters it is very difficult to decide on optimal set of good feature to get good recognition rate. These methods use raw pixel values as features. Deep Learning architectures learn hierarchies of features. In this work, first image is preprocessed to remove noise, converted to binary image, resized to fixed size of 30×40 and then convert to gray scale image using mask operation, it blurs the edges of the images. Then we learn features using an unsupervised stacked Restrict...

10 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
77% related
Support vector machine
73.6K papers, 1.7M citations
75% related
Image segmentation
79.6K papers, 1.8M citations
74% related
Convolutional neural network
74.7K papers, 2M citations
74% related
Encryption
98.3K papers, 1.4M citations
73% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202342
202298
202148
202061
201938
201843