Home
/
Authors
/
R.M.K. Sinha

Author

R.M.K. Sinha

Other affiliations: Université du Québec, Institut national de la recherche scientifique

Bio: R.M.K. Sinha is an academic researcher from Indian Institute of Technology Kanpur. The author has contributed to research in topics: Devanagari & Natural language. The author has an hindex of 13, co-authored 32 publications receiving 766 citations. Previous affiliations of R.M.K. Sinha include Université du Québec & Institut national de la recherche scientifique.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Segmentation of touching and fused Devanagari characters

[...]

Veena Bansal¹, R.M.K. Sinha¹•Institutions (1)

Indian Institute of Technology Kanpur¹

01 Apr 2002-Pattern Recognition

TL;DR: A two pass algorithm for the segmentation and decomposition of Devanagari composite characters/symbols into their constituent symbols and a recognition rate has been achieved on the segmented conjuncts.

...read moreread less

143 citations

Journal Article•DOI•

Integrating knowledge sources in Devanagari text recognition system

[...]

Veena Bansal¹, R.M.K. Sinha¹•Institutions (1)

Indian Institute of Technology Kanpur¹

01 Jul 2000

TL;DR: The reading process has been widely studied and there is a general agreement among researchers that knowledge in different forms and at different levels plays a vital role, which is the underlying philosophy of the Devanagari document recognition system described in this work.

...read moreread less

Abstract: The reading process has been widely studied and there is a general agreement among researchers that knowledge in different forms and at different levels plays a vital role. This is the underlying philosophy of the Devanagari document recognition system described in this work. The knowledge sources we use are mostly statistical in nature or in the form of a word dictionary tailored specifically for optical character recognition (OCR). We do not perform any reasoning on these. However, we explore their relative importance and role in the hierarchy. Some of the knowledge sources are acquired a priori by an automated training process while others are extracted from the text as it is processed. A complete Devanagari OCR system has been designed and tested with real-life printed documents of varying size and font. Most of the documents used were photocopies of the original. A performance of approximately 90% correct recognition is achieved.

...read moreread less

132 citations

Journal Article•DOI•

Rule based contextual post-processing for Devanagari text recognition

[...]

R.M.K. Sinha¹•Institutions (1)

Université du Québec¹

01 Oct 1987-Pattern Recognition

TL;DR: This paper presents a design of a post-processor which corrects the Devanagari symbol string based on this observation and its accumulated penalty value for a word gives a measure of its confidence level.

...read moreread less

73 citations

Proceedings Article•DOI•

On how to describe shapes of Devanagari characters and use them for recognition

[...]

Veena Bansal¹, R.M.K. Sinha•Institutions (1)

Indian Institute of Technology Kanpur¹

20 Sep 1999

TL;DR: A schema for the description of shapes of Devanagari characters and its application in their recognition is presented, which exploits certain features of the script in both reducing the search space and creating a reference with respect to which correspondence could be established, during the matching process.

...read moreread less

Abstract: The paper presents a schema for the description of shapes of Devanagari characters and its application in their recognition. It exploits certain features of the script in both reducing the search space and creating a reference with respect to which correspondence could be established, during the matching process. The description prototypes are constructed using the real-life script after segmentation so that the aberrations introduced during the inevitable process of segmentation get accounted for in the description. This has been tested on printed Devanagari text with a success of approximately 70% without any post-processing and 88% correct recognition with the help of a word dictionary.

...read moreread less

62 citations

Proceedings Article•DOI•

ANGLABHARTI: a multilingual machine aided translation project on translation from English to Indian languages

[...]

R.M.K. Sinha¹, K. Sivaraman¹, Alok Agrawal¹, R. Jain¹, R. C. Srivastava¹, A. Jain¹ - Show less +2 more•Institutions (1)

Indian Institute of Technology Kanpur¹

22 Oct 1995

TL;DR: An English to Indian languages machine aided translation system, named ANGLABHARTI, has been developed, which is better than the transfer approach, but falls short of genuine interlingua, in the sense that it ignores complete disambiguation/understanding of the text to be translated.

...read moreread less

Abstract: An English to Indian languages machine aided translation system, named ANGLABHARTI, has been developed. It uses pattern directed approach using context free grammar like structures. A 'pseudo-target' is generated which is applicable to a group of Indian languages. Set of rules are acquired through corpus analysis to identify the plausible constituents with respect to which movement rules for the 'pseudo-target' are constructed. A number of semantic tags are used to resolve sense ambiguity in the source language. Alternative meanings for the unresolved ambiguities are retained in the pseudo target language code. A text generator module for each of the target languages transforms the pseudo target language to the target language. A corrector for ill-formed sentences is used for each of the target languages. Finally, a human-engineered post-editing package is used to make the final corrections. The post-editor needs to know only the target language. The strategy used in ANGLABHARTI lies in between the transfer and the interlingua approach. It is better than the transfer approach, as the translation is valid for a host of target language sentences, but falls short of genuine interlingua, in the sense that it ignores complete disambiguation/understanding of the text to be translated.

...read moreread less

58 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Thinning methodologies-a comprehensive survey

[...]

Louisa Lam¹, Seong-Whan Lee², Ching Y. Suen¹•Institutions (2)

Concordia University¹, Chungbuk National University²

01 Sep 1992-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A comprehensive survey of thinning methodologies, including iterative deletion of pixels and nonpixel-based methods, is presented and the relationships among them are explored.

...read moreread less

Abstract: A comprehensive survey of thinning methodologies is presented. A wide range of thinning algorithms, including iterative deletion of pixels and nonpixel-based methods, is covered. Skeletonization algorithms based on medial axis and other distance transforms are not considered. An overview of the iterative thinning process and the pixel-deletion criteria needed to preserve the connectivity of the image pattern is given first. Thinning algorithms are then considered in terms of these criteria and their modes of operation. Nonpixel-based methods that usually produce a center line of the pattern directly in one pass without examining all the individual pixels are discussed. The algorithms are considered in great detail and scope, and the relationships among them are explored. >

...read moreread less

1,827 citations

Book•

Algorithms for image processing and computer vision

[...]

James R. Parker

25 Nov 1996

TL;DR: Algorithms for Image Processing and Computer Vision, 2nd Edition provides the tools to speed development of image processing applications.

...read moreread less

Abstract: A cookbook of algorithms for common image processing applicationsThanks to advances in computer hardware and software, algorithms have been developed that support sophisticated image processing without requiring an extensive background in mathematics This bestselling book has been fully updated with the newest of these, including 2D vision methods in content-based searches and the use of graphics cards as image processing computational aids Its an ideal reference for software engineers and developers, advanced programmers, graphics programmers, scientists, and other specialists who require highly specialized image processingAlgorithms now exist for a wide variety of sophisticated image processing applications required by software engineers and developers, advanced programmers, graphics programmers, scientists, and related specialistsThis bestselling book has been completely updated to include the latest algorithms, including 2D vision methods in content-based searches, details on modern classifier methods, and graphics cards used as image processing computational aidsSaves hours of mathematical calculating by using distributed processing and GPU programming, and gives non-mathematicians the shortcuts needed to program relatively sophisticated applicationsAlgorithms for Image Processing and Computer Vision, 2nd Edition provides the tools to speed development of image processing applications

...read moreread less

1,517 citations

Journal Article•DOI•

Techniques for automatically correcting words in text

[...]

Karen Kukich

01 Dec 1992-ACM Computing Surveys

TL;DR: Research aimed at correcting words in text has focused on three progressively more difficult problems: nonword error detection; (2) isolated-word error correction; and (3) context-dependent work correction, which surveys documented findings on spelling error patterns.

...read moreread less

Abstract: Research aimed at correcting words in text has focused on three progressively more difficult problems:(1) nonword error detection; (2) isolated-word error correction; and (3) context-dependent work correction. In response to the first problem, efficient pattern-matching and n-gram analysis techniques have been developed for detecting strings that do not appear in a given word list. In response to the second problem, a variety of general and application-specific spelling correction techniques have been developed. Some of them were based on detailed studies of spelling error patterns. In response to the third problem, a few experiments using natural-language-processing tools or statistical-language models have been carried out. This article surveys documented findings on spelling error patterns, provides descriptions of various nonword detection and isolated-word error correction techniques, reviews the state of the art of context-dependent word correction techniques, and discusses research issues related to all three areas of automatic error correction in text.

...read moreread less

1,417 citations

Journal Article•DOI•

Automatic text detection and tracking in digital video

[...]

Huiping Li¹, David Doermann¹, Omid Kia²•Institutions (2)

University of Maryland, College Park¹, National Institute of Standards and Technology²

01 Jan 2000-IEEE Transactions on Image Processing

TL;DR: This work presents algorithms for detecting and tracking text in digital video that implements a scale-space feature extractor that feeds an artificial neural processor to detect text blocks.

...read moreread less

Abstract: Text that appears in a scene or is graphically added to video can provide an important supplemental source of index information as well as clues for decoding the video's structure and for classification. In this work, we present algorithms for detecting and tracking text in digital video. Our system implements a scale-space feature extractor that feeds an artificial neural processor to detect text blocks. Our text tracking scheme consists of two modules: a sum of squared difference (SSD) based module to find the initial position and a contour-based module to refine the position. Experiments conducted with a variety of video sources show that our scheme can detect and track text robustly.

...read moreread less

635 citations

Journal Article•DOI•

Indian script character recognition: a survey

[...]

Umapada Pal¹, Bidyut B. Chaudhuri¹•Institutions (1)

Indian Statistical Institute¹

01 Sep 2004-Pattern Recognition

TL;DR: A review of the OCR work done on Indian language scripts and the scope of future work and further steps needed for Indian script OCR development is presented.

...read moreread less

592 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117

Collapse