scispace - formally typeset
Search or ask a question
Author

Sukalpa Chanda

Other affiliations: Griffith University, MediaTech Institute, Uppsala University  ...read more
Bio: Sukalpa Chanda is an academic researcher from Østfold University College. The author has contributed to research in topics: Optical character recognition & Feature extraction. The author has an hindex of 14, co-authored 47 publications receiving 444 citations. Previous affiliations of Sukalpa Chanda include Griffith University & MediaTech Institute.

Papers
More filters
Proceedings ArticleDOI
26 Jul 2009
TL;DR: A two-stage approach for word-wise identification of English, Devnagari and Bengali (Bangla) scripts is proposed, which allows identifying scripts with high speed, yet less accuracy when dealing with noisy data.
Abstract: A two-stage approach for word-wise identification of English (Roman), Devnagari and Bengali (Bangla) scripts is proposed. This approach balances the tradeoff between recognition accuracy and processing speed. The 1st stage allows identifying scripts with high speed, yet less accuracy when dealing with noisy data. The advanced 2nd stage processes only those samples that yield low recognition confidence in the first stage. For both stages a rough character segmentation is performed and features are computed on segmented character components. Features used in the 1st stage are a 64-dimensional chain-code-histogram feature, while 400-dimensional gradient features are used in the 2nd stage. Final classification of a word to a particular script is done via majority voting of each recognized character component of the word. Extensive experiments with various confidence scores were conducted and reported here. The overall recognition accuracy and speed is remarkable. Correct classification of 98.51% on 11,123 test words is achieved, even when the recognition-confidence is as high as 95% at both stages.

47 citations

Proceedings ArticleDOI
25 Aug 2013
TL;DR: Experiments show that the super resolution technique with gradient features has performed well, and an accuracy of 87.5% was achieved when testing on 896 words from three different scripts, and the use of proper pre-processing approaches can be helpful in applying traditional script identification techniques to video frames.
Abstract: Script identification is an essential step for the efficient use of the appropriate OCR in multilingual document images. There are various techniques available for script identification from printed and handwritten document images, but script identification from video frames has not been explored much. This paper presents a study of some pre-processing techniques and features for word-wise script identification from video frames. Traditional features, namely Zernike moments, Gabor and gradient, have performed well for handwritten and printed documents having simple backgrounds and adequate resolution for OCR. Video frames are mostly coloured and suffer from low resolution, blur, background noise, to mention a few. In this paper, an attempt has been made to explore whether the traditional script identification techniques can be useful in video frames. Three feature extraction techniques, namely Zernike moments, Gabor and gradient features, and SVM classifiers were considered for analyzing three popular scripts, namely English, Bengali and Hindi. Some pre-processing techniques such as super resolution and skeletonization of the original word images were used in order to overcome the inherent problems with video. Experiments show that the super resolution technique with gradient features has performed well, and an accuracy of 87.5% was achieved when testing on 896 words from three different scripts. The study also reveals that the use of proper pre-processing approaches can be helpful in applying traditional script identification techniques to video frames.

38 citations

Proceedings ArticleDOI
27 Mar 2012
TL;DR: A writer identification system for Oriya script is proposed which is capable of performing reasonably well even with small amount of text, and experiments with curvature feature are reported here.
Abstract: Automatic identification of an individual based on his/her handwriting characteristics is an important forensic tool. In a computational forensic scenario, presence of huge amount of text/information in a questioned document cannot be ensured. Lack of data threatens system reliability in such cases. We here propose a writer identification system for Oriya script which is capable of performing reasonably well even with small amount of text. Experiments with curvature feature are reported here, using Support Vector Machine (SVM) as classifier. We got promising results of 94.00% writer identification accuracy at first top choice and 99% when considering first three top choices.

37 citations

01 Jan 2015
TL;DR: In this article, a two-stage approach is followed, in the first stage, a decision is made as whether an image should undergo a pre-processing step (depending on image characteristics); in the second stage, wavelet features are extracted from the image using a sliding window texture analysis-based technique.
Abstract: Bridge inspection is a pathway to bridge condition rating assessment, and is an essential element of any bridge management system (BMS). The success of a BMS is highly dependent on the quality of bridge inspection outcomes and accurate estimation of future bridge condition ratings. However, existing visual bridge inspection methods suffer several limitations due to human subjective judgment. In order to minimise such limitations, a feasibility study has been performed to enhance the current visual inspection method using optical image processing techniques. However, the accuracy of the inspection outcomes still requires further improvement. This paper proposes an automatic bridge inspection approach employing wavelet-based image features along with support vector machines (SVM) for automatic detection of cracks in bridge images. A two-stage approach is followed, in the first stage, a decision is made as whether an image should undergo a pre-processing step (depending on image characteristics); in the second stage, wavelet features are extracted from the image using a sliding window texture analysis-based technique. Consequently, an average accuracy of 92% (effect of training image types on accuracy) is obtained even when undertaking experiments with noisy and complex bridge images.

34 citations

Proceedings ArticleDOI
23 Aug 2010
TL;DR: A system to encounter adverse situation in the context of Bengali script, with promising results of 95.19% writer identification accuracy at first top choice and 99.03% when considering first three top choices is proposed.
Abstract: Automatic identification of an individual based on his/her handwriting characteristics is an important forensic tool. In a computational forensic scenario, presence of huge amount of text/information in a questioned document cannot be always ensured. Also, compromising in terms of systems reliability under such situation is not desirable. We here propose a system to encounter such adverse situation in the context of Bengali script. Experiments with discrete directional feature and gradient feature are reported here, along with Support Vector Machine (SVM) as classifier. We got promising results of 95.19% writer identification accuracy at first top choice and 99.03% when considering first three top choices.

31 citations


Cited by
More filters
01 Jan 1990
TL;DR: An overview of the self-organizing map algorithm, on which the papers in this issue are based, is presented in this article, where the authors present an overview of their work.
Abstract: An overview of the self-organizing map algorithm, on which the papers in this issue are based, is presented in this article.

2,933 citations

Journal ArticleDOI
TL;DR: A convolutional neural network is proposed to detect crack patches in each video frame, while the proposed data fusion scheme maintains the spatiotemporal coherence of cracks in videos, and the Naïve Bayes decision making discards false positives effectively.
Abstract: Regular inspection of nuclear power plant components is important to guarantee safe operations. However, current practice is time consuming, tedious, and subjective, which involves human technicians reviewing the inspection videos and identifying cracks on reactors. A few vision-based crack detection approaches have been developed for metallic surfaces, and they typically perform poorly when used for analyzing nuclear inspection videos. Detecting these cracks is a challenging task since they are tiny, and noisy patterns exist on the components’ surfaces. This study proposes a deep learning framework, based on a convolutional neural network (CNN) and a Naive Bayes data fusion scheme, called NB-CNN, to analyze individual video frames for crack detection while a novel data fusion scheme is proposed to aggregate the information extracted from each video frame to enhance the overall performance and robustness of the system. To this end, a CNN is proposed to detect crack patches in each video frame, while the proposed data fusion scheme maintains the spatiotemporal coherence of cracks in videos, and the Naive Bayes decision making discards false positives effectively. The proposed framework achieves a 98.3% hit rate against 0.1 false positives per frame that is significantly higher than state-of-the-art approaches as presented in this paper.

649 citations

Posted Content
Fan Yang1, Lei Zhang1, Sijia Yu1, Danil V. Prokhorov2, Xue Mei2, Haibin Ling1 
TL;DR: A novel network architecture, named feature pyramid and hierarchical boosting network (FPHBN), is proposed, Inspired by recent advances of deep learning in computer vision, for pavement crack detection that outperforms the state-of-the-art crack detection, edge detection, and semantic segmentation methods.
Abstract: Pavement crack detection is a critical task for insuring road safety. Manual crack detection is extremely time-consuming. Therefore, an automatic road crack detection method is required to boost this progress. However, it remains a challenging task due to the intensity inhomogeneity of cracks and complexity of the background, e.g., the low contrast with surrounding pavements and possible shadows with similar intensity. Inspired by recent advances of deep learning in computer vision, we propose a novel network architecture, named Feature Pyramid and Hierarchical Boosting Network (FPHBN), for pavement crack detection. The proposed network integrates semantic information to low-level features for crack detection in a feature pyramid way. And, it balances the contribution of both easy and hard samples to loss by nested sample reweighting in a hierarchical way. To demonstrate the superiority and generality of the proposed method, we evaluate the proposed method on five crack datasets and compare it with state-of-the-art crack detection, edge detection, semantic segmentation methods. Extensive experiments show that the proposed method outperforms these state-of-the-art methods in terms of accuracy and generality.

265 citations

Journal ArticleDOI
TL;DR: A systematic review of the last 10 years of the literature on handwritten signatures with respect to the new scenario is reported, focusing on the most promising domains of research and trying to elicit possible future research directions in this subject.
Abstract: Handwritten signatures are biometric traits at the center of debate in the scientific community. Over the last 40 years, the interest in signature studies has grown steadily, having as its main reference the application of automatic signature verification, as previously published reviews in 1989, 2000, and 2008 bear witness. Ever since, and over the last 10 years, the application of handwritten signature technology has strongly evolved and much research has focused on the possibility of applying systems based on handwritten signature analysis and processing to a multitude of new fields. After several years of haphazard growth of this research area, it is time to assess its current developments for their applicability in order to draw a structured way forward. This perspective reports a systematic review of the last 10 years of the literature on handwritten signatures with respect to the new scenario, focusing on the most promising domains of research and trying to elicit possible future research directions in this subject.

184 citations

Journal ArticleDOI
01 Nov 2011
TL;DR: In this paper, the state of the art from 1970s of machine printed and handwritten Devanagari optical character recognition (OCR) is discussed in various sections of the paper.
Abstract: In India, more than 300 million people use Devanagari script for documentation. There has been a significant improvement in the research related to the recognition of printed as well as handwritten Devanagari text in the past few years. State of the art from 1970s of machine printed and handwritten Devanagari optical character recognition (OCR) is discussed in this paper. All feature-extraction techniques as well as training, classification and matching techniques useful for the recognition are discussed in various sections of the paper. An attempt is made to address the most important results reported so far and it is also tried to highlight the beneficial directions of the research till date. Moreover, the paper also contains a comprehensive bibliography of many selected papers appeared in reputed journals and conference proceedings as an aid for the researchers working in the field of Devanagari OCR.

159 citations