scispace - formally typeset
Search or ask a question
Author

Mita Nasipuri

Other affiliations: MCKV Institute of Engineering
Bio: Mita Nasipuri is an academic researcher from Jadavpur University. The author has contributed to research in topics: Facial recognition system & Feature extraction. The author has an hindex of 36, co-authored 437 publications receiving 5294 citations. Previous affiliations of Mita Nasipuri include MCKV Institute of Engineering.


Papers
More filters
Journal ArticleDOI
01 May 2012
TL;DR: A methodology where local regions of varying heights and widths are created dynamically and genetic algorithm (GA) is applied on these local regions to sample the optimal set of local regions from where an optimal feature set can be extracted that has the best discriminating features.
Abstract: Identification of local regions from where optimal discriminating features can be extracted is one of the major tasks in the area of pattern recognition. To locate such regions different kind of region sampling techniques are used in the literature. There is no standard methodology to identify exactly such regions. Here we have proposed a methodology where local regions of varying heights and widths are created dynamically. Genetic algorithm (GA) is then applied on these local regions to sample the optimal set of local regions from where an optimal feature set can be extracted that has the best discriminating features. We have evaluated the proposed methodology on a data set of handwritten Bangla digits. In the present work, we have randomly generated seven sets of local regions and from every set, GA selects an optimal group of local regions which produces best recognition performance with a support vector machine (SVM) based classifier. Other popular optimization techniques like simulated annealing (SA) and hill climbing (HC) have also been evaluated with the same data set and maximum recognition accuracies were found to be 97%, 96.7% and 96.7% for GA, SA and HC, respectively. We have also compared the performance of the present technique with those of other zone based techniques on the same database.

159 citations

Journal ArticleDOI
01 Sep 2015
TL;DR: The experimental results show that the csFCM algorithm has superior performance in terms of qualitative and quantitative studies such as, cluster validity functions, segmentation accuracy, tissue segmentsation accuracy and receiver operating characteristic (ROC) curve on the image segmentation results than the k-means, FCM and some other recently proposed FCM-based algorithms.
Abstract: A conditional spatial fuzzy C-means (csFCM) clustering algorithm to improve the robustness of the conventional FCM algorithm is presented.The method incorporates conditional affects and spatial information into the membership functions.The algorithm resolves the problem of sensitivity to noise and intensity inhomogeneity in magnetic resonance imaging (MRI) data.The experimental results on four volumes of simulated and one volume of real-patient MRI brain images, each one having 51 images, support efficiency of the csFCM algorithm.The csFCM algorithm has superior performance in terms of qualitative and quantitative studies on the image segmentation results than the k-means, FCM and some other recently proposed FCM-based algorithms. The fuzzy C-means (FCM) algorithm has got significant importance due to its unsupervised form of learning and more tolerant to variations and noise as compared to other methods in medical image segmentation. In this paper, we propose a conditional spatial fuzzy C-means (csFCM) clustering algorithm to improve the robustness of the conventional FCM algorithm. This is achieved through the incorporation of conditioning effects imposed by an auxiliary (conditional) variable corresponding to each pixel, which describes a level of involvement of the pixel in the constructed clusters, and spatial information into the membership functions. The problem of sensitivity to noise and intensity inhomogeneity in magnetic resonance imaging (MRI) data is effectively reduced by incorporating local and global spatial information into a weighted membership function. The experimental results on four volumes of simulated and one volume of real-patient MRI brain images, each one having 51 images, show that the csFCM algorithm has superior performance in terms of qualitative and quantitative studies such as, cluster validity functions, segmentation accuracy, tissue segmentation accuracy and receiver operating characteristic (ROC) curve on the image segmentation results than the k-means, FCM and some other recently proposed FCM-based algorithms.

147 citations

Journal ArticleDOI
TL;DR: This work proposed a patch-based classifier (PBC) using Convolutional neural network (CNN) for automatic classification of histopathological breast images using ICIAR 2018 breast histology image dataset which comprises of 4 different classes namely normal, benign, in situ and invasive cancer.

144 citations

Journal ArticleDOI
01 Aug 2012
TL;DR: A new combination of PCA/MPCA and QTLR features for OCR of handwritten numerals is introduced and it has been observed that MPCA+QTLR feature combination outperforms PCA+QTB feature combination and most other conventional features available in the literature.
Abstract: Principal Component Analysis (PCA) and Modular PCA (MPCA) are well known statistical methods for recognition of facial images. But only PCA/MPCA is found to be insufficient to achieve high classification accuracy required for handwritten character recognition application. This is due to the shortcomings of those methods to represent certain local morphometric information present in the character patterns. On the other hand Quad-tree based hierarchically derived Longest-Run (QTLR) features, a type of popularly used topological features for character recognition, miss some global statistical information of the characters. In this paper, we have introduced a new combination of PCA/MPCA and QTLR features for OCR of handwritten numerals. The performance of the designed feature-combination is evaluated on handwritten numerals of five popular scripts of Indian sub-continent, viz., Arabic, Bangla, Devanagari, Latin and Telugu with Support Vector Machine (SVM) based classifier. From the results it has been observed that MPCA+QTLR feature combination outperforms PCA+QTLR feature combination and most other conventional features available in the literature.

125 citations

Journal ArticleDOI
TL;DR: This paper has described the preparation of a benchmark database for research on off-line Optical Character Recognition (OCR) of document images of handwritten Bangla text and Bangle text mixed with English words, which is the first handwritten database in this area available as an open source document.
Abstract: In this paper, we have described the preparation of a benchmark database for research on off-line Optical Character Recognition (OCR) of document images of handwritten Bangla text and Bangla text mixed with English words. This is the first handwritten database in this area, as mentioned above, available as an open source document. As India is a multi-lingual country and has a colonial past, so multi-script document pages are very much common. The database contains 150 handwritten document pages, among which 100 pages are written purely in Bangla script and rests of the 50 pages are written in Bangla text mixed with English words. This database for off-line-handwritten scripts is collected from different data sources. After collecting the document pages, all the documents have been preprocessed and distributed into two groups, i.e., CMATERdb1.1.1, containing document pages written in Bangla script only, and CMATERdb1.2.1, containing document pages written in Bangla text mixed with English words. Finally, we have also provided the useful ground truth images for the line segmentation purpose. To generate the ground truth images, we have first labeled each line in a document page automatically by applying one of our previously developed line extraction techniques [Khandelwal et al., PReMI 2009, pp. 369–374] and then corrected any possible error by using our developed tool GT Gen 1.1. Line extraction accuracies of 90.6 and 92.38% are achieved on the two databases, respectively, using our algorithm. Both the databases along with the ground truth annotations and the ground truth generating tool are available freely at http://code.google.com/p/cmaterdb.

119 citations


Cited by
More filters
Journal ArticleDOI

[...]

08 Dec 2001-BMJ
TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.
Abstract: There is, I think, something ethereal about i —the square root of minus one. I remember first hearing about it at school. It seemed an odd beast at that time—an intruder hovering on the edge of reality. Usually familiarity dulls this sense of the bizarre, but in the case of i it was the reverse: over the years the sense of its surreal nature intensified. It seemed that it was impossible to write mathematics that described the real world in …

33,785 citations

Christopher M. Bishop1
01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

10,141 citations

01 Jan 1990
TL;DR: An overview of the self-organizing map algorithm, on which the papers in this issue are based, is presented in this article, where the authors present an overview of their work.
Abstract: An overview of the self-organizing map algorithm, on which the papers in this issue are based, is presented in this article.

2,933 citations