scispace - formally typeset
Search or ask a question
Author

Samay Pashine

Bio: Samay Pashine is an academic researcher. The author has contributed to research in topics: Deep learning & Handwriting recognition. The author has an hindex of 1, co-authored 3 publications receiving 6 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: This paper has performed handwritten digit recognition with the help of MNIST datasets using Support Vector Machines (SVM), Multi-Layer Perceptron (MLP) and Convolution Neural Network (CNN) models.
Abstract: The reliance of humans over machines has never been so high such that from object classification in photographs to adding sound to silent movies everything can be performed with the help of deep learning and machine learning algorithms. Likewise, Handwritten text recognition is one of the significant areas of research and development with a streaming number of possibilities that could be attained. Handwriting recognition (HWR), also known as Handwritten Text Recognition (HTR), is the ability of a computer to receive and interpret intelligible handwritten input from sources such as paper documents, photographs, touch-screens and other devices [1]. Apparently, in this paper, we have performed handwritten digit recognition with the help of MNIST datasets using Support Vector Machines (SVM), Multi-Layer Perceptron (MLP) and Convolution Neural Network (CNN) models. Our main objective is to compare the accuracy of the models stated above along with their execution time to get the best possible model for digit recognition.

26 citations

Journal ArticleDOI
TL;DR: In this paper, the authors have performed handwritten digit recognition with the help of MNIST datasets using Support Vector Machines (SVM), Multi-Layer Perceptron (MLP) and Convolution Neural Network (CNN) models.
Abstract: The reliance of humans over machines has never been so high such that from object classification in photographs to adding sound to silent movies everything can be performed with the help of deep learning and machine learning algorithms. Likewise, Handwritten text recognition is one of the significant areas of research and development with a streaming number of possibilities that could be attained. Handwriting recognition (HWR), also known as Handwritten Text Recognition (HTR), is the ability of a computer to receive and interpret intelligible handwritten input from sources such as paper documents, photographs, touch-screens and other devices [1]. Apparently, in this paper, we have performed handwritten digit recognition with the help of MNIST datasets using Support Vector Machines (SVM), Multi-Layer Perceptron (MLP) and Convolution Neural Network (CNN) models. Our main objective is to compare the accuracy of the models stated above along with their execution time to get the best possible model for digit recognition.

1 citations

Posted Content
TL;DR: In this article, the authors analyzed several state-of-the-art neural networks (MesoNet, ResNet-50, VGG-19, and Xception Net) and compared them against each other to find an optimal solution for various scenarios like real-time deep fake detection to be deployed in online social media platforms.
Abstract: Deep Learning as a field has been successfully used to solve a plethora of complex problems, the likes of which we could not have imagined a few decades back. But as many benefits as it brings, there are still ways in which it can be used to bring harm to our society. Deep fakes have been proven to be one such problem, and now more than ever, when any individual can create a fake image or video simply using an application on the smartphone, there need to be some countermeasures, with which we can detect if the image or video is a fake or real and dispose of the problem threatening the trustworthiness of online information. Although the Deep fakes created by neural networks, may seem to be as real as a real image or video, it still leaves behind spatial and temporal traces or signatures after moderation, these signatures while being invisible to a human eye can be detected with the help of a neural network trained to specialize in Deep fake detection. In this paper, we analyze several such states of the art neural networks (MesoNet, ResNet-50, VGG-19, and Xception Net) and compare them against each other, to find an optimal solution for various scenarios like real-time deep fake detection to be deployed in online social media platforms where the classification should be made as fast as possible or for a small news agency where the classification need not be in real-time but requires utmost accuracy.

Cited by
More filters
Book
11 Sep 2017

21 citations

Proceedings ArticleDOI
21 May 2021
TL;DR: In this paper, a CNN based on the kara-model was used to classify handwritten digit images with RMSprop optimizer for optimizing the model, which achieved 99.06% of training accuracy and 98.80% of testing accuracy with epoch 10.
Abstract: In the era of research, pattern recognition is one of the most famous and widely used area in the field of research work. There are various types of patterns are available for the researches like: audio, video, handwritten digit images and handwritten characters images etc. In this paper, we concentrate in the field of handwritten digit recognition for classification of patterns. We have used famous handwritten digit datasets named as MNIST, which is collection of 70000 images. Many of machine learning and deep learning techniques have been already used by the researches for handwritten digit recognition like Support Vector Machine (SVM), RFC, K-nearest Neighbor (K-NN), Multilayer Perceptron (MLP), Convolutional Neural Network (CNN) etc. In this research work, we have suggested CNN as deep learning technique on keras for MNIST handwritten digit recognition and compare the performance of CNN with SVM and KNN. The proposed CNN based on keras model used to classify handwritten digit images with RMSprop optimizer for optimizing the model. The main contribution of this research work is to increase the convolutional layer with pooling and dropout and also tuned the model using filter, kernel size and number of neurons. The proposed CNN model achieves 99.06% of training accuracy and 98.80% of testing accuracy with epoch 10. Experiment results reveals that proposed CNN is more effective compare to other techniques.

18 citations

Journal ArticleDOI
TL;DR: In this article , a new model based on center-symmetric local binary patterns (CS-LBPs) is proposed, which addresses the issues of LBCNNs.
Abstract: The writing style of the same writer varies from instance to instance in Arabic and English handwritten digit recognition, making handwritten digit recognition challenging. Currently, deep learning approaches are applied in many applications, including convolutional neural networks (CNNs) modified to produce other models, such as local binary convolutional neural networks (LBCNNs). An LBCNN is created by fusing a local binary pattern (LBP) with a CNN by reformulating the LBP as a convolution layer called a local binary convolution (LBC). However, LBCNNs suffer from the random assignment of 1, 0, or -1 to LBC weights, making LBCNNs less robust. Nevertheless, using another LBP-based technique, such as center-symmetric local binary patterns (CS-LBPs), can address such issues. In this paper, a new model based on CS-LBPs is proposed called center-symmetric local binary convolutional neural networks (CS-LBCNN), which addresses the issues of LBCNNs. Furthermore, an enhanced version of CS-LBCNNs called threshold center-symmetric local binary convolutional neural networks (TCS-LBCNNs) is proposed, which addresses another issue related to the zero-thresholding function. Finally, the proposed models are compared to state-of-the-art models, proving their ability by producing a more accurate and significant classification rate than the existing LBCNN models. For the bilingual dataset, the TCS-LBCNN enhances the accuracy of the LBCNN and CS-LBCNN by 0.15% and 0.03%, respectively. In addition, the comparison shows that the accuracy acquired by the TCS-LBCNN is the second-highest using the MNIST and MADBase datasets.

4 citations

Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed an adaptive graph construction and low-rank representation of weighted noise (ACLWN) model, which is composed of an adaptive representation graph construction model named ARG and an adaptive weighted sparse representation graph learning model named AWSG.
Abstract: Many graph construction methods for clustering cannot consider both local and global data structures in the construction of initial graph. Meanwhile, redundant features or even outliers and data with important characteristics are addressed equally in the graph optimization process. These lead to the learned representation graph may not capture the optimal structure when clustering. This paper proposes a novel model for clustering, named adaptive graph construction and low-rank representation of weighted noise (ACLWN), to overcome these problems. ACLWN is composed of an adaptive representation graph construction model named ARG, and an adaptive weighted sparse representation graph learning model named AWSG. In ARG, manifold learning and sparse representation are employed to capture the local structure of data. In AWSG, an adaptive weighted matrix is proposed to strengthen the important features and improve the robustness of the low-dimensional representation graph. Moreover, constraints such as non-negative low-rank, sparsity and distance regularization terms are imposed to capture the local and global structures of data. Comprehensive experimental results show that our method outperforms the compared state-of-the-art methods. The low-dimensional representation graph constructed by ACLWN is more suitable for clustering.

1 citations

Journal ArticleDOI
TL;DR: In this paper , a deep learning-based technique, namely, EfficientDet-D4, was proposed to detect and categorize the numerals into their respective classes from zero to nine, achieving an average accuracy of 99.83%.
Abstract: Handwritten digit recognition (HDR) shows a significant application in the area of information processing. However, correct recognition of such characters from images is a complicated task due to immense variations in the writing style of people. Moreover, the occurrence of several image artifacts like the existence of intensity variations, blurring, and noise complicates this process. In the proposed method, we have tried to overcome the aforementioned limitations by introducing a deep learning- (DL-) based technique, namely, EfficientDet-D4, for numeral categorization. Initially, the input images are annotated to exactly show the region of interest (ROI). In the next phase, these images are used to train the EfficientNet-B4-based EfficientDet-D4 model to detect and categorize the numerals into their respective classes from zero to nine. We have tested the proposed model over the MNIST dataset to demonstrate its efficacy and attained an average accuracy value of 99.83%. Furthermore, we have accomplished the cross-dataset evaluation on the USPS database and achieved an accuracy value of 99.10%. Both the visual and reported experimental results show that our method can accurately classify the HDR from images even with the varying writing style and under the presence of various sample artifacts like noise, blurring, chrominance, position, and size variations of numerals. Moreover, the introduced approach is capable of generalizing well to unseen cases which confirms that the EfficientDet-D4 model is an effective solution to numeral recognition.

1 citations