scispace - formally typeset
Proceedings ArticleDOI

Scene Text Analysis using Deep Belief Networks

Reads0
Chats0
TLDR
This paper is the first paper to report scene text recognition using deep belief networks and achieves improved recognition results on Chars74K English, Kannada and SVT-CHAR dataset in comparison to the state-of-art algorithms.
Abstract
This paper focuses on the recognition and analysis of text embedded in scene images using Deep learning. The proposed approach uses deep learning architectures for automated higher order feature extraction, thereby improving classification accuracies in comparison to handcrafted features used traditionally. Exhaustive experiments have been performed with Deep Belief Networks and Convolutional Deep Neural Networks with varied training algorithms like Contrastive Divergence, De-noising Score Matching and supervised learning algorithms such as logistic regression and Multi-layer perceptron. These algorithms have been validated on 4 standard datasets: Chars 74K English, Chars 74K Kannada, ICDAR 2003 Robust OCR dataset and SVT-CHAR dataset. The proposed network achieves improved recognition results on Chars74K English, Kannada and SVT-CHAR dataset in comparison to the state-of-art algorithms. For ICDAR 2003 dataset, the proposed network is marginally worse in comparison to Deep Convolutional networks. Although deep belief networks have been considerably used for several applications, according to the knowledge of the authors, this is the first paper to report scene text recognition using deep belief networks.

read more

Citations
More filters
Journal ArticleDOI

A Novel Dataset for English-Arabic Scene Text Recognition (EASTR)-42K and Its Evaluation Using Invariant Feature Extraction on Detected Extremal Regions

TL;DR: A novel technique by using adapted maximally stable extremal region (MSER) technique and extracts scale-invariant features from MSER detected region is presented and the adapted MDLSTM network is presented to tackle the complexities of cursive scene text.
Patent

Image classification neural networks

TL;DR: In this article, a neural network system that includes multiple subnetworks that includes a first subnetwork including multiple first modules, each first module including: a pass-through convolutional layer configured to process the subnetwork input for the first sub-network to generate a passthrough output; an average pooling stack of neural network layers that collectively processes the sub-networks inputs to generate an average Pooling output.
Journal ArticleDOI

An intelligent approach for automated argument based legal text recognition and summarization using machine learning

TL;DR: A machine learning based automated legal model is proposed to enhance the efficiency of the legal support system with an accuracy of 94% to assist the victims with prompt delivery of justice and legal professionals in reducing their workload.
Proceedings ArticleDOI

A memory efficient DNA sequence alignment technique using pointing matrix

TL;DR: The proposed DNA sequence alignment technique uses a novel concept of pointing matrix where the directed path in the pointing matrix ensures faster and accurate finding of the optimal alignment pertaining with the accuracy ensured by the well known Needleman & Wunsch algorithm.
Proceedings ArticleDOI

A hardware-based high-throughput DNA sequence alignment scheme

TL;DR: An improved DNA sequence alignment scheme using a matrix named neighbouring matrix to meet the requirement of high speed processing and can align 43% — 69% more sequences than that of same DNA sequence pairs of variable lengths.
References
More filters
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Journal ArticleDOI

Gradient-based learning applied to document recognition

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.
Journal ArticleDOI

Reducing the Dimensionality of Data with Neural Networks

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.
Journal ArticleDOI

A fast learning algorithm for deep belief nets

TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.
Journal ArticleDOI

Representation Learning: A Review and New Perspectives

TL;DR: Recent work in the area of unsupervised feature learning and deep learning is reviewed, covering advances in probabilistic models, autoencoders, manifold learning, and deep networks.
Related Papers (5)