Scene Text Analysis using Deep Belief Networks

doi:10.1145/2683483.2683554

Proceedings ArticleDOI

Scene Text Analysis using Deep Belief Networks

Anupama Ray, +2 more

- pp 71

Chats0

TLDR

This paper is the first paper to report scene text recognition using deep belief networks and achieves improved recognition results on Chars74K English, Kannada and SVT-CHAR dataset in comparison to the state-of-art algorithms.

Abstract:

This paper focuses on the recognition and analysis of text embedded in scene images using Deep learning. The proposed approach uses deep learning architectures for automated higher order feature extraction, thereby improving classification accuracies in comparison to handcrafted features used traditionally. Exhaustive experiments have been performed with Deep Belief Networks and Convolutional Deep Neural Networks with varied training algorithms like Contrastive Divergence, De-noising Score Matching and supervised learning algorithms such as logistic regression and Multi-layer perceptron. These algorithms have been validated on 4 standard datasets: Chars 74K English, Chars 74K Kannada, ICDAR 2003 Robust OCR dataset and SVT-CHAR dataset. The proposed network achieves improved recognition results on Chars74K English, Kannada and SVT-CHAR dataset in comparison to the state-of-art algorithms. For ICDAR 2003 dataset, the proposed network is marginally worse in comparison to Deep Convolutional networks. Although deep belief networks have been considerably used for several applications, according to the knowledge of the authors, this is the first paper to report scene text recognition using deep belief networks.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A Novel Dataset for English-Arabic Scene Text Recognition (EASTR)-42K and Its Evaluation Using Invariant Feature Extraction on Detected Extremal Regions

Saad Bin Ahmed, +3 more

- 01 Jan 2019 -

IEEE Access

TL;DR: A novel technique by using adapted maximally stable extremal region (MSER) technique and extracts scale-invariant features from MSER detected region is presented and the adapted MDLSTM network is presented to tackle the complexities of cursive scene text.

...read moreread less

Patent

Image classification neural networks

Vincent Vanhoucke, +2 more

TL;DR: In this article, a neural network system that includes multiple subnetworks that includes a first subnetwork including multiple first modules, each first module including: a pass-through convolutional layer configured to process the subnetwork input for the first sub-network to generate a passthrough output; an average pooling stack of neural network layers that collectively processes the sub-networks inputs to generate an average Pooling output.

...read moreread less

Journal ArticleDOI

An intelligent approach for automated argument based legal text recognition and summarization using machine learning

Riya Sil, +4 more

- 01 Jan 2021 -

Journal of Intelligent and Fuzzy Systems

TL;DR: A machine learning based automated legal model is proposed to enhance the efficiency of the legal support system with an accuracy of 94% to assist the victims with prompt delivery of justice and legal professionals in reducing their workload.

...read moreread less

Proceedings ArticleDOI

A memory efficient DNA sequence alignment technique using pointing matrix

Sanchita Saha Ray, +3 more

TL;DR: The proposed DNA sequence alignment technique uses a novel concept of pointing matrix where the directed path in the pointing matrix ensures faster and accurate finding of the optimal alignment pertaining with the accuracy ensured by the well known Needleman & Wunsch algorithm.

...read moreread less

Proceedings ArticleDOI

A hardware-based high-throughput DNA sequence alignment scheme

Sanchita Saha Ray, +2 more

TL;DR: An improved DNA sequence alignment scheme using a matrix named neighbouring matrix to meet the requirement of high speed processing and can align 43% — 69% more sequences than that of same DNA sequence pairs of variable lengths.

...read moreread less

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Journal ArticleDOI

Reducing the Dimensionality of Data with Neural Networks

Geoffrey E. Hinton, +1 more

- 28 Jul 2006 -

Science

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.

...read moreread less

Journal ArticleDOI

A fast learning algorithm for deep belief nets

Geoffrey E. Hinton, +2 more

- 01 Jul 2006 -

Neural Computation

TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.

...read moreread less

Journal ArticleDOI

Representation Learning: A Review and New Perspectives

Yoshua Bengio, +2 more

- 01 Aug 2013 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Recent work in the area of unsupervised feature learning and deep learning is reviewed, covering advances in probabilistic models, autoencoders, manifold learning, and deep networks.

...read moreread less