Text recognition using deep BLSTM networks

doi:10.1109/ICAPR.2015.7050699

Proceedings ArticleDOI

Text recognition using deep BLSTM networks

Anupama Ray, +2 more

- pp 1-6

Chats0

TLDR

A Deep Bidirectional Long Short Term Memory (LSTM) based Recurrent Neural Network architecture for text recognition that uses Connectionist Temporal Classification (CTC) for training to learn the labels of an unsegmented sequence with unknown alignment.

Abstract:

This paper presents a Deep Bidirectional Long Short Term Memory (LSTM) based Recurrent Neural Network architecture for text recognition. This architecture uses Connectionist Temporal Classification (CTC) for training to learn the labels of an unsegmented sequence with unknown alignment. This work is motivated by the results of Deep Neural Networks for isolated numeral recognition and improved speech recognition using Deep BLSTM based approaches. Deep BLSTM architecture is chosen due to its ability to access long range context, learn sequence alignment and work without the need of segmented data. Due to the use of CTC and forward backward algorithms for alignment of output labels, there are no unicode re-ordering issues, thus no need of lexicon or postprocessing schemes. This is a script independent and segmentation free approach. This system has been implemented for the recognition of unsegmented words of printed Oriya text. This system achieves 4.18% character level error and 12.11% word error rate on printed Oriya text.

Citations

PDF

Open Access

More filters

Using BLSTM for Interpretation of 2D Languages - Case of Handwritten Mathematical Expressions.

Ting Zhang, +2 more

TL;DR: The proposed solution aims at transforming the mathematical expression description into a sequence including at the same time symbol labels and relation- ship labels, so that classical supervised sequence labeling with recurrent neural networks can be applied.

...read moreread less

Posted Content

Cooking Is All About People: Comment Classification On Cookery Channels Using BERT and Classification Models (Malayalam-English Mix-Code).

Subramaniam Kazhuparambil, +1 more

- 15 Jun 2020 -

arXiv: Computation and Language

TL;DR: The statistical analysis of results indicates that Multinomial Naïve Bayes, K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Random Forest and Decision Trees offer similar level of accuracy in comment classification.

...read moreread less

Book ChapterDOI

An Efficient Multi Lingual Optical Character Recognition System for Indian Languages Through Use of Bharati Script

Chandra Sekhar Vorugunti, +2 more

TL;DR: Thorough experimental analysis with varied levels of noise confirms the promising results of character recognition accuracy of the proposed OCR model which out-performs the state-of-the-art OCR systems for Indian scripts.

...read moreread less

Patent

Text emotion classification method based on emotion center

Wang Maoquan, +2 more

TL;DR: In this article, a text emotion classification method based on an emotion center was proposed, which comprises the steps that first, the distance between a text vector and a category emotion vector center is added into a loss function, and then, a BLSTM is used to perform preliminary decoding on a text, and the accuracy of text representation is improved through an attention mechanism.

...read moreread less

Book ChapterDOI

A Performance Evaluation of Several Artificial Neural Networks for Mapping Speech Spectrum Parameters.

Víctor Yeom-Song, +2 more

TL;DR: The results show that for this application of neural networks, the architectures with more layers or the greater number of neurons are not the most convenient, both for the time required in their training and for the adjustment achieved.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Journal ArticleDOI

A fast learning algorithm for deep belief nets

Geoffrey E. Hinton, +2 more

- 01 Jul 2006 -

Neural Computation

TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.

...read moreread less

Proceedings ArticleDOI

Speech recognition with deep recurrent neural networks

Alex Graves, +2 more

TL;DR: This paper investigates deep recurrent neural networks, which combine the multiple levels of representation that have proved so effective in deep networks with the flexible use of long range context that empowers RNNs.

...read moreread less

Posted Content

Speech Recognition with Deep Recurrent Neural Networks

Alex Graves, +2 more

- 22 Mar 2013 -

arXiv: Neural and Evolutionary Computing

TL;DR: In this paper, deep recurrent neural networks (RNNs) are used to combine the multiple levels of representation that have proved so effective in deep networks with the flexible use of long range context that empowers RNNs.

...read moreread less

Proceedings ArticleDOI

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

Alex Graves, +3 more

TL;DR: This paper presents a novel method for training RNNs to label unsegmented sequences directly, thereby solving both problems of sequence learning and post-processing.

...read moreread less

Collapse

Related Papers (5)

Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks

Adnan Ul-Hasan, +4 more

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

Text recognition using deep BLSTM networks

Citations

Using BLSTM for Interpretation of 2D Languages - Case of Handwritten Mathematical Expressions.

Cooking Is All About People: Comment Classification On Cookery Channels Using BERT and Classification Models (Malayalam-English Mix-Code).

An Efficient Multi Lingual Optical Character Recognition System for Indian Languages Through Use of Bharati Script

Text emotion classification method based on emotion center

A Performance Evaluation of Several Artificial Neural Networks for Mapping Speech Spectrum Parameters.

References

Long short-term memory

A fast learning algorithm for deep belief nets

Speech recognition with deep recurrent neural networks

Speech Recognition with Deep Recurrent Neural Networks

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

Related Papers (5)

Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks

Long short-term memory

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

Unconstrained On-line Handwriting Recognition with Recurrent Neural Networks

Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks