Text recognition using deep BLSTM networks

doi:10.1109/ICAPR.2015.7050699

Proceedings ArticleDOI

Text recognition using deep BLSTM networks

- pp 1-6

TLDR

A Deep Bidirectional Long Short Term Memory (LSTM) based Recurrent Neural Network architecture for text recognition that uses Connectionist Temporal Classification (CTC) for training to learn the labels of an unsegmented sequence with unknown alignment.

Abstract:

This paper presents a Deep Bidirectional Long Short Term Memory (LSTM) based Recurrent Neural Network architecture for text recognition. This architecture uses Connectionist Temporal Classification (CTC) for training to learn the labels of an unsegmented sequence with unknown alignment. This work is motivated by the results of Deep Neural Networks for isolated numeral recognition and improved speech recognition using Deep BLSTM based approaches. Deep BLSTM architecture is chosen due to its ability to access long range context, learn sequence alignment and work without the need of segmented data. Due to the use of CTC and forward backward algorithms for alignment of output labels, there are no unicode re-ordering issues, thus no need of lexicon or postprocessing schemes. This is a script independent and segmentation free approach. This system has been implemented for the recognition of unsegmented words of printed Oriya text. This system achieves 4.18% character level error and 12.11% word error rate on printed Oriya text.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Gait-Based Human Identification by Combining Shallow Convolutional Neural Network-Stacked Long Short-Term Memory and Deep Convolutional Neural Network

Ganbayar Batchuluun, +3 more

- 22 Oct 2018 -

IEEE Access

TL;DR: The proposed gait-based human identification based on the front and back view images of humans captured in both high- and low-illumination environments outperforms previous methods.

...read moreread less

Journal ArticleDOI

Multiobjective optimization for recognition of isolated handwritten Indic scripts

Anisha Gupta, +3 more

- 01 Dec 2019 -

Pattern Recognition Letters

TL;DR: A modified opposition-based multiobjective Harmony Search algorithm has been proposed to select the local regions from handwritten character images based on their rankings in a three-dimensional pareto-front based on recognition accuracy and redundancy.

...read moreread less

Proceedings ArticleDOI

KPTI: Katib's Pashto Text Imagebase and Deep Learning Benchmark

Riaz Ahmad, +5 more

TL;DR: This paper presents the first Pashto text image database for scientific research and thereby the first dataset with complete handwritten and printed text line images which ultimately covers all alphabets of Arabic and Persian languages.

...read moreread less

Journal ArticleDOI

In-air handwritten English word recognition using attention recurrent translator

Ji Gan, +1 more

- 01 Jul 2019 -

Neural Computing and Applications

TL;DR: An attention-based model, called attention recurrent translator, is proposed for the in-air handwritten English word recognition, which is considerably different from connectionist temporal classification (CTC) and achieves a word recognition accuracy of 97.74%.

...read moreread less

Journal ArticleDOI

Improving the DBLSTM for on-line Arabic handwriting recognition

Rania Maalej, +1 more

- 01 Jul 2020 -

Multimedia Tools and Applications

TL;DR: This work proposes a new online Arabic handwriting recognition system based on DBLSTM that relies on three techniques that were applied in order to enhance its performance, and the best tested architecture gives a reduction of 10.99% in label error rate.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Journal ArticleDOI

A fast learning algorithm for deep belief nets

Geoffrey E. Hinton, +2 more

- 01 Jul 2006 -

Neural Computation

TL;DR: A fast, greedy algorithm is derived that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory.

...read moreread less

Proceedings ArticleDOI

Speech recognition with deep recurrent neural networks

Alex Graves, +2 more

TL;DR: This paper investigates deep recurrent neural networks, which combine the multiple levels of representation that have proved so effective in deep networks with the flexible use of long range context that empowers RNNs.

...read moreread less

Posted Content

Speech Recognition with Deep Recurrent Neural Networks

Alex Graves, +2 more

- 22 Mar 2013 -

arXiv: Neural and Evolutionary Computing

TL;DR: In this paper, deep recurrent neural networks (RNNs) are used to combine the multiple levels of representation that have proved so effective in deep networks with the flexible use of long range context that empowers RNNs.

...read moreread less

Proceedings ArticleDOI

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

Alex Graves, +3 more

TL;DR: This paper presents a novel method for training RNNs to label unsegmented sequences directly, thereby solving both problems of sequence learning and post-processing.

...read moreread less

Collapse

Related Papers (5)

Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks

Adnan Ul-Hasan, +4 more

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

Text recognition using deep BLSTM networks

Citations

Gait-Based Human Identification by Combining Shallow Convolutional Neural Network-Stacked Long Short-Term Memory and Deep Convolutional Neural Network

Multiobjective optimization for recognition of isolated handwritten Indic scripts

KPTI: Katib's Pashto Text Imagebase and Deep Learning Benchmark

In-air handwritten English word recognition using attention recurrent translator

Improving the DBLSTM for on-line Arabic handwriting recognition

References

Long short-term memory

A fast learning algorithm for deep belief nets

Speech recognition with deep recurrent neural networks

Speech Recognition with Deep Recurrent Neural Networks

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

Related Papers (5)

Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks

Long short-term memory

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

Unconstrained On-line Handwriting Recognition with Recurrent Neural Networks

Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks