Bidirectional LSTM with attention mechanism and convolutional layer for text classification

doi:10.1016/J.NEUCOM.2019.01.078

Journal ArticleDOI

Bidirectional LSTM with attention mechanism and convolutional layer for text classification

Gang Liu, +1 more

- 14 Apr 2019 -

Neurocomputing

- Vol. 337, pp 325-338

Chats0

TLDR

A novel and unified architecture which contains a bidirectional LSTM (BiLSTM), attention mechanism and the convolutional layer is proposed in this paper, which outperforms other state-of-the-art text classification methods in terms of the classification accuracy.

About:

This article is published in Neurocomputing.The article was published on 2019-04-14. It has received 581 citations till now. The article focuses on the topics: Word embedding & Recurrent neural network.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A review on the attention mechanism of deep learning

Zhaoyang Niu, +2 more

- 10 Sep 2021 -

Neurocomputing

TL;DR: An overview of the state-of-the-art attention models proposed in recent years is given and a unified model that is suitable for most attention structures is defined.

...read moreread less

Journal ArticleDOI

ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis

Mohammad Ehsan Basiri, +4 more

- 01 Feb 2021 -

Future Generation Computer Systems

TL;DR: An Attention-based Bidirectional CNN-RNN Deep Model (ABCDM) is proposed that achieves state-of-the-art results on both long review and short tweet polarity classification and is evaluated on sentiment polarity detection.

...read moreread less

Synthesis Lectures on Human Language Technologies

Ido Dagan, +6 more

TL;DR: This book gives a comprehensive view of state-of-the-art techniques that are used to build spoken dialogue systems and presents dialogue modelling and system development issues relevant in both academic and industrial environments and also discusses requirements and challenges for advanced interaction management and future research.

...read moreread less

Journal ArticleDOI

CNN-based transfer learning-BiLSTM network: A novel approach for COVID-19 infection detection.

Muhammet Fatih Aslan, +3 more

- 01 Jan 2021 -

Applied Soft Computing

TL;DR: Two deep learning architectures have been proposed that automatically detect positive COVID-19 cases using Chest CT X-ray images and it is proved that the proposed architecture shows outstanding success in infection detection.

...read moreread less

Journal ArticleDOI

Bi-LSTM Model to Increase Accuracy in Text Classification: Combining Word2vec CNN and Attention Mechanism

Beakcheol Jang, +4 more

- 24 Aug 2020 -

Applied Sciences

TL;DR: An attention-based Bi-LSTM+CNN hybrid model that capitalize on the advantages of LSTM and CNN with an additional attention mechanism is proposed that produces more accurate classification results, as well as higher recall and F1 scores, than individual multi-layer perceptron (MLP), CNN or L STM models as the hybrid models.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Journal ArticleDOI

Deep learning in neural networks

Jürgen Schmidhuber

- 01 Jan 2015 -

Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

Collapse

Related Papers (5)

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

arXiv: Computation and Language

Bidirectional LSTM with attention mechanism and convolutional layer for text classification

Citations

A review on the attention mechanism of deep learning

ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis

Synthesis Lectures on Human Language Technologies

CNN-based transfer learning-BiLSTM network: A novel approach for COVID-19 infection detection.

Bi-LSTM Model to Increase Accuracy in Text Classification: Combining Word2vec CNN and Attention Mechanism

References

Adam: A Method for Stochastic Optimization

ImageNet Classification with Deep Convolutional Neural Networks

Long short-term memory

Efficient Estimation of Word Representations in Vector Space

Deep learning in neural networks

Related Papers (5)

Long short-term memory

Glove: Global Vectors for Word Representation

Convolutional Neural Networks for Sentence Classification

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Efficient Estimation of Word Representations in Vector Space