A C-LSTM Neural Network for Text Classification

Open AccessPosted Content

A C-LSTM Neural Network for Text Classification

Chunting Zhou, +3 more

- 27 Nov 2015 -

arXiv: Computation and Language

Chats0

TLDR

C-LSTM is a novel and unified model for sentence representation and text classification that outperforms both CNN and LSTM and can achieve excellent performance on these tasks.

Abstract:

Neural network models have been demonstrated to be capable of achieving remarkable performance in sentence and document modeling. Convolutional neural network (CNN) and recurrent neural network (RNN) are two mainstream architectures for such modeling tasks, which adopt totally different ways of understanding natural languages. In this work, we combine the strengths of both architectures and propose a novel and unified model called C-LSTM for sentence representation and text classification. C-LSTM utilizes CNN to extract a sequence of higher-level phrase representations, and are fed into a long short-term memory recurrent neural network (LSTM) to obtain the sentence representation. C-LSTM is able to capture both local features of phrases as well as global and temporal sentence semantics. We evaluate the proposed architecture on sentiment classification and question classification tasks. The experimental results show that the C-LSTM outperforms both CNN and LSTM and can achieve excellent performance on these tasks.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Hierarchical Attention Networks for Document Classification

Zichao Yang, +5 more

TL;DR: Experiments conducted on six large scale text classification tasks demonstrate that the proposed architecture outperform previous methods by a substantial margin.

...read moreread less

Journal ArticleDOI

A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures

Yong Yu, +3 more

- 14 Jun 2019 -

Neural Computation

TL;DR: The LSTM cell and its variants are reviewed and their variants are explored to explore the learning capacity of the LSTm cell and the L STM networks are divided into two broad categories:LSTM-dominated networks and integrated LSTS networks.

...read moreread less

Journal ArticleDOI

Predicting residential energy consumption using CNN-LSTM neural networks

Tae Young Kim, +1 more

- 01 Sep 2019 -

Energy

TL;DR: This paper proposes a CNN-LSTM neural network that can extract spatial and temporal features to effectively predict the housing energy consumption and achieves almost perfect prediction performance for electric energy consumption that was previously difficult to predict.

...read moreread less

Journal ArticleDOI

Text Classification Algorithms: A Survey

Kamran Kowsari, +5 more

- 17 Apr 2019 -

arXiv: Learning

TL;DR: An overview of text classification algorithms is discussed, which covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods.

...read moreread less

Journal ArticleDOI

Bidirectional LSTM with attention mechanism and convolutional layer for text classification

Gang Liu, +1 more

- 14 Apr 2019 -

Neurocomputing

TL;DR: A novel and unified architecture which contains a bidirectional LSTM (BiLSTM), attention mechanism and the convolutional layer is proposed in this paper, which outperforms other state-of-the-art text classification methods in terms of the classification accuracy.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

When Are Tree Structures Necessary for Deep Learning of Representations

Jiwei Li, +3 more

- 28 Feb 2015 -

arXiv: Artificial Intelligence

TL;DR: The authors show that recursive neural models can outperform simple recurrent neural networks (LSTM and LSTM) on several tasks, such as sentiment classification at the sentence level and phrase level, matching questions to answer-phrases, discourse parsing and semantic relation extraction.

...read moreread less

Proceedings ArticleDOI

Molding CNNs for text: non-linear, non-consecutive convolutions

Tao Lei, +2 more

TL;DR: This work revise the temporal convolution operation in CNNs to better adapt it to text processing by appealing to tensor algebra and using low-rank n-gram tensors to directly exploit interactions between words already at the convolution stage.

...read moreread less

Posted Content

Self-Adaptive Hierarchical Sentence Model

Han Zhao, +2 more

- 20 Apr 2015 -

arXiv: Computation and Language

TL;DR: Both qualitative and quantitative analysis shows that AdaSent can automatically form and select the representations suitable for the task at hand during training, yielding superior classification performance over competitor models on 5 benchmark data sets.

...read moreread less

Proceedings ArticleDOI

Discriminative Neural Sentence Modeling by Tree-Based Convolution

Lili Mou, +5 more

TL;DR: This paper proposed a tree-based convolutional neural network (TBCNN), which leverages constituency trees or dependency trees of sentences to extract sentences structural features, which are then aggregated by max pooling.

...read moreread less

Posted Content

Modelling‚ Visualising and Summarising Documents with a Single Convolutional Neural Network

Misha Denil, +4 more

- 15 Jun 2014 -

arXiv: Computation and Language

TL;DR: A model is introduced that is able to represent the meaning of documents by embedding them in a low dimensional vector space, while preserving distinctions of word and sentence order crucial for capturing nuanced semantics.

...read moreread less

Collapse

Related Papers (5)

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

A C-LSTM Neural Network for Text Classification

Citations

Hierarchical Attention Networks for Document Classification

A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures

Predicting residential energy consumption using CNN-LSTM neural networks

Text Classification Algorithms: A Survey

Bidirectional LSTM with attention mechanism and convolutional layer for text classification

References

When Are Tree Structures Necessary for Deep Learning of Representations

Molding CNNs for text: non-linear, non-consecutive convolutions

Self-Adaptive Hierarchical Sentence Model

Discriminative Neural Sentence Modeling by Tree-Based Convolution

Modelling‚ Visualising and Summarising Documents with a Single Convolutional Neural Network

Related Papers (5)

Long short-term memory

Glove: Global Vectors for Word Representation

Distributed Representations of Words and Phrases and their Compositionality

Neural Machine Translation by Jointly Learning to Align and Translate

Adam: A Method for Stochastic Optimization