scispace - formally typeset
Open AccessPosted Content

A C-LSTM Neural Network for Text Classification

Reads0
Chats0
TLDR
C-LSTM is a novel and unified model for sentence representation and text classification that outperforms both CNN and LSTM and can achieve excellent performance on these tasks.
Abstract
Neural network models have been demonstrated to be capable of achieving remarkable performance in sentence and document modeling. Convolutional neural network (CNN) and recurrent neural network (RNN) are two mainstream architectures for such modeling tasks, which adopt totally different ways of understanding natural languages. In this work, we combine the strengths of both architectures and propose a novel and unified model called C-LSTM for sentence representation and text classification. C-LSTM utilizes CNN to extract a sequence of higher-level phrase representations, and are fed into a long short-term memory recurrent neural network (LSTM) to obtain the sentence representation. C-LSTM is able to capture both local features of phrases as well as global and temporal sentence semantics. We evaluate the proposed architecture on sentiment classification and question classification tasks. The experimental results show that the C-LSTM outperforms both CNN and LSTM and can achieve excellent performance on these tasks.

read more

Citations
More filters
Proceedings ArticleDOI

A Hybrid Algorithm for Text Classification Based on CNN-BLSTM with Attention

TL;DR: An effective text classification framework is proposed, which is the hybrid of different weights of character-level and word-level features through concatenation based on Convolutional Neural Network-bidirectional long short-term memory with attention (BACNN).
Proceedings ArticleDOI

BPGC at SemEval-2020 Task 11: Propaganda Detection in News Articles with Multi-Granularity Knowledge Sharing and Linguistic Features Based Ensemble Learning

TL;DR: This article used contextual embeddings extracted from pre-trained transformer models to represent the text data at various granularities and propose a multi-granularity knowledge sharing approach for news propaganda detection.
Proceedings ArticleDOI

Opinion Expression Detection via Deep Bidirectional C-GRUs

Xiaoxia Xie
TL;DR: This work considers opinion expression detection as a sequence labeling task and describes deep neural network frameworks that consist of convolutional neural networks (CNNs) and bidirectional gated units (Bi-GRUs) that provide the framework to effectively detect opinion expression.
Proceedings ArticleDOI

FrAppLe: A Framework for Apprenticeship Learning

TL;DR: A framework that helps in developing apprentice agents that learn from Human demonstration and also proposes a process to learn and self-diagnose the agent's action for apprenticeship learning, which alleviates the need to explicitly program the apprentice agents.
Proceedings ArticleDOI

Personality Prediction Based on Content of Facebook Users:A Literature Review

TL;DR: An overview on the personality prediction based on language features is provided and may act as a support material for those who wish to know about personality prediction from the Facebook status update content.
References
More filters
Journal ArticleDOI

Long short-term memory

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.
Proceedings ArticleDOI

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.
Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.
Posted Content

Sequence to Sequence Learning with Neural Networks

TL;DR: This paper presents a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure, and finds that reversing the order of the words in all source sentences improved the LSTM's performance markedly, because doing so introduced many short term dependencies between the source and the target sentence which made the optimization problem easier.
Related Papers (5)