Text Classification Algorithms: A Survey

doi:10.3390/INFO10040150

Open AccessJournal ArticleDOI

Text Classification Algorithms: A Survey

Kamran Kowsari, +5 more

- 17 Apr 2019 -

arXiv: Learning

TLDR

An overview of text classification algorithms is discussed, which covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods.

Abstract:

In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natural language processing. The success of these learning algorithms relies on their capacity to understand complex models and non-linear relationships within data. However, finding suitable structures, architectures, and techniques for text classification is a challenge for researchers. In this paper, a brief overview of text classification algorithms is discussed. This overview covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods. Finally, the limitations of each technique and their application in the real-world problem are discussed.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Learning--based Text Classification: A Comprehensive Review

Shervin Minaee, +5 more

- 17 Apr 2021 -

ACM Computing Surveys

TL;DR: This paper provided a comprehensive review of more than 150 deep learning-based models for text classification developed in recent years, and discussed their technical contributions, similarities, and strengths, and provided a quantitative analysis of the performance of different deep learning models on popular benchmarks.

...read moreread less

Combining Instance-Based Learning and Logistic Regression for Multilabel Classification.

Weiwei Cheng, +1 more

TL;DR: This paper proposes a new approach to multilabel classification, which is based on a framework that unifies instance-based learning and logistic regression, comprising both methods as special cases, and allows one to capture interdependencies between labels and to combine model-based and similarity-based inference for multILabel classification.

...read moreread less

Journal ArticleDOI

COVID-19 public sentiment insights and machine learning for tweets classification

Jim Samuel, +5 more

- 01 Jun 2020 -

Information-an International Interdiscip...

TL;DR: Insight is provided into the progress of fear-sentiment over time as COVID-19 approached peak levels in the United States, using descriptive textual analytics supported by necessary textual data visualizations and two essential machine learning classification methods are provided.

...read moreread less

Posted ContentDOI

COVID-19 Public Sentiment Insights and Machine Learning for Tweets Classification

Jim Samuel, +5 more

- 03 Jun 2020 -

medRxiv

TL;DR: Insight is provided into the progress of fear-sentiment over time as COVID-19 approached peak levels in the United States, using descriptive textual analytics supported by necessary textual data visualizations and a methodological overview of two essential machine learning classification methods.

...read moreread less

Posted Content

A Survey on Text Classification: From Shallow to Deep Learning

Qian Li, +7 more

- 02 Aug 2020 -

arXiv: Computation and Language

TL;DR: A taxonomy for text classification according to the text involved and the models used for feature extraction and classification is created, dealing with both the technical developments and benchmark datasets that support tests of predictions.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

Journal ArticleDOI

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Collapse

Text Classification Algorithms: A Survey

Citations

Deep Learning--based Text Classification: A Comprehensive Review

Combining Instance-Based Learning and Logistic Regression for Multilabel Classification.

COVID-19 public sentiment insights and machine learning for tweets classification

COVID-19 Public Sentiment Insights and Machine Learning for Tweets Classification

A Survey on Text Classification: From Shallow to Deep Learning

References

Adam: A Method for Stochastic Optimization

Long short-term memory

Deep learning

Gradient-based learning applied to document recognition

Deep Learning

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Data Classification: Algorithms and Applications

Survey of Classification Techniques in Data Mining

A review on Video Classification with Methods, Findings, Performance, Challenges, Limitations and Future Work

Human Face Detection Techniques: A Comprehensive Review and Future Research Directions

Trending Questions (1)