DOC: Deep Open Classification of Text Documents

doi:10.18653/V1/D17-1314

Open AccessProceedings ArticleDOI

DOC: Deep Open Classification of Text Documents

Lei Shu, +2 more

- pp 2911-2916

Chats0

TLDR

This paper proposes a novel deep learning based approach that outperforms existing state-of-the-art techniques dramatically and is applicable to text learning or text classification.

Abstract:

Traditional supervised learning makes the closed-world assumption that the classes appeared in the test data must have appeared in training. This also applies to text learning or text classification. As learning is used increasingly in dynamic open environments where some new/test documents may not belong to any of the training classes, identifying these novel documents during classification presents an important problem. This problem is called open-world classification or open classification. This paper proposes a novel deep learning based approach. It outperforms existing state-of-the-art techniques dramatically.

Citations

PDF

Open Access

More filters

Book

Lifelong Machine Learning

Zhiyuan Chen, +1 more

TL;DR: As statistical machine learning matures, it is time to make a major effort to break the isolated learning tradition and to study lifelong learning to bring machine learning to new heights.

...read moreread less

Posted Content

Deep Learning for Anomaly Detection: A Survey.

Raghavendra Chalapathy, +1 more

- 10 Jan 2019 -

arXiv: Learning

TL;DR: A structured and comprehensive overview of research methods in deep learning-based anomaly detection, grouped state-of-the-art research techniques into different categories based on the underlying assumptions and approach adopted.

...read moreread less

Journal ArticleDOI

Recent Advances in Open Set Recognition: A Survey

Chuanxing Geng, +2 more

- 01 Oct 2021 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper provides a comprehensive survey of existing open set recognition techniques covering various aspects ranging from related definitions, representations of models, datasets, evaluation criteria, and algorithm comparisons to highlight the limitations of existing approaches and point out some promising subsequent research directions.

...read moreread less

Journal ArticleDOI

A Unifying Review of Deep and Shallow Anomaly Detection

Lukas Ruff, +7 more

- 24 Sep 2020 -

arXiv: Learning

TL;DR: This review aims to identify the common underlying principles and the assumptions that are often made implicitly by various methods in deep learning, and draws connections between classic “shallow” and novel deep approaches and shows how this relation might cross-fertilize or extend both directions.

...read moreread less

Posted Content

Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems

Chien-Sheng Wu, +5 more

- 21 May 2019 -

arXiv: Computation and Language

TL;DR: A Transferable Dialogue State Generator (TRADE) that generates dialogue states from utterances using copy mechanism, facilitating transfer when predicting (domain, slot, value) triplets not encountered during training.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Proceedings ArticleDOI

Convolutional Neural Networks for Sentence Classification

Yoon Kim

TL;DR: The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification, and are proposed to allow for the use of both task-specific and static vectors.

...read moreread less

Posted Content

Convolutional Neural Networks for Sentence Classification

Yoon Kim

- 25 Aug 2014 -

arXiv: Computation and Language

TL;DR: In this article, CNNs are trained on top of pre-trained word vectors for sentence-level classification tasks and a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks.

...read moreread less

Journal ArticleDOI

Bidirectional recurrent neural networks

Mike Schuster, +1 more

- 01 Nov 1997 -

IEEE Transactions on Signal Processing

TL;DR: It is shown how the proposed bidirectional structure can be easily modified to allow efficient estimation of the conditional posterior probability of complete symbol sequences without making any explicit assumption about the shape of the distribution.

...read moreread less

Collapse

Related Papers (5)

Towards Open Set Deep Networks

Abhijit Bendale, +1 more

Toward Open Set Recognition

Walter J. Scheirer, +3 more

- 01 Jul 2013 -

IEEE Transactions on Pattern Analysis an...

DOC: Deep Open Classification of Text Documents

Citations

Lifelong Machine Learning

Deep Learning for Anomaly Detection: A Survey.

Recent Advances in Open Set Recognition: A Survey

A Unifying Review of Deep and Shallow Anomaly Detection

Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems

References

Long short-term memory

Deep Learning

Convolutional Neural Networks for Sentence Classification

Convolutional Neural Networks for Sentence Classification

Bidirectional recurrent neural networks

Related Papers (5)

Towards Open Set Deep Networks

Toward Open Set Recognition

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Glove: Global Vectors for Word Representation