Learning representations for weakly supervised natural language processing tasks

doi:10.1162/COLI_A_00167

Journal ArticleDOI

Learning representations for weakly supervised natural language processing tasks

Fei Huang, +5 more

- 01 Mar 2014 -

Computational Linguistics

- Vol. 40, Iss: 1, pp 85-120

Chats0

TLDR

Novel techniques for extracting features from n-gram models, Hidden Markov Models, and other statistical language models are investigated, including a novel Partial Lattice Markov Random Field model.

Abstract:

Finding the right representations for words is critical for building accurate NLP systems when domain-specific labeled data for the task is scarce. This article investigates novel techniques for extracting features from n-gram models, Hidden Markov Models, and other statistical language models, including a novel Partial Lattice Markov Random Field model. Experiments on part-of-speech tagging and information extraction, among other tasks, indicate that features taken from statistical language models, in combination with more traditional features, outperform traditional representations alone, and that graphical model representations outperform n-gram models, especially on sparse and polysemous words.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks

Daojian Zeng, +3 more

TL;DR: This paper proposes a novel model dubbed the Piecewise Convolutional Neural Networks (PCNNs) with multi-instance learning to address the problem of wrong label problem when using distant supervision for relation extraction and adopts convolutional architecture with piecewise max pooling to automatically learn relevant features.

...read moreread less

Proceedings ArticleDOI

Tailoring Continuous Word Representations for Dependency Parsing

Mohit Bansal, +2 more

TL;DR: It is found that all embeddings yield significant parsing gains, including some recent ones that can be trained in a fraction of the time of others, suggesting their complementarity.

...read moreread less

Journal ArticleDOI

A survey on the application of recurrent neural networks to statistical language modeling

Wim De Mulder, +2 more

- 01 Mar 2015 -

Computer Speech & Language

TL;DR: This paper presents a survey on the application of recurrent neural networks to the task of statistical language modeling, and gives an overview of the most important extensions.

...read moreread less

Proceedings ArticleDOI

Deep Multilingual Correlation for Improved Word Embeddings

Ang Lu, +4 more

TL;DR: Deep non-linear transformations of word embeddings of the two languages are learned, using the recently proposed deep canonical correlation analysis, to improve their quality and consistency on multiple word and bigram similarity tasks.

...read moreread less

Proceedings ArticleDOI

Unsupervised Morphology Induction Using Word Embeddings

Radu Soricut, +1 more

TL;DR: A language agnostic, unsupervised method for inducing morphological transformations between words that relies on certain regularities manifest in highdimensional vector spaces and is capable of discovering a wide range of morphological rules.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Maximum likelihood from incomplete data via the EM algorithm

Arthur P. Dempster, +2 more

- 01 Sep 1977 -

Journal of the royal statistical society...

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings Article

Latent Dirichlet Allocation

David M. Blei, +2 more

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Journal ArticleDOI

Indexing by Latent Semantic Analysis

Scott Deerwester, +4 more

- 01 Sep 1990 -

Journal of the Association for Informati...

TL;DR: A new method for automatic indexing and retrieval to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries.

...read moreread less