Unsupervised POS Induction with Word Embeddings

doi:10.3115/V1/N15-1144

Open AccessProceedings ArticleDOI

Unsupervised POS Induction with Word Embeddings

- pp 1311-1316

TLDR

This paper showed that word embeddings can also add value to the problem of unsupervised POS induction, replacing multinomial distributions over the vocabulary with multivariate Gaussian distributions over word embedding and observe consistent improvements in eight languages.

Abstract:

Unsupervised word embeddings have been shown to be valuable as features in supervised learning problems; however, their role in unsupervised problems has been less thoroughly explored. In this paper, we show that embeddings can likewise add value to the problem of unsupervised POS induction. In two representative models of POS induction, we replace multinomial distributions over the vocabulary with multivariate Gaussian distributions over word embeddings and observe consistent improvements in eight languages. We also analyze the e ect of various choices while inducing word embeddings on “downstream” POS induction results.

Citations

PDF

Open Access

More filters

Book Chapter

Redefining Context Windows for Word Embedding Models: An Experimental Study

Pierre Lison, +1 more

TL;DR: This paper presented a systematic analysis of context windows based on a set of four distinct hyper-parameters, and trained continuous Skip-gram models on two English-language corpora for various combinations of these hyperparameters and evaluated them on both lexical similarity and analogy tasks.

...read moreread less

Proceedings ArticleDOI

Mutual Information Maximization for Simple and Accurate Part-Of-Speech Induction

Karl Stratos

TL;DR: This work addresses part-of-speech (POS) induction by maximizing the mutual information between the induced label and its context and shows that the variational lower bound is robust whereas the generalized Brown objective is vulnerable.

...read moreread less

Journal ArticleDOI

Sparse Coding of Neural Word Embeddings for Multilingual Sequence Labeling

Gábor Berend

- 26 Jul 2017 -

Transactions of the Association for Comp...

TL;DR: A sequence labeling framework which solely utilizes sparse indicator features derived from dense distributed word representations, which obtains (near) state-of-the art performance for both part- of-speech tagging and named entity recognition for a variety of languages.

...read moreread less

Normalization and parsing algorithms for uncertain input

Rob van der Goot

TL;DR: This research attempts to improve the automatic analysis of spontaneous language by translating it to 'normal' language by developing a modular normalization model, MoNoise, which reaches a new state-of-art performance on a variety of languages.

...read moreread less

Journal ArticleDOI

Learning With Annotation of Various Degrees

Joey Tianyi Zhou, +6 more

- 14 Jan 2019 -

IEEE Transactions on Neural Networks

TL;DR: A novel deep conditional random field model is proposed which utilizes an end-to-end learning manner to smoothly handle fully/un/partially labeled sequences within a unified framework and could be one of the first works to utilize the partially labeled instance for sequence labeling.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Journal ArticleDOI

Indexing by Latent Semantic Analysis

Scott Deerwester, +4 more

- 01 Sep 1990 -

Journal of the Association for Informati...

TL;DR: A new method for automatic indexing and retrieval to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries.

...read moreread less

Journal Article

Natural Language Processing (Almost) from Scratch

Ronan Collobert, +5 more

- 01 Feb 2011 -

Journal of Machine Learning Research

TL;DR: A unified neural network architecture and learning algorithm that can be applied to various natural language processing tasks including part-of-speech tagging, chunking, named entity recognition, and semantic role labeling is proposed.

...read moreread less

Proceedings ArticleDOI

A unified architecture for natural language processing: deep neural networks with multitask learning

Ronan Collobert, +1 more

TL;DR: This work describes a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, semantic roles, semantically similar words and the likelihood that the sentence makes sense using a language model.

...read moreread less

Journal ArticleDOI

Class-based n -gram models of natural language

Peter Fitzhugh Brown, +4 more

- 01 Dec 1992 -

Computational Linguistics

TL;DR: This work addresses the problem of predicting a word from previous words in a sample of text and discusses n-gram models based on classes of words, finding that these models are able to extract classes that have the flavor of either syntactically based groupings or semanticallybased groupings, depending on the nature of the underlying statistics.

...read moreread less