Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

Open AccessProceedings Article

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

John Lafferty, +2 more

- pp 282-289

Chats0

TLDR

This work presents iterative parameter estimation algorithms for conditional random fields and compares the performance of the resulting models to HMMs and MEMMs on synthetic and natural-language data.

Abstract:

We present conditional random fields , a framework for building probabilistic models to segment and label sequence data. Conditional random fields offer several advantages over hidden Markov models and stochastic grammars for such tasks, including the ability to relax strong independence assumptions made in those models. Conditional random fields also avoid a fundamental limitation of maximum entropy Markov models (MEMMs) and other discriminative Markov models based on directed graphical models, which can be biased towards states with few successor states. We present iterative parameter estimation algorithms for conditional random fields and compare the performance of the resulting models to HMMs and MEMMs on synthetic and natural-language data.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Online Large-Margin Training of Dependency Parsers

Ryan McDonald, +2 more

TL;DR: An effective training algorithm for linearly-scored dependency parsers that implements online large-margin multi-class training on top of efficient parsing techniques for dependency trees is presented.

...read moreread less

Posted Content

Natural Language Processing (almost) from Scratch

Ronan Collobert, +5 more

- 02 Mar 2011 -

arXiv: Learning

TL;DR: The authors proposed a unified neural network architecture and learning algorithm that can be applied to various natural language processing tasks including: part-of-speech tagging, chunking, named entity recognition, and semantic role labeling.

...read moreread less

Journal ArticleDOI

Domain adaptation for statistical classifiers

Hal Daumé, +1 more

- 01 May 2006 -

Journal of Artificial Intelligence Resea...

TL;DR: This work introduces a statistical formulation of this problem in terms of a simple mixture model and presents an instantiation of this framework to maximum entropy classifiers and their linear chain counterparts and leads to improved performance on three real world tasks on four different data sets from the natural language processing domain.

...read moreread less

Proceedings ArticleDOI

Learning Conditional Random Fields for Stereo

Daniel Scharstein, +1 more

TL;DR: This paper has constructed a large number of stereo datasets with ground-truth disparities, and a subset of these datasets are used to learn the parameters of conditional random fields (CRFs) and presents experimental results illustrating the potential of this approach for automatically learning the Parameters of models with richer structure than standard hand-tuned MRF models.

...read moreread less

Proceedings Article

Learning to map sentences to logical form: structured classification with probabilistic categorial grammars

Luke Zettlemoyer, +1 more

TL;DR: A learning algorithm is described that takes as input a training set of sentences labeled with expressions in the lambda calculus and induces a grammar for the problem, along with a log-linear model that represents a distribution over syntactic and semantic analyses conditioned on the input sentence.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Yoav Freund, +1 more

TL;DR: The model studied can be interpreted as a broad, abstract extension of the well-studied on-line prediction model to a general decision-theoretic setting, and it is shown that the multiplicative weight-update Littlestone?Warmuth rule can be adapted to this model, yielding bounds that are slightly weaker in some cases, but applicable to a considerably more general class of learning problems.

...read moreread less

Gradient-based learning applied to document recognition

Yann LeCun, +7 more

TL;DR: This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task, and Convolutional neural networks are shown to outperform all other techniques.

...read moreread less

Book

Foundations of Statistical Natural Language Processing

Christopher D. Manning, +1 more

TL;DR: This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear and provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.

...read moreread less

Book

Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids

Richard Durbin, +3 more

TL;DR: This book gives a unified, up-to-date and self-contained account, with a Bayesian slant, of such methods, and more generally to probabilistic methods of sequence analysis.

...read moreread less

Journal ArticleDOI

A maximum entropy approach to natural language processing

Adam L. Berger, +2 more

- 01 Mar 1996 -

Computational Linguistics

TL;DR: A maximum-likelihood approach for automatically constructing maximum entropy models is presented and how to implement this approach efficiently is described, using as examples several problems in natural language processing.

...read moreread less

Related Papers (5)

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

Citations

Online Large-Margin Training of Dependency Parsers

Natural Language Processing (almost) from Scratch

Domain adaptation for statistical classifiers

Learning Conditional Random Fields for Stereo

Learning to map sentences to logical form: structured classification with probabilistic categorial grammars

References

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Gradient-based learning applied to document recognition

Foundations of Statistical Natural Language Processing

Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids

A maximum entropy approach to natural language processing

Related Papers (5)

A tutorial on hidden Markov models and selected applications in speech recognition

Long short-term memory

Neural Architectures for Named Entity Recognition

Glove: Global Vectors for Word Representation

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding