Incremental Parsing with the Perceptron Algorithm

doi:10.3115/1218955.1218970

Open AccessProceedings ArticleDOI

Incremental Parsing with the Perceptron Algorithm

Michael Collins, +1 more

- pp 111-118

Chats0

TLDR

It is demonstrated that training a perceptron model to combine with the generative model during search provides a 2.1 percent F-measure improvement over the Generative model alone, to 88.8 percent.

Abstract:

This paper describes an incremental parsing approach where parameters are estimated using a variant of the perceptron algorithm. A beam-search algorithm is used during both training and decoding phases of the method. The perceptron approach was implemented with the same feature set as that of an existing generative model (Roark, 2001a), and experimental results show that it gives competitive performance to the generative model on parsing the Penn treebank. We demonstrate that training a perceptron model to combine with the generative model during search provides a 2.1 percent F-measure improvement over the generative model alone, to 88.8 percent.

Citations

PDF

Open Access

More filters

Proceedings Article

Scheduled sampling for sequence prediction with recurrent Neural networks

Samy Bengio, +3 more

TL;DR: This work proposes a curriculum learning strategy to gently change the training process from a fully guided scheme using the true previous token, towards a less guided scheme which mostly uses the generated token instead.

...read moreread less

Proceedings ArticleDOI

Online Large-Margin Training of Dependency Parsers

Ryan McDonald, +2 more

TL;DR: An effective training algorithm for linearly-scored dependency parsers that implements online large-margin multi-class training on top of efficient parsing techniques for dependency trees is presented.

...read moreread less

Proceedings Article

Grammar as a foreign language

Oriol Vinyals, +5 more

TL;DR: The domain agnostic attention-enhanced sequence-to-sequence model achieves state-of-the-art results on the most widely used syntactic constituency parsing dataset, when trained on a large synthetic corpus that was annotated using existing parsers.

...read moreread less

Proceedings Article

Online Learning of Approximate Dependency Parsing Algorithms.

Ryan McDonald, +1 more

TL;DR: This paper extends the maximum spanning tree dependency parsing framework to incorporate higher-order feature representations and allow dependency structures with multiple parents per word, and shows that those extensions can make the MST framework computationally intractable, but that the intractability can be circumvented with new approximate parsing algorithms.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

John Lafferty, +2 more

TL;DR: This work presents iterative parameter estimation algorithms for conditional random fields and compares the performance of the resulting models to HMMs and MEMMs on synthetic and natural-language data.

...read moreread less

Probabilistic Models for Segmenting and Labeling Sequence Data

John Lafferty, +3 more

Proceedings ArticleDOI

Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms

Michael Collins

TL;DR: Experimental results on part-of-speech tagging and base noun phrase chunking are given, in both cases showing improvements over results for a maximum-entropy tagger.

...read moreread less

Journal ArticleDOI

An efficient boosting algorithm for combining preferences

Yoav Freund, +3 more

- 01 Dec 2003 -

Journal of Machine Learning Research

TL;DR: This work describes and analyze an efficient algorithm called RankBoost for combining preferences based on the boosting approach to machine learning, and gives theoretical results describing the algorithm's behavior both on the training data, and on new test data not seen during training.

...read moreread less

Proceedings Article

An Efficient Boosting Algorithm for Combining Preferences

Yoav Freund, +3 more

TL;DR: RankBoost as discussed by the authors is an algorithm for combining preferences based on the boosting approach to machine learning, which can be applied to several applications, such as that of combining the results of different search engines, or the "collaborative filtering" problem of ranking movies for a user based on movie rankings provided by other users.

...read moreread less

Collapse

Incremental Parsing with the Perceptron Algorithm

Citations

Moses: Open Source Toolkit for Statistical Machine Translation

Scheduled sampling for sequence prediction with recurrent Neural networks

Online Large-Margin Training of Dependency Parsers

Grammar as a foreign language

Online Learning of Approximate Dependency Parsing Algorithms.

References

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

Probabilistic Models for Segmenting and Labeling Sequence Data

Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms

An efficient boosting algorithm for combining preferences

An Efficient Boosting Algorithm for Combining Preferences

Related Papers (5)

Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

Building a large annotated corpus of English: the penn treebank

Head-Driven Statistical Models for Natural Language Parsing

A maximum-entropy-inspired parser