Hierarchical Phrase-Based Translation

doi:10.1162/COLI.2007.33.2.201

Journal ArticleDOI

Hierarchical Phrase-Based Translation

David Chiang

- 01 Jun 2007 -

Computational Linguistics

- Vol. 33, Iss: 2, pp 201-228

Chats0

TLDR

A statistical machine translation model that uses hierarchical phrasesphrases that contain subphrasing that is formally a synchronous context-free grammar but is learned from a parallel text without any syntactic annotations is presented.

Abstract:

We present a statistical machine translation model that uses hierarchical phrases---phrases that contain subphrases. The model is formally a synchronous context-free grammar but is learned from a parallel text without any syntactic annotations. Thus it can be seen as combining fundamental ideas from both syntax-based translation and phrase-based translation. We describe our system's training and decoding methods in detail, and evaluate it for translation speed and translation accuracy. Using BLEU as a metric of translation accuracy, we find that our system performs significantly better than the Alignment Template System, a state-of-the-art phrase-based system.

Citations

PDF

Open Access

More filters

Proceedings Article

A Simple, Fast, and Effective Reparameterization of IBM Model 2

Chris Dyer, +2 more

TL;DR: A simple log-linear reparameterization of IBM Model 2 that overcomes problems arising from Model 1’'s strong assumptions and Model 2’s overparameterization is presented.

...read moreread less

Proceedings ArticleDOI

Six Challenges for Neural Machine Translation.

Philipp Koehn, +1 more

TL;DR: The authors explore six challenges for NMT: domain mismatch, amount of training data, rare words, long sentences, word alignment, and beam search, and show both deficiencies and improvements over the quality of phrase-based statistical machine translation.

...read moreread less

Journal ArticleDOI

Synthesis Lectures on Human Language Technologies

Philip Williams, +3 more

Proceedings ArticleDOI

Addressing the Rare Word Problem in Neural Machine Translation

Thang Luong, +4 more

TL;DR: This paper proposed and implemented an effective technique to address the problem of out-of-vocabulary (OOV) word translation in NMT, which trains an NMT system on data that is augmented by the output of a word alignment algorithm, and then uses this information in a post-processing step that translates every OOV word using a dictionary.

...read moreread less

Proceedings ArticleDOI

Modeling Coverage for Neural Machine Translation

Zhaopeng Tu, +4 more

TL;DR: This paper propose a coverage vector to keep track of the attention history and feed it to the attention model to adjust future attention, which enables NMT system to consider more about untranslated source words.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Introduction to Algorithms

Thomas H. Cormen, +2 more

TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.

...read moreread less

Proceedings ArticleDOI

Bleu: a Method for Automatic Evaluation of Machine Translation

Kishore Papineni, +3 more

TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.

...read moreread less

Introduction to Algorithms

Adhi Harmoko S, +4 more

Proceedings Article

SRILM – An Extensible Language Modeling Toolkit

Andreas Stolcke

TL;DR: The functionality of the SRILM toolkit is summarized and its design and implementation is discussed, highlighting ease of rapid prototyping, reusability, and combinability of tools.

...read moreread less

Journal Article

The mathematics of statistical machine translation: parameter estimation

Peter Fitzhugh Brown, +3 more

- 01 Jun 1993 -

Computational Linguistics

TL;DR: The authors describe a series of five statistical models of the translation process and give algorithms for estimating the parameters of these models given a set of pairs of sentences that are translations of one another.

...read moreread less

Collapse

Computational Linguistics

Hierarchical Phrase-Based Translation

Citations

A Simple, Fast, and Effective Reparameterization of IBM Model 2

Six Challenges for Neural Machine Translation.

Synthesis Lectures on Human Language Technologies

Addressing the Rare Word Problem in Neural Machine Translation

Modeling Coverage for Neural Machine Translation

References

Introduction to Algorithms

Bleu: a Method for Automatic Evaluation of Machine Translation

Introduction to Algorithms

SRILM – An Extensible Language Modeling Toolkit

The mathematics of statistical machine translation: parameter estimation

Related Papers (5)

Bleu: a Method for Automatic Evaluation of Machine Translation

Statistical phrase-based translation

Minimum Error Rate Training in Statistical Machine Translation

Moses: Open Source Toolkit for Statistical Machine Translation

A systematic comparison of various statistical alignment models