Guiding Neural Machine Translation Decoding with External Knowledge

doi:10.18653/V1/W17-4716

Open AccessProceedings ArticleDOI

Guiding Neural Machine Translation Decoding with External Knowledge

Rajen Chatterjee, +5 more

- pp 157-168

Chats0

TLDR

This work proposes a “guide” mechanism that enhances an existing NMT decoder with the ability to prioritize and adequately handle translation options presented in the form of XML annotations of source words.

Abstract:

© 2017 The Authors. Published by Association for Computational Linguistics. This is an open access article available under a Creative Commons licence. The published version can be accessed at the following link on the publisher’s website: http://dx.doi.org/10.18653/v1/W17-4716

Citations

PDF

Open Access

More filters

Book

Neural Machine Translation

Philipp Koehn

TL;DR: A comprehensive treatment of the topic, ranging from introduction to neural networks, computation graphs, description of the currently dominant attentional sequence-to-sequence model, recent refinements, alternative architectures and challenges.

...read moreread less

Posted Content

Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation

Matt Post, +1 more

- 18 Apr 2018 -

arXiv: Computation and Language

TL;DR: This work presents a algorithm for lexically constrained decoding with a complexity of O(1) in the number of constraints and demonstrates the algorithm’s remarkable ability to properly place constraints, and uses it to explore the shaky relationship between model and BLEU scores.

...read moreread less

Posted Content

Guiding Neural Machine Translation with Retrieved Translation Pieces

Jingyi Zhang, +4 more

- 07 Apr 2018 -

arXiv: Computation and Language

TL;DR: This paper proposes a simple, fast, and effective method for recalling previously seen translation examples and incorporating them into the NMT decoding process, and compares favorably to another alternative retrieval-based method with respect to accuracy, speed, and simplicity of implementation.

...read moreread less

Posted Content

Jointly Learning to Align and Translate with Transformer Models

Sarthak Garg, +3 more

- 04 Sep 2019 -

arXiv: Computation and Language

TL;DR: The authors extract discrete alignments from the attention probabilities learnt during regular neural machine translation model training and leverage them in a multi-task framework to optimize towards translation and alignment objectives, and demonstrate that their approach produces competitive results compared to GIZA++ trained IBM alignment models without sacrificing translation accuracy.

...read moreread less

Proceedings ArticleDOI

CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech.

Yi-Ling Chung, +3 more

TL;DR: In this paper, the authors describe the creation of the first large-scale, multilingual, expert-based dataset of hate-speech/counter-narrative pairs, which has been built with the effort of more than 100 operators from three different NGOs that applied their training and expertise to the task.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Bleu: a Method for Automatic Evaluation of Machine Translation

Kishore Papineni, +3 more

TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.

...read moreread less

Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

Posted Content

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

- 01 Sep 2014 -

arXiv: Computation and Language

TL;DR: In this paper, the authors propose to use a soft-searching model to find the parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

Proceedings Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

John C. Duchi, +2 more

TL;DR: Adaptive subgradient methods as discussed by the authors dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradient-based learning, which allows us to find needles in haystacks in the form of very predictive but rarely seen features.

...read moreread less

Journal Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

John C. Duchi, +2 more

- 01 Feb 2011 -

Journal of Machine Learning Research

TL;DR: This work describes and analyze an apparatus for adaptively modifying the proximal function, which significantly simplifies setting a learning rate and results in regret guarantees that are provably as good as the best proximal functions that can be chosen in hindsight.

...read moreread less

Collapse

Guiding Neural Machine Translation Decoding with External Knowledge

Citations

Neural Machine Translation

Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation

Guiding Neural Machine Translation with Retrieved Translation Pieces

Jointly Learning to Align and Translate with Transformer Models

CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech.

References

Bleu: a Method for Automatic Evaluation of Machine Translation

Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

Related Papers (5)

Neural Machine Translation of Rare Words with Subword Units

Attention is All you Need

Bleu: a Method for Automatic Evaluation of Machine Translation

Neural Machine Translation by Jointly Learning to Align and Translate

Effective Approaches to Attention-based Neural Machine Translation