scispace - formally typeset
Open AccessBook

Discourse in Statistical Machine Translation

TLDR
This thesis addresses the technical and linguistic aspects of discourse-level processing in phrase-based statistical machine translation (SMT) with a focus on connected texts.
Abstract
This thesis addresses the technical and linguistic aspects of discourse-level processing in phrase-based statistical machine translation (SMT). Connected texts can have complex text-level linguisti ...

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Evaluating Discourse Phenomena in Neural Machine Translation

TL;DR: This paper investigated the performance of multi-encoder NMT models trained on subtitles for English to French and found that decoding the concatenation of the previous and current sentence leads to good performance.
Posted Content

Neural Machine Translation with Extended Context

TL;DR: In this pilot study, interesting cross-sentential attention patterns that improve textual coherence in translation at least in some selected cases are observed.
Proceedings ArticleDOI

Using Discourse Structure Improves Machine Translation Evaluation

TL;DR: Two discourse-aware similarity measures are designed, which use all-subtree kernels to compare discourse parse trees in accordance with the Rhetorical Structure Theory and can help improve a number of existing machine translation evaluation metrics both at the segment- and at the system-level.
Proceedings ArticleDOI

Pronoun-Focused MT and Cross-Lingual Pronoun Prediction: Findings of the 2015 DiscoMT Shared Task on Pronoun Translation

TL;DR: This work described the design, the evaluation setup, and the results of the DiscoMT 2015 shared task, which included two subtasks, relevant to both the machine translation (MT) and the discourse communities: pronoun-focused translation and cross-lingual pronoun prediction.
Proceedings ArticleDOI

Validation of an Automatic Metric for the Accuracy of Pronoun Translation (APT)

TL;DR: A reference-based metric is defined and assessed to evaluate the accuracy of pronoun translation (APT), which automatically aligns a candidate and a reference translation using GIZA++ augmented with specific heuristics, and then counts the number of identical or different pronouns.
References
More filters
Journal ArticleDOI

A mathematical theory of communication

TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.
Journal ArticleDOI

Optimization by Simulated Annealing

TL;DR: There is a deep and useful connection between statistical mechanics and multivariate or combinatorial optimization (finding the minimum of a given function depending on many parameters), and a detailed analogy with annealing in solids provides a framework for optimization of very large and complex systems.
Journal ArticleDOI

Equation of state calculations by fast computing machines

TL;DR: In this article, a modified Monte Carlo integration over configuration space is used to investigate the properties of a two-dimensional rigid-sphere system with a set of interacting individual molecules, and the results are compared to free volume equations of state and a four-term virial coefficient expansion.
Journal ArticleDOI

Learning representations by back-propagating errors

TL;DR: Back-propagation repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector, which helps to represent important features of the task domain.
Proceedings ArticleDOI

Bleu: a Method for Automatic Evaluation of Machine Translation

TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.
Related Papers (5)