Showing papers by "Kevin Duh published in 2010"

PDF

Open Access

Proceedings Article•

Automatic Evaluation of Translation Quality for Distant Language Pairs

[...]

Hideki Isozaki¹, Tsutomu Hirao¹, Kevin Duh¹, Katsuhito Sudoh¹, Hajime Tsukada¹ - Show less +1 more•Institutions (1)

09 Oct 2010

TL;DR: An automatic evaluation metric based on rank correlation coefficients modified with precision is proposed and meta-evaluation of the NTCIR-7 PATMT JE task data shows that this metric outperforms conventional metrics.

...read moreread less

Abstract: Automatic evaluation of Machine Translation (MT) quality is essential to developing high-quality MT systems. Various evaluation metrics have been proposed, and BLEU is now used as the de facto standard metric. However, when we consider translation between distant language pairs such as Japanese and English, most popular metrics (e.g., BLEU, NIST, PER, and TER) do not work well. It is well known that Japanese and English have completely different word orders, and special care must be paid to word order in translation. Otherwise, translations with wrong word order often lead to misunderstanding and incomprehensibility. For instance, SMT-based Japanese-to-English translators tend to translate 'A because B' as 'B because A.' Thus, word order is the most important problem for distant language translation. However, conventional evaluation metrics do not significantly penalize such word order mistakes. Therefore, locally optimizing these metrics leads to inadequate translations. In this paper, we propose an automatic evaluation metric based on rank correlation coefficients modified with precision. Our meta-evaluation of the NTCIR-7 PATMT JE task data shows that this metric outperforms conventional metrics.

...read moreread less

335 citations

Proceedings Article•

Head Finalization: A Simple Reordering Rule for SOV Languages

[...]

Hideki Isozaki¹, Katsuhito Sudoh¹, Hajime Tsukada¹, Kevin Duh¹•Institutions (1)

Nippon Telegraph and Telephone¹

15 Jul 2010

TL;DR: This paper proposes an alternative single reordering rule: Head Finalization, a syntax-based preprocessing approach that offers the advantage of simplicity and shows that its result, Head Final English (HFE), follows almost the same order as Japanese.

...read moreread less

Abstract: English is a typical SVO (Subject-Verb-Object) language, while Japanese is a typical SOV language. Conventional Statistical Machine Translation (SMT) systems work well within each of these language families. However, SMT-based translation from an SVO language to an SOV language does not work well because their word orders are completely different. Recently, a few groups have proposed rule-based preprocessing methods to mitigate this problem (Xu et al., 2009; Hong et al., 2009). These methods rewrite SVO sentences to derive more SOV-like sentences by using a set of handcrafted rules. In this paper, we propose an alternative single reordering rule: Head Finalization. This is a syntax-based preprocessing approach that offers the advantage of simplicity. We do not have to be concerned about part-of-speech tags or rule weights because the powerful Enju parser allows us to implement the rule at a general level. Our experiments show that its result, Head Final English (HFE), follows almost the same order as Japanese. We also show that this rule improves automatic evaluation scores.

...read moreread less

95 citations

Proceedings Article•

Divide and Translate: Improving Long Distance Reordering in Statistical Machine Translation

[...]

Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, Tsutomu Hirao, Masaaki Nagata - Show less +1 more

15 Jul 2010

TL;DR: This paper proposes a novel method for long distance, clause-level reordering in statistical machine translation (SMT), which separately translates clauses in the source sentence and reconstructs the target sentence using the clause translations with non-terminals.

...read moreread less

Abstract: This paper proposes a novel method for long distance, clause-level reordering in statistical machine translation (SMT). The proposed method separately translates clauses in the source sentence and reconstructs the target sentence using the clause translations with non-terminals. The non-terminals are placeholders of embedded clauses, by which we reduce complicated clause-level reordering into simple word-level reordering. Its translation model is trained using a bilingual corpus with clause-level alignment, which can be automatically annotated by our alignment algorithm with a syntactic parser in the source language. We achieved significant improvements of 1.4% in BLEU and 1.3% in TER by using Moses, and 2.2% in BLEU and 3.5% in TER by using our hierarchical phrase-based SMT, for the English-to-Japanese translation of research paper abstracts in the medical domain.

...read moreread less

41 citations

Analysis of translation model adaptation in statistical machine translation.

[...]

Kevin Duh, Katsuhito Sudoh, Hajime Tsukada

01 Jan 2010

TL;DR: It is shown that sometimes outof-domain data may help word alignment more than it helps phrase coverage, and more flexible combination of data along different parts of the training pipeline may lead to better results.

...read moreread less

Abstract: Numerous empirical results have shown that combining data from multiple domains often improve statistical machine translation (SMT) performance. For example, if we desire to build SMT for the medical domain, it may be beneficial to augment the training data with bitext from another domain, such as parliamentary proceedings. Despite the positive results, it is not clear exactly how and where additional outof-domain data helps in the SMT training pipeline. In this work, we analyze this problem in detail, considering the following hypotheses: out-of-domain data helps by either (a) improving word alignment or (b) improving phrase coverage. Using a multitude of datasets (IWSLT-TED, EMEA, Europarl, OpenSubtitles, KDE), we show that sometimes outof-domain data may help word alignment more than it helps phrase coverage, and more flexible combination of data along different parts of the training pipeline may lead to better results.

...read moreread less

19 citations

Proceedings Article•

N-Best Reranking by Multitask Learning

[...]

Kevin Duh, Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki, Masaaki Nagata - Show less +1 more

15 Jul 2010

TL;DR: A new framework for N-best reranking on sparse feature sets is proposed, where each N- best list corresponds to a distinct task, and a meta-algorithm is proposed that uses multitask learning (such as e1/e2 regularization) to discover common feature representations across N- Best lists.

...read moreread less

Abstract: We propose a new framework for N-best reranking on sparse feature sets. The idea is to reformulate the reranking problem as a Multitask Learning problem, where each N-best list corresponds to a distinct task. This is motivated by the observation that N-best lists often show significant differences in feature distributions. Training a single reranker directly on this heteroge-nous data can be difficult. Our proposed meta-algorithm solves this challenge by using multitask learning (such as e1/e2 regularization) to discover common feature representations across N-best lists. This meta-algorithm is simple to implement, and its modular approach allows one to plug-in different learning algorithms from existing literature. As a proof of concept, we show statistically significant improvements on a machine translation system involving millions of features.

...read moreread less

17 citations

Proceedings Article•

Hierarchical Phrase-based Machine Translation with Word-based Reordering Model

[...]

Katsuhiko Hayashi¹, Hajime Tsukada, Katsuhito Sudoh, Kevin Duh, Seiichi Yamamoto¹ - Show less +1 more•Institutions (1)

Doshisha University¹

23 Aug 2010

TL;DR: The approach extends the synchronous context-free grammar rules of hierarchical phrase-based model to include reordered source strings, allowing efficient calculation of reordering model scores during decoding, and shows that the BLEU scores obtained by the proposed system were better than those obtained by a standard hierarchical phrases-based machine translation system.

...read moreread less

Abstract: Hierarchical phrase-based machine translation can capture global reordering with synchronous context-free grammar, but has little ability to evaluate the correctness of word orderings during decoding. We propose a method to integrate word-based reordering model into hierarchical phrase-based machine translation to overcome this weakness. Our approach extends the synchronous context-free grammar rules of hierarchical phrase-based model to include reordered source strings, allowing efficient calculation of reordering model scores during decoding. Our experimental results on Japanese-to-English basic travel expression corpus showed that the BLEU scores obtained by our proposed system were better than those obtained by a standard hierarchical phrase-based machine translation system.

...read moreread less

14 citations

NTT statistical machine translation system for IWSLT 2010.

[...]

Katsuhito Sudoh, Kevin Duh, Hajime Tsukada

01 Jan 2010

TL;DR: The two components of the NTT statistical machine translation decoder and reranker are described and the results for the evaluation campaign of IWSLT 2008 are presented.

...read moreread less

Abstract: The NTT Statistical Machine Translation System consists of two primary components: a statistical machine translation decoder and a reranker. The decoder generates kbest translation canditates using a hierarchical phrase-based translation based on synchronous context-free grammar. The decoder employs a linear feature combination among several real-valued scores on translation and language models. The reranker reorders the k-best translation candidates using Ranking SVMs with a large number of sparse features. This paper describes the two components and presents the results for the evaluation campaign of IWSLT 2008.

...read moreread less

6 citations

Proceedings Article•

MSS: Investigating the Effectiveness of Domain Combinations and Topic Features for Word Sense Disambiguation

[...]

Sanae Fujita, Kevin Duh, Akinori Fujino, Hirotoshi Taira, Hiroyuki Shindo - Show less +1 more

15 Jul 2010

TL;DR: This paper participated in the SemEval-2010 Japanese Word Sense Disambiguation (WSD) task and focused on investigating domain differences, incorporating topic features, and predicting new unknown senses with Support Vector Machines and Maximum Entropy classifiers.

...read moreread less

Abstract: We participated in the SemEval-2010 Japanese Word Sense Disambiguation (WSD) task (Task 16) and focused on the following: (1) investigating domain differences, (2) incorporating topic features, and (3) predicting new unknown senses. We experimented with Support Vector Machines (SVM) and Maximum Entropy (MEM) classifiers. We achieved 80.1% accuracy in our experiments.

...read moreread less

5 citations