Topic

Rule-based machine translation

About: Rule-based machine translation is a research topic. Over the lifetime, 8804 publications have been published within this topic receiving 240581 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

[...]

Melvin Johnson¹, Mike Schuster¹, Quoc V. Le¹, Maxim Krikun¹, Yonghui Wu¹, Zhifeng Chen¹, Nikhil Thorat¹, Fernanda B. Viégas¹, Martin Wattenberg¹, Greg S. Corrado¹, Macduff Hughes¹, Jeffrey Dean¹ - Show less +8 more•Institutions (1)

Google¹

09 Oct 2017-Transactions of the Association for Computational Linguistics

TL;DR: This work proposes a simple solution to use a single Neural Machine Translation (NMT) model to translate between multiple languages using a shared wordpiece vocabulary, and introduces an artificial token at the beginning of the input sentence to specify the required target language.

...read moreread less

Abstract: We propose a simple solution to use a single Neural Machine Translation (NMT) model to translate between multiple languages. Our solution requires no changes to the model architecture from a standard NMT system but instead introduces an artificial token at the beginning of the input sentence to specify the required target language. Using a shared wordpiece vocabulary, our approach enables Multilingual NMT using a single model. On the WMT’14 benchmarks, a single multilingual model achieves comparable performance for English→French and surpasses state-of-the-art results for English→German. Similarly, a single multilingual model surpasses state-of-the-art results for French→English and German→English on WMT’14 and WMT’15 benchmarks, respectively. On production corpora, multilingual models of up to twelve language pairs allow for better translation of many individual pairs. Our models can also learn to perform implicit bridging between language pairs never seen explicitly during training, showing that transfer learning and zero-shot translation is possible for neural translation. Finally, we show analyses that hints at a universal interlingua representation in our models and show some interesting examples when mixing languages.

...read moreread less

1,288 citations

Journal Article•DOI•

Hierarchical Phrase-Based Translation

[...]

David Chiang¹•Institutions (1)

University of Maryland, College Park¹

01 Jun 2007-Computational Linguistics

TL;DR: A statistical machine translation model that uses hierarchical phrasesphrases that contain subphrasing that is formally a synchronous context-free grammar but is learned from a parallel text without any syntactic annotations is presented.

...read moreread less

Abstract: We present a statistical machine translation model that uses hierarchical phrases---phrases that contain subphrases. The model is formally a synchronous context-free grammar but is learned from a parallel text without any syntactic annotations. Thus it can be seen as combining fundamental ideas from both syntax-based translation and phrase-based translation. We describe our system's training and decoding methods in detail, and evaluate it for translation speed and translation accuracy. Using BLEU as a metric of translation accuracy, we find that our system performs significantly better than the Alignment Template System, a state-of-the-art phrase-based system.

...read moreread less

1,265 citations

Proceedings Article•DOI•

Discriminative Training and Maximum Entropy Models for Statistical Machine Translation

[...]

Franz Josef Och¹, Hermann Ney¹•Institutions (1)

RWTH Aachen University¹

06 Jul 2002

TL;DR: A framework for statistical machine translation of natural languages based on direct maximum entropy models, which contains the widely used source-channel approach as a special case and shows that a baseline statistical machinetranslation system is significantly improved using this approach.

...read moreread less

Abstract: We present a framework for statistical machine translation of natural languages based on direct maximum entropy models, which contains the widely used source-channel approach as a special case. All knowledge sources are treated as feature functions, which depend on the source language sentence, the target language sentence and possible hidden variables. This approach allows a baseline machine translation system to be extended easily by adding new feature functions. We show that a baseline statistical machine translation system is significantly improved using this approach.

...read moreread less

1,216 citations

Book•

Language Development: Form and Function in Emerging Grammars

[...]

Lois Bloom

15 Jul 1970

TL;DR: This paper used nonlinguistic information from situational and behavioral context to infer the semantic intent of utterances in order to analyze the development of linguistic expression, and demonstrated the extent (and limitations) of the child's knowledge of basic grammatical relations in the earliest two-word utterances.

...read moreread less

Abstract: The research reported is in investigation into the early acquisition of grammar by three children from the age of approximately 19 months. Nonlinguistic information from situational and behavioral context was used to infer the semantic intent of utterances in order to analyze the development of linguistic expression. Previous psycholinguistic studies of child language had described utterances in terms of the orderly distribution with which words occurred in juxtaposition. In this study, by making judgments of semantic intent, it was possible to describe the inherent structure of utterances so that conclusions could be drawn about the child's knowledge of semantic-syntactic relationship in the derivation of sentences. For example, when the child said "Mommy sock" and Mommy was putting the child's sock on the child, it was clear that a different semantic interpretation was intended than when the child said "Mommy sock" and picked up Mommy's sock. The syntactic components of generative transformational grammars were proposed for those samples of the children's language in which mean length of utterance was less than 1.5 morphemes.For the psychologist, the book provides added insight into the relative development of syntactic expression and underlying cognitive function. It was clear, for example, that the two did not develop hand in hand. For the linguist, the book provides additional evidence for the growing conclusion that child language is not incoherent. There is strong evidence presented to demonstrate the extent (and limitations) of the child's knowledge of basic grammatical relations in the earliest two-word utterances. For the speech pathologist concerned with language disorders in children, the evidence presented and the resulting conclusions should provide important hypotheses for application in treatment.One of the major contributions that this book will make to the literature on child language is the presentation of a large body of data in support of the conclusions that have been drawn. There is an extensive catalog of the children's earliest two-word utterances, negative sentences, and syntactic and single-word lexicons. This evidence should prove invaluable to other researchers in the field.

...read moreread less

1,149 citations

Journal Article•DOI•

The Alignment Template Approach to Statistical Machine Translation

[...]

Franz Josef Och¹, Hermann Ney²•Institutions (2)

Google¹, RWTH Aachen University²

01 Dec 2004-Computational Linguistics

TL;DR: A phrase-based statistical machine translation approach the alignment template approach is described, which allows for general many-to-many relations between words and is easier to extend than classical statistical machinetranslation systems.

...read moreread less

Abstract: A phrase-based statistical machine translation approach — the alignment template approach — is described. This translation approach allows for general many-to-many relations between words. Thereby, the context of words is taken into account in the translation model, and local changes in word order from source to target language can be learned explicitly. The model is described using a log-linear modeling approach, which is a generalization of the often used source–channel approach. Thereby, the model is easier to extend than classical statistical machine translation systems. We describe in detail the process for learning phrasal translations, the feature functions used, and the search algorithm. The evaluation of this approach is performed on three different tasks. For the German–English speech VERBMOBIL task, we analyze the effect of various system components. On the French–English Canadian HANSARDS task, the alignment template system obtains significantly better results than a single-word-based translation model. In the Chinese–English 2002 National Institute of Standards and Technology (NIST) machine translation evaluation it yields statistically significantly better NIST scores than all competing research and commercial translation systems.

...read moreread less

1,031 citations

Collapse

Network Information

Performance

Metrics

9,214

Papers

255,695

Citations

No. of papers in the topic in previous years
Year	Papers
2023	127
2022	282
2021	136
2020	183
2019	174
2018	174

Rule-based machine translation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics