Home
/
Authors
/
Katsuhito Sudoh

Author

Katsuhito Sudoh

Nara Institute of Science and Technology

Other affiliations: Nippon Telegraph and Telephone, Kyoto University

Bio: Katsuhito Sudoh is an academic researcher from Nara Institute of Science and Technology. The author has contributed to research in topics: Machine translation & Example-based machine translation. The author has an hindex of 19, co-authored 99 publications receiving 1386 citations. Previous affiliations of Katsuhito Sudoh include Nippon Telegraph and Telephone & Kyoto University.

Papers published on a yearly basis

2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2007
2006
2005
2003

Papers

PDF

Open Access

More filters

Proceedings Article•

Automatic Evaluation of Translation Quality for Distant Language Pairs

[...]

Hideki Isozaki¹, Tsutomu Hirao¹, Kevin Duh¹, Katsuhito Sudoh¹, Hajime Tsukada¹ - Show less +1 more•Institutions (1)

Nippon Telegraph and Telephone¹

09 Oct 2010

TL;DR: An automatic evaluation metric based on rank correlation coefficients modified with precision is proposed and meta-evaluation of the NTCIR-7 PATMT JE task data shows that this metric outperforms conventional metrics.

...read moreread less

Abstract: Automatic evaluation of Machine Translation (MT) quality is essential to developing high-quality MT systems. Various evaluation metrics have been proposed, and BLEU is now used as the de facto standard metric. However, when we consider translation between distant language pairs such as Japanese and English, most popular metrics (e.g., BLEU, NIST, PER, and TER) do not work well. It is well known that Japanese and English have completely different word orders, and special care must be paid to word order in translation. Otherwise, translations with wrong word order often lead to misunderstanding and incomprehensibility. For instance, SMT-based Japanese-to-English translators tend to translate 'A because B' as 'B because A.' Thus, word order is the most important problem for distant language translation. However, conventional evaluation metrics do not significantly penalize such word order mistakes. Therefore, locally optimizing these metrics leads to inadequate translations. In this paper, we propose an automatic evaluation metric based on rank correlation coefficients modified with precision. Our meta-evaluation of the NTCIR-7 PATMT JE task data shows that this metric outperforms conventional metrics.

...read moreread less

335 citations

Proceedings Article•

Adaptation Data Selection using Neural Language Models: Experiments in Machine Translation

[...]

Kevin Duh¹, Graham Neubig¹, Katsuhito Sudoh², Hajime Tsukada²•Institutions (2)

Nara Institute of Science and Technology¹, Nippon Telegraph and Telephone²

01 Aug 2013

TL;DR: It is found that neural language models are indeed viable tools for data selection: while the improvements are varied, they are fast to train on small in-domain data and can sometimes substantially outperform conventional n-grams.

...read moreread less

Abstract: Data selection is an effective approach to domain adaptation in statistical machine translation. The idea is to use language models trained on small in-domain text to select similar sentences from large general-domain corpora, which are then incorporated into the training data. Substantial gains have been demonstrated in previous works, which employ standard ngram language models. Here, we explore the use of neural language models for data selection. We hypothesize that the continuous vector representation of words in neural language models makes them more effective than n-grams for modeling unknown word contexts, which are prevalent in general-domain text. In a comprehensive evaluation of 4 language pairs (English to German, French, Russian, Spanish), we found that neural language models are indeed viable tools for data selection: while the improvements are varied (i.e. 0.1 to 1.7 gains in BLEU), they are fast to train on small in-domain data and can sometimes substantially outperform conventional n-grams.

...read moreread less

129 citations

Proceedings Article•

Head Finalization: A Simple Reordering Rule for SOV Languages

[...]

Hideki Isozaki¹, Katsuhito Sudoh¹, Hajime Tsukada¹, Kevin Duh¹•Institutions (1)

Nippon Telegraph and Telephone¹

15 Jul 2010

TL;DR: This paper proposes an alternative single reordering rule: Head Finalization, a syntax-based preprocessing approach that offers the advantage of simplicity and shows that its result, Head Final English (HFE), follows almost the same order as Japanese.

...read moreread less

Abstract: English is a typical SVO (Subject-Verb-Object) language, while Japanese is a typical SOV language. Conventional Statistical Machine Translation (SMT) systems work well within each of these language families. However, SMT-based translation from an SVO language to an SOV language does not work well because their word orders are completely different. Recently, a few groups have proposed rule-based preprocessing methods to mitigate this problem (Xu et al., 2009; Hong et al., 2009). These methods rewrite SVO sentences to derive more SOV-like sentences by using a set of handcrafted rules. In this paper, we propose an alternative single reordering rule: Head Finalization. This is a syntax-based preprocessing approach that offers the advantage of simplicity. We do not have to be concerned about part-of-speech tags or rule weights because the powerful Enju parser allows us to implement the rule at a general level. Our experiments show that its result, Head Final English (HFE), follows almost the same order as Japanese. We also show that this rule improves automatic evaluation scores.

...read moreread less

95 citations

Journal Article•DOI•

Incorporating discourse features into confidence scoring of intention recognition results in spoken dialogue systems

[...]

Ryuichiro Higashinaka¹, Katsuhito Sudoh¹, Mikio Nakano¹•Institutions (1)

Nippon Telegraph and Telephone¹

01 Mar 2006-Speech Communication

TL;DR: Experimental results show that incorporating discourse features significantly improves the confidence scoring of intention recognition results, and conventional methods may be insufficient since the intention recognition result is a result of discourse processing.

...read moreread less

47 citations

Proceedings Article•

Overview of the 5th Workshop on Asian Translation

[...]

Toshiaki Nakazawa¹, Katsuhito Sudoh², Shohei Higashiyama³, Chenchen Ding⁴, Raj Dabre¹, Hideya Mino, Isao Goto⁴, Win Pa Pa⁵, Anoop Kunchukuttan⁶, Sadao Kurohashi¹ - Show less +6 more•Institutions (6)

Kyoto University¹, Nara Institute of Science and Technology², Kobe University³, National Institute of Information and Communications Technology⁴, University of Computer Studies, Yangon⁵, Indian Institute of Technology Bombay⁶

01 Jan 2018

TL;DR: The results of the shared tasks from the 4th workshop on Asian translation (WAT2017) including J↔E, J→C scientific paper translation subtasks, C→J, K→J patent translation subtask, H→E mixed domain subtasks and J→E newswire subtasks are presented.

...read moreread less

Abstract: This paper presents the results of the shared tasks from the 4th workshop on Asian translation (WAT2017) including J↔E, J↔C scientific paper translation subtasks, C↔J, K↔J, E↔J patent translation subtasks, H↔E mixed domain subtasks, J↔E newswire subtasks and J↔E recipe subtasks. For the WAT2017, 12 institutions participated in the shared tasks. About 300 translation results have been submitted to the automatic evaluation server, and selected submissions were manually evaluated.

...read moreread less

47 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20

Collapse

Cited by

PDF

Open Access

More filters

Posted Content•

fairseq: A Fast, Extensible Toolkit for Sequence Modeling.

[...]

Myle Ott¹, Sergey Edunov¹, Alexei Baevski¹, Angela Fan¹, Sam Gross¹, Nathan Ng, David Grangier², Michael Auli¹ - Show less +4 more•Institutions (2)

Facebook¹, Google²

01 Apr 2019-arXiv: Computation and Language

TL;DR: fairseq as discussed by the authors is an open-source sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling, and other text generation tasks, and supports distributed training across multiple GPUs and machines.

...read moreread less

Abstract: fairseq is an open-source sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling, and other text generation tasks. The toolkit is based on PyTorch and supports distributed training across multiple GPUs and machines. We also support fast mixed-precision training and inference on modern GPUs. A demo video can be found at this https URL

...read moreread less

1,650 citations

Proceedings Article•DOI•

fairseq: A Fast, Extensible Toolkit for Sequence Modeling

[...]

Myle Ott¹, Sergey Edunov¹, Alexei Baevski¹, Angela Fan¹, Sam Gross¹, Nathan Ng, David Grangier², Michael Auli¹ - Show less +4 more•Institutions (2)

Facebook¹, Google²

01 Apr 2019

TL;DR: Fairseq is an open-source sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling, and other text generation tasks and supports distributed training across multiple GPUs and machines.

...read moreread less

1,535 citations

Posted Content•

BERTScore: Evaluating Text Generation with BERT

[...]

Tianyi Zhang¹, Varsha Kishore, Felix Wu¹, Kilian Q. Weinberger¹, Yoav Artzi¹ - Show less +1 more•Institutions (1)

Cornell University¹

21 Apr 2019-arXiv: Computation and Language

TL;DR: This work proposes BERTScore, an automatic evaluation metric for text generation that correlates better with human judgments and provides stronger model selection performance than existing metrics.

...read moreread less

Abstract: We propose BERTScore, an automatic evaluation metric for text generation. Analogously to common metrics, BERTScore computes a similarity score for each token in the candidate sentence with each token in the reference sentence. However, instead of exact matches, we compute token similarity using contextual embeddings. We evaluate using the outputs of 363 machine translation and image captioning systems. BERTScore correlates better with human judgments and provides stronger model selection performance than existing metrics. Finally, we use an adversarial paraphrase detection task to show that BERTScore is more robust to challenging examples when compared to existing metrics.

...read moreread less

1,456 citations

Proceedings Article•

BERTScore: Evaluating Text Generation with BERT

[...]

Tianyi Zhang¹, Varsha Kishore, Felix Wu¹, Kilian Q. Weinberger¹, Yoav Artzi¹ - Show less +1 more•Institutions (1)

Cornell University¹

30 Apr 2020

TL;DR: This article proposed BERTScore, an automatic evaluation metric for text generation, which computes a similarity score for each token in the candidate sentence with each token from the reference sentence. But instead of exact matches, they compute token similarity using contextual embeddings.

...read moreread less

Abstract: We propose BERTScore, an automatic evaluation metric for text generation. Analogously to common metrics, BERTScore computes a similarity score for each token in the candidate sentence with each token in the reference sentence. However, instead of exact matches, we compute token similarity using contextual embeddings. We evaluate using the outputs of 363 machine translation and image captioning systems. BERTScore correlates better with human judgments and provides stronger model selection performance than existing metrics. Finally, we use an adversarial paraphrase detection task and show that BERTScore is more robust to challenging examples compared to existing metrics.

...read moreread less

819 citations

Journal Article•DOI•

A primer on neural network models for natural language processing

[...]

Yoav Goldberg¹•Institutions (1)

Bar-Ilan University¹

01 Sep 2016-Journal of Artificial Intelligence Research

TL;DR: This tutorial surveys neural network models from the perspective of natural language processing research, in an attempt to bring natural-language researchers up to speed with the neural techniques.

...read moreread less

Abstract: Over the past few years, neural networks have re-emerged as powerful machine-learning models, yielding state-of-the-art results in fields such as image recognition and speech processing. More recently, neural network models started to be applied also to textual natural language signals, again with very promising results. This tutorial surveys neural network models from the perspective of natural language processing research, in an attempt to bring natural-language researchers up to speed with the neural techniques. The tutorial covers input encoding for natural language tasks, feed-forward networks, convolutional networks, recurrent networks and recursive networks, as well as the computation graph abstraction for automatic gradient computation.

...read moreread less

760 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse