Statistical Analysis of Machine Translation Evaluation Systems for English- Hindi Language Pair

doi:10.2174/2213275912666190716100145

Journal ArticleDOI

Statistical Analysis of Machine Translation Evaluation Systems for English- Hindi Language Pair

- Vol. 13, Iss: 5, pp 864-870

TLDR

The importance of Automatic Machine Translation Evaluation is discussed and various Machine translation Evaluation metrics are compared by performing Statistical Analysis on various metrics and human evaluations to find out which metric has the highest correlation with human scores.

Abstract:

Automatic Machine Translation (AMT) Evaluation Metrics have become popular in the Machine Translation Community in recent times. This is because of the popularity of Machine Translation engines and Machine Translation as a field itself. Translator is a very important tool to break barriers between communities especially in countries like India, where people speak 22 different languages and their many variations. With the onset of Machine Translation engines, there is a need for a system that evaluates how well these are performing. This is where machine translation evaluation enters. This paper discusses the importance of Automatic Machine Translation Evaluation and compares various Machine Translation Evaluation metrics by performing Statistical Analysis on various metrics and human evaluations to find out which metric has the highest correlation with human scores. The correlation between the Automatic and Human Evaluation Scores and the correlation between the five Automatic evaluation scores are examined at the sentence level. Moreover, a hypothesis is set up and p-values are calculated to find out how significant these correlations are. The results of the statistical analysis of the scores of various metrics and human scores are shown in the form of graphs to see the trend of the correlation between the scores of Automatic Machine Translation Evaluation metrics and human scores. Out of the five metrics considered for the study, METEOR shows the highest correlation with human scores as compared to the other metrics.

Statistical Analysis of Machine Translation Evaluation Systems for English- Hindi Language Pair

Citations

Emerging Trends and Applications in Cognitive Computing

References

Bleu: a Method for Automatic Evaluation of Machine Translation

METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments

Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

Evaluation of Machine Translation and its Evaluation

AMBER: A Modified BLEU, Enhanced Ranking Metric

Related Papers (5)

Contributions to english to hindi machine translation using example-based approach

The IIT Bombay Hindi-English Translation System at WMT 2014

Hybrid appraoch to English-Hindi name entity transliteration

AnglaHindi: an English to Hindi machine-aided translation system

Improving machine translation via triangulation and transliteration