The WebNLG Challenge: Generating Text from RDF Data

doi:10.18653/V1/W17-3518

Open AccessProceedings ArticleDOI

The WebNLG Challenge: Generating Text from RDF Data

- pp 124-133

TLDR

The microplanning task is introduced, data preparation, evaluation methodology, participant results and a brief description of the participating systems are provided.

Abstract:

The WebNLG challenge consists in mapping sets of RDF triples to text It provides a common benchmark on which to train, evaluate and compare “microplanners”, ie generation systems that verbalise a given content by making a range of complex interacting choices including referring expression generation, aggregation, lexicalisation, surface realisation and sentence segmentation In this paper, we introduce the microplanning task, describe data preparation, introduce our evaluation methodology, analyse participant results and provide a brief description of the participating systems

Citations

PDF

Open Access

More filters

Book Chapter

Language and thought

D Laplane

TL;DR: This article showed that the meaning of a word depends on a context of signification which may or may not be supplied verbally and that the initial progress made by children is not related to language but to brain maturation.

...read moreread less

Posted Content

BLEURT: Learning Robust Metrics for Text Generation

Thibault Sellam, +2 more

- 09 Apr 2020 -

arXiv: Computation and Language

TL;DR: BLEURT, a learned evaluation metric for English based on BERT, can model human judgment with a few thousand possibly biased training examples and yields superior results even when the training data is scarce and out-of-distribution.

...read moreread less

Proceedings ArticleDOI

BLEURT: Learning Robust Metrics for Text Generation

Thibault Sellam, +2 more

TL;DR: This paper proposed BLEURT, a learned evaluation metric for English based on BERT, which can model human judgment with a few thousand possibly biased training examples and achieved state-of-the-art results on the last three years of the WMT Metrics shared task and the WebNLG data set.

...read moreread less

Posted Content

ToTTo: A Controlled Table-To-Text Generation Dataset

Ankur P. Parikh, +6 more

- 29 Apr 2020 -

arXiv: Computation and Language

TL;DR: An open-domain English table-to-text dataset with over 120,000 training examples that proposes a controlled generation task: given a Wikipedia table and a set of highlighted table cells, produce a one-sentence description.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Bleu: a Method for Automatic Evaluation of Machine Translation

Kishore Papineni, +3 more

TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.

...read moreread less

Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

Posted Content

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

- 01 Sep 2014 -

arXiv: Computation and Language

TL;DR: In this paper, the authors propose to use a soft-searching model to find the parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

Proceedings Article

A Study of Translation Edit Rate with Targeted Human Annotation

Matthew Snover, +4 more

TL;DR: A new, intuitive measure for evaluating machine translation output that avoids the knowledge intensiveness of more meaning-based approaches, and the labor-intensiveness of human judgments is defined.

...read moreread less

Collapse

The WebNLG Challenge: Generating Text from RDF Data

Citations

Knowledge Graphs

Language and thought

BLEURT: Learning Robust Metrics for Text Generation

BLEURT: Learning Robust Metrics for Text Generation

ToTTo: A Controlled Table-To-Text Generation Dataset

References

Bleu: a Method for Automatic Evaluation of Machine Translation

Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

Moses: Open Source Toolkit for Statistical Machine Translation

A Study of Translation Edit Rate with Targeted Human Annotation

Related Papers (5)

Bleu: a Method for Automatic Evaluation of Machine Translation

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

ROUGE: A Package for Automatic Evaluation of Summaries

Attention is All you Need

Adam: A Method for Stochastic Optimization