Anastasia Shimorina

Researcher at Centre national de la recherche scientifique

Publications - 20

Citations - 935

Anastasia Shimorina is an academic researcher from Centre national de la recherche scientifique. The author has contributed to research in topics: Natural language generation & Machine translation. The author has an hindex of 10, co-authored 18 publications receiving 641 citations.

Papers

PDF

Open Access

More filters

Proceedings ArticleDOI

Creating Training Corpora for NLG Micro-Planners

Claire Gardent, +3 more

TL;DR: This paper proposes the corpus generation framework as a novel method for creating challenging data sets from which NLG models can be learned which are capable of handling the complex interactions occurring during in micro-planning between lexicalisation, aggregation, surface realisation, referring expression generation and sentence segmentation.

...read moreread less

Proceedings ArticleDOI

The WebNLG Challenge: Generating Text from RDF Data

Claire Gardent, +3 more

TL;DR: The microplanning task is introduced, data preparation, evaluation methodology, participant results and a brief description of the participating systems are provided.

...read moreread less

Proceedings ArticleDOI

Split and Rephrase

Shashi Narayan, +3 more

TL;DR: This paper proposed a new sentence simplification task (Split-and-Rephrase) where the aim is to split a complex sentence into a meaning preserving sequence of shorter sentences, which can be used as a preprocessing step which facilitates and improves the performance of parsers, semantic role labelers and machine translation systems.

...read moreread less

Posted Content

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics.

Sebastian Gehrmann, +55 more

- 02 Feb 2021 -

arXiv: Computation and Language

TL;DR: GEM as discussed by the authors is a living benchmark for natural language generation (NLG), its Evaluation and Metrics, which provides an environment in which models can easily be applied to a wide set of tasks and in which evaluation strategies can be tested.

...read moreread less

Proceedings ArticleDOI

Handling Rare Items in Data-to-Text Generation

Anastasia Shimorina, +1 more

TL;DR: It is shown that rare items strongly impact performance and that combining delexicalisation and copying yields the strongest improvement; that copying underperforms for rare and unseen items and that the impact of these two mechanisms greatly varies depending on how the dataset is constructed and on how it is split into train, dev and test.

...read moreread less