A
Anastasia Shimorina
Researcher at Centre national de la recherche scientifique
Publications - 20
Citations - 935
Anastasia Shimorina is an academic researcher from Centre national de la recherche scientifique. The author has contributed to research in topics: Natural language generation & Machine translation. The author has an hindex of 10, co-authored 18 publications receiving 641 citations.
Papers
More filters
Proceedings ArticleDOI
Creating Training Corpora for NLG Micro-Planners
TL;DR: This paper proposes the corpus generation framework as a novel method for creating challenging data sets from which NLG models can be learned which are capable of handling the complex interactions occurring during in micro-planning between lexicalisation, aggregation, surface realisation, referring expression generation and sentence segmentation.
Proceedings ArticleDOI
The WebNLG Challenge: Generating Text from RDF Data
TL;DR: The microplanning task is introduced, data preparation, evaluation methodology, participant results and a brief description of the participating systems are provided.
Proceedings ArticleDOI
Split and Rephrase
TL;DR: This paper proposed a new sentence simplification task (Split-and-Rephrase) where the aim is to split a complex sentence into a meaning preserving sequence of shorter sentences, which can be used as a preprocessing step which facilitates and improves the performance of parsers, semantic role labelers and machine translation systems.
Posted Content
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics.
Sebastian Gehrmann,Tosin P. Adewumi,Karmanya Aggarwal,Pawan Sasanka Ammanamanchi,Aremu Anuoluwapo,Antoine Bosselut,Khyathi Raghavi Chandu,Miruna Clinciu,Dipanjan Das,Kaustubh Dhole,Wanyu Du,Esin Durmus,Ondřej Dušek,Chris Chinenye Emezue,Varun Gangal,Cristina Garbacea,Tatsunori Hashimoto,Yufang Hou,Yacine Jernite,Harsh Jhamtani,Yangfeng Ji,Shailza Jolly,Mihir Kale,Dhruv Kumar,Faisal Ladhak,Aman Madaan,Mounica Maddela,Khyati Mahajan,Saad Mahamood,Bodhisattwa Prasad Majumder,Pedro Henrique Martins,Angelina McMillan-Major,Simon Mille,Emiel van Miltenburg,Moin Nadeem,Shashi Narayan,Vitaly Nikolaev,Rubungo Andre Niyongabo,Salomey Osei,Ankur P. Parikh,Laura Perez-Beltrachini,Niranjan Ramesh Rao,Vikas Raunak,Juan Diego Rodriguez,Sashank Santhanam,João Sedoc,Thibault Sellam,Samira Shaikh,Anastasia Shimorina,Marco Antonio Sobrevilla Cabezudo,Hendrik Strobelt,Nishant Subramani,Wei Xu,Diyi Yang,Akhila Yerukola,Jiawei Zhou +55 more
TL;DR: GEM as discussed by the authors is a living benchmark for natural language generation (NLG), its Evaluation and Metrics, which provides an environment in which models can easily be applied to a wide set of tasks and in which evaluation strategies can be tested.
Proceedings ArticleDOI
Handling Rare Items in Data-to-Text Generation
TL;DR: It is shown that rare items strongly impact performance and that combining delexicalisation and copying yields the strongest improvement; that copying underperforms for rare and unseen items and that the impact of these two mechanisms greatly varies depending on how the dataset is constructed and on how it is split into train, dev and test.