L
Loïc Barrault
Publications - 12
Citations - 292
Loïc Barrault is an academic researcher. The author has contributed to research in topics: Computer science & Speaker diarisation. The author has an hindex of 5, co-authored 12 publications receiving 292 citations.
Papers
More filters
Journal ArticleDOI
No Language Left Behind: Scaling Human-Centered Machine Translation
Nllb team,Marta R. Costa-jussà,James Cross,Onur cCelebi,Maha Elbayad,Kenneth Heafield,Kevin Heffernan,Elahe Kalbassi,Janice Si-Man Lam,Daniel Licht,Jean Maillard,Anna Sun,Skyler Wang,Guillaume Wenzek,Alison Youngblood,Bapi Akula,Loïc Barrault,Gabriel Mejia Gonzalez,Prangthip Hansanti,John Hoffman,Semarley Jarrett,Kaushik Ram Sadagopan,Dirk Rowe,Shannon Spruit,Chau Tran,Pierre Andrews,Necip Fazil Ayan,Shruti Bhosale,Sergey Edunov,Angela Fan,Cynthia Gao,Vedanuj Goswami,Francisco Guzm'an,Philipp Koehn,Alexandre Mourachko,Christophe Ropers,Safiyyah Saleem,Holger Schwenk,Jeff Wang +38 more
TL;DR: A conditional compute model based on Sparsely Gated Mixture of Experts that is trained on data obtained with novel and effective data mining techniques tailored for low-resource languages is developed, laying important groundwork towards realizing a universal translation system.
Proceedings ArticleDOI
Findings of the IWSLT 2022 Evaluation Campaign
Antonios Anastasopoulos,Loïc Barrault,Luisa Bentivogli,Marcely Zanon Boito,Ondřej Bojar,Roldano Cattoni,Anna Currey,Georgiana Dinu,Kevin K. Duh,Maha Elbayad,Clara Emmanuel,Yannick Estève,Margarita Frederico,Christian Federmann,Souhir Gahbiche,Hongyu Gong,Roman Grundkiewicz,Barry Haddow,Benjamin Hsu,Dávid Javorský,Věra Kloudová,Surafel Melaku Lakew,Xutai Ma,Prashant Mathur,Paul McNamee,Kenton Murray,Maria Nadejde,Satoshi Nakamura,M. Cristina Negri,Jan Niehues,Xing Niu,John Ortega,Juan Pino,Elizabeth Salesky,Jiatong Shi,Matthias Sperber,Sebastian Stüker,K. Sudoh,Marco Turchi,Yogesh Virkar,Alex Waibel,Chang Wang,Shinji Watanabe +42 more
TL;DR: For each shared task of the 19th International Conference on Spoken Language Translation, the purpose of the task, the data that were released, the evaluation metrics that were applied, the submissions that were received and the results that were achieved are detailed.
Proceedings Article
Speech Resources in the Tamasheq Language
Marcely Zanon Boito,Fethi Bougares,Florentin Barbier,Souhir Gahbiche,Loïc Barrault,Mickael Rouvier,Yannick Estève +6 more
TL;DR: This paper shares a massive amount of unlabeled audio data (671 hours) in five languages: French from Niger, Fulfulde, Hausa, Tamasheq and Zarma, and a smaller 17 hours parallel corpus of audio recordings in TamAsheq, with utterance-level translations in the French language.
Proceedings ArticleDOI
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better
TL;DR: The authors proposed to use a method that evaluates the percentage of the source contribution to a generated translation to identify translations that are disconnected from the source, hence they can be identified by low source contribution.
Proceedings ArticleDOI
ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks
Marcely Zanon Boito,John Ortega,Hugo Riguidel,Antoine Laurent,Loïc Barrault,Fethi Bougares,Firas Chaabani,Ha Nguyen,Florentin Barbier,Souhir Gahbiche,Yannick Estève +10 more
TL;DR: This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2022: low-resource and dialect speech translation and highlights that self-supervised models trained on smaller sets of target data are more effective to low- resource end-to-end ST fine-tuning, compared to large off-the-shelf models.