T
Tommi Jauhiainen
Researcher at University of Helsinki
Publications - 39
Citations - 590
Tommi Jauhiainen is an academic researcher from University of Helsinki. The author has contributed to research in topics: Language identification & Language model. The author has an hindex of 12, co-authored 33 publications receiving 420 citations.
Papers
More filters
Journal ArticleDOI
Automatic Language Identification in Texts: A Survey
TL;DR: A unified notation is introduced for evaluation methods, applications, as well as off-the-shelf LI systems that do not require training by the end user, to propose future directions for research in LI.
Proceedings Article
HeLI, a Word-Based Backoff Method for Language Identification
TL;DR: The Helsinki language identification method, HeLI, and the resources it created for and used in the 3rd edition of the Discriminating between Similar Languages (DSL) shared task, which was organized as part of the VarDial 2016 workshop are described.
Proceedings Article
A Report on the VarDial Evaluation Campaign 2020
Mihaela Gaman,Dirk Hovy,Radu Tudor Ionescu,Heidi Jauhiainen,Tommi Jauhiainen,Krister Lindén,Nikola Ljubešić,Niko Partanen,Christoph Purschke,Yves Scherrer,Marcos Zampieri +10 more
TL;DR: The VarDial Evaluation Campaign 2020 included three shared tasks each focusing on a different challenge of language and dialect identification: Romanian Dialect Identification (RDI), Social Media Variety Geolocation (SMG), and Uralic Language Identification (ULI).
Proceedings Article
HeLI-based Experiments in Swiss German Dialect Identification
TL;DR: The SUKI team's submission using HeLI with adaptive language models obtained the best results in the shared task with a macro F1-score of 0.686, which is clearly higher than the other submitted results.
Proceedings ArticleDOI
A Report on the Third VarDial Evaluation Campaign
Marcos Zampieri,Shervin Malmasi,Yves Scherrer,Tanja Samardžić,Francis M. Tyers,Miikka Silfverberg,Natalia Klyueva,Tung-Le Pan,Chu-Ren Huang,Radu Tudor Ionescu,Andrei M. Butnaru,Tommi Jauhiainen +11 more
TL;DR: The findings of the Third VarDial Evaluation Campaign organized as part of the sixth edition of the workshop on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects (VarDial), co-located with NAACL 2019 are presented.