scispace - formally typeset
T

Tommi Jauhiainen

Researcher at University of Helsinki

Publications -  39
Citations -  590

Tommi Jauhiainen is an academic researcher from University of Helsinki. The author has contributed to research in topics: Language identification & Language model. The author has an hindex of 12, co-authored 33 publications receiving 420 citations.

Papers
More filters
Journal ArticleDOI

Automatic Language Identification in Texts: A Survey

TL;DR: A unified notation is introduced for evaluation methods, applications, as well as off-the-shelf LI systems that do not require training by the end user, to propose future directions for research in LI.
Proceedings Article

HeLI, a Word-Based Backoff Method for Language Identification

TL;DR: The Helsinki language identification method, HeLI, and the resources it created for and used in the 3rd edition of the Discriminating between Similar Languages (DSL) shared task, which was organized as part of the VarDial 2016 workshop are described.
Proceedings Article

A Report on the VarDial Evaluation Campaign 2020

TL;DR: The VarDial Evaluation Campaign 2020 included three shared tasks each focusing on a different challenge of language and dialect identification: Romanian Dialect Identification (RDI), Social Media Variety Geolocation (SMG), and Uralic Language Identification (ULI).
Proceedings Article

HeLI-based Experiments in Swiss German Dialect Identification

TL;DR: The SUKI team's submission using HeLI with adaptive language models obtained the best results in the shared task with a macro F1-score of 0.686, which is clearly higher than the other submitted results.
Proceedings ArticleDOI

A Report on the Third VarDial Evaluation Campaign

TL;DR: The findings of the Third VarDial Evaluation Campaign organized as part of the sixth edition of the workshop on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects (VarDial), co-located with NAACL 2019 are presented.