M
Milan Straka
Researcher at Charles University in Prague
Publications - 80
Citations - 3412
Milan Straka is an academic researcher from Charles University in Prague. The author has contributed to research in topics: Treebank & Czech. The author has an hindex of 20, co-authored 76 publications receiving 2655 citations.
Papers
More filters
Proceedings ArticleDOI
Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe
Milan Straka,Jana Straková +1 more
TL;DR: An update to UDPipe 1.0 (Straka et al., 2016), a trainable pipeline which performs sentence segmentation, tokenization, POS tagging, lemmatization and dependency parsing, which provides models for all 50 languages of UD 2.0.
Proceedings Article
CoNLL 2018 Shared Task : Multilingual Parsing from Raw Text to Universal Dependencies
Daniel Zeman,Jan Hajič,Martin Popel,Martin Potthast,Milan Straka,Filip Ginter,Joakim Nivre,Slav Petrov +7 more
TL;DR: This overview paper defines the task and the updated evaluation methodology, describes data preparation, report and analyze the main results, and provides a brief categorization of the different approaches of the participating systems.
Proceedings Article
UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing
TL;DR: UDPipe, a pipeline processing CoNLL-U-formatted files, performs tokenization, morphological analysis, part-of-speech tagging, lemmatization and dependency parsing for nearly all treebanks of Universal Dependencies 1.2.
Proceedings ArticleDOI
CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
Daniel Zeman,Martin Popel,Milan Straka,Jan Hajič,Joakim Nivre,Filip Ginter,Juhani Luotolahti,Sampo Pyysalo,Slav Petrov,Martin Potthast,Francis M. Tyers,Elena Badmaeva,Memduh Gökırmak,Anna Nedoluzhko,Silvie Cinková,Jaroslava Hlaváčová,Václava Kettnerová,Zdenka Uresova,Jenna Kanerva,Stina Ojala,Anna Missilä,Christopher D. Manning,Sebastian Schuster,Siva Reddy,Dima Taji,Nizar Habash,Herman Leung,Marie-Catherine de Marneffe,Manuela Sanguinetti,Maria Simi,Hiroshi Kanayama,Valeria dePaiva,Kira Droganova,Héctor Martínez Alonso,Ça ugrı Çöltekin,Umut Sulubacak,Hans Uszkoreit,Vivien Macketanz,Aljoscha Burchardt,Kim Harris,Katrin Marheinecke,Georg Rehm,Tolga Kayadelen,Mohammed Attia,Ali Elkahky,Zhuoran Yu,Emily Pitler,Saran Lertpradit,Michael Mandl,Jesse Kirchner,Hector Fernandez Alcalde,Jana Strnadová,Esha Banerjee,Ruli Manurung,Antonio Stella,Atsuko Shimada,Sookyoung Kwak,Gustavo Mendonça,Tatiana Lando,Rattima Nitisaroj,Josie Li +60 more
TL;DR: The task and evaluation methodology is defined, how the data sets were prepared, report and analyze the main results, and a brief categorization of the different approaches of the participating systems are provided.
Proceedings ArticleDOI
75 Languages, 1 Model: Parsing Universal Dependencies Universally
Dan Kondratyuk,Milan Straka +1 more
TL;DR: It is found that fine-tuning a multilingual BERT self-attention model pretrained on 104 languages can meet or exceed state-of-the-art UPOS, UFeats, Lemmas, (and especially) UAS, and LAS scores, without requiring any recurrent or language-specific components.