scispace - formally typeset
Open AccessProceedings ArticleDOI

Findings of the VarDial Evaluation Campaign 2017

TLDR
The VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects, which was organized as part of the fourth edition of the VarDial workshop at EACL’2017, is presented.
Abstract
We present the results of the VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects, which we organized as part of the fourth edition of the VarDial workshop at EACL’2017 This year, we included four shared tasks: Discriminating between Similar Languages (DSL), Arabic Dialect Identification (ADI), German Dialect Identification (GDI), and Cross-lingual Dependency Parsing (CLP) A total of 19 teams submitted runs across the four tasks, and 15 of them wrote system description papers

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings Article

Identification of Differences between Dutch Language Varieties with the VarDial 2018 Dutch-Flemish Subtitle Data

TL;DR: With the goal of discovering differences between Belgian and Netherlandic Dutch, Team Taurus participated in the Dutch-Flemish Subtitles task of VarDial2018 and used a rather simple marker-based method, but a wide range of features, including lexical, lexico-syntactic and syntactic ones, and achieved a second position.
Proceedings ArticleDOI

ZCU-NLP at MADAR 2019: Recognizing Arabic Dialects

TL;DR: This paper presents systems for the MADAR Shared Task: Arabic Fine-Grained Dialect Identification and experiment with recurrent neural networks showed that simpler machine learning algorithms are more successful.
Proceedings ArticleDOI

Dialect-robust Evaluation of Generated Text

TL;DR: The authors proposed NANO, which introduces regional and language information to the metric's pretraining to improve the robustness of text generation metrics to dialect variation. But their work is limited to English-to-German text generation.
Proceedings Article

German Dialect Identification Using Classifier Ensembles

TL;DR: The GDI classification entry to the second German Dialect Identification (GDI) shared task organized within the scope of the VarDial Evaluation Campaign 2018 is presented, based on SVM classifier ensembles trained on characters and words.

Parsing Approaches for Swiss German

TL;DR: This paper applies different cross-lingual parsing strategies to Swiss German, making use of Standard German resources, and shows around 60% Labelled Attachment Score for all approaches.
References
More filters
Proceedings Article

Okapi at TREC

TL;DR: Much of the work involved investigating plausible methods of applying Okapi-style weighting to phrases, and expansion using terms from the top documents retrieved by a pilot search on topic terms was used.
Proceedings Article

Parallel Data, Tools and Interfaces in OPUS

TL;DR: New data sets and their features, additional annotation tools and models provided from the website and essential interfaces and on-line services included in the OPUS project are reported.
Proceedings Article

Universal Dependencies v1: A Multilingual Treebank Collection

TL;DR: This paper describes v1 of the universal guidelines, the underlying design principles, and the currently available treebanks for 33 languages, as well as highlighting the needs for sound comparative evaluation and cross-lingual learning experiments.
Proceedings Article

Universal Dependency Annotation for Multilingual Parsing

TL;DR: A new collection of treebanks with homogeneous syntactic dependency annotation for six languages: German, English, Swedish, Spanish, French and Korean is presented, made freely available in order to facilitate research on multilingual dependency parsing.
Journal ArticleDOI

Bootstrapping parsers via syntactic projection across parallel texts

TL;DR: Using parallel text to help solving the problem of creating syntactic annotation in more languages by annotating the English side of a parallel corpus, project the analysis to the second language, and train a stochastic analyzer on the resulting noisy annotations.
Related Papers (5)