scispace - formally typeset
Open AccessProceedings ArticleDOI

Findings of the VarDial Evaluation Campaign 2017

TLDR
The VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects, which was organized as part of the fourth edition of the VarDial workshop at EACL’2017, is presented.
Abstract
We present the results of the VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects, which we organized as part of the fourth edition of the VarDial workshop at EACL’2017 This year, we included four shared tasks: Discriminating between Similar Languages (DSL), Arabic Dialect Identification (ADI), German Dialect Identification (GDI), and Cross-lingual Dependency Parsing (CLP) A total of 19 teams submitted runs across the four tasks, and 15 of them wrote system description papers

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

TL;DR: The task and evaluation methodology is defined, how the data sets were prepared, report and analyze the main results, and a brief categorization of the different approaches of the participating systems are provided.
Proceedings Article

Fine-Grained Arabic Dialect Identification

TL;DR: This paper presents the first results on a fine-grained dialect classification task covering 25 specific cities from across the Arab World, in addition to Standard Arabic, and builds several classification systems and explores a large space of features.
Proceedings Article

CAMeL tools: An open source python toolkit for arabic natural language processing

TL;DR: The design of CAMeL Tools is described and the functionalities it provides are described, including utilities for pre-processing, morphological modeling, Dialect Identification, Named Entity Recognition and Sentiment Analysis.
Proceedings ArticleDOI

The MADAR Shared Task on Arabic Fine-Grained Dialect Identification

TL;DR: This shared task is the first to target a large set of dialect labels at the city and country levels and was organized as part of The Fourth Arabic Natural Language Processing Workshop, collocated with ACL 2019.
References
More filters
Proceedings ArticleDOI

ArchiMob - A Corpus of Spoken Swiss German

TL;DR: A bootstrapping approach to automatic normalisation using different machine-translation-inspired methods is presented and the performance of part-of-speech taggers on the authors' data is evaluated to show how the same bootstrapped approach improves part- of-speech tagging by 10% over four rounds.
Journal Article

Overview of TweetLID: Tweet Language Identification at SEPLN 2014

TL;DR: This article presents a summary of the TweetLID shared task and workshop held at SEPLN 2014, which briefly summarizes the data collection and annotation process, the development and evaluation of the shared task, as well as the results achieved by the participants.
Proceedings ArticleDOI

The NRC System for Discriminating Similar Languages

TL;DR: This work describes the system built by the National Research Council Canada for the ”Discriminating between similar languages” (DSL) shared task, which uses various statistical classifiers and makes predictions based on a two-stage process to reach the best performance among all systems submitted to the open and closed tasks.

Cross-Lingual Dependency Parsing with Universal Dependencies and Predicted PoS Labels

TL;DR: This paper quantifies the differences that can be observed when replacing gold standard labels and their results should influence application developers that rely on crosslingual models that are not tested in real life.
Posted Content

Discriminating Similar Languages: Evaluations and Explorations

TL;DR: An analysis of the performance of machine learning classifiers on discriminating between similar languages and language varieties is presented and an upper bound on possible performance using ensemble and oracle combination is estimated.
Related Papers (5)