Universal Dependency Annotation for Multilingual Parsing

Open AccessProceedings Article

Universal Dependency Annotation for Multilingual Parsing

Ryan McDonald, +12 more

- Vol. 2, pp 92-97

Chats0

TLDR

A new collection of treebanks with homogeneous syntactic dependency annotation for six languages: German, English, Swedish, Spanish, French and Korean is presented, made freely available in order to facilitate research on multilingual dependency parsing.

Abstract:

We present a new collection of treebanks with homogeneous syntactic dependency annotation for six languages: German, English, Swedish, Spanish, French and Korean. To show the usefulness of such a resource, we present a case study of crosslingual transfer parsing with more reliable evaluation than has been possible before. This ‘universal’ treebank is made freely available in order to facilitate research on multilingual dependency parsing. 1

Citations

PDF

Open Access

More filters

Proceedings Article

Universal Dependencies v1: A Multilingual Treebank Collection

Joakim Nivre, +11 more

TL;DR: This paper describes v1 of the universal guidelines, the underlying design principles, and the currently available treebanks for 33 languages, as well as highlighting the needs for sound comparative evaluation and cross-lingual learning experiments.

...read moreread less

Book

Neural Network Methods in Natural Language Processing

Yoav Goldberg, +1 more

TL;DR: Neural networks are a family of powerful machine learning models as mentioned in this paper, and they have been widely used in natural language processing applications such as machine translation, syntactic parsing, and multi-task learning.

...read moreread less

Proceedings ArticleDOI

CamemBERT: a Tasty French Language Model

Louis Martin, +7 more

- 10 Nov 2019 -

arXiv: Computation and Language

TL;DR: This paper investigates the feasibility of training monolingual Transformer-based language models for other languages, taking French as an example and evaluating their language models on part-of-speech tagging, dependency parsing, named entity recognition and natural language inference tasks.

...read moreread less

Proceedings Article

Universal Stanford dependencies: A cross-linguistic typology

Marie-Catherine de Marneffe, +6 more

TL;DR: This work proposes a two-layered taxonomy: a set of broadly attested universal grammatical relations, to which language-specific relations can be added, and a lexicalist stance of the Stanford Dependencies, which leads to a particular, partially new treatment of compounding, prepositions, and morphology.

...read moreread less

Proceedings ArticleDOI

Low Resource Dependency Parsing: Cross-lingual Parameter Sharing in a Neural Network Parser

Long Duong, +3 more

TL;DR: This work proposes a learning method that needs less data, based on the observation that there are underlying shared structures across languages, and exploits cues from a different source language in order to guide the learning process.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Dependency Parsing

Sandra Kübler, +3 more

TL;DR: This book surveys the three major classes of parsing models that are in current use: transition- based, graph-based, and grammar-based models, and gives a thorough introduction to the methods that are most widely used today.

...read moreread less

Book ChapterDOI

The Prague Dependency Treebank

Alena Böhmová, +3 more

TL;DR: Inspired by the Penn Treebank, the most widely used syntactically annotated corpus of English, this work decided to develop a similarly sized corpus of Czech with a rich annotation scheme.

...read moreread less

Journal ArticleDOI

Bootstrapping parsers via syntactic projection across parallel texts

Rebecca Hwa, +4 more

- 01 Sep 2005 -

Natural Language Engineering

TL;DR: Using parallel text to help solving the problem of creating syntactic annotation in more languages by annotating the English side of a parallel corpus, project the analysis to the second language, and train a stochastic analyzer on the resulting noisy annotations.

...read moreread less

Proceedings Article

Transition-based Dependency Parsing with Rich Non-local Features

Yue Zhang, +1 more

TL;DR: This paper shows that it can improve the accuracy of transition-based dependency parsers by considering even richer feature sets than those employed in previous systems by improving the accuracy in the standard Penn Treebank setup and rivaling the best results overall.

...read moreread less

Proceedings Article

Multi-Source Transfer of Delexicalized Dependency Parsers

Ryan McDonald, +2 more

TL;DR: This work demonstrates that delexicalized parsers can be directly transferred between languages, producing significantly higher accuracies than unsupervised parsers and shows that simple methods for introducing multiple source languages can significantly improve the overall quality of the resulting parsers.

...read moreread less

Collapse

Related Papers (5)

Building a large annotated corpus of English: the penn treebank

Mitchell Marcus, +2 more

- 01 Jun 1993 -

Computational Linguistics

Universal Dependency Annotation for Multilingual Parsing

Citations

Universal Dependencies v1: A Multilingual Treebank Collection

Neural Network Methods in Natural Language Processing

CamemBERT: a Tasty French Language Model

Universal Stanford dependencies: A cross-linguistic typology

Low Resource Dependency Parsing: Cross-lingual Parameter Sharing in a Neural Network Parser

References

Dependency Parsing

The Prague Dependency Treebank

Bootstrapping parsers via syntactic projection across parallel texts

Transition-based Dependency Parsing with Rich Non-local Features

Multi-Source Transfer of Delexicalized Dependency Parsers

Related Papers (5)

Building a large annotated corpus of English: the penn treebank

CoNLL-X Shared Task on Multilingual Dependency Parsing

Universal Dependencies v1: A Multilingual Treebank Collection

The Stanford Typed Dependencies Representation

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding