Universal Dependency Annotation for Multilingual Parsing

Open AccessProceedings Article

Universal Dependency Annotation for Multilingual Parsing

Ryan McDonald, +12 more

- Vol. 2, pp 92-97

Chats0

TLDR

A new collection of treebanks with homogeneous syntactic dependency annotation for six languages: German, English, Swedish, Spanish, French and Korean is presented, made freely available in order to facilitate research on multilingual dependency parsing.

Abstract:

We present a new collection of treebanks with homogeneous syntactic dependency annotation for six languages: German, English, Swedish, Spanish, French and Korean. To show the usefulness of such a resource, we present a case study of crosslingual transfer parsing with more reliable evaluation than has been possible before. This ‘universal’ treebank is made freely available in order to facilitate research on multilingual dependency parsing. 1

Citations

PDF

Open Access

More filters

Multi-source Cross-lingual Delexicalized Parser Transfer: Prague or Stanford?

Rudolf Rosa

TL;DR: It is found that best results can be obtained by parsing the target sentences with parsers trained on treebanks using both of the adposition annotation styles in parallel, and combining all the resulting parse trees together after having converted them to the Stanford adposition style.

...read moreread less

Proceedings ArticleDOI

Baseline Models for Pronoun Prediction and Pronoun-Aware Translation

Jörg Tiedemann

TL;DR: B baseline models for the cross-lingual pronoun prediction task and the pronoun-focused translation task at DiscoMT 2015 are presented and the impact of various contextual features on the prediction performance is discussed.

...read moreread less

Proceedings ArticleDOI

Automatic Selection of Context Configurations for Improved Class-Specific Word Representations

Ivan Vulić, +4 more

TL;DR: The authors proposed a simple yet effective framework for an automatic selection of class-specific context configurations, based on universal dependency relations between words, and efficiently search this space with an adapted beam search algorithm.

...read moreread less

Proceedings ArticleDOI

Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks

Fynn Schröder, +1 more

TL;DR: New methods to automatically assess the similarity of sequence tagging datasets to identify beneficial auxiliary data for MTL or TL setups are proposed and empirically show that their similarity measures correlate with the change in test score of neural networks that use the auxiliary dataset for M TL to increase the main task performance.

...read moreread less

Proceedings ArticleDOI

Automatic Proposition Extraction from Dependency Trees: Helping Early Prediction of Alzheimer's Disease from Narratives

Andre Luiz Verucci da Cunha, +3 more

TL;DR: This work proposes a novel approach to obtaining the ID automatically from a text using an automation of Chand et al.'s ID manual, and consists of a rule-based system acting upon dependency trees.

...read moreread less

Collapse

References

PDF

Open Access

More filters

ReportDOI

Building a large annotated corpus of English: the penn treebank

Mitchell Marcus, +2 more

- 01 Jun 1993 -

Computational Linguistics

TL;DR: As a result of this grant, the researchers have now published on CDROM a corpus of over 4 million words of running text annotated with part-of- speech (POS) tags, which includes a fully hand-parsed version of the classic Brown corpus.

...read moreread less

Proceedings ArticleDOI

Accurate Unlexicalized Parsing

Dan Klein, +1 more

TL;DR: It is demonstrated that an unlexicalized PCFG can parse much more accurately than previously shown, by making use of simple, linguistically motivated state splits, which break down false independence assumptions latent in a vanilla treebank grammar.

...read moreread less

Proceedings Article

Generating Typed Dependency Parses from Phrase Structure Parses

Marie-Catherine de Marneffe, +2 more

TL;DR: A system for extracting typed dependency parses of English sentences from phrase structure parses that captures inherent relations occurring in corpus texts that can be critical in real-world applications is described.

...read moreread less

Proceedings ArticleDOI

CoNLL-X Shared Task on Multilingual Dependency Parsing

Sabine Buchholz, +1 more

TL;DR: How treebanks for 13 languages were converted into the same dependency format and how parsing performance was measured is described and general conclusions about multi-lingual parsing are drawn.

...read moreread less

Proceedings ArticleDOI

The Stanford Typed Dependencies Representation

Marie-Catherine de Marneffe, +1 more

TL;DR: This paper examines the Stanford typed dependencies representation, which was designed to provide a straightforward description of grammatical relations for any user who could benefit from automatic text understanding, and considers the underlying design principles of the Stanford scheme.

...read moreread less

Collapse

Related Papers (5)

Building a large annotated corpus of English: the penn treebank

Mitchell Marcus, +2 more

- 01 Jun 1993 -

Computational Linguistics

Universal Dependency Annotation for Multilingual Parsing

Citations

Multi-source Cross-lingual Delexicalized Parser Transfer: Prague or Stanford?

Baseline Models for Pronoun Prediction and Pronoun-Aware Translation

Automatic Selection of Context Configurations for Improved Class-Specific Word Representations

Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks

Automatic Proposition Extraction from Dependency Trees: Helping Early Prediction of Alzheimer's Disease from Narratives

References

Building a large annotated corpus of English: the penn treebank

Accurate Unlexicalized Parsing

Generating Typed Dependency Parses from Phrase Structure Parses

CoNLL-X Shared Task on Multilingual Dependency Parsing

The Stanford Typed Dependencies Representation

Related Papers (5)

Building a large annotated corpus of English: the penn treebank

CoNLL-X Shared Task on Multilingual Dependency Parsing

Universal Dependencies v1: A Multilingual Treebank Collection

The Stanford Typed Dependencies Representation

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding