Fine-Grained Prediction of Syntactic Typology: Discovering Latent Structure with Supervised Learning

doi:10.1162/TACL_A_00052

Open AccessJournal ArticleDOI

Fine-Grained Prediction of Syntactic Typology: Discovering Latent Structure with Supervised Learning

Dingquan Wang, +1 more

- 16 Jun 2017 -

Transactions of the Association for Comp...

- Vol. 5, Iss: 1, pp 147-161

Chats0

TLDR

This article used a large collection of realistic synthetic languages as training data to predict how often direct objects follow their verbs, how often adjectives follow their nouns, and in general the directionalities of all dependency relations.

Abstract:

We show how to predict the basic word-order facts of a novel language given only a corpus of part-of-speech (POS) sequences. We predict how often direct objects follow their verbs, how often adjectives follow their nouns, and in general the directionalities of all dependency relations. Such typological properties could be helpful in grammar induction. While such a problem is usually regarded as unsupervised learning, our innovation is to treat it as supervised learning, using a large collection of realistic synthetic languages as training data. The supervised learner must identify surface features of a language’s POS sequence (hand-engineered or neural features) that correlate with the language’s deeper structure (latent trees). In the experiment, we show: 1) Given a small set of real languages, it helps to add many synthetic languages to the training data. 2) Our system is robust even when the POS sequences include noise. 3) Our system on this task outperforms a grammar induction baseline by a large margin.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing

Edoardo Maria Ponti, +7 more

- 07 Oct 2019 -

Computational Linguistics

TL;DR: It is shown that to date, the use of information in existing typological databases has resulted in consistent but modest improvements in system performance, due to both intrinsic limitations of databases and under-employment of the typological features included in them.

...read moreread less

Journal ArticleDOI

PhaseLink: A Deep Learning Approach to Seismic Phase Association

Zachary E. Ross, +4 more

- 01 Jan 2019 -

Journal of Geophysical Research

TL;DR: This work presents PhaseLink, a framework based on recent advances in deep learning for grid‐free earthquake phase association that is expected to improve the resolution of seismicity catalogs, add stability to real‐time seismic monitoring, and streamline automated processing of large seismic data sets.

...read moreread less

Proceedings ArticleDOI

On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing

Wasi Uddin Ahmad, +5 more

TL;DR: The authors compare encoders and decoders based on Recurrent Neural Networks (RNNs) and modified self-attentive architectures for cross-lingual transfer and show that RNN-based architectures transfer well to languages that are close to English and perform especially well on distant languages.

...read moreread less

Journal ArticleDOI

PhaseLink: A Deep Learning Approach to Seismic Phase Association

Zachary E. Ross, +4 more

- 08 Sep 2018 -

arXiv: Learning

TL;DR: In this paper, the authors propose a grid-free approach to link phases together that share a common origin, which is trained on tens of millions of synthetic sequences of P- and S-wave arrival times generated using a simple 1D velocity model.

...read moreread less

Posted Content

On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing

Wasi Uddin Ahmad, +5 more

- 01 Nov 2018 -

arXiv: Computation and Language

TL;DR: Investigating crosslingual transfer and posit that an orderagnostic model will perform better when transferring to distant foreign languages shows that RNN-based architectures transfer well to languages that are close to English, while self-attentive models have better overall cross-lingualtransferability and perform especially well on distant languages.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Proceedings Article

Understanding the difficulty of training deep feedforward neural networks

Xavier Glorot, +1 more

TL;DR: The objective here is to understand better why standard gradient descent from random initialization is doing so poorly with deep neural networks, to better understand these recent relative successes and help design better algorithms in the future.

...read moreread less

Book

Typology and Universals

William Croft

TL;DR: The authors presents a comprehensive introduction to the method and theory used in studying typology and universals, and provides students and researchers with extensive examples of language universals in phonology, morphology, syntax and semantics.

...read moreread less

Journal ArticleDOI

Applications of stochastic context-free grammars using the Inside-Outside algorithm

K. Lari, +1 more

- 01 Jan 1990 -

Computer Speech & Language

TL;DR: Two applications in speech recognition of the use of stochastic context-free grammars trained automatically via the Inside-Outside Algorithm, used to model VQ encoded speech for isolated word recognition and compared directly to HMMs used for the same task are described.

...read moreread less