A Generative Model of Phonotactics

doi:10.1162/TACL_A_00047

Open AccessJournal ArticleDOI

A Generative Model of Phonotactics

Richard Futrell, +3 more

- 27 Feb 2017 -

Transactions of the Association for Comp...

- Vol. 5, Iss: 1, pp 73-86

Chats0

TLDR

A probabilistic model of phonotactics, the set of well-formed phoneme sequences in a language, that robustly assigns higher probabilities to held-out forms than a sophisticated N-gram model for all languages is presented.

Abstract:

We present a probabilistic model of phonotactics, the set of well-formed phoneme sequences in a language. Unlike most computational models of phonotactics (Hayes and Wilson, 2008; Goldsmith and Riggle, 2012), we take a fully generative approach, modeling a process where forms are built up out of subparts by phonologically-informed structure building operations. We learn an inventory of subparts by applying stochastic memoization (Johnson et al., 2006; Goodman et al., 2008) to a generative process for phonemes structured as an and-or graph, based on concepts of feature hierarchy from generative phonology (Clements, 1985; Dresher, 2009). Subparts are combined in a way that allows tier-based feature interactions. We evaluate our models’ ability to capture phonotactic distributions in the lexicons of 14 languages drawn from the WOLEX corpus (Graff, 2012). Our full model robustly assigns higher probabilities to held-out forms than a sophisticated N-gram model for all languages. We also present novel analyses that probe model behavior in more detail.

A Generative Model of Phonotactics

Citations

Grundzüge der Phonologie

Phonotactic Complexity and Its Trade-offs

Phonotactic Complexity and its Trade-offs

Miller's monkey updated: Communicative efficiency and the statistics of words in natural language

Why do human languages have homophones

References

The Sound Pattern of English

A Bayesian Analysis of Some Nonparametric Problems

A Constructive Definition of Dirichlet Priors

A constructive definition of dirichlet priors

The geometry of phonological features

Related Papers (5)

A Maximum Entropy Model of Phonotactics and Phonotactic Learning

The Sound Pattern of English

Probabilistic Phonotactics and Neighborhood Activation in Spoken Word Recognition

Explaining sonority projection effects

Some controversial questions in phonological theory