Unsupervised learning of vowel categories from infant-directed speech

doi:10.1073/PNAS.0705369104

Open AccessJournal ArticleDOI

Unsupervised learning of vowel categories from infant-directed speech

Gautam K. Vallabha, +4 more

- 14 Aug 2007 -

Proceedings of the National Academy of S...

- Vol. 104, Iss: 33, pp 13273-13278

TLDR

An algorithm, based on Expectation–Maximization, is presented here for learning the categories from a sequence of vowel tokens without receiving any category information with each vowel token, or knowing in advance the number of categories to learn, or having access to the entire data ensemble.

Abstract:

Infants rapidly learn the sound categories of their native language, even though they do not receive explicit or focused training. Recent research suggests that this learning is due to infants' sensitivity to the distribution of speech sounds and that infant-directed speech contains the distributional information needed to form native-language vowel categories. An algorithm, based on Expectation–Maximization, is presented here for learning the categories from a sequence of vowel tokens without (i) receiving any category information with each vowel token, (ii) knowing in advance the number of categories to learn, or (iii) having access to the entire data ensemble. When exposed to vowel tokens drawn from either English or Japanese infant-directed speech, the algorithm successfully discovered the language-specific vowel categories (/i, i, e, e/ for English, /i, iː, e, eː/ for Japanese). A nonparametric version of the algorithm, closely related to neural network models based on topographic representation and competitive Hebbian learning, also was able to discover the vowel categories, albeit somewhat less reliably. These results reinforce the proposal that native-language speech categories are acquired through distributional learning and that such learning may be instantiated in a biologically plausible manner. language acquisition speech perception expectation maximization online learning

Unsupervised learning of vowel categories from infant-directed speech

Citations

Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.

Category Learning in the Brain

The developmental origins of voice processing in the human brain

What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations

Motherese in Interaction: At the Cross-Road of Emotion and Cognition? (A Systematic Review)

References

Pattern Classification

Density estimation for statistics and data analysis

Understanding normal and impaired word reading: computational principles in quasi-regular domains.

Cross-language speech perception: Evidence for perceptual reorganization during the first year of life

Simplified neuron model as a principal component analyzer

Related Papers (5)

Infant sensitivity to distributional information can affect phonetic discrimination.

Cross-language speech perception: Evidence for perceptual reorganization during the first year of life

Linguistic experience alters phonetic perception in infants by 6 months of age

Cross-language analysis of phonetic units in language addressed to infants.

Statistical Learning by 8-Month-Old Infants