Unsupervised learning of vowel categories from infant-directed speech

Infants rapidly learn the sound categories of their native language, even though they do not receive explicit or focused training. Recent research suggests that this learning is due to infants' sensitivity to the distribution of speech sounds and that infant-directed speech contains the distrib...

Full description

Saved in:

Bibliographic Details
Published in	Proceedings of the National Academy of Sciences - PNAS Vol. 104; no. 33; pp. 13273 - 13278
Main Authors	Vallabha, Gautam K, McClelland, James L, Pons, Ferran, Werker, Janet F, Amano, Shigeaki
Format	Journal Article
Language	English
Published	United States National Academy of Sciences 14.08.2007 National Acad Sciences
Subjects	Algorithms artificial intelligence Aural learning Babies Baby talk Biological Sciences Covariance matrices Gaussian distributions Humans Infant Infants Learning Learning rate Neural networks Online learning Perceptual learning Speech Training Vowels
Online Access	Get full text
ISSN	0027-8424 1091-6490 1091-6490
DOI	10.1073/pnas.0705369104

Cover

More Information
Summary:	Infants rapidly learn the sound categories of their native language, even though they do not receive explicit or focused training. Recent research suggests that this learning is due to infants' sensitivity to the distribution of speech sounds and that infant-directed speech contains the distributional information needed to form native-language vowel categories. An algorithm, based on Expectation-Maximization, is presented here for learning the categories from a sequence of vowel tokens without (i) receiving any category information with each vowel token, (ii) knowing in advance the number of categories to learn, or (iii) having access to the entire data ensemble. When exposed to vowel tokens drawn from either English or Japanese infant-directed speech, the algorithm successfully discovered the language-specific vowel categories (/I, i, ε, e/ for English, /i, i{tricolon}, e, e{tricolon}/ for Japanese). A nonparametric version of the algorithm, closely related to neural network models based on topographic representation and competitive Hebbian learning, also was able to discover the vowel categories, albeit somewhat less reliably. These results reinforce the proposal that native-language speech categories are acquired through distributional learning and that such learning may be instantiated in a biologically plausible manner.
Bibliography:	SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 Contributed by James L. McClelland, June 16, 2007 Author contributions: G.K.V. and J.L.M. designed research; G.K.V. performed research; F.P., J.F.W., and S.A. contributed new reagents/analytic tools; F.P., J.F.W., and S.A. analyzed data; and G.K.V. and J.L.M. wrote the paper.
ISSN:	0027-8424 1091-6490 1091-6490
DOI:	10.1073/pnas.0705369104