scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Wordbank: an open repository for developmental vocabulary data.

01 May 2017-Journal of Child Language (J Child Lang)-Vol. 44, Iss: 3, pp 677-694
TL;DR: Wordbank as mentioned in this paper is a structured database of parent-report data combined with a browsable web interface for exploring patterns of vocabulary growth at the level of both individual children and particular words.
Abstract: The MacArthur-Bates Communicative Development Inventories (CDIs) are a widely used family of parent-report instruments for easy and inexpensive data-gathering about early language acquisition. CDI data have been used to explore a variety of theoretically important topics, but, with few exceptions, researchers have had to rely on data collected in their own lab. In this paper, we remedy this issue by presenting Wordbank, a structured database of CDI data combined with a browsable web interface. Wordbank archives CDI data across languages and labs, providing a resource for researchers interested in early language, as well as a platform for novel analyses. The site allows interactive exploration of patterns of vocabulary growth at the level of both individual children and particular words. We also introduce wordbankr, a software package for connecting to the database directly. Together, these tools extend the abilities of students and researchers to explore quantitative trends in vocabulary development.
Citations
More filters
Journal ArticleDOI
TL;DR: It is argued that a focus on individual differences (IDs) provides a crucial source of evidence that bears strongly upon core issues in theories of the acquisition and processing of language; specifically, the role of experience in language acquisition, processing, and attainment, and the architecture of the language system.

230 citations


Cites background from "Wordbank: an open repository for de..."

  • ...For instance, vocabulary production norms from Wordbank shows that a child in the 90th percentile at 16 months knows the same number of words as a child in 10th percentile at 26 months [37]....

    [...]

  • ...(A) Cross-sectional MacArthur-Bates Communicative Development Inventory vocabulary production data from 4687 English-speaking children aged 16–30 months [37]....

    [...]

Journal ArticleDOI
01 Jul 2017-Infancy
TL;DR: The ManyBabies project, the instantiation of this proposal, will not only help to estimate how robust and replicable these phenomena are, but also gain new theoretical insights into how they vary across ages, linguistic communities, and measurement methods.
Abstract: The ideal of scientific progress is that we accumulate measurements and integrate these into theory, but recent discussion of replicability issues has cast doubt on whether psychological research conforms to this model. Developmental research—especially with infant participants—also has discipline-specific replicability challenges, including small samples and limited measurement methods. Inspired by collaborative replication efforts in cognitive and social psychology, we describe a proposal for assessing and promoting replicability in infancy research: large-scale, multi-laboratory replication efforts aiming for a more precise understanding of key developmental phenomena. The ManyBabies project, our instantiation of this proposal, will not only help us estimate how robust and replicable these phenomena are, but also gain new theoretical insights into how they vary across ages, linguistic communities, and measurement methods. This project has the potential for a variety of positive outcomes, including less-biased estimates of theoretically important effects, estimates of variability that can be used for later study planning, and a series of best-practices blueprints for future infancy research.

198 citations


Cites background from "Wordbank: an open repository for de..."

  • ...1Although both the recent Infancy registered reports submission route and efforts for sharing observational data are notable exceptions; cf. Adolph, Gilmore, Freeman, Sanderson, & Millman, 2012; Frank et al., 2016; MacWhinney, 2000; Rose & MacWhinney, 2014; VanDam et al., 2016)....

    [...]

Journal Article
TL;DR: Two alternative growth principles are introduced and test: preferential acquisition—words enter the lexicon not because they are related to well-connected words, but because they connect well to other words in the learning environment— and the lure of the associates—new words are favored in proportion to their connections with known words.
Abstract: Analyses of adult semantic networks suggest a learning mechanism involving preferential attachment: A word is more likely to enter the lexicon the more connected the known words to which it is related. We introduce and test two alternative growth principles: preferential acquisition—words enter the lexicon not because they are related to well-connected words, but because they connect well to other words in the learning environment—and the lure of the associates—new words are favored in proportion to their connections with known words. We tested these alternative principles using longitudinal analyses of developing networks of 130 nouns children learn prior to the age of 30 months. We tested both networks with links between words represented by features and networks with links represented by associations. The feature networks did not predict age of acquisition using any growth model. The associative networks grew by preferential acquisition, with the best model incorporating word frequency, number of phonological neighbors, and connectedness of the new word to words in the learning environment, as operationalized by connectedness to words typically acquired by the age of 30 months.

198 citations

Journal ArticleDOI
07 Dec 2018-Science
TL;DR: The results suggest that recessive coding variants account for a small fraction of currently undiagnosed nonconsanguineous individuals, and that the role of noncoding variants, incomplete penetrance, and polygenic mechanisms need further exploration.
Abstract: We estimated the genome-wide contribution of recessive coding variation in 6040 families from the Deciphering Developmental Disorders study. The proportion of cases attributable to recessive coding variants was 3.6% in patients of European ancestry, compared with 50% explained by de novo coding mutations. It was higher (31%) in patients with Pakistani ancestry, owing to elevated autozygosity. Half of this recessive burden is attributable to known genes. We identified two genes not previously associated with recessive developmental disorders, KDM5B and EIF3F, and functionally validated them with mouse and cellular models. Our results suggest that recessive coding variants account for a small fraction of currently undiagnosed nonconsanguineous individuals, and that the role of noncoding variants, incomplete penetrance, and polygenic mechanisms need further exploration.

146 citations

Journal ArticleDOI
TL;DR: Future advances in computational models will be necessary to connect the developmentally changing content and statistics of infant experience to the internal machinery that does the learning.

142 citations

References
More filters
Journal Article
TL;DR: Copyright (©) 1999–2012 R Foundation for Statistical Computing; permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and permission notice are preserved on all copies.
Abstract: Copyright (©) 1999–2012 R Foundation for Statistical Computing. Permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and this permission notice are preserved on all copies. Permission is granted to copy and distribute modified versions of this manual under the conditions for verbatim copying, provided that the entire resulting derived work is distributed under the terms of a permission notice identical to this one. Permission is granted to copy and distribute translations of this manual into another language, under the above conditions for modified versions, except that this permission notice may be stated in a translation approved by the R Core Team.

272,030 citations

Book
13 Aug 2009
TL;DR: This book describes ggplot2, a new data visualization package for R that uses the insights from Leland Wilkisons Grammar of Graphics to create a powerful and flexible system for creating data graphics.
Abstract: This book describes ggplot2, a new data visualization package for R that uses the insights from Leland Wilkisons Grammar of Graphics to create a powerful and flexible system for creating data graphics. With ggplot2, its easy to: produce handsome, publication-quality plots, with automatic legends created from the plot specification superpose multiple layers (points, lines, maps, tiles, box plots to name a few) from different data sources, with automatically adjusted common scales add customisable smoothers that use the powerful modelling capabilities of R, such as loess, linear models, generalised additive models and robust regression save any ggplot2 plot (or part thereof) for later modification or reuse create custom themes that capture in-house or journal style requirements, and that can easily be applied to multiple plots approach your graph from a visual perspective, thinking about how each component of the data is represented on the final plot. This book will be useful to everyone who has struggled with displaying their data in an informative and attractive way. You will need some basic knowledge of R (i.e. you should be able to get your data into R), but ggplot2 is a mini-language specifically tailored for producing graphics, and youll learn everything you need in the book. After reading this book youll be able to produce graphics customized precisely for your problems,and youll find it easy to get graphics out of your head and on to the screen or page.

29,504 citations

Book
Roger Brown1
01 Jan 1973
TL;DR: This article studied the early stages of grammatical constructions and the meanings they convey in pre-school children and found that the order of their acquisition is almost identical across children and is predicted by their relative semantic and grammatical complexity.
Abstract: For many years, Roger Brown and his colleagues have studied the developing language of pre-school children--the language that ultimately will permit them to understand themselves and the world around them. This longitudinal research project records the conversational performances of three children, studying both semantic and grammatical aspects of their language development. These core findings are related to recent work in psychology and linguistics--and especially to studies of the acquisition of languages other than English, including Finnish, German, Korean, and Samoan. Roger Brown has written the most exhaustive and searching analysis yet undertaken of the early stages of grammatical constructions and the meanings they convey. The five stages of linguistic development Brown establishes are measured not by chronological age-since children vary greatly in the speed at which their speech develops--but by mean length of utterance. This volume treats the first two stages. Stage I is the threshold of syntax, when children begin to combine words to make sentences. These sentences, Brown shows, are always limited to the same small set of semantic relations: nomination, recurrence, disappearance, attribution, possession, agency, and a few others. Stage II is concerned with the modulations of basic structural meanings--modulations for number, time, aspect, specificity--through the gradual acquisition of grammatical morphemes such as inflections, prepositions, articles, and case markers. Fourteen morphemes are studied in depth and it is shown that the order of their acquisition is almost identical across children and is predicted by their relative semantic and grammaticalcomplexity. It is, ultimately, the intent of this work to focus on the nature and development of knowledge: knowledge concerning grammar and the meanings coded by grammar; knowledge inferred from performance, from sentences and the settings in which they are spoken, and from signs of comprehension or incomprehension of sentences.

4,302 citations

Reference EntryDOI
30 Jan 2010
TL;DR: The Peabody Picture Vocabulary Test (PPVT) as discussed by the authors is an individually administered, norm-referenced test of single-word receptive (or hearing) vocabulary.
Abstract: The Peabody Picture Vocabulary Test (PPVT) is an individually administered, norm-referenced test of single-word receptive (or hearing) vocabulary. Originally published in 1959, the PPVT has been revised several times and currently exists in its fourth edition (PPVT-4; Dunn & Dunn, 2007). In addition to assessing receptive vocabulary, test authors report that the PPVT-4 may be used as a means of estimating verbal development (Dunn & Dunn, 2007). Normed with a sample of 3,540 individuals ages 2½ to 90 representative of March 2004 U.S. census data, the PPVT-4 features two parallel forms (Form A and Form B), each consisting of 228 test items. Items consist of two stimuli, a word spoken by the examiner and four pictures on a single card; the examinee selects the picture that best represents the examiner's spoken word. Raw scores may be translated into age-based standard scores (i.e., M = 100; SD = 15), percentile ranks, stanines, age equivalents, and grade equivalents. Keywords: receptive language; language screener

4,281 citations

Journal ArticleDOI
TL;DR: ggplot2 as mentioned in this paper is an implementation in R of The Grammar of Graphics, a systematic approach to the specification of statistical graphics that was introduced in a book previously reviewed in the Journal of Statistical Software by Cox (2007).
Abstract: ggplot2: Elegant Graphics for Data Analysis is a new addition to the UseR! series by Springer, probably the fastest expanding source of resources for computational statistics at the current moment. The books in this series are all linked with R, either presenting a new package developed by the own authors of the book or describing how to applying statistical techniques with the different packages available in R. ggplot2 is an implementation in R of The Grammar of Graphics (Wilkinson 2005) a systematic approach to the specification of statistical graphics that was introduced in a book previously reviewed in the Journal of Statistical Software by Cox (2007). This implementation has been developed by Hadley Wickham, who is also the author of the book reviewed here.

4,089 citations