scispace - formally typeset
Journal ArticleDOI

Age-of-acquisition ratings for 30,000 English words

Reads0
Chats0
TLDR
This megastudy presents age-of-acquisition ratings for 30,121 English content words (nouns, verbs, and adjectives) using the Web-based crowdsourcing technology offered by the Amazon Mechanical Turk to indicate that the ratings collected are as valid and reliable as those collected in laboratory conditions.
Abstract
We present age-of-acquisition (AoA) ratings for 30,121 English content words (nouns, verbs, and adjectives). For data collection, this megastudy used the Web-based crowdsourcing technology offered by the Amazon Mechanical Turk. Our data indicate that the ratings collected in this way are as valid and reliable as those collected in laboratory conditions (the correlation between our ratings and those collected in the lab from U.S. students reached .93 for a subsample of 2,500 monosyllabic words). We also show that our AoA ratings explain a substantial percentage of the variance in the lexical-decision data of the English Lexicon Project, over and above the effects of log frequency, word length, and similarity to other words. This is true not only for the lemmas used in our rating study, but also for their inflected forms. We further discuss the relationships of AoA with other predictors of word recognition and illustrate the utility of AoA ratings for research on vocabulary growth.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Norms of valence, arousal, and dominance for 13,915 English lemmas

TL;DR: This work extended the ANEW database to nearly 14,000 English lemmas, providing researchers with a much richer source of information, including gender, age, and educational differences in emotion norms.
Journal ArticleDOI

Concreteness ratings for 40 thousand generally known English word lemmas

TL;DR: A comparison with the existing concreteness norms indicates that participants, as before, largely focused on visual and haptic experiences.
Journal ArticleDOI

SUBTLEX-UK: A new and improved word frequency database for British English

TL;DR: A new measure of word frequency, the Zipf scale, is introduced, which the authors hope will stop the current misunderstandings of the word frequency effect.
Proceedings Article

MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text

TL;DR: MCTest is presented, a freely available set of stories and associated questions intended for research on the machine comprehension of text that requires machines to answer multiple-choice reading comprehension questions about fictional stories, directly tackling the high-level goal of open-domain machine comprehension.
Journal ArticleDOI

Automatically Assessing Lexical Sophistication: Indices, Tools, Findings, and Application

TL;DR: The Tool for the Automatic Analysis of LExical Sophistication (TAALES), which calculates text scores for 135 classic and newly developed lexical indices related to word frequency, range, bigram and trigram frequency, academic language, and psycholinguistic word information, is introduced.
References
More filters
Posted Content

Conducting Behavioral Research on Amazon's Mechanical Turk

TL;DR: In this article, the authors demonstrate how to use Mechanical Turk for conducting behavioral research and lower the barrier to entry for researchers who could benefit from this platform, and illustrate the mechanics of putting a task on Mechanical Turk including recruiting subjects, executing the task, and reviewing the work submitted.
Journal ArticleDOI

Conducting behavioral research on Amazon's Mechanical Turk.

TL;DR: It is shown that when taken as a whole Mechanical Turk can be a useful tool for many researchers, and how the behavior of workers compares with that of experts and laboratory subjects is discussed.
Proceedings ArticleDOI

Cheap and Fast -- But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks

TL;DR: This work explores the use of Amazon's Mechanical Turk system, a significantly cheaper and faster method for collecting annotations from a broad base of paid non-expert contributors over the Web, and proposes a technique for bias correction that significantly improves annotation quality on two tasks.
Journal ArticleDOI

The English Lexicon Project.

TL;DR: The motivation for this project, the methods used to collect the data, and the search engine that affords access to the behavioral measures and descriptive lexical statistics for these stimuli are described.
Journal ArticleDOI

Moving beyond Kučera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English

TL;DR: The size of the corpus, the language register on which the corpus is based, and the definition of the frequency measure were investigated, finding that lemma frequencies are not superior to word form frequencies in English and that a measure of contextual diversity is better than a measure based on raw frequency of occurrence.
Related Papers (5)