Literature DB >> 25715025

Word knowledge in the crowd: Measuring vocabulary size and word prevalence in a massive online experiment.

Emmanuel Keuleers1, Michaël Stevens, Paweł Mandera, Marc Brysbaert.   

Abstract

We use the results of a large online experiment on word knowledge in Dutch to investigate variables influencing vocabulary size in a large population and to examine the effect of word prevalence-the percentage of a population knowing a word-as a measure of word occurrence. Nearly 300,000 participants were presented with about 70 word stimuli (selected from a list of 53,000 words) in an adapted lexical decision task. We identify age, education, and multilingualism as the most important factors influencing vocabulary size. The results suggest that the accumulation of vocabulary throughout life and in multiple languages mirrors the logarithmic growth of number of types with number of tokens observed in text corpora (Herdan's law). Moreover, the vocabulary that multilinguals acquire in related languages seems to increase their first language (L1) vocabulary size and outweighs the loss caused by decreased exposure to L1. In addition, we show that corpus word frequency and prevalence are complementary measures of word occurrence covering a broad range of language experiences. Prevalence is shown to be the strongest independent predictor of word processing times in the Dutch Lexicon Project, making it an important variable for psycholinguistic research.

Entities:  

Keywords:  Ageing; Bilingualism; Crowdsourcing; Frequency; Herdan's law; Prevalence

Mesh:

Year:  2015        PMID: 25715025     DOI: 10.1080/17470218.2015.1022560

Source DB:  PubMed          Journal:  Q J Exp Psychol (Hove)        ISSN: 1747-0218            Impact factor:   2.143


  22 in total

1.  Orthographic Knowledge and Lexical Form Influence Vocabulary Learning.

Authors:  James Bartolotti; Viorica Marian
Journal:  Appl Psycholinguist       Date:  2016-07-26

2.  Accounting for item-level variance in recognition memory: Comparing word frequency and contextual diversity.

Authors:  Brendan T Johns
Journal:  Mem Cognit       Date:  2021-11-22

3.  Prevalence norms for 40,777 Catalan words: An online megastudy of vocabulary size.

Authors:  Marc Guasch; Roger Boada; Jon Andoni Duñabeitia; Pilar Ferré
Journal:  Behav Res Methods       Date:  2022-09-09

4.  EmoPro - Emotional prototypicality for 1286 Spanish words: Relationships with affective and psycholinguistic variables.

Authors:  Miguel Ángel Pérez-Sánchez; Hans Stadthagen-Gonzalez; Marc Guasch; José Antonio Hinojosa; Isabel Fraga; Javier Marín; Pilar Ferré
Journal:  Behav Res Methods       Date:  2021-02-24

5.  It's all in the delivery: Effects of context valence, arousal, and concreteness on visual word processing.

Authors:  Bryor Snefjella; Victor Kuperman
Journal:  Cognition       Date:  2016-08-24

6.  Towards a distributed connectionist account of cognates and interlingual homographs: evidence from semantic relatedness tasks.

Authors:  Eva D Poort; Jennifer M Rodd
Journal:  PeerJ       Date:  2019-05-16       Impact factor: 2.984

7.  Speech error and tip of the tongue diary for mobile devices.

Authors:  Michael S Vitevitch; Cynthia S Q Siew; Nichol Castro; Rutherford Goldstein; Jeremy A Gharst; Jeriprolu J Kumar; Erica B Boos
Journal:  Front Psychol       Date:  2015-08-13

8.  Lexical Influences on Spoken Spondaic Word Recognition in Hearing-Impaired Patients.

Authors:  Annie Moulin; Céline Richard
Journal:  Front Neurosci       Date:  2015-12-23       Impact factor: 4.677

9.  How Many Words Do We Know? Practical Estimates of Vocabulary Size Dependent on Word Definition, the Degree of Language Input and the Participant's Age.

Authors:  Marc Brysbaert; Michaël Stevens; Paweł Mandera; Emmanuel Keuleers
Journal:  Front Psychol       Date:  2016-07-29

10.  Age-Related Differences in Lexical Access Relate to Speech Recognition in Noise.

Authors:  Rebecca Carroll; Anna Warzybok; Birger Kollmeier; Esther Ruigendijk
Journal:  Front Psychol       Date:  2016-07-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.