| Literature DB >> 27524974 |
Marc Brysbaert1, Michaël Stevens1, Paweł Mandera1, Emmanuel Keuleers1.
Abstract
Based on an analysis of the literature and a large scale crowdsourcing experiment, we estimate that an average 20-year-old native speaker of American English knows 42,000 lemmas and 4,200 non-transparent multiword expressions, derived from 11,100 word families. The numbers range from 27,000 lemmas for the lowest 5% to 52,000 for the highest 5%. Between the ages of 20 and 60, the average person learns 6,000 extra lemmas or about one new lemma every 2 days. The knowledge of the words can be as shallow as knowing that the word exists. In addition, people learn tens of thousands of inflected forms and proper nouns (names), which account for the substantially high numbers of 'words known' mentioned in other publications.Entities:
Keywords: reading; vocabulary size; word knowledge
Year: 2016 PMID: 27524974 PMCID: PMC4965448 DOI: 10.3389/fpsyg.2016.01116
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Extract from the word list of Google Books Ngram viewer.
| ekam |
| ekamantam |
| ekatvam |
| eke |
| ekiben |
| ekistic |
| ekklesia |
| ekklesiologische |
| ekkuklema |
| ekonomicheskoye |
| ekonomicznego |
| ekonomisk |
| eks |
| ekstatic |
| ektexine |
| ekun |
| E.K. |
| EKAW’2000 |
| EKG |
| EKV |
Extract from a lemma list showing the existence of word families.
| nomad |
| nomadic |
| nomadically |
| nomadism |
| nomenclatorial |
| nomenclatural |
| nomenclature |
| nominal |
| nominalist |
| nominalization |
| nominally |
| nominate |
| nominated |
| nomination |
| nominative |
| nominator |
| nominee |
| nomothetic |
| non |
| non-absorbent |
Various estimates of the number of English words known by adults (typically first-year university students), together with the way in which “words” were defined and the task used.
| Study | Estimate | Definition of “word” | Task |
|---|---|---|---|
| 215,000 | All entries from Webster’s New International Dictionary | Meaning production | |
| 14,400 | Lemmas present both in Miriam-Webster’s Pocket Dictionary and Webster’s Seventh Collegiate Dictionary (list of 19,750 words) | Familiarity rating | |
| 17,200 | Base words (sic) from Webster’s Third New International Dictionary, excluding proper nouns, derived words, and compounds. | Indicate whether word is known or not | |
| 17,000 | Functionally important lemmas (sic) from the Oxford American Dictionary, with the exception of abbreviations, hyphenated words, affixes, contractions, interjections, letters, multiword entries, slang, capitalized entries, foreign words, alternate spellings, and outdated words. | Subjective estimates of knowledge | |
| 40,000 | Distinct lemmas (sic) from a corpus based on school textbooks; excludes proper nouns and a limited number of very transparent derived words and compounds. | Various tests | |
| 12,000 | Same as in | Multiple choice questions related to the meaning of the words | |
| 9,800 | Same as in | Provide synonym or explanation for words known |
Estimates of the words known by 20-year-olds and 60-year-olds at the low end and the high end.
| Person | Number of alphabetical types encountered | Number of lemmas known (max = 61,800) | Number of base words known (max = 18,300) |
|---|---|---|---|
| Low end | 84,000 | 27,100 | 6,100 |
| Median | 42,000 | 11,100 | |
| High end | 292,000 | 51,700 | 14,900 |
| Low end | 157,000 | 35,100 | 9,000 |
| Median | 48,200 | 13,400 | |
| High end | 543,000 | 56,400 | 16,700 |