Literature DB >> 12109025

Using Internet search engines to estimate word frequency.

Irene V Blair1, Geoffrey R Urland, Jennifer E Ma.   

Abstract

The present research investigated Internet search engines as a rapid, cost-effective alternative for estimating word frequencies. Frequency estimates for 382 words were obtained and compared across four methods: (1) Internet search engines, (2) the Kucera and Francis (1967) analysis of a traditional linguistic corpus, (3) the CELEX English linguistic database (Baayen, Piepenbrock, & Gulikers, 1995), and (4) participant ratings of familiarity. The results showed that Internet search engines produced frequency estimates that were highly consistent with those reported by Kucera and Francis and those calculated from CELEX, highly consistent across search engines, and very reliable over a 6-month period of time. Additional results suggested that Internet search engines are an excellent option when traditional word frequency analyses do not contain the necessary data (e.g., estimates for forenames and slang). In contrast, participants' familiarity judgments did not correspond well with the more objective estimates of word frequency. Researchers are advised to use search engines with large databases (e.g., AltaVista) to ensure the greatest representativeness of the frequency estimates.

Entities:  

Mesh:

Year:  2002        PMID: 12109025     DOI: 10.3758/bf03195456

Source DB:  PubMed          Journal:  Behav Res Methods Instrum Comput        ISSN: 0743-3808


  11 in total

1.  Memory enhancement for emotional words: are emotional words more vividly remembered than neutral words?

Authors:  Elizabeth A Kensinger; Suzanne Corkin
Journal:  Mem Cognit       Date:  2003-12

2.  The emotional memory effect: differential processing or item distinctiveness?

Authors:  Stephen R Schmidt; Bonnie Saari
Journal:  Mem Cognit       Date:  2007-12

3.  The multidetermined nature of idiom processing.

Authors:  Maya R Libben; Debra A Titone
Journal:  Mem Cognit       Date:  2008-09

4.  The interpretation of ambiguous trimorphemic words in sentence context.

Authors:  Alexander Pollatsek; Denis Drieghe; Linnaea Stockall; Roberto G de Almeida
Journal:  Psychon Bull Rev       Date:  2010-02

5.  Taboo words: the effect of emotion on memory for peripheral information.

Authors:  Rebecca Guillet; Jason Arndt
Journal:  Mem Cognit       Date:  2009-09

6.  Beyond words: evidence for automatic language-gesture integration of symbolic gestures but not dynamic landscapes.

Authors:  Dana Vainiger; Ludovica Labruna; Richard B Ivry; Michal Lavidor
Journal:  Psychol Res       Date:  2013-01-10

7.  Impaired memory retrieval correlates with individual differences in cortisol response but not autonomic response.

Authors:  Tony W Buchanan; Daniel Tranel; Ralph Adolphs
Journal:  Learn Mem       Date:  2006 May-Jun       Impact factor: 2.460

8.  An ecological alternative to Snodgrass & Vanderwart: 360 high quality colour images with norms for seven psycholinguistic variables.

Authors:  Francisco Javier Moreno-Martínez; Pedro R Montoro
Journal:  PLoS One       Date:  2012-05-25       Impact factor: 3.240

9.  True gender ratios and stereotype rating norms.

Authors:  Alan Garnham; Sam Doehren; Pascal Gygax
Journal:  Front Psychol       Date:  2015-07-22

10.  Inter-subject variability modulates phonological advance planning in the production of adjective-noun phrases.

Authors:  Violaine Michel Lange; Marina Laganaro
Journal:  Front Psychol       Date:  2014-01-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.