Literature DB >> 35787033

Historical representations of social groups across 200 years of word embeddings from Google Books.

Tessa E S Charlesworth1, Aylin Caliskan2, Mahzarin R Banaji1.   

Abstract

Using word embeddings from 850 billion words in English-language Google Books, we provide an extensive analysis of historical change and stability in social group representations (stereotypes) across a long timeframe (from 1800 to 1999), for a large number of social group targets (Black, White, Asian, Irish, Hispanic, Native American, Man, Woman, Old, Young, Fat, Thin, Rich, Poor), and their emergent, bottom-up associations with 14,000 words and a subset of 600 traits. The results provide a nuanced picture of change and persistence in stereotypes across 200 y. Change was observed in the top-associated words and traits: Whether analyzing the top 10 or 50 associates, at least 50% of top associates changed across successive decades. Despite this changing content of top-associated words, the average valence (positivity/negativity) of these top stereotypes was generally persistent. Ultimately, through advances in the availability of historical word embeddings, this study offers a comprehensive characterization of both change and persistence in social group representations as revealed through books of the English-speaking world from 1800 to 1999.

Entities:  

Keywords:  attitude change; natural language processing; stereotype change; word embeddings

Mesh:

Year:  2022        PMID: 35787033      PMCID: PMC9282454          DOI: 10.1073/pnas.2121798119

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   12.779


  15 in total

1.  A model of (often mixed) stereotype content: competence and warmth respectively follow from perceived status and competition.

Authors:  Susan T Fiske; Amy J C Cuddy; Peter Glick; Jun Xu
Journal:  J Pers Soc Psychol       Date:  2002-06

2.  Stereotype persistence and change among college students.

Authors:  G M GILBERT
Journal:  J Abnorm Psychol       Date:  1951-04

3.  Semantics derived automatically from language corpora contain human-like biases.

Authors:  Aylin Caliskan; Joanna J Bryson; Arvind Narayanan
Journal:  Science       Date:  2017-04-14       Impact factor: 47.728

4.  Gender Stereotypes in Natural Language: Word Embeddings Show Robust Consistency Across Child and Adult Language Corpora of More Than 65 Million Words.

Authors:  Tessa E S Charlesworth; Victor Yang; Thomas C Mann; Benedek Kurdi; Mahzarin R Banaji
Journal:  Psychol Sci       Date:  2021-01-05

5.  Norms of valence, arousal, and dominance for 13,915 English lemmas.

Authors:  Amy Beth Warriner; Victor Kuperman; Marc Brysbaert
Journal:  Behav Res Methods       Date:  2013-12

6.  Patterns of Implicit and Explicit Attitudes: I. Long-Term Change and Stability From 2007 to 2016.

Authors:  Tessa E S Charlesworth; Mahzarin R Banaji
Journal:  Psychol Sci       Date:  2019-01-03

7.  What's in a Name? The Hidden Historical Ideologies Embedded in the Black and African American Racial Labels.

Authors:  Erika V Hall; Sarah S M Townsend; James T Carter
Journal:  Psychol Sci       Date:  2021-10-25

8.  Characterizing the Google Books Corpus: Strong Limits to Inferences of Socio-Cultural and Linguistic Evolution.

Authors:  Eitan Adam Pechenick; Christopher M Danforth; Peter Sheridan Dodds
Journal:  PLoS One       Date:  2015-10-07       Impact factor: 3.240

9.  Guideline for improving the reliability of Google Ngram studies: Evidence from religious terms.

Authors:  Nadja Younes; Ulf-Dietrich Reips
Journal:  PLoS One       Date:  2019-03-22       Impact factor: 3.240

10.  From Text to Thought: How Analyzing Language Can Advance Psychological Science.

Authors:  Joshua Conrad Jackson; Joseph Watts; Johann-Mattis List; Curtis Puryear; Ryan Drabble; Kristen A Lindquist
Journal:  Perspect Psychol Sci       Date:  2021-10-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.