| Literature DB >> 35774418 |
Ashwini Ashokkumar1, James W Pennebaker2.
Abstract
To what degree can we determine people's connections with groups through the language they use? In recent years, large archives of behavioral data from social media communities have become available to social scientists, opening the possibility of tracking naturally occurring group identity processes. A feature of most digital groups is that they rely exclusively on the written word. Across 3 studies, we developed and validated a language-based metric of group identity strength and demonstrated its potential in tracking identity processes in online communities. In Studies 1a-1c, 873 people wrote about their connections to various groups (country, college, or religion). A total of 2 language markers of group identity strength were found: high affiliation (more words like we, togetherness) and low cognitive processing or questioning (fewer words like think, unsure). Using these markers, a language-based unquestioning affiliation index was developed and applied to in-class stream-of-consciousness essays of 2,161 college students (Study 2). Greater levels of unquestioning affiliation expressed in language predicted not only self-reported university identity but also students' likelihood of remaining enrolled in college a year later. In Study 3, the index was applied to naturalistic Reddit conversations of 270,784 people in 2 online communities of supporters of the 2016 presidential candidates-Hillary Clinton and Donald Trump. The index predicted how long people would remain in the group (3a) and revealed temporal shifts mirroring members' joining and leaving of groups (3b). Together, the studies highlight the promise of a language-based approach for tracking and studying group identity processes in online groups.Entities:
Keywords: LIWC; group identity; group processes; language analysis; naturalistic observation
Year: 2022 PMID: 35774418 PMCID: PMC9229362 DOI: 10.1093/pnasnexus/pgac022
Source DB: PubMed Journal: PNAS Nexus ISSN: 2752-6542
Correlations of self-reported and language indices in Studies 1a, 1b, and 1c.
| Self-reported measures | Affiliation | Self-focus (I-words) | Questioning (cognitive processing) | Engagement (word count) |
|---|---|---|---|---|
| Study 1a: USA ( | ||||
| Identity fusion with group | 0.19** | −0.001 | −0.14* | −0.16* |
| Progroup behavior | 0.21*** | −0.03 | −0.10 | −0.12* |
| Study 1b: religion ( | ||||
| Identity fusion with group | 0.14** | 0.06 | −0.31*** | 0.10* |
| Progroup behavior | 0.16** | 0.13** | −0.24*** | 0.003 |
| Study 1c: college ( | ||||
| Identity fusion with group | 0.25*** | 0.09 | −0.17** | −0.19** |
| Progroup behavior | 0.33*** | 0.13* | −0.15* | −0.17** |
Note: *indicates P < 0.05. **indicates P < 0.01. ***indicates P < 0.001.
Excerpts from sample responses with low and high unquestioning affiliation scores. Words from the affiliation dictionary are in green font, and words in the cognitive processing dictionary are in red font.
| High unquestioning affiliation | Low unquestioning affiliation |
|---|---|
| “I | “I'm just |
Fig. 1.(a) and (b). Average unquestioning affiliation scores among short-, medium-, and long-term members of The_Donald (1a) and hillaryclinton (1b). Group members who stayed in the group longer conveyed higher levels of unquestioning affiliation in their language on average. Error bars represent confidence intervals.
Fig. 2.(a) and (b). Change in unquestioning affiliation expressed in natural language after joining and before leaving The_Donald (2a) and hillaryclinton (2b). The y-axis shows 3 days rolling means of the unquestioning affiliation score. The graphs indicate that linguistic expressions of unquestioning affiliation generally increase after joining a group and drop before leaving the group. Error bands represent confidence intervals.