| Literature DB >> 35089448 |
Ethan Kutlu1,2, Mehrgol Tiv3, Stefanie Wulff4,5, Debra Titone3.
Abstract
Upon hearing someone's speech, a listener can access information such as the speaker's age, gender identity, socioeconomic status, and their linguistic background. However, an open question is whether living in different locales modulates how listeners use these factors to assess speakers' speech. Here, an audio-visual test was used to measure whether listeners' accentedness judgments and intelligibility (i.e., speech perception) can be modulated depending on racial information in faces that they see. American, British, and Indian English were used as three different English varieties of speech. These speech samples were presented with either a white female face or a South Asian female face. Two experiments were completed in two locales: Gainesville, Florida (USA) and Montreal, Quebec (Canada). Overall, Montreal listeners were more accurate in their transcription of sentences (i.e., intelligibility) compared to Gainesville listeners. Moreover, Gainesville listeners' ability to transcribe the same spoken sentences decreased for all varieties when listening to speech paired with South Asian faces. However, seeing a white or a South Asian face did not impact speech intelligibility for the same spoken sentences for Montreal listeners. Finally, listeners' accentedness judgments increased for American English and Indian English when the visual information changed from a white face to a South Asian face in Gainesville, but not in Montreal. These findings suggest that visual cues for race impact speech perception to a greater degree in locales with greater ecological diversity.Entities:
Keywords: Accents; Audio-visual; Ecological diversity; Race; Speech perception
Mesh:
Year: 2022 PMID: 35089448 PMCID: PMC8799814 DOI: 10.1186/s41235-022-00354-0
Source DB: PubMed Journal: Cogn Res Princ Implic ISSN: 2365-7464
Descriptive demographic background information
| Sample ( | ||||
|---|---|---|---|---|
| Gainesville | Montreal | |||
| Age | 20.4 (1.2) | 25 | 20.4 (1.5) | 25 |
| Age of Acquisition | ||||
| English | 0.68 (1.4) Min = 0, Max = 4 | 0.04 (0.2) Min = 0, Max = 1 | ||
| French | 0.56(2.8) Min = 14, Max = 14 | 3.08(2.1) Min = 0, Max = 5 | ||
| Spanish | 4(6.1) Min = 0, Max = 14 | 3.52(5.8) Min = 11, Max = 15 | ||
| Daily use (%) | ||||
| English | 87.6 (13.8) | 84.48(13.3 | ||
| French | 0 (0) | 13.8(13.1) | ||
| Spanish | 11.6(13.6) | 0.08(0.2) | ||
| Gender | ||||
| Female | 22 | 24 | ||
| Male | 2 | 1 | ||
| Queer/Non-binary | 1 | |||
| Racial/Ethnic Identity | ||||
| Black | ||||
| East Asian | 1 | 2 | ||
| Middle Eastern | 1 | |||
| Pacific Islander | ||||
| Southeast Asian | 1 | |||
| White | 12 | 19 | ||
| Latin American | 3 | |||
| Bi/multiracial | 8 | 3 | ||
Fig. 1Overall language entropy distribution in Montreal and in Gainesville
Fig. 2The design of the intelligibility task (both pictures are allowed to be used for publication purposes)
Fig. 3The design of the accentedness task
Summaries of the Mean and Standard Deviation (in parenthesis) of the accentedness judgments and intelligibility scores
| Gainesville | Montreal | |||
|---|---|---|---|---|
| White | South Asian | White | South Asian | |
| AmE | 2.0(1.36) | 3.5(1.90) | 2.4(1.70) | 2.2(1.63) |
| BrE | 4.8(1.40) | 5.6(1.80) | 6.4(1.66) | 6.5(1.5) |
| IndE | 7.2(1.48) | 7.5(1.38) | 7.8(1.19) | 7.6(1.06) |
| AmE | 0.98(0.08) | 0.97(0.09) | 0.98(0.08) | 0.98(0.07) |
| BrE | 0.98(0.08) | 0.93(0.16) | 0.98(0.07) | 0.98(0.10) |
| IndE | 0.96(0.13) | 0.83(0.28) | 0.98(0.10) | 0.97(0.11) |
AmE refers to American English, BrE refers to British English, and IndE refers to Indian English
Summary of the linear mixed-effects model results of the proportions as the dependent variable
| Proportion | |||
|---|---|---|---|
| Fixed Effects | Estimates | SE | |
| Intercept | 0.92 | 0.01 | |
| Face (white) | 0.07 | 0.01 | |
| Variety (British vs. American) | − 0.01 | 0.00 | 0.094 |
| Variety (Indian vs. American + British) | − 0.04 | 0.00 | |
| Location (Montreal) | 0.06 | 0.01 | |
| Predictability(Low) | − 0.01 | 0.01 | 0.155 |
| Face (white):Variety (British vs. American) | 0.01 | 0.01 | 0.053 |
| Face (white):Variety (Indian vs. American + British) | 0.03 | 0.00 | |
| Face(white):Location(Montreal) | − 0.07 | 0.01 | |
| Variety (British vs. American):Location(Montreal) | 0.01 | 0.01 | 0.305 |
| Variety (Indian vs. American + British):Location(Montreal) | 0.04 | 0.00 | |
| Face(white):Variety(British vs. American):Location(Montreal) | − 0.00 | 0.01 | 0.681 |
| Face(white):Variety (Indian vs. American + British):Location(Montreal) | − 0.03 | 0.01 | |
| Observations | 6120 | ||
| Marginal R2 / Conditional R2 | 0.092 / 0.232 | ||
Significant results are in bold
Fig. 4Proportions of intelligibility scores in Montreal and in Gainesville for each face and variety
Summary of the Cumulative Link Mixed Model with accentedness judgments as to the dependent variable
| Proportion | |||
|---|---|---|---|
| Fixed effects | Estimates | SE | |
| Face (white) | − 1.16 | 0.09 | |
| Variety (British vs. American) | 1.30 | 0.08 | |
| Variety (Indian vs. American + British) | 1.31 | 0.05 | |
| Location (Montreal) | − 0.10 | 0.29 | 0.716 |
| Predictability (Low) | − 0.16 | 0.06 | |
| Face (white):Variety (British vs. American) | 0.41 | 0.11 | |
| Face (white):Variety (Indian vs. American + British) | 0.32 | 0.06 | |
| Face(white):Location(Montreal) | 1.35 | 0.13 | |
| Variety (British vs. American):Location (Montreal) | 1.27 | 0.12 | |
| Variety (Indian vs. American + British):Location(Montreal) | 0.01 | 0.06 | 0.793 |
| Face(white):Variety(British vs. American):Location(Montreal) | − 0.54 | 0.16 | |
| Face(white):Variety (Indian vs. American + British):Location(Montreal) | − 0.12 | 0.09 | 0.206 |
| Observations | 3060 | ||
| Marginal R2 / Conditional R2 | 0.635/0.721 | ||
Significant results are in bold
Fig. 5Accentedness judgments in Montreal and in Gainesville for each face and variety type
Percentage of intelligibility with(n = 10) and without noise (n = 14) for each American English, British English and Indian English speaker. Only speakers who had 85% and above intelligibility were used in the actual experiment indicated with *
| English | Other language(s) | Percentage of intelligibility | Percentage of intelligibility with noise | |
|---|---|---|---|---|
| Speaker 1* | American | None | 98% | 92% |
| Speaker 2* | American | None | 98.5% | 90% |
| Speaker 3* | British | None | 96.2% | 90% |
| Speaker 4 | British | None | 82% | 78% |
| Speaker 5 | British | None | 92% | 83.4% |
| Speaker 6 | British | None | 90% | 82% |
| Speaker 7 | British | None | 93% | 84% |
| Speaker 8* | British | None | 94.4% | 88% |
| Speaker 9 | Indian | Tamil | 77% | 65% |
| Speaker 10 | Indian | Tamil | 81% | 79% |
| Speaker 11* | Indian | Tamil, Telugu | 91% | 89% |
| Speaker 12 | Indian | Tamil | 68% | 65% |
| Speaker 13* | Indian | Telugu | 87% | 85% |
| Speaker 14 | Indian | Telugu | 79% | 72.5% |