| Literature DB >> 31012064 |
Louise Stringer1,2, Paul Iverson3.
Abstract
Research into non-native (L2) speech perception has increased the need for specialized experimental materials. The Non-Native Speech Recognition (NNSR) sentences are a new large-scale set of speech recognition materials for research with L2 speakers of English at CEFR level B1 (North, Ortega, & Sheehan, 2010) and above. The set comprises 439 triplets of sentences in three related conditions: semantically predictable, neutral, and anomalous. The sentences were created by combining a strongly or weakly contextually constrained sentence frame with a congruent or anomalous final keyword, and they were matched on a number of factors during development, to maintain consistency across conditions. This article describes the development process of the NNSR sentences, along with results of speech-in-noise intelligibility testing for L2 and native English speakers. Suggestions for the sentences' application in a range of investigations and experimental designs are also discussed.Entities:
Keywords: L2 speech perception; Stimulus set
Mesh:
Year: 2020 PMID: 31012064 PMCID: PMC7148274 DOI: 10.3758/s13428-019-01251-z
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
Composition of the three semantic conditions
| Condition | Sentence Frame | Final Keyword | Example 1 | Example 2 |
|---|---|---|---|---|
| Predictable | Strongly constrained | Congruous | ||
| Neutral | Weakly constrained | Congruous | My coat is made of nice | Children like burgers with delicious |
| Anomalous | Strongly constrained | Incongruous |
Content overlapping across sentences within a triplet is shown in bold, the pointer words that generate the context are underlined, and final keywords are capitalized
Native languages of cloze test participants
| Cloze Test P1 | Cloze Test P2 | Cloze Test P3 | Cloze Test N1 | Cloze Test N2 |
|---|---|---|---|---|
| intermediate NN | ||||
Albanian (1) Arabic (2) Cantonese (2) Dutch (2) English (18) French (1) German (7) Hungarian (3) Italian (1) Korean (1) Polish (1) Romanian (2) Serbian (1) Spanish (2) | Dutch (1) English (10) French (1) German (2) Romanian (1) Serbian (1) Spanish (2) | Bosnian (1) French (1) German (1) Hindi (2) Hungarian (5) Italian (2) Korean (3) Russian (2) Spanish (9) Thai (1) Vietnamese (8) | English (9) Kiswahili (1) Korean (2) Mandarin (1) Slovak (1) | Cantonese (1) English (9) German (1) Romanian (1) Slovak (1) |
The number of speakers of each language is given in parentheses
Properties of the sentence frame sets
| Strongly Constrained | Weakly Constrained | |
|---|---|---|
| Syllable count | 8.40 (1.58) | 8.35 (1.32) |
| Total word count | 6.51 (1.15) | 6.39 (1.03) |
| Pointer word count (per sentence) | 2.53 (0.49) | 2.52 (0.50) |
| Pointer word count (across whole set)* | 1,100 (623 unique) | 1,087 (425 unique) |
| Pointer word frequency (across whole set) | 1.79 (1.69) | 2.56 (3.57) |
Values are given in the form mean (SD), except in the case of *
Properties of the congruous and incongruous final keyword sets
| Congruous | Incongruous | |
|---|---|---|
| Syllable count | 1.795 (0.841) | 1.797 (0.839) |
| Lexical frequency (SUBTLEX Log10) | 3.14 (0.60) | 3.12 (0.59) |
| Phonological neighborhood density (CLEARPOND) | 12.43 (13.79) | 12.47 (13.83) |
| Phonological Levenshtein distance (English Lexicon Project) | 1.91 (0.87) | 1.89 (0.87) |
| Concreteness (MRC) | 543.9 (84.70) | 493.7 (105.78) |
Values are given in the form mean (SD)
Fig. 1Recognition accuracy of the 18 lists of semantically neutral sentences presented in speech-shaped noise, averaged across the three noise levels
Fig. 2Speech-in-noise recognition accuracy for Spanish and English listeners as a function of noise level