| Literature DB >> 30606083 |
Odette Scharenborg1, Sofoklis Kakouros1,2, Brechtje Post3, Fanny Meunier4.
Abstract
This paper investigates whether sentence accent detection in a non-native language is dependent on (relative) similarity between prosodic cues to accent between the non-native and the native language, and whether cross-linguistic differences in the use of local and more widely distributed (i.e., non-local) cues to sentence accent detection lead to differential effects of the presence of background noise on sentence accent detection in a non-native language. We compared Dutch, Finnish, and French non-native listeners of English, whose cueing and use of prosodic prominence is gradually further removed from English, and compared their results on a phoneme monitoring task in different levels of noise and a quiet condition to those of native listeners. Overall phoneme detection performance was high for the native and the non-native listeners, but deteriorated to the same extent in the presence of background noise. Crucially, relative similarity between the prosodic cues to sentence accent of one's native language compared to that of a non-native language does not determine the ability to perceive and use sentence accent for speech perception in that non-native language. Moreover, proficiency in the non-native language is not a straightforward predictor of sentence accent perception performance, although high proficiency in a non-native language can seemingly overcome certain differences at the prosodic level between the native and non-native language. Instead, performance is determined by the extent to which listeners rely on local cues (English and Dutch) versus cues that are more distributed (Finnish and French), as more distributed cues survive the presence of background noise better.Entities:
Keywords: Sentence accent detection; acoustic cues; cross-linguistic influence; native and non-native listening; noise; phoneme detection; prosody
Mesh:
Year: 2019 PMID: 30606083 PMCID: PMC7066482 DOI: 10.1177/0023830918819573
Source DB: PubMed Journal: Lang Speech ISSN: 0023-8309 Impact factor: 1.500
The number of participants and their mean age and mean LexTale score per listener/language group.
| Listener group |
| Age | LexTale | Fixed effect estimates | ||||
|---|---|---|---|---|---|---|---|---|
| Mean |
| Mean |
| β |
|
| ||
| English | 47 | 20.8 | 2.7 | 98.6 | 2.6 | 98.560 | 1.257 | <0.001 |
| Dutch | 42 | 23.0 | 4.2 | 84.5 | 11.2 | −11.762 | 1.798 | <0.001 |
| Finnish | 45 | 27.3 | 6.7 | 86.8 | 9.7 | −18.279 | 1.975 | <0.001 |
| French | 32 | 29.8 | 8.6 | 80.3 | 8.9 | −14.018 | 1.830 | <0.001 |
All participants had a LexTale score of minimally 63, which corresponds (roughly) to an upper intermediate or B2 level of proficiency (Lemhöfer & Broersma, 2012). The column “Fixed effect estimates” shows the results of the linear regression analysis with the English listener group on the intercept.
SD, standard deviation; SE, standard error; p, significant difference.
Overview of all 48 target-bearing words and the experimental sentences in which they were embedded.
| Carrier sentence | Target-bearing word |
|---|---|
| The documentary about poverty interested the common viewers. | common |
| The material for the tires was tested by the car manufacturer. | car |
| The statement of the crown witness led to the pickpocket’s arrest. | crown |
| The area near the coast responded to the tornado warning. | coast |
| The young man on the corner was wearing the paper hat. | paper |
| The company of tap dancers negotiated with the pop star’s agent. | pop star |
| The ducklings in the pond were fighting for the cake crumbs. | pond |
| The woman with the parrot went into the teacher’s office. | parrot |
| The actions of the crew led to the test lab’s evacuation. | test lab |
| The rising price of packaging worried the tennis racket manufacturer. | tennis |
| The group of students from Thailand photographed the crocodile wrestler. | Thailand |
| The mayor of the town will check on his popularity ratings. | town |
| The members of the Pomo tribe lived off the cotton trade. | cotton |
| The entrepreneurs in the transport sector suffered most from the coalmine’s closure. | coalmine |
| The flavor of the coffee was ruined by the polluted water. | coffee |
| The investigator of the crime was shot by the taxi driver. | crime |
| The manager of the kitchen staff was upset about the pumpkin soup. | pumpkin |
| The visitors of the theme park came to look at the pygmy hippo’s young. | pygmy |
| The archaeologists from Poland went excavating in the Kalahari desert. | Poland |
| The man in the purple suit was welcomed by the tailor’s assistant. | purple |
| The farm with the cornfields was close to the toy factory. | toy |
| The article on prostate cancer was praised at the 12th Medical Congress. | twelfth |
| The reporter from | telegraph |
| The flight to the tropics crashed near Palm Island. | tropics |
| The bones of the triceratops were found by the Cuban archaeologist. | Cuban |
| The owner of the potato farm refused to go to the counselor’s party. | counselor |
| The villa with the carport must belong to the tobacco farmer. | carport |
| The member of the cabinet was involved in the plagiarism incident. | cabinet |
| The old man with the tattoos had worked as a prison guard. | prison |
| The mistress of the king was given a pearl necklace. | pearl |
| The members of parliament were outraged by the tabloid’s allegations. | parliament |
| The suspension of the pastor was confirmed at the college meeting. | pastor |
| The watcher on the patio saw the tournament finals. | tournament |
| The voice of the caller was hard to hear through the telephone connection. | telephone |
| The attendant at the toll booth looked through the passenger window. | toll |
| The chauffeur of the teenager refused to pick up her cousin’s dog. | teenager |
| The shed for the tools was built in the cabin’s garden. | cabin |
| The children in the primary school were drawing on the crayon boxes. | crayon |
| The remains of the camp were found by the tiger hunter. | camp |
| The personnel officer of the company interviewed the polo player. | company |
| The demonstrators at the train station were arrested by the police officer. | police |
| The box with the contraband was hidden in the parking lot. | parking |
| The group of tourists from Prague received information about the Tate Gallery. | Prague |
| The owner of the pawn shop checked the customer’s items. | pawn |
| The shed near the park ranger’s house was made of teak wood. | teak |
| The program about cocaine was meant for television broadcast. | television |
| The help of the tutor was appreciated by the pupil’s mother. | tutor |
| The group of turtle enthusiasts wrote about Korean mythology. | turtle |
The target phoneme is the first phoneme of the target-bearing word.
Overview of all 48 filler sentences; words in capitals denote words carrying sentence accent.
| Filler sentence | Target phoneme | Target-bearing word |
|---|---|---|
| The villagers were against the expansion of the CAMPSITE. |
| campsite |
| The second-hand CAR was bought by the geology student. |
| car |
| The PAINTINGS in the gallery turned out to be forgeries. |
| paintings |
| The ambassador’s wife wanted to order a new TABLE. |
| table |
| The value of the bonds was altered with the devalued CURRENCY. |
| currency |
| The delegation from the USA established PEACE between the Arab nations. |
| peace |
| The PERPETRATOR of the bombing was a member of the Afghan army. |
| perpetrator |
| The TOURISTS in the underground were greatly amused by the Japanese graffiti. |
| tourists |
| The participant in the running relay race sprained her KNEE. |
| * |
| His OLD Volvo broke down while he was on holiday in Kent. |
| * |
| The victim of the gang’s vicious attack had cuts on his BACK. |
| * |
| The FLUTE ensemble performed in the school building. |
| * |
| The mother of three daughters wrote to the BOARDING school. |
| * |
| The STUDENT from Australia went to the book signing. |
| * |
| The General of the Army was killed during the CIVIL uprising. |
| * |
| The bridge nearest the CITY was sabotaged by the Libyan army. |
| * |
| The CAPTAIN of the expedition refused to turn back to the nearest harbor. |
| captain |
| The blonde woman won a prize at the CONTEST. |
| contest |
| The intern at the lab researched the rat’s sleeping PATTERN. |
| pattern |
| The army officer was not happy about his POSTING to the desert war. |
| posting |
| The PEOPLE on the square rallied against gay marriage. |
| people |
| The figure skating TEAM from Brazil lost the world championship. |
| team |
| The cricket grounds of the village were maintained by the TEST match player. |
| test |
| The rich playboy’s TIME was mostly spent on his luxury yacht. |
| time |
| The director knew nothing of the SCANDAL’s outcome. |
| * |
| The celebrity was arrested for driving under INFLUENCE. |
| * |
| The board of the health center went to the conference on DONOR kidneys. |
| * |
| The members of the Women’s Association held a collection for the VICTIMS of the flood. |
| * |
| The personnel manager of the department store fired the LAZY salesman. |
| * |
| The university vice-chancellor’s opinion was reported on the EVENING news. |
| * |
| The association of online shoppers objected to the NEW delivery procedure. |
| * |
| The entrance to the dormitory was located near the BAPTIST church |
| * |
| The CHECKING of the ballots was interrupted when the machine failed. |
| * |
| The ROAD to the bridge was flooded by the rain. |
| * |
| The SCIENCE students listened to the violin concerto by Beethoven. |
| * |
| The Californian SENATOR proposed the motion to dismiss. |
| * |
| The ATTITUDE of the businessman aroused his associates’ anger. |
| * |
| The head of staff of the BANK was facing a dollar shortage. |
| * |
| The FATHER of the deaf child was given instructions on the baby monitor’s use. |
| * |
| The report of the SKIING accident was aired on the evening news. |
| * |
| The old folks’ CLUB had gone on its regular Friday afternoon bus trip. |
| club |
| The CLUB of derby fans attended the new driver’s first race. |
| club |
| The members of the board asked about the budget CUTS. |
| cuts |
| The chicken farmers voted against the minister’s new fertilizer POLICY. |
| policy |
| The PRIEST from Denver did not attend the Easter celebration. |
| priest |
| The Basque TERRORISTS were responsible for the assassination. |
| terrorists |
| The residents of the neighborhood were annoyed by the noisy TRAFFIC. |
| traffic |
| The brain surgeon was unable to remove the smallest of the TUMORS. |
| tumors |
The column denoted “Target phoneme” indicates the phoneme displayed on the computer screen. The “*” in the column “Target-bearing word” indicates that no target phoneme was present.
Figure 1.Original signal waveform, energy, F0, and spectral tilt for the deaccented condition of the sentence The suspension of the /p/astor was confirmed at the college meeting.
Vertical solid lines (red) demarcate the target word and vertical dashed lines (black) the target phoneme /p/. Horizontal dashed lines (black) indicate the feature values during unvoiced segments.
F0, fundamental frequency.
Figure 2.Boxplots for the target words for mean, standard deviation, max, and range of energy, F0, and spectral tilt.
Standard Dev., standard deviation; Max, maximum; F0, fundamental frequency.
Effect size d for the differences between the two prosodic conditions for the word-level acoustic descriptors max, mean, standard deviation, and range, and for the target bearing-words.
| Maximum | Mean | |||||||
|---|---|---|---|---|---|---|---|---|
| Clean | +5 dB | 0 dB | –5 dB | Clean | +5 dB | 0 dB | −5 dB | |
| Energy |
|
|
|
|
| 0.29 |
| 0.16 |
| F0 |
|
|
| 0.35 |
|
|
|
|
| Spectral tilt |
|
|
|
|
| 0.33 | 0.35 |
|
| Duration | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 |
| SD | Range | |||||||
| Clean | +5 dB | 0 dB | −5 dB | Clean | +5 dB | 0 dB | −5 dB | |
| Energy |
|
|
|
|
|
|
|
|
| F0 |
|
|
|
|
|
|
|
|
| Spectral tilt |
|
|
|
|
|
|
|
|
| Duration | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 | 0.17 |
The results are presented for clean speech and for three noise levels (+5, 0, −5 dB SNR). Values in bold indicate statistically significant differences between the two prosodic conditions.
dB, decibel(s); SD, standard deviation; SNR, signal/noise ratio.
p < 0.05, **p < 0.01, ***p < 0.001.
Figure 3.The proportion of detected target phonemes for the four listener groups and the four listening conditions, plotted per language group.
The deaccented condition is represented with the rounded markers and the accented condition with the squared markers.
Figure 4.The proportion of detected target phonemes for the four listener groups plotted per listening condition, aggregated over the two prosodic conditions.
SNR, signal/noise ratio; dB, decibel(s).
Fixed effect estimates for the best-fitting models of performance for the target phoneme detection accuracy analyses.
| Fixed effect | β |
|
|
|---|---|---|---|
| Intercept | 1.904 | 0.313 |
|
| Prosodic condition | 0.440 | 0.097 |
|
| Listening condition: | 0.087 | 0.161 | 0.588 |
| Listening condition: | −0.213 | 0.179 | 0.236 |
| Listening condition: | −0.899 | 0.193 |
|
| Language: Finnish | −0.304 | 0.207 | 0.142 |
| Language: French | −0.163 | 0.235 | 0.487 |
| Language: Dutch | −0.525 | 0.209 |
|
n = 5312 observations. Bold indicates a significant effect.
SE, standard error; p, significant difference; SNR, signal/noise ratio; dB, decibel(s).
Fixed effect estimates for the best-fitting models of performance for the target phoneme detection accuracy analyses with acoustic measures, for all listener groups together.
| Fixed effect | β |
|
|
|---|---|---|---|
| Intercept | 2.211 | 0.417 |
|
| Prosodic condition | 0.286 | 0.142 |
|
| Listening condition: | 0.266 | 0.182 | 0.144 |
| Listening condition: | −0.060 | 0.198 | 0.763 |
| Listening condition: | −0.715 | 0.210 |
|
| Maximum energy | 0.162 | 0.120 | 0.176 |
| Language: Finnish | −0.368 | 0.212 | 0.082[ |
| Language: French | −0.194 | 0.243 | 0.425 |
| Language: Dutch | −0.517 | 0.217 |
|
| Duration | −0.263 | 0.122 |
|
| Listening condition: | −0.480 | 0.161 |
|
| Listening condition: | −0.025 | 0.144 | 0.860 |
| Listening condition: | 0.227 | 0.139 | 0.103 |
| Language: Finnish × duration | 0.258 | 0.134 | 0.054[ |
| Language: French × duration | 0.302 | 0.167 | 0.070[ |
| Language: Dutch × duration | −0.059 | 0.132 | 0.656 |
n = 5162. Bold indicates a significant effect; * indicates a marginal effect.
SE, standard error; p, significant difference; SNR, signal/noise ratio; dB, decibel(s).
Fixed effect estimates for the best-fitting models of performance for the target phoneme detection accuracy analyses with acoustic measures, for the native English listener group.
| Fixed effect | β |
|
|
|---|---|---|---|
| Intercept | 3.006 | 0.268 |
|
| Listening condition: | 0.351 | 0.348 | 0.313 |
| Listening condition: | 0.037 | 0.315 | 0.907 |
| Listening condition: | −1.152 | 0.269 |
|
| Maximum energy | 0.267 | 0.220 | 0.224 |
| Listening condition: | −0.696 | 0.360 | 0.053[ |
| Listening condition: | −0.095 | 0.303 | 0.754 |
| Listening condition: | 0.253 | 1.178 | 0.239 |
n = 1472. Bold indicates a significant effect; * indicates a marginal effect.
SE, standard error; p, significant difference; SNR, signal/noise ratio; dB, decibel(s).
Fixed effect estimates for the best-fitting models of performance for the target phoneme detection accuracy analyses with acoustic measures, for the Dutch non-native listener group.
| Fixed effect | β |
|
|
|---|---|---|---|
| Intercept | 2.596 | 0.397 |
|
| Listening condition: | 0.299 | 0.317 | 0.345 |
| Listening condition: | −0.399 | 0.361 | 0.269 |
| Listening condition: | −0.874 | 0.404 |
|
| Mean energy | 0.201 | 0.088 |
|
| Duration | −0.322 | 0.098 |
|
n = 1319. Bold indicates a significant effect.
SE, standard error; p, significant difference; SNR, signal/noise ratio; dB, decibel(s).
Fixed effect estimates for the best-fitting models of performance for the target phoneme detection accuracy analyses with acoustic measures, for the Finnish non-native listener group.
| Fixed effect | β |
|
|
|---|---|---|---|
| Intercept | 2.646 | 0.292 |
|
| Listening condition: | −0.095 | 0.263 | 0.718 |
| Listening condition: | −0.101 | 0.279 | 0.718 |
| Listening condition: | −0.856 | 0.272 |
|
| Maximum F0 | 0.0355 | 0.105 |
|
n = 1472. Bold indicates a significant effect; * indicates a marginal effect.
SE, standard error; p, significant difference; SNR, signal/noise ratio; dB, decibel(s).
Fixed effect estimates for the best-fitting models of performance for the target phoneme detection accuracy analyses with acoustic measures, for the French non-native listeners.
| Fixed effect | β |
|
|
|---|---|---|---|
| Intercept | 2.602 | 0.350 |
|
| Listening condition: | 0.950 | 0.492 | 0.054[ |
| Listening condition: | 0.191 | 0.338 | 0.564 |
| Listening condition: | −0.507 | 0.314 | 0.106 |
| LexTale | 0.225 | 0.230 | 0.328 |
| Maximum energy | 0.193 | 0.341 | 0.571 |
| Maximum F0 | 0.107 | 0.466 | 0.818 |
| Mean F0 | 0.101 | 0.391 | 0.797 |
| Mean tilt | −0.518 | 0.371 | 0.162 |
| LexTale × maximum energy | 0.613 | 0.263 |
|
| LexTale × mean F0 | 0.862 | 0.340 |
|
| LexTale × mean tilt | −0.875 | 0.266 |
|
n = 840. Bold indicates a significant effect; * indicates a marginal effect.
SE, standard error; p, significant difference; SNR, signal/noise ratio; dB, decibel(s); LexTale, standardized test of vocabulary knowledge.