| Literature DB >> 36137111 |
Albert Lee1, Faith Chiu2, Yi Xu3.
Abstract
This study explored the contexts in which native Japanese listeners have difficulty identifying prosodic focus. Using a 4AFC identification task, we compared native Japanese listeners' focus identification accuracy in different lexical accent × focus location conditions using resynthesised speech stimuli, which varied only in fundamental frequency. Experiment 1 compared the identification accuracy in lexical accent × focus location conditions using both natural and resynthesised stimuli. The results showed that focus identification rates were similar with the two stimulus types, thus establishing the reliability of the resynthesised stimuli. Experiment 2 explored these conditions further using only resynthesised stimuli. Narrow foci bearing the lexical pitch accent were always more correctly identified than unaccented ones, whereas the identification rate for final focus was the lowest among all focus locations. From these results, we argue that the difficulty of focus perception in Japanese is attributed to (i) the blocking of PFC by unaccented words, and (ii) similarity in F0 contours between lexical pitch accent and narrow focus, including in particular the similarity between downstep and PFC. Focus perception is therefore contingent on other concurrent communicative functions which may sometimes take precedence in a +PFC language.Entities:
Year: 2022 PMID: 36137111 PMCID: PMC9499294 DOI: 10.1371/journal.pone.0274176
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Averaged F0 contours of initial / penultimate / final narrow vs. neutral × accented vs. unaccented focus [data from 9].
Stimuli used in the present study [adapted from 9].
For easy illustration, here the accusative case marker–o (which collocates with mita ‘saw’) and the dative case marker–ni (which collocates with nita ‘resembled) are presented as though belonging to Word III; syntactically they are part of Word II.
| Word I | Word II | Word III | ||||
|---|---|---|---|---|---|---|
| Short | Accented | Me’i-ga | × | mo’mo | × | -o mi’ta |
| Unaccented | Mei-ga | momo | -ni nita | |||
| Long | Accented | Mu’umin-ga | × | bu’dou | × | -o mi’ta |
| Unaccented | Noumin-ga | budou | -ni nita | |||
Fig 2Example of functional labelling using PENTAtrainer.
Fig 3F0 contour of a natural vs. corresponding synthesised utterance (me’i-ga mo’mo-o mi’ta) in neutral focus.
X-axis shows actual time.
Fig 4Mean identification accuracy by focus condition and accent condition of the focused word (or of Word 1 in case of neutral focus).
Mean identification accuracy (chance = 25%) by accent combination and focus location.
Accent combinations are in descending order of mean identification accuracy. Cell colouring signals the percentage of correct responses (25% < green < 50% < yellow < 75% < red) (colour online).
| Accuracy (%) | ||||||
|---|---|---|---|---|---|---|
| Accent comb. | Initial | Medial | Final | Neutral |
|
|
|
| 79 | 63 | 38 | 46 | 56 | 18 |
|
| 77 | 29 | 44 | 44 | 48 | 20 |
|
| 27 | 52 | 50 | 60 | 47 | 14 |
|
| 33 | 35 | 38 | 75 | 45 | 20 |
|
| 63 | 48 | 27 | 42 | 45 | 15 |
|
| 69 | 21 | 17 | 58 | 41 | 26 |
|
| 21 | 38 | 10 | 77 | 36 | 29 |
|
| 23 | 27 | 25 | 54 | 32 | 15 |
Model summary: Correct response ~ Focus + Accent + FocusAccent + Focus × FocusAccent + (Focus | Group:Subject).
Pr(>|z|)-values: * < .05, ** < .01, *** < .001. In the parameters, ‘I’ = Initial focus, ‘M’ = Medial, ‘F’ = Final, ‘N’ = Neutral, ‘A’ = Accented, ‘U’ = Unaccented.
| Parameters | Fixed | Random | |||
|---|---|---|---|---|---|
|
|
|
| Participant | ||
| SD | |||||
| (Intercept) | -.072 | .555 | -.129 | 1.977 | |
| Focus (Final) | -1.893 | .683 | -2.771 | ** | 2.548 |
| Focus (Initial) | -.752 | .665 | -1.131 | 2.515 | |
| Focus (Medial) | -1.122 | .647 | -1.735 | . | 2.455 |
| Accent (AAA) | 1.044 | .292 | 3.579 | *** | |
| Accent (AAU) | .410 | .285 | 1.437 | ||
| Accent (AUA) | .884 | .306 | 2.888 | ** | |
| Accent (AUU) | .485 | .303 | 1.600 | ||
| Accent (UAA) | .874 | .265 | 3.303 | *** | |
| Accent (UUA) | .514 | .283 | 1.819 | . | |
| Accent (UUU) | 1.134 | .273 | 4.152 | *** | |
| FocusAccent | -1.358 | .325 | -4.176 | *** | |
| Focus (F) x FocusAccent | 1.674 | .446 | 3.755 | *** | |
| Focus (I) x FocusAccent | 3.896 | .399 | 9.760 | *** | |
| Focus (M) x FocusAccent | 2.351 | .429 | 5.485 | *** | |
Confusion matrix summarizing mean response rates in Experiment 2.
| Response | |||||
|---|---|---|---|---|---|
| Initial | Medial | Final | Neutral | ||
| Target | Initial |
| 10% | 4% | 38% |
| Medial | 15% |
| 5% | 41% | |
| Final | 13% | 17% |
| 39% | |
| Neutral | 18% | 20% | 5% |
| |