| Literature DB >> 28626493 |
Jelena Krivokapić1,2, Mark K Tiede2, Martha E Tyrone2,3.
Abstract
The primary goal of this work is to examine prosodic structure as expressed concurrently through articulatory and manual gestures. Specifically, we investigated the effects of phrase-level prominence (Experiment 1) and of prosodic boundaries (Experiments 2 and 3) on the kinematic properties of oral constriction and manual gestures. The hypothesis guiding this work is that prosodic structure will be similarly expressed in both modalities. To test this, we have developed a novel method of data collection that simultaneously records speech audio, vocal tract gestures (using electromagnetic articulometry) and manual gestures (using motion capture). This method allows us, for the first time, to investigate kinematic properties of body movement and vocal tract gestures simultaneously, which in turn allows us to examine the relationship between speech and body gestures with great precision. A second goal of the paper is thus to establish the validity of this method. Results from two speakers show that manual and oral gestures lengthen under prominence and at prosodic boundaries, indicating that the effects of prosodic structure extend beyond the vocal tract to include body movement.Entities:
Keywords: EMA; Prosodic boundaries; Vicon; electro- magnetic articulometry; gestures; motion capture; prosodic prominence; speech production
Year: 2017 PMID: 28626493 PMCID: PMC5472837 DOI: 10.5334/labphon.75
Source DB: PubMed Journal: Lab Phonol ISSN: 1868-6346
Question-answer pairs for Experiment 1. The target word is Bob and the constrictions of interest are the two /b/ consonants.
| Condition | Context question | Answer |
|---|---|---|
| Does Lenny want to see Bob? | Anna [wants to see Bob]deaccented. | |
| What is going on?v | [Anna wants to see Bob]broad. | |
| Who does Anna want to see? | Anna wants to see [Bob]narrow. | |
| Does Anna want to see Mary? | Anna wants to see [Bob]contrastive. |
Stimuli for Experiment 2 for the target word MIma. The target boundary is before MIma, and the relevant constriction is the first /m/ (underlined here but not when the stimuli were presented to participants).
| Condition | Utterance |
|---|---|
| 1. word | There are other things. I saw |
| 2. ip | Mary would like to see Shaw, |
| 3. IP | There are other things I saw. |
Stimuli for Experiment 3 for the target word DIdad. The target boundary is after DIdad, and the relevant constriction is the last /d/ (underlined here but not in the experiment).
| Condition | Utterance |
|---|---|
| 1. word | Mary would like to get the new |
| 2. ip | Mary would like to get the new DIdad, Ynette, and Bobby for her birthday. |
| 3. IP | Mary would like to get the new DIdad. In Ette this would be quite easy. |
Figure 1Movement tracking. EMA sensors on the left and motion capture markers on the right.
Figure 2Experimental setup.
Number of tokens excluded from the analysis for Experiments 2 and 3.
| Boundary | Experiment 1 (phrase initial) | Experiment 2 (phrase final) | ||
|---|---|---|---|---|
|
| ||||
| Speaker 1 | Speaker 2 | Speaker 1 | Speaker 2 | |
| word | 4 excluded (IP boundary produced) | 8 excluded (IP boundary produced) | 0 | 17 sentences produced with IP boundary (condition excluded) |
| Ip/IP1 | 2 excluded (incorrect target word/disfluency) | 1 (incorrect target word) | 0 | 0 |
| IP/IP2 | 3 excluded (incorrect target word/disfluency) | 1 (disfluency) | 0 | 0 |
Figure 3Labeling example for Bob. The identified landmarks shown here are (for /b/): gesture onset (left edge of the box), nucleus onset (left edge of the shaded box), maximum constriction (dashed line), nucleus offset (right end of the shaded box), gesture offset (right end of the box). LA: lip aperture trajectory and velocity, FING: finger vertical displacement trajectory and tangential velocity.
Examined variables.
| Speech variable | Description | Pointing gesture variable | Description |
|---|---|---|---|
| CLOSEDUR | duration of the constriction closing movement (from onset to maximum constriction) | POINTINGDUR | duration of the pointing movement (from onset to maximum finger displacement) |
| CLOSEDURACC | constriction closing movement acceleration duration (from onset to peak velocity) | POINTINGDURACC | pointing movement accelera- tion duration (from onset to peak velocity) |
| OPENDUR | duration of the constriction opening movement (from maximum constriction to gesture release) | RETURNDUR | duration of the return movement (from maximum finger displacement to gesture release) |
| OPENDURACC | constriction opening movement acceleration duration (from maximum constriction to peak velocity) | RETURNDURACC | return movement acceleration duration (from maximum finger displacement to peak velocity) |
Results for prominence lengthening, Speaker 1. Means (SE) in ms., ANOVA, Fisher’s PLSD.
| C1 LA closing movement duration (CLOSEDUR) | |
|---|---|
| broad = 86.36 (3.4) | contrastive, broad: |
| narrow = 92.73 (3.4) | contrastive, narrow: |
| contrastive = 116.67 (3.25) | contrastive > broad, narrow |
| broad = 134.55 (5.31) | contrastive, broad: |
| narrow = 145.46 (5.31) | contrastive > broad |
| contrastive = 156.67 (5.08) | |
| broad = 75.45 (4.51) | contrastive, broad: |
| narrow = 81.82 (4.51) | contrastive, narrow: |
| contrastive = 95.83 (4.31) | contrastive > broad, narrow |
| broad = 117.5 (3.99) | broad, narrow: |
| narrow = 102.5 (3.99) | broad > narrow |
| contrastive = 111.82 (3.41) | |
| broad = 37.47 (3.88) | contrastive, broad: |
| narrow = 53.2 (3.63) | narrow, broad: |
| contrastive = 58.51 (3.1) | contrastive, narrow > broad |
| broad = 515.65 (10.64) | contrastive, broad: |
| narrow = 525.91 (10.64) | contrastive, narrow: |
| contrastive = 561.2 (10.19) | contrastive > broad, narrow |
Results for prominence lengthening, Speaker 2. Means (SE) in ms., ANOVA, Fisher’s PLSD.
| C1 LA closing movement duration (CLOSEDUR) | |
|---|---|
| deaccented = 151.57 (10.7) | contrastive, deaccented: |
| broad = 167.5 (10.7) | narrow, deaccented: |
| narrow = 191.67 (10.7) | contrastive, narrow > deaccented |
| contrastive = 194.17 (10.7) | |
| deaccented = 167.5 (9.4) | contrastive, deaccented: |
| broad = 190 (9.4) | contrastive, broad: |
| narrow = 230.83 (9.4) | narrow, deaccented: p = 0.0001 |
| contrastive = 262.5 (9.4) | narrow, broad: |
| contrastive, narrow: | |
| contrastive > narrow > deaccented, broad | |
| deaccented = 111.67 (8.57) | contrastive, deaccented: |
| broad = 138.33 (8.57) | contrastive, broad: |
| narrow = 166.67 (8.57) | narrow, deaccented: |
| contrastive = 198.33 (8.57) | contrastive, narrow: |
| narrow, broad: | |
| broad, deaccented: | |
| contrastive > narrow > broad > deaccented | |
| deaccented = 110 (2.88) | contrastive, broad: |
| broad = 105 (2.88) | contrastive, deaccented: |
| narrow = 123.33 (2.88) | narrow, broad: |
| contrastive = 122.5 (2.88) | narrow, deaccented: |
| contrastive, narrow > broad, deaccented | |
| deaccented = 512.5 (21.02) | contrastive, broad: |
| broad = 483.65 (21.02) | contrastive, deaccented: |
| narrow = 574.88 (21.02) | narrow, broad: |
| contrastive = 615.95 (21.02) | narrow, deaccented: |
| contrastive, narrow > broad, deaccented | |
| deaccented = 297.5 (27.59) | contrastive, broad: |
| broad = 265 (27.59) | contrastive, deaccented: |
| narrow = 353.62 (27.59) | narrow, broad: |
| contrastive = 431.67 (27.59) | contrastive > broad, deaccented |
| narrow > broad | |
| deaccented = 170 (21.46) | contrastive, deaccented: |
| broad = 172.5 (21.46) | contrastive, broad: |
| narrow = 179.17 (21.46) | contrastive, narrow: |
| contrastive = 264.17 (21.46) | contrastive > narrow, broad, deaccented |
Figure 4Labeling example for miMA, for the utterance, “There are other things. I saw miMA being stolen in broad daylight by a cop”. The identified landmarks shown here are (for /m/): gesture onset (left edge of the box), nucleus onset (left edge of the shaded box), maximum constriction (dashed line), nucleus offset (right end of the shaded box), gesture offset (right end of the box). LA: lip aperture trajectory and velocity. FING: finger vertical displacement trajectory and tangential velocity.
Results for phrase-initial lengthening, Speaker 1. Means (SE) in z-scores, ANOVA, Fisher’s PLSD.
| LA closing movement duration (CLOSEDUR) | |
|---|---|
| Word = –0.61 (0.2) | Word, IP1: |
| IP1 = 0.56 (0.19) | Word, IP2: |
| IP2 = 0.05 (0.19) | IP2, IP1 > Word |
| Word = –0.45 (0.21) | Word, IP1: |
| IP1 = 0.25 (0.2) | Word, IP2: |
| IP2 = 0.18 (0.21) | IP2, IP1 > Word |
| Word = –0.51 (0.20) | Word, IP1: |
| IP1 = 0.07 (0.19) | Word, IP2: |
| IP2 = 0.31 (0.20) | IP2, IP1 > Word |
Results for phrase-initial lengthening, Speaker 2. Means (SE) in z-scores, ANOVA, Fisher’s PLSD.
| LA closing movement duration (CLOSEDUR) | |
|---|---|
| Word = –0.97 (0.2) | Word, IP1: |
| IP1 = 0.38 (0.17) | Word, IP2: |
| IP2 = 0.2 (0.17) | IP2, IP1 > Word |
| Word = –0.6 (0.23) | Word, IP1: |
| IP1 = 0.09 (0.19) | Word, IP2: |
| IP2 = 0.33 (0.19) | IP2, IP1 > Word |
| Word = –0.61 (0.19) | Word, IP1: |
| IP1 = 0.46 (0.15) | IP1, IP2: |
| IP2 = –0.21 (0.15) | IP1 > IP2, Word |
| Word = –0.66 (0.21) | Word, IP1: |
| IP1 = 0.68 (0.17) | Word, IP2: |
| IP2 = –0.22 (0.17) | IP1 > IP2, Word |
| Word = –0.79 (0.2) | Word, IP1: |
| IP1 = –0.11 (0.17) | Word, IP2: |
| IP2 = 0.48 (0.17) | IP2, IP1: |
| IP2 > IP1 > Word | |
| Word = –0.51 (0.24) | Word, IP2: |
| IP1 = 0.046 (0.2) | IP2 > Word |
| IP2 = 0.31 (0.2) | |
Figure 5Labeling example for diDAD, for the utterance, “Mary would like to get the new diDAD. In Ette this would be quite easy”. The identified landmarks (for /d/) shown here are: gesture onset (left edge of the box), nucleus onset (left edge of the shaded box), maximum constriction (dashed line), nucleus offset (right end of the shaded box), gesture offset (right end of the box). TT: vertical tongue tip trajectory and vertical velocity. FING: finger vertical displacement and tangential velocity.
Results for phrase-final lengthening, both speakers. Means (SE) in z-scores, ANOVA, Fisher’s PLSD.
| TT opening movement duration OPENDUR (Speaker 1) | |
|---|---|
| Word = –0.3 (0.19) | Word, IP1: |
| IP1 = 0.42 (0.19) | IP1, IP2: |
| IP2 = –0.13 (0.19) | IP1 > Word, IP2 |
| Word = 0.01 (0.19) | IP1, IP2: |
| IP1 = 0.5 (0.19) | IP1 > IP2 |
| IP2 = –0.51 (0.19) | |
| Word = –1.17 (0.09) | Word, IP1: |
| IP1 = 0.14 (0.09) | Word, IP2: |
| IP2 = 0.96 (0.09) | IP1, IP2: |
| IP2 > IP1 > Word | |
| IP1 = –0.91 (0.11) | |
| IP2 = 0.77 (0.1) | IP2 > IP1 |