| Literature DB >> 28744233 |
Julia Merrill1,2, Pauline Larrouy-Maestri3.
Abstract
Similarities and differences between speech and song are often examined. However, the perceptual definition of these two types of vocalization is challenging. Indeed, the prototypical characteristics of speech or song support top-down processes, which influence listeners' perception of acoustic information. In order to examine vocal features associated with speaking and singing, we propose an innovative approach designed to facilitate bottom-up mechanisms in perceiving vocalizations by using material situated between speech and song: Speechsong. 25 participants were asked to evaluate 20 performances of a speechsong composition by Arnold Schoenberg, "Pierrot lunaire" op. 21 from 1912, evaluating 20 features of vocal-articulatory expression. Raters provided reliable judgments concerning the vocal features used by the performers and did not show strong appeal or specific expectations in reference to Schoenberg's piece. By examining the relationship between the vocal features and the impression of song or speech, the results confirm the importance of pitch (height, contour, range), but also point to the relevance of register, timbre, tension and faucal distance. Besides highlighting vocal features associated with speech and song, this study supports the relevance of the present approach of focusing on a theoretical middle category in order to better understand vocal expression in song and speech.Entities:
Keywords: categorization; speechsong; sprechgesang; vocal expression; voice
Year: 2017 PMID: 28744233 PMCID: PMC5504174 DOI: 10.3389/fpsyg.2017.01108
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
List of recordings.
| Angius, Marco | Rado, Livia | ca. 2010 | STR33962 |
| Atherton, David | Thomas, Mary | 1973 | Decca 4256262 |
| none | Goltz, Jennifer | ca. 2007 | MSR Classics MS1208 |
| Boulez, Pierre | Minton, Yvonne | 1977 | B00000281B |
| Boulez, Pierre | Pilarczyk, Helga | 1961 | WER6778-2 |
| Boulez, Pierre | Schäfer, Christine | 1997 | E4576302 |
| Ceccanti, Mauro | Bergamasco, Sonia | 1997 | Arts 47389-2 |
| Craft, Robert | Silja, Anja | ca. 2000 | Naxos 8557523 |
| de Leeuw, Reinbert | Sukowa, Barbara | 1988 | Koch Schwann 310117 |
| Engelen, Robin | Janssen, Jaqueline | 2003 | FUG504 |
| Gould, Glenn | Rideout, Patricia | 1974 | 74645266428 |
| Gourzi, Konstanzia | Doufexis, Stella | ca. 2007 | NEOS10709 |
| Herreweghe, Philippe | Pousseur, Marianne | 1991 | HMA1951390 |
| Leibowitz, René | Semser, Ethel | ca. 1954 | BAM LD 016 |
| Rattle, Simon | Manning, Jane | 1977 | CHAN6534 |
| Rufer, Josef | Burmester, Irmen | 1949 | Audite 21.412 |
| Schönberg, Arnold | Stiedry-Wagner, Erika | 1940 | CBS MPK 45695 |
| Sinopoli, Giuseppe | Castellani, Luisa | 1997 | Teldec 3984-22901-2 |
| Weisberg, Arthur | DeGaetani, Jan | 1970 | H-71251 |
| Zender, Hans | Kammer, Salome | 1994 | MDG 613 0579-2 |
List of recordings with conductor, performer, year of recording, label and code.
Questionnaire on vocal-articulatory expression.
| Pitch | Average pitch | Low | High |
| Pitch variability | Inflected | Monotone | |
| Pitch range | Wide | Narrow | |
| Pitch changes | Sudden | Continuous | |
| Loudness | Loudness range | Wide | Narrow |
| Sound of voice | Resonance | Full | Thin |
| Timbre | Dark | Bright | |
| Faucal distance | Wide | Constricted | |
| Vocal onset, offset | Soft | Hard | |
| Modifications: ~Variance | Varying | Constant | |
| ~ Range | Wide | Narrow | |
| ~ Changes | Sudden | Gradual | |
| Register | Chest voice | Head voice | |
| Noisiness | Breathy | ||
| Pressed | |||
| Creaky | |||
| Harsh | |||
| Other modulations | Vibrato | ||
| Tremolo (tremble) | |||
| Articulation | Precision of articulation | Precise | Imprecise |
| Vowel duration | Lengthened | Shortened | |
| Consonant duration | Lengthened | Shortened | |
| Other | (Phonation) tension | Tense | Relaxed |
| Mode of phonation | Speaking | Singing | |
| Rhythm | Staccato | Legato | |
| Overall tempo | Slow | Fast | |
| Blending of voice + flute | Dis-harmonic | Harmonic | |
The five rational categories (1st column) pitch, loudness, sound of voice, articulation and other are divided into several features (2nd and 3rd column) describing the vocal expression, in columns #3 and #4 are the bipolar characteristics. Features on noisiness and other modulations are unipolar.
Inter-rater agreement.
| Proportion of positive and significant correlations | 8.33% | 20.00% | 32.67% | 39.00% |
| Median correlation coefficient | 0.519 | 0.530 | 0.565 | 0.655 |
| Mean correlation coefficient | 0.548 | 0.561 | 0.575 | 0.646 |
| 0.107 | 0.098 | 0.092 | 0.126 |
Estimation of the agreement between the 25 raters (proportion of positive and significant Spearman correlation coefficient/phi coefficients, median correlation coefficient, mean correlation coefficient and standard deviation) when rating the general liking of each piece, the adequateness, the coherence of the interpretations, and the profession of the performers.
Regression analyses.
| Pitch | Average pitch | −0.219 | 0.000 | −0.173 | −0.170 | 0.001 | −0.076 | −0.002 | 0.994 | 0.105 | 0.654 | ||
| Variability | −0.183 | 0.000 | 0.012 | −0.093 | 0.075 | 0.596 | 0.016 | −0.034 | −0.065 | 0.785 | |||
| Range | 0.008 | 0.844 | −0.062 | 0.229 | 0.589 | 0.020 | 0.048 | 0.409 | 0.110 | ||||
| Changes | −0.090 | 0.039 | 0.063 | −0.139 | 0.007 | −0.167 | 0.020 | 0.932 | 0.082 | 0.713 | |||
| Loudness | Range | 0.070 | 0.087 | 0.039 | 0.428 | 0.100 | 0.657 | 0.219 | 0.326 | ||||
| Sound of | Resonance | −0.133 | 0.003 | −0.432 | −0.073 | 0.164 | 0.099 | 0.696 | 0.359 | 0.141 | |||
| voice | Timbre | −0.034 | 0.415 | −0.010 | 0.837 | 0.686 | 0.007 | 0.183 | −0.085 | 0.704 | |||
| Faucal distance | −0.098 | 0.023 | −0.340 | 0.042 | 0.413 | 0.774 | 0.001 | 0.308 | 0.226 | 0.333 | |||
| Sound | −0.103 | 0.016 | −0.162 | 0.085 | 0.091 | −0.009 | 0.972 | 0.569 | 0.020 | 0.101 | |||
| Variance | −0.137 | 0.002 | −0.002 | −0.010 | 0.843 | 0.166 | 0.487 | 0.100 | 0.671 | ||||
| Range | 0.0024 | 0.560 | 0.026 | 0.602 | −0.431 | 0.103 | 0.245 | 0.319 | |||||
| Changes | −0.042 | 0.351 | −0.025 | 0.641 | 0.391 | 0.121 | 0.342 | 0.168 | |||||
| Register | −0.039 | 0.345 | −0.070 | 0.158 | 0.400 | 0.083 | −0.182 | 0.398 | |||||
| Articul. | Precision | −0.008 | 0.842 | −0.174 | 0.000 | −0.161 | 0.092 | 0.689 | −0.329 | 0.149 | |||
| Vowel duration | −0.106 | 0.006 | 0.272 | 0.033 | 0.467 | 0.629 | 0.006 | −0.205 | −0.047 | 0.823 | |||
| Cons. duration | −0.004 | 0.926 | −0.093 | 0.051 | 0.159 | 0.126 | 0.628 | 0.093 | 0.719 | ||||
| Tension | 0.010 | 0.798 | 0.000 | 0.992 | 0.410 | 0.086 | 0.241 | 0.309 | |||||
| Mode of phonation | −0.113 | 0.002 | 0.076 | −0.191 | 0.000 | −0.080 | 0.150 | 0.468 | −1.303 | 0.000 | −0.408 | ||
| Rhythm | 0.042 | 0.297 | 0.017 | 0.725 | −0.094 | 0.700 | −0.392 | 0.088 | |||||
| Tempo | −0.074 | 0.070 | 0.020 | 0.673 | −0.204 | 0.386 | 0.235 | 0.311 | |||||
Beta-weights and p-values of the specific items included in the two linear and the two logistic regression analyses, performed separately for each general question (i.e., liking, adequateness, coherence, and profession). Dark cells represent significant effects (p < 0.05), gray cells represent marginal effects (0.05 < p < 0.10) and white cells correspond to non-significant predictors (p > 0.10). The columns “direction” include the coefficient correlations between the vocal expression and the general question. If the significance level of p < 0.05 is reached (marked with an asterisk), the sign of the coefficient correlation indicates the direction of the relation between vocal features and the general question. Note that non-significant correlations reflect the non-specificity of direction of the vocal features influencing the general rating. For Direction:
p < 0.05.
Figure 1Features associated with song and speech. Illustration of the significant correlations between the different features (register, average pitch, pitch range, pitch variability, timbre, faucal distance, and tension; y-axis) and mode of phonation (spoken vs. sung, around “just about right,” JAR; x-axis).
Cross-table pitch changes.
| Count | 27 | 23 | 32 | 82 | ||
| Count | 59 | 108 | 49 | 216 | ||
| Count | 35 | 94 | 71 | 199 | ||
| Total | Count | 120 | 225 | 152 | 497 | |
Cross-table (Count vs. Expected count) for the feature mode of phonation in comparison to pitch changes. The JAR-ratings were not included in the Chi-squared tests and are therefore depicted in gray.