| Literature DB >> 23259698 |
Marina Scheumann1, Anna-Elisa Roser, Wiebke Konerding, Eva Bleich, Hans-Jürgen Hedrich, Elke Zimmermann.
Abstract
INTRODUCTION: Human speech does not only communicate linguistic information but also paralinguistic features, e.g. information about the identity and the arousal state of the sender. Comparable morphological and physiological constraints on vocal production in mammals suggest the existence of commonalities encoding sender-identity and the arousal state of a sender across mammals. To explore this hypothesis and to investigate whether specific acoustic parameters encode for sender-identity while others encode for arousal, we studied infants of the domestic cat (Felis silvestris catus). Kittens are an excellent model for analysing vocal correlates of sender-identity and arousal. They strongly depend on the care of their mother. Thus, the acoustical conveyance of sender-identity and arousal may be important for their survival.Entities:
Year: 2012 PMID: 23259698 PMCID: PMC3551667 DOI: 10.1186/1742-9994-9-36
Source DB: PubMed Journal: Front Zool ISSN: 1742-9994 Impact factor: 3.172
Description of measured acoustic parameters
| Call duration [ms] | Time between the onset and the offset of a call. |
| ICI [ms] | Time between the offset of a call and the onset of the successive call. |
| Peaktime [ms] | Time between the onset and the maximum amplitude of a call. |
| MeanF0 [Hz] | Mean fundamental frequency of a call. |
| MinF0 [Hz] | Minimum fundamental frequency of a call. |
| MaxF0 [Hz] | Maximum fundamental frequency of a call. |
| SDF0 [Hz] | Standard deviation of the fundamental frequency of a call. |
| Peak [Hz] | Frequency with maximum energy over a call. |
| MeanF1 [Hz] | Mean frequency of the first formant of a call. |
| SDF1 [Hz] | Standard deviation of the first formant frequency of a call. |
| BWF1[Hz] | Bandwidth of the first formant frequency of a call. |
| MeanF2 [Hz] | Mean frequency of the second formant of a call. |
| SDF2 [Hz] | Standard deviation of the second formant frequency of a call. |
| BWF2 [Hz] | Bandwidth of the second formant frequency of a call. |
| MeanF3 [Hz] | Mean frequency of the third formant of a call. |
| SDF3 [Hz] | Standard deviation of the third formant frequency of a call. |
| BWF3 [Hz] | Bandwidth of the third formant frequency of a call. |
| F2–F1 [Hz] | Difference between the mean of the second and the first formant frequency. |
| Consistency | Mean maximum correlation of power spectra of successive 25 ms time steps of a call. |
| Cepstral peak [V] | Value of the peak at the fundamental period of a cepstrum for the middle 10 ms of the call. |
| Voiced [%] | Percentage of voiced frames of a call. |
| MaxHNR [db] | Maximum harmonic-to-noise ratio of a call. |
Results of the one-way Anova testing for differences between individuals for each acoustic parameter and arousal condition and the correlation coefficient with the three most important PCs for the DFA; LOW = Low arousal condition; HIGH = High arousal condition; bold p-values represent significant difference p < 0.05; bold loading factors represent the parameters showing loading factors higher than 0.700 with the respective PC
| | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Call duration [ms] | 6.632 | -.368 | .365 | .324 | 5.574 | -.147 | -.375 | .132 | ||
| ICI [ms] | 1.230 | 0.256 | -.080 | -.113 | -.218 | 1.894 | .255 | .168 | .124 | |
| Peaktime [ms] | 2.688 | -.203 | .056 | .487 | 3.978 | -.032 | -.412 | .145 | ||
| MeanF0 [Hz] | 25.331 | .047 | .304 | 20.199 | -.380 | .018 | ||||
| MinF0 [Hz] | 16.034 | -.149 | .050 | 10.574 | -.400 | .650 | -.312 | |||
| MaxF0 [Hz] | 27.394 | .213 | .426 | 17.166 | -.341 | .166 | ||||
| SDF0 [Hz] | 2.806 | -.361 | .363 | .380 | 5.921 | .012 | .371 | .602 | ||
| Peak [Hz] | 3.919 | .387 | .255 | .132 | 10.444 | -.236 | .071 | |||
| MeanF1 [Hz] | 8.305 | .207 | -.081 | 8.631 | -.677 | -.202 | .403 | |||
| SDF1 [Hz] | 3.170 | -.122 | .048 | 4.558 | -.047 | .172 | .614 | |||
| BWF1 [Hz] | 1.848 | -.015 | .582 | -.311 | 1.953 | .207 | .112 | .486 | ||
| MeanF2 [Hz] | 4.287 | -.668 | .078 | .326 | 11.260 | .315 | .174 | |||
| SDF2 [Hz] | 2.322 | -.147 | .289 | -.080 | 4.707 | .406 | -.133 | .414 | ||
| BWF2 [Hz] | 1.130 | .335 | -.065 | -.146 | .072 | 1.896 | .281 | -.143 | .031 | |
| MeanF3 [Hz] | 3.411 | -.538 | .022 | .047 | 5.251 | .658 | .170 | .046 | ||
| SDF3 [Hz] | 1.727 | .052 | -.247 | .306 | .102 | 2.442 | .107 | -.451 | .455 | |
| BWF3 [Hz] | 2.086 | -.152 | -.363 | .047 | 2.386 | -.031 | -.313 | .045 | ||
| F2-F1 [Hz] | 6.789 | -.385 | .333 | 14.675 | .329 | -.072 | ||||
| Consistency | 2.072 | .205 | -.474 | .195 | 5.135 | .140 | -.110 | -.695 | ||
| Cepstral peak [V] | 3.902 | .174 | .632 | .038 | 3.501 | -.230 | .074 | .492 | ||
| Voiced [%] | 2.569 | .459 | -.061 | .090 | 1.963 | .091 | .351 | .025 | ||
| MaxHNR [db] | 4.058 | .556 | -.197 | .171 | 2.174 | -.438 | .169 | -.156 | ||
Mean and standard deviation of the acoustic parameters for Low and High arousal condition, results of the dependent t-test comparing both arousal-levels for each acoustic parameter and the correlation coefficient with the PC1; bold p-values represent significant difference; ↑ value is higher in the High than in the Low arousal condition, ↓ value is lower in the High than in the Low arousal condition; bold loading factors represent the parameters showing loading factors higher than 0.700 with the respective PC
| | ||||||||
|---|---|---|---|---|---|---|---|---|
| Call duration [ms] | 566.34 | 168.62 | 707.10 | 186.09 | −2.81 | ↑ | ||
| ICI [ms] | 2072.53 | 1442.76 | 1075.38 | 652.32 | 2.58 | ↓ | .482 | |
| Peak time [ms] | 0.23 | 0.08 | 0.29 | 0.12 | −1.92 | .072 | | -.640 |
| MeanF0 [Hz] | 1305.42 | 238.49 | 1105.10 | 184.60 | 3.82 | ↓ | ||
| MinF0 [Hz] | 931.71 | 274.86 | 746.65 | 166.58 | 3.12 | ↓ | ||
| MaxF0 [Hz] | 1517.21 | 249.64 | 1316.52 | 221.48 | 3.68 | ↓ | .686 | |
| SDF0 [Hz] | 154.99 | 37.51 | 149.56 | 37.23 | .50 | .623 | | -.077 |
| Peak [Hz] | 1648.68 | 327.77 | 2493.48 | 676.96 | −5.24 | ↑ | -.610 | |
| MeanF1 [Hz] | 2112.80 | 420.47 | 2642.38 | 325.62 | −4.84 | ↑ | -.509 | |
| SDF1 [Hz] | 696.02 | 273.16 | 549.43 | 159.78 | 2.13 | ↓ | .107 | |
| BWF1 [Hz] | 1120.22 | 466.85 | 623.80 | 376.95 | 3.60 | ↓ | .442 | |
| MeanF2 [Hz] | 7034.30 | 570.61 | 6758.39 | 511.89 | 1.79 | .091 | | .095 |
| SDF2 [Hz] | 981.97 | 286.65 | 987.88 | 297.99 | -.07 | .948 | | -.247 |
| BWF2 [Hz] | 1977.26 | 512.96 | 1858.01 | 732.96 | .75 | .463 | | -.067 |
| MeanF3 [Hz] | 11320.63 | 552.86 | 11240.16 | 524.16 | .47 | .642 | | .118 |
| SDF3 [Hz] | 1134.20 | 249.39 | 1273.83 | 231.80 | −1.52 | .148 | | -.588 |
| BWF3 [Hz] | 2044.74 | 1128.99 | 3017.33 | 1602.68 | −2.03 | .058 | | -.410 |
| F2-F1 [Hz] | 4921.50 | 694.44 | 4116.01 | 745.54 | 4.31 | ↓ | .348 | |
| Consistency | 0.89 | 0.02 | 0.86 | 0.03 | 3.03 | ↓ | .312 | |
| Cepstral peak [V] | 2.36 | 0.59 | 2.69 | 0.61 | −1.69 | .110 | | -.366 |
| Voiced [%] | 98.23 | 1.71 | 96.26 | 2.67 | 2.53 | ↓ | ||
| MaxHNR [db] | 31.73 | 4.61 | 28.78 | 3.27 | 2.51 | ↓ | .576 | |
Figure 1Mean and standard deviation for the Low and High arousal condition for the acoustic parameter of kitten isolation calls which had important impact on the classification of arousal; t(17)≥|2.53|, N=18, p≤0.022.
Figure 2Scatterplot for the PC1 and PC2 of the arousal analysis.
Figure 3Example of kitten isolation calls; (a) harmonic isolation call without non-linear phenomena, (b) isolation call with a frequency jump and a chaotic component, (c) isolation call with subharmonics.