| Literature DB >> 24032022 |
Maija Hausen1, Ritva Torppa, Viljami R Salmela, Martti Vainio, Teppo Särkämö.
Abstract
Disorders of music and speech perception, known as amusia and aphasia, have traditionally been regarded as dissociated deficits based on studies of brain damaged patients. This has been taken as evidence that music and speech are perceived by largely separate and independent networks in the brain. However, recent studies of congenital amusia have broadened this view by showing that the deficit is associated with problems in perceiving speech prosody, especially intonation and emotional prosody. In the present study the association between the perception of music and speech prosody was investigated with healthy Finnish adults (n = 61) using an on-line music perception test including the Scale subtest of Montreal Battery of Evaluation of Amusia (MBEA) and Off-Beat and Out-of-key tasks as well as a prosodic verbal task that measures the perception of word stress. Regression analyses showed that there was a clear association between prosody perception and music perception, especially in the domain of rhythm perception. This association was evident after controlling for music education, age, pitch perception, visuospatial perception, and working memory. Pitch perception was significantly associated with music perception but not with prosody perception. The association between music perception and visuospatial perception (measured using analogous tasks) was less clear. Overall, the pattern of results indicates that there is a robust link between music and speech perception and that this link can be mediated by rhythmic cues (time and stress).Entities:
Keywords: MBEA; music perception; speech prosody perception; visuospatial perception; word stress
Year: 2013 PMID: 24032022 PMCID: PMC3759063 DOI: 10.3389/fpsyg.2013.00566
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Characteristics of the participants.
| Male/female | 21/40 (34/66%) |
| Mean age (range) | 39.0 (19–59) |
| Education Level | |
| Primary level | 0 (0%) |
| Secondary level | 23 (38%) |
| Lowest level tertiary | 6 (10%) |
| Bachelor level | 17 (28%) |
| Master level or higher | 15 (25%) |
| Mean education in years (range) | 17.1 (10–32) |
| Musical education: no/yes | 19/42 (31/69%) |
| Musical playschool | 4 (7%) |
| Special music class in school | 6 (10%) |
| Private lessons or with parents | 23 (37%) |
| Music institute or conservatory | 13 (21%) |
| Independent music learning | 26 (43%) |
| Mean musical training in years (range) | 3.7 (0–19) |
| Self-reported cognitive problems | |
| Reading problems | 5 (8%) |
| Speech problems | 3 (5%) |
| Spatial orientation problems | 5 (8%) |
| Problems in maths | 12 (20%) |
| Attentional problems | 5 (8%) |
| Memory problems | 6 (10%) |
The differences between the cues for word stress in first and second stressed syllables in compound/phrase utterances.
| Compound | 14 | 2.0 (69.2) | 9.2 (2.4) | 8.6 (5.7) |
| Phrase | 16 | −33.3 (98.6) | 4.8 (2.7) | 1.1 (2.6) |
| T-test between compounds vs. phrases |
The mean differences were calculated as follows (a): Duration: the duration of the first syllable vowel (nucleus) of the first part of the compound minus the duration of the first syllable vowel (nucleus) in the second part of the compound or phrase, i.e., “kIssankEllo” or “kIssan kEllo”, respectively. (b) f0 and intensity: the peak value of the f0/intensity in the first part of the compound minus the peak value of the f0/intensity in second part of the compound/phrase. The f0 differences were calculated in semitones. One f0 value was missing due to creaky voice.
Figure 1Example of the spectrum of a compound word (above; audio file The scale is 9–400 Hz for f0 and 0–100 dB for intensity.
Figure 2Example of the word stress task. The left picture represents a compound word “kissankello” and the right picture a phrase “kissan kello.”
Figure 3Example of the visuospatial task with the original sequence of Gabor figures (A) and a sequence with a change in the location and orientation of one of the Gabor figures (B). Note that in the actual test, only a single Gabor was presented at a time.
Basic descriptive statistics of the music perception test.
| Scale | 19–30 (63.3–100%) | 25.0 (83.4%) | 3.2 (10.5%) |
| Off-beat | 16–23 (66.7–95.8%) | 19.8 (82.4%) | 2.4 (10.1%) |
| Out-of-key | 15–24 (63.0–100%) | 20.3 (84.6%) | 3.3 (13.7%) |
| Total | 55–74 (70.5–94.9%) | 65.1 (83.5%) | 6.5 (8.3%) |
Figure 4Distributions of the music perception subtest and total scores.
Figure 5Scatter plots indicating the relationships between the three music perception subtests.
Other tests of perception and memory: basic descriptive statistics.
| Speech prosody perception | 19–30 (63–100%) | 25.0 (83%) | 2.7 (9%) |
| Visuospatial perception | 17–30 (57–100%) | 23.8 (79%) | 2.9 (10%) |
| Auditory working memory | 10–22 (33–73%) | 15.8 (53%) | 3.0 (10%) |
| Pitch perception | |||
| No change trials | 13–40 (33–100%) | 32.4 (81%) | 6.4 (16%) |
| Change trials | 22–38 (55–95%) | 30.3 (76%) | 4.4 (11%) |
| 3 Hz change (1/16 semitone) | 0–7 (0-88%) | 2.7 (34%) | 2.1 (26%) |
| 7 Hz change (1/8 semitone) | 0–8 (0–100%) | 4.5 (57%) | 2.0 (25%) |
| 15 Hz change (1/4 semitone) | 4–8 (50–100%) | 7.1 (89%) | 0.9 (12%) |
| 36 Hz change (1/2 semitone) | 6–8 (75–100%) | 7.8 (98%) | 0.4 (0%) |
| 62 Hz change (1 semitone) | 8 (100%) | 8 (100%) | 0 (0%) |
| Pitch discrimination threshold (Hz) | 3.0–26.1 | 9.9 | 4.7 |
Background variables' associations with the music perception total score.
| Gender: female/male | 40/21 | 84/82 | |
| First language: Finnish/Swedish | 58/3 | 84/81 | |
| Self-reported cognitive problems | |||
| Problems in reading: yes/no | 5/53 | 83/84 | |
| Attention problems: yes/no | 5/53 | 83/84 | |
| Problems in speech: yes/no | 3/55 | 79/84 | |
| Problems in mathematics: yes/no | 12/45 | 83/84 | |
| Memory problems: yes/no | 6/51 | 85/84 | |
| Problems in visuospatial orientation: yes/no | 5/52 | 82/84 | |
| Suspected or mild hearing problems: yes/no | 12/49 | 81/84 |
Average music perception scores of the age groups.
| 19–29 | 17 | 81.2 (6.2) |
| 30–39 | 14 | 85.0 (6.8) |
| 40–49 | 12 | 89.0 (3.2) |
| 50–59 | 18 | 80.7 (6.5) |
Correlations between speech prosody and visuospatial perception, music perception and possible confounding variables.
| Pitch perception: change trials ( | 0.01 | 0.05 | 0.31 |
| No change trials ( | −0.06 | 0.07 | −0.15 |
| All trials ( | −0.08 | 0.14 | 0.09 |
| Pitch discrimination threshold ( | −0.13 | −0.03 | −0.32 |
| Auditory working memory ( | 0.26 | 0.10 | 0.10 |
| Digit span forwards ( | 0.26 | 0.07 | 0.07 |
| Digin span backwards ( | 0.13 | 0.11 | 0.06 |
| Music education (years) ( | 0.12 | 0.02 | 0.32 |
| General education (years) ( | 0.08 | −0.11 | 0.10 |
p < 0.01;
p < 0.05.
Regression analysis.
| 1 | 0.15 | 0.15 | ||||
| Music education | 0.28 | 2.27 | ||||
| Age group | −0.20 | −1.62 | ||||
| 2 | 0.21 | 0.06 | ||||
| Music education | 0.28 | 2.25 | ||||
| Age group | −0.14 | −1.07 | ||||
| Auditory working memory | −0.01 | −0.05 | ||||
| Pitch discrimination threshold | −0.25 | −2.07 | ||||
| 3 | 0.29 | 0.08 | ||||
| Music education | 0.28 | 2.37 | ||||
| Age group | −0.08 | −0.61 | ||||
| Auditory working memory | −0.02 | −0.19 | ||||
| Pitch discrimination threshold | −0.26 | −2.23 | ||||
| Visuospatial perception | 0.28 | 2.40 | ||||
| 4 | 0.37 | 0.08 | ||||
| Music education | 0.28 | 2.47 | ||||
| Age group | −0.09 | −0.74 | ||||
| Auditory working memory | −0.10 | −0.85 | ||||
| Pitch discrimination threshold | −0.23 | −2.03 | ||||
| Visuospatial perception | 0.27 | 2.42 | ||||
| Word stress perception | 0.30 | 2.65 | ||||
| 1 | 0.11 | 0.11 | ||||
| Music education | 0.15 | 1.20 | ||||
| Age group | −0.26 | −2.00 | ||||
| 2 | 0.11 | 0.00 | ||||
| Music education | 0.15 | 1.15 | ||||
| Age group | −0.25 | −1.79 | ||||
| Auditory working memory | −0.01 | 0.10 | ||||
| Pitch discrimination threshold | −0.04 | −0.30 | ||||
| 3 | 0.16 | 0.05 | ||||
| Music education | 0.15 | 1.19 | ||||
| Age group | −0.20 | −1.44 | ||||
| Auditory working memory | 0.00 | 0.01 | ||||
| Pitch discrimination threshold | −0.05 | −0.36 | ||||
| Visuospatial perception | 0.22 | 1.71 | ||||
| 4 | 0.17 | 0.01 | ||||
| Music education | 0.15 | 1.18 | ||||
| Age group | −0.20 | −1.47 | ||||
| Auditory working memory | −0.03 | −0.22 | ||||
| Pitch discrimination threshold | −0.03 | −0.25 | ||||
| Visuospatial perception | 0.21 | 1.67 | ||||
| Word stress perception | 0.12 | 0.91 | ||||
| 1 | 0.10 | 0.10 | ||||
| Music education | 0.31 | 2.40 | ||||
| Age group | −0.04 | −0.31 | ||||
| 2 | 0.17 | 0.06 | ||||
| Music education | 0.32 | 2.51 | ||||
| Age group | −0.01 | −0.04 | ||||
| Auditory working memory | −0.12 | −0.98 | ||||
| Pitch discrimination threshold | −0.23 | −1.82 | ||||
| 3 | 0.19 | 0.02 | ||||
| Music education | 0.32 | 2.54 | ||||
| Age group | 0.03 | 0.21 | ||||
| Auditory working memory | −0.13 | −1.06 | ||||
| Pitch discrimination threshold | −0.24 | −1.87 | ||||
| Visuospatial perception | 0.15 | 1.23 | ||||
| 4 | 0.24 | 0.05 | ||||
| Music education | 0.32 | 2.58 | ||||
| Age group | 0.00 | 0.00 | ||||
| Auditory working memory | −0.20 | −1.53 | ||||
| Pitch discrimination threshold | −0.21 | −1.68 | ||||
| Visuospatial perception | 0.14 | 1.18 | ||||
| Word stress perception | 0.24 | 1.96 | ||||
| 1 | 0.05 | 0.05 | ||||
| Music education | 0.14 | 1.07 | ||||
| Age group | −0.15 | −1.15 | ||||
| 2 | 0.17 | 0.11 | ||||
| Music education | 0.12 | 0.90 | ||||
| Age group | −0.04 | −0.33 | ||||
| Auditory working memory | 0.13 | 1.06 | ||||
| Pitch discrimination threshold | −0.32 | −2.52 | ||||
| 3 | 0.23 | 0.06 | ||||
| Music education | 0.11 | 0.96 | ||||
| Age group | 0.01 | 0.10 | ||||
| Auditory working memory | 0.12 | 0.97 | ||||
| Pitch discrimination threshold | −0.33 | −2.67 | ||||
| Visuospatial perception | 0.26 | 2.15 | ||||
| 4 | 0.33 | 0.09 | ||||
| Music education | 0.12 | 1.18 | ||||
| Age group | 0.00 | 0.00 | ||||
| Auditory working memory | 0.04 | 0.32 | ||||
| Pitch discrimination threshold | −0.29 | −2.48 | ||||
| Visuospatial perception | 0.25 | 2.15 | ||||
| Word stress perception | 0.32 | 2.73 | ||||
p < 0.001;
p < 0.01;
p < 0.05;
p < 0.10.
Figure 6Scatter plots indicating the relationships between the word stress task and the music perception test.