| Literature DB >> 27713712 |
Anna Dora Manca1, Mirko Grimaldi1.
Abstract
Speech sound perception is one of the most fascinating tasks performed by the human brain. It involves a mapping from continuous acoustic waveforms onto the discrete phonological units computed to store words in the mental lexicon. In this article, we review the magnetoencephalographic studies that have explored the timing and morphology of the N1m component to investigate how vowels and consonants are computed and represented within the auditory cortex. The neurons that are involved in the N1m act to construct a sensory memory of the stimulus due to spatially and temporally distributed activation patterns within the auditory cortex. Indeed, localization of auditory fields maps in animals and humans suggested two levels of sound coding, a tonotopy dimension for spectral properties and a tonochrony dimension for temporal properties of sounds. When the stimulus is a complex speech sound, tonotopy and tonochrony data may give important information to assess whether the speech sound parsing and decoding are generated by pure bottom-up reflection of acoustic differences or whether they are additionally affected by top-down processes related to phonological categories. Hints supporting pure bottom-up processing coexist with hints supporting top-down abstract phoneme representation. Actually, N1m data (amplitude, latency, source generators, and hemispheric distribution) are limited and do not help to disentangle the issue. The nature of these limitations is discussed. Moreover, neurophysiological studies on animals and neuroimaging studies on humans have been taken into consideration. We compare also the N1m findings with the investigation of the magnetic mismatch negativity (MMNm) component and with the analogous electrical components, the N1 and the MMN. We conclude that N1 seems more sensitive to capture lateralization and hierarchical processes than N1m, although the data are very preliminary. Finally, we suggest that MEG data should be integrated with EEG data in the light of the neural oscillations framework and we propose some concerns that should be addressed by future investigations if we want to closely line up language research with issues at the core of the functional brain mechanisms.Entities:
Keywords: N1; auditory cortex; consonants; magnetoencephalography; oscillatory rhythms; tonochrony; tonotopy; vowels
Year: 2016 PMID: 27713712 PMCID: PMC5031792 DOI: 10.3389/fpsyg.2016.01413
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Lateral view of the left hemisphere with a focus on the main human auditory areas housed in the supratemporal plane. The colored patches show the different cortical fields which has been emphasized in the reviewed literature. Green: A1, primary auditory cortex in the Brodmann area (BA) 41. Indigo: PT, planum temporale. Yellow: STG, superior temporal gyrus in the BA 22. Purple: STS, superior temporal sulcus.
Figure 2F1-F2 Hz scatterplot of the stressed British English vowels produced by a native 50-year-old male speaker (our recording). 68.27% confidence ellipse corresponding to ±1 standard deviation from the bivariate mean (the symbol within the ellipse indicates the mean formant value). F1 is inversely correlated with articulatory tongue height (+high/−high), while F2 reflects place of articulation in the horizontal (−back/+back and −round/+round) dimension.
Figure 3MEG setup. The recording systems is situated in a magnetically shielded room maintaining an electromagnetically quiet recording environment. Subjects are positioned either sitting or supine with their heads in the recording helmet that covers as much of the skull/brain as possible. The detectors embedded in the helmet work as high-gain low-noise amplifier of the magnetic field elicited by the neuronal activity (Poeppel and Hickok, 2015). A 4D Neuroimaging MEG system that uses the magnetometer sensors is showed. From https://en.wikipedia.org/wiki/Magnetoencephalography.
Figure 4(A) Representation of the auditory N1m/N1 wave evoked from 275 channels to a kHz tone presented to the right ear. The peak at 100 ms post-stimulus onset, measured in femto-Tesla (fT) is evidenced. (B) The topographic map represents the direction and amplitude of the response at the N1m peak. Adapted from Sereda et al. (2013: p. 302). (C) The 3D space within the brain along the classical Talairach coordinates: The direction of x axis is from left to right, that of y axis to the front, and the z axis thus points up. (D) Average location and orientation of the equivalent current dipole sources fitted in the bilateral auditory cortical areas. Adapted from Cirelli et al. (2014).
Results for amplitude and latency of the stimuli (vowels and consonants) applied in the N1m/N1 studies.
| Eulitz et al., | 11 | German synthetic | [a, i, u, æ, œ] 1 kHz burst | – | – |
| Diesch et al., | 11 | Synthetic | [a, æ, u, ø, i] 1 kHz sine tone | – | [a], [æ] < [u] |
| Poeppel et al., | 6 | Synthetic | [a, i, u] 500 Hz pure tone | – | [a] < [i], [u] |
| Mäkelä et al., | 10 | Finnish semi-synth. | [a, o, u] | – | – |
| Obleser et al., | 12 | German synthetic | [a, e, i] | [a] < [i, e] | [a] < [i], [e] |
| Eulitz et al., | 1 | Synthetic | [a, e, i] | – | [a] < [e]—left hemisphere [e] < [i]—right hemisphere |
| Obleser et al., | 21 | German natural | [ø, o] | [ø] > [o] | [o] > [ø] |
| Obleser et al., | 20 | German natural | [a, e, i, ø, y, o, u] | – | [o], [u]—the latest peaks [i] > [e] [y]-[ø] > [i]-[e] |
| Scharinger et al., | 12 | Turkish natural | [i. y, ɛ, ɑ, œ, ɔ ɯ, u] | [u] > [ɑ] [u] > [y] [i] > [ɯ] | [a] < [i] |
| Scharinger et al., | 14 | American English natural | [æ. ɛ, | [ɛ] (as deviant preceded by the standard [æ] > [ɛ] (as standard) | – |
| Grimaldi et al., | 11 | SI natural | [i, ɛ, ɑ, ɔ, u] | [i, u] > [ɑ, ɔ, ɛ] | [ɑ, ɔ, ɛ] > [i, u] |
| Kuriki et al., | 8 | Synthetic | [na], [ka], [ha], [a] | – | [ka], [ha] < [a] [na]—the longest peak |
| Poeppel et al., | 6 | Synthetic | [bæ, pæ, dæ, tæ] | – | [bæ, pæ, dæ, tæ] > left hemisphere in the discrimination task |
| Gage et al., | [b], [p], [d], [t], [g], [k], [f], [r], [m], [r], [s] | ([b], [p], [d], [t], [g], [k]) > ([f], [r], [m], [r], [s]) | ([f], [r], [m], [r], [s]) > right hemisphere | ||
| Gage et al., | 6 | German | [ba, [da], [ga] | – | [ba] > [da], [ga]—right hemisphere |
| Obleser et al., | 22 | Natural German | [bø], [dø], [gø] [bo], [do], [go] | – | [go] > than others |
| Obleser et al., | 19 | Natural German | [d], [t], [g], [k] | ([d], [g]) > ([t], [k]) | ([d], [t]) < ([g], [k]) ([d], [g]) > ([t], [k]) |
| Scharinger et al., | 13 | American English natural | [aja], [awa] | – | [awa] < [aja] (as deviants) |
| Scharinger et al., | 15 | American English natural | [ava] [aʒa] | – | [aʒa] < [ava] |
| Scharinger et al., | [aja], [awa], [ava], [aʒa] | [awa], [aʒa] > [aja], [ava] | – |
The labeled Sj indicates the number of subjects that were submitted to statistical analysis. The column labeled Language reports the variety of languages and the type of stimuli used in the experimental protocols. The symbol (–) means that the results did not reach a statistically significant level or that they were not reported. All studies listed report results from both hemispheres, unless otherwise noted.
Pitch (F0), Formant Frequency (F1, F2, F3 in Hz) and formant distance (F2-F1) values of the vowels used as stimuli in the reviewed studies.
| Eulitz et al., | /i/ | 250 | 2700 | 3400 | ||
| Poeppel et al., | m 100 f 200 | M 280 f 310 | M 2250 f 2790 | M 2890 f 3310 | ||
| Obleser et al., | From 129 to 119 | 250 | 2700 | 3400 | ||
| Obleser et al., | Min 127 max 132 | Min 267 max 287 | Min 2048 max 2120 | Min 2838 max 328 | Min 198 max 125 | |
| Shestakova et al., | From 129 to 119 | 370 | 2250 | 2800 | ||
| Scharinger et al., | 184.31 | 531.50 | 2239.90 | 3009.50 | ||
| Grimaldi et al., | 294 | 2325 | 2764 | 2031 | 294 | |
| Obleser et al., | /e/ | From 129 to 119 | 370 | 2250 | 2800 | |
| Obleser et al., | Min 109 max 125 | Min 302 max 322 | Min 2055 max 2143 | Max 2890 min 2711 | Min 1741 max 1821 | |
| Scharinger et al., | /ɛ/ | 550 | 2100 | 2700 | ||
| Scharinger et al., | 177.19 | 801.00 | 2008.80 | 2895.80 | ||
| Grimaldi et al., | 145 | 549 | 1880 | 2489 | 1330 | |
| Eulitz et al., | /ae/ | 606 | 2077 | 2656 | ||
| Diesch et al., | 600 | 2080 | 2650 | |||
| Scharinger et al., | 171.25 | 1023.30 | 1760.60 | 2712.60 | ||
| Eulitz et al., | /a/ | 780 | 1250 | 2600 | ||
| Poeppel et al., | M 100 f 200 | M 710 f 850 | M 1100 f 1220 | M 2540 f 2810 | ||
| Mäkelä et al., | 330 | |||||
| Grimaldi et al., | 140 | 794 | 1231 | 2528 | 418 | |
| Obleser et al., | From 129 to 119 | 780 | 1250 | 2600 | ||
| Obleser et al., | Min 103 max 113 | Min 552 max 747 | Min 1188 max 1224 | Min 2663 max 3171 | Min 442 max 641 | |
| Scharinger et al., | 680 | 1200 | 2700 | 680 | 1200 | |
| Mäkelä et al., | /o/ | 350 | ||||
| Obleser et al., | M 123 f 223 | M 317 f 390 | M 516 f 904 | M 2601 f 2871 | M 199 f 514 | |
| Obleser et al., | Min 109 max 1125 | Min 293 max 346 | Min 471 max 609 | Min 2481 max 2688 | Min 131 max 303 | |
| Scharinger et al., | 500 | 900 | 3000 | |||
| Grimaldi et al., | 140 | 550 | 856 | 2551 | 306 | |
| Eulitz et al., | /u/ | 250 | 600 | 2500 | ||
| Poeppel et al., | M 100 f 200 | M 310 f 370 | M 870 f 950 | M 2250 f 2670 | ||
| Obleser et al., | M 123 f 223 | M 318 f 417 | M 1357 f 1731 | M 1980 f 2627 | M 1039 f 1314 | |
| Obleser et al., | Min 112 max 118 | Min 231 max 256 | Min 522 max 645 | Min 2117 max 2292 | Min 266 max 415 | |
| Scharinger et al., | 350 | 800 | 2900 | |||
| Grimaldi et al., | 130 | 310 | 660 | 2437 | 349 | |
| Diesch et al., | /ø/ | 350 | 1400 | 2500 | ||
| Poeppel et al., | M 100 f 200 | M 310 f 370 | M 870 f 950 | M 2250 f 2670 | M 100 f 200 | |
| Obleser et al., | M 123 f 223 | M 318 f 417 | M 1357 f 1731 | M 1980 f 2627 | M 1039 f 1314 | |
| Obleser et al., | Min 108 max 125 | Min 301 max 325 | Min 133 max 1447 | Min 1945 max 2079 | Min 981 max 1142 | |
| Obleser et al., | /y/ | Min 115 max 144 | Min 238 max 248 | Min 1516 max 1769 | Min 1987 max 2097 | Min 1275 max 1528 |
| Scharinger et al., | 300 | 2000 | 2600 | |||
| Scharinger et al., | /ɯ/ | 350 | 1800 | 2600 |
The tokens of the vowels that have been used in the experimental protocols are generally spoken by native male speakers with some exceptions reported in the table: (m) stands for male voice and (f) for female voice. Empty cells mean that the study has not tested that vowel category.
Figure 5(A) FDG (2-Fluoro-2-Deoxy-D-[14C(U) Glucose) reconstruction of the activity patterns in left dorsal auditory cortex evoked by vowels [i] and [o] in Gerbils. Vowel representation in field A1 appeared as a dorso-ventral stripe along the isofrequency axis and was highlighted by using a pseudo-color transformation relative to the optical density of the corpus callosum. Large formant distances F2-F1, as in [i], led to stripes that extended far dorsally (white bracket with arrows), whereas stripes obtained with small formant distances, as in [o] ended close to the dorso-ventral level of the roof of the corpus striatum (C str. dashed line). At the top, topographic representation of formant distance F2-F1 along the isofrequency axis in A1 of the vowel-evoked FDG that were used in the study. From Ohl and Scheich (1997: p. 9442). Copyright (1997) National Academy of Sciences, U.S.A. (B) Graphical Representation of the relationship between mean distances of the source locations of [a]–[i], [a]–[u], [u]–[i] vowel pairs—measured via an ECD models- and relative acoustic–phonetic dissimilarity (black line) measured as F2/F1 ratio. The representational centers of the N1m show relative distances that resemble a F2-F1 vowel space and indicate a phonemotopic organization in the supratemporal plane. Adapted from Shestakova et al. (2004: p. 348).
Results for the Absolute ECD distances and for the significant effects driving the dipole location along the medial-lateral, anterior-posterior, and inferior-superior dimensions.
| Eulitz et al., | 11 | Synthetic | [a, i, u, æ, œ] | – | – | – | – |
| Diesch and Luce, | 11 | Synthetic | [a, æ, u, ø, i] | – | F1 | – | – |
| Poeppel et al., | 6 | Synthetic | [a, i, u] | – | – | – | – |
| Mäkelä et al., | 10 | Finnish synthetic | [a-o-u] | [a-o] < (d) [a-u]—left hemisphere | – | F1 and F2 | – |
| Obleser et al., | 11 | German synthetic | [a-e-i] | [a-i] > (d) [e- < i] | – | – | F1 and Height |
| Shestakova et al., | 11 | Russian natural | [a-u-i] | [a-i], [a-u] > (d) [i-u] | – | – | F2-F1 |
| Eulitz et al., | 1 | German synthetic | [a-e-i] | [a-i] < (d) [e-i] | F2-F1 | – | – |
| Obleser et al., | 14 | German natural | [ø-o] | – | Place | – | – |
| Obleser et al., | 20 | German natural | [a-e-i-ø-y-o-u] | [e-i], [o-u] > (d) [e-i], [ø-y] [a-i] > (d) [e-i] | – | Place | – |
| Scharinger et al., | 12 | Turkish natural | [i-ɯ-ɛ-ɑ-y-œ-ɔ-u] | [a-œ] > (d) [a-o] [u-ɛ] > (d) [u-i] | F2 Round | Place | Round and Height in front vowels |
| Scharinger et al., | 14 | American English natural | [æ. ɛ, | [œ- | – | Height | Height—left hemisphere |
| Kuriki et al., | 6 | Synthetic | [na], [ka], [ha], [a] | – | Onset of the low-amplitude high-frequency consonant | – | – |
| Obleser et al., | 16 | Natural German | [bø], [dø], [gø] [bo], [do], [go] | – | – | Place | – |
| Obleser et al., | 19 | German natural | [d], [t], [g], [k] | – | Place | Place | Place |
| Scharinger et al., | 13 | American English natural | [aja], [awa] | – | – | Place—Labial—(as deviant)–left hemisphere | – |
| Scharinger et al., | 15 | American English natural | [ava] [aʒa] | – | – | Place—Labial (as deviant and as standard)—left hemisphere | – |
| Scharinger et al., | [aja], [awa], [ava] [aʒa] | – | – | Place—Labial | – |
The labeled Sj reports the number of the subjects that were submitted to statistical analysis. The symbol (–) means that the results did not reach statistically significance. Empty cells in column labeled Absolute ECD distance (d), means that the study has not performed that kind of analysis. All studies listed report results from both hemispheres, unless otherwise noted.
Figure 6Graphical representation of the main trends emerging from the N1m ECD analysis along the three-dimensional spaces slicing human brain in lateral-medial (. The symbol (*) indicates that the topographical gradient was explained in terms of acoustics effects rather than of featural variables.