| Literature DB >> 28963569 |
Clément François1,2,3, Maria Teixidó4, Sylvain Takerkart5, Thaïs Agut6,7, Laura Bosch4,8,6,9, Antoni Rodriguez-Fornells4,8,10.
Abstract
Words and melodies are some of the basic elements infants are able to extract early in life from the auditory input. Whether melodic cues contained in songs can facilitate word-form extraction immediately after birth remained unexplored. Here, we provided converging neural and computational evidence of the early benefit of melodies for language acquisition. Twenty-eight neonates were tested on their ability to extract word-forms from continuous flows of sung and spoken syllabic sequences. We found different brain dynamics for sung and spoken streams and observed successful detection of word-form violations in the sung condition only. Furthermore, neonatal brain responses for sung streams predicted expressive vocabulary at 18 months as demonstrated by multiple regression and cross-validation analyses. These findings suggest that early neural individual differences in prosodic speech processing might be a good indicator of later language outcomes and could be considered as a relevant factor in the development of infants' language skills.Entities:
Mesh:
Year: 2017 PMID: 28963569 PMCID: PMC5622081 DOI: 10.1038/s41598-017-12798-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1After a learning phase of 3.5 minutes (left side, learning phase), an implicit test (right side, test phase) was performed in which illegal word-forms violating the statistical structure (e.g. a legal ABC word-form in the language would become a CBA illegal word-form) appeared at pseudo-random positions in the stream. In the flat contour condition, all the syllables were spoken on the same pitch, thus resulting in a monotonous stream of syllables in which the only cue to word segmentation were the transitional probabilities between adjacent syllables. In the melodically enriched condition, each syllable was associated to a unique pitch, thus resulting in a continuous stream of sung syllables in which transitional probabilities and pitch modulations could be used for word segmentation. The blue traces represent the constant pitch used in the flat contour condition for both learning and test phases. The red traces represent the varying pitches used in the melodically enriched condition for both learning and test phases. In both conditions, the syllable duration was set to 350 ms thus leading to 1050 ms tri-syllabic pseudo-words.
Figure 2Time-course of the Learning effects. Grand averages ERPs at F3 electrode across 27 newborns recorded during each block (thick = 1st block, dotted = 2nd block) of the learning phases in both conditions (flat and melodically enriched). The isovoltage topographical maps show the distribution of the mean amplitude in the 200–500 ms latency band for each learning block.
Test phase: Results of the t-tests comparing the difference waveform (illegal minus legal words) against zero for the two time-windows (300–400 and 800–900 ms) and for the 10 electrodes in the two conditions.
| Flat Contour | 300–400 | 800–900 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| M | SD |
|
|
| Cohen’s d | M | SD |
|
|
| Cohen’s d | |
| Fp1 | −0.07 | 2.1 | −0.2 | 0.87 | 0.94 | 0.02 | −0.13 | 1.4 | −0.5 | 0.64 | 0.88 | 0.19 |
| Fp2 | −0.19 | 2.2 | −0.4 | 0.66 | 0.94 | 0.27 | −0.24 | 1.1 | −1.0 | 0.31 | 0.88 | 0.41 |
| F4 | 0.06 | 1.9 | 0.1 | 0.88 | 0.94 | 0.19 | −0.08 | 1.3 | −0.3 | 0.76 | 0.88 | 0.12 |
| Fz | 0.03 | 1.8 | 0.1 | 0.94 | 0.94 | 0.08 | −0.07 | 1.5 | −0.2 | 0.80 | 0.88 | 0.09 |
| F3 | −0.38 | 2.3 | −0.8 | 0.42 | 0.94 | 0.37 | −0.17 | 1.2 | −0.7 | 0.50 | 0.88 | 0.32 |
| T7 | −0,98 | 2.1 | −2.3 | 0.02 | 0.20 | 0.8 | −0.47 | 1.3 | −1.9 | 0.07 | 0.70 | 0.76 |
| C3 | −0.34 | 2.6 | −0.7 | 0.52 | 0.94 | 0.21 | −0.21 | 1.7 | −0.6 | 0.52 | 0.88 | 0.26 |
| Cz | −0.21 | 3.1 | −0.3 | 0.74 | 0.94 | 0.31 | −0.05 | 1.8 | −0.1 | 0.89 | 0.89 | 0.06 |
| C4 | −0.04 | 2.3 | −0.1 | 0.94 | 0.94 | 0.06 | 0.12 | 1.5 | 0.4 | 0.67 | 0.88 | 0.17 |
| T8 | 0.66 | 2.2 | 1.6 | 0.13 | 0.65 | 0.05 | 0.09 | 1.1 | 0.4 | 0.70 | 0.88 | 0.15 |
|
|
|
| ||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| |
| Fp1 | 1.27 | 2.3 | 2.9 | 0.008 | 0.07 | 1.1 | 1.49 | 2.0 | 3.8 |
| 0.003 |
|
| Fp2 | 1.33 | 2.8 | 2.4 | 0.02 | 0.07 | 0.9 | 1.53 | 2.0 | 3.9 |
| 0.003 |
|
| F4 | 1.13 | 2.9 | 1.9 | 0.06 | 0.10 | 0.8 | 1.17 | 1.9 | 3.1 |
| 0.01 |
|
| Fz | 1.06 | 3.2 | 1.7 | 0.10 | 0.14 | 0.7 | 1.15 | 2.1 | 2.8 |
| 0.01 |
|
| F3 | 1.29 | 3.0 | 2.2 | 0.03 | 0.07 | 0.9 | 1.32 | 2.3 | 2.9 |
| 0.01 |
|
| T7 | 0.77 | 0.5 | 1.6 | 0.12 | 0.15 | 0.6 | 0.48 | 1.9 | 1.3 | 0.2 | 0.20 | 0.5 |
| C3 | 1.10 | 2.7 | 2.1 | 0.04 | 0.08 | 0.8 | 1.02 | 2.0 | 2.6 |
| 0.01 |
|
| Cz | 1.40 | 3.2 | 2.2 | 0.03 | 0.07 | 0.9 | 1.38 | 1.9 | 3.7 |
| 0.003 |
|
| C4 | 0.69 | 2.9 | 1.2 | 0.24 | 0.26 | 0.5 | 0.86 | 2.0 | 2.2 | 0.04 | 0.05 | 0.9 |
| T8 | 0.52 | 0.6 | 0.9 | 0.39 | 0.39 | 0.3 | 0.57 | 1.7 | 1.7 | 0.10 | 0.11 | 0.6 |
The significant differences remaining significant after FDR correction (n = 10 comparisons; P < 0.05) are highlighted in bold.
Figure 3Grand averages ERP at F3 electrode across 26 newborns for legal and illegal word (thick = legal, dotted = illegal words) of the test phases in both conditions. The topographical maps show the distribution of the mean amplitude for illegal words in the significant time-windows.
Figure 4Link between neonatal brain responses in the learning and test phases and expressive vocabulary measures obtained from the MCDI and the Bayley-III language subscale. (A) the scatter plots show the correlations between both the raw MCDI scores and the language subscale score of the Bayley-III and the learning brain dynamics averaged across all channels and for each condition. A marginally significant correlation is observed between MCDI scores and the learning brain dynamics in the melodically enriched condition after FDR correction (n = 4 comparisons, P < 0.05). (B) the scatter plots show the correlations between the raw MCDI scores and the Bayley-III language subscale score and the mean amplitude of the difference waveform (illegal minus legal words) for the two time-windows averaged across all EEG channels. The correlation between the MCDI scores and the second MMR in the melodically enriched condition survives the FDR correction (n = 4 comparisons, P < 0.05) and is highlighted in red.
Coefficients for the multiple regression with the four neonatal EEG features as predictors (N = 13).
| Model | Unstandardized coefficients | Standardized coefficients | |||
|---|---|---|---|---|---|
| B | SE | βk |
|
| |
|
| |||||
| (Constant) | 49.098 | 13.378 | 3.595 | 0.007 | |
| L_flat | 0.212 | 5.844 | 0.009 | 0.036 | 0.97 |
| L_melody | 6.684 | 3.291 | 0.465 | 2.031 | 0.07 |
| TW1_melody | −3.822 | 7.503 | −0.174 | −0.509 | 0.62 |
| TW2_melody | −17.552 | 13.821 | −0.421 | −1.270 | 0.24 |
|
| |||||
| (Constant) | 47.575 | 10.800 | 4.405 | 0.001 | |
| L_melody | 6.263 | 2.896 | 0.436 | 2.163 | 0.05 |
| TW2_melody | −22.929 | 8.403 | −0.550 | −2.729 | 0.02 |
The abbreviations L_flat and L_melody correspond to the learning brain dynamics (block 2 minus block 1) in the flat contour and melodically enriched condition respectively. Viol_melody1 and 2 correspond to the difference waveform mean amplitude in the first and second time-windows for the melodically enriched condition (see Methods).
Figure 5Results of the learning phases (Left): Learning modulations of the ERP mean amplitude in the 200–500 ms latency band for the flat contour (red bars) and melodically enriched conditions (blue bars). Bars denote average amplitudes across all channels (with SEMs) in the two learning blocks. An opposite pattern of ERP modulation is observed in the melodically enriched as compared to the flat contour condition. Results of the test phases (Right): Bars denote average brain response in the 300–400 and 800–900 ms latency band (with SEMs) for legal and illegal word. Significant differences were found in the melodically enriched condition only (blue bars).