| Literature DB >> 31430627 |
Elaine C Thompson1, Jennifer Krizman1, Travis White-Schwoch1, Trent Nicol1, Ryne Estabrook2, Nina Kraus3.
Abstract
Hearing in noisy environments is a complicated task that engages attention, memory, linguistic knowledge, and precise auditory-neurophysiological processing of sound. Accumulating evidence in school-aged children and adults suggests these mechanisms vary with the task's demands. For instance, co-located speech and noise demands a large cognitive load and recruits working memory, while spatially separating speech and noise diminishes this load and draws on alternative skills. Past research has focused on one or two mechanisms underlying speech-in-noise perception in isolation; few studies have considered multiple factors in tandem, or how they interact during critical developmental years. This project sought to test complementary hypotheses involving neurophysiological, cognitive, and linguistic processes supporting speech-in-noise perception in young children under different masking conditions (co-located, spatially separated). Structural equation modeling was used to identify latent constructs and examine their contributions as predictors. Results reveal cognitive and language skills operate as a single factor supporting speech-in-noise perception under different masking conditions. While neural coding of the F0 supports perception in both co-located and spatially separated conditions, neural timing predicts perception of spatially separated listening exclusively. Together, these results suggest co-located and spatially separated speech-in-noise perception draw on similar cognitive/linguistic skills, but distinct neural factors, in early childhood.Entities:
Keywords: Auditory development; Auditory processing; Cognition; Electrophysiology; FFR; Language; Speech-in-noise perception; Structural equation modeling
Mesh:
Year: 2019 PMID: 31430627 PMCID: PMC6886664 DOI: 10.1016/j.dcn.2019.100672
Source DB: PubMed Journal: Dev Cogn Neurosci ISSN: 1878-9293 Impact factor: 6.464
Fig. 1The theoretically proposed structural equation model (M1) is comprised of latent constructs (ovals) and manifest variables (rectangles). Latent constructs include speech-in-noise perception (shared and differential), cognition, language, and neural processing. Manifest variables, or measured indicators, are listed for each latent construct.
Fig. 2Path diagrams for M2 and M3. These models reflect refined versions of the conceptual model (M1), where the multifactor latent variable is eliminated (M2), and the cognition and language latent constructs are combined (M3).
Factor loadings of manifest variables on latent constructs of the final model (M2). HINT = Hearing in Noise Test; CELF = Clinical Evaluation of Language Fundamentals; WJIII = Woodcock Johnson Test of Cognitive Abilities-Third Edition; FFR = frequency following response.
| Speech-in-noise perception SUM | Co-located | HINT Front (dB SNR) | 1 | --- | --- |
| Spatially separated | HINT Right (dB SNR) | 1 | --- | --- | |
| Speech-in-noise perception DIFF | Co-located | HINT Front (dB SNR) | 1 | 0.088 | <0.001 |
| Spatially separated | HINT Right (dB SNR) | −1 | 0.024 | <0.001 | |
| Language | Syntax and semantics | CELF Formulated Sentences (scaled score) | 0.599 | 0.088 | <0.001 |
| Word morphology | CELF Word Structure (scaled score) | 0.689 | 0.077 | <0.001 | |
| Sentence memory | CELF Recalling Sentences (scaled score) | 0.792 | 0.063 | <0.001 | |
| Cognition | Auditory working memory | WJIII Auditory Working Memory (scaled score) | 0.669 | 0.075 | <0.001 |
| Word memory | WJIII Memory for Words (scaled score) | 0.809 | 0.055 | <0.001 | |
| Short-term memory | WJIII Numbers Reversed (scaled score) | 0.636 | 0.08 | <0.001 | |
| Forward memory | Leiter Forward Memory (scaled score) | 0.402 | 0.113 | <0.001 | |
| Attention | Leiter Attention Sustained (scaled score) | 0.36 | 0.109 | 0.001 | |
| Neural (F0) | F0 processing (μV) | FFR | 1 | --- | --- |
| Neural (Timing) | Transition timing (ms) | FFR | 1 | --- | --- |
Correlations among manifest variables. *p < 0.05; **p < 0.01.
| Co-located SPIN (dB SNR) | Spatially separated SPIN (dB SNR) | F0 amplitude (uV) | Transition timing (ms) | Auditory working memory (scaled score) | Memory for words (scaled score) | Numbers reversed (scaled score) | Attention (scaled score) | Forward memory (scaled score) | Formulated sentences (scaled score) | Word structure (scaled score) | Recalling sentences (scaled score) | Age (years) | Sex | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Co-located SPIN (dB SNR) | 1 | .365** | −0.160 | −0.124 | −.245* | −0.081 | 0.042 | −0.043 | −.251* | −0.227 | −.294* | −.286** | −.259** | 0.024 |
| Spatially separated SPIN (dB SNR) | .365** | 1 | −.268** | 0.105 | −0.121 | −0.197 | 0.056 | −0.061 | −0.144 | −0.128 | −.273* | −0.161 | −0.109 | −0.053 |
| F0 amplitude (uV) | −0.160 | −.268** | 1 | 0.067 | 0.025 | 0.011 | 0.003 | −0.191 | 0.009 | −0.030 | 0.113 | 0.084 | 0.122 | .256* |
| Transition timing (ms) | −0.124 | 0.105 | 0.067 | 1 | 0.132 | 0.119 | 0.216 | 0.079 | 0.157 | .260* | 0.122 | 0.003 | −0.041 | .254* |
| Auditory working memory (scaled score) | −.245* | −0.121 | 0.025 | 0.132 | 1 | .530** | .442** | .260* | 0.183 | .339** | .460** | .540** | −0.006 | 0.030 |
| Memory for words (scaled score) | −0.081 | −0.197 | 0.011 | 0.119 | .530** | 1 | .519** | .256* | .263* | .400** | .424** | .552** | −.302** | 0.108 |
| Numbers reversed (scaled score) | 0.042 | 0.056 | 0.003 | 0.216 | .442** | .519** | 1 | 0.143 | .345** | .328** | .381** | .457** | −0.112 | 0.179 |
| Attention (scaled score) | −0.043 | −0.061 | −0.191 | 0.079 | .260* | .256* | 0.143 | 1 | .337** | .347** | 0.192 | .276* | −.235* | −0.171 |
| Forward memory (scaled score) | −.251* | −0.144 | 0.009 | 0.157 | 0.183 | .263* | .345** | .337** | 1 | .253* | 0.220 | 0.228 | −0.007 | −0.093 |
| Formulated sentences (scaled score) | −0.227 | −0.128 | −0.030 | .260* | .339** | .400** | .328** | .347** | .253* | 1 | .543** | .469** | −0.027 | −0.184 |
| Word structure (scaled score) | −.294* | −.273* | 0.113 | 0.122 | .460** | .424** | .381** | 0.192 | 0.220 | .543** | 1 | .541** | 0.061 | −0.052 |
| Recalling sentences (scaled score) | −.286** | −0.161 | 0.084 | 0.003 | .540** | .552** | .457** | .276* | 0.228 | .469** | .541** | 1 | −0.043 | 0.066 |
| Age (years) | −.259** | −0.109 | 0.122 | −0.041 | −0.006 | −.302** | −0.112 | −.235* | −0.007 | −0.027 | 0.061 | −0.043 | 1 | 0.090 |
| Sex | 0.024 | −0.053 | .256* | .254* | 0.030 | 0.108 | 0.179 | −0.171 | −0.093 | −0.184 | −0.052 | 0.066 | 0.090 | 1 |
Standardized coefficients of latent constructs predicting the shared and differential effects of co-located and spatially separated speech-in-noise perception.
| SUM by F0 processing | −0.245 | 0.098 | 0.012 | −0.251 | 0.092 | 0.007 |
| Transition timing | 0.074 | 0.098 | 0.453 | 0.083 | 0.096 | 0.384 |
| Cognition | −0.114 | 0.419 | 0.785 | --- | --- | --- |
| Language | −0.195 | 0.4 | 0.626 | --- | --- | --- |
| Cognition/Language | --- | --- | --- | −0.306 | 0.1 | 0.002 |
| Sex | 0.04 | 0.112 | 0.719 | 0.045 | 0.097 | 0.639 |
| Age | −0.175 | 0.156 | 0.263 | −0.192 | 0.093 | 0.04 |
| DIFF by F0 processing | −0.222 | 0.108 | 0.04 | −0.198 | 0.098 | 0.043 |
| Transition timing | 0.251 | 0.105 | 0.016 | 0.238 | 0.1 | 0.017 |
| Cognition | −0.353 | 0.466 | 0.448 | --- | --- | --- |
| Language | 0.247 | 0.448 | 0.582 | --- | --- | --- |
| Cognition/Language | --- | --- | --- | −0.087 | 0.112 | 0.434 |
| Sex | −0.038 | 0.121 | 0.752 | −0.08 | 0.101 | 0.429 |
| Age | −0.028 | 0.172 | 0.873 | 0.064 | 0.099 | 0.518 |
| Cognition & Language | 0.836 | 0.074 | <0.001 | |||
| F0 processing & Language | 0.075 | 0.118 | 0.527 | |||
| F0 processing & Cognition | −0.008 | 0.121 | 0.946 | |||
| Transition tming & Language | 0.126 | 0.122 | 0.302 | |||
| Transition tming & Cognition | 0.213 | 0.116 | 0.066 | |||
| Transition tming & F0 processing | 0.062 | 0.1 | 0.532 | |||
| DIFF & SUM | 0.616 | 0.067 | <0.001 | |||
Model comparisons show M2 had the best fit. When comparing models, a significant Chi Square test indicates the model with a greater number of parameters should be accepted, while non-significant Chi Square tests indicate the model with fewer parameters should be accepted. M1 refers to the theoretically proposed model with a neural latent construct, while M3 refers to the model in which cognition and language latent constructs were combined.
| M1 | M2 | |
|---|---|---|
| Parameters | 58 | 64 |
| RMSEA | 0.067 | 0.059 |
| CFI | 0.88 | 0.916 |
| TLI | 0.822 | 0.863 |
| LL | −3498.704 | −3263.385 |
| Chi-Square Estimate | 89.646 | 74.919 |
| Chi-Square df | 61 | 55 |
| Model Comparison | ||
| p-value | 0.022 |
Over and above age and sex, neural and cognitive/language skills at ˜age 3 predict spatially separated speech-in-noise perception at ˜age 6. Because children participated in our longitudinal study, we were poised to retroactively examine these measures at a child’s first visit (˜age 3–4) and their predictive utility on sentence-in-noise perception two years later (˜5-6). Unique predictors included F0 processing (β = -0.234; p = 0.020) and sentence memory (β = -0.226; p = 0.026), but not transition timing (β = 0.164; p = 0.108).
| DV: Co-located | |||
|---|---|---|---|
| ΔR2 | β | p-value | |
| 0.054 | 0.091 | ||
| Sex | 0.072 | 0.490 | |
| Age | −0.225 | 0.034 | |
| 0.078 | 0.063 | ||
| Sex | 0.095 | 0.381 | |
| Age | −0.194 | 0.065 | |
| F0 processing | −0.100 | 0.334 | |
| Transition timing | −0.127 | 0.234 | |
| Sentence memory | −0.228 | 0.031 | |
| Total R2 | 0.132 | ||