| Literature DB >> 35022429 |
Patrick C M Wong1,2,3, Xin Kang4,5,6,7, Hon-Cheong So8,9, Kwong Wai Choy10.
Abstract
Research over the past two decades has identified a group of common genetic variants explaining a portion of variance in native language ability. The present study investigates whether the same group of genetic variants are associated with different languages and languages learned at different times in life. We recruited 940 young adults who spoke from childhood Chinese and English as their first (native) (L1) and second (L2) language, respectively, who were learners of a new, third (L3) language. For the variants examined, we found a general decrease of contribution of genes to language functions from native to foreign (L2 and L3) languages, with variance in foreign languages explained largely by non-genetic factors such as musical training and motivation. Furthermore, genetic variants that were found to contribute to traits specific to Chinese and English respectively exerted the strongest effects on L1 and L2. These results seem to speak against the hypothesis of a language- and time-universal genetic core of linguistic functions. Instead, they provide preliminary evidence that genetic contribution to language may depend at least partly on the intricate language-specific features. Future research including a larger sample size, more languages and more genetic variants is required to further explore these hypotheses.Entities:
Mesh:
Year: 2022 PMID: 35022429 PMCID: PMC8755716 DOI: 10.1038/s41598-021-04163-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
SNPs of language-related genes hypothesized to be associated with language proficiency that we examined in the present study.
| Gene | SNP | Population | Phenotype | Major allele | Minor allele | References | Total N of cases |
|---|---|---|---|---|---|---|---|
| rs11860694 | Majority European (UK) | Non-word repetition in English in SLI individuals | G = 0.76 | C = 0.24 | [ | 879 | |
| rs7619451 | European (Swedish) | Reading comprehension in dyslexic individuals | G = 0.82 | T = 0.18 | [ | 801 | |
| rs6564903 | Majority European (UK) | Non-word repetition in English in SLI individuals | C = 0.80 | T = 0.20 | [ | 876 | |
| rs2538976 | Majority European (UK) | SLI diagnosis in English | T = 0.54 | C = 0.46 | [ | 881 | |
| European (Australian) | Early communicative behaviour scores in English in SLI individuals | [ | |||||
| rs2538991 | European (UK) | SLI diagnosis in English | C = 0.64 | A = 0.36 | [ | 880 | |
| rs4680 | East Asian (Han Chinese) | Immediate memory, visuospatial and language scores in Chinese | G = 0.72 | A = 0.28 | [ | 871 | |
| rs1087266 | Asian (Uyghur) | Dyslexia diagnosis in Uyghur | A = 0.59 | G = 0.41 | [ | 739 | |
| rs2274305 | Asian (Uyghur) | Dyslexia diagnosis in Uyghur | C = 0.81 | T = 0.19 | [ | 791 | |
| Asian (Uyghur) | Dyslexia diagnosis in Uyghur | T = 0.58 | C = 0.42 | [ | 765 | ||
| rs4599626 | Asian (Uyghur) | Dyslexia diagnosis in Uyghur | C = 0.83 | A = 0.17 | [ | 750 | |
| rs6456593 | Asian (Uyghur) | Dyslexia diagnosis in Uyghur | C = 0.63 | G = 0.37 | [ | 738 | |
| rs6940827 | East Asian (Han Chinese) | Dyslexia diagnosis in Chinese | G = 0.82 | A = 0.18 | [ | 766 | |
| rs807724 | Asian (Uyghur) | Dyslexia diagnosis in Uyghur | T = 0.96 | C = 0.04 | [ | 763 | |
| East Asian (Han Chinese) | Reading fluency, character reading, morphological production and tone deletion in Chinese | [ | |||||
| Majority European (UK) | Dyslexia diagnosis in English | [ | |||||
| European (UK) | Single word reading and non-word repetition in English | [ | |||||
| rs9461045 | Asian (Uyghur) | Dyslexia diagnosis in Uyghur | T = 0.62 | C = 0.38 | [ | 807 | |
| Majority European (UK) | Forced word choice test, irregular word coding, and single-word spelling in English in dyslexic individuals | [ | |||||
| European (UK) | Single word reading and non-word repetition in English | [ | |||||
| Majority European (UK) | Forced word choice test, irregular word coding, single-word reading and single-word spelling in English | [ | |||||
| rs889869 | European (German) | Dyslexia diagnosis in German | G = 0.84 | A = 0.16 | [ | 809 | |
| rs2255526 | East Asian (Han Chinese) | Dyslexia diagnosis in Chinese | A = 0.78 | G = 0.22 | [ | 755 | |
| rs3743205 | East Asian (Han Chinese) | One minute reading, digit rapid naming, non-word repetition and left–right reversal in Chinese | C = 0.97 | T = 0.03 | [ | 813 | |
| rs57809907 | Majority European (UK) | Forced word choice test in English in SLI individuals | C = 0.99 | A = 0.01 | [ | 811 | |
| rs1800497 | European (US) | Artificial grammar learning | G = 0.60 | A = 0.41 | [ | 816 | |
| rs11629841 | East Asian (Han Chinese) | Character dictation and orthographic judgment in Chinese | T = 0.96 | G = 0.05 | [ | 764 | |
| rs1852469 | East Asian (Han Chinese) | Diagnosis of speech sound disorder in Chinese | A = 0.69 | T = 0.31 | [ | 756 | |
| rs2396722 | East Asian (Han Chinese) | Diagnosis of speech sound disorder in Chinese | T = 0.51 | C = 0.49 | [ | 739 | |
| rs6980093 | European (Italian) | Semantic fluency and single-word reading in Italian in dyslexic individuals | A = 0.62 | G = 0.38 | [ | 865 | |
| rs3756821 | East Asian (Uyghur) | Dyslexia diagnosis in Uyghur | C = 0.77 | T = 0.23 | [ | 798 | |
| East Asian (Han Chinese) | Dyslexia diagnosis in Chinese | [ | |||||
| Majority European (US) | General reading skills and text reading in English in SLI individuals | [ | |||||
| rs4504469 | East Asian (Han Chinese) | Dyslexia diagnosis in Chinese | C = 0.87 | T = 0.13 | [ | 804 | |
| Asian (Indians) | Dyslexia diagnosis in Hindi | [ | |||||
| Majority European (US) | General reading skills in English | [ | |||||
| Majority European (UK) | Dyslexia diagnosis in English | [ | |||||
| rs807507 | East Asian (Han Chinese) | Onset detection test in Chinese in dyslexic individuals | G = 0.80 | C = 0.20 | [ | 809 | |
| rs6803202 | Majority European (Australian) | Non-word repetition in English | C = 0.59 | T = 0.41 | [ | 784 | |
| rs9722 | European (German) | Spelling test in German | G = 0.68 | A = 0.32 | [ | 797 |
Major and minor allele frequencies are those in our sample. Examples of relevant studies for each gene are listed under References.
Demographic information and phenotype scores of participants.
| Variables | Mean (SD) | Range | Total N of cases |
|---|---|---|---|
| Gender(F/M) | 696/244 | 940 | |
| Musical training (Y/N) | 760/172 | 932 | |
| Nonverbal IQ | 108.00 | 85–132 | 920 |
| Family SES | 37.00 | 1–66 | 877 |
| L3 age (years old) | 19.98 | 18–25 | 940 |
| L1 | 4.94 | 3–7 | 640 |
| L2 | 5.18 | 3–7 | 640 |
| L3 | − 0.031 | − 3.16 to 5.00 | 857 |
| L3 external motivation | 0.016 | − 2.61 to 2.89 | 929 |
| L3 internal motivation | 0.001 | − 3.41 to 0.65 | 929 |
| L3 attitude | 0.020 | − 4.94 to 2.33 | 926 |
| L3 anxiety | − 0.002 | − 3.05 to 1.98 | 926 |
L1 and L2 proficiency were represented by the composite grades of the Chinese and English subjects in the HKDSE exam. Gender (F/M) = female/male. Music training (Y/N) = have/have not received at least 1 year of musical training.
Figure 1Metamodel of the structural equation model (SEM). Language proficiency of L1, L2, and L3 was added as latent variables with HKDSE Chinese and English scores and L3 Global scores as indicators, respectively. The significant SNPs in the final models of the stepwise regression and all non-genetic factors were entered into the SEM.
The final model of bi-directional stepwise regression analyses for L1 proficiency using the original dataset.
| Predictor | Gene | Major allele | Estimate | Confidence intervals | Uncorrected | FDR corrected | Partial eta squared | ∆R2 |
|---|---|---|---|---|---|---|---|---|
| (Intercept) | 7.49 | 4.51–10.47 | ||||||
| rs6980093 | 0.12 | − 0.03 to 0.26 | 0.110 | 0.121 | 0.006 | 0.003 | ||
| 0. | ||||||||
| rs3765502 | − 0.65 | − 1.38 to 0.08 | 0.082 | 0.121 | 0.007 | 0.004 | ||
| rs2255526 | 0.15 | − 0.03 to 0.34 | 0.106 | 0.121 | 0.006 | 0.004 | ||
| rs6803202 | − 0.11 | − 0.25 to 0.04 | 0.162 | 0.162 | 0.005 | 0.002 | ||
| rs9722 | 0.15 | − 0.01 to 0.31 | 0.074 | 0.121 | 0.008 | 0.005 | ||
| rs1087266 | − 0.63 | − 1.38 to 0.11 | 0.094 | 0.121 | 0.007 | 0.004 | ||
| 0. |
Gender, family SES, and three SNPs (rs1800497, rs6940827, rs6456593) are independently associated with L1 HKDSE grades. The original model included gender (Female = 1; Male = 0), music training (Yes = 1; No = 0), family SES, and the 28 hypothesized SNPs. * indicates p < 0.05 (uncorrected); ‡represents significant associations after FDR corrections for multiple comparisons.
Observations: 421.
R2/R2 adjusted: 0.119/0.096, p = 2.215e−07.
Significant values are in bold.
The final model of bi-directional stepwise regression analyses for L2 proficiency.
| Predictor | Gene | Major allele | Estimate | Confidence intervals | Uncorrected | FDR corrected p | Partial eta squared | ∆R2 |
|---|---|---|---|---|---|---|---|---|
| (Intercept) | 4.22 | 3.60–4.83 | ||||||
| Gender | 0.21 | 0.01–0.40 | 0.040* | 0.060 | 0.010 | 0.007 | ||
| rs2538976 | T | 0.13 | − 0.04 to 0.29 | 0.125 | 0.153 | 0.006 | 0.003 | |
| − | ||||||||
| rs4599626 | C | − 0.12 | − 0.28 to 0.05 | 0.155 | 0.155 | 0.005 | 0.002 | |
| rs9461045 | T | − 0.10 | − 0.24 to 0.03 | 0.136 | 0.153 | 0.005 | 0.003 |
Music training, family SES, and three SNPs on CNTNAP2 (rs2538991) and FOXP2 (rs6980093, rs1852469) are independently associated with L2 HKDSE grades. The original model included gender (Female = 1; Male = 0), music (Yes = 1; No = 0), family SES, and the 28 hypothesized SNPs. * indicates p < 0.05 (uncorrected); ‡represents significant associations after FDR corrections for multiple comparisons.
Observations: 421.
R2/R2 adjusted: 0.138/0.119, p = 6.72e−10.
Significant values are in bold.
The final model of bi-directional stepwise regression analyses for L3 proficiency with motivation variables included as additional predictors.
| Predictor | Gene | Major allele | Estimate | Confidence intervals | Uncorrected p | FDR corrected p | Partial eta squared | ∆R2 |
|---|---|---|---|---|---|---|---|---|
| (Intercept) | − 0.01 | − 0.35 to 0.32 | 0.939 | |||||
| External | − 0.06 | − 0.14 to 0.02 | 0.133 | 0.154 | 0.004 | 0.002 | ||
| rs2538991 | 0.09 | − 0.03 to 0.21 | 0.141 | 0.154 | 0.004 | 0.002 | ||
| rs4680 | − 0.09 | − 0.22 to 0.04 | 0.154 | 0.154 | 0.004 | 0.002 | ||
| rs6456593 | − 0.14 | − 0.26 to − 0.02 | 0.026 | 0.052 | 0.010 | 0.007 | ||
Internal motivation and S100B (rs9722) are independently associated with L3 Global Scores. The original model included gender (Female = 1; Male = 0), music training (Yes = 1; No = 0), family SES, external motivation, internal motivation, attitude, and anxiety, and 28 SNPs. * indicates p < 0.05 (uncorrected); ‡represents significant associations after FDR corrections for multiple comparisons.
Observations: 510.
R2/R2 adjusted: 0.088/0.077, p = 2.614e−08.
Significant values are in bold.
Path coefficients of structural equation models (SEMs) for L1, L2, and L3.
| Language | Path | Major allele | Gene | Unstandardized | Standardized | z value | P value | 95% CI |
|---|---|---|---|---|---|---|---|---|
| L1 | ||||||||
| Music | 0.067 | 0.024 | 0.431 | 0.666 | [− 0.238 to 0.372] | |||
| rs9722 | G | 0.068 | 0.041 | 0.924 | 0.355 | [− 0.076 to 0.213] | ||
| rs2538991 | C | 0.024 | 0.015 | 0.290 | 0.772 | [− 0.136 to 0.183] | ||
| rs6980093 | A | 0.023 | 0.014 | 0.173 | 0.862 | [− 0.234 to 0.28] | ||
| rs1852469 | A | 0.085 | 0.052 | 0.627 | 0.531 | [− 0.182 to 0.352] | ||
| rs6456593 | C | 0.093 | 0.057 | 1.172 | 0.241 | [− 0.063 to 0.249] | ||
| Gender | 0.160 | 0.072 | 1.648 | 0.099 | [− 0.030 to 0.350] | |||
| rs9722 | G | − 0.064 | − 0.043 | − 1.052 | 0.293 | [− 0.184 to 0.056] | ||
| rs6980093 | A | 0.157 | 0.113 | 1.548 | 0.122 | [− 0.042 to 0.356] | ||
| rs1800497 | G | − 0.047 | − 0.034 | − 0.784 | 0.433 | [− 0.164 to 0.070] | ||
| rs6940827 | G | − 0.043 | − 0.023 | − 0.516 | 0.606 | [− 0.205 to 0.120] | ||
| rs6456593 | C | − 0.085 | − 0.059 | − 1.396 | 0.163 | [− 0.205 to 0.035] | ||
| L3 | L1 | 0.008 | 0.009 | 0.164 | 0.870 | [− 0.084 to 0.099] | ||
| Gender | − 0.103 | − 0.048 | − 1.19 | 0.234 | [− 0.273 to 0.067] | |||
| Family SES | − 0.005 | − 0.085 | − 1.907 | 0.056 | [− 0.010 to 0.000] | |||
| Music | − 0.108 | − 0.045 | − 1.14 | 0.254 | [− 0.295 to 0.078] | |||
| Attitude | − 0.018 | − 0.02 | − 0.498 | 0.619 | [− 0.091 to 0.054] | |||
| Anxiety | 0.026 | 0.029 | 0.748 | 0.455 | [− 0.043 to 0.095] | |||
| External | − 0.033 | − 0.034 | − 0.858 | 0.391 | [− 0.108 to 0.042] | |||
| rs2538991 | C | 0.044 | 0.032 | 0.78 | 0.435 | [− 0.066 to 0.154] | ||
| rs6980093 | A | − 0.185 | − 0.135 | − 1.908 | 0.056 | [− 0.376 to 0.005] | ||
| rs1852469 | A | 0.145 | 0.102 | 1.434 | 0.152 | [− 0.053 to 0.343] | ||
| rs1800497 | G | − 0.079 | − 0.058 | − 1.376 | 0.169 | [− 0.191 to 0.033] | ||
| rs6940827 | G | 0.062 | 0.035 | 0.913 | 0.361 | [− 0.071 to 0.194] | ||
Gender and Music were coded as dummy variables with 1 = Female, 0 = Male; 1 = Have received at least 1 year of musical training, 0 = Have received less than 1 year of musical training or have not received any musical training at all. Both unstandardized and standardized beta coefficients between the two variables indicated by the path were reported. L1, L2, and L3 are latent variables of language proficiency with Chinese HKDSE grades, English HKDSE grades, and L3 Global scores as their indicators, respectively. In total, 609 participants were included in the SEM.
Significant values are in bold.
Risk alleles of SNPs that were reported to be linked with language abilities in the present and in the literature.
| SNPs | Gene | Risk allele in our study | Risk allele in the literature | Phenotypes | Population | Sample size | References |
|---|---|---|---|---|---|---|---|
| rs1800497 | A | A | Grammatical rule learning | European (USA) | 22 adults | [ | |
| rs1852469 | A | T | Speech sound disorder | East Asian (Han Chinese) | 150 patients with speech sound disorder and 140 healthy controls | [ | |
| rs6980093 | G | G | Expressive language , fluency | European (Italian) | 699 population-based cohort and 572 children with developmental dyslexia | [ | |
| rs2538991 | A | C | Specific Language Impairment (SLI) | European (USA) | 847 members of 184 families | [ | |
| rs6456593 | C | C | Developmental dyslexia (DD) | Asian (Uyghur) | 392 Uyghur children aged 8–12 years old | [ | |
| rs6940827 | G | G | Developmental dyslexia (DD) | Asian (Han Chinese) | 54 trios aged between 5 and 16 year | [ | |
| rs9722 | A | A | Developmental dyslexia (DD) | European (Finland, Germany and Sweden) | 100 participants with DD | [ |