| Literature DB >> 29666269 |
Mellissa M C DeMille1, Kevin Tang2, Chintan M Mehta1, Christopher Geissler3, Jeffrey G Malins1,4, Natalie R Powers5, Beatrice M Bowen5, Andrew K Adams5, Dongnhu T Truong1, Jan C Frijters6, Jeffrey R Gruen7,5,8.
Abstract
DCDC2 is a gene strongly associated with components of the phonological processing system in animal models and in multiple independent studies of populations and languages. We propose that it may also influence population-level variation in language component usage. To test this hypothesis, we investigated the evolution and worldwide distribution of the READ1 regulatory element within DCDC2, and compared its distribution with variation in different language properties. The mutational history of READ1 was estimated by examining primate and archaic hominin sequences. This identified duplication and expansion events, which created a large number of polymorphic alleles based on internal repeat units (RU1 and RU2). Association of READ1 alleles was studied with respect to the numbers of consonants and vowels for languages in 43 human populations distributed across five continents. Using population-based approaches with multivariate ANCOVA and linear mixed effects analyses, we found that the RU1-1 allele group of READ1 is significantly associated with the number of consonants within languages independent of genetic relatedness, geographic proximity, and language family. We propose that allelic variation in READ1 helped create a subtle cognitive bias that was amplified by cultural transmission, and ultimately shaped consonant use by different populations over time.Entities:
Keywords: DCDC2; READ1; genetics; language; phoneme
Mesh:
Substances:
Year: 2018 PMID: 29666269 PMCID: PMC5948951 DOI: 10.1073/pnas.1710472115
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Alignment of READ1 repeat units in primates. Repeat units are depicted as colored dots for 4-bp motifs, and as rectangles for longer motifs, and are labeled with the same colors used in . Human-1 (an RU1-2 allele) is the most common READ1 allele, Human-3 is the most common of the RU1-1 alleles, and Human-4 is identical to the Neanderthal allele. The Denisovan allele is similar to human alleles. Chimpanzee (Pan paniscus and P. trogdolytes) has two alleles that differ by one GGAA repeat; the longest is represented here. Gorilla gorilla lacks the final “A” in the first motif, represented by the red X at the end of the motif.
Fig. 2.Distribution of RU1-1, RU1-2, and deletion of READ1 in 43 world populations. Populations used in this study are distributed on the y axis in an approximation of the geographic distribution out of Africa and colored by the continental group to which they were assigned in this study. The x axis shows the distribution of frequencies of three major categories of READ1 alleles: RU1-1 in red, RU1-2 in blue, and the microdeletion in green.
Fig. 3.Linear regressions of consonants and vowels by RU1-1 frequencies. Log10 transformed numbers of consonants (solid dots) and vowels (open triangles) were regressed separately against the frequencies of RU1-1 alleles in 43 populations.
Multivariate ANCOVA examining the relationship between the number of consonants and vowels, compared with the frequency of RU1-1, RU1-2, and the deletion
| Variable | Pillai | ||
| RU1-1 | 0.229 | 5.51 | 0.024 |
| RU1-2 | 0.006 | 0.12 | 0.888 |
| Deletion | 0.023 | 0.44 | 0.888 |
P ≤ 0.05.
FDR corrected.
Post hoc univariate analysis of the effect of RU1-1 on vowels and consonants independently while controlling for PCs1-3
| Response | β | ||
| Vowels | −0.61 | −0.8 | 0.471 |
| Consonants | 1.76 | 3.2 | 0.003 |
P ≤ 0.01.
Results of a linear mixed effects model using RU1-1 as the response variable with PCs 1–3 as fixed effects, language family, and continental grouping as random effects, and using chromosome numbers as weights
| Predictor | β | |||
| Vowels | −0.002 | −2.01 | 0.046 | (−0.005, 0.0002) |
| Consonants | 0.002 | 3.14 | 0.004 | (0.0008, 0.003) |
Linear mixed effects model using RU1-1 (95% CI). Bias was less than 10−6 for both parameters.
P value under t distribution with 2 and 37 degrees of freedom.
Bootstrap estimation of error using 10,000 replicates with 95% confidence intervals.