| Literature DB >> 34315907 |
Penelope Benchek1, Robert P Igo1, Heather Voss-Hoynes1, Barbara Lewis2,3, Catherine M Stein4, Sudha K Iyengar5, Yvonne Wren6, Gabrielle Miller2, Barbara Truitt1, Wen Zhang7, Michael Osterman1, Lisa Freebairn2, Jessica Tag2, H Gerry Taylor8,9, E Ricky Chan1, Panos Roussos7,10.
Abstract
Speech sound disorders (SSD) manifest as difficulties in phonological memory and awareness, oral motor function, language, vocabulary, reading, and spelling. Families enriched for SSD are rare, and typically display a cluster of deficits. We conducted a genome-wide association study (GWAS) in 435 children from 148 families in the Cleveland Family Speech and Reading study (CFSRS), examining 16 variables representing 6 domains. Replication was conducted using the Avon Longitudinal Study of Parents and Children (ALSPAC). We identified 18 significant loci (combined p < 10-8) that we pursued bioinformatically. We prioritized 5 novel gene regions with likely functional repercussions on neural pathways, including those which colocalized with differentially methylated regions in our sample. Polygenic risk scores for receptive language, expressive vocabulary, phonological awareness, phonological memory, spelling, and reading decoding associated with increasing clinical severity. In summary, neural-genetic influence on SSD is primarily multigenic and acts on genomic regulatory elements, similar to other neurodevelopmental disorders.Entities:
Year: 2021 PMID: 34315907 PMCID: PMC8316336 DOI: 10.1038/s41525-021-00225-5
Source DB: PubMed Journal: NPJ Genom Med ISSN: 2056-7944 Impact factor: 8.617
Annotation of most significant loci with replication in CFSRS and ALSPAC.
| Locus (Chr location) | Gene(s) | Associated SNPsa | Independently associated SNPs (after conditional analysis)a | CFRS Lead SNP MAF | CFRS Lead SNP | ALSPAC best replication | Fisher combined | Locus priority score | Expression in brain/neural tissue | Associated with Communication and/or psych phenotype | Associated with multiple CFSRS traits | Promoter (muscle, brain, neural) | Enhancer (muscle, brain, neural) | eQTL (muscle, brain,neural) | Target of FOXP2 (brain) | Methylation / meQTL |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1:30732871 | LINC01648;MATN1 | 1 | 1 | 0.14 | 6.1 × 10−6 | 4.1 × 10−2 | 4.1 × 10−6 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | |
| 1:55494735* | BSND;PCSK9 | 5 | 1 | 0.42 | 1.6 × 10−2 | 3.8 × 10−8 | 1.4 × 10−8 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | |
| 1:146988760 | LINC00624 | 1 | 1 | 0.81 | 6.7 × 10−6 | 4.1 × 10−3 | 5.1 × 10−7 | 1 | 1 | 0 | 0 | 1 | 1 | 0 | 1 | |
| 1:159028378 | IFI16, AIM2 | 23 | 1 | 0.27 | 6.9 × 10−6 | 9.8 × 10−3 | 1.2 × 10−6 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 2 | |
| 2:143378805 | LRP1B;KYNU | 4 | 1 | 0.76 | 5.0 × 10−6 | 8.9 × 10−3 | 8.0 × 10−7 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | |
| 2:169280713 | STK39;CERS6 | 1 | 1 | 0.5 | 2.4 × 10−6 | 4.8 × 10−2 | 2.0 × 10−6 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | |
| 3:1942898 | CNTN6;CNTN4 | 1 | 1 | 0.15 | 6.3 × 10−6 | 1.8 × 10−2 | 1.9 × 10−6 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | |
| 3:39743136 | MOBP;MYRIP | 1 | 1 | 0.13 | 8.9 × 10−6 | 4.5 × 10−2 | 6.3 × 10−6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| 4:27297733 | LINC02261;MIR4275 | 9 | 1 | 0.09 | 6.1 × 10−7 | 3.4 × 10−2 | 3.6 × 10−7 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | |
| 4:73572756 | ADAMTS3;COX18 | 7 | 1 | 0.05 | 2.6 × 10−6 | 4.4 × 10−2 | 2.6 × 10−6 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | |
| 4:77531588 | SHROOM3 | 1 | 1 | 0.14 | 8.1 × 10−6 | 4.3 × 10−3 | 6.4 × 10−7 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | |
| 5:72144005 | TNPO1 | 1 | 1 | 0.43 | 2.1 × 10−6 | 3.6 × 10−2 | 1.3 × 10−6 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | |
| 5:132043351 | KIF3A | 1 | 1 | 0.48 | 4.0 × 10−6 | 4.7 × 10−3 | 3.6 × 10−7 | 1 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | |
| 5:170102906 | KCNIP1 | 2 | 1 | 0.12 | 5.7 × 10−6 | 3.5 × 10−2 | 3.4 × 10−6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
| 5:172924967 | MIR8056;LOC285593 | 15 | 1 | 0.08 | 5.6 × 10−7 | 5.8 × 10−3 | 6.7 × 10−8 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | |
| 7:123604182 | SPAM1 | 10 | 1 | 0.13 | 7.4 × 10−6 | 1.6 × 10−2 | 2.3 × 10−6 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | |
| 7:154706515 | DPP6;PAXIP1-AS2 | 1 | 1 | 0.05 | 2.5 × 10−4 | 7.2 × 10−6 | 3.9 × 10−8 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | |
| 9:114335864 | PTGR1;ZNF483 | 0 | 1 | 0.24 | 5.4 × 10−4 | 5.5 × 10−6 | 6.2 × 10−8 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | |
| 10:46027420 | MARCH8 | 2 | 1 | 0.39 | 1.6 × 10−6 | 3.1 × 10−2 | 9.1 × 10−7 | 1 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | |
| 12:21002703 | SLCO1B3 | 2 | 1 | 0.43 | 6.4 × 10−6 | 7.5 × 10−3 | 8.8 × 10−7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
| 12:103677691** | LOC101929058; C12orf42 | 1 | 1 | 0.07 | 1.2 × 10−5 | 5.8 × 10−5 | 1.6 × 10−8 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 12:131389783 | RAN; ADGRD1 | 1 | 1 | 0.33 | 7.2 × 10−6 | 4.4 × 10−3 | 5.8 × 10−7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 13:28329109 | POLR1D; GSX1 | 18 | 1 | 0.38 | 7.8 × 10−6 | 3.8 × 10−2 | 4.8 × 10−6 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | |
| 13:79839523 | LINC00331; RBM26 | 10 | 1 | 0.41 | 1.2 × 10−6 | 4.0 × 10−2 | 8.9 × 10−7 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | |
| 14:35837476 | PSMA6; NFKBIA | 26 | 1 | 0.19 | 1.1 × 10−6 | 2.1 × 10−2 | 4.4 × 10−7 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 2 | |
| 14:59210646 | DACT1; LINC01500 | 7 | 1 | 0.05 | 3.1 × 10−6 | 3.6 × 10−2 | 1.9 × 10−6 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | |
| 14:93195374 | LGMN | 1 | 1 | 0.14 | 8.1 × 10−6 | 4.4 × 10−2 | 5.7 × 10−6 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | |
| 14:94993936* | SERPINA12; SERPINA4 | 5 | 1 | 0.17 | 1.7 × 10−1 | 3.0 × 10−9 | 5.0 × 10−8 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | |
| 14:99858970 | BCL11B; SETD3 | 1 | 1 | 0.38 | 6.5 × 10−6 | 4.9 × 10−2 | 5.1 × 10−6 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 2 | |
| 16:77231207 | MON1B | 1 | 1 | 0.42 | 9.4 × 10−6 | 4.6 × 10−3 | 6.8 × 10−6 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 2 | |
| 18:4023876 | DLGAP1 | 1 | 1 | 0.19 | 2.9 × 10−6 | 3.9 × 10−2 | 1.9 × 10−6 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | |
| 18:40822793 | RIT2; SYT4 | 1 | 1 | 0.05 | 6.8 × 10−6 | 2.8 × 10−2 | 3.2 × 10−6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 18:56462735 | MALT1; LINC01926 | 1 | 1 | 0.14 | 8.8 × 10−6 | 1.1 × 10−2 | 1.5 × 10−6 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 |
aAssociated SNPs include those associated in CFSRS (p < 10−5) and Alspac (p < 0.05) or Fisher combined p < 10−7
*Alspac led locus. No CFSRS SNPs showed association at p < 10−5.
**CFSRS p = 1.3 × 10−5 and ALSPAC p = 5.8 × 10−5 (Fisher p = 1.6 × 10−8).
DACT1 is associated with multiple CFS traits, but not in SNPs that replicated.
Note Bold values represents the sum of the column to the right.
Characteristic table for CFSRS GWAS sample.
| 435 | |
| Number of families | 148 |
| Median age in children (sd) | 6.6 (5.7) |
| Median age in parents (sd) | 36 (8) |
| Female, | 194 (45%) |
| CAS | 47 (11%) |
| SSD + LI (no CAS) | 70 (16%) |
| SSD only | 119 (27%) |
| Lang only | 17 (4%) |
| No CAS/SSD/Lang | 177 (41%) |
| Missing | 5 (1%) |
| CAS | 43 (31%) |
| SSD + LI (no CAS) | 39 (28%) |
| SSD only | 49 (36%) |
| Lang only | 3 (2%) |
| No CAS/SSD/Lang | 3 (2%) |
| 1 (lowest) | 3 (1%) |
| 2 | 30 (7%) |
| 3 | 67 (15%) |
| 4 | 167 (38%) |
| 5 (highest) | 147 (34%) |
| Missing | 21 (5%) |
aSample considered is the union of all samples across the 16 tests. Specific test sample sizes and age ranges are shown in Supplementary Table 1.
Fig. 1Genetic correlation matrix across traits in CFSRS.
Figure 1 shows cross-trait correlation results for each pair of tests using GCTA’s bivariate REML analysis[69]. The cross-trait correlation was tested under the null hypothesis of 0 correlation. Circles shown are for results significant at p < 0.05, with increasing diameter/color corresponding with increasing correlation (circles omitted otherwise). Traits: Phonological memory MSW.PPC, MSW, NSW, NSW.PPC), Spelling (TWS), Reading (WRMT-ID, WRMT-AT, WIAT-RC), Language (CELF.E, CELF.R, WIAT.LC), Vocabulary (EOWPVT, PPVT), phonological awareness (Elision).
Fig. 2Locus zoom plots for most significant findings.
Figure 2 shows association results for the top loci. p-Values displayed are for CFSRS and are for the test for which the top SNP was observed. Circles show p-values for SNP associations and triangles show p-values for methylation associations (specifically those for which the top SNP is an meQTL). The plot shows the top SNP for each region ±200 kb. a IFI16 region. rs855865 was associated with NSW in CFSRS (p = 7 × 10−6) and with vocabulary (WISC-V) in ALSPAC (p = 0.01). This region also includes an meQTL (rs12124059, p = 4 × 10−8) for methylation marker cg07196514, and this methylation marker (cg07196514) was also associated with NSW (p = 0.018). b NFKBIA region. rs57645874 was associated with Elision in CFSRS (p = 1 × 10−6) and with reading accuracy (NARA-A) in ALSPAC (p = 0.02). This region also contains an meQTL, rs4981288, for cg07166546 (p = 2 × 10−50), and this methylation marker was associated with Elision (p = 3 × 10−5), TWS (p = 0.0005) and WRMT-ID (p = 0.002). c DACT1 region. rs856379 was associated with MSW in CFSRS (p = 3 × 10−6) and with nonword reading (ALSPACread) in ALSPAC (p = 0.036). This SNP is an meQTL for methylation marker cg13972423 (p = 3 × 10−5). d SETD3 region rs1257267 was associated with WRMT-AT in CFSRS (p = 6.58 × 10−6) and with nonsense word repetition (CNrep5) in ALSPAC (p = 0.05). While only 1 SNP replicated between CFSRS and ALSPAC, 14 additional SNPs showed association in CFSRS at p < 10−5. This SNP (rs1257267) is an meQTL for cg18949721 (p = 4 × 10−12), and methylation marker cg18949721 was also associated with WRMT-AT (p = 0.003). e MON1B region. rs4888606 was associated with MSW in CFSRS (p = 9 × 10−6) and with nonword reading (ALSPACread) in ALSPAC (p = 0.046). While only 1 SNP replicated between CFSRS and ALSPAC, 18 additional SNPs showed association in CFSRS at p < 10−5. This SNP (rs4888606) falls in an intron of MON1B and is an meQTL for cg06128999 (p = 4 × 10−23) and cg05007098 (p = 1 × 10−15); these 2 methylation markers were also associated with MSW (p = 0.045 and p = 0.12, respectively). Functional annotation is in Supplementary Fig. 2.
Fig. 3Polygenic risk scores across major domains.
We constructed polygenic risk scores for 587 individuals who were both genotyped and had clinical subgroup information available. Polygenic risk scores are displayed by quantile across the clinical subgroups for six endophenotypes representing the major domains (a Receptive language; b Expressive vocabulary; c Phonological awareness; d Phonological memory; e Spelling; f Reading decoding).