| Literature DB >> 28866788 |
Andrew K Adams1, Shelley D Smith2, Dongnhu T Truong3, Erik G Willcutt4,5, Richard K Olson4,5, John C DeFries4,5, Bruce F Pennington6, Jeffrey R Gruen7,8.
Abstract
Eleven loci with prior evidence for association with reading and language phenotypes were sequenced in 96 unrelated subjects with significant impairment in reading performance drawn from the Colorado Learning Disability Research Center collection. Out of 148 total individual missense variants identified, the chromosome 7 genes CCDC136 and FLNC contained 19. In addition, a region corresponding to the well-known DYX2 locus for RD contained 74 missense variants. Both allele sets were filtered for a minor allele frequency ≤0.01 and high Polyphen-2 scores. To determine if observations of these alleles are occurring more frequently in our cases than expected by chance in aggregate, counts from our sample were compared to the number of observations in the European subset of the 1000 Genomes Project using Fisher's exact test. Significant P values were achieved for both CCDC136/FLNC (P = 0.0098) and the DYX2 locus (P = 0.012). Taken together, this evidence further supports the influence of these regions on reading performance. These results also support the influence of rare variants in reading disability.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28866788 PMCID: PMC5702371 DOI: 10.1007/s00439-017-1838-z
Source DB: PubMed Journal: Hum Genet ISSN: 0340-6717 Impact factor: 4.132
Fig. 1Depiction of the DCDC2/KIAA0319 region sequenced. Gene sizes are proportional per sizes listed in human genome build GRCh37/hg19. Intergenic distances are approximate, total region length approximately 1.04 Mbp
Loci sequenced for this study
| Locus | Chr. Loc. | Sequenced coordinates | Size (Mb) | Rationale for selection |
|---|---|---|---|---|
|
| 6p22.3 | 24,124,528–25,166,900 | 1.04 | Reading candidatea (Meng et al. |
|
| 7q31.1 | 113,708,455–114,351,800 | 0.643 | Reading candidatea (Peter et al. |
|
| 7q32.1 | 128,417,100–128,501,990 | 0.085 | Reading candidatea (Gialluisi et al. |
|
| 7q35-q36.1 | 145,792,593–148,183,542 | 2.4 | Reading and language candidatea,b (Vernes et al. |
|
| 9q34.2 | 136,498,622–136,524,722 | 0.026 | Preliminary association with reading (data not shown) |
|
| 11q23.2 | 113,279,322–113,350,600 | 0.071 | Language candidateb (Eicher et al. |
|
| 13q21.32 | 66,843,132–67,813,032 | 0.97 | Language candidateb (Bartlett et al. |
|
| 15q21.1 | 47,458,034–48,078,585 | 0.628 | Language candidateb (Ercan-Sencicek et al. |
|
| 15q21.3 | 55,647,446–55,819,300 | 0.303 | RD candidatec (Taipale et al. |
|
| 22q11.21 | 36,129,602–36,432,802 | 0.309 | Reading and language candidatea,b (Grigorenko et al. |
|
| 22q12.3 | 19,922,500–19,961,200 | 0.172 | Reading candidatea (Gialluisi et al. |
Each locus is named for the major associated gene(s) present. Columns are chromosomal location, sequenced coordinates in NCBIv37/hg19, approximate size of the locus in Mb, and rationale for selection. Genes selected based on literature support include corresponding citation
a Association with reading component phenotype or preliminary association evidence
b Association with a language phenotype or SLI
c Association with RD as a categorical trait
Observed missense variant counts in each sequenced locus
| Locus | # of Genes | # of Variants |
|---|---|---|
|
| 9 | 74 |
|
| 1 | 3 |
|
| 2 | 19 |
|
| 1 | 2 |
|
| 1 | 10 |
|
| 1 | 7 |
|
| 1 | 7 |
|
| 1 | 6 |
|
| 2 | 10 |
|
| 2 | 8 |
|
| 4 | 2 |
| Total | 25 | 148 |
Loci are named for the major associated gene(s) present. The number of genes in each individual locus is also reported
Variants in CCDC136 and FLNC with MAF <0.01 and PolyPhen-2 scores greater than 0.75
| Variant (11 total) | Chr | Position | Gene | Rs ID | AA change | SIFT | PolyPhen | CDS Obs | 1KGE Obs | gnomAD |
|---|---|---|---|---|---|---|---|---|---|---|
| 7:128441492-SNV | 7 | 128441492 |
| rs185493260 | Arg/His | 0 | 1 | 3 | 5 | 2.43 × 10−3 |
| 7:128434498-SNV | 7 | 128434498 |
| rs143537881 | Gly/Asp | 0.01 | 0.923 | 2 | 6 | 1.03 × 10−2 |
| 7:128449696-SNV | 7 | 128449696 |
| rs142055253 | Lys/Glu | 0 | 0.792 | 1 | 3 | 2.38 × 10−3 |
| 7:128446877-Frameshift | 7 | 128446877 |
| Novel | N/A | N/A | N/A | 1 | 0 | N/A |
| 7:128483880-SNV | 7 | 128483880 |
| rs768103657 | Gly/Arg | 0 | 1 | 1 | 0 | 7.41 × 10−5 |
| 7:128480184-SNV | 7 | 128480184 |
| rs189525930 | Gly/Arg | 0 | 0.981 | 1 | 3 | 1.29 × 10– 3 |
| 7:128485240-SNV | 7 | 128485240 |
| rs146953558 | Arg/Cys | 0.02 | 0.953 | 1 | 7 | 9.52 × 10−3 |
| 7:128483367-SNV | 7 | 128483367 |
| rs374983276 | Arg/Cys | 0 | 0.932 | 1 | 0 | 2.48 × 10−4 |
| 7:128491324-SNV | 7 | 128491324 |
| rs181067717 | Arg/Cys | 0.01 | 0.901 | 2 | 5 | 8.17 × 10−3 |
| 7:128481578-SNV | 7 | 128481578 |
| rs34972246 | Asp/Ala | 0 | 0.898 | 1 | 5 | 4.97 × 10−3 |
| 7:128494547-SNV | 7 | 128494547 |
| rs202223616 | Glu/Gln | 0.11 | 0.81 | 1 | 1 | 1.15 × 10−3 |
| Variant | 15 | 35 | ||||||||
| Reference | 177 | 971 |
Each variant is listed with chromosome, genomic location in hg19 coordinates, gene, rsID (if one has been assigned), SIFT score, PolyPhen-2 score, number of observations in our RD data set (CDS), number of observations in European subjects (N = 503) from the 1000 Genomes Project, and allele frequency in the gnomAD database. Amino acid changes are presented using the standard three-letter codes as reference/alternate. Variants without an rsID are novel to this study. Allele counts (variant and reference) are presented at the bottom of the table. Two-tailed Fisher’s exact test P value = 0.0098
Variants in DCDC2/KIAA0319 locus with MAF <0.01 and PolyPhen-2 scores greater than 0.75
| Variant (15 total) | Chr | Position | Gene | Rs ID | AA change | SIFT | PolyPhen | CDS Obs | 1KGE Obs | gnomAD |
|---|---|---|---|---|---|---|---|---|---|---|
| 6:24520644-SNV | 6 | 24520644 |
| rs149482918 | Ala/Thr | 0.03 | 0.986 | 1 | 0 | 7.89 × 10−6 |
| 6:24520719-SNV | 6 | 24520719 |
| rs115784602 | Val/Met | 0.01 | 0.955 | 1 | 4 | 4.96 × 10−3 |
| 6:24178856-SNV | 6 | 24178856 |
| rs766600237 | Pro/Leu | 0.03 | 0.874 | 1 | 0 | 3.17 × 10−5 |
| 6:24174966-SNV | 6 | 24174966 |
| rs145154884 | Val/Met | 0 | 0.791 | 1 | 1 | 5.53 × 10−5 |
| 6:24848281-SNV | 6 | 24848281 |
| rs375981215 | Thr/Lys | 0 | 1 | 1 | 0 | 4.73 × 10−5 |
| 6:24495234-SNV | 6 | 24495234 |
| rs200793796 | Cys/Gly | 0.02 | 0.991 | 2 | 10 | 9.39 × 10−3 |
| 6:24472814-SNV | 6 | 24472814 |
| rs62400471 | Arg/Cys | 0.02 | 0.912 | 1 | 1 | 6.39 × 10−4 |
| 6:24476431-SNV | 6 | 24476431 |
| rs61754637 | Asn/Ser | 0.04 | 0.771 | 1 | 3 | 2.10 × 10−3 |
| 6:24559241-SNV | 6 | 24559241 |
| rs773691612 | Gly/Arg | 0.24 | 0.956 | 1 | 0 | 3.24 × 10−5 |
| 6:24596526-SNV | 6 | 24596526 |
| rs148566925 | Pro/Thr | 0.06 | 0.869 | 1 | 4 | 4.87 × 10−3 |
| 6:24566953-SNV | 6 | 24566953 |
| rs113411083 | Arg/Trp | 0.08 | 0.835 | 1 | 6 | 4.68 × 10−3 |
| 6:24596675-SNV | 6 | 24596675 |
| rs201302072 | Arg/Leu | 0.15 | 0.778 | 1 | 0 | 1.58 × 10−4 |
| 6:24412567-SNV | 6 | 24412567 |
| rs200245617 | Val/Ile | 0.12 | 0.973 | 1 | 0 | 4.39 × 10−4 |
| 6:24667009-SNV | 6 | 24667009 |
| rs146127613 | Leu/Val | 0.01 | 0.923 | 1 | 4 | 3.00 × 10−3 |
| 6:24667074-SNV | 6 | 24667074 |
| rs61760186 | Cys/Tyr | 0.35 | 0.781 | 1 | 5 | 4.47 × 10−3 |
| Variant | 16 | 38 | ||||||||
| Reference | 176 | 968 |
Each variant is listed with chromosome, genomic location in hg19 coordinates, gene, rsID (if one has been assigned), SIFT score, PolyPhen-2 score, number of observations in our RD data set (CDS), number of observations in European subjects (N = 503) from the 1000 Genomes Project, and allele frequency in the gnomAD database. Amino acid changes are presented using standard three-letter codes as reference/alternate. Variants and reference alleles are presented as counts out of number of observed alleles. Two-tailed Fisher’s exact test P value = 0.0118
Fig. 2Heatmap of LD across CCDC136/FLNC. Coloring is based on D’ values with darker gray indicating high LD between marker pairs. Genes are aligned to markers on the heatmap, but are not accurately scaled. The major LD block joining CCDC136 and FLNC is outlined in black