| Literature DB >> 28628100 |
Madeleine R Geisheker1, Gabriel Heymann2,3, Tianyun Wang4, Bradley P Coe1, Tychele N Turner1, Holly A F Stessman1, Kendra Hoekzema1, Malin Kvarnung5,6, Marie Shaw7, Kathryn Friend7,8, Jan Liebelt9, Christopher Barnett9,10, Elizabeth M Thompson9,11, Eric Haan9,11, Hui Guo4, Britt-Marie Anderlid5,6, Ann Nordgren5,6, Anna Lindstrand5,6, Geert Vandeweyer12, Antonino Alberti13, Emanuela Avola13, Mirella Vinci14, Stefania Giusto15, Tiziano Pramparo16, Karen Pierce16, Srinivasa Nalabolu16, Jacob J Michaelson17, Zdenek Sedlacek18, Gijs W E Santen19, Hilde Peeters20, Hakon Hakonarson21,22,23, Eric Courchesne16, Corrado Romano13, R Frank Kooy12, Raphael A Bernier3, Magnus Nordenskjöld5,6, Jozef Gecz7,24, Kun Xia4, Larry S Zweifel2,3, Evan E Eichler1,25.
Abstract
Although de novo missense mutations have been predicted to account for more cases of autism than gene-truncating mutations, most research has focused on the latter. We identified the properties of de novo missense mutations in patients with neurodevelopmental disorders (NDDs) and highlight 35 genes with excess missense mutations. Additionally, 40 amino acid sites were recurrently mutated in 36 genes, and targeted sequencing of 20 sites in 17,688 patients with NDD identified 21 new patients with identical missense mutations. One recurrent site substitution (p.A636T) occurs in a glutamate receptor subunit, GRIA1. This same amino acid substitution in the homologous but distinct mouse glutamate receptor subunit Grid2 is associated with Lurcher ataxia. Phenotypic follow-up in five individuals with GRIA1 mutations shows evidence of specific learning disabilities and autism. Overall, we find significant clustering of de novo mutations in 200 genes, highlighting specific functional domains and synaptic candidate genes important in NDD pathology.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28628100 PMCID: PMC5539915 DOI: 10.1038/nn.4589
Source DB: PubMed Journal: Nat Neurosci ISSN: 1097-6256 Impact factor: 24.884
Figure 1Burden and recurrence of de novo missense mutations
Bars for cases and controls represent observed data and error bars indicate the 95% confidence interval (CI) for the observed proportions (Clopper-Pearson method). Box-and-whisker plots for downsampled cases represent the distribution of one million permutations. Boxes show interquartile range (IQR) with lines at the median and whiskers are 1.5 times the IQR. Asterisks indicate p < 0.05. a) 4,301 out of 8,477 cases (50.7%) and 1,042 out of 2,178 controls (47.8%) have one or more de novo missense mutations (denovo-db v.0.9) that are rare in the general population (MAF < 0.1% in ESP). The fraction of individuals with one or more de novo missense mutation is significantly higher in cases (p = 0.016, OR = 1.12, two-sided Fisher’s exact test) even after downsampling (empirical p = 9.22×10−4, OR = 1.12 [1.06–1.19]). b) The number of genes with two or more mutations in downsampled cases is significantly greater than controls (empirical p = 0.011, OR = 1.26 [1.10–1.42]), as is the number of genes with three or more mutations (empirical p = 3.1×10−5, OR = 3.13 [2.22–4.03]).
Figure 2Severity of de novo missense mutations
a) De novo missense mutations are more likely to be deleterious in cases (n = 5,807 mutations) versus controls (n = 1,475 mutations) and the positive likelihood ratio increases as severity increases (as measured by Combined Annotation Dependent Depletion (CADD) score). b) The distribution of CADD scores skews significantly as the number of de novo missense mutations per gene in cases increases (p = 5.87×10−29, one-way ANOVA) indicating an enrichment for genes with pathogenic mutations. Boxes show IQR with notches representing the 95% CI of the median; whiskers are 1.5 times the IQR. Circles are outliers.
New recurrent mutations at targeted missense sites.
| Gene | Site | Alternate | Protein ID | Mutations in | Codon | Codon | Mutations identified with | Total | Codon | Codon | ExAC v.0.3 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Inherited | Unknown | ||||||||||||
| p.Arg203 | Trp | NP_060496.2 | 6 | 1.03E-24 | 1.13E-17 | 2 (St, Tr) | 1 (Ad) | 8 | 1.37E-29 | 1.51E-22 | 1 | ||
| p.Glu198 | Lys | NP_006236.1 | 4 | 3.84E-18 | 4.22E-11 | 1 (Tr) | 4 | 3.48E-16 | 3.83E-09 | 0 | |||
| p.Asn107 | Ser, Thr | NP_060936.1 | 5 | 5.26E-17 | 5.79E-10 | 1 (Tr) | 5 | 1.47E-14 | 1.62E-07 | 0 | |||
| p.Arg937 | Cys, His | NP_001035232.1 | 3 | 9.47E-12 | 1.04E-04 | 1 (ACGC) | 1 (ACGC) | 4 | 8.26E-14 | 9.09E-07 | 0 | ||
| p.Ile500 | Val, Thr | NP_005350.1 | 2 | 1.12E-07 | 1 | 2 (An, Le) | 4 | 1.88E-13 | 2.07E-06 | 0 | |||
| p.Gly503 | Arg, Glu | NP_002825.3 | 3 | 1.66E-11 | 1.83E-04 | 1 | 3 | 5.10E-10 | 5.62E-03 | 0 | |||
| p.Ala636 | Thr | NP_000818.2 | 2 | 1.11E-07 | 1 | 1 (St) | 2 (St) | 3 | 4.89E-10 | 5.39E-03 | 0 | ||
| p.Arg379 | His | NP_001035232.1 | 2 | 7.39E-08 | 8.14E-01 | 1 (Ad) | 2 | 7.04E-07 | 1 | 0 | |||
| p.Arg718 | Trp, Gln | NP_001821.2 | 2 | 9.57E-08 | 1 | 1 (St) | 2 | 9.11E-07 | 1 | 0/2 | |||
| p.Arg230 | Cys | NP_004510.1 | 2 | 3.36E-07 | 1 | 1 (Tr) | 2 | 3.20E-06 | 1 | 0 | |||
| p.Arg473 | Gln | NP_037382.2 | 2 | 6.54E-07 | 1 | 3 (Ad) | 2 | 3.49E-06 | 1 | 5 | |||
| p.Glu590 | Lys | NP_056082.2 | 2 | 5.38E-07 | 1 | 1 (Tr) | 2 | 5.12E-06 | 1 | 0 | |||
Bonferroni family-wise error rate (FWER) correction based on 1.1E7 codons in genome.
In two affected siblings.
Allele in denovo-db v.0.9 has 0 occurrences in ExAC; allele found with smMIPs has been seen twice.
ACGC, Autism Clinical and Genetic Resources in China; Ad, Adelaide; AGRE, Autism Genetic Resource Exchange; An, Antwerp; Le, Leuven; St, Stockholm; Tr, Troina
Figure 3Recurrent mutations fall in or near functional domains
Published mutations in NDD patients are above the protein and new mutations identified by targeted sequencing are below the protein. De novo mutations (lightning bolt) and paternally inherited mutations (blue) are indicated. Inheritance is unknown for the remaining mutations. Protein domains are from UniProt. a) PACS1, NP_060496.2. b) ALG13, NP_060936.1. c) SMAD4, NP_005350.1. d) SATB2, NP_001165988.1.
Rare* clustered missense mutations identified by targeted sequencing (CADD >20).
| Gene | Site or cluster | Protein ID | denovo- | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| N | Missense | N | Missense | N | Missense | Known | ||||
| p.Arg203 | NP_060496.2 | 6 | 3023 | 0 | 17689 | 4 | 26166 | 10 | 8 | |
| p.Glu198 | NP_006236.1 | 4 | 3023 | 1 | 17689 | 3 | 26166 | 7 | 4 | |
| p.Arg937 | NP_001035232.1 | 3 | 3023 | 0 | 17689 | 5 | 26166 | 8 | 5 | |
| p.Gln264 | NP_066288.2 | 2 | 3023 | 0 | 17689 | 7 | 26166 | 9 | 5 | |
| p.Asn107 | NP_060936.1 | 5 | 3023 | 0 | 17689 | 1 | 26166 | 6 | 5 | |
| p.Ala636 | NP_000818.2 | 2 | 3023 | 0 | 17689 | 5 | 26166 | 7 | 4 | |
| p.Ser260 | NP_001123577.1 | 3 | 3023 | 0 | 17689 | 2 | 26166 | 5 | 3 | |
| p.Gly214-p.Asn215 | NP_055006.1 | 3 | 169 | 0 | 11631 | 1 | 20108 | 4 | 3 | |
| p.Leu219-p.Gly220 | NP_066288.2 | 2 | 169 | 0 | 11631 | 2 | 20108 | 4 | 3 | |
| p.Arg399-p.Glu402 | NP_001165988.1 | 2 | 169 | 0 | 11631 | 3 | 20108 | 5 | 3 | |
| p.Arg182 | NP_055040.2 | 2 | 3023 | 0 | 17689 | 2 | 26166 | 4 | 3 | |
| p.Arg1617-p.Gly1625 | NP_055006.1 | 2 | 169 | 0 | 11631 | 2 | 20108 | 4 | 3 | |
| p.Gly503 | NP_002825.3 | 3 | 3023 | 0 | 17689 | 6 | 26166 | 9 | 3 | |
| p.Gly544 | NP_001027392.1 | 2 | 3023 | 0 | 17689 | 5 | 26166 | 7 | 3 | |
| p.Thr47-p.Cys48 | NP_075044.2 | 2 | 169 | 0 | 11631 | 4 | 20108 | 6 | 2 | |
| p.Arg230 | NP_004510.1 | 2 | 3023 | 0 | 17689 | 3 | 26166 | 5 | 3 | |
| p.Trp1848 | NP_003487.1 | 2 | 3023 | 2 | 17689 | 9 | 26166 | 11 | 3 | |
| p.Val641-p.Asn647 | NP_009029.3 | 2 | 169 | 0 | 11631 | 2 | 20108 | 4 | 2 | |
| p.Thr379-p.Glu387 | NP_060142.3 | 2 | 169 | 0 | 11631 | 2 | 20108 | 4 | 2 | |
| p.Arg853 | NP_001035232.1 | 2 | 3023 | 1 | 17689 | 2 | 26166 | 4 | 2 | |
| p.Thr4 | NP_001012662.1 | 2 | 3023 | 0 | 11113 | 3 | 19590 | 5 | 2 | |
| p.Arg379 | NP_001035232.1 | 2 | 3023 | 0 | 17689 | 6 | 26166 | 8 | 2 | |
| p.Val381-p.Arg389 | NP_001165988.1 | 2 | 169 | 0 | 11631 | 1 | 20108 | 3 | 2 | |
| p.Ile500 | NP_005350.1 | 2 | 3023 | 0 | 17689 | 6 | 26166 | 8 | 4 | |
| p.Ser173-p.Gly176 | NP_057029.8 | 2 | 169 | 0 | 11631 | 3 | 20108 | 5 | 2 | |
| p.Arg718 | NP_001821.2 | 2 | 3023 | 1 | 17689 | 4 | 26166 | 6 | 2 | |
| p.Pro65 | NP_009075.1 | 2 | 3023 | 0 | 17689 | 5 | 26166 | 7 | 2 | |
| p.Ser1380-p.Glu1389 | NP_001243398.1 | 2 | 169 | 0 | 11631 | 4 | 20108 | 6 | 2 | |
| p.Pro1461 | NP_009049.2 | 2 | 3023 | 0 | 17689 | 1 | 26166 | 3 | 2 | |
| p.Pro1219-p.Val1221 | NP_004377.2 | 2 | 169 | 0 | 11631 | 8 | 20108 | 10 | 2 | |
| p.Thr267-p.Arg269 | NP_001161744.1 | 2 | 169 | 0 | 11631 | 1 | 20108 | 3 | 2 | |
| p.Glu590 | NP_056082.2 | 2 | 3023 | 1 | 17689 | 3 | 26166 | 5 | 2 | |
| p.Arg473 | NP_037382.2 | 2 | 3023 | 1 | 11113 | 10 | 19590 | 12 | 2 | |
| p.Arg525 | NP_003061.3 | 2 | 3023 | 0 | 17689 | 3 | 26166 | 5 | 2 | |
| p.Trp271 | NP_006584.1 | 1 | 3023 | 0 | 17689 | 3 | 26166 | 4 | 1 | |
| p.Arg964 | NP_055791.1 | 1 | 3023 | 0 | 17689 | 8 | 26166 | 9 | 1 | |
|
| ||||||||||
Minor allele frequency (MAF) < 0.01% in ExAC v.0.3 and ESP v.0.0.30.
denovo-db v.0.9 and smMIPs.
Figure 4Functional effect of recurrent GRIA1 missense mutations
a) Linear representation of annotated domains in the protein GRIA1, a.k.a. GluA1 (NP_001244950.1) as defined in UniProt. b) A recurrent mutation observed only in NDD cases (n = 6 patients) falls within a highly conserved M3 transmembrane domain, important in channel gating. The alanine mutated in these patients (red) is homologous to the one that causes severe ataxia in Lurcher mice in the delta-2 subunit of this receptor (GRID2). c–e) Example current traces from a 1.8s voltage-ramp from −100mV to +80mV for c) WT, d) A636T, and e) heteromeric WT/A636T transfected HEK cells. The three current traces per panel correspond to voltage-ramp currents in the presence of normal extracellular solution (NaCl), extracellular solution supplemented with 50 uM NBQX (NBQX), and the isolated GluA1 dependent current determined by subtracting the NBQX current from the NaCl current (NaCl – NBQX). f) Average leak current at −60mV. The GluA1-mediated current (NaCl – NBQX) was determined at −60mV and averaged across cells (n = 5 (WT), 7 (A636T), and 5 (heteromeric); p = 0.024, F = 4.91, 2 degrees of freedom, one-way ANOVA). Data are mean +/− S.E.M.
Figure 5Proteins with excessive clustering of missense mutations in NDD cases
The pattern of de novo missense mutations in cases with NDDs is contrasted with rare missense variants from the 1000 Genomes Project (1KG) and private missense mutations from the Exome Aggregation Consortium (ExAC) excluding neuropsychiatric cases. Missense mutations are colored by severity (CADD heatmap) and recurrent de novo mutations at a specific amino acid position are indicated (triangle). Significance of clustering was calculated based on comparison to ExAC using CLUMP. a) GRIN1 (NP_001172019.1) shows greater missense mutation clustering in NDD patients (CLUMP = 1.68, p = 0.013) with region-specific significance corresponding to the transmembrane domains (amino acids 550–845; Fisher’s exact test p = 5.6×10−8). b) Similarly, missense mutations cluster for GRIN2B (NP_000825.2; CLUMP = 1.34, p = 0.003) in particular between the second and fourth transmembrane domains (amino acids 600–820; p = 2.0×10−9). c) KCNK3 (NP_002237.1) patient missense mutations cluster (CLUMP = 0.54, p = 0.036) near the first transmembrane domain (amino acids 120–140, p = 9.4×10−5, OR = Inf). The average per-base rate of ExAC samples with ≥10× coverage across the exon harboring mutations in cases was 79.1%. d) KCNQ2 (NP_742105.1) shows several missense mutation hotspots (CLUMP = 0.36, p < 1×10−3) corresponding to the pore and voltage sensor of the channel (amino acids 200–350, p = 2.0×10−14). e) Finally, patients show more severe CTCF (NP_006556.1) missense mutations that cluster (CLUMP = 1.0, p = 0.007) at two locations between the fourth and seventh C2H2 zinc finger motifs (amino acids 350–490, p = 9.1×10−8).