| Literature DB >> 25086666 |
Kaitlin E Samocha1, Elise B Robinson2, Stephan J Sanders3, Christine Stevens4, Aniko Sabo5, Lauren M McGrath6, Jack A Kosmicki7, Karola Rehnström8, Swapan Mallick9, Andrew Kirby10, Dennis P Wall11, Daniel G MacArthur10, Stacey B Gabriel12, Mark DePristo13, Shaun M Purcell14, Aarno Palotie15, Eric Boerwinkle16, Joseph D Buxbaum17, Edwin H Cook18, Richard A Gibbs5, Gerard D Schellenberg19, James S Sutcliffe20, Bernie Devlin21, Kathryn Roeder22, Benjamin M Neale2, Mark J Daly2.
Abstract
Spontaneously arising (de novo) mutations have an important role in medical genetics. For diseases with extensive locus heterogeneity, such as autism spectrum disorders (ASDs), the signal from de novo mutations is distributed across many genes, making it difficult to distinguish disease-relevant mutations from background variation. Here we provide a statistical framework for the analysis of excesses in de novo mutation per gene and gene set by calibrating a model of de novo mutation. We applied this framework to de novo mutations collected from 1,078 ASD family trios, and, whereas we affirmed a significant role for loss-of-function mutations, we found no excess of de novo loss-of-function mutations in cases with IQ above 100, suggesting that the role of de novo mutations in ASDs might reside in fundamental neurodevelopmental processes. We also used our model to identify ∼1,000 genes that are significantly lacking in functional coding variation in non-ASD samples and are enriched for de novo loss-of-function mutations identified in ASD cases.Entities:
Mesh:
Year: 2014 PMID: 25086666 PMCID: PMC4222185 DOI: 10.1038/ng.3050
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Evaluation of the rates of de novo mutations in ASD cases and unaffected siblings. The observed and expected rate of mutations by type per exome for unaffected siblings[2] and ASD cases, including some unpublished US and Finnish trios[2–6] (a). (b) The number of genes with multiple de novo mutations in unaffected siblings and ASD cases across studies. The average number of expected genes with multiple de novo mutations was determined by simulation. LoF = Loss-of-function. DNMs = de novo mutations.
| a) Genome-wide excesses of mutational events | ||||
|---|---|---|---|---|
|
| ||||
| Unaffected Siblings | ||||
|
| ||||
| Mutation Type | Observed events per exome | Expected events per exome | p-value | |
|
| ||||
| Synonymous | 0.21 | 0.27 | 0.0218 | Two-tailed |
| Missense | 0.61 | 0.62 | 0.8189 | Two-tailed |
| Loss-of-Function | 0.09 | 0.09 | 0.4508 | One-tailed |
|
| ||||
| n = 343 families | ||||
Individually significant genes identified from the analysis of de novo mutations in ASD cases. Genes with multiple loss-of-function (LoF) de novo mutations across 1,078 ASD cases. LoF mutations include nonsense, frameshift, and splice site-disrupting mutations. “# LoF Expected” refers to the expected number of de novo LoF mutations based on the probability of mutation for the gene as determined by our model. The genome-wide significance threshold is 1×10−6
| Gene | Mutations | # LoF Observed | # LoF Expected | p-value |
|---|---|---|---|---|
| DYRK1A | nonsense, splice, frameshift | 3 | 0.0072 | 6.15E-08 |
| SCN2A | nonsense, nonsense, frameshift | 3 | 0.0178 | 9.20E-07 |
| CHD8 | nonsense, splice, frameshift | 3 | 0.0221 | 1.76E-06 |
| KATNAL2 | splice, splice | 2 | 0.0049 | 1.19E-05 |
| POGZ | frameshift, frameshift | 2 | 0.0133 | 8.93E-05 |
| ARID1B | frameshift, frameshift | 2 | 0.0178 | 1.57E-04 |
Evaluation of the rates of de novo mutations in cases with intellectual disability. (a) The observed and expected rate of mutations by type per exome for cases of intellectual disability (ID)[9,10]. (b) The number of genes with multiple de novo mutations in intellectual disability cases across studies. The average number of expected genes with multiple de novo mutations was determined by simulation. LoF = Loss-of-function. DNMs = de novo mutations.
| a) Genome-wide excesses of mutational events | ||||
|---|---|---|---|---|
|
| ||||
| ID Cases | ||||
|
| ||||
| Mutation Type | Observed events per exome | Expected events per exome | p-value | |
|
| ||||
| Synonymous | 0.19 | 0.27 | 0.0267 | Two-tailed |
| Missense | 0.70 | 0.62 | 0.2380 | Two-tailed |
| Loss-of-Function | 0.24 | 0.09 | 6.49E-07 | One-tailed |
|
| ||||
| n = 151 families | ||||
Individually significant genes identified from the analysis of de novo mutations from patients with intellectual disability. Genes with multiple functional de novo mutations across 151 cases of intellectual disability (ID)[9,10]. Loss-of-function (LoF) mutations include nonsense, frameshift, and splice site-disrupting mutations. The genome-wide significance threshold is 1×10−6. The number of mutations is either compared to the expected number for LoF only or for both LoF and missense, as indicated by the “# DNMs Expected” and “Test” columns.
| Gene | Mutations | #LoF | #Missense | # DNMs Expected | p-value | Test |
|---|---|---|---|---|---|---|
| SYNGAP1 | splice/frameshift/frameshift | 3 | 0 | 0.0017 | 8.15E-10 | LoF |
| SCN2A | missense/nonsense/frameshift/frameshift | 3 | 1 | 0.0025 | 2.56E-09 | LoF |
| SCN2A | missense/nonsense/frameshift/frameshift | 3 | 1 | 0.0187 | 5.01E-09 | LoF+mis |
| STXBP1 | missense/missense/splice | 1 | 2 | 0.0071 | 5.87E-08 | LoF+mis |
| TCF4 | missense/missense | 0 | 2 | 0.0069 | 2.39E-05 | LoF+mis |
| GRIN2A | missense/missense | 0 | 2 | 0.0162 | 1.34E-04 | LoF+mis |
| TRIO | missense/missense | 0 | 2 | 0.0333 | 5.60E-04 | LoF+mis |
Figure 1The expected and observed fraction of genes with a de novo loss-of-function mutation in ASD cases and unaffected controls for four gene sets of interest[2–6,10,15]. “Betancur” refers to a set of genes reported as disrupted in individuals with ASD or autistic features; of the 112 on the list, we could evaluate 111[12]. “FMRP” refers to the genes whose mRNAs are bound and regulated by the Fragile X Mental Retardation Protein (FMRP), as identified by Darnell and colleagues[13]. The “constrained” category is a set of 1,003 genes that we defined as significantly lacking rare missense variation, indicating intolerance to mutation. The targets of FMRP that are also considered constrained by our metric make up the “Constrained FMRP” category. * indicates p < 0.01; ** indicates p < 10−4.
Figure 2The distribution of missense Z scores and Z scores of de novo loss-of-function mutations identified in unaffected individuals, autism spectrum disorder (ASD) cases, and intellectual disability cases. (a) The distribution of missense Z scores. The red line indicates a Z score of 3.09, or the threshold for inclusion into the set of 1,003 constrained genes. (b) The missense Z scores for genes containing de novo LoF in unaffected individuals, ASD cases, and intellectual disability cases[2–6,9,10,15]. Black bars indicate the mean Z score of each group: 0.94, 1.68, and 2.46 for unaffected individuals, ASD cases, and intellectual disability cases, respectively. While the missense Z scores of the de novo LoF mutations found in unaffected siblings matched the overall distribution (Wilcoxon p=0.8325, n.s. = not significant), de novo LoF mutations found in both ASD and intellectual disability cases were significantly shifted towards more extreme constraint values (p < 10−6 for both). All p-values for deviation from the overall distribution are listed on the right side of the figure in bold. In addition, the distribution of missense Z scores between each of the three de novo lists were all individually significant at p < 0.05.