| Literature DB >> 31932770 |
Daniel P Howrigan1,2, Samuel A Rose3,4, Kaitlin E Samocha5,3, Menachem Fromer3,4, Felecia Cerrato3, Wei J Chen6, Claire Churchhouse5,3, Kimberly Chambert3, Sharon D Chandler7, Mark J Daly5,3, Ashley Dumont3, Giulio Genovese3, Hai-Gwo Hwu6, Nan Laird8, Jack A Kosmicki5,3, Jennifer L Moran3, Cheryl Roe9, Tarjinder Singh5,3, Shi-Heng Wang10, Stephen V Faraone9, Stephen J Glatt9, Steven A McCarroll3,11, Ming Tsuang7, Benjamin M Neale12,13.
Abstract
Protein-coding de novo mutations (DNMs) are significant risk factors in many neurodevelopmental disorders, whereas schizophrenia (SCZ) risk associated with DNMs has thus far been shown to be modest. We analyzed DNMs from 1,695 SCZ-affected trios and 1,077 published SCZ-affected trios to better understand the contribution to SCZ risk. Among 2,772 SCZ probands, exome-wide DNM burden remained modest. Gene set analyses revealed that SCZ DNMs were significantly concentrated in genes that were highly expressed in the brain, that were under strong evolutionary constraint and/or overlapped with genes identified in other neurodevelopmental disorders. No single gene surpassed exome-wide significance; however, 16 genes were recurrently hit by protein-truncating DNMs, corresponding to a 3.15-fold higher rate than the mutation model expectation (permuted 95% confidence interval: 1-10 genes; permuted P = 3 × 10-5). Overall, DNMs explain a small fraction of SCZ risk, and larger samples are needed to identify individual risk genes, as coding variation across many genes confers risk for SCZ in the population.Entities:
Mesh:
Year: 2020 PMID: 31932770 PMCID: PMC7007385 DOI: 10.1038/s41593-019-0564-3
Source DB: PubMed Journal: Nat Neurosci ISSN: 1097-6256 Impact factor: 24.884
Figure 1:DNM rates (dots), 95% CI (error bars), and DNM model expectations (dotted lines) among SCZ and control probands. Figure 1A: Exome-wide DNM rates split by primary annotation, where “All DNM” encompasses protein-truncating variants (PTV), missense, and synonymous DNMs. Poisson rate p-values (all two-sided and unadjusted) and rate enrichment compared to controls are listed to the right of the DNM rate. DNM rates split by the absence (Figure 1B) or presence (Figure 1C) of an extant allele at the same position in the 104k non-psychiatric gnomAD cohort.
Figure 2:DNM rates (dots) and 95% CI (error bars) compared to DNM model expectations (dotted line) for exome-sequenced trios with various mental disorders. Figure 2A: Exome-wide PTV rate and enrichment relative to DNM model expectations. Figure 2B: Missense rate among evolutionarily constrained genes (defined here as the union of high pLI, missense constraint, and RVIS intolerant gene sets) and enrichment relative to DNM model expectations. Figure 2C: Exome-wide synonymous rate and enrichment relative to DNM model expectations. All tests are Poisson rate tests and information regarding the published studies analyzed is available in Online Methods.
Figure 3:Partitioning gene set enrichment among evolutionarily constrained and neurodevelopmental disorder gene sets. Proportion enrichment using a binomial test is evaluated among all DNMs in SCZ probands (n=2772) relative to controls (n=2216; green) and the DNM model (blue), with proportion enrichment (dots), 95% CI (error bars), and unadjusted two-sided p-values (above dots) displayed. Figure 3A: Gene set enrichment in three evolutionary constraint metrics (missense constraint, pLI, and RVIS), with partitioning among how often genes are present in each set. Figure 3B: Gene set enrichment for genes with observed DNM PTVs in intellectual disability, developmental delay, and ASD probands. Shared recurrent PTV genes have identified two or more PTVs in more than one disorder. Unique recurrent PTV genes have identified two or more PTVs in only one disorder. Single PTV genes are not recurrent within any disorder, although may be shared across disorders.
Partitioning DNM enrichment in top constraint and neurodevelopmental gene sets
| DNM annotation | SCZ DNM count | Control DNM count | SCZ-control enrichment | SCZ-control | DNM model enrichment | DNM model |
|---|---|---|---|---|---|---|
| All coding DNM | 220 | 118 | 1.40 | 2e-3 | 1.40 | 9e-7 |
| PTV | 38 | 7 | 3.73 | 3e-4 | 2.27 | 3e-6 |
| Missense | 135 | 77 | 1.30 | 0.05 | 1.31 | 2e-3 |
| Synonymous | 47 | 34 | 1.11 | 0.62 | 1.27 | 0.11 |
| All coding DNM | 29 | 9 | 2.42 | 0.02 | 1.98 | 8e-4 |
| PTV | 8 | 0 | NA | 0.02 | 4.61 | 4e-4 |
| Missense | 17 | 6 | 2.11 | 0.11 | 1.76 | 0.03 |
| Synonymous | 4 | 3 | 1.08 | 0.92 | 1.21 | 0.58 |
SCZ proband (n=2772) fold-enrichment and p-values (all two-sided and unadjusted) are from Poisson rate tests (Online methods).
Figure 4:Partitioning of the genes expressed in neuronal cell types by their interaction with mRNAs highly active in the synapse (e.g. FMRP, RBFOX, CELF4), with interacting genes listed as “potentially synaptic”. Also included are the results from BrainSpan highly expressed genes (bottom comparison). Proportion enrichment (dots) of each gene set is tested in SCZ probands (n=2772) against the mutation model using a one-sample binomial test, with p-values (all two-sided and unadjusted) listed above each dot along with the 95% CI (error bars).
Gene recurrence rates in SCZ probands and controls
| DNM | Observed recurrent genes | Expected recurrent genes | Fold-enrichment | Empirical | |
|---|---|---|---|---|---|
| PTV | 296 | 16 | 5.1 | 3.15 | 3e-5 |
| Missense | 1763 | 150 | 136.6 | 1.10 | 0.09 |
| Synonymous | 635 | 29 | 21.2 | 1.37 | 0.05 |
| All DNM | 2694 | 314 | 286.2 | 1.10 | 0.02 |
| PTV | 212 | 4 | 2.7 | 1.51 | 0.27 |
| Missense | 1320 | 89 | 81.2 | 1.10 | 0.17 |
| Synonymous | 512 | 18 | 14.1 | 1.28 | 0.16 |
| All DNM | 2044 | 195 | 177.4 | 1.10 | 0.06 |
Fold-enrichment and empirical p-values (all one-sided and unadjusted) are from permutations using DNM model rates (Online methods).
Genes recurrently hit by PTV DNMs
| Gene symbol | Coding bases (canonical) | pLI | PTV DNM Poisson | PTV DNM | Missense DNM | PTV transmitted | PTV non-transmitted | Damaging Missense transmitted | Damaging Missense non-transmitted |
|---|---|---|---|---|---|---|---|---|---|
| 5124 | 1 | 7.77E-06 | 3 | 0 | 0 | 0 | 1 | 4 | |
| 714 | 0.56 | 1.27E-05 | 2 | 0 | 0 | 0 | 2 | 0 | |
| 375 | 0.08 | 2.43E-05 | 2 | 0 | 0 | 0 | 0 | 0 | |
| 1182 | 0 | 3.61E-05 | 2 | 0 | 1 | 0 | 0 | 0 | |
| 2052 | 0.34 | 1.50E-04 | 2 | 0 | 1 | 0 | 1 | 2 | |
| 3186 | 1 | 2.12E-04 | 2 | 0 | 0 | 0 | 6 | 6 | |
| 3645 | 1 | 6.03E-04 | 2 | 0 | 0 | 0 | 0 | 1 | |
| 4785 | 1 | 8.32E-04 | 2 | 0 | 0 | 0 | 3 | 3 | |
| 7221 | 0.84 | 1.04E-03 | 2 | 0 | 0 | 0 | 2 | 4 | |
| 9771 | 0 | 1.55E-03 | 2 | 1 | 0 | 3 | 4 | 3 | |
| 7746 | 1 | 3.04E-03 | 2 | 1 | 0 | 0 | 1 | 3 | |
| 9294 | 1 | 3.11E-03 | 2 | 0 | 4 | 0 | 11 | 2 | |
| 13461 | 0 | 5.07E-03 | 2 | 0 | 6 | 1 | 8 | 7 | |
| 15018 | 0.72 | 7.70E-03 | 2 | 1 | 1 | 0 | 14 | 12 | |
| 14736 | 1 | 8.13E-03 | 2 | 2 | 0 | 0 | 6 | 7 | |
| 107976 | 0 | 1.83E-01 | 2 | 3 | 14 | 8 | 59 | 58 |
Single gene p-values (all one-sided and unadjusted) are from a Poisson rate test of PTV counts against the DNM model (Online methods).
DNM validation counts
| Taiwanese cohort | Trios | Putative called DNMs | DNMs submitted for validation | Validated DNMs | Final DNM count |
|---|---|---|---|---|---|
| Agilent wave 1 | 575 | 1060 | 1060 | 586 (55.3%) | 586 |
| Agilent wave 2 | 532 | 628 | 628 | 517 (82.3%) | 517 |
| Nextera wave 3 | 588 | 653 | 68 | 59 (86.8%) | 644 |