| Literature DB >> 29262854 |
Hoang T Nguyen1, Julien Bryois2, April Kim3,4, Amanda Dobbyn5,6, Laura M Huckins5, Ana B Munoz-Manchado7, Douglas M Ruderfer8, Giulio Genovese3,9, Menachem Fromer10, Xinyi Xu11, Dalila Pinto5,11,12,13, Sten Linnarsson7, Matthijs Verhage14, August B Smit15, Jens Hjerling-Leffler7, Joseph D Buxbaum11, Christina Hultman2, Pamela Sklar5, Shaun M Purcell5,16, Kasper Lage3,4, Xin He17, Patrick F Sullivan2,18, Eli A Stahl19,20.
Abstract
BACKGROUND: Integrating rare variation from trio family and case-control studies has successfully implicated specific genes contributing to risk of neurodevelopmental disorders (NDDs) including autism spectrum disorders (ASD), intellectual disability (ID), developmental disorders (DDs), and epilepsy (EPI). For schizophrenia (SCZ), however, while sets of genes have been implicated through the study of rare variation, only two risk genes have been identified.Entities:
Keywords: Autism; De novo mutations; Developmental disorders; Epilepsy; Hierarchical model; Intellectual disability; Rare variants; Schizophrenia
Mesh:
Year: 2017 PMID: 29262854 PMCID: PMC5738153 DOI: 10.1186/s13073-017-0497-y
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1Observed false discovery rates (oFDRs) and theoretical FDR with different combinations between and . Each panel is for one π value. For example, the top left panel shows oFDR and FDR for π=0.02. FDR false discovery rate, dn de novo, FDR false discovery rate, oFDR observed FDR, RR relative risk
Estimated parameters of proportions of risk genes (pi) and mean relative risk (meanRR) for DN and CC SCZ data and four other NDDs: ID, EPI, ASD and DD
| Parameter | Estimated | Lower credible | Upper credible |
|---|---|---|---|
| mode | interval boundary | interval boundary | |
| SCZ_pi (%) | 8.01 | 4.59 | 12.9 |
| SCZ_meanRR_silentFCPk_denovo | 1.22 | 1.00 | 2.16 |
| SCZ_meanRR_MiD_denovo | 1.44 | 1.00 | 3.16 |
| SCZ_meanRR_LoF_denovo | 12.25 | 4.79 | 22.22 |
| SCZ_meanRR_MiD+LoF_CCpop1 | 2.09 | 1.04 | 3.54 |
| SCZ_meanRR_MiD+LoF_CCpop2 | 2.44 | 1.05 | 5.73 |
| SCZ_meanRR_MiD+LoF_CCpop3 | 1.04 | 1 | 1.19 |
| ASD_pi (%) | 4.44 | 3.15 | 5.94 |
| ASD_meanRR_MiDdenovo | 3.71 | 2.06 | 8.71 |
| ASD_meanRR_LoFdenovo | 24.56 | 14.27 | 37.44 |
| ASD_meanRR_LoFcc | 4.04 | 2.08 | 8.24 |
| ID_pi (%) | 2.53 | 1.89 | 3.43 |
| ID_meanRR_MiDdenovo | 29.82 | 18.86 | 46.1 |
| ID_meanRR_LoFdenovo | 105.45 | 73.27 | 143.29 |
| DD_pi (%) | 2.84 | 2.29 | 3.45 |
| DD_meanRR_MiDdenovo | 23.42 | 13.97 | 33.97 |
| DD_meanRR_LoFdenovo | 88.32 | 67.54 | 115.09 |
| EPI_pi (%) | 1.14 | 0.52 | 2.1 |
| EPI_meanRR_MiDdenovo | 72.2 | 35.39 | 128.46 |
| EPI_meanRR_LoFdenovo | 89.71 | 45.31 | 169.43 |
These results were obtained by sampling three MCMC chains (20,000 times for each chain). These results are for three categories: loss of function (LoF) variants/mutations, missense damaging (MiD) variants/mutations, and silent within frontal cortex-derived DHS peaks (silentFCPk) variants.
ASD autism spectrum disorders, CC case–control, DD developmental disorder, DN de novo, EPI epilepsy, ID intellectual disability, LoF loss of function, MCMC Markov chain Monte Carlo, MiD missense damaging, NDD neurodevelopmental disorder, SCZ schizophrenia, silentFCPk silent within frontal cortex-derived DHS peaks
Fig. 2The densities of the proportion of risk genes (x-axis) and mean relative risk (y-axis) for SCZ data. These were obtained after 20,000 iterations of three MCMC chains. The first two case–control populations are derived from the Sweden data set while the third case–control population is the UK population. The scales on the y-axes are adjusted because mean relative risk varies between categories. LoF loss of function, MCMC Markov chain Monte Carlo, MiD missense damaging, Pop population, SCZ schizophrenia, silentFCPk, silent within frontal cortex-derived DNase I hypersensitive site peaks
Fig. 3The densities of the proportion of risk genes (x-axis) and mean relative risk (y-axis) for ASD, EPI, ID, and DD data. These were obtained after 20,000 iterations of three MCMC chains. For ASD, there are two de novo classes and one case–control class. For other disorders, only two de novo classes are publicly available for our current study. The scales on the y-axes are adjusted because mean relative risk varies between categories and disorders. ASD autism spectrum disorders, DD developmental disorder, EPI epilepsy, ID intellectual disability, LoF loss of function, MCMC Markov chain Monte Carlo, MiD missense damaging
Fig. 4GeNets network analysis for developmental disorder significant genes (p<2×10−3). These are 93 genome-wide significant genes from [70] and 98 significant genes (FDR < 0.05 from extTADA) not in the 93 genes. Triangular shapes are the 98 novel genes from extTADA. FDR false discovery rate
Fig. 5Number of risk genes for different sample sizes based on the genetic architecture predicted by extTADA. Case–control number is only for cases (or controls); therefore, if case–control number = 10,000, this means cases + controls = 20,000. The numbers in brackets show risk-gene numbers if we use only case–control data or only de novo mutation data
Enrichment of known gene sets from extTADA results for schizophrenia
| Gene set | Gene number | Overlapping gene number |
| FDR |
|---|---|---|---|---|
| Constrained | 1003 | 939 | 3.3e-06 | 6.2e-04 |
| pLI09 | 3488 | 3241 | 1.0e-05 | 8.2e-04 |
| rbfox2 | 3068 | 2895 | 1.3e-05 | 8.2e-04 |
| chd8.human_brain | 2798 | 2601 | 5.0e-05 | 2.3e-03 |
| rbfox13 | 3445 | 3230 | 1.7e-04 | 6.3e-03 |
| FMRP_targets | 839 | 792 | 2.1e-04 | 6.5e-03 |
| celf4 | 2675 | 2468 | 2.7e-04 | 7.1e-03 |
| Module.M3 | 162 | 145 | 5.6e-04 | 1.3e-02 |
| DD.allDenovoMiDandLoF | 1271 | 1271 | 7.0e-04 | 1.4e-02 |
| ARC | 28 | 25 | 1.0e-03 | 1.8e-02 |
| NMDAR_network | 61 | 58 | 1.5e-03 | 2.3e-02 |
| abnormal_emotionORaffect_behavior | 392 | 363 | 1.5e-03 | 2.3e-02 |
| AlleleBiasedExpression.Neuron | 802 | 619 | 1.9e-03 | 2.6e-02 |
| Module.M13 | 149 | 129 | 2.0e-03 | 2.6e-02 |
| abnormal_cued_conditioning_behavior | 74 | 67 | 2.5e-03 | 2.9e-02 |
| mGluR5 | 39 | 36 | 2.4e-03 | 2.9e-02 |
| abnormal_sensory_capabilitiesORreflexesORnociception | 607 | 579 | 4.5e-03 | 4.9e-02 |
| mir137 | 3260 | 2940 | 7.0e-03 | 6.5e-02 |
| abnormal_behavior | 2037 | 1937 | 7.0e-03 | 6.5e-02 |
| Pardinas2017_extTable9 | 534 | 522 | 7.0e-03 | 6.5e-02 |
| PSD-95_(core) | 65 | 57 | 8.0e-03 | 6.7e-02 |
| abnormal_excitatory_postsynaptic_currents | 73 | 67 | 8.0e-03 | 6.7e-02 |
| list.EPI.43genes.2017.Epi4K.2017 | 43 | 38 | 9.2e-03 | 7.2e-02 |
| abnormal_socialORconspecific_interaction | 257 | 238 | 9.4e-03 | 7.2e-02 |
| abnormal_associative_learning | 204 | 190 | 1.5e-02 | 1.1e-01 |
| abnormal_social_investigation | 64 | 54 | 1.8e-02 | 1.2e-01 |
| Module.M1 | 1244 | 1071 | 1.8e-02 | 1.2e-01 |
| synaptome | 1887 | 1816 | 1.9e-02 | 1.3e-01 |
| abnormal_motor_capabilitiesORcoordinationORmovement | 1398 | 1326 | 2.0e-02 | 1.3e-01 |
| CYFIP1_all | 37 | 34 | 2.1e-02 | 1.3e-01 |
| abnormal_fearORanxiety-related_behavior | 232 | 213 | 2.3e-02 | 1.4e-01 |
| abnormal_behavioral_response_to_xenobiotic | 219 | 208 | 3.0e-02 | 1.7e-01 |
| abnormal_learningORmemoryORconditioning | 449 | 414 | 3.1e-02 | 1.7e-01 |
| abnormal_brain_size | 193 | 180 | 3.6e-02 | 1.8e-01 |
| abnormal_contextual_conditioning_behavior | 95 | 88 | 3.4e-02 | 1.8e-01 |
| abnormal_excitatory_postsynaptic_potential | 64 | 58 | 3.5e-02 | 1.8e-01 |
| abnormal_aggression-related_behavior | 69 | 62 | 3.7e-02 | 1.8e-01 |
| Module.M2 | 38 | 35 | 4.1e-02 | 2.0e-01 |
| abnormal_discrimination_learning | 21 | 20 | 4.3e-02 | 2.0e-01 |
These p values were obtained from 10,000,000 simulations, and then adjusted using the method of [58]. The information for these gene sets is summarized in Additional file 1: Table S2. The second column (Gene number) shows the number of genes in the gene set. The third column shows the number of overlapping genes between the gene sets and the 19,358 genes used by extTADA.
FDR false discovery rate
Fig. 6Comparing between five conditions. Top left: Overlaps of top significant genes (FDR < 0.3). Top right: Correlations of posterior probabilities (PPs) between SCZ, ASD, DD, ID, and EPI (all p values <0.0001). These results are calculated using PPs from extTADA. Bottom: Overlaps of significant gene sets in SCZ, ASD, EPI, DD, and ID. These results are for 185 and 1,879 gene sets, respectively. ASD autism spectrum disorders, DD developmental disorder, EPI epilepsy, FDR false discovery rate, ID intellectual disability, PP posterior probability, SCZ schizophrenia