| Literature DB >> 30988527 |
Quan Wang1,2, Rui Chen1,2, Feixiong Cheng3,4,5, Qiang Wei1,2, Ying Ji1,2, Hai Yang1,2, Xue Zhong2,6, Ran Tao2,7, Zhexing Wen8, James S Sutcliffe1,2, Chunyu Liu9, Edwin H Cook10, Nancy J Cox2,6, Bingshan Li11,12.
Abstract
Genome-wide association studies (GWAS) have identified more than 100 schizophrenia (SCZ)-associated loci, but using these findings to illuminate disease biology remains a challenge. Here we present integrative risk gene selector (iRIGS), a Bayesian framework that integrates multi-omics data and gene networks to infer risk genes in GWAS loci. By applying iRIGS to SCZ GWAS data, we predicted a set of high-confidence risk genes, most of which are not the nearest genes to the GWAS index variants. High-confidence risk genes account for a significantly enriched heritability, as estimated by stratified linkage disequilibrium score regression. Moreover, high-confidence risk genes are predominantly expressed in brain tissues, especially prenatally, and are enriched for targets of approved drugs, suggesting opportunities to reposition existing drugs for SCZ. Thus, iRIGS can leverage accumulating functional genomics and GWAS data to advance our understanding of SCZ etiology and potential therapeutics.Entities:
Mesh:
Year: 2019 PMID: 30988527 PMCID: PMC6646046 DOI: 10.1038/s41593-019-0382-7
Source DB: PubMed Journal: Nat Neurosci ISSN: 1097-6256 Impact factor: 24.884
Figure 1.A schematic illustration of the iRIGS framework.
Each circle represents a candidate gene, and candidate genes from a GWAS locus are arranged horizontally. Candidate genes from different GWAS loci are piled up vertically. In the middle of the figure the L-1 loci have already been sampled, and for the L-th locus the colors of the genes represent the strength of the support from genomic features as well as the closeness to the L-1 sampled risk genes in the network space. After the sampling converges, the candidate gene with the highest PP at each locus is denoted as the inferred risk gene.
Enrichment of network-derived risk genes (NRGs) and high-confidence risk genes (HRGs) in gene sets implicated in SCZ.
| Gene set[ | NRG vs LBG | HRG vs WBG | HRG vs LBG | |||
|---|---|---|---|---|---|---|
| OR | OR | OR | ||||
| AutDB (781) | 1.23×10−8 | 10.75 (18) | 2.87×10−14 | 9.04 (27) | 4.20×10−16 | 18.22 |
| ECG (998) | 1.60×10−4 | 4.63 (17) | 3.21×10−16 | 8.85 (32) | 9.69×10−15 | 10.65 |
| Essential genes (3910) | 4.19×10−11 | 4.91 (48) | 9.23×10−8 | 3.35 (46) | 3.00×10−9 | 4.25 |
| FMRP-Darnell (832) | 1 | 1.85 (9) | 3.28×10−9 | 6.42 (22) | 5.95×10−8 | 6.86 |
| RBFOX1 (556) | 0.11 | 3.60 (8) | 4.98×10−4 | 4.71 (12) | 2.36×10−5 | 9.10 |
| miR-137 targets (281) | 4.24×10−5 | 9.79 (11) | 3.29 ×10−5 | 7.82 (10) | 5.69×10−5 | 11.18 |
| PSD (1444) | 4.03×10−5 | 4.52 (20) | 2.21×10−3 | 2.94 (19) | 8.74×10−5 | 4.42 |
| FMRP-Ascano (939) | 0.41 | 2.38 (10) | 1.55×10−3 | 3.51 (15) | 4.30×10−3 | 3.61 |
| CCS (73) | 1 | 2.03 (1) | 6.57×10−4 | 14.94 (5) | 4.38×10−3 | 21.34 |
| PRAZ (209) | 1 | 2.04 (2) | 1.82×10−3 | 7.14 (7) | 4.78×10−3 | 8.69 |
| mGluR5 (37) | 1 | 0 (0) | 0.02 | 17.60 (3) | 0.08 | 25.13 |
| PRP (336) | 1 | 2.76 (4) | 0.52 | 3.03 (5) | 1 | 2.48 |
| TADA (179) | 1 | 4.10 (2) | 1 | 2.22 (2) | 1 | 3.32 |
| ARC (25) | 1 | Inf (1) | 1 | 8.16 (1) | 1 | Inf |
| PSD-95 (107) | 1 | 8.20 (2) | 1 | 3.75 (2) | 1 | 8.31 |
| NMDAR (59) | 0.60 | 16.39 (2) | 1 | 3.37 (1) | 1 | 8.24 |
| SYV(107) | 1 | 2.73 (2) | 1 | 1.84 (1) | 1 | 2.06 |
| GABAA (18) | 1 | 0 (0) | 1 | 0 (0) | 1 | 0 |
In brackets are the numbers of genes in the corresponding gene sets. One-sided Fisher’s exact test and Bonferroni correction were used for enrichment analyses. Please refer to Methods for details of gene set abbreviations. Abbreviations: AutDB (autism genes from database AutDB), ECG (evolutionarily constrained genes), FMRP-Darnell (the fragile X mental retardation protein targets from), PSD (postsynaptic density genes), FMRP-Ascano (the fragile X mental retardation protein targets from), RBFOX1 (targets of RNA Binding Protein, Fox-1 Homolog 1), miR-137 targets (microRNA 137 targets), PRAZ (genes related to presynaptic active zone), CCS (calcium channel and signaling genes), mGluR5 (metabotropic glutamate receptor 5 complex), PRP (genes related to presynaptic proteins), ARC (neuronal activity-regulated cytoskeleton-associated proteins), PSD-95 (postsynaptic density protein 95 complex), TADA (genes from transmission and de novo association tests), NMDAR (N-methyl-D-aspartate receptor network genes), SYV (genes related to synaptic vesicles), GABAA (neurotransmitter gamma-aminobutyric acid receptors), NRG (network-derived risk gene), HRG (high-confident risk gene), LBG (local background gene), WBG (whole-genome background gene), OR (odds ratio).
Figure 2.Discovery of genomic features characteristic of SCZ risk genes.
Panel a) shows that network-derived risk genes (NRGs) are more likely to be differentially expressed (DE) compared to local background genes (LBGs). We directly used the P values of DE from the CommondMind Consortium to perform the comparison (one-sided Wilcoxon rank sum test, n = 99 and 562 for NRGs and LBGs respectively). Panel b) shows that NRGs capture more distal regulatory element (DRE)-promoter links based on the data from capture Hi-C, FANTOM5, and brain specific Hi-C (one-sided Wilcoxon rank sum test; for capture Hi-C and FANTOM5, n = 104 and 842 for NRGs and LBGs respectively; for brain specific Hi-C, n = 104 and 831 for NRGs and LBGs respectively). See main text and Supplementary Note for details. The box plots show median and the 25th and 75th percentiles. The whiskers extend from the box to the largest and smallest values no further than 1.5 * IQR from the box (where IQR is the inter-quartile range, or distance between the 25th and 75th percentiles).
Figure 3.Characteristics of predicted risk genes.
a) The distributions of the PPs of high-confidence risk genes (HRGs) and network-derived risk genes (NRGs) showed that HRGs carry significantly higher sampling posterior probabilities (PPs) than NRGs (one-sided Wilcoxon rank sum test, n = 107 for both HRGs and NRGs). The x-axis represents the ratio of maximum and median of PPs of candidate genes for each GWAS loci. b) Stratified LDSC to evaluate the enrichment of SCZ heritability explained by different groups of genes. The center values represent the enrichment and the error bars indicate the standard errors. c) The tissue-specificity of HRGs across tissues in GTEx showed that HRGs are highly expressed in brain-related tissues (one-sided Wilcoxon rank sum test and Bonferroni correction, n = 104 and 830 for NRGs and LBGs respectively). d) The expression of HRGs, the 65 non-nearest HRGs, the corresponding 65 nearest non-HRG genes and LBGs across developmental stages based on the BrainSpan data showed that HRGs and non-nearest HRGs are highly expressed at prenatal stages compared to postnatal stages, while the 65 corresponding nearest non-HRG genes and LBGs are not differentially expressed across developmental stages (one-sided Wilcoxon rank sum test using medians of expression at prenatal (n = 3) and postnatal (n = 4) stages). It also showed that HRGs have higher expression in brains than LBGs, consistent with the observation in c) that were based on GTEx data. The error bar plot shows the median and the 25th and 75th percentiles.
Selected high-confidence risk genes (HRGs) involved in biological functions implicated in SCZ.
| Gene | Descriptions | Nearest[ | Reference |
|---|---|---|---|
| Encodes an alpha-1 subunit of a voltage-dependent calcium channel and a target of miR-137 | Yes | ||
| A member of the voltage-gated calcium channel superfamily | No | ||
| Involved in calcium-induced regulation of ion channels; interacts with | No | ||
| An transcriptional factor essential for neurogenesis | Yes | ||
| Essential for cognitive development and is involved in long-term plasticity processes | Yes | ||
| An ionotropic glutamate receptor that mediates fast synaptic transmission | No | ||
| A glutamate-gated ion channel protein and a key mediator of synaptic plasticity; also a target of miR-137 | Yes | ||
| Encodes glutamate metabotropic receptor 3, one of the major excitatory neurotransmitters in central nervous system (CNS); has been extensively explored as a potential drug target in SCZ | Yes | ||
| Involved in neuronal plasticity and probably synapse formation; has been previously shown associating with the severity of depression in SCZ patients | Yes | ||
| Involved in the formation and remodeling of CNS synapses; knockdown of it directly impacts neurodevelopment process indicating a role in the molecular pathophysiology of psychiatric diseases, including ASD and SCZ | Yes | ||
| Encodes transcription factor 4 and involved in the initiation of neuronal differentiation | Yes | ||
| A zinc finger binding protein implicated in SCZ previously | Yes | ||
| Encodes a ligand-dependent transcriptional regulator; a potential ASD gene | Yes | ||
| A target of miR-137 and a potential SCZ gene | Yes | ||
Well-established SCZ genes.
Pontential SCZ genes of great interest predicted by iRIGS.
Nearest to the index SNPs or not.