| Literature DB >> 32565911 |
Hao-Chih Lee1,2, Osamu Ichikawa1,2,3, Benjamin S Glicksberg1,2,4, Aparna A Divaraniya1,2, Christine E Becker1,2, Pankaj Agarwal5, Joel T Dudley1,2.
Abstract
BACKGROUND: Mapping disease-associated genetic variants to complex disease pathophysiology is a major challenge in translating findings from genome-wide association studies into novel therapeutic opportunities. The difficulty lies in our limited understanding of how phenotypic traits arise from non-coding genetic variants in highly organized biological systems with heterogeneous gene expression across cells and tissues.Entities:
Keywords: Gene candidate discovery; Genome-wide association study; Network biology
Year: 2020 PMID: 32565911 PMCID: PMC7301559 DOI: 10.1186/s13040-020-00216-9
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Fig. 1Schematic diagram of GWAS component analysis. a We designed a two-stage method to map SNP associations to component associations. Associated GWAS components were integrated with Bayesian networks to facilitate therapeutic discovery. GWAS component is a component with a score significantly deviating from 0. b In the first stage, S-PrediXcan was used to infer gene associations from SNP associations. Gene-to-Component model were fitted by WGCNA, and the corresponding associations were computed as in Eq. (2) (see text). Models in B) were built using GTEx reference data
Gene pairs co-expressed in multiple tissues and genetically conserved gene pairs. P-values report the difference in ratios compared to the one estimated from gene pairs found in one tissue (bottom row)
| # tissues detected | # WGCNA edge (n1) | # genetically conserved edges (n2) | ratio (n2 / n1) | |
|---|---|---|---|---|
| 14 | 1 | 0 | 0 | 0.548 |
| 13 | 1 | 1 | 1 | 8.78 × 10−23 |
| 12 | 7 | 2 | 0.286 | 3.58 × 10−13 |
| 11 | 9 | 5 | 0.556 | 1.68 × 10−58 |
| 10 | 12 | 4 | 0.333 | 1.75 × 10−28 |
| 9 | 19 | 8 | 0.420 | 2.06 × 10− 69 |
| 8 | 28 | 15 | 0.536 | 6.96 × 10− 163 |
| 7 | 33 | 14 | 0.424 | 6.65 × 10− 120 |
| 6 | 43 | 14 | 0.326 | 3.58 × 10− 91 |
| 5 | 75 | 20 | 0.267 | 4.62 × 10− 104 |
| 4 | 180 | 45 | 0.25 | 1.18 × 10− 209 |
| 3 | 495 | 87 | 0.176 | 2.21 × 10− 261 |
| 2 | 3344 | 218 | 0.0652 | 1.96 × 10− 170 |
| 1 | 78,656 | 817 | 0.0104 | N/A |
Fig. 2Connected components formed by gene-pairs co-expressed in more than 2 tissues have well-defined biological functions, such as a ribosome complex, b electron transfer chain, c mini-chromosome maintenance and d Golgi vesicle transport
Fig. 3Component associations estimated using GWAS summary (x-axis) are consistent with estimated using individual-level data (y-axis). Data were simulated as a trait that can be attributed to a randomly chosen eigen-gene component with 10% (left), 5% (middle) and 2% (right) heritability. Red squares indicate the z-score of the causal component. We confirmed by Kolmogorov–Smirnov test that the z-scores from non-causal components (blue points) followed a normal distribution (p = 0.1691, 0.5393 and 0.2542 for 10, 5 and 2% (right) heritability, respectively)
Fig. 4GWAS components select cell-specific signatures. Cell type enrichment were used to determine cell types from genes selected by GWAS component analysis (a) and S-Predixcan (b). Colors indicate –log10(p-value). Results were generated by CTen web server
In silico validation of gene candidates for four disease phenotypes
| AMD | CD | UC | RA | |
|---|---|---|---|---|
| LINCS hits | 0.059 (3/51) | 0.143 (2/14) | 0.146 (6/41) | 0.326 (17/52) |
| MGI hits | 0.311 (42/135) | 0.326 (15/46) | 0.361 (35/97) | 0.493 (73/148) |
| LINCS hits | 0.093 (5/54) | 0.243 (18/74) | 0.121 (7/58) | 0.237 (22/93) |
| MGI hits | 0.227 (29/128) | 0.367 (50/136) | 0.269 (28/104) | 0.277 (57/206) |
| LINCS hits | −0.034 | −0.1 | 0.025 | 0.089 |
| MGI hits | 0.084 | − 0.047 | 0.092 | 0.216*** |
Numerators represent hits, and denominators represent the number of genes retrieved by GWAS components + BNs or S-PrediXcan. *** indicates p < 0.001
Gene candidates with known indications. Results are queried from DrugBank
| Target | Indication | Drug |
|---|---|---|
| ALOX5 | UC | Mesalazine |
| SLOC1A2 | RA | Hydrocortisone, Ibuprofen, Indomethacin |
| FCGR3A | RA | Etanercept |
| C1QA | RA | Etanercept |
| TNF | UC | Infliximab |
| TNF | RA | Chloroquine, Etanercept, Infliximab |
Fig. 5Downstream genes of selected gene candidates on the Bayesian networks: a HCK in artery aorta, b C1QA in adrenal gland, and c PDGFRA in adrenal gland. For simplicity, genes more than three steps away from a gene candidate were excluded