| Literature DB >> 31174177 |
Daniel E Carlin1, Samson H Fong2, Yue Qin3, Tongqiu Jia4, Justin K Huang5, Bokan Bao5, Chao Zhang5, Trey Ideker6.
Abstract
We present an accessible, fast, and customizable network propagation system for pathway boosting and interpretation of genome-wide association studies. This system-NAGA (Network Assisted Genomic Association)-taps the NDEx biological network resource to gain access to thousands of protein networks and select those most relevant and performative for a specific association study. The method works efficiently, completing genome-wide analysis in under 5 minutes on a modern laptop computer. We show that NAGA recovers many known disease genes from analysis of schizophrenia genetic data, and it substantially boosts associations with previously unappreciated genes such as amyloid beta precursor. On this and seven other gene-disease association tasks, NAGA outperforms conventional approaches in recovery of known disease genes and replicability of results. Protein interactions associated with disease are visualized and annotated in Cytoscape, which, in addition to standard programmatic interfaces, allows for downstream analysis.Entities:
Keywords: Bioinformatics; Biological Sciences; Genomics
Year: 2019 PMID: 31174177 PMCID: PMC6554232 DOI: 10.1016/j.isci.2019.05.025
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1NAGA Workflow
Red steps are upstream of the method; blue steps are provided by the NAGA python package.
Figure 2AUROC Results against Gold-Standard Disease Genes
Area under the receiver-operator curve (AUROC) for three different network GWAS methods, using the gene network shown in parentheses for (A) Schizophrenia, (B) Bipolar Disorder, (C) Type 1 Diabetes, (D) Type 2 Diabetes, (E) Hypertension, (F) Coronary Artery Disease, (G) Crohn's Disease, and (H) Rheumatoid Arthritis.
(I) Runtime for the methods.
Figure 3Application of NAGA to Schizophrenia
(A) Top 100 prioritized genes after network propagation of a schizophrenia GWAS dataset. Genes in the gold standard are represented by turquoise bars, whereas newly implicated genes are represented by red bars.
(B) Subnetwork associated with hottest network propagation scores. Subnetwork is visualized with the initial association scores mapped to node colors, with darker red corresponding to stronger association. Previously implicated schizophrenia genes appear as squares, and newly implicated genes appear as circles.
(C) Integrated Genomics Viewer (IGV) screenshot showing the genomic locus of APP, the second highest scoring gene from (A). IGV displays the log 10 p-value of association. APP contains SNPs that, before network propagation, achieve nominal but not global statistical significance of association.