| Literature DB >> 32241259 |
Astrid M Manuel1, Yulin Dai1, Leorah A Freeman2, Peilin Jia3, Zhongming Zhao4,5,6.
Abstract
BACKGROUND: Multiple sclerosis (MS) is a complex disease in which the immune system attacks the central nervous system. The molecular mechanisms contributing to the etiology of MS remain poorly understood. Genome-wide association studies (GWAS) of MS have identified a small number of genetic loci significant at the genome level, but they are mainly non-coding variants. Network-assisted analysis may help better interpret the functional roles of the variants with association signals and potential translational medicine application. The Dense Module Searching of GWAS tool (dmGWAS version 2.4) developed in our team is applied to 2 MS GWAS datasets (GeneMSA and IMSGC GWAS) using the human protein interactome as the reference network. A dual evaluation strategy is used to generate results with reproducibility.Entities:
Keywords: Drug target; GWAS; Gene set enrichment analysis; Multiple sclerosis; Network module; dmGWAS
Mesh:
Substances:
Year: 2020 PMID: 32241259 PMCID: PMC7118851 DOI: 10.1186/s12920-020-0674-5
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Pipeline for dmGWAS implementation and gene network assessment. Details are provided in the Materials and methods section
Fig. 2Top modules associated to GeneMSA GWAS and IMSGC GWAS. The top module of GeneMSA GWAS displays HDAC1 as central (a). The top 10 modules of GeneMSA were merged (b). The top module of IMSGC GWAS shows JAK2 as central (c). The top 10 modules of IMSGC were merged (d). Large nodes represent central genes and small nodes represent less central genes (based on measure of Betweenness). Colors of nodes represent Pascal reported gene-level p-values as specified by legend
Fig. 3Significant modules of dual evaluation regarding IMSGC GWAS as discovery set. The top module of dual evaluation shows MAPK1 as central (a). Modules are regarded as significant based on normalized evaluation and discovery z-scores. Significant modules are represented in red (b). The 20 significant modules were merged to show overall gene network (n = 56) (c). Large nodes represent central genes and small nodes represent less central genes (based on measure of Betweenness). Colors of nodes represent Pascal reported gene-level p-values as specified by legend
Gene set enrichment analysis of the modules from dual evaluationa
| GO term | # contributing genes/ term sizeb | Contributing genesc | adj. | |
|---|---|---|---|---|
| Molecular Function | ||||
| Phosphotyrosine residue binding | 3/18 | 1.916 × 10− 5 | 1.674 × 10− 3 | |
| Phosphatidylinositol-4,5-bisphosphate 3-kinase activity | 4/62 | 3.318 × 10− 5 | 1.674 × 10− 3 | |
| Virus receptor activity | 4/71 | 5.667 × 10−5 | 1.674 × 10−3 | |
| Phosphatidylinositol bisphosphate kinase activity | 4/72 | 5.987 × 10−5 | 1.674 × 10−3 | |
| Protein phosphorylated amino acid binding | 3/28 | 7.533 × 10−5 | 1.674 × 10−3 | |
| Biological Process | ||||
| T cell costimulation | 7/82 | 3.704 × 10−9 | 2.113 × 10−6 | |
| Lymphocyte costimulation | 7/83 | 4.037 × 10−9 | 2.113 × 10−6 | |
| JAK-STAT cascade involved in growth hormone signaling pathway | 4/15 | 8.415 × 10−8 | 2.937 × 10−5 | |
| Growth hormone receptor signaling pathway | 4/24 | 6.425 × 10−7 | 1.598 × 10− 4 | |
| Cellular response to growth hormone stimulus | 4/25 | 7.633 × 10−7 | 1.598 × 10−4 | |
| Cellular Component | ||||
| Nuclear pore | 5/84 | 4.311 × 10−6 | 3.794 × 10−4 | |
| Clathrin-coated pit | 4/69 | 4.666 × 10−5 | 2.053 × 10−3 | |
| Endoplasmic reticulum tubular network | 2/12 | 5.296 × 10−4 | 0.0136 | |
| Platelet alpha granule membrane | 2/13 | 6.248 × 10−4 | 0.0136 | |
| Cytosolic large ribosomal subunit | 3/67 | 9.556 × 10−4 | 0.0136 | |
aIn this dual evaluation, IMSGC GWAS was the discovery set and GeneMSA was the evaluation set
bContributing genes: the number of genes in the input gene set. Term size: the total number of genes in the corresponding GO term
cContributing genes: those in the input genes that contributed to the enrichment
dAdjusted p-value by the Benjamini-Hochberg method [28]
MS FDA drug target genes within top modules
| Drug name | Commercial product name | Target gene | Occurrence in top modules | GeneMSA gene-level | IMSGC gene-level |
|---|---|---|---|---|---|
| Gilenya | Central in top module of GeneMSA, present in 75/75 top 1 % GeneMSA modules | 0.263 | 0.249 | ||
| Ocrevus | Present in top modules of IMSGC GWAS and top module of dual evaluation, present in 76/76 top 1 % IMSGC modules | 0.212 | 4.37 × 10−9 | ||
| Tecfidera | Present in 1/76 IMSGC top one present modules | 0.172 | 2.22 × 10−6 | ||
| Present in 3/75 top 1 % GeneMSA modules and 7/76 IMSGC top 1 % modules | 0.084 | 1.84 × 10−4 |
aDimethyl fumarate has two target genes from module gene list (KEAP1 and RELA)