| Literature DB >> 35401145 |
Liang-Yong Xia1, Lihong Tang1, Hui Huang1, Jie Luo1.
Abstract
Alzheimer's disease (AD) is one of the most common neurodegenerative diseases. To identify AD-related genes from transcriptomics and help to develop new drugs to treat AD. In this study, firstly, we obtained differentially expressed genes (DEG)-enriched coexpression networks between AD and normal samples in multiple transcriptomics datasets by weighted gene co-expression network analysis (WGCNA). Then, a convergent genomic approach (CFG) integrating multiple AD-related evidence was used to prioritize potential genes from DEG-enriched modules. Subsequently, we identified candidate genes in the potential genes list. Lastly, we combined deepDTnet and SAveRUNNER to predict interaction among candidate genes, drug and AD. Experiments on five datasets show that the CFG score of GJA1 is the highest among all potential driver genes of AD. Moreover, we found GJA1 interacts with AD from target-drugs-diseases network prediction. Therefore, candidate gene GJA1 is the most likely to be target of AD. In summary, identification of AD-related genes contributes to the understanding of AD pathophysiology and the development of new drugs.Entities:
Keywords: Alzheimer's disease; deep learning; drug repurposing; drug-target interaction; transcriptomics
Year: 2022 PMID: 35401145 PMCID: PMC8985410 DOI: 10.3389/fnagi.2022.752858
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.750
Figure 1A flowchart of the whole study. (1) Data collection from AlzData and ADNI; (2) Data preprocessing (e.g., eliminating the samples with missing data); (3) DEGs regarded with |logFC| > 0.1 and FDR < 0.05; (4) Enrichment of biological process analyzed by DAVID 6.8; (5) Use WGCNA to find AD-specific module; (6) Prioritize driver genes of AD by CFG score; (7) candidate genes with CFG≥5 are identified. (8) Collect the dataset of target, drug and disease; (9) Combine deepDTnet and SAveRUNNER to predict association between candidate genes and AD.
Brief descriptions for five datasets.
|
|
|
| |||
|---|---|---|---|---|---|
|
|
|
|
| ||
| Abbreviation | EC | HP | FC | TC | ADNI |
| No.of.gene | 15361 | 16313 | 11779 | 15462 | 49387 |
| Sample size(Control/AD) | 78 (39/39) | 141 (67/74) | 232 (128/104) | 91 (39/52) | 234 (194/40) |
| Age | 80 (29.6) | 81.7 (9.6) | 83 (9.4) | 81 (8.7) | 74.3 (6.5) |
| Male/Female/Unknown | 35/43/0 | 68/73/0 | 99/111/22 | 32/41/18 | 116/118/0 |
|
| NA | NA | NA | NA | 1142.9 (494.9) |
| Tau | NA | NA | NA | NA | 25.4 (11.6) |
These datasets come from AlzData and ADNI, respectively. Each dataset has multiple features. SDs are given in parentheses.
Figure 2Enhanced Volcano for illustrating DEGs in all datasets. The gene with |logFC| > 0.1 and FDR < 0.05 as DEGs shown in red node. (A) EC, (B) HP, (C) FC, (D) TC and (E) ADNI. Note: in ADNI dataset, DEGs by counting the frequency of 3 or above out of 10 occurrences.
Figure 3Venn diagram is used to represent relationships between EC (blue), HP (red), FC (green), TC (yellow) and ADNI (brown).
Figure 4Venn diagram is used to represent relationships between multiple datasets. (A) KEGG pathway and (B) GO term.
Significant KEGG pathways obtained from DAVID (P < 0.005).
|
|
|
|
|
|---|---|---|---|
| hsa00020 | Citrate cycle (TCA cycle) | hsa04966 | Collecting duct acid secretion |
| hsa00190 | Oxidative phosphorylation | hsa05010 | Alzheimer's disease |
| hsa00260 | Glycine, serine and threonine metabolism | hsa05012 | Parkinson's disease |
| hsa00620 | Pyruvate metabolism | hsa05014 | Amyotrophic lateral sclerosis |
| hsa01200 | Carbon metabolism | hsa05016 | Huntington disease |
| hsa01210 | 2-Oxocarboxylic acid metabolism | hsa05017 | Spinocerebellar ataxia |
| hsa01230 | Biosynthesis of amino acids | hsa05020 | Prion disease |
| hsa01522 | Endocrine resistance | hsa05022 | Pathways of neurodegeneration - multiple diseases |
| hsa03050 | Proteasome | hsa05032 | Morphine addiction |
| hsa04010 | MAPK signaling pathway | hsa05033 | Nicotine addiction |
| hsa04070 | Phosphatidylinositol signaling system | hsa05110 | Vibrio cholerae infection |
| hsa04071 | Sphingolipid signaling pathway | hsa05120 | Epithelial cell signaling in Helicobacter pylori infection |
| hsa04110 | Cell cycle | hsa05131 | Shigellosis |
| hsa04120 | Ubiquitin mediated proteolysis | hsa05132 | Salmonella infection |
| hsa04137 | Mitophagy - animal | hsa05140 | Leishmaniasis |
| hsa04140 | Autophagy - animal | hsa05145 | Toxoplasmosis |
| hsa04144 | Endocytosis | hsa05152 | Tuberculosis |
| hsa04145 | Phagosome | hsa05163 | Human cytomegalovirus infection |
| hsa04152 | AMPK signaling pathway | hsa05167 | Kaposi sarcoma-associated herpesvirus infection |
| hsa04211 | Longevity regulating pathway | hsa05169 | Epstein-Barr virus infection |
| hsa04218 | Cellular senescence | hsa05202 | Transcriptional misregulation in cancer |
| hsa04260 | Cardiac muscle contraction | hsa05205 | Proteoglycans in cancer |
| hsa04360 | Axon guidance | hsa05212 | Pancreatic cancer |
| hsa04625 | C-type lectin receptor signaling pathway | hsa05214 | Glioma |
| hsa04666 | Fc gamma R-mediated phagocytosis | hsa05215 | Prostate cancer |
| hsa04721 | Synaptic vesicle cycle | hsa05219 | Bladder cancer |
| hsa04722 | Neurotrophin signaling pathway | hsa05220 | Chronic myeloid leukemia |
| hsa04723 | Retrograde endocannabinoid signaling | hsa05223 | Non-small cell lung cancer |
| hsa04920 | Adipocytokine signaling pathway | hsa05225 | Hepatocellular carcinoma |
| hsa04932 | Non-alcoholic fatty liver disease | hsa05235 | PD-L1 expression and PD-1 checkpoint pathway in cancer |
| hsa04961 | Endocrine and other factor-regulated calcium reabsorption |
Figure 5Top 20 pathway of KEGG for five datasets (P < 0.005). (A) EC, (B) HP, (C) FC, (D) TC, and (E) ADNI.
Significant GO terms obtained from DAVID (P < 0.005).
|
|
|
|---|---|
| GO:0002223 | Stimulatory C-type lectin receptor signaling pathway |
| GO:0006888 | ER to Golgi vesicle-mediated transport |
| GO:0048015 | Phosphatidylinositol-mediated signaling |
| GO:0038128 | ERBB2 signaling pathway |
| GO:0007249 | I-kappaB kinase/NF-kappaB signaling |
| GO:0006672 | ceramide metabolic process |
| GO:0000165 | MAPK cascade |
| GO:0045944 | Positive regulation of transcription from RNA polymerase II promoter |
| GO:0007269 | Neurotransmitter secretion |
| GO:0035329 | Hippo signaling |
| GO:0006120 | Mitochondrial electron transport, NADH to ubiquinone |
| GO:0042776 | Mitochondrial ATP synthesis coupled proton transport |
| GO:0070125 | Mitochondrial translational elongation |
| GO:0032981 | Mitochondrial respiratory chain complex I assembly |
| GO:0007409 | Axonogenesis |
| GO:0048812 | Neuron projection morphogenesis |
| GO:0043524 | Negative regulation of neuron apoptotic process |
| GO:0007268 | Chemical synaptic transmission |
| GO:0060078 | Regulation of postsynaptic membrane potential |
| GO:0016079 | Synaptic vesicle exocytosis |
| GO:0048813 | Dendrite morphogenesis |
| GO:0090263 | Positive regulation of canonical Wnt signaling pathway |
| GO:0009967 | Positive regulation of signal transduction |
| GO:0051932 | Synaptic transmission, GABAergic |
| GO:0046034 | ATP metabolic process |
| GO:0070933 | Histone H4 deacetylation |
| GO:0007420 | Brain development |
| GO:0007417 | Central nervous system development |
| GO:0035357 | Peroxisome proliferator activated receptor signaling pathway |
| GO:0015986 | ATP synthesis coupled proton transport |
| GO:0040029 | Regulation of gene expression, epigenetic |
| GO:0007399 | Nervous system development |
| GO:0051966 | Regulation of synaptic transmission, glutamatergic |
| GO:0048488 | Synaptic vesicle endocytosis |
| GO:0010977 | Negative regulation of neuron projection development |
| GO:0060071 | Wnt signaling pathway, planar cell polarity pathway |
| GO:0006521 | Regulation of cellular amino acid metabolic process |
| GO:2000310 | Regulation of N-methyl-D-aspartate selective glutamate receptor activity |
| GO:0038061 | NIK/NF-kappaB signaling |
| GO:0035418 | Protein localization to synapse |
| GO:0060291 | Long-term synaptic potentiation |
The first column is GO terms ID; the second column is the name of GO terms.
Figure 6Module-trait relationships for five datasets.Each row represents different gene co-expression modules, and each column represents different clinical phenotypes. Number represent correlation coefficients and P-values are in parenthesis. Correlation strength is represented by continuous color, with red being positive, blue being negative. (A) EC, (B) HP, (C) FC, (D) TC, and (E) ADNI.
The 40 potential driver genes are prioritized by the CFG method based on AlzData database.
|
|
|
| |||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
| |||
|
|
| ||||||
|
| 2 | 2 |
| yes | 0.388** | 0.131 | 5 |
|
| 1 | 0 |
| yes | 0.270 | 0.526* | 4 |
|
| 3 | NA |
| yes | 0.352* | –0.023 | 4 |
|
| 5 | 2 |
| yes | –0.199 | –0.738** | 4 |
|
| 5 | 0 |
| yes | 0.482*** | 0.738** | 4 |
|
| 1 | 3 |
| NA | –0.304* | –0.506 | 4 |
|
| 1 | 32 |
| yes | –0.419** | –0.579* | 4 |
|
| 1 | 1 |
| yes | –0.688*** | –0.783*** | 4 |
|
| 1 | 0 |
| yes | 0.503*** | 0.662** | 4 |
|
| 3 | 8 |
| yes | 0.559*** | 0.120 | 4 |
|
| 1 | 0 |
| yes | –0.277 | –0.585* | 4 |
|
| 10 | 1 |
| yes | –0.128 | –0.638* | 4 |
|
| 1 | 0 |
| yes | –0.359* | 0.002 | 4 |
|
| 7 | 4 |
| yes | 0.800*** | 0.275 | 4 |
|
| 3 | 12 |
| yes | –0.488*** | –0.583* | 4 |
|
| 1 | 0 |
| yes | 0.310* | 0.282 | 4 |
|
| 8 | 0 |
| yes | 0.353* | 0.510 | 4 |
|
| 1 | 0 |
| yes | –0.433** | –0.772*** | 4 |
|
| 16 | 1 |
| NA | –0.332* | –0.497 | 4 |
|
| 0 | 1 |
| yes | –0.345* | –0.691** | 4 |
|
| 5 | 2 |
| yes | 0.32* | 0.609* | 4 |
|
| 1 | 1 |
| yes | 0.780*** | 0.718** | 4 |
|
| 2 | 0 |
| yes | 0.591*** | –0.107 | 4 |
|
| 1 | 1 |
| yes | 0.525*** | 0.008 | 4 |
|
| 7 | 3 |
| NA | 0.195 | 0.623* | 4 |
|
| 1 | 0 |
| yes | 0.114 | 0.662** | 4 |
|
| 2 | 17 |
| yes | 0.085 | 0.587* | 4 |
|
| 6 | 2 |
| yes | –0.064 | –0.387 | 4 |
|
| 8 | 3 |
| yes | 0.041 | 0.086 | 4 |
|
| 8 | 0 |
| yes | –0.416** | 0.546* | 4 |
|
| 1 | 0 |
| yes | 0.769*** | 0.616* | 4 |
|
| 2 | 41 |
| yes | 0.307* | –0.248 | 4 |
|
| 1 | 0 |
| yes | 0.879*** | 0.839*** | 4 |
|
| 8 | 0 |
| yes | 0.353* | 0.510 | 4 |
|
| 1 | 0 |
| yes | 0.432** | –0.069 | 4 |
|
| 1 | 0 |
| yes | –0.439** | –0.396 | 4 |
|
| 14 | 0 |
| yes | 0.438** | –0.022 | 4 |
|
| 7 | 1 |
| yes | 0.651*** | 0.494 | 4 |
|
| 0 | 5 |
| yes | 0.319* | –0.284 | 4 |
|
| 2 | 18 |
| yes | 0.623*** | 0.685** | 4 |
“NA,” not applicable due to missing related data for the target gene. AD, Alzheimer's disease; CFG, convergent functional genomics score based on the total number of lines of AD-related evidence; DEG, differentially expressed gene; eQTL, the total number of risk SNPs based on the IGAP data setthat were able to regulate expression of the target gene; GWAS, the total number of risk SNPs within the target gene based on the IGAP data set; PPI, AD core genes (APP, PSEN1, PSEN2, MAPT, and APOE) that had a significant protein-protein interaction with the target genes; Early_DEG: target gene is differentially expressed in AD mouse models before AD pathology emergence; Expression correlation of the target gene and AD pathology in AD mice was performed for the Aβ line AD mice in Mouse (marked as Aβ) and the Tau line AD mice in Mouse (marked as Tau). *P < 0.05; **P < 0.01; .
Figure 7Drug-GJA1-disease interaction network. The network contained candidate target GJA1 (green), Neurodegenerative Diseases (red) and 13 drugs (yellow).Gray indicate known interaction. Green and red lines and newly predicted interactions using deepDTnet and SAveRUNNER, respectively.