| Literature DB >> 36056063 |
Fengjun Zhang1, Mingyue Xia2, Yuan Liu3, Min Peng4, Jiarong Jiang5, Shuai Wang6, Qiong Zhao7, Cheng Yu8, Jinzhen Yu9, Dexian Xian1, Xiao Li10, Lin Zhang11.
Abstract
Dilated cardiomyopathy (DCM) is a condition of impaired ventricular remodeling and systolic diastole that is often complicated by arrhythmias and heart failure with a poor prognosis. This study attempted to identify autophagy-related genes (ARGs) with diagnostic biomarkers of DCM using machine learning and bioinformatics approaches. Differential analysis of whole gene microarray data of DCM from the Gene Expression Omnibus (GEO) database was performed using the NetworkAnalyst 3.0 platform. Differentially expressed genes (DEGs) matching (|log2FoldChange ≥ 0.8, p value < 0.05|) were obtained in the GSE4172 dataset by merging ARGs from the autophagy gene libraries, HADb and HAMdb, to obtain autophagy-related differentially expressed genes (AR-DEGs) in DCM. The correlation analysis of AR-DEGs and their visualization were performed using R language. Gene Ontology (GO) enrichment analysis and combined multi-database pathway analysis were served by the Enrichr online enrichment analysis platform. We used machine learning to screen the diagnostic biomarkers of DCM. The transcription factors gene regulatory network was constructed by the JASPAR database of the NetworkAnalyst 3.0 platform. We also used the drug Signatures database (DSigDB) drug database of the Enrichr platform to screen the gene target drugs for DCM. Finally, we used the DisGeNET database to analyze the comorbidities associated with DCM. In the present study, we identified 23 AR-DEGs of DCM. Eight (PLEKHF1, HSPG2, HSF1, TRIM65, DICER1, VDAC1, BAD, TFEB) molecular markers of DCM were obtained by two machine learning algorithms. Transcription factors gene regulatory network was established. Finally, 10 gene-targeted drugs and complications for DCM were identified.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36056063 PMCID: PMC9440113 DOI: 10.1038/s41598-022-19027-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Workflow diagram of the current study. GO, go ontology; TFs, transcription factors.
Figure 2DEGs differential analysis of GSE4172 dataset. (a) Heatmap of DEGs in GSE4172 dataset (n = 60, p < 0.05, |log2 FoldChange|≥ 0.8). (b) Asymptotic volcano map of gene expression in the GSE4172 dataset. The two vertical lines indicated gene expression ploidy changes > 0.8 and < -0.8, respectively, and the horizontal line indicated a p value of 0.05. The color of the dots represented the level of the p value. The top 10 significantly expressed genes among the DEGs were labeled on the graph.
Figure 3AR-DEGs were shown by Venn diagram. 366 DEGs-Up and 404 DEGs-Down were intersected with 232 and 796 autophagy-associated genes from the HADb and HAMDb autophagy gene pools, with 23 genes being identical. The number of intersecting genes was marked in the red box. DEGs-Up, differentially expressed up-regulated genes; DEGs-Down, differentially expressed down-regulated genes.
Figure 423 AR-DEGs in dilated cardiomyopathy (DCM) group and control group and their correlation. (a) Box plot of the expression levels of 23 DEGs-Down in the DCM and control groups. The blue box plots above the corresponding gene names indicated expression in control groups, whereas the red box plots indicated expression in DCM groups. (b) Correlation heatmap of 23 AR-DEGs. The color within the circle shape and the magnitude of the correlation value represented the strength of the correlation; red represented positive correlation and blue represented negative correlation. The darker the color, the larger the absolute value of the correlation value represented a stronger correlation.
GO category, GO pathways, corresponding p-values, and AR-DEGs.
| Go category | GO pathways | GO ID | p-value | AR-DEGs |
|---|---|---|---|---|
| Biological process | Regulation of autophagy | GO:0010506) | 5.24E−09 | |
| Positive regulation of autophagy | GO:0010508) | 5.2E−08 | ||
| Positive regulation of cellular catabolic process | GO:0031331) | 4.93E−07 | ||
| Positive regulation of cold-induced thermogenesis | GO:0120162) | 4.29E−06 | ||
| Positive regulation of metabolic process | GO:0009893) | 7.87E−06 | ||
| Macroautophagy | GO:0016236) | 9.99E−06 | ||
| Response to sodium arsenite | GO:1903935) | 1.26E−05 | ||
| Cellular response to sodium arsenite | GO:1903936) | 1.26E−05 | ||
| Cellular response to salt | GO:1902075) | 1.89E−05 | ||
| Negative regulation of tumor necrosis factor production | GO:0032720) | 1.95E−05 | ||
| Molecular function | Low-density lipoprotein particle receptor binding | GO:0050750) | 0.000315 | |
| Lipoprotein particle receptor binding | GO:0070325) | 0.00047 | ||
| Endoribonuclease activity | GO:0004521) | 0.000694 | ||
| Regulatory RNA binding | GO:0061980) | 0.000961 | ||
| Endonuclease activity | GO:0004519) | 0.001559 | ||
| Kinase binding | GO:0019900) | 0.001742 | ||
| Protein kinase binding | GO:0019901) | 0.002446 | ||
| Ribonuclease activity | GO:0004540) | 0.002595 | ||
| C-X-C chemokine receptor activity | GO:0016494) | 0.005737 | ||
| Intronic transcription regulatory region sequence-specific DNA binding | GO:0001161) | 0.005737 | ||
| Cellular component | lysosome | GO:0005764) | 7.4E−07 | |
| Lytic vacuole | GO:0000323) | 0.000105 | ||
| Lytic vacuole membrane | GO:0098852) | 0.003419 | ||
| Lysosomal lumen | GO:0043202) | 0.00436 | ||
| Endosome membrane | GO:0010008) | 0.005917 | ||
| Lysosomal membrane | GO:0005765) | 0.006172 | ||
| Nuclear stress granule | GO:0097165) | 0.006881 | ||
| AP-2 adaptor complex | GO:0030122) | 0.008023 | ||
| Clathrin coat of endocytic vesicle | GO:0030128) | 0.008023 | ||
| Mitochondrial outer membrane | GO:0005741) | 0.009136 |
Top 10 pathways from KEGG, BioPlanet, Reactome, WikiPathways databases and their corresponding p-values and genes for AR-DEGs.
| Databases | Pathways | p-value | Genes |
|---|---|---|---|
| KEGG | Longevity regulating pathway | 2.12E−04 | |
| AMPK signaling pathway | 3.42E−04 | ||
| Apelin signaling pathway | 5.04E−04 | ||
| Insulin signaling pathway | 5.04E−04 | ||
| cGMP-PKG signaling pathway | 8.96E−04 | ||
| Calcium signaling pathway | 0.002531 | ||
| Acute myeloid leukemia | 0.002673 | ||
| Mitophagy | 0.002752 | ||
| Adipocytokine signaling pathway | 0.002832 | ||
| Endocytosis | 0.002905 | ||
| BioPlanet | AMPK signaling | 6.34E−05 | |
| Mitochondrial pathway of apoptosis: BH3-only Bcl-2 family | 1.83E−04 | ||
| Phosphoinositides and their downstream targets | 3.73E−04 | ||
| PKB-mediated events | 4.70E−04 | ||
| TOR signaling | 6.94E−04 | ||
| Endocytosis | 0.001527 | ||
| Calcineurin-dependent NFAT signaling role in lymphocytes | 0.00181 | ||
| Acute myeloid leukemia | 0.001943 | ||
| ERK1/ERK2 MAPK pathway | 0.002518 | ||
| Adipocytokine signaling pathway | 0.002673 | ||
| Reactome | Macroautophagy Homo sapiens R-HSA-1632852 | 6.07E−05 | |
| Cellular responses to stress Homo sapiens R-HSA-2262752 | 7.49E−04 | ||
| mTOR signalling Homo sapiens R-HSA-165159 | 9.13E−04 | ||
| PKB-mediated events Homo sapiens R-HSA-109703 | 9.61E−04 | ||
| Disease Homo sapiens R-HSA-1643685 | 0.001206 | ||
| HIV Infection Homo sapiens R-HSA-162906 | 0.002028 | ||
| PI3K Cascade Homo sapiens R-HSA-109704 | 0.003693 | ||
| Degradation of the extracellular matrix Homo sapiens R-HSA-1474228 | 0.006547 | ||
| Infectious disease Homo sapiens R-HSA-5663205 | 0.007144 | ||
| Nef Mediated CD8 Down-regulation Homo sapiens R-HSA-182218 | 0.008023 | ||
| WikiPathway | PI3K-AKT-mTOR signaling pathway and therapeutic opportunities WP3844 | 5.28E−06 | |
| Leptin and adiponectin WP3934 | 5.66E−05 | ||
| AMP-activated protein kinase (AMPK) signaling WP1403 | 6.62E−05 | ||
| The influence of laminopathies on Wnt signaling WP4844 | 7.36E−04 | ||
| Target Of Rapamycin (TOR) Signaling WP1471 | 7.78E−04 | ||
| Synaptic signaling pathways associated with autism spectrum disorder WP4539 | 0.001499 | ||
| RAC1/PAK1/p38/MMP2 Pathway WP3303 | 0.002752 | ||
| Peptide GPCRs WP24 | 0.003249 | ||
| Leptin signaling pathway WP2034 | 0.003424 | ||
| IL-18 signaling pathway WP4754 | 0.003602 |
Figure 5(a) Identification results of GO terms related to biological processes, cellular components and molecular functions based on gene enrichment analysis. Higher p value indicated a higher number of genes involved in this GO ontology. (b) Identification of results from combined multi-pathway analysis by KEGG, WikiPathways, BioPlanet and Reactome.
Figure 6Screening of diagnostic biomarkers for DCM by machine learning algorithms. (a) Screening of optimal genes by LASSO regression model. (b) Plot of the best gene selected by SVM-RFE algorithm. (c) Venn diagram embodying the eight diagnostic biomarkers common to both machine learning algorithms. LASSO, least absolute shrinkage and selection operator; SVM-RFE, support vector machine-recursive feature elimination.
Figure 7Network of transcription factors interacting with 8 potential diagnostic biomarkers. The highlighted orange nodes indicated the 8 potential diagnostic biomarkers and the other pink nodes indicated transcription factors. The network consisted of 8 core genes, 46 nodes and 76 edges.
Drugs of choice for dilated cardiomyopathy.
| Term | p-value | Combined score | Genes |
|---|---|---|---|
| Arsenenous acid CTD 00000922 | 5.77E−05 | 76.35849 | |
| Melatonin CTD 00006260 | 0.000058 | 462.545 | |
| Metformin CTD 00006282 | 0.000115 | 338.4126 | |
| Tretinoin HL60 UP | 0.000152 | 154.1536 | |
| Imatinib CTD 00003267 | 0.000231 | 244.7684 | |
| Arsenenous acid CTD 00000922 | 0.000374 | 61.03121 | |
| Wortmannin CTD 00000504 | 0.000606 | 154.125 | |
| Isoflupredone HL60 UP | 0.000867 | 371.9284 | |
| Telmisartan CTD 00003021 | 0.001059 | 325.1773 | |
| Rosiglitazone CTD 00003139 | 0.001142 | 68.5849 |
Figure 8The process of identifying comorbidities in DCM.
Clinical information for the GSE4172 dataset.
| Sample | Group | Age | Gender | Ejection fraction | Left ventricular end diastolic diameter | Inflammation/PVB19 |
|---|---|---|---|---|---|---|
| GSM94836 | DCM | 45 | Male | 34 | 62 | Positive |
| GSM94837 | DCM | 62 | Male | 51 | 73 | Positive |
| GSM94838 | DCM | 31 | Male | 52 | 57 | Positive |
| GSM94839 | DCM | 67 | Male | 43 | 59 | Positive |
| GSM94840 | DCM | 60 | Male | 34 | 76 | Positive |
| GSM94841 | DCM | 69 | Male | 35 | 60 | Positive |
| GSM94842 | DCM | 55 | Female | 31 | 61 | Positive |
| GSM94843 | DCM | 31 | Female | 56 | 71 | Positive |
| GSM94831 | Healthy control | 36 | Female | 68 | 47 | Negative |
| GSM94854 | Healthy control | 46 | Female | 61 | 49 | Negative |
| GSM94855 | Healthy control | 26 | Female | 74 | 47 | Negative |
| GSM94870 | Healthy control | 36 | Male | 64 | 50 | Negative |