| Literature DB >> 36003956 |
Sitong Liu1,2, Tong Lu3, Qian Zhao1,2, Bingbing Fu1,2, Han Wang1,2, Ginhong Li1,2, Fan Yang1,2, Juan Huang1,2, Nan Lyu1,2.
Abstract
Background: Identifying new biomarkers of major depressive disorder (MDD) would be of great significance for its early diagnosis and treatment. Herein, we constructed a diagnostic model of MDD using machine learning methods.Entities:
Keywords: artificial neural network; bioinformatics analysis; machine learning; major depressive disorder; random forest
Year: 2022 PMID: 36003956 PMCID: PMC9393475 DOI: 10.3389/fnins.2022.949609
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
The information of datasets.
| Dataset | Platform | Organism | Tissue | Sample | |
|
|
| ||||
|
| |||||
| GSE19738 | GPL6848 | Homo Sapiens | Blood | 34 | 33 |
| GSE98793 | GPL570 | Homo Sapiens | Blood | 64 | 128 |
FIGURE 1Before and after box diagram of standardization of GSE19738 and GSE98793 datasets. (A) The flow chart of this study. (B) Box diagram of the GSE19738 dataset before correction; (C) Box diagram of the GSE98793 dataset before correction; (D) Box diagram of the GSE19738 dataset after correction; and (E) Box diagram of the GSE98793 dataset after correction; Red represents the MDD samples, and blue represents the normal samples.
FIGURE 2DEG identification of the GSE19738 dataset. (A) DEGs of peripheral blood samples from patients with MDD and healthy controls were obtained from the DEG heat map constructed from the GSE19738 dataset. Horizontal coordinate blue represents the control group, red represents the experimental group, blue indicates low expression, and red indicates high expression. (B) Volcano diagram, black indicates genes with no differential expression, blue indicates down-regulated genes, and red indicates up-regulated genes. (C) GO enrichment analysis. The outer circle represents the number of GO term, the outer circle number represents all genes in GO term, and the inner circle number represents the number of enriched genes. The inner circle pie chart represents the percentage of genes that are enriched. (D) KEGG pathways. The outer circle represents the KEGG ID, the outer circle number represents all genes in the KEGG pathway, and the inner circle number represents the number of genes enriched in the pathway. The inner circle pie chart represents the percentage of genes that are enriched. DEGs, differentially expressed genes; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.
GO enrichment analyses results.
| GO ID | Description | Number | ||
| GO: 0034341 | Response to interferon-gamma | 37 | 1.46E-24 | 4.15E-22 |
| GO: 0051607 | Defense response to virus | 49 | 3.74E-32 | 2.36E-29 |
| GO:0009615 | Response to virus | 58 | 5.70E-35 | 4.63E-32 |
| GO:0019221 | Cytokine-mediated signaling pathway | 80 | 2.27E-32 | 1.61E-29 |
| GO:0098542 | Defense response to other organism | 61 | 1.42E-24 | 4.15E-22 |
| GO:0045087 | Innate immune response | 104 | 8.81E-42 | 1.25E-38 |
| GO:0001817 | Regulation of cytokine production | 69 | 2.04E-25 | 6.84E-23 |
| GO:0001816 | Cytokine production | 69 | 3.01E-25 | 9.51E-23 |
| GO:0034097 | Response to cytokine | 107 | 2.77E-39 | 2.62E-36 |
| GO:0051707 | Response to other organism | 88 | 7.14E-30 | 3.11E-27 |
| GO:0043207 | Response to external biotic stimulus | 88 | 7.66E-30 | 3.11E-27 |
| GO:0071345 | Cellular response to cytokine stimulus | 93 | 1.43E-31 | 8.12E-29 |
| GO:0009607 | Response to biotic stimulus | 91 | 8.60E-31 | 4.45E-28 |
| GO:0006952 | Defense response | 147 | 2.56E-47 | 1.46E-43 |
| GO:0002252 | Immune effector process | 99 | 4.33E-30 | 2.05E-27 |
| GO:0006955 | Immune response | 160 | 1.07E-45 | 3.03E-42 |
| GO:0002682 | Regulation of immune system process | 110 | 8.62E-26 | 3.06E-23 |
| GO:0002376 | Immune system process | 185 | 1.02E-42 | 1.93E-39 |
| GO:0009605 | Response to external stimulus | 139 | 7.34E-26 | 2.78E-23 |
| GO:0006950 | Response to stress | 208 | 1.86E-39 | 2.12E-36 |
KEGG enrichment analyses results.
| KEGG ID | Description | Number | ||
| ko04668 | TNF signaling pathway | 16 | 4.09E-08 | 1.39E-06 |
| ko04064 | NF-kappa B signaling pathway | 19 | 1.94E-07 | 5.11E-06 |
| ko04216 | Ferroptosis | 11 | 9.06E-09 | 3.58E-07 |
| ko04217 | Necroptosis | 24 | 8.24E-12 | 9.77E-10 |
| ko04621 | NOD-like receptor signaling pathway | 27 | 2.20E-13 | 5.22E-11 |
| ko04380 | Osteoclast differentiation | 20 | 2.17E-10 | 1.71E-08 |
| ko05164 | Influenza A | 22 | 1.40E-09 | 8.30E-08 |
| ko05169 | Epstein-Barr virus infection | 27 | 4.83E-09 | 2.29E-07 |
| ko05162 | Measles | 18 | 6.86E-08 | 2.03E-06 |
| ko05160 | Hepatitis C | 18 | 2.47E-07 | 5.85E-06 |
| ko04620 | Toll-like receptor signaling pathway | 13 | 4.66E-06 | 9.21E-05 |
| ko04625 | C-type lectin receptor signaling pathway | 13 | 4.66E-06 | 9.21E-05 |
| ko05145 | Toxoplasmosis | 13 | 7.74E-06 | 1.41E-04 |
| ko05321 | Inflammatiory bowel disease (IBD) | 10 | 1.31E-05 | 2.07E-04 |
| ko05167 | Kaposi sarcoma-associated herpesvirus infection | 17 | 1.31E-05 | 2.07E-04 |
| ko05134 | Legionellosis | 9 | 1.80E-05 | 2.67E-04 |
| ko04978 | Mineral absorption | 9 | 2.74E-05 | 3.82E-04 |
| ko05161 | Hepatitis B | 15 | 3.21E-05 | 4.23E-04 |
| ko04062 | Chemokine signaling pathway | 16 | 6.01E-05 | 7.50E-04 |
| ko04920 | Adipocytokine signaling pathway | 9 | 8.36E-05 | 9.90E-04 |
FIGURE 3(A) Protein-protein interaction network of DEGs constructed using cytoscape. (B) The top 10 hub genes were explored using CytoHubba.
FIGURE 4(A) Random forest tree. (B) MeanDecreaseGini. (C) Artificial Neural Network model. Healthy stands for healthy group, and MDD stands for MDD group.
FIGURE 5Receiver operating characteristic curves for the artificial neural network. The AUC curve of GSE19738 training cohort is on the left, and the AUC curve of GSE98793 testing cohort is on the right.
FIGURE 6Evaluation and correlation analysis of immune cell infiltration. (A) Panoramic view of 22 immune cell infiltrates in peripheral blood samples; (B, C) High and low expression group of immune cell infiltration difference.