| Literature DB >> 34220416 |
Shu Zhao1, Zhiwei Bao1, Xinyi Zhao1, Mengxiang Xu1, Ming D Li1,2, Zhongli Yang1.
Abstract
BACKGROUND: Major depressive disorder (MDD) is a global health challenge that impacts the quality of patients' lives severely. The disorder can manifest in many forms with different combinations of symptoms, which makes its clinical diagnosis difficult. Robust biomarkers are greatly needed to improve diagnosis and to understand the etiology of the disease. The main purpose of this study was to create a predictive model for MDD diagnosis based on peripheral blood transcriptomes.Entities:
Keywords: biomarkers; depression; machine learning; major depressive disorder; meta-analysis
Year: 2021 PMID: 34220416 PMCID: PMC8249859 DOI: 10.3389/fnins.2021.645998
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
FIGURE 1Workflow of data processing. GEO, Gene Expression Omnibus; QC, quality control.
Basic information of collected microarray datasets.
| Study | GEO accession number | Country | Array platform | Samples MDD/Control | Number of genes after QC |
| GSE98793 | United Kingdom | Affymetrix Human Genome U133 Plus 2.0 Array | 64/64 | 20188 | |
| GSE19738 | Netherlands | Agilent-012391 Whole Human Genome Oligo Microarray G4112A | 33/34 | 13334 | |
| GSE38206 | France | Agilent-028004 SurePrint G3 Human GE 8x60K Microarray | 9/9 | 33074 | |
| GSE52790 | China | Affymetrix Human hGlue_3_0_v1 Array | 10/12 | 16951 | |
| GSE39653 | United States | Illumina HumanHT-12 V4.0 expression beadchip | 21/24 | 29328 | |
| GSE76826 | Japan | Agilent-039494 SurePrint G3 Human GE v2 8x60K Microarray 039381 | 10/12 | 27382 | |
| GSE46743 | Germany | Illumina HumanHT-12 V3.0 expression beadchip | 69/91 | 8615 | |
| GSE58430 | China | Agilent-028004 SurePrint G3 Human GE 8x60K Microarray | 6/6 | 20188 | |
| GSE32280 | China | Affymetrix Human Genome U133 Plus 2.0 Array | 8/8 | 22879 |
FIGURE 2Volcano plot of MDD-related DEGs. Node colors define change direction in DEGs: red for upregulated genes, green for downregulated genes, and gray for not significant genes. Node size combines the effect size and FDR value: a larger node indicates that mean effect size of gene is large and FDR value is small.
FIGURE 3Expression of most significant DEGs between MDD and control group (a random-effects model) in different studies. Lines indicate 95% confidence intervals (CI), and the midpoint of each line is denoted by a square indicating the standardized mean difference (SMD) for each study. Diamond indicates overall SMD and 95% confidence interval.
AUC of single gene models for MDD diagnosis.
| Study | ||||||
| 0.67 | 0.66 | 0.70 | 0.65 | 0.60 | 0.71 | |
| 0.63 | 0.62 | 0.59 | 0.71 | 0.66 | 0.50 | |
| 0.65 | 0.57 | 0.55 | 0.51 | 0.65 | 0.61 | |
| 0.68 | 0.72 | 0.85 | 0.51 | 0.67 | 0.68 | |
| 0.82 | 0.68 | 0.72 | 0.72 | 0.63 | 0.62 | |
| 0.62 | 0.78 | 0.78 | 0.62 | 0.56 | 0.69 | |
| Mean ± SD | 0.68 ± 0.07 | 0.67 ± 0.07 | 0.70 ± 0.11 | 0.62 ± 0.09 | 0.63 ± 0.04 | 0.64 ± 0.08 |
Comparison of different models in the validation sets.
| Average value | SVM | kNN | NB | RF |
| AUC | 0.84 ± 0.09 | 0.73 ± 0.11 | 0.83 ± 0.09 | 0.81 ± 0.10 |
| Accuracy | 0.79 ± 0.11 | 0.69 ± 0.10 | 0.74 ± 0.13 | 0.76 ± 0.12 |
| Sensitivity | 0.80 ± 0.14 | 0.54 ± 0.16 | 0.81 ± 0.15 | 0.83 ± 0.14 |
| Specificity | 0.77 ± 0.10 | 0.82 ± 0.07 | 0.69 ± 0.14 | 0.70 ± 0.14 |
Evaluation of classification effect of the SVM model.
| Testing sample | Study | AUC | Accuracy | Sensitivity | Specificity |
| Training | 0.89 | 0.77 | 0.78 | 0.77 | |
| Internal test | 0.73 | 0.67 | 0.73 | 0.62 | |
| 0.91 | 0.80 | 0.90 | 0.71 | ||
| 0.83 | 0.86 | 0.90 | 0.83 | ||
| 0.86 | 0.77 | 0.70 | 0.83 | ||
| Independent test | 0.78 | 0.67 | 0.67 | 0.67 | |
| Mean ± SD in test datasets | 0.82 ± 0.07 | 0.75 ± 0.08 | 0.78 ± 0.11 | 0.74 ± 0.10 | |
FIGURE 4Comparison of prediction performance between SVM and single-gene models. Red lines represent ROC curves of SVM model in different studies, and lines with other colors represent ROC curves of various single gene models.