Literature DB >> 34220416

Identification of Diagnostic Markers for Major Depressive Disorder Using Machine Learning Methods.

Shu Zhao¹, Zhiwei Bao¹, Xinyi Zhao¹, Mengxiang Xu¹, Ming D Li^1,2, Zhongli Yang¹.

Abstract

BACKGROUND: Major depressive disorder (MDD) is a global health challenge that impacts the quality of patients' lives severely. The disorder can manifest in many forms with different combinations of symptoms, which makes its clinical diagnosis difficult. Robust biomarkers are greatly needed to improve diagnosis and to understand the etiology of the disease. The main purpose of this study was to create a predictive model for MDD diagnosis based on peripheral blood transcriptomes.
MATERIALS AND METHODS: We collected nine RNA expression datasets for MDD patients and healthy samples from the Gene Expression Omnibus database. After a series of quality control and heterogeneity tests, 302 samples from six studies were deemed suitable for the study. R package "MetaOmics" was applied for systematic meta-analysis of genome-wide expression data. Receiver operating characteristic (ROC) curve analysis was used to evaluate the diagnostic effectiveness of individual genes. To obtain a better diagnostic model, we also adopted the support vector machine (SVM), random forest (RF), k-nearest neighbors (kNN), and naive Bayesian (NB) tools for modeling, with the RF method being used for feature selection.
RESULTS: Our analysis revealed six differentially expressed genes (AKR1C3, ARG1, KLRB1, MAFG, TPST1, and WWC3) with a false discovery rate (FDR) < 0.05 between MDD patients and control subjects. We then evaluated the diagnostic ability of these genes individually. With single gene prediction, we achieved a corresponding area under the curve (AUC) value of 0.63 ± 0.04, 0.67 ± 0.07, 0.70 ± 0.11, 0.64 ± 0.08, 0.68 ± 0.07, and 0.62 ± 0.09, respectively, for these genes. Next, we constructed the classifiers of SVM, RF, kNN, and NB with an AUC of 0.84 ± 0.09, 0.81 ± 0.10, 0.73 ± 0.11, and 0.83 ± 0.09, respectively, in validation datasets, suggesting that the SVM classifier might be superior for constructing an MDD diagnostic model. The final SVM classifier including 70 feature genes was capable of distinguishing MDD samples from healthy controls and yielded an AUC of 0.78 in an independent dataset.
CONCLUSION: This study provides new insights into potential biomarkers through meta-analysis of GEO data. Constructing different machine learning models based on these biomarkers could be a valuable approach for diagnosing MDD in clinical practice.

Entities: Chemical

Keywords: biomarkers; depression; machine learning; major depressive disorder; meta-analysis

Year: 2021 PMID： 34220416 PMCID： PMC8249859 DOI： 10.3389/fnins.2021.645998

Source DB: PubMed Journal: Front Neurosci ISSN： 1662-453X Impact factor: 4.677

Introduction

From 1990 to 2016, major depressive disorder (MDD) was one of the five leading causes of years lived with disability (GBD 2016 Disease and Injury Incidence and Prevalence Collaborators, 2017). Patients with MDD have a higher risk of diabetes, stroke, cardiovascular disease, obesity, cancer, cognitive impairment, and Alzheimer’s disease (Otte et al., 2016). Moreover, MDD is one of the most common disorders associated with suicidal behavior. It has been estimated that the risk of suicide in MDD patients is increased substantially (greater than 10 times) compared with the general population (Chesney et al., 2014). Early diagnosis and appropriate treatment would undoubtedly reduce the incidence and mortality rate of MDD patients. However, like many other affective disorders, the complex etiology of MDD and the inevitable need for clinical judgment based on an individual’s medical history may cause a lack of reliability in diagnosis. More objective diagnostic methods thus are required. Previous studies have explored molecular biomarkers of MDD based on genomic, epigenetic, transcriptomic, and proteomic sources (Gururajan et al., 2016). Several types of molecules have been revealed with these approaches, which include mitochondrial DNA (Cai et al., 2015), small non-coding RNAs (Bocchio-Chiavetto et al., 2013), neurotransmitters (Belzeaux et al., 2010), neurotrophic and growth factors (Iga et al., 2007; Cattaneo et al., 2013), HPA axis-related molecules (Austin et al., 2003), and mediators of neuroinflammation (Cattaneo et al., 2013; Iacob et al., 2013). For example, several studies (Cattaneo et al., 2013; Iacob et al., 2013, 2014) reported increased expression of peripheral mRNAs for the pro-inflammatory cytokines interleukin (IL)-1α, IL-1β, IL-6, IL-8, IL-10, interferon (IFN)-γ, migration inhibitory factor (MIF), and tumor necrosis factor (TNF)-α in MDD patients compared with healthy control subjects. In addition, neuroimaging approaches, such as magnetic resonance imaging (MRI), electroencephalography (EEG), diffusion tensor imaging (DTI), near-infrared spectroscopy (NIRS), and molecular imaging (i.e., PET and SPECT) (Kang and Cho, 2020) have been used to discover biomarkers for diagnosis and treatment of MDD. This study focused on the peripheral transcriptomic biomarkers, which have been described as “sentinels of disease” (Liew et al., 2006). Because of the complicated and heterogeneous pathogenesis of MDD, there existed some limitations in the study of relevant transcriptomic biomarkers. For example, studies on brain-derived neurotrophic factor (BDNF) had inconsistent results. The studies of Karege et al. (2005) and Piccinni et al. (2008) reported a reduction in BDNF in depressed patients compared with healthy persons, but in the study of Serra-Millàs et al. (2011), MDD patients showed higher plasma BDNF concentrations. However, in another study, researchers found no significant difference in plasma BDNF concentrations between MDD patients and control subjects (Bocchio-Chiavetto et al., 2010). Based on these facts, it appears that identification of reliable biomarkers for predicting diagnosis and treatment of MDD remains a challenge. Therefore, in this study, meta-analysis was first performed to identify consistent biomarkers from different large-sample datasets. Although there are various meta-analyses of microarray data, they generally focus on one or a few genes; few have been developed for systematic integration of multiple microarray datasets (Sun et al., 2017; Li et al., 2018). The current commonly used meta-analysis method was proposed by Choi et al. (2003) and facilitates the detection of small but consistent expression changes and increases sensitivity and reliability. With this method, Chen et al. (2019) acquired gene expression data from eight commonly used in vitro macrophage models to perform a meta-analysis and identified consistently differentially expressed genes (DEGs) that have been implicated in inflammatory and metabolic processes. Forero et al. (2017) found MDD-related DEGs for blood, amygdala, cerebellum, anterior cingulate cortex, and prefrontal cortex regions based on GWES using meta-analysis. However, whether these DEGs would be useful as biomarkers has not been evaluated yet. MDD is influenced by both genetics and environment where the transcriptome feature patterns or feature function patterns may represent disease subtypes, outcome prognosis, drug benefit prediction, or specific biological process. The machine learning (Anttila et al., 2018) approach has an advantage in recognizing subtle patterns in large and noisy datasets, which is particularly useful in the study of complex transcriptome data. For example, Xu et al. (2012) employed the SVM-RFE approach to select genes for prediction of breast cancer prognosis and discovered a 50-gene signature that yielded significantly higher accuracy than the widely used 70-gene signature (van ‘t Veer et al., 2002). By adopting fuzzy forests of transcriptome data, Ciobanu et al. (2020) found that the downregulated TFRC (transferrin receptor) can predict recurrent MDD with an accuracy of 63%. The aim of this study was to identify potential transcriptional biosignatures that might be used for the diagnosis of MDD. Here, we applied meta-analysis to discover DEGs differing between MDD patients and healthy controls. Six significant DEGs with FDRs < 0.05 were investigated for their diagnostic capability. To obtain better diagnostic efficacy, we compared four ML models. Finally, an SVM prediction model consisting of 70 feature genes was constructed and validated by a reserved independent gene expression dataset.

Materials and Methods

Systematic Search of Microarray Expression Profiling Datasets

MDD-related keywords were searched in the Medical Subject Headings(MeSH) library[1]. Then we conducted a systematic search in the GEO repository[2] using the following search sentence: ((((((((MDD) OR major depressive disorders) OR depressive disorders) OR depressive syndromes) OR depression)) AND (((blood) OR peripheral blood) OR PB)) AND Homo sapiens [Organism]) AND Expression profiling by array [Filter]. A report was included in the analysis if the following criteria were satisfied: (1) used a case-control design; (2) patients did not have any diseases other than MDD; and (3) the patients were medication free. We finally obtained a total of 9 datasets (Figure 1).

FIGURE 1

Workflow of data processing. GEO, Gene Expression Omnibus; QC, quality control.

Initial Data Processing

Nine microarray datasets were retrieved from the GEO database: GSE98793, GSE19738, GSE38206, GSE52790, GSE39653, GSE76826, GSE58430, GSE32280, and GSE46743, with a sample size of 128, 67, 18, 22, 45, 22, 12, 16, and 160, respectively (Table 1).

TABLE 1

Basic information of collected microarray datasets.

Study	GEO accession number	Country	Array platform	Samples MDD/Control	Number of genes after QC
Leday et al. (2018)	GSE98793	United Kingdom	Affymetrix Human Genome U133 Plus 2.0 Array	64/64	20188
Spijker et al. (2010)	GSE19738	Netherlands	Agilent-012391 Whole Human Genome Oligo Microarray G4112A	33/34	13334
Belzeaux et al. (2012)	GSE38206	France	Agilent-028004 SurePrint G3 Human GE 8x60K Microarray	9/9	33074
Liu et al. (2014)	GSE52790	China	Affymetrix Human hGlue_3_0_v1 Array	10/12	16951
Savitz et al. (2013)	GSE39653	United States	Illumina HumanHT-12 V4.0 expression beadchip	21/24	29328
Miyata et al. (2016)	GSE76826	Japan	Agilent-039494 SurePrint G3 Human GE v2 8x60K Microarray 039381	10/12	27382
Arloth et al. (2015)	GSE46743	Germany	Illumina HumanHT-12 V3.0 expression beadchip	69/91	8615
Wang et al. (2015)	GSE58430	China	Agilent-028004 SurePrint G3 Human GE 8x60K Microarray	6/6	20188
Yi et al. (2012)	GSE32280	China	Affymetrix Human Genome U133 Plus 2.0 Array	8/8	22879

Basic information of collected microarray datasets. For the GSE98793, GSE52790, and GSE32280 datasets, based on the Affymetrix platform (Thermo Fisher Scientific, Inc., Waltham, MA, United States), the raw CEL data were downloaded; and the Robust Multi-Array Average (RMA) method and the ‘‘Oligo’’ package from BioConductor[3] were used to normalize the data and annotate the probe information. For the GSE19738, GSE38206, GSE58430, and GSE76826 data, based on the Agilent platform (Agilent Technologies, Inc., Santa Clara, CA, United States), the quantile method was used to normalize the data. Annotation of the probe information was based on Agilent platform information. For the GSE39653 and GSE46743 datasets based on the Illumina platform (Illumina Inc., San Diego, CA, United States), the quantile method was used to normalize the data, and annotation of the probe information was based on Illumina platform information. Normalized signal intensity data were imported into BRB-Array Tools (v. 4.5)[4] for initial processing. We excluded those genes with more than 50% of the data missing. The most variable probe measured by inter-quartile range (IQR) was used to handle redundant probe sets that correspond to the same gene.

Microarray Gene Expression Meta-Analysis

Meta-analysis of microarray data was carried out in “MetaOmics” based on R language (Ma et al., 2019), which includes three packages: MetaQC, MetaDE, and MetaPath. MetaQC (Kang et al., 2012) was used for the quality control of datasets before meta-analysis, and the MetaDE (Wang et al., 2012) package was used to identifying differentially expressed genes. We used the following six quantitative quality control indexes to assess heterogeneity across different studies: internal homogeneity of co-expression structure among studies (IQC), external consistency of co-expression pattern with pathway database (EQC), accuracy of biomarker detection (AQCg), accuracy of enriched pathway detection (AQCp), consistency of differentially expressed genes (CQCg), and consistency of enriched pathway ranking (CQCp). Each QC index was defined as the minus log-transformed p-value from formal hypothesis testing in each QC criterion (Wang et al., 2012). Finally, standardized mean rank (SMR) was generated to assist decision making. In this study, datasets with SMR values > 5 were excluded from analysis (Esmaeili et al., 2020). MetaDE was used for identifying differentially expressed genes, and the meta-analysis method used in the current study was developed by Choi et al. (2003). The change of gene expression was represented as “effect size,” a standardized index measuring the magnitude of a treatment or covariate effect. The effect sizes of different studies were combined to obtain an estimate of the overall mean. Herein, we applied the random effects model, and FDR correction was used to control for multiple testing. Finally, genes were considered significant at FDR < 0.05.

Protein–Protein Interaction (PPI) Network Construction and Module Analysis

A total of 217,249 pairs of FIs were downloaded from Reactome (v. 2014[5]; Croft et al., 2011). These pairwise relations were derived from datasets of protein–protein interactions in BioGrid (Chatr-Aryamontri et al., 2015), the Database of Interacting Proteins (Salwinski et al., 2004), the Human Protein Reference Database (Keshava Prasad et al., 2009), I2D (Brown and Jurisica, 2007), IntACT (Orchard et al., 2014), and MINT (Licata et al., 2012), as well as from gene co-expression data derived from multiple high-throughput techniques, including yeast two-hybrid assays, mass spectrometry pull-down experiments, and DNA microarrays (Wu et al., 2010). The above interaction information was imported into Cytoscape software (v. 3.2.1[6]) to construct the FI network (Shannon et al., 2003). A spectral partition-based network clustering (Newman, 2006) was used to search for modules based on the FI network. A KEGG pathway enrichment analysis was used to analyze functions for each individual network module. We selected a size cutoff of 2 to filter out small network modules. An FDR value of < 0.05 was considered to represent significantly enriched processes or signaling pathways. Co-expression patterns of the genes in the same module were analyzed by using the Pearson correlation test.

Establishment of the ML Classifier

Supplementary Figure 3 shows an overview of the proposed ML method involving feature extraction, selection, classification, and validation. R package ‘‘caret’’ (v. 6.0-84[7]) was applied in the following steps. The six datasets included in this study were divided into two parts: GSE98793 (Leday et al., 2018), GSE19738 (Spijker et al., 2010), GSE39653 (Savitz et al., 2013), GSE52790 (Liu et al., 2014), and GSE76826 (Miyata et al., 2016) were used as discovery datasets and GSE38206 (Belzeaux et al., 2012) as an independent validation dataset. We performed meta-analysis on discovery sets to identify DGEs between MDD patients and healthy controls. DEGs with p < 0.01 were considered potential biomarkers for further feature selection. Then, the dataset GSE98793 with the largest sample size was used as the training dataset in both feature screening and ML modeling. The other four datasets in discovery sets were used for internal validation. We applied an RF algorithm to reduce the number of feature genes, with the following steps: (1) ranked genes in descending order based on their importance; (2) eliminated feature genes one by one according to the importance of each feature with the goal of producing a new feature set; and (3) repeated the above process with the new feature set. We used 10-fold cross-validation for verification and calculated the average accuracy value to assess the classification capability. Finally, the feature set with the highest average accuracy was selected for model construction. The following ML algorithms, SVM (Cortes and Vapnik, 1995), RF (Tin Kam, 1998), kNN (Altman, 1992), and NB (Friedman et al., 1997), were used to build prediction models for gene expression data. The aim of SVM algorithm is to identify a decision hyperplane that make the distance between the hyperplane and the instances that are closest to boundary is maximized. By introducing the concept of “soft margin” and using “kernel trick,” SVM performs well with linear indivisibility data (Boser et al., 1996). SVM with the radial basis function (RBF) kernel was used in this study and parameters “C” and “sigma” were returned. Random Forest is an ensembles-learning algorithm forming with a series of decision trees. Each tree is developed from a subset from the training data. The class of the new instance is determined by using the majority vote of individual trees in the forest (Tin Kam, 1998). In this study, R package “randomForest” was used in the modeling and we tuned the parameter “mtry.” In kNN, classification is based on the distance between the instances and an object is classified according to the status of its k nearest neighbors. R package “kknn” was used in the modeling, where we set distance = 2 (Euclidean Distance), and tuned the parameter “kmax.” Naive Bayes is a statistical classification algorithm based on theBayes theorem. The core idea of Naive Bayes is, if the probability ofinstance x belonging to A is greater than the probability of belonging to B under some attribute conditions, it is said that instance x belongs to A. R package “klaR” was used in the modeling and output parameter “usekernel.” Default values were used for other parameters in each model. To evaluate the overall performance of each model, leave-one-out cross-validation was performed. Details about the tested parameters and their corresponding test values for each model are provided in Supplementary Table 6. We used the average AUC to assess the classification capability of each model. Finally, a model with the highest average AUC in validation sets was chosen. To facilitate clinical application, we attempted to construct a model with fewer genes without affecting the accuracy of classification efficiency. We compared the classification ability of the model based on the average AUC value of discovery set. The criterion for determining the final model was that the model achieved the optimal average AUC value in the discovery sets. The feature genes in the final determined SVM classifier were used to perform the supervised clustering of samples and extents of expression. The clustering results were visualized using a heatmap (Gu et al., 2016).

Results

Data Sets Collection and Pre-processing

The information in the datasets is shown inTable 1. Dataset GSE46743 (Miyata et al., 2016) has a number of genes < 10,000 after QC was excluded. Eight eligible datasets were finally used for the following analysis, which consisted of a total of 161 MDD and 169 control samples. The quality of the eight datasets was assessed utilizing “MetaQC.” Among the eight microarray datasets, six were included in the further meta-analysis for DEGs (Supplementary Figure 1 and Supplementary Table 1), and the other two studies (Yi et al., 2012; Wang et al., 2015) were excluded because of their lower quantitative quality control scores (SMR < 5). We then combined the other six datasets and obtained a total of 9,263 common genes that were used as input for the meta-analysis, which revealed 137 DEGs with p < 0.01. Of them, 66 were upregulated and 71 downregulated in MDD (Figure 2). A detailed list of these DEGs is given in Supplementary Table 2. Figure 2 shows the six most significant DEGs with FDRs < 0.05; they are tyrosylprotein sulfotransferase 1 (TPST1), arginase 1 (ARG1), killer cell lectin-like receptor B1 (KLRB1), WWC family member 3 (WWC3), aldo-keto reductase family 1 member C3 (AKR1C3), and MAF bZIP transcription factor G (MAFG). The forest plots of the six genes’ expression in different datasets are shown in Figure 3. These six genes were associated with immune process, inflammatory response, and hormonal metabolic process (Supplementary Table 3). In short, these results implied that these six biomarkers might play important roles in MDD.

FIGURE 2

FIGURE 3

Expression of most significant DEGs between MDD and control group (a random-effects model) in different studies. Lines indicate 95% confidence intervals (CI), and the midpoint of each line is denoted by a square indicating the standardized mean difference (SMD) for each study. Diamond indicates overall SMD and 95% confidence interval.

Volcano plot of MDD-related DEGs. Node colors define change direction in DEGs: red for upregulated genes, green for downregulated genes, and gray for not significant genes. Node size combines the effect size and FDR value: a larger node indicates that mean effect size of gene is large and FDR value is small. Expression of most significant DEGs between MDD and control group (a random-effects model) in different studies. Lines indicate 95% confidence intervals (CI), and the midpoint of each line is denoted by a square indicating the standardized mean difference (SMD) for each study. Diamond indicates overall SMD and 95% confidence interval.

Protein–Protein Interaction Network and Module Pathway Enrichment

By mapping 137 MDD-related DEGs to the FI data, we constructed an MDD-related FI network comprising 137 nodes, of which 103 were isolated and 34 were classified into seven clusters (Supplementary Figure 2A). A topographical analysis of the FI network revealed five modules ranging in size from three to eight genes (Supplementary Figure 2B). We next explored the potential co-expression of the DEGs in each module. There was a moderate to high positive correlation among the expression of most genes in Modules 0, 3, and 4 (Supplementary Figure 2C). In Module 1, there was a moderate to high positive correlation among MLKL, CEP63, and CSNK1E, a moderate negative correlation among DNAJC7, MLKL, and CSNK1E (Supplementary Figure 2C). Most genes in Module 2 showed a low correlation between each other. To understand how the 26 genes of the five modules were related to the molecular mechanisms of MDD, we performed a functional enrichment analysis of these modules based on pathway annotation (Supplementary Table 4). The enriched pathways of Modules 0 and 2 were related to some elements and events in transcription and translation, such as the ribosome, spliceosome, RNA degradation, or mRNA surveillance pathway. Similarly, genes in Module 3 were involved in transcriptional mis-regulation in cancers. Module 1 was related mainly to signaling pathways associated with the immune response, for example, antigen processing and presentation and the IL-17 signaling pathway. Neurodegenerative disease-related pathways in KEGG were enriched in Module 4, which included genes involved in Alzheimer, Huntington, and Parkinson diseases. In addition, metabolic pathways were enriched in Module 4. Together, this identified MDD-related FI network of dysregulated pathways could serve as a pool of novel functional module genes for future investigation in the diagnosis of MDD.

Evaluation of Diagnostic Ability of Single Genes

Next, we constructed single gene models to distinguish MDD patientsfrom healthy control subjects. We chose the six most significant DEGsand calculated the diagnostic ability of these genes with ROC curveanalysis. Table 2 shows the predicted resultsof single gene model in different datasets, represented by AUC values. The gene with the most significant expression difference, TPST1, had an average AUC value of 0.68 and a predictive ability of 0.82 in the dataset of Miyata et al. (2016), but only 0.62 in the dataset of Belzeaux et al. (2012). The gene with the better predictive potency was KLRB1, with an average AUC value of 0.70 and an SD of 0.11. The performance of WWC3 was the worst, with an average AUC of 0.63 and an SD of 0.04. These results indicated that the model developed with individual genes was not effective for diagnosis of MDD in clinics.

TABLE 2

AUC of single gene models for MDD diagnosis.

Study	TPST1	ARG1	KLBR1	WWC3	AKR1C3	MAFG
Leday et al. (2018)	0.67	0.66	0.70	0.65	0.60	0.71
Spijker et al. (2010)	0.63	0.62	0.59	0.71	0.66	0.50
Savitz et al. (2013)	0.65	0.57	0.55	0.51	0.65	0.61
Liu et al. (2014)	0.68	0.72	0.85	0.51	0.67	0.68
Miyata et al. (2016)	0.82	0.68	0.72	0.72	0.63	0.62
Belzeaux et al. (2012)	0.62	0.78	0.78	0.62	0.56	0.69
Mean ± SD	0.68 ± 0.07	0.67 ± 0.07	0.70 ± 0.11	0.62 ± 0.09	0.63 ± 0.04	0.64 ± 0.08

AUC of single gene models for MDD diagnosis.

ML Classifier

The above analyses indicated that the average AUCs of most single gene models were less than 0.70, suggesting that more efficient diagnostic models were necessary. We used the transcriptome data from the discovery sets for meta-analysis and obtained 114 DEGs (Supplementary Table 5) with p < 0.01 as input for feature screening. First, the feature set containing 108 genes was selected as it had the highest accuracy in the training dataset (Supplementary Figure 4A). Based on this feature set, four ML classification methods were used for modeling and the parameters of each model were shown in Supplementary Table 6. All models yielded an average AUC > 0.7 in validation datasets (Table 3), with SVM producing the highest average AUC (0.84). We thus chose SVM as the final diagnostic model.

TABLE 3

Comparison of different models in the validation sets.

Average value	SVM	kNN	NB	RF
AUC	0.84 ± 0.09	0.73 ± 0.11	0.83 ± 0.09	0.81 ± 0.10
Accuracy	0.79 ± 0.11	0.69 ± 0.10	0.74 ± 0.13	0.76 ± 0.12
Sensitivity	0.80 ± 0.14	0.54 ± 0.16	0.81 ± 0.15	0.83 ± 0.14
Specificity	0.77 ± 0.10	0.82 ± 0.07	0.69 ± 0.14	0.70 ± 0.14

Comparison of different models in the validation sets. To facilitate clinical application, we attempted to construct a modelwith fewer genes. Feature genes ranked with an average AUC ofdiscovery datasets were picked at 10 intervals from the top 10 to number 108. As shown in Supplementary Figure 4B, the accuracy of the SVM classifier was improved with an increasing number of genes, and the average accuracy reached the top at 0.84 once 70 genes were selected. The SVM classifier was still able to distinguish MDD samples from the healthy controls in test datasets with an average AUC of 0.82, accuracy of 0.75, sensitivity of 0.78, and specificity of 0.74 (Table 4). In the independent dataset, the classifier achieved an AUC of 0.78 (Table 4), which was greatly better than the model with randomly selected 70 genes (Supplementary Table 9). Besides, we calculated positive predictive value (0.74 ± 0.06) and Matthews correlation coefficient (0.52 ± 0.14) in training and test data sets which also reflect a great performance of our model (Supplementary Table 8). The top six most significant genes mentioned above were all included in this SVM model (Supplementary Figure 5 and Supplementary Table 7). Compared with a single gene, the SVM model had better predictive performance (Figure 4).

TABLE 4

Evaluation of classification effect of the SVM model.

Testing sample	Study	AUC	Accuracy	Sensitivity	Specificity
Training	Leday et al., 2018	0.89	0.77	0.78	0.77
Internal test	Spijker et al., 2010	0.73	0.67	0.73	0.62
	Savitz et al., 2013	0.91	0.80	0.90	0.71
	Liu et al., 2014	0.83	0.86	0.90	0.83
	Miyata et al., 2016	0.86	0.77	0.70	0.83
Independent test	Belzeaux et al., 2012	0.78	0.67	0.67	0.67
Mean ± SD in test datasets		0.82 ± 0.07	0.75 ± 0.08	0.78 ± 0.11	0.74 ± 0.10

FIGURE 4

Comparison of prediction performance between SVM and single-gene models. Red lines represent ROC curves of SVM model in different studies, and lines with other colors represent ROC curves of various single gene models.

Evaluation of classification effect of the SVM model. Comparison of prediction performance between SVM and single-gene models. Red lines represent ROC curves of SVM model in different studies, and lines with other colors represent ROC curves of various single gene models.

Discussion

Although there have been numerous reports analyzing DEGs in MDD patients and healthy individuals, an exploration of diagnosis and etiology of MDD remains a challenge. To identify effective diagnostic biomarkers of MDD, in this study, we first conducted a meta-analysis of six studies, which revealed 137 DEGs with a p < 0.01. Then we identified functional module genes showing that these DEGs were involved in the processes of transcription and translation, inflammation, immune-related pathways, and neurodegenerative diseases. Six DEGs with FDR < 0.05 were investigated by ROC curve analysis for their potential to distinguish MDD patients from healthy controls. To improve predictive power, we applied four ML methods with RF being used for feature selection. Finally, we constructed an SVM model containing 70 feature genes and showed that it was superior to the single gene prediction model. In the first part of the present study, we used meta-analysis to identify reliable DEGs as biomarkers. Six differentially expressed genes with FDRs < 0.05 were identified in MDD and healthy control subjects and used to calculate classification efficiency. Among them, the functions of KLRB1, ARG1, and TPST1 were associated with immune and inflammatory responses (Supplementary Table 2). It is known that inflammation and immunity play an important role in the etiology of depression (Liu et al., 2019). For example, individuals with autoimmune diseases and severe infections are more likely to have depression (Malhi and Mann, 2018). Peripheral cytokine concentrations have been linked to brain function, wellbeing, and cognition (Bollen et al., 2017). Therefore, we speculate that KLRB1, ARG1, and TPST1 influence the occurrence and progression of MDD by participating in the immune or inflammation pathways. AKR1C3 has the activity of aldo-keto reductase (nicotinamide adenine nucleotide phosphate; NADP), which plays an important role in interconversion of androgens, estrogens, and progestins to their cognate inactive metabolites (Supplementary Table 2). Over the past several years, both clinical and preclinical studies have established a strong link between sex hormones and depression (Thériault and Perreault, 2019). For example, in women, the prevalence of depression correlates with changes in hormonal fluctuations, such as puberty, prior to menstruation, during the postpartum period, and after the onset of menopause (Thériault and Perreault, 2019). Even if these differentially expressed genes are biologically related to the MDD process, ROC analysis showed that the diagnostic AUC value of a single gene was generally < 0.7, and, importantly, there existed great differences in the effectiveness of a diagnostic model developed on the basis of individual genes among different datasets or studies. These results strongly indicated that including only the top DEGs in a diagnostic model might lack a reliable distinguishing effect in all datasets, which has been one of the reasons transcriptome DEGs are currently difficult to use as MDD biomarkers. Further in this study, four ML classifiers of screened feature genes were constructed. Among them, SVM produced better classification results. We then reduced the model size to 70 feature genes and found that the SVM model of these genes displayed acceptable performance in distinguishing MDD from control samples of discovery sets. The verification on an independent dataset exhibited an AUC of 0.78 and an accuracy of 0.67. The results showed that the predictive power of the model was superior to that of a single gene as an indicator of classification. The application of ML in some large-scale omics data is a popular undertaking, such as in cancer genomics (Zhou et al., 2018) and radiomics (Huang et al., 2018; Ding et al., 2019). In the depression-related studies, ML algorithms also have been used to find radiomics (Rubin-Falcone et al., 2018) and video- (Schultebraucks et al., 2020) and audio-based markers (Schultebraucks et al., 2020) for diagnosis or medication prediction. However, unlike cancers, in some studies of peripheral transcriptome biomarkers of MDD, it was difficult to find a relatively credible database such as TCGA (The Cancer Genome Atlas) as an independent validation. For example, one study constructed an elastic net model using immune-inflammatory signature to classify MDD and BD; it achieved high accuracy (AUC = 97%), but the result lacked the independent validation to acquire a true diagnostic effect of these biomarkers. In contrast to other studies on MDD biomarkers, the blood transcriptomic data used for modeling here were from multiple studies, and the validation data were completely independent from the training data. Besides, we identified feature biomarkers by using meta-analysis followed by RF. Therefore, our strategy could help in identifying consistent and reliable biomarkers from different studies and was more conducive to the evaluation of the generalization ability of the model. This study has several limitations. First, although existing data of MDD and healthy control subjects were used, it is unknown at this stage how much data are required to establish a reliable predictive model, which can be answered through empirical investigation. Second, the biomarkers used in this study were based on statistical significance, although the biological conception of these genes could also be considered. For example, our subsequent research can build a diagnostic model to refer to genes in the module of the PPI network. Third, because as many data as possible were used for modeling to improve the accuracy, independent validation data are needed in the future. MDD occurs in a heterogeneous patient population, which makes accurate diagnosis a challenge. To address this challenge, we conducted meta-analysis of six datasets and found significant DEGs. The TPST1, ARG1, KLRB1, WWC3, AKR1C3, MAF, and MAFG genes were highlighted as potential feature genes influencing MDD. In addition, we constructed four ML models and chose SVM as a diagnostic model for MDD. We finally obtained an SVM diagnostic model containing 70 feature genes with an average AUC of 0.83, whose diagnostic effectiveness was superior to that of a single gene. Together, this study provided some markers that may be prospective precise diagnosis targets for MDD. Besides, this study provides new insight into how meta-analysis and ML can be used to find relatively objective transcriptional markers for complex mental diseases.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author Contributions

SZ, ZB, XZ, and MX collected the data, conducted the analysis, and drafted the manuscript. ML provided resources and revised the manuscript. ZY administered project and revised the manuscript. All authors contributed to the article and approved the submitted version.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

59 in total

1. Transcriptomic profiling of peripheral blood CD4⁺ T-cells in asthmatics with and without depression.

Authors: Ting Wang; Yu-Lin Ji; Yin-Yin Yang; Xing-Yu Xiong; I-Ming Wang; Andrew J Sandford; Zong-An Liang; Jian-Qing He
Journal: Gene Date: 2015-04-11 Impact factor: 3.688

Review 2. Depression.

Authors: Gin S Malhi; J John Mann
Journal: Lancet Date: 2018-11-02 Impact factor: 79.321

3. Clinical variations modulate patterns of gene expression and define blood biomarkers in major depression.

Authors: Raoul Belzeaux; Christine Formisano-Tréziny; Anderson Loundou; Laurent Boyer; Jean Gabert; Jean-Claude Samuelian; François Féron; Jean Naudin; El Chérif Ibrahim
Journal: J Psychiatr Res Date: 2010-05-14 Impact factor: 4.791

4. Stimulated gene expression profiles as a blood marker of major depressive disorder.

Authors: Sabine Spijker; Jeroen S Van Zanten; Simone De Jong; Brenda W J H Penninx; Richard van Dyck; Frans G Zitman; Jan H Smit; Bauke Ylstra; August B Smit; Witte J G Hoogendijk
Journal: Biol Psychiatry Date: 2010-05-14 Impact factor: 13.382

5. A comprehensive regional analysis of genome-wide expression profiles for major depressive disorder.

Authors: Diego A Forero; Gina P Guio-Vega; Yeimy González-Giraldo
Journal: J Affect Disord Date: 2017-04-26 Impact factor: 4.839

6. Gene expression profiling predicts clinical outcome of breast cancer.

Authors: Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal: Nature Date: 2002-01-31 Impact factor: 49.962

7. Blood microRNA changes in depressed patients during antidepressant treatment.

Authors: Luisella Bocchio-Chiavetto; Elisabetta Maffioletti; Paola Bettinsoli; Caterina Giovannini; Stefano Bignotti; Daniela Tardito; Dario Corrada; Luciano Milanesi; Massimo Gennarelli
Journal: Eur Neuropsychopharmacol Date: 2012-08-25 Impact factor: 4.600

8. Blood-based gene expression profiles models for classification of subsyndromal symptomatic depression and major depressive disorder.

Authors: Zhenghui Yi; Zezhi Li; Shunying Yu; Chengmei Yuan; Wu Hong; Zuowei Wang; Jian Cui; Tieliu Shi; Yiru Fang
Journal: PLoS One Date: 2012-02-13 Impact factor: 3.240

9. Analysis of shared heritability in common disorders of the brain.

Authors: Verneri Anttila; Brendan Bulik-Sullivan; Hilary K Finucane; Raymond K Walters; Jose Bras; Laramie Duncan; Valentina Escott-Price; Guido J Falcone; Padhraig Gormley; Rainer Malik; Nikolaos A Patsopoulos; Stephan Ripke; Zhi Wei; Dongmei Yu; Phil H Lee; Patrick Turley; Benjamin Grenier-Boley; Vincent Chouraki; Yoichiro Kamatani; Claudine Berr; Luc Letenneur; Didier Hannequin; Philippe Amouyel; Anne Boland; Jean-François Deleuze; Emmanuelle Duron; Badri N Vardarajan; Christiane Reitz; Alison M Goate; Matthew J Huentelman; M Ilyas Kamboh; Eric B Larson; Ekaterina Rogaeva; Peter St George-Hyslop; Hakon Hakonarson; Walter A Kukull; Lindsay A Farrer; Lisa L Barnes; Thomas G Beach; F Yesim Demirci; Elizabeth Head; Christine M Hulette; Gregory A Jicha; John S K Kauwe; Jeffrey A Kaye; James B Leverenz; Allan I Levey; Andrew P Lieberman; Vernon S Pankratz; Wayne W Poon; Joseph F Quinn; Andrew J Saykin; Lon S Schneider; Amanda G Smith; Joshua A Sonnen; Robert A Stern; Vivianna M Van Deerlin; Linda J Van Eldik; Denise Harold; Giancarlo Russo; David C Rubinsztein; Anthony Bayer; Magda Tsolaki; Petra Proitsi; Nick C Fox; Harald Hampel; Michael J Owen; Simon Mead; Peter Passmore; Kevin Morgan; Markus M Nöthen; Martin Rossor; Michelle K Lupton; Per Hoffmann; Johannes Kornhuber; Brian Lawlor; Andrew McQuillin; Ammar Al-Chalabi; Joshua C Bis; Agustin Ruiz; Mercè Boada; Sudha Seshadri; Alexa Beiser; Kenneth Rice; Sven J van der Lee; Philip L De Jager; Daniel H Geschwind; Matthias Riemenschneider; Steffi Riedel-Heller; Jerome I Rotter; Gerhard Ransmayr; Bradley T Hyman; Carlos Cruchaga; Montserrat Alegret; Bendik Winsvold; Priit Palta; Kai-How Farh; Ester Cuenca-Leon; Nicholas Furlotte; Tobias Kurth; Lannie Ligthart; Gisela M Terwindt; Tobias Freilinger; Caroline Ran; Scott D Gordon; Guntram Borck; Hieab H H Adams; Terho Lehtimäki; Juho Wedenoja; Julie E Buring; Markus Schürks; Maria Hrafnsdottir; Jouke-Jan Hottenga; Brenda Penninx; Ville Artto; Mari Kaunisto; Salli Vepsäläinen; Nicholas G Martin; Grant W Montgomery; Mitja I Kurki; Eija Hämäläinen; Hailiang Huang; Jie Huang; Cynthia Sandor; Caleb Webber; Bertram Muller-Myhsok; Stefan Schreiber; Veikko Salomaa; Elizabeth Loehrer; Hartmut Göbel; Alfons Macaya; Patricia Pozo-Rosich; Thomas Hansen; Thomas Werge; Jaakko Kaprio; Andres Metspalu; Christian Kubisch; Michel D Ferrari; Andrea C Belin; Arn M J M van den Maagdenberg; John-Anker Zwart; Dorret Boomsma; Nicholas Eriksson; Jes Olesen; Daniel I Chasman; Dale R Nyholt; Andreja Avbersek; Larry Baum; Samuel Berkovic; Jonathan Bradfield; Russell J Buono; Claudia B Catarino; Patrick Cossette; Peter De Jonghe; Chantal Depondt; Dennis Dlugos; Thomas N Ferraro; Jacqueline French; Helle Hjalgrim; Jennifer Jamnadas-Khoda; Reetta Kälviäinen; Wolfram S Kunz; Holger Lerche; Costin Leu; Dick Lindhout; Warren Lo; Daniel Lowenstein; Mark McCormack; Rikke S Møller; Anne Molloy; Ping-Wing Ng; Karen Oliver; Michael Privitera; Rodney Radtke; Ann-Kathrin Ruppert; Thomas Sander; Steven Schachter; Christoph Schankin; Ingrid Scheffer; Susanne Schoch; Sanjay M Sisodiya; Philip Smith; Michael Sperling; Pasquale Striano; Rainer Surges; G Neil Thomas; Frank Visscher; Christopher D Whelan; Federico Zara; Erin L Heinzen; Anthony Marson; Felicitas Becker; Hans Stroink; Fritz Zimprich; Thomas Gasser; Raphael Gibbs; Peter Heutink; Maria Martinez; Huw R Morris; Manu Sharma; Mina Ryten; Kin Y Mok; Sara Pulit; Steve Bevan; Elizabeth Holliday; John Attia; Thomas Battey; Giorgio Boncoraglio; Vincent Thijs; Wei-Min Chen; Braxton Mitchell; Peter Rothwell; Pankaj Sharma; Cathie Sudlow; Astrid Vicente; Hugh Markus; Christina Kourkoulis; Joana Pera; Miriam Raffeld; Scott Silliman; Vesna Boraska Perica; Laura M Thornton; Laura M Huckins; N William Rayner; Cathryn M Lewis; Monica Gratacos; Filip Rybakowski; Anna Keski-Rahkonen; Anu Raevuori; James I Hudson; Ted Reichborn-Kjennerud; Palmiero Monteleone; Andreas Karwautz; Katrin Mannik; Jessica H Baker; Julie K O'Toole; Sara E Trace; Oliver S P Davis; Sietske G Helder; Stefan Ehrlich; Beate Herpertz-Dahlmann; Unna N Danner; Annemarie A van Elburg; Maurizio Clementi; Monica Forzan; Elisa Docampo; Jolanta Lissowska; Joanna Hauser; Alfonso Tortorella; Mario Maj; Fragiskos Gonidakis; Konstantinos Tziouvas; Hana Papezova; Zeynep Yilmaz; Gudrun Wagner; Sarah Cohen-Woods; Stefan Herms; Antonio Julià; Raquel Rabionet; Danielle M Dick; Samuli Ripatti; Ole A Andreassen; Thomas Espeseth; Astri J Lundervold; Vidar M Steen; Dalila Pinto; Stephen W Scherer; Harald Aschauer; Alexandra Schosser; Lars Alfredsson; Leonid Padyukov; Katherine A Halmi; James Mitchell; Michael Strober; Andrew W Bergen; Walter Kaye; Jin Peng Szatkiewicz; Bru Cormand; Josep Antoni Ramos-Quiroga; Cristina Sánchez-Mora; Marta Ribasés; Miguel Casas; Amaia Hervas; Maria Jesús Arranz; Jan Haavik; Tetyana Zayats; Stefan Johansson; Nigel Williams; Astrid Dempfle; Aribert Rothenberger; Jonna Kuntsi; Robert D Oades; Tobias Banaschewski; Barbara Franke; Jan K Buitelaar; Alejandro Arias Vasquez; Alysa E Doyle; Andreas Reif; Klaus-Peter Lesch; Christine Freitag; Olga Rivero; Haukur Palmason; Marcel Romanos; Kate Langley; Marcella Rietschel; Stephanie H Witt; Soeren Dalsgaard; Anders D Børglum; Irwin Waldman; Beth Wilmot; Nikolas Molly; Claiton H D Bau; Jennifer Crosbie; Russell Schachar; Sandra K Loo; James J McGough; Eugenio H Grevet; Sarah E Medland; Elise Robinson; Lauren A Weiss; Elena Bacchelli; Anthony Bailey; Vanessa Bal; Agatino Battaglia; Catalina Betancur; Patrick Bolton; Rita Cantor; Patrícia Celestino-Soper; Geraldine Dawson; Silvia De Rubeis; Frederico Duque; Andrew Green; Sabine M Klauck; Marion Leboyer; Pat Levitt; Elena Maestrini; Shrikant Mane; Daniel Moreno- De-Luca; Jeremy Parr; Regina Regan; Abraham Reichenberg; Sven Sandin; Jacob Vorstman; Thomas Wassink; Ellen Wijsman; Edwin Cook; Susan Santangelo; Richard Delorme; Bernadette Rogé; Tiago Magalhaes; Dan Arking; Thomas G Schulze; Robert C Thompson; Jana Strohmaier; Keith Matthews; Ingrid Melle; Derek Morris; Douglas Blackwood; Andrew McIntosh; Sarah E Bergen; Martin Schalling; Stéphane Jamain; Anna Maaser; Sascha B Fischer; Céline S Reinbold; Janice M Fullerton; José Guzman-Parra; Fermin Mayoral; Peter R Schofield; Sven Cichon; Thomas W Mühleisen; Franziska Degenhardt; Johannes Schumacher; Michael Bauer; Philip B Mitchell; Elliot S Gershon; John Rice; James B Potash; Peter P Zandi; Nick Craddock; I Nicol Ferrier; Martin Alda; Guy A Rouleau; Gustavo Turecki; Roel Ophoff; Carlos Pato; Adebayo Anjorin; Eli Stahl; Markus Leber; Piotr M Czerski; Cristiana Cruceanu; Ian R Jones; Danielle Posthuma; Till F M Andlauer; Andreas J Forstner; Fabian Streit; Bernhard T Baune; Tracy Air; Grant Sinnamon; Naomi R Wray; Donald J MacIntyre; David Porteous; Georg Homuth; Margarita Rivera; Jakob Grove; Christel M Middeldorp; Ian Hickie; Michele Pergadia; Divya Mehta; Johannes H Smit; Rick Jansen; Eco de Geus; Erin Dunn; Qingqin S Li; Matthias Nauck; Robert A Schoevers; Aartjan Tf Beekman; James A Knowles; Alexander Viktorin; Paul Arnold; Cathy L Barr; Gabriel Bedoya-Berrio; O Joseph Bienvenu; Helena Brentani; Christie Burton; Beatriz Camarena; Carolina Cappi; Danielle Cath; Maria Cavallini; Daniele Cusi; Sabrina Darrow; Damiaan Denys; Eske M Derks; Andrea Dietrich; Thomas Fernandez; Martijn Figee; Nelson Freimer; Gloria Gerber; Marco Grados; Erica Greenberg; Gregory L Hanna; Andreas Hartmann; Matthew E Hirschtritt; Pieter J Hoekstra; Alden Huang; Chaim Huyser; Cornelia Illmann; Michael Jenike; Samuel Kuperman; Bennett Leventhal; Christine Lochner; Gholson J Lyon; Fabio Macciardi; Marcos Madruga-Garrido; Irene A Malaty; Athanasios Maras; Lauren McGrath; Eurípedes C Miguel; Pablo Mir; Gerald Nestadt; Humberto Nicolini; Michael S Okun; Andrew Pakstis; Peristera Paschou; John Piacentini; Christopher Pittenger; Kerstin Plessen; Vasily Ramensky; Eliana M Ramos; Victor Reus; Margaret A Richter; Mark A Riddle; Mary M Robertson; Veit Roessner; Maria Rosário; Jack F Samuels; Paul Sandor; Dan J Stein; Fotis Tsetsos; Filip Van Nieuwerburgh; Sarah Weatherall; Jens R Wendland; Tomasz Wolanczyk; Yulia Worbe; Gwyneth Zai; Fernando S Goes; Nicole McLaughlin; Paul S Nestadt; Hans-Jorgen Grabe; Christel Depienne; Anuar Konkashbaev; Nuria Lanzagorta; Ana Valencia-Duarte; Elvira Bramon; Nancy Buccola; Wiepke Cahn; Murray Cairns; Siow A Chong; David Cohen; Benedicto Crespo-Facorro; James Crowley; Michael Davidson; Lynn DeLisi; Timothy Dinan; Gary Donohoe; Elodie Drapeau; Jubao Duan; Lieuwe Haan; David Hougaard; Sena Karachanak-Yankova; Andrey Khrunin; Janis Klovins; Vaidutis Kučinskas; Jimmy Lee Chee Keong; Svetlana Limborska; Carmel Loughland; Jouko Lönnqvist; Brion Maher; Manuel Mattheisen; Colm McDonald; Kieran C Murphy; Igor Nenadic; Jim van Os; Christos Pantelis; Michele Pato; Tracey Petryshen; Digby Quested; Panos Roussos; Alan R Sanders; Ulrich Schall; Sibylle G Schwab; Kang Sim; Hon-Cheong So; Elisabeth Stögmann; Mythily Subramaniam; Draga Toncheva; John Waddington; James Walters; Mark Weiser; Wei Cheng; Robert Cloninger; David Curtis; Pablo V Gejman; Frans Henskens; Morten Mattingsdal; Sang-Yun Oh; Rodney Scott; Bradley Webb; Gerome Breen; Claire Churchhouse; Cynthia M Bulik; Mark Daly; Martin Dichgans; Stephen V Faraone; Rita Guerreiro; Peter Holmans; Kenneth S Kendler; Bobby Koeleman; Carol A Mathews; Alkes Price; Jeremiah Scharf; Pamela Sklar; Julie Williams; Nicholas W Wood; Chris Cotsapas; Aarno Palotie; Jordan W Smoller; Patrick Sullivan; Jonathan Rosand; Aiden Corvin; Benjamin M Neale; Jonathan M Schott; Richard Anney; Josephine Elia; Maria Grigoroiu-Serbanescu; Howard J Edenberg; Robin Murray
Journal: Science Date: 2018-06-22 Impact factor: 47.728

10. Dysregulation of leukocyte gene expression in women with medication-refractory depression versus healthy non-depressed controls.

Authors: Eli Iacob; Kathleen C Light; Scott C Tadler; Howard R Weeks; Andrea T White; Ronald W Hughen; Timothy A Vanhaitsma; Lowry Bushnell; Alan R Light
Journal: BMC Psychiatry Date: 2013-10-21 Impact factor: 3.630

4 in total

1. A novel 4 immune-related genes as diagnostic markers and correlated with immune infiltrates in major depressive disorder.

Authors: Linna Ning; Zhou Yang; Jie Chen; Zhaopeng Hu; Wenrui Jiang; Lixia Guo; Yan Xu; Huiming Li; Fanghua Xu; Dandong Deng
Journal: BMC Immunol Date: 2022-02-13 Impact factor: 3.615

2. A machine learning model for predicting patients with major depressive disorder: A study based on transcriptomic data.

Authors: Sitong Liu; Tong Lu; Qian Zhao; Bingbing Fu; Han Wang; Ginhong Li; Fan Yang; Juan Huang; Nan Lyu
Journal: Front Neurosci Date: 2022-08-08 Impact factor: 5.152

3. Novel feature selection methods for construction of accurate epigenetic clocks.

Authors: Adam Li; Amber Mueller; Brad English; Anthony Arena; Daniel Vera; Alice E Kane; David A Sinclair
Journal: PLoS Comput Biol Date: 2022-08-19 Impact factor: 4.779

4. Gene Signatures Associated with Temporal Rhythm as Diagnostic Markers of Major Depressive Disorder and Their Role in Immune Infiltration.

Authors: Jing Wang; Pan Ai; Yi Sun; Hui Shi; Anshi Wu; Changwei Wei
Journal: Int J Mol Sci Date: 2022-09-30 Impact factor: 6.208

4 in total