Wei Gu 1 , Tinghong Ming 2 , Zhongwen Xie 3 . Show Affiliations »
Abstract
BACKGROUND: The clinical diagnosis of major depressive disorder (MDD) mainly relies on subjective assessment of depression-like behavior and clinical examination. In the present study, we aimed to develop a novel diagnostic model for special predicting of MDD. METHODS: The human brain GSE102556 DataSet and the blood GSE98793 and GSE76826 DataSets were downloaded from the Gene Expression Omnibus (GEO) database. We used novel algorithm, random forest (RF) plus artificial neural network (ANN), to examine gene biomarkers and establish a diagnostic model of MDD. RESULTS: Through the "limma" package in the R language, 2653 deferentially expressed genes (DEGs) were identified in the GSE102556 DataSet, and 1786 DEGs were identified in the GSE98793 DataSet, and a total of 100 shared DEGs. We applied GSE98793 TrainData 1 to an RF algorithm and thereby successfully selected 28 genes as biomarkers. Furthermore, the 28 biomarkers were verified by GSE98793 TestData 1, and the performance of these biomarkers was perfect. In addition, we further used an ANN algorithm to optimize the weight of each gene and employed GSE98793 TrainData 2 to build an ANN model through the neuralnet package by R language. Based on this algorithm, GSE98793 TestData 2 and independent blood GSE76826 were verified to correlate with MDD, with AUCs of 0.903 and 0.917, respectively. Conclusion: To the best of our knowledge, this is the first time that the classifier constructed via DEG biomarkers be used as an endophenotype for MDD clinical diagnosis. Our results may provide a new entry point for the diagnosis, treatment, outcome prediction, prognosis and recurrence of MDD. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.net.
BACKGROUND: The clinical diagnosis of major depressive disorder (MDD) mainly relies on subjective assessment of depression-like behavior and clinical examination. In the present study, we aimed to develop a novel diagnostic model for special predicting of MDD. METHODS: The human brain GSE102556 DataSet and the blood GSE98793 and GSE76826 DataSets were downloaded from the Gene Expression Omnibus (GEO) database. We used novel algorithm, random forest (RF) plus artificial neural network (ANN), to examine gene biomarkers and establish a diagnostic model of MDD. RESULTS: Through the "limma" package in the R language, 2653 deferentially expressed genes (DEGs) were identified in the GSE102556 DataSet, and 1786 DEGs were identified in the GSE98793 DataSet, and a total of 100 shared DEGs. We applied GSE98793 TrainData 1 to an RF algorithm and thereby successfully selected 28 genes as biomarkers. Furthermore, the 28 biomarkers were verified by GSE98793 TestData 1, and the performance of these biomarkers was perfect. In addition, we further used an ANN algorithm to optimize the weight of each gene and employed GSE98793 TrainData 2 to build an ANN model through the neuralnet package by R language. Based on this algorithm, GSE98793 TestData 2 and independent blood GSE76826 were verified to correlate with MDD, with AUCs of 0.903 and 0.917, respectively. Conclusion: To the best of our knowledge, this is the first time that the classifier constructed via DEG biomarkers be used as an endophenotype for MDD clinical diagnosis. Our results may provide a new entry point for the diagnosis, treatment, outcome prediction, prognosis and recurrence of MDD. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.net.
Entities: Chemical
Keywords:
Major depressive disorder; biomarkers; ensemble learning; gene expression profiling; genome-wide microarray analysis
Year: 2022
PMID: 35379119 DOI: 10.2174/1386207325666220404123433
Source DB: PubMed Journal: Comb Chem High Throughput Screen ISSN: 1386-2073 Impact factor: 1.339