| Literature DB >> 34198659 |
Xin Chen1,2,3, Zhigeng Pan1,2,3.
Abstract
Depression is a common mental health disease, which has great harm to public health. At present, the diagnosis of depression mainly depends on the interviews between doctors and patients, which is subjective, slow and expensive. Voice data are a kind of data that are easy to obtain and have the advantage of low cost. It has been proved that it can be used in the diagnosis of depression. The voice data used for modeling in this study adopted the authoritative public data set, which had passed the ethical review. The features of voice data were extracted by Python programming, and the voice features were stored in the format of CSV files. Through data processing, a big database, containing 1479 voice feature samples, was generated for modeling. Then, the decision tree screening model of depression was established by 10-fold cross validation and algorithm selection. The experiment achieved 83.4% prediction accuracy on voice data set. According to the prediction results of the model, the patients can be given early warning and intervention in time, so as to realize the health management of personal depression.Entities:
Keywords: decision tree; depression; early warning; public health; screening model; voice data
Year: 2021 PMID: 34198659 PMCID: PMC8296267 DOI: 10.3390/ijerph18126441
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Brief introduction of the generation process of depression screening model.
Voice sample numbering.
| Voice Types | Voice Sample Number |
|---|---|
| Interview | 1–18 |
| Passage reading | 19 |
| Vocabulary reading | 20–25 |
| Picture description | 26–28 |
| Thematic Apperception Test (TAT) | 29 |
Figure 2Visualization of voice features.
Figure 3Scatter diagram of V3 and V69 in the interview database. The blue dot means there is no depression. The orange dots represents depression.
Confusion matrix indicator formula.
| Indicators | Formula |
|---|---|
| Accuracy | Accuracy =
|
| Precision | Precision = |
| Sensitivity (Recall) | Sensitivity = Recall = |
| Specificity | Specificity = |
| F1 Score |
F1 Score = |
Performance indicators of different database decision tree models.
| Accuracy | Precision | Recall | Specificity | F1 Score | |
|---|---|---|---|---|---|
| interview | 83.4% | 81.9% | 79.0% | 86.8% | 80.5% |
| reading | 76.8% | 80.9% | 60.4% | 89.2% | 69.1% |
| picture | 75.0% | 66.7% | 84.1% | 68.1% | 74.4% |
| all | 83.4% | 83.5% | 76.8% | 88.5% | 80.0% |
Figure 4The depression screening and early warning model of this study.
Comparison of the results of similar studies.
| Accuracy | Precision | F1 Score | |
|---|---|---|---|
| Our study (interview) | 83.4% | 81.9% | 80.5% |
| Study 1 [ | 71% | 77% | 80% |
| Study 2 [ | 75.8% | Not provided | Not provided |