| Literature DB >> 32170136 |
Uli Niemann1, Petra Brueggemann2, Benjamin Boecking2, Birgit Mazurek2, Myra Spiliopoulou3.
Abstract
Tinnitus is a complex condition that is associated with major psychological and economic impairments - partly through various comorbidities such as depression. Understanding the interaction between tinnitus and depression may thus improve either symptom cluster's prevention, diagnosis and treatment. In this study, we developed and validated a machine learning model to predict depression severity after outpatient therapy (T1) based on variables obtained before therapy (T0). 1,490 patients with chronic tinnitus (comorbid major depressive disorder: 52.2%) who completed a 7-day multimodal treatment encompassing tinnitus-specific components, cognitive behavioural therapy, physiotherapy and informational counselling were included. 185 variables were extracted from self-report questionnaires and socio-demographic data acquired at T0. We used 11 classification methods to train models that reliably separate between subclinical and clinical depression at T1 as measured by the general depression questionnaire. To ensure highly predictive and robust classifiers, we tuned algorithm hyperparameters in a 10-fold cross-validation scheme. To reduce model complexity and improve interpretability, we wrapped model training around an incremental feature selection mechanism that retained features that contributed to model prediction. We identified a LASSO model that included all 185 features to yield highest predictive performance (AUC = 0.87 ± 0.04). Through our feature selection wrapper, we identified a LASSO model with good trade-off between predictive performance and interpretability that used only 6 features (AUC = 0.85 ± 0.05). Thus, predictive machine learning models can lead to a better understanding of depression in tinnitus patients, and contribute to the selection of suitable therapeutic strategies and concise and valid questionnaire design for patients with chronic tinnitus with or without comorbid major depressive disorder.Entities:
Mesh:
Year: 2020 PMID: 32170136 PMCID: PMC7069984 DOI: 10.1038/s41598-020-61593-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Workflow. We extracted a total of 185 features from answers of 7 questionnaires from 1,490 patients. We trained multiple classification models to predict depression status after outpatient therapy using data collected prior to therapy commencement only. Cross-validation was used for performance evaluation. We embedded model training and evaluation in an incremental feature selection wrapper which retained only features which were identified to be important for the model.
Baseline characteristics of patients before treatment commencement (T0).
| Tinnitus status | ||||
|---|---|---|---|---|
| Total | compensated | decompensated | ||
| Number of subjects, n (%) | 1490 (100) | 1005 (67) | 485 (33) | |
| Age in years | 49.8 | 49.3 | 50.8 | 0.023 (TT) |
| Male sex, n (%) | 735 (49) | 514 (51) | 221 (46) | 0.050 (Chi) |
| Tinnitus duration in years, modus (%) | 5 (33) | 5 (32) | 5 (35) | 0.008 (MW) |
| Number of days until start of an intensive treatment | 9.5 | 8.9 | 10.8 | <0.001 (MW) |
| TQ total score | 38.6 | 29.0 | 58.6 | <0.001 (TT) |
| PSQ total score | 0.5 | 0.4 | 0.6 | <0.001 (TT) |
| SF8 general health score | 41.6 | 43.5 | 37.6 | <0.001 (MW) |
| ADSL depression score | 18.2 | 13.7 | 27.3 | <0.001 (MW) |
| Clinical depression, n (%) | 777 (52) | 362 (36) | 415 (86) | <0.001 (Chi) |
Baseline characteristics for the patients with compensated tinnitus and patients with decompensated tinnitus, respectively. Continuous variables are expressed as mean standard deviation. Categorical variables are expressed as absolute frequency (percentage). -values were calculated by unpaired two-tailed t-test (TT), Chi-square test (Chi) or two-tailed unpaired Mann-Whitney test (MW). TQ: German version of the Tinnitus Questionnaire[22]; PSQ: Perceived Stress Questionnaire[20]; SF8: Short Form 8 Health Survey[21]; ADSL: General Depression Scale Questionnaire - long form[19].
Figure 2Relationship between depression score after therapy and other features. Graphical representation of the relationship between the ADSL depression score at the end of therapy (y-axis) with other features (x-axis). Higher values on y-axis represent higher depression severity. Background color represents subclinical (blue) or clinical (red) depression status at the end of therapy. Slight jittering was applied to the points to mitigate overplotting. Marginal histograms depict univariate feature distributions.
Classification performance.
| Classification method | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| lasso | ridge | wknn | nb | svm | gpls | nnet | cart | c5.0 | rf | gbt | |
| 1 | 0.838 (185) | 0.822 (185) | 0.795 (185) | 0.795 (185) | 0.864 (185) | ||||||
| 2 | 0.856 (89) | 0.847 (86) | 0.845 (98) | 0.849 (70) | 0.530 (5) | 0.836 (80) | 0.807 (117) | 0.799 (106) | 0.803 (103) | 0.864 (109) | 0.855 (89) |
| 3 | 0.857 (50) | 0.854 (51) | 0.845 (65) | 0.829 (38) | 0.537 (4) | 0.836 (47) | 0.809 (87) | 0.794 (66) | 0.803 (62) | 0.866 (99) | 0.859 (52) |
| 4 | 0.856 (24) | 0.853 (31) | 0.837 (40) | 0.832 (26) | 0.542 (3) | 0.801 (59) | 0.799 (45) | 0.790 (39) | 0.865 (85) | 0.858 (38) | |
| 5 | 0.853 (17) | 0.853 (21) | 0.842 (28) | 0.838 (15) | 0.562 (2) | 0.838 (16) | 0.793 (45) | 0.811 (34) | 0.806 (24) | 0.865 (77) | 0.855 (24) |
| 6 | 0.854 (10) | 0.851 (15) | 0.847 (16) | 0.841 (13) | — | 0.837 (9) | 0.810 (25) | 0.817 (28) | 0.803 (23) | 0.863 (75) | 0.856 (16) |
| 7 | 0.850 (6) | 0.854 (11) | 0.833 (9) | — | — | 0.838 (6) | 0.812 (21) | 0.822 (24) | 0.804 (16) | 0.864 (69) | 0.854 (14) |
| 8 | — | 0.854 (9) | 0.829 (7) | — | — | — | 0.852 (12) | 0.822 (23) | 0.802 (13) | 0.865 (64) | 0.853 (11) |
| 9 | — | 0.854 (8) | 0.830 (6) | — | — | — | 0.802 (12) | 0.865 (59) | — | ||
| 10 | — | 0.853 (7) | — | — | — | — | 0.842 (4) | — | — | ||
| 11 | — | — | — | — | — | — | — | — | — | 0.865 (56) | — |
| 12 | — | — | — | — | — | — | — | — | — | 0.864 (51) | — |
| 13 | — | — | — | — | — | — | — | — | — | 0.864 (50) | — |
| 14 | — | — | — | — | — | — | — | — | — | 0.863 (47) | — |
Mean cross-validation AUC for each classifier with best parameter configuration and for each iteration (). The number of features are given in parenthesis. The best run per classifier is highlighted in boldface. All methods induce at least one model with AUC of 0.809 or higher. Empty cells indicate that the feature selection wrapper had already been terminated after a previous iteration.
Figure 3Coefficients and relative inclusion of features in cross-validation of lasso model. Median ( median absolute deviation) coefficients (top) and absolute frequency of inclusion of features (bottom) over 10 cross-validation iterations for the best lasso model. From 185 features, the depicted 40 features exhibit a nonzero model coefficient. The average frequency of feature inclusion is represented as horizontal line in the bottom subplot. Line ranges depict MAD (right). TQ: German version of the Tinnitus Questionnaire[22]; PSQ: Perceived Stress Questionnaire[20]; SF8: Short Form 8 Health Survey[21]; ADSL: General Depression Scale Questionnaire - long form[19]; SOZK: sociodemographics questionnaire[24].
Top-25 features of lasso model.
| Feature | Description | Coefficient |
|---|---|---|
| SOZK_nationality | German nationality | −0.370 |
| ADSL_adsl06 | “During the past week I felt depressed”. | 0.309 |
| ADSL_adsl19 | “During the past week I felt that people disliked me”. | 0.288 |
| PSQ_stress21 | “You enjoy yourself”. | −0.284 |
| SOZK_graduate | Graduation: university | −0.210 |
| ADSL_adsl11 | “During the past week my sleep was restless”. | 0.196 |
| ADSL_adsl03 | “During the past week I felt that I could not shake off the blues even with help from my family or friends”. | 0.175 |
| TQ_tin50 | Because of the noises I am unable to enjoy the radio or television. | 0.151 |
| TQ_tin47 | I am a victim of my noises. | 0.137 |
| ADSL_adsl02 | “During the past week I did not feel like eating; my appetite was poor”. | 0.132 |
| ADSL_adsl05 | “During the past week I had trouble keeping my mind on what I was doing”. | 0.132 |
| SF8_sf07 | “During the past 4 weeks, how much have you been bothered by emotional problems (such as feeling anxious, depressed or irritable)?” | 0.125 |
| ADSL_adsl10 | “During the past week I felt fearful”. | 0.107 |
| ADSL_adsl04 | “During the past week I felt I was just as good as other people”. | −0.107 |
| TQ_tin40 | I am able to forget about the noises when I am doing something interesting. | −0.104 |
| ADSL_adsl16 | “During the past week I enjoyed life”. | −0.085 |
| PSQ_stress15 | “Your problems seem to be piling up”. | 0.081 |
| TQ_tin07 | Most of the time the noises are fairly quiet. | −0.069 |
| ADSL_adsl08 | “During the past week I felt hopeful about the future”. | −0.064 |
| SF8_sf02 | “During the past 4 weeks, how much did physical health problems limit your physical activities (such as walking or climbing stairs)?” | 0.059 |
| SOZK_tinnitusdur | “How long have you been suffering from tinnitus (in years)?” | 0.058 |
| PSQ_stress28 | “You feel loaded down with responsibility”. | 0.055 |
| ADSL_adsl18 | “During the past week I felt sad”. | 0.053 |
| SOZK_job | Job status: currently employed | −0.050 |
| ADSL_adsl13 | “During the past week I talked less than usual”. | 0.049 |
Features with highest absolute coefficient in lasso model (iteration ). TQ: German version of the Tinnitus Questionnaire[22]; PSQ: Perceived Stress Questionnaire[20]; SF8: Short Form 8 Health Survey[21]; ADSL: General Depression Scale Questionnaire - long form[19]; SOZK: sociodemographics questionnaire[24].
Figure 4Predictive features. Distribution of features included in the lasso model of iteration for the patients with subclinical and clinical depression. Green squares and labels represent mean of continuous features. ADSL: General Depression Scale Questionnaire - long form[19]; PSQ: Perceived Stress Questionnaire[20]; SF8: Short Form 8 Health Survey[21]; TQ: German version of the Tinnitus Questionnaire[22].