| Literature DB >> 35137480 |
Moritz Philipp Günther1, Johannes Kirchebner2, Jan Ben Schulze1, Roland von Känel1, Sebastian Euler1.
Abstract
OBJECTIVE: In routine oncological treatment settings, psychological distress, including mental disorders, is overlooked in 30% to 50% of patients. High workload and a constant need to optimise time and costs require a quick and easy method to identify patients likely to miss out on psychological support.Entities:
Keywords: cancer; machine learning; mental disorders; psycho-oncology; psychological support
Mesh:
Year: 2022 PMID: 35137480 PMCID: PMC9286797 DOI: 10.1111/ecc.13555
Source DB: PubMed Journal: Eur J Cancer Care (Engl) ISSN: 0961-5423 Impact factor: 2.328
Descriptive statistics of study group
| Age | Mean | SD |
|---|---|---|
| Female | 61 | 15.25 |
| Male | 65 | 13.2 |
| Total | 63 | 14.43 |
FIGURE 1Source of data and data selection
FIGURE 2Overview of statistical procedures. Step 1—data preparation: Outcome variable ‘consultation/no consultation’ and 544 predictor variables were defined. Step 2—datasplitting: Split into 70% training dataset and 30% validation dataset. Step 3a to e—Model building and testing on training data I: Imputation by randomForest; upsampling of outcome ‘no consultation’ × 5.6; variable reduction via random forest; model building and hyperparmater tuning via ML algorithms—logistic regression, trees, random forest, gradient boosting, KNN (k‐nearest neighbour), support vector machines (SVM) and naive bayes; testing (selection) of best ML algorithm via ROC parameters. Step 4—Model building and testing on training data II: Nested resampling with imputation, upsampling, variable reduction and model building with hyperparameter tuning in inner loop and model testing on outer loop. Step 5—Model building and testing on validation data I: Imputation with stored weights from Step 3a. Step 6—Model building and testing on validation data II: Best model identified in Step 3e applied on imputed validation dataset and evaluated via ROC parameters. Step 7: Ranking of variables by indicative power
Machine learning models and performance in nested cross‐validation on training dataset
| Statistical procedure | Balanced Accuracy (%) | AUC | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) |
|---|---|---|---|---|---|---|
| Logistic regression | 77 | 0.76 | 83.8 | 70.1 | 97.1 | 26.3 |
| Tree | 80 | 0.74 | 80.6 | 79.4 | 97.9 | 25.3 |
| Random Forest | 80.6 | 0.74 | 80.1 | 81.2 | 98.1 | 25.2 |
|
|
|
|
|
|
|
|
| KNN | 76.5 | 0.54 | 54.3 | 98.7 | 99.8 | 15.2 |
| SVM | 79.5 | 0.74 | 80.9 | 78.1 | 97.8 | 25.3 |
| Naive Bayes | 79.6 | 0.74 | 78.7 | 80.5 | 98.1 | 23.2 |
Abbreviations: AUC, area under the curve (level of discrimination); KNN, k‐nearest neighbours; NPV, negative predictive value; PPV, positive predictive value; SVM, support vector machines.
Absolute and relative distribution of indicative variables on the complete dataset
| Variable code | Variable description | No psychiatric/psychotherapeutic treatment (%) | Psychiatric/psychotherapeutic treatment (%) |
|---|---|---|---|
| DTSREE | Distress screening applied | 1507/6222 (24.2) | 480/1096 (43.8) |
| HOSDUR | Longest inpatient treatment 28 days or more | 774/6,222 (12.4) | 410/1,096 (37.4) |
| FDX | Mental disorder present | 538/6,222 (8.6) | 256/1,096 (23.4) |
| Alt/Jung | Patient age 65 or older | 3,287/6,222 (52.8) | 409/1,096 (37.3) |
| Haut | Skin cancer present | 1,975/6,222 (31.7) | 147/1,096 (13.4) |
| Tumorboard | Tumour board held | 2,616/4,832 (54.1) | 622/732 (85) |
Final gradient boosting model performance measures on validation dataset
| Performance measures | % 95% confidence interval |
|---|---|
| Balanced accuracy | 68.5 [61.2, 75.9] |
| AUC | 0.75 [0.59, 0.78] |
| Sensitivity | 85.9 [85.9, 86] |
| Specificity | 51.2 [50.7, 51.7] |
| PPV | 98.9 [98.9, 98.9] |
| NPV | 6.5 [6.4, 6.7] |
Notes: AUC, area under the curve (level of discrimination); NPV, negative predictive value; PPV, positive predictive value.
FIGURE 3Variable importance. Abbreviations: Altjung, patient aged 65 or older; DTSCREE, distress screening applied; FDX, mental disorder present; Haut, skin cancer present; HOSDUR, longest inpatient treatment 28 days or more; Tumorboard, tumour board held