| Literature DB >> 35578254 |
Qiang Liu1,2, Georgia Salanti3, Franco De Crescenzo4,5,6, Edoardo Giuseppe Ostinelli4,5,6, Zhenpeng Li4,5, Anneka Tomlinson4,5, Andrea Cipriani4,5,6, Orestis Efthimiou4,3,7.
Abstract
BACKGROUND: The debate of whether machine learning models offer advantages over standard statistical methods when making predictions is ongoing. We discuss the use of a meta-learner model combining both approaches as an alternative.Entities:
Keywords: Depression; Dropout; Machine learning; PHQ-9; Statistical model
Mesh:
Year: 2022 PMID: 35578254 PMCID: PMC9112573 DOI: 10.1186/s12888-022-03986-0
Source DB: PubMed Journal: BMC Psychiatry ISSN: 1471-244X Impact factor: 4.144
Glossary
| Term | Definition |
|---|---|
| Area under the receiver operating characteristic curve (AUC) | A discrimination metric for classification problems, measuring the area under the entire receiver operating characteristic curve. AUC ranges from 0 to 1 with higher values indicating better performance. |
| Base-learner | A single, stand-alone statistical or machine learning model built for predicting a continuous or a binary outcome. |
| Bootstrapping | Random sampling data with replacement. |
| Calibration-in-the-large | A method for measuring the agreement between observed outcomes and predictions for classification problems, where the average predicted probability is compared with the observed event rate. A mismatch indicates that the model over- or underestimates the risk on average. |
| Deep neural network | A type of machine learning model that resembles how neurons in human brain work. |
| Mean absolute error (MAE) | MAE measures the average magnitude of errors, i.e., the difference between true/observed values and their predictions. Lower MAE indicates better performance. |
| Meta-learner | A statistical or machine learning model that uses as input the output of other models (i.e., base-learners), to predict an outcome of interest. |
| Multi-layer perceptron (MLP) | The simplest deep neural network model with multiple stacked hidden layers. |
| Overfitting | The case when a model fits too closely to the data used to develop the model (training data), but performs badly on new, testing data. |
| Permutation feature importance | A method to evaluate the importance of predictors used in machine learning models, by measuring the decrease in model performance when the predictor’s values are randomly shuffled. |
| Ridge regression | A statistical regression model which uses a penalized likelihood. The penalty has the effect of shrinking the estimated coefficients so that the model does not yield extreme predictions. |
Fig. 1Meta-learner architecture. Patient-level baseline information was used to independently develop the base-learners (i.e., a ridge regression model and an MLP). Their predictions were in turn used as sole predictors of the meta-learners. We explored two different types of meta-learners, namely linear regression (logistic regression for the binary outcome), and MLP. MLP: multi-layer perceptron
Fig. 2Histograms of predicted Patient Health Questionnaire-9 (PHQ-9) scores. A Ridge regression base-learner. B Multi-layer perceptron (MLP) base-learner. C Linear regression meta-learner without regularization. D MLP meta-learner
Evaluation on post treatment Patient Health Questionnaire (PHQ-9) and all-cause dropout predictions made by the statistical, machine learning base-learners and meta-learners
| PHQ-9 score MAE* [95% CI†] | Dropout AUC‡ [95% CI] | |
|---|---|---|
| Base-learner ridge regression | 4.64 [4.57, 4.71] | 0.597 [0.594, 0.601] |
| Base-learner MLP§ | 4.63 [4.57, 4.71] | 0.598 [0.594, 0.601] |
| Meta-learner linear/logistic regression | 4.54 [4.48, 4.60] | 0.604 [0.600, 0.606] |
| Meta-learner MLP | 4.52 [4.45, 4.59] | 0.604 [0.600, 0.606] |
*Mean absolute error.
†Confidence interval, calculated as the 2.5th to the 97.5th percentile of bootstrap estimates.
‡Area under the receiver operating characteristic curve.
§Multi-layer perceptron.
Permutation feature importance values of the meta-learners
| Outcome | Meta-learner | Base-learner | Change of measures (%) |
|---|---|---|---|
| PHQ-9,** measured by MAE†† | Linear regression | Ridge regression | 24.78 |
| MLP‡‡ | 0.30 | ||
| MLP | Ridge regression | 31.70 | |
| MLP | 2.34 | ||
| Dropout, measured by AUC§§ | Logistic regression | Ridge regression | 0.83 |
| MLP | 5.21 | ||
| MLP | Ridge regression | 0.42 | |
| MLP | 2.67 |
**Patient Health Questionnaire-9.
††Mean absolute error.
‡‡Multi-layer perceptron.
§§Area under the receiver operating characteristic curve.
Fig. 3Histograms of predicted probabilities of dropout. Colors are according to observed outcomes. A Ridge regression base-learner. B Multi-layer perceptron (MLP) base-learner. C Logistic regression meta-learner without regularization. D MLP meta-learner