| Literature DB >> 31624229 |
Paolo Fusar-Poli1,2,3,4, Dominic Stringer5, Alice M S Durieux5, Grazia Rutigliano6, Ilaria Bonoldi6, Andrea De Micheli6, Daniel Stahl5.
Abstract
Predicting the onset of psychosis in individuals at-risk is based on robust prognostic model building methods including a priori clinical knowledge (also termed clinical-learning) to preselect predictors or machine-learning methods to select predictors automatically. To date, there is no empirical research comparing the prognostic accuracy of these two methods for the prediction of psychosis onset. In a first experiment, no improved performance was observed when machine-learning methods (LASSO and RIDGE) were applied-using the same predictors-to an individualised, transdiagnostic, clinically based, risk calculator previously developed on the basis of clinical-learning (predictors: age, gender, age by gender, ethnicity, ICD-10 diagnostic spectrum), and externally validated twice. In a second experiment, two refined versions of the published model which expanded the granularity of the ICD-10 diagnosis were introduced: ICD-10 diagnostic categories and ICD-10 diagnostic subdivisions. Although these refined versions showed an increase in apparent performance, their external performance was similar to the original model. In a third experiment, the three refined models were analysed under machine-learning and clinical-learning with a variable event per variable ratio (EPV). The best performing model under low EPVs was obtained through machine-learning approaches. The development of prognostic models on the basis of a priori clinical knowledge, large samples and adequate events per variable is a robust clinical prediction method to forecast psychosis onset in patients at-risk, and is comparable to machine-learning methods, which are more difficult to interpret and implement. Machine-learning methods should be preferred for high dimensional data when no a priori knowledge is available.Entities:
Mesh:
Year: 2019 PMID: 31624229 PMCID: PMC6797779 DOI: 10.1038/s41398-019-0600-9
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
Sociodemographic characteristics of the study population, including the derivation and validation dataset[28]
| Derivation dataset | Validation dataset | |||
|---|---|---|---|---|
| Mean | SD | Mean | SD | |
| Age (years)c | 34.4 | 18.92 | 31.98 | 18.54 |
| Count | % | Count | % | |
| Gender | ||||
| Male | 17303 | 48.81 | 27302 | 49.9 |
| Female | 16507 | 51.16 | 27398 | 50.07 |
| Missing | 10 | 0.03 | 16 | 0.03 |
| Ethnicity | ||||
| Black | 6879 | 20.34 | 7023 | 12.84 |
| White | 18627 | 55.08 | 35392 | 64.68 |
| Asian | 1129 | 3.34 | 2608 | 4.77 |
| Mixed | 1306 | 3.86 | 1957 | 3.58 |
| Other | 3466 | 10.25 | 2084 | 3.81 |
| Missing | 2413 | 7.13 | 5652 | 10.33 |
| ICD-10 Index spectrum diagnosis | ||||
| CHR-Pa | 314 | 0.93 | 50 | 0.09 |
| Acute and transient psychotic disorders | 553 | 1.64 | 725 | 1.33 |
| Substance use disorders | 7149 | 21.14 | 6507 | 11.89 |
| Bipolar mood disorders | 950 | 2.81 | 1526 | 2.79 |
| Non-bipolar mood disorders | 6302 | 18.63 | 8841 | 16.16 |
| Anxiety disorders | 8235 | 24.35 | 15960 | 29.17 |
| Personality disorders | 1286 | 3.8 | 2116 | 3.87 |
| Developmental disorders | 1412 | 4.18 | 3706 | 6.77 |
| Childhood/adolescence onset disorders | 4200 | 12.42 | 9629 | 17.6 |
| Physiological syndromes | 2555 | 7.55 | 4424 | 8.09 |
| Mental retardation | 864 | 2.55 | 1232 | 2.25 |
aLambeth and Southwark, n = 33820
bCroydon and Lewisham, n = 54716
cNot an ICD-10 Index spectrum diagnosis
Experiment 1: prognostic accuracy (Harrell’s C) for the original model (M1, diagnostic spectra) developed through Clinical-learning (a priori clinical knowledge) vs machine learning (LASSO and RIDGE). The EPV is >20 (55.6)
| Method | Derivation Data Set ( | Validation Data Set ( | Optimism | ||||
|---|---|---|---|---|---|---|---|
| Harrell’s C | SE | 95% C.I. | Harrell’s C | SE | 95% C.I. | ||
| Unregularized | 0.800 | 0.008 | 0.784–0.816 | 0.791 | 0.008 | 0.775–0.807 | 0.009 |
| Lasso | 0.798 | 0.008 | 0.782–0.814 | 0.789 | 0.008 | 0.773–0.805 | 0.009 |
| Ridge | 0.810 | 0.008 | 0.794–0.826 | 0.788 | 0.008 | 0.772–0.804 | 0.022 |
Experiment 2: prognostic performance of the revised models in the derivation dataset and the validation dataset, and their comparative performance
| Model | Type of clustering of ICD-10 index diagnoses | Harrell’s C | SE | 95% CI | |
|---|---|---|---|---|---|
| Derivation dataset | |||||
| M1 | Diagnostic spectra | 0.800 | 0.008 | 0.784 | 0.816 |
| M2 | Diagnostic categories | 0.811 | 0.008 | 0.795 | 0.824 |
| M3 | Diagnostic subdivisions | 0.833 | 0.008 | 0.821 | 0.847 |
| Validation dataset | |||||
| M1 | Diagnostic spectra | 0.791 | 0.008 | 0.776 | 0.807 |
| M2 | Diagnostic categories | 0.797 | 0.008 | 0.782 | 0.812 |
| M3 | Diagnostic subdivisions | 0.792 | 0.008 | 0.776 | 0.808 |
| M2-M1 | 0.006 | 0.003 | 0.001 | 0.012 | |
| M3-M1 | 0.001 | 0.005 | −0.009 | 0.011 | |
| M3-M2 | −0.005 | 0.005 | −0.015 | 0.004 | |
All models include age, gender, age by gender, ethnicity and ICD-10 index diagnosis (refined as specified in the methods)
Experiment 3a. Prognostic performance using machine-learning vs clinical-learning under variable EPVs
| Unregularized | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| M1 (diagnostic spectra) | M2 (diagnostic categories) | M3 (diagnostic subdivisions) | |||||||
| Cox Regression | LASSO | RIDGE | Cox Regression | LASSO | RIDGE | Cox Regression | LASSO | RIDGE | |
| Apparent performance | |||||||||
| C index | 0.800 | 0.793 | 0.790 | 0.811 | 0.799 | 0.803 | 0.827 | 0.812 | 0.813 |
| SE | 0.005 | 0.005 | 0.006 | 0.005 | 0.005 | 0.005 | 0.005 | 0.005 | 0.005 |
| Internal validation performance | |||||||||
| C index | 0.799 | 0.794 | 0.790 | 0.804 | 0.795 | 0.795 | 0.805 | 0.793 | 0.797 |
| SE | 0.017 | 0.017 | 0.017 | 0.017 | 0.017 | 0.017 | 0.017 | 0.017 | 0.017 |
| Events | 2011 | 2011 | 2011 | ||||||
| Degrees of freedom of predictors | 18 | 63 | 226 | ||||||
| EPV | 111.7 | 31.9 | 8.9 | ||||||
Upper part of the table: apparent performance of M1-M3 models in the whole dataset. Bottom part of the table: internal performance in the whole dataset using nested 10-fold CV and taking median values with 100 repetitions
EPV events per variables, calculated as the number of transitions to psychosis over the degrees of freedom of predictors. Categorical predictors are counted as the number of indicator categories they consist of (i.e. number of categories−7)
Fig. 1Experiment 3b.
Clinical-learning (unregularised regression) vs machine learning (LASSO and RIDGE) for the original model M1 with random sampling of varying sample sizes and decreasing EPV