| Literature DB >> 34215752 |
Jessica de Nijs1, Thijs J Burger2,3, Ronald J Janssen1, Seyed Mostafa Kia1, Daniël P J van Opstal1, Mariken B de Koning2,3, Lieuwe de Haan2,3, Wiepke Cahn1,4, Hugo G Schnack5.
Abstract
Schizophrenia and related disorders have heterogeneous outcomes. Individualized prediction of long-term outcomes may be helpful in improving treatment decisions. Utilizing extensive baseline data of 523 patients with a psychotic disorder and variable illness duration, we predicted symptomatic and global outcomes at 3-year and 6-year follow-ups. We classified outcomes as (1) symptomatic: in remission or not in remission, and (2) global outcome, using the Global Assessment of Functioning (GAF) scale, divided into good (GAF ≥ 65) and poor (GAF < 65). Aiming for a robust and interpretable prediction model, we employed a linear support vector machine and recursive feature elimination within a nested cross-validation design to obtain a lean set of predictors. Generalization to out-of-study samples was estimated using leave-one-site-out cross-validation. Prediction accuracies were above chance and ranged from 62.2% to 64.7% (symptomatic outcome), and 63.5-67.6% (global outcome). Leave-one-site-out cross-validation demonstrated the robustness of our models, with a minor drop in predictive accuracies of 2.3% on average. Important predictors included GAF scores, psychotic symptoms, quality of life, antipsychotics use, psychosocial needs, and depressive symptoms. These robust, albeit modestly accurate, long-term prognostic predictions based on lean predictor sets indicate the potential of machine learning models complementing clinical judgment and decision-making. Future model development may benefit from studies scoping patient's and clinicians' needs in prognostication.Entities:
Year: 2021 PMID: 34215752 PMCID: PMC8253813 DOI: 10.1038/s41537-021-00162-3
Source DB: PubMed Journal: NPJ Schizophr ISSN: 2334-265X
Baseline demographic and clinical characteristics of patients who completed baseline and follow-ups and of those who were not included in the study.
| Included patients ( | Excluded patients ( | ||
|---|---|---|---|
| Age in years, mean (SD) | 27.6 (7.4) | 26.6 (7.0) | 0.018 |
| No. (%) male sex | 402 (76.9) | 426 (77.6) | 0.775 |
| No. (%) white ethnicity | 449 (85.9) | 363 (72.6) | <0.001 |
| WAIS IQ, mean (SD) | 97.4 (16.1) | 92.1 (15.6) | <0.001 |
| Education patient, mean (SD) | 4.3 (2.0) | 3.8 (2.1) | <0.001 |
| Education father; SES, mean (SD) | 5.1 (2.5) | 4.7 (2.6) | 0.014 |
| Education mother; SES, mean (SD) | 4.4 (2.4) | 4.1 (2.5) | 0.054 |
| No. (%) employed/student | 241 (46.1) | 184 (41.2) | 0.124 |
| Illness duration in years, mean (SD) | 4.6 (4.2) | 3.9 (3.4) | 0.002 |
| No. (%) recent onset of psychosis in the past year | 101 (19.3) | 168 (30.7) | <0.001 |
| No. (%) DSM-IV schizophrenia diagnosis, 295.1,2,3 | 342 (65.4) | 341 (62.5) | 0.317 |
| No. (%) antipsychotic drug use present state | 479 (91.6) | 434 (99.3) | <0.001 |
| No. (%) clozapine use present state | 64 (12.2) | 81 (14.8) | 0.228 |
| No. (%) cannabis abuse/dependency present state | 160 (30.6) | 179 (32.6) | 0.479 |
| No. (%) other illicit drug use in the past | 324 (62.9) | 365 (69.5) | 0.024 |
| PANSS positive symptoms, mean (SD) | 12.2 (5.1) | 13.3 (5.5) | 0.001 |
| PANSS negative symptoms, mean (SD) | 13.3 (5.5) | 14.7 (6.3) | <0.001 |
| PANSS general symptoms, mean (SD) | 27.0 (7.8) | 29.0 (8.8) | <0.001 |
| PANSS total, mean (SD) | 52.4 (15.7) | 56.9 (17.5) | <0.001 |
| Global assessment of functioning; symptoms, mean (SD) | 57.9 (16.0) | 53.5 (15.3) | <0.001 |
| Global assessment of functioning; degree of disabilities, mean (SD) | 57.0 (15.6) | 51.3 (15.8) | <0.001 |
| No. (%) GAF score ≥65 | 173 (33.1) | 94 (21.2) | <0.001 |
| CAPE frequency symptoms, mean (SD) | 0.9 (0.5) | 0.9 (0.5) | 0.267 |
| CANSAS number of needs, mean (SD) | 6.7 (3.8) | 7.8 (3.9) | <0.001 |
WAIS IQ Wechsler Adult Intelligence Scale Intelligence Quotient, SES socioeconomic status, DSM-IV Diagnostic and Statistical Manual of Mental Disorders 4th edition, PANSS Positive and Negative Syndrome Scale, GAF global assessment of functioning, CAPE community assessment of psychic experiences, CANSAS Camberwell Assessment scale of Need Short Appraisal Schedule.
Internal validation with nested cross-validation and leave-one-site-out (LOSO) nested cross-validation predicting symptomatic and global outcome at T3 and T6 in multimodal models.
| Predictor/Model (outcome) | Internal BAC | Internal Sens/Spec | Internal PPV/NPV | Internal AUC | LOSO BAC | LOSO Sens/spec | LOSO PPV/NPV | LOSO AUC |
|---|---|---|---|---|---|---|---|---|
| PANSS, ill, demo, CANSAS (symptomatic outcome | 62.2 (1.7) | 77.9/42.6 | 68.9/54.1 | 0.60 | 61.3 | 59.7/62.9 | 73.1/47.7 | 0.61 |
| PANSS, ill, demo, CAPE (symptomatic outcome | 64.4 (1.9) | 76.0/50.0 | 72.2/54.8 | 0.63 | 63.8 | 62.9/64.7 | 75.8/48.5 | 0.65 |
| PANSS, ill, demo, CANSAS (symptomatic outcome | 64.7 (2.0) | 78.7/46.5 | 69.0/59.0 | 0.63 | 62.5 | 68.5/56.5 | 74.3/49.3 | 0.62 |
| PANSS, ill, demo, CAPE (symptomatic outcome | 62.3 (1.8) | 75.4/46.5 | 66.2/57.6 | 0.61 | 59.9 | 64.2/55.6 | 69.3/49.1 | 0.62 |
| PANSS, ill, demo, CANSAS (global outcome | 63.5 (1.9) | 66.3/59.7 | 65.6/60.5 | 0.63 | 63.5 | 66.1/60.8 | 68.5/59.6 | 0.62 |
| PANSS, ill, demo, CAPE (global outcome | 67.6 (1.3) | 74.9/58.4 | 70.1/64.2 | 0.67 | 64.8 | 65.8/63.8 | 72.7/56.5 | 0.58 |
| PANSS, ill, demo, CANSAS (global outcome | 67.6 (2.2) | 81.8/47.7 | 71.8/61.6 | 0.65 | 64.0 | 71.8/56.1 | 73.2/55.3 | 0.64 |
| PANSS, ill, demo, CAPE (global outcome | 67.3 (1.7) | 84.3/43.3 | 73.4/59.8 | 0.64 | 61.2 | 65.9/56.5 | 76.8/45.5 | 0.64 |
| EUFEST 4 weeks (symptomatic outcome | 62.7 | 61.3/64.0 | 69.0/45.1 | 0.62 | – | – | – | – |
| EUFEST 52 weeks (symptomatic outcome | 59.0 | 60.9/57.1 | 70.5/47.4 | 0.60 | – | – | – | – |
| EUFEST 4 weeks (symptomatic outcome remission | 62.4 | 58.1/66.7 | 72.0/50.9 | 0.62 | – | – | – | – |
| EUFEST 52 weeks (symptomatic outcome | 61.0 | 60.2/61.8 | 69.8/50.0 | 0.60 | – | – | – | – |
| EUFEST 4 weeks (global outcome | 60.4 | 61.7/59.1 | 64.5/57.9 | 0.61 | – | – | – | – |
| EUFEST 52 weeks (global outcome | 56.5 | 58.5/54.6 | 60.3/53.4 | 0.57 | – | – | – | – |
| EUFEST 4 weeks (global outcome | 62.0 | 61.4/62.7 | 73.2/50.3 | 0.62 | – | – | – | – |
| EUFEST 52 weeks (global outcome | 66.4 | 70.0/62.8 | 76.1/57.1 | 0.67 | – | – | – | – |
BAC (mean (SD)) is balanced accuracy (i.e. the average of sensitivity and specificity), sens is sensitivity, spec is specificity, PPV is positive predictive value, NPV is negative predictive value, AUC is area under the curve, T3 is follow-up at 3-years interval after the baseline, T6 is follow-up at 6-years interval after the baseline, PANSS Positive and Negative Syndrome Scale, ill illness related, demo demographic, CANSAS Camberwell assessment of need short appraisal, CAPE Community Assessment of Psychic Experiences, EUFEST 4 weeks is set of 10% best performing features of 4-week outcome prediction of the European First Episode Schizophrenia Trial, EUFEST 52 weeks is set of 10% best performing features of 52-week outcome prediction of the European First Episode Schizophrenia Trial.
Important baseline features by model.
| Baseline ( | Symptom outcome | Symptom outcome | Symptom outcome | Symptom outcome | Global outcome | Global outcome | Global outcome | Global outcome |
|---|---|---|---|---|---|---|---|---|
| ILL GAF disabilities | − | − | o | − | − | − | − | − |
| ILL GAF symptoms | o | − | − | − | − | − | − | − |
| PANSS poor judgment and Insight | + | + | + | + | o | + | + | o |
| PANSS hallucinatory behavior | + | + | × | + | + | o | + | + |
| PANSS flat affect | o | o | + | + | + | + | × | + |
| PANSS motor retardation | o | o | o | + | + | + | o | + |
| PANSS unusual thought content | o | o | o | + | + | + | o | + |
| ILL (health related) quality of Life | − | o | o | o | o | − | − | − |
| ILL status antipsychotics | o | o | + | + | o | o | + | o |
| ILL diagnosis schizophrenia/psychosis related disorders | − | − | − | o | o | o | o | o |
| PANSS passive/apathetic Social withdrawal | o | o | o | o | + | + | o | + |
| DEMO age | + | + | o | o | o | o | o | o |
| CANSAS number of met need | − | o | − | − | ||||
| CANSAS housing need | o | o | + | + | ||||
| CANSAS food need | o | − | + | o | ||||
| CANSAS number of no need | o | o | − | o | ||||
| CANSAS psychotic disorder unmet need | o | + | o | o | ||||
| CAPE feeling guilty | − | o | o | o | ||||
| CAPE feeling tense | o | o | + | o | ||||
| CAPE suicidal | + | o | o | o | ||||
| CAPE lack of activity | o | + | o | o | ||||
| CAPE hallucinations | o | + | o | o | ||||
| CAPE telepathy | o | o | o | + | ||||
| PANSS delusions | o | o | + | + | o | o | o | o |
| PANSS Suspiciousness/persecution | + | + | o | o | o | o | o | o |
| PANSS grandiosity | o | o | o | o | o | + | o | + |
| PANSS depression | o | + | o | o | o | o | o | + |
| PANSS lack of spontaneity | o | + | o | + | o | o | o | o |
| PANSS stereotyped thinking | o | o | o | o | + | + | o | o |
| PANSS difficulty abstract thinking | o | o | o | + | o | + | o | o |
| PANSS emotional withdrawal | o | o | o | + | o | + | o | o |
| PANSS tension | o | o | o | o | o | o | + | + |
Important baseline features: selected in at least one-fourth of the models’ top 10% most frequently selected features. 1Models contained PANSS, demographic, illness, and CANSAS features; 2Models contained PANSS, demographic, illness, and CAPE features. +: positive weight; −: negative weight; o: not selected in the top 10% most frequently selected features; empty cell: not included in the model. Note that low weights (or beta’s): ≤0.10 were not considered in this Table (see Supplementary Tables 5–12 for specific weights). Weights (−/+) are relative to poor outcomes (i.e. “positive” outcome).
Symptom symptomatic, T3 follow-up at 3-year interval after the baseline, T6 is follow-up at 6-year interval after the baseline, PANSS Positive and Negative Syndrome Scale, CANSAS Camberwell assessment of need short appraisal, CAPE Community Assessment of Psychic Experiences.
Fig. 1Frequency of inclusion and weight of features.
Frequency of inclusion of a feature against its (average) weight in the model; shown for prediction of global assessment of functioning (GAF) outcome at T3, containing Positive and Negative Syndrome Scale—general, negative, and positive subscale (PANSS—Gen, Neg, and Pos), Demographic, Illness-related and lifetime psychotic experiences (CAPE) related features. A positive weight reflects that scoring higher on this feature contributes to being classified as ‘poor outcome’. For features with negative weights the opposite holds.
Leave-one-site-out cross-validation site performance by model.
| Amsterdam | 81 | 60.8 | 104 | 69.5 | 81 | 69.3 | 104 | 60.1 |
| Groningen | 73 | 62.4 | 132 | 63.1 | 73 | 62.3 | 132 | 57.2 |
| Maastricht | 124 | 56.4 | 139 | 53.0 | 124 | 63.8 | 139 | 61.5 |
| Utrecht | 54 | 65.5 | 70 | 69.7 | 54 | 54.4 | 70 | 60.6 |
| Amsterdam | 80 | 66.4 | 100 | 68.1 | 77 | 63.9 | 98 | 65.4 |
| Groningen | 65 | 61.5 | 118 | 64.4 | 58 | 62.5 | 107 | 64.2 |
| Maastricht | 81 | 68.0 | 93 | 68.9 | 124 | 66.3 | 139 | 62.1 |
| Utrecht | 48 | 58.0 | 66 | 57.9 | 54 | 63.1 | 70 | 53.0 |
Rows mention the geographic site left out of model training. Columns mention models organized per timepoint and included modalities. 1Models contained PANSS, demographic, illness-related, and CANSAS features; 2Models contained PANSS, demographic, illness-related, and CAPE features.
BAC balanced accuracy, T3 follow-up at 3-year interval after the baseline, T6 is follow-up at 6-year interval after the baseline, PANSS Positive, and Negative Syndrome Scale, CANSAS Camberwell assessment of need short appraisal, CAPE community assessment of psychic experiences.
Fig. 2Machine learning training and validation design.
Machine learning pipeline. D = modality; M = Model; training sets in dark blue, test/validation sets in yellow. a Data selection (see Supplementary Fig. 1 for details) and preprocessing including scaling and imputation. b Unimodal models, to identify the most informative modalities. c Multimodal models consisting of 2–4 modalities, including recursive feature elimination (RFE). d External validation of multimodal models using leave-one-site-out (LOSO) validation, where one of the four geographic sites is held out of model training and used for external validation; SVM (support vector machine) setup: RFE is part of the SVM pipeline; (i) In the inner layer, a CV loop is used to find the optimal value for the cost hyperparameter C from 38 points equidistant in 2log, starting at 0.0001 and ending at 37.07. C sets a penalty for violating the margin of the hyperplane; (ii) the middle layer employs a CV loop for RFE, a feature selection algorithm. It starts by including all available features in the model and iteratively eliminates the least informative features from it until the stopping criterion is met. The smallest set of features with performance within 10% of the best-performing set is selected; (iii) in the outer layer, a CV loop is used to define feature weights in the training set (9/10th of the data) and test the accuracy of the model in the validation set (1/10th of the data). Repetition of this procedure yields 10 models, which are repeated 50 times to reduce dependency on the choice of train-test partitions. The final prediction for a patient is an ensemble constituting an average of 50 repetitions.