| Literature DB >> 31571320 |
Xin-Lu Cai1,2,3, Dong-Jie Xie1,4, Kristoffer H Madsen3,5,6, Yong-Ming Wang1,2,3, Sophie Alida Bögemann1,2,3, Eric F C Cheung7, Arne Møller3,8, Raymond C K Chan1,2,3,9.
Abstract
Machine learning has increasingly been applied to classification of schizophrenia in neuroimaging research. However, direct replication studies and studies seeking to investigate generalizability are scarce. To address these issues, we assessed within-site and between-site generalizability of a machine learning classification framework which achieved excellent performance in a previous study using two independent resting-state functional magnetic resonance imaging data sets collected from different sites and scanners. We established within-site generalizability of the classification framework in the main data set using cross-validation. Then, we trained a model in the main data set and investigated between-site generalization in the validated data set using external validation. Finally, recognizing the poor between-site generalization performance, we updated the unsupervised algorithm to investigate if transfer learning using additional unlabeled data were able to improve between-site classification performance. Cross-validation showed that the published classification procedure achieved an accuracy of 0.73 using majority voting across all selected components. External validation found a classification accuracy of 0.55 (not significant) and 0.70 (significant) using the direct and transfer learning procedures, respectively. The failure of direct generalization from one site to another demonstrates the limitation of within-site cross-validation and points toward the need to incorporate efforts to facilitate application of machine learning across multiple data sets. The improvement in performance with transfer learning highlights the importance of taking into account the properties of data when constructing predictive models across samples and sites. Our findings suggest that machine learning classification result based on a single study should be interpreted cautiously.Entities:
Keywords: generalizability; machine learning; reproducibility; schizophrenia spectrum disorders
Year: 2019 PMID: 31571320 PMCID: PMC7268030 DOI: 10.1002/hbm.24797
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.038
Summary of machine learning studies in schizophrenia based on rsfMRI
| Reference | Participants | Feature | Feature extraction | Classifier | Accuracy (%) |
|---|---|---|---|---|---|
| Shen, Wang, Liu, and Hu ( | SZ = 32, HC = 20 | FC among 116 regions | Correlation coefficient rank; LLE | C‐means clustering | 86.5 |
| Fan et al. ( | SZ = 31, HC = 31 | Functional brain networks from ICA | ICA, Grassmann manifold analysis | SVM | 85.5 |
| Du et al. ( | SZ = 28, HC = 28 | Spatial components from ICA |
| Majority voting | 93 |
| Tang, Wang, Cao, and Tan ( | SZ = 22, HC = 22 | FC by 90 regions | Correlation coefficient rank; PCA | Linear SVM | 93.2 |
| Bassett, Nelson, Mueller, Camchong, and Lim ( | SZ = 15, HC = 14 | FC by the graph | No | Linear kernel SVM | 75 |
| Venkataraman, Whitford, Westin, Golland, and Kubicki ( | SZ = 18, HC = 18 | FC by 77 ROIs | Select by prior knowledge and random forest | Majority voting | 75 |
| Anderson and Cohen ( | SZ = 72, HC = 74 | FNC by the graph | ICA | SVM | 65 |
| Arbabshirani et al. (2013) | SZ = 28, HC = 28 | FNC | ICA; visually inspect | K‐nearest neighbors | 96 |
| Fekete et al. ( | SZ = 8, HC = 10 | FNC by the graph | No | Multi kernel block diagonal optimization | 100 |
| Su, Wang, Shen, Feng, and Hu ( | SZ = 32, HC = 32 | FC | Correlation coefficient rank | Linear kernel SVM | 81.2 |
| Watanabe, Kessler, Scott, Angstadt, and Sripada ( | SZ = 54, HC = 67 | FC by 347 ROIs | Elastic‐net | SVM | 73.5 |
| Cheng, Newman, et al. ( | SZ = 415, HC = 405 | FC by BWAS | No | SVM | 75.8 |
| Chyzhyk, Savio, and Grana ( | SZ = 74, HC = 72 | FC/ local activity | Pearson's correlation coefficient | SVM | 91.2/100 |
| Savio and Grana ( | SZ = 72, HC = 74 | ALFF, fALFF, VMHC, ReHo | Voxel site saliency measures | SVM; RF | 80 |
| Cheng, Newman, et al. ( | SZ = 27, HC = 36 | BC of FC | Rank BC | SVM | 79 |
| Kaufmann et al. ( | SZ = 71, HC = 196 | FC from ICA | No | Regularized LDA | 84.4 |
| Kim et al. ( | SZ = 50, HC = 50 | FC among 116 regions | No | Deep neural network | 86 |
| Skatun et al. ( | SZ = 182, HC = 348 | FC from ICA | No | Regularized LDA | 78.3 |
| Cui, Liu, Song, et al. ( | SZ = 108, HC = 121 | FC by 90 ROIs | Two sample | SVM | 82.6 |
| Cao et al. (2018) | SZ = 43, HC = 29 | MI and FC between STC and other cortical regions | Top 10 features | SVM | 78.6 |
Abbreviations: BC, betweenness centrality; BWAS, brain‐wide association study; FLD, Fisher linear discriminant; FNC, functional network connectivity; HC, healthy controls; LASSO, least absolute shrinkage and selection operator; LDA, linear discriminative analysis; LLE, locally linear embedding; MI, mutual information; PCA, principle component analysis; RF, random forest; ROI, region of interest; STC, superior temporal cortex; SZ, patients with schizophrenia; VMHC, voxel‐mirrored homotopic connectivity.
Demographic and clinical information of two data sets
| HC | SZ |
|
| |
|---|---|---|---|---|
| The main data set | ||||
| Demographics | ||||
| Age (years) | 42.04 (12.165) | 43.22 (10.885) | −.515 | .608 |
| Gender (male%) | 35.29% | 41.18% | .374 | .684 |
| Education (years) | 12.80 (3.731) | 12.07 (2.946) | 1.105 | .272 |
| Estimated IQ* | 119.76 (11.735) | 107.68 (14.215) | 4.663 | .000 |
| Clinical characteristics | ||||
| Onset age (years) | 25.41 (9.185) | |||
| Course (years) | 16.66 (8.067) | |||
| PANSS total | 51.69 (14.771) | |||
| PANSS positive | 11.63 (4.858) | |||
| PANSS negative | 13.39 (5.783) | |||
| PANSS general | 26.67 (7.618) | |||
| CPZ equivalent dose (mg) | 236.56 (172.208) | |||
| The validated data set | ||||
| Demographics | ||||
| Age* (years) | 27.37 (7.344) | 36.5 (7.140) | −4.898 | .000 |
| Gender* (male%) | 44.44% | 82.35% | 9.580 | .003 |
| Education* (years) | 13.93 (2.814) | 12.18 (2.208) | 2.722 | .009 |
| Estimated IQ | 108.00 (20.196) | 93.94 (24.799) | 1.970 | .055 |
| Clinical characteristics | ||||
| Onset age (years) | 24.17 (6.644) | |||
| Course (years) | 10.54 (7.868) | |||
| PANSS total | 67.72 (12.378) | |||
| PANSS positive | 10.00 (3.873) | |||
| PANSS negative | 23.16 (3.118) | |||
| PANSS general | 30.84 (7.238) | |||
| CPZ equivalent dose (mg) | 287.05 (193.830) | |||
Note. Table values: mean (SD).
Abbreviations: CPZ: chlorpromazine; HC, healthy controls; PANSS, the Positive and Negative Syndrome Scale; SZ, patients with schizophrenia.
*p < 0.05.
Comparison of demographic and clinical information between two sites
| SZ | HC | |||||||
|---|---|---|---|---|---|---|---|---|
| Main | Validated |
|
| Main | Validated |
|
| |
| Demographics | ||||||||
| Age (years) | 43.22 (10.885) | 36.50 (7.14) | 3.168 | .002 | 42.04 (12.165) | 43.22 (10.885) | −.515 | .608 |
| Gender (male%) | 41.18% | 82.35% | 14.167 | .000 | 35.29% | 44.44% | .625 | .470 |
| Education (years) | 12.07 (2.946) | 12.18 (2.208) | −.182 | .856 | 12.80 (3.731) | 12.07 (2.946) | 1.105 | .272 |
| Estimated IQ | 107.68 (14.215) | 93.94 (24.799) | 3.206 | .002 | 119.76 (11.735) | 107.68 (14.215) | 4.663 | .000 |
| Clinical characteristics | ||||||||
| Onset age (years) | 25.41 (9.185) | 24.17 (6.644) | .594 | .554 | ||||
| Course (years) | 16.66 (8.067) | 10.54 (7.868) | 3.070 | .003 | ||||
| PANSS total | 51.69 (14.771) | 67.72 (12.378) | −4.678 | .000 | ||||
| PANSS positive | 11.63 (4.858) | 10.00 (3.873) | 1.461 | .148 | ||||
| PANSS negative | 13.39 (5.783) | 23.16 (3.118) | −7.884 | .000 | ||||
| PANSS general | 26.67 (7.618) | 30.84 (7.238) | −2.28 | .025 | ||||
| CPZ equivalent dose (mg) | 236.56 (172.208) | 287.05 (193.830) | −1.024 | .310 | ||||
| PANSS5 negative | 15.20 (6.636) | 28.17 (4.428) | −8.544 | .000 | ||||
| PANSS5 positive | 10.47 (4.888) | 8.17 (2.973) | 2.127 | .037 | ||||
| PANSS5 disorganized | 15.22 (4.500) | 17.09 (4.067) | −1.704 | .093 | ||||
| PANSS5 excited | 5.61 (2.155) | 5.92 (2.701) | −.533 | .596 | ||||
| PANSS5 anxiety | 8.75 (3.918) | 8.76 (3.358) | −.016 | .987 | ||||
Note. Table values: mean (SD).
Abbreviations: CPZ, chlorpromazine; HC, healthy controls; main, main data set; PANSS, the Positive and Negative Syndrome Scale; PANSS5, the symptom domains calculated by the five‐factor model; SZ, patients with schizophrenia; validated: validated data set.
Classification performance for each component
| Ind | Area | Validation type | Acc | Sens | Spec | Ind | Area | Validation type | Acc | Sens | Spec |
|---|---|---|---|---|---|---|---|---|---|---|---|
| (1) | Occipital lateral cortex | Internal | 0.686 | 0.588 | 0.784 | (9) | Posterior DMN | Internal | 0.441 | 0.392 | 0.490 |
| External1 | 0.417 | 0.485 | 0.333 | External1 | 0.450 | 0.424 | 0.481 | ||||
| External2 | 0.450 | 0.333 | 0.593 | External2 | 0.583 | 0.485 | 0.704 | ||||
| (2) | Occipital medial cortex | Internal | 0.578 | 0.510 | 0.647 | (10) | Middle cingulate | Internal | 0.686 | 0.510 | 0.863 |
| External1 | 0.533 | 0.455 | 0.630 | External1 | 0.433 | 0.152 | 0.778 | ||||
| External2 | 0.583 | 0.515 | 0.667 | External2 | 0.617 | 0.636 | 0.593 | ||||
| (3) | Cerebellum | Internal | 0.598 | 0.392 | 0.804 | (11) | Occipital medial cortex | Internal | 0.588 | 0.353 | 0.824 |
| External1 | 0.417 | 0.333 | 0.519 | External1 | 0.550 | 0.333 | 0.815 | ||||
| External2 | 0.517 | 0.333 | 0.741 | External2 | 0.550 | 0.697 | 0.370 | ||||
| (4) | Fusiform | Internal | 0.657 | 0.490 | 0.824 | (12) | Frontal superior cortex | Internal | 0.529 | 0.627 | 0.431 |
| External1 | 0.567 | 0.364 | 0.815 | External1 | 0.550 | 0.727 | 0.333 | ||||
| External2 | 0.467 | 0.242 | 0.741 | External2 | 0.533 | 0.576 | 0.481 | ||||
| (5) | Basal ganglia | Internal | 0.706 | 0.627 | 0.784 | (13) | DMN | Internal | 0.647 | 0.490 | 0.804 |
| External1 | 0.500 | 0.242 | 0.815 | External1 | 0.483 | 0.303 | 0.704 | ||||
| External2 | 0.667 | 0.606 | 0.741 | External2 | 0.617 | 0.576 | 0.667 | ||||
| (6) | Precentral gyrus | Internal | 0.578 | 0.686 | 0.471 | (14) | Central gyrus | Internal | 0.627 | 0.431 | 0.824 |
| External1 | 0.500 | 0.576 | 0.407 | External1 | 0.617 | 0.485 | 0.778 | ||||
| External2 | 0.633 | 0.576 | 0.704 | External2 | 0.483 | 0.242 | 0.778 | ||||
| (7) | Anterior DMN | Internal | 0.618 | 0.510 | 0.725 | (15) | DMN | Internal | 0.431 | 0.353 | 0.510 |
| External1 | 0.600 | 0.545 | 0.667 | External1 | 0.583 | 0.606 | 0.556 | ||||
| External2 | 0.550 | 0.788 | 0.259 | External2 | 0.650 | 0.606 | 0.704 | ||||
| (8) | Temporal lobe | Internal | 0.657 | 0.667 | 0.647 | (16) | Precuneus | Internal | 0.676 | 0.647 | 0.706 |
| External1 | 0.550 | 0.333 | 0.815 | External1 | 0.550 | 0.455 | 0.667 | ||||
| External2 | 0.667 | 0.788 | 0.519 | External2 | 0.583 | 0.333 | 0.889 |
Note. Ind, index; Acc, accuracy; Sens, sensitivity; Spec, specificity; Internal, internal validation within the main data set; External1, external validation generalizing to the validated data set from the main data set by the template based on the main data set; External2, external validation generalizing to the validated data set from the main data set by the template based on both data set.
Figure 1The flowchart shows machine learning procedures through internal and external validation. FLD, Fisher linear discrimination; GICA, group independent component analysis; PCA, principle component analysis; STR, spatial–temporal reconstruction
Figure 2Three orthogonal slices from selected components are shown in the figure. The mean components were calculated across all participants and converted into Z‐scores. Orthogonal projects are reproduced according to neurological convention
Selected spatial components used for classification
| Index | Component name | Spatial location | Peak MNI region | Peak MNI coordinates (mm) |
|---|---|---|---|---|
| (1) | Occipital lateral cortex | Left calcarine sulcus + left middle occipital gyrus + lingual gyrus | Left calcarine sulcus | −12, −93, −3 |
| (2) | Occipital medial cortex | Calcarine | Left calcarine sulcus | 0, −75, 9 |
| (3) | Cerebellum | Left lobule VIII, left crus I, and lobule VI of cerebellar hemisphere | Lobule IX of vermis | 3, −60, −39 |
| (4) | Fusiform | Fusiform gyrus + lingual gyrus | Right lingual gyrus | 24, −60, −9 |
| (5) | Basal ganglia | Putamen + thalamus + caudate nucleus | Left putamen | −21, 9, −3 |
| (6) | Precentral gyrus | Precentral gyrus + left postcentral gyrus | Left postcentral gyrus | −51, −9, 33 |
| (7) | Anterior DMN | Anterior cingulate gyrus + left medial frontal gyrus | Left medial orbitofrontal cortex | 0, 51, −3 |
| (8) | Temporal lobe | Middle temporal gyrus + right superior temporal gyrus | Right superior temporal gyrus | 60, −42, 12 |
| (9) | Posterior DMN | Precentral gyrus + left cuneus | Left precuneus | 0, −69, 36 |
| (10) | Middle cingulate | Midcingulate area | Left midcingulate area | 0, −33, 45 |
| (11) | Occipital medial cortex | Middle occipital gyrus + cuneus + superior occipital gyrus | Right middle occipital gyrus | 30, −81, 24 |
| (12) | Frontal superior cortex | Superior frontal gyrus + right supplementary motor area + middle frontal gyrus | Left supplementary motor area | 0, 6, 60 |
| (13) | DMN | Medial frontal gyrus | Left medial frontal gyrus | 0, 51, 33 |
| (14) | Central gyrus | Postcentral gyrus + precentral gyrus | Undefined | 0, −27, 66 |
| (15) | DMN | Middle frontal gyrus + superior frontal gyrus | Right midcingulate area | 3, 24, 36 |
| (16) | Precuneus | Precentral gyrus + superior parietal lobule | Left precuneus | 0, −57, 54 |
Figure 3Accuracy, sensitivity, and specificity based on individual features. Internal validation was conducted within the main data set, external validation1 was conducted by directly applying the classification procedure from the main data set to the validated data set, and external validation2 was conducted using an updated group ICA across both data sets but with all other steps being identical