| Literature DB >> 31737978 |
Du Lei1,2, Walter H L Pinaya2, Jonathan Young3, Therese van Amelsvoort4, Machteld Marcelis4,5, Gary Donohoe6, David O Mothersill6, Aiden Corvin7, Sandra Vieira2, Xiaoqi Huang1, Su Lui1, Cristina Scarpazza2,8, Celso Arango9, Ed Bullmore10, Qiyong Gong1,11,12, Philip McGuire2, Andrea Mechelli2.
Abstract
Schizophrenia is a severe psychiatric disorder associated with both structural and functional brain abnormalities. In the past few years, there has been growing interest in the application of machine learning techniques to neuroimaging data for the diagnostic and prognostic assessment of this disorder. However, the vast majority of studies published so far have used either structural or functional neuroimaging data, without accounting for the multimodal nature of the disorder. Structural MRI and resting-state functional MRI data were acquired from a total of 295 patients with schizophrenia and 452 healthy controls at five research centers. We extracted features from the data including gray matter volume, white matter volume, amplitude of low-frequency fluctuation, regional homogeneity and two connectome-wide based metrics: structural covariance matrices and functional connectivity matrices. A support vector machine classifier was trained on each dataset separately to distinguish the subjects at individual level using each of the single feature as well as their combination, and 10-fold cross-validation was used to assess the performance of the model. Functional data allow higher accuracy of classification than structural data (mean 82.75% vs. 75.84%). Within each modality, the combination of images and matrices improves performance, resulting in mean accuracies of 81.63% for structural data and 87.59% for functional data. The use of all combined structural and functional measures allows the highest accuracy of classification (90.83%). We conclude that combining multimodal measures within a single model is a promising direction for developing biologically informed diagnostic tools in schizophrenia.Entities:
Keywords: functional connectivity; graph theoretical analysis; machine learning; neuroimaging; schizophrenia
Mesh:
Year: 2019 PMID: 31737978 PMCID: PMC7268084 DOI: 10.1002/hbm.24863
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.399
Demographic and clinical characteristics of participantsa
| Dataset 1 | Dataset 2 | Dataset 3 | Dataset 4 | Dataset 5 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SCZ | CON | SCZ | CON | SCZ | CON | SCZ | CON | SCZ | CON | |
| Sample size | 68 | 72 | 56 | 132 | 49 | 63 | 32 | 83 | 90 | 102 |
| Disease stage | EST | – | EST | – | EST | – | EST | – | FE | – |
| Age (years) | 38.10 ± 14.13 | 35.87 ± 11.74 | 36.16 ± 8.52 | 30.99 ± 8.62 | 29.02 ± 6.39 | 29.60 ± 10.59 | 40.94 ± 10.90 | 28.09 ± 8.98 | 24.31 ± 7.77 | 30.56 ± 15.21 |
| Gender (M/F) | 55/13 | 51/21 | 42/14 | 69/63 | 38/11 | 25/38 | 24/8 | 38/45 | 33/57 | 49/53 |
| Handedness (R/L/B) | 56/10/2 | 69/1/2 | NA | NA | 40/7/2 | 54/7/2 | 32/0/0 | 83/0/0 | 90/0/0 | 102/0/0 |
| Education (years) | NA | NA | NA | NA | 16.61 ± 1.99 | 17.38 ± 2.00 | 14.72 ± 4.41 | 17.73 ± 3.28 | 12.13 ± 3.21 | 12.27 ± 3.18 |
| Medication (An/Dn) | 68/0 | NA | 45/4 | NA | 48/1 | NA | 23/5 | NA | 0/90 | NA |
| PANSS total | 58.78 ± 14.35 | NA | NA | NA | 44.16 ± 12.40 | NA | NA | NA | 96.46 ± 17.02 | NA |
| PANSS positive | 14.36 ± 4.78 | NA | NA | NA | 10.10 ± 4.43 | NA | NA | NA | 26.17 ± 5.40 | NA |
| PANSS negative | 15.00 ± 5.36 | NA | NA | NA | 10.81 ± 5.29 | NA | NA | NA | 17.90 ± 7.18 | NA |
| PANSS general | 29.42 ± 8.55 | NA | NA | NA | 23.24 ± 5.49 | NA | NA | NA | 48.05 ± 9.09 | NA |
| SAPS | NA | NA | 23.16 ± 17.00 | NA | NA | NA | 7.58 ± 12.28 | NA | NA | NA |
| SANS | NA | NA | 28.30 ± 16.14 | NA | NA | NA | 13.33 ± 17.85 | NA | NA | NA |
Abbreviations: SCZ, schizophrenia; CON, control; EST, established; FE, first episode; PANSS, Positive and Negative Syndrome Scale; SAPS, Scale for the Assessment of Positive Symptoms; SANS, Scale for the Assessment of Negative Symptoms; M, male; F, female; R, right; L, left; B, ambidextrous; An, antipsychotic medication, Dn, Drug‐naïve, NA, not available.
Data are presented as mean ± SD.
Patients were diagnosed with established schizophrenia if duration of illness was more than 24 months.
Data available for 49 of 56 patients.
Data available for 28 of 32 patients.
Data available for 50 of 68 patients.
Data available for 88 of 90 patients.
Data available for 50 of 56 patients.
Data available for 24 of 32 patients.
Figure 1Overview of the employed classification approach. Overview of the classification approach employed to estimate the diagnostic value of fMRI and rs‐fMRI data. ALFF, amplitude of low‐frequency fluctuation; Func M, functional connectivity matrix; GM, gray matter; ReHo, regional homogeneity; rs‐fMRI, resting‐state functional MRI; sMRI, structural MRI; Struct M, structural covariance matrix; WM, white matter
Single‐subject classification of patients with schizophrenia and healthy controls across different measures
| Measures | BAC (%) | SEN | SPEC |
| |
|---|---|---|---|---|---|
| Struct M | Dataset 1 | 77.26 | 61.67 | 92.86 | <.001 |
| Dataset 2 | 60.88 | 34.67 | 87.09 | <.001 | |
| Dataset 3 | 81.50 | 63.00 | 100.00 | <.001 | |
| Dataset 4 | 81.25 | 65.00 | 97.50 | <.001 | |
| Dataset 5 | 74.65 | 61.11 | 88.18 | <.001 | |
| Average | 75.11 | 57.09 | 93.13 | ||
| Pooled | 55.94 | 45.73 | 66.14 | =.02 | |
| Stratified | 56.74 | 44.79 | 68.70 | <.001 | |
| GM | Dataset 1 | 83.45 | 68.33 | 98.57 | <.001 |
| Dataset 2 | 67.26 | 40.67 | 93.85 | <.001 | |
| Dataset 3 | 81.17 | 64.00 | 98.33 | <.001 | |
| Dataset 4 | 84.17 | 68.33 | 100.00 | <.001 | |
| Dataset 5 | 76.38 | 56.67 | 96.09 | <.001 | |
| Average | 78.49 | 59.60 | 97.37 | ||
| Pooled | 59.11 | 41.43 | 76.79 | <.001 | |
| Stratified | 58.84 | 43.26 | 74.42 | <.001 | |
| WM | Dataset 1 | 73.36 | 52.62 | 94.64 | <.001 |
| Dataset 2 | 61.59 | 29.33 | 93.85 | <.001 | |
| Dataset 3 | 77.04 | 55.50 | 98.57 | <.001 | |
| Dataset 4 | 77.71 | 56.67 | 98.75 | <.001 | |
| Dataset 5 | 79.92 | 66.67 | 93.18 | <.001 | |
| Average | 73.92 | 52.16 | 95.80 | ||
| Pooled | 56.31 | 40.85 | 71.76 | =.003 | |
| Stratified | 55.68 | 40.05 | 71.31 | =.004 | |
| Struct M + GM + WM | Dataset 1 | 87.59 | 82.14 | 93.04 | <.001 |
| Dataset 2 | 64.40 | 35.67 | 93.13 | <.001 | |
| Dataset 3 | 86.92 | 80.50 | 93.33 | <.001 | |
| Dataset 4 | 86.94 | 75.00 | 98.89 | <.001 | |
| Dataset 5 | 82.32 | 80.00 | 84.64 | <.001 | |
| Average | 81.63 | 70.66 | 92.61 | ||
| Pooled | 58.39 | 51.67 | 65.11 | <.001 | |
| Stratified | 58.17 | 50.67 | 65.66 | <.001 | |
| Func M | Dataset 1 | 88.21 | 79.29 | 97.14 | <.001 |
| Dataset 2 | 76.35 | 55.00 | 97.69 | <.001 | |
| Dataset 3 | 85.25 | 70.50 | 100.00 | <.001 | |
| Dataset 4 | 81.87 | 65.00 | 98.75 | <.001 | |
| Dataset 5 | 83.65 | 71.11 | 96.18 | <.001 | |
| Average | 83.07 | 68.18 | 97.952 | ||
| Pooled | 58.58 | 45.15 | 72.01 | <.001 | |
| Stratified | 58.00 | 45.86 | 70.15 | <.001 | |
| ReHo | Dataset 1 | 84.07 | 68.14 | 100.00 | <.001 |
| Dataset 2 | 79.46 | 62.00 | 96.92 | <.001 | |
| Dataset 3 | 83.25 | 66.50 | 100.00 | <.001 | |
| Dataset 4 | 82.36 | 65.83 | 98.89 | <.001 | |
| Dataset 5 | 81.78 | 65.56 | 98.00 | <.001 | |
| Average | 82.18 | 65.61 | 98.76 | ||
| Pooled | 54.42 | 35.50 | 73.34 | =.039 | |
| Stratified | 54.25 | 35.7 | 72.8 | <.001 | |
| ALFF | Dataset 1 | 86.90 | 73.81 | 100.00 | <.001 |
| Dataset 2 | 78.62 | 63.33 | 93.90 | <.001 | |
| Dataset 3 | 82.92 | 67.50 | 98.33 | <.001 | |
| Dataset 4 | 86.25 | 72.50 | 100.00 | <.001 | |
| Dataset 5 | 80.32 | 65.56 | 95.09 | <.001 | |
| Average | 83.00 | 68.54 | 97.46 | ||
| Pooled | 57.49 | 39.52 | 75.47 | <.001 | |
| Stratified | 56.75 | 38.98 | 74.52 | <.001 | |
| Func M + ReHo + ALFF | Dataset 1 | 92.14 | 88.57 | 95.71 | <.001 |
| Dataset 2 | 78.93 | 64.67 | 93.19 | <.001 | |
| Dataset 3 | 87.64 | 81.00 | 94.29 | <.001 | |
| Dataset 4 | 90.28 | 84.17 | 96.39 | <.001 | |
| Dataset 5 | 88.97 | 86.67 | 91.27 | <.001 | |
| Average | 87.59 | 81.02 | 94.17 | ||
| Pooled | 57.73 | 52.66 | 62.40 | <.001 | |
| Stratified | 57.96 | 52.44 | 63.49 | <.001 | |
| Struct M + GM + WM + Func M + ReHo + ALFF | Dataset 1 | 95.71 | 94.29 | 97.14 | <.001 |
| Dataset 2 | 79.74 | 63.33 | 96.15 | <.001 | |
| Dataset 3 | 94.29 | 90.00 | 98.57 | <.001 | |
| Dataset 4 | 92.92 | 85.83 | 100.00 | <.001 | |
| Dataset 5 | 91.50 | 90.00 | 93.00 | <.001 | |
| Average | 90.83 | 84.69 | 96.97 | ||
| Pooled | 59.38 | 52.38 | 66.39 | <.001 | |
| Stratified | 58.27 | 52.33 | 64.21 | <.001 |
Abbreviations: ALFF, amplitude of low‐frequency fluctuation; BAC, balanced accuracy; Func M, functional connectivity matrix; GM, gray matter; Pooled, pooled the five datasets; ReHo, regional homogeneity; SEN, sensitivity; SPEC, specificity; Stratified, site‐stratified cross‐validation; Struct M, structural covariance matrix; WM, white matter.
Sensitivity and specificity were computed considering the patient group as the positive class.
Statistical significance was estimated using the permutation method (1,000 permutations).
Single‐subject classification of patients with schizophrenia and healthy controls using different training dataset across different measures
| Measures | Training dataset | Dataset 1 | Dataset 2 | Dataset 3 | Dataset 4 | Dataset 5 |
|---|---|---|---|---|---|---|
| Struct M | Dataset 1 | 57.60 | 56.8 | 63.69 | 55.95 | |
| Dataset 2 | 56.05 | 57.71 | 56.36 | 54.41 | ||
| Dataset 3 | 60.58 | 54.90 | 54.91 | 52.71 | ||
| Dataset 4 | 57.48 | 52.71 | 54.54 | 55.75 | ||
| Dataset 5 | 51.76 | 54.38 | 57.03 | 64.78 | ||
| GM | Dataset 1 | 62.26 | 64.74 | 63.82 | 56.31 | |
| Dataset 2 | 52.98 | 53.06 | 54.69 | 50.62 | ||
| Dataset 3 | 61.07 | 55.49 | 52.88 | 52.19 | ||
| Dataset 4 | 53.76 | 51.79 | 56.12 | 52.22 | ||
| Dataset 5 | 51.55 | 57.66 | 56.12 | 57.21 | ||
| WM | Dataset 1 | 52.19 | 63.27 | 64.06 | 53.04 | |
| Dataset 2 | 55.88 | 57.14 | 55.65 | 55.62 | ||
| Dataset 3 | 53.68 | 53.19 | 51.32 | 50.62 | ||
| Dataset 4 | 58.25 | 50.51 | 58.16 | 52.78 | ||
| Dataset 5 | 50.78 | 56.01 | 55.10 | 50.00 | ||
| Struct M + GM + WM | Dataset 1 | 65.48 | 67.46 | 69.22 | 59.12 | |
| Dataset 2 | 50.78 | 52.04 | 54.69 | 51.11 | ||
| Dataset 3 | 68.46 | 56.03 | 63.57 | 53.76 | ||
| Dataset 4 | 57.43 | 52.68 | 58.16 | 52.78 | ||
| Dataset 5 | 52.29 | 58.17 | 58.16 | 51.68 | ||
| Func M | Dataset 1 | 61.63 | 63.27 | 57.57 | 50.62 | |
| Dataset 2 | 54.41 | 54.08 | 59.38 | 52.22 | ||
| Dataset 3 | 54.45 | 53.22 | 59.73 | 56.11 | ||
| Dataset 4 | 59.56 | 55.25 | 55.10 | 52.22 | ||
| Dataset 5 | 56.62 | 57.55 | 52.27 | 52.25 | ||
| ReHo | Dataset 1 | 56.30 | 62.93 | 61.30 | 51.67 | |
| Dataset 2 | 50.74 | 51.02 | 54.69 | 50.56 | ||
| Dataset 3 | 56.62 | 56.01 | 59.38 | 51.73 | ||
| Dataset 4 | 57.35 | 58.17 | 53.06 | 50.56 | ||
| Dataset 5 | 52.21 | 54.73 | 50.23 | 53.12 | ||
| ALFF | Dataset 1 | 67.48 | 62.47 | 65.14 | 57.25 | |
| Dataset 2 | 56.62 | 52.04 | 58.17 | 52.29 | ||
| Dataset 3 | 62.50 | 57.41 | 72.10 | 53.79 | ||
| Dataset 4 | 50.00 | 54.46 | 50.00 | 50.56 | ||
| Dataset 5 | 52.21 | 54.98 | 50.00 | 49.76 | ||
| Func M + ReHo + ALFF | Dataset 1 | 69.45 | 77.44 | 65.61 | 56.5 | |
| Dataset 2 | 58.09 | 51.02 | 59.38 | 51.18 | ||
| Dataset 3 | 66.22 | 66.37 | 66.94 | 53.53 | ||
| Dataset 4 | 55.15 | 58.55 | 53.06 | 52.22 | ||
| Dataset 5 | 57.35 | 58.58 | 50.45 | 50.72 | ||
| Struct M + GM + WM + Func M + ReHo + ALFF | Dataset 1 | 64.45 | 72.11 | 65.14 | 55.26 | |
| Dataset 2 | 51.47 | 50.00 | 53.12 | 51.11 | ||
| Dataset 3 | 64.01 | 58.69 | 62.26 | 50.00 | ||
| Dataset 4 | 57.35 | 52.68 | 53.06 | 51.67 | ||
| Dataset 5 | 51.47 | 56.25 | 54.08 | 50.96 |
Abbreviations: ALFF, amplitude of low‐frequency fluctuation; Func M, functional connectivity matrix; GM, gray matter; ReHo, regional homogeneity; Struct M, structural covariance matrix; WM, white matter.
Ten brain regions making the greatest contribution to single‐subject classification across the different measures
| Struct M | Dataset 1 | Dataset 2 | Dataset 3 | Dataset 4 | Dataset 5 | Mean |
|---|---|---|---|---|---|---|
| Median cingulate and paracingulate gyri L | 0.0191 | 0.0383 | 0.0488 | 0.0516 | 0.0184 | 0.0352 |
| Paracentral lobule L | 0.0171 | 0.0503 | 0.0453 | 0.0308 | 0.0264 | 0.0340 |
| Heschl gyrus R | 0.0197 | 0.0448 | 0.0490 | 0.0323 | 0.0193 | 0.0330 |
| Heschl gyrus L | 0.0142 | 0.0496 | 0.0342 | 0.0430 | 0.0239 | 0.0330 |
| Calcarine L | 0.0200 | 0.0615 | 0.0348 | 0.0258 | 0.0211 | 0.0326 |
| Median cingulate and paracingulate gyri R | 0.0133 | 0.0507 | 0.0349 | 0.0388 | 0.0240 | 0.0323 |
| Angular gyrus R | 0.0243 | 0.0470 | 0.0354 | 0.0345 | 0.0196 | 0.0322 |
| Middle frontal gyrus R | 0.0099 | 0.0722 | 0.0284 | 0.0222 | 0.0264 | 0.0318 |
| Angular gyrus L | 0.0174 | 0.0596 | 0.0343 | 0.0202 | 0.0208 | 0.0305 |
| Temporal pole: Middle temporal gyrus R | 0.0137 | 0.0531 | 0.0348 | 0.0276 | 0.0226 | 0.0304 |
Note: All brain regions are identified using AAL (automated anatomical labeling). For matrix‐based measures (i.e., Struct M and Func M), the vectors are absolute values of the weights for the connectivity between each brain region and the remaining 89 regions across the different folds of the cross‐validation. For voxel‐wise measures (i.e., GM, WM, ReHo, and ALFF), the vectors are computed using a template mask based on the AAL atlas to extract the absolute value of weight for each brain regions across the different folds of the cross‐validation.
Abbreviations: ALFF, amplitude of low‐frequency fluctuation; Func M, functional connectivity matrix; GM, gray matter; L, left hemisphere; R, right hemisphere; ReHo, regional homogeneity; Struct M, structural covariance matrix; WM, white matter.
Figure 2Regions providing the greatest contribution to single‐subject classification. Ten brain regions making the greatest contribution to single‐subject classification for each of our six measures of interest. The nodes were mapped onto the cortical surfaces by using the BrainNet Viewer package (http://www.nitrc.org/projects/bnv). ACG, anterior cingulate and paracingulate gyri; ALFF, amplitude of low‐frequency fluctuation; ANG, angular gyrus; CAL, calcarine; CAU, caudate nucleus; CUN, cuneus; DCG, median cingulate and paracingulate gyri; Func M, functional connectivity matrix; GM, gray matter; HES, Heschl gyrus; IFGoperc, inferior frontal gyrus, opercular part; IFGtriang, inferior frontal gyrus, triangular part; IPL, inferior parietal gyrus; ITG, inferior temporal gyrus; L, left hemisphere; LING, lingual gyrus; MFG, middle frontal gyrus; MTG, middle temporal gyrus; ORBinf, inferior frontal gyrus, orbital part; ORBsupmed, superior frontal gyrus, medial orbital part; PCL, paracentral lobule; PCUN, precuneus; PreCG, Precental gyrus; PUT, putamen; R, right hemisphere; REC, gyrus rectus; ReHo, regional homogeneity; SMA, supplementary motor area; SOG, superior occipital gyrus; SPG, superior parietal gyrus; STG, superior temporal gyrus; Struct M, structural covariance matrix; THA, thalamus; TPOmid, temporal pole: middle temporal gyrus; TPOsup, temporal pole: superior temporal gyrus; WM, white matter