| Literature DB >> 22059690 |
J Mourao-Miranda1, A A T S Reinders, V Rocha-Rego, J Lappin, J Rondina, C Morgan, K D Morgan, P Fearon, P B Jones, G A Doody, R M Murray, S Kapur, P Dazzan.
Abstract
BACKGROUND: To date, magnetic resonance imaging (MRI) has made little impact on the diagnosis and monitoring of psychoses in individual patients. In this study, we used a support vector machine (SVM) whole-brain classification approach to predict future illness course at the individual level from MRI data obtained at the first psychotic episode.Entities:
Mesh:
Year: 2011 PMID: 22059690 PMCID: PMC3315786 DOI: 10.1017/S0033291711002005
Source DB: PubMed Journal: Psychol Med ISSN: 0033-2917 Impact factor: 7.723
Fig. 1The support vector machine (SVM) classifier. (a) Illustration of a classification problem between two groups (patients versus controls) for the simplified case of only two voxels. Each brain image (e.g. gray-matter map) corresponds to a point in the input space and each voxel in the image represents one dimension of this space. The gray circles represent the images of patients and the black circles images of healthy controls. The dashed lines represent hyperplanes or decision boundaries that separate the groups. (b) Illustration of the optimal hyperplane determined by the SVM algorithm. The optimal hyperplane (dashed line) is the one with the largest margin of separation between the two classes or groups. The symbols at the margin (circled) are the support vectors. During the training phase the SVM finds the optimal hyperplane or decision boundary. During the test phase the decision boundary can be applied to classify new examples (white squares). The optimal hyperplane is described by a weight vector and an off-set.
Sociodemographic and clinical characteristics of the patients included in the analyses
MRI, Magnetic resonance imaging; n.s., not significant.
Values given as percentage or mean ± standard deviation.
Change in diagnosis over follow-up does not simply reflect an increase in one diagnostic group and a decrease in another one, but a change in both directions, with approximately 45% of patients changing diagnosis at follow-up.
Information missing for nine subjects.
Information missing for 10 subjects.
Results of the support vector machine (SVM) classification
The results are give for the SVM classification in two classes: ratio of true positive (sensitivity), true negative (specificity), accuracy and statistical probability. The first column shows the groups of subjects considered in each classification. The second column shows the percentage of subjects in the first group correctly classified as pertaining to it (sensitivity). The third column shows the percentage of subjects in the second group correctly classified as non-pertaining to the first group (specificity). The fourth column shows the accuracy (arithmetic mean between sensitivity and specificity). The last column shows the statistical probability that the result has been obtained by chance. It was obtained after 1000 permutations within the subjects. The number of subjects considered in each classification was 56 (28 in each group).
Fig. 2Discrimination map or support vector machine (SVM) weight vector: continuous versus episodic course (top), continuous course versus healthy individuals (bottom). The colours represent the weight of each voxel in the classification function (the red scale represents positive weights and the blue scale represents negative weights). The SVM weight vector is a linear combination or weighted average of the support vectors, that is the training examples that are most difficult to separate and define the decision boundary. The weight vector is therefore a spatial representation of the decision boundary. Every voxel contributes with a certain weight to the decision boundary or classification function. Given a positive and a negative class (e.g. +1=episodic group; −1=continuous group), a positive weight for a voxel means the weighted average in that voxel was higher for the episodic group, and a negative weight means the weighted average was higher for the continuous group. Because the classifier is multivariate by nature, the combination of all voxels as a whole is identified as a global spatial pattern by which the groups differ (the discriminating pattern). Therefore, the discrimination map should not be interpreted as a standard statistical parametric map resulting from a mass-univariate statistical test to find group differences, and no local inferences should be made based on the SVM weights.
List of the most discriminating regions (cluster peaks) for the classifier episodic versus continuous
x, y, z are Talairach coordinates of the cluster peaks selected using 3Dclust in AFNI (http://afni.nimh.nih.gov/afni). The regions were estimated using the software Talairach Client (www.talairach.org/).
List of the most discriminating regions (cluster peaks) for the classifier continuous versus healthy individuals
x, y, z are Talairach coordinates of the cluster peaks selected using 3Dclust in AFNI (http://afni.nimh.nih.gov/afni). The regions were estimated using the software Talairach Client (www.talairach.org/).