| Literature DB >> 24179839 |
Gaëtan Garraux1, Christophe Phillips, Jessica Schrouff, Alexandre Kreisler, Christian Lemaire, Christian Degueldre, Christian Delcour, Roland Hustinx, André Luxen, Alain Destée, Eric Salmon.
Abstract
Most available pattern recognition methods in neuroimaging address binary classification problems. Here, we used relevance vector machine (RVM) in combination with booststrap resampling ('bagging') for non-hierarchical multiclass classification. The method was tested on 120 cerebral (18)fluorodeoxyglucose (FDG) positron emission tomography (PET) scans performed in patients who exhibited parkinsonian clinical features for 3.5 years on average but that were outside the prevailing perception for Parkinson's disease (PD). A radiological diagnosis of PD was suggested for 30 patients at the time of PET imaging. However, at follow-up several years after PET imaging, 42 of them finally received a clinical diagnosis of PD. The remaining 78 APS patients were diagnosed with multiple system atrophy (MSA, N = 31), progressive supranuclear palsy (PSP, N = 26) and corticobasal syndrome (CBS, N = 21), respectively. With respect to this standard of truth, classification sensitivity, specificity, positive and negative predictive values for PD were 93% 83% 75% and 96%, respectively using binary RVM (PD vs. APS) and 90%, 87%, 79% and 94%, respectively, using multiclass RVM (PD vs. MSA vs. PSP vs. CBS). Multiclass RVM achieved 45%, 55% and 62% classification accuracy for, MSA, PSP and CBS, respectively. Finally, a majority confidence ratio was computed for each scan on the basis of class pairs that were the most frequently assigned by RVM. Altogether, the results suggest that automatic multiclass RVM classification of FDG PET scans achieves adequate performance for the early differentiation between PD and APS on the basis of cerebral FDG uptake patterns when the clinical diagnosis is felt uncertain. This approach cannot be recommended yet as an aid for distinction between the three APS classes under consideration.Entities:
Keywords: Bagging; Boostrap resampling; Computer-aided diagnosis; Corticobasal syndrome; Data mining; Error-Correcting Output Code; FDG PET; Multiclass classification; Multiple system atrophy; Parkinson's disease; Pattern recognition; Progressive supranuclear palsy; Relevance vector machine
Year: 2013 PMID: 24179839 PMCID: PMC3778264 DOI: 10.1016/j.nicl.2013.06.004
Source DB: PubMed Journal: Neuroimage Clin ISSN: 2213-1582 Impact factor: 4.881
Demographic and clinical data.
| N | Gender (F/M) | Data at the time of PET assessment | Last available follow-up | |||
|---|---|---|---|---|---|---|
| Mean age (years) | Mean disease duration (years) | Mean LEDD (mg) | Mean disease duration (years) | |||
| PD | 42 | 17/25 | 56.9 ± 10.3 | 3.6 ± 3.1 | 442 ± 239 | 11.6 ± 5.1 |
| MSA | 31 | 18/13 | 66.0 ± 8.8 | 3.4 ± 2.9 | 559 ± 298 | 6.4 ± 3.9 |
| PSP | 26 | 9/17 | 69.4 ± 7.3 | 3.1 ± 2.4 | 281 ± 250 | 5.9 ± 4 |
| CBS | 21 | 15/6 | 67.8 ± 7 | 3.3 ± 2 | 164 ± 189 | 5.9 ± 2.9 |
| All classes | 120 | 59/61 | 63.9 ± 10.2 | 3.4 ± 2.7 | 386 ± 284 | 8.0 ± 5.0 |
LEDD = L-DOPA equivalent daily dose (Tomlinson et al., 2010).
Fig. 1Bootstrap resampling with replacement (“bagging”).
At each iteration, the whole FDG-PET dataset was split into training and test sets. A prediction was assigned to each test instance by each of the six trained RVM models. A single prediction (PD, MSA, PSP or CBS) was obtained from the six RVM models using an Error-Correcting Output Code (ECOC) approach (Dietterich and Bakiri, 1995).
Bootstrap resampling procedure (multiclass RVM).
| Bag #1 | Bag #2 | Bag #3 | Bag #n-2 | Bag #n-1 | Bag #n | Vote counting | Standard of truth (SOT) | RVM prediction accuracy | Majority confidence ratio | |
|---|---|---|---|---|---|---|---|---|---|---|
| Vote #1 | Vote #2 | Vote # 3 | Vote #n-2 | Vote #n-1 | Vote #n | Majority vote? | Clinical diagnosis | Majority vote = SOT? | ||
| Scan #1 | PD | – | MSA | PD | PD | – | NPD = 32 | PD | 1 | (32 − 2) / 35 ∗ 100 = 85% |
| Scan #2 | PD | PSP | – | MSA | MSA | PD | NPD = 13 | MSA | 0 | (13 − 9) / 31 ∗ 100 = 13% |
| · | · | · | · | · | · | · | · | · | · | · |
| · | · | · | · | · | · | · | · | · | · | · |
| · | · | · | · | · | · | · | · | · | · | · |
| Scan #120 | PD | PSP | CBS | – | – | PSP | NPD = 4 | CBS | 0 | (13 − 10) / 30 ∗ 100 = 10% |
– = scan included in the training set and not in the test set in this bootstrap sample.
Confusion matrix derived from bootstrap aggregation (bagging) in binary RVM.
| RVM classification | Diagnostic classes (SOT) | PPV & NPV | |
|---|---|---|---|
| PD | APS | ||
| PD | 13 | .75 | |
| APS | 3 | .96 | |
| Class accuracy (p-value) | .93 (0.0) | .83 (0.0) | |
The table shows class accuracies (with the associate p-value) and positive/negative predictive values (PPV and NPV). SOT = standard of truth. The number of scans correctly classified in each class is indicated in bold.
Accuracy of RVM classification and the radiological diagnosis at the time of PET.
| 42 PD patients | 78 APS patients | Total 120 patients | |
|---|---|---|---|
| Correct agreement | 26 (62%) | 63 (81%) | 89 (74%) |
| Incorrect agreement | 2 (5%) | 1 (1%) | 3 (3%) |
| Correct RVM, incorrect radiological | 13 (31%) | 2 (3%) | 15 (13%) |
| Correct radiological, incorrect RVM | 1 (2%) | 12 (15%) | 13 (11%) |
The table summarizes the accuracy of binary RVM classification and radiological diagnosis, for the two diagnostic classes (PD and APS) together (last column) or separately. RVM classification and the radiological diagnostic could be in agreement (correctly or incorrectly) or disagreement, with one correct and the other incorrect with respect to the standard of truth (SOT) given by the clinical diagnosis at the last available follow-up several years after PET assessment (Table 1).
Confusion matrix derived from bootstrap aggregation (bagging) in multiclass RVM.
| RVM classification | Diagnostic classes (SOT) | PPV/NPV | |||
|---|---|---|---|---|---|
| PD | MSA | PSP | CBS | ||
| PD | 6 | 2 | 2 | .79/.94 | |
| MSA | 1 | 5 | 1 | .67/.83 | |
| PSP | 1 | 7 | 5 | .52/.87 | |
| CBS | 2 | 4 | 5 | .54/.92 | |
| Class accuracy (p-value) | .90 (0.0) | .45 (.149) | .55 (.067) | .62 (.025) | |
The table shows class accuracies (with the associate p-value) and positive/negative predictive values (PPV and NPV). SOT = standard of truth. The number of scans correctly classified in each class is indicated in bold.
Fig. 2Positive predictive values as a function of the majority confidence ratio.
Variation of positive predictive value (PPV) when only scans above a classification confidence threshold are counted (total number and number of correctly classified): the threshold tc of confidence level is varied from 0 to 90% by steps of 10%.
Fig. 3Discriminant standardized maps between PD and APS.
Unthresholded discriminant standardized map computed from the binary (A) and multiclass (B) RVM analyses comparing PD and APS classes. The color scale represents the standardized values computed on the basis of the 100 discriminant images created during bootstrap resampling (Fig. 1). By convention here, the excess network (EN) where FDG uptake is relatively increased in the PD class as compared with the APS class is represented by positive standardized values while relative deficits are represented by negative standardized values. Most voxels have a standardized value close to zero and therefore their contribution to the distinction between the two diagnostic classes under consideration is rather variable across the bootstrap samplings. The discriminant maps are displayed on representative axial (Z = 0, 20 and 40 mm), and sagittal (X = 0 mm) slices through a standard T1-weighted MRI in stereotactic space. Z and X values at the bottom indicate the distance (in mm) of the image from the axial plane through the anterior and posterior commissures and from the parasagittal plane through the midline, respectively. L = Left.
ECOC approach.
| Class | Binary classifier | |||||
|---|---|---|---|---|---|---|
| PD vs MSA | PD vs PSP | PD vs CBD | MSA vs PSP | MSA vs CBD | PSP vs CBD | |
| PD | 1 | 1 | 1 | 0.5 | 0.5 | 0.5 |
| MSA | 0 | 0.5 | 0.5 | 1 | 1 | 0.5 |
| PSP | 0.5 | 0 | 0.5 | 0 | 0.5 | 1 |
| CBD | 0.5 | 0.5 | 0 | 0.5 | 0 | 0 |
| Test scan: PD | 0.5 | 0.7 | 0.8 | 0.4 | 0.6 | 0.5 |
Let's consider a test scan obtaining the probabilities shown in table S1, last line, for each binary classifier. When using the RVM probabilities, the distance measure used to assign the final class is the Manhattan distance, which would give for the four classes:where L represents the final score of each class. In the present case, the class ‘PD’ is assigned to the test point since it shows the smallest final score L, which leads to a correct classification.