| Literature DB >> 24820966 |
Guan Yu1, Yufeng Liu2, Kim-Han Thung3, Dinggang Shen4.
Abstract
Accurately identifying mild cognitive impairment (MCI) individuals who will progress to Alzheimer's disease (AD) is very important for making early interventions. Many classification methods focus on integrating multiple imaging modalities such as magnetic resonance imaging (MRI) and fluorodeoxyglucose positron emission tomography (FDG-PET). However, the main challenge for MCI classification using multiple imaging modalities is the existence of a lot of missing data in many subjects. For example, in the Alzheimer's Disease Neuroimaging Initiative (ADNI) study, almost half of the subjects do not have PET images. In this paper, we propose a new and flexible binary classification method, namely Multi-task Linear Programming Discriminant (MLPD) analysis, for the incomplete multi-source feature learning. Specifically, we decompose the classification problem into different classification tasks, i.e., one for each combination of available data sources. To solve all different classification tasks jointly, our proposed MLPD method links them together by constraining them to achieve the similar estimated mean difference between the two classes (under classification) for those shared features. Compared with the state-of-the-art incomplete Multi-Source Feature (iMSF) learning method, instead of constraining different classification tasks to choose a common feature subset for those shared features, MLPD can flexibly and adaptively choose different feature subsets for different classification tasks. Furthermore, our proposed MLPD method can be efficiently implemented by linear programming. To validate our MLPD method, we perform experiments on the ADNI baseline dataset with the incomplete MRI and PET images from 167 progressive MCI (pMCI) subjects and 226 stable MCI (sMCI) subjects. We further compared our method with the iMSF method (using incomplete MRI and PET images) and also the single-task classification method (using only MRI or only subjects with both MRI and PET images). Experimental results show very promising performance of our proposed MLPD method.Entities:
Mesh:
Year: 2014 PMID: 24820966 PMCID: PMC4018387 DOI: 10.1371/journal.pone.0096458
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of subject information.
| Class label | Category | Sample size | Age | Female/Male | Education | MMSE |
| pMCI | MRI+PET | 76 | 75.4 | 28/48 | 15.9 | 26.7 |
| MRI | 91 | 74.4 | 37/54 | 15.5 | 26.5 | |
| sMCI | MRI+PET | 126 | 74.9 | 38/88 | 15.6 | 27.4 |
| MRI | 100 | 75.0 | 37/63 | 15.5 | 27.1 |
(For the second column, MRI+PET represents the group of subjects with both MRI and PET features, and MRI represents the group of subjects with only MRI features. The last column shows the Mini-Mental State Examination (MMSE) score.).
Figure 1Left: Histogram of the canonical correlations between MRI features and PET features; Middle: Histogram of the number of selected MRI features for task 1 based on 50 times simulation. Each time we chose 76 pMCI subjects and 76 sMCI subjects randomly in task 1; Right: Histogram of the number of selected MRI features for task 2 based on 50 times simulation. Each time we chose 76 pMCI subjects and 76 sMCI subjects randomly in task 2. For the middle and right plots, the LASSO method is used for feature selection and 10-fold cross validation is used to choose the optimal number of features.
Classification performance of MLPD and iMSF using incomplete MRI and PET.
| pMCI(+1)/sMCI(−1) | k-fold CV | iMSFQ | iMSFL | MLPD |
| Accuracy | 2 | 0.6433 (0.0030) | 0.6420 (0.0035) |
|
| 5 | 0.6467 (0.0023) | 0.6482 (0.0031) |
| |
| 10 | 0.6581 (0.0022) | 0.6578 (0.0030) |
| |
| Sensitivity | 2 | 0.4913 (0.0072) | 0.4965 (0.0061) |
|
| 5 | 0.4853 (0.0063) | 0.5069 (0.0054) |
| |
| 10 | 0.4906 (0.0040) | 0.5223 (0.0038) |
| |
| Specificity | 2 |
| 0.7496 (0.0064) | 0.6671 (0.0044) |
| 5 |
| 0.7527 (0.0048) | 0.6671 (0.0022) | |
| 10 |
| 0.7582 (0.0044) | 0.6675 (0.0026) |
(For this experiment, we used all the available data from 393 subjects in total. iMSFQ and iMSFL indicate the iMSF method using quadratic loss function and logistic loss function respectively. The best value for each performance measure is highlighted in bold. The value in the parenthesis is the standard deviation.).
Comparison of the classification performance of MLPD and iMSF on task 2.
| pMCI(+1)/sMCI(−1) | k-fold CV | iMSFQ | iMSFL | MLPD |
| Accuracy | 2 | 0.6363 (0.0047) | 0.6386 (0.0055) |
|
| 5 | 0.6291 (0.0039) | 0.6346 (0.0041) |
| |
| 10 | 0.6321 (0.0038) | 0.6380 (0.0043) |
| |
| Sensitivity | 2 | 0.6026 (0.0101) | 0.6052 (0.0092) |
|
| 5 | 0.5990 (0.0074) | 0.6104 (0.0060) |
| |
| 10 | 0.5996 (0.0057) | 0.6146 (0.0067) |
| |
| Specificity | 2 | 0.6670 (0.0062) |
| 0.6700 (0.0065) |
| 5 | 0.6563 (0.0049) | 0.6567 (0.0055) |
| |
| 10 |
| 0.6590 (0.0054) | 0.6597 (0.0045) |
(For this experiment, we used all the available data from 393 subjects in total. The classification results for the subjects in task 2 are reported. iMSFQ and iMSFL indicate the iMSF method using quadratic loss function and logistic loss function respectively. The best value for each performance measure is highlighted in bold. The value in the parenthesis is the standard deviation.).
Classification performance of MLPD and SLPD.
| pMCI(+1)/sMCI(−1) | k-fold CV | SLPD | MLPD |
| Accuracy | 2 | 0.6423 (0.0029) |
|
| 5 | 0.6420 (0.0028) |
| |
| 10 | 0.6409 (0.0029) |
| |
| Sensitivity | 2 |
| 0.6431 (0.0065) |
| 5 | 0.6510 (0.0037) |
| |
| 10 | 0.6588 (0.0048) |
| |
| Specificity | 2 | 0.6342 (0.0040) |
|
| 5 | 0.6353 (0.0046) |
| |
| 10 | 0.6276 (0.0030) |
|
(For MLPD, all the available data are used. For SLPD, only the MRI features are used. The best value for each performance measure is highlighted in bold. The value in the parenthesis is the standard deviation.).
Comparison of the classification performance of MLPD and SLPD on task 1.
| pMCI(+1)/sMCI(−1) | k-fold CV | SLPD | MLPD |
| Accuracy | 2 | 0.6394 (0.0067) |
|
| 5 | 0.6538 (0.0047) |
| |
| 10 | 0.6588 (0.0031) |
| |
| Sensitivity | 2 | 0.6096 (0.0137) |
|
| 5 | 0.6608 (0.0063) |
| |
| 10 | 0.6551 (0.0064) |
| |
| Specificity | 2 | 0.6574 (0.0073) |
|
| 5 | 0.6497 (0.0053) |
| |
| 10 | 0.6611 (0.0036) |
|
(For MLPD, all the available data is used, but the classification results for the subjects in task 1 is reported. For SLPD, only use the subjects in task 1. The best value for each performance measure is highlighted in bold. The value in the parenthesis is the standard deviation.).
Figure 2Classification accuracy of MLPD and SLPD with respect to different predefined threshold .