| Literature DB >> 29618790 |
Seyed Abolfazl Valizadeh1,2, Franziskus Liem3, Susan Mérillat4,3, Jürgen Hänggi1, Lutz Jäncke5,6,7.
Abstract
We examined whether it is possible to identify individual subjects on the basis of brain anatomical features. For this, we analyzed a dataset comprising 191 subjects who were scanned three times over a period of two years. Based on FreeSurfer routines, we generated three datasets covering 148 anatomical regions (cortical thickness, area, volume). These three datasets were also combined to a dataset containing all of these three measures. In addition, we used a dataset comprising 11 composite anatomical measures for which we used larger brain regions (11LBR). These datasets were subjected to a linear discriminant analysis (LDA) and a weighted K-nearest neighbors approach (WKNN) to identify single subjects. For this, we randomly chose a data subset (training set) with which we calculated the individual identification. The obtained results were applied to the remaining sample (test data). In general, we obtained excellent identification results (reasonably good results were obtained for 11LBR using WKNN). Using different data manipulation techniques (adding white Gaussian noise to the test data and changing sample sizes) still revealed very good identification results, particularly for the LDA technique. Interestingly, using the small 11LBR dataset also revealed very good results indicating that the human brain is highly individual.Entities:
Mesh:
Year: 2018 PMID: 29618790 PMCID: PMC5884835 DOI: 10.1038/s41598-018-23696-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the identification results broken down for the different methods (LDA and WKNN) and the different datasets. Acc: accuracy, Sens: sensitivity, Spec: specificity, F1: F1-score.
| LDA | WKNN | |||||||
|---|---|---|---|---|---|---|---|---|
| Acc | Sens | Spec | F1 | Acc | Sens | Spec | F1 | |
| All | 1.00 | 0.99 | 1.00 | 0.99 | 1.00 | 0.70 | 1.00 | 0.65 |
| AREA | 1.00 | 0.99 | 1.00 | 0.99 | 1.00 | 0.99 | 1.00 | 0.99 |
| THICKNESS | 1.00 | 0.98 | 1.00 | 0.97 | 1.00 | 1.00 | 1.00 | 1.00 |
| VOLUME | 1.00 | 0.99 | 1.00 | 0.99 | 1.00 | 1.00 | 1.00 | 1.00 |
| 11 LBR | 1.00 | 0.92 | 1.00 | 0.90 | 1.00 | 0.57 | 1.00 | 0.51 |
Summary of the Cochran’s Q test results for the two identification techniques.
| P-Value | significant McNemar | |
|---|---|---|
| LDA | *** | ALL vs. 11LBR AREA vs. 11LBR VOLUME vs. 11LBR |
| WKNN | *** | ALL vs. AREA ALL vs. THICKNESS All vs. VOLUME ALL vs. 11LBR AREA vs. 11 LBR THICKNESS vs. 11LBR VOLUME vs. 11LBR |
***p < 0.001.
Summary of the McNemar tests comparing the accuracies for the LDA and WKNN techniques broken down for the 5 different datasets.
| Dataset | P value | |
|---|---|---|
| WKNN vs. LDA | ALL |
|
| AREA | n.s. | |
| THICKNESS | n.s. | |
| VOLUME | n.s. | |
| 11 LBR |
|
***p < 0.001, significant after Bonferroni-Holm correction; n.s.: not significant.
Figure 1Identification results in the context of different noise levels added to the anatomical measures broken down for the different methods (a) LDA and (b) WKNN.
Summary of the Cochran tests comparing the identification results for the 6 different noise levels separately for the LDA and WKNN techniques.
| LDA | WKNN | |
|---|---|---|
| ALL | n.s. | n.s. |
| AREA | n.s. | n.s. |
| THICKNESS | ***a | ***c |
| VOLUME | n.s. | n.s. |
| 11LBR | ***b | n.s. |
n.s. not significant, ***p < 0.001; a–c: significant differences between 0% noise and all noise additions exceeding 15%.
Figure 2Sample size effect for the different identification techniques and the different datasets. a) LDA and b) WKNN.
Results of the stepwise LDAs for the ALL dataset. Shown are the anatomical measures contributing to the identification result separately for each step.
| Step | Feature name | ACC. | SENS. | SPEC. | F1 |
|---|---|---|---|---|---|
| 1 | rh_S_oc_middle_and_Lunatus_area | 0.99 | 0.04 | 0.99 | 0.03 |
| 2 | rh_Lat_Fis-ant-Horizont_area | 0.99 | 0.15 | 1.00 | 0.12 |
| 3 | rh_S_interm_prim-Jensen_volume | 0.99 | 0.34 | 1.00 | 0.29 |
| 4 | rh_S_subparietal_area | 0.99 | 0.48 | 1.00 | 0.41 |
| 5 | lh_S_suborbital_thickness | 1.00 | 0.55 | 1.00 | 0.50 |
| 6 | rh_S_oc_middle_and_Lunatus_volume | 1.00 | 0.63 | 1.00 | 0.56 |
| 7 | rh_S_orbital-H_Shaped_area | 1.00 | 0.66 | 1.00 | 0.60 |
| 8 | rh_G_front_middle_area | 1.00 | 0.79 | 1.00 | 0.74 |
| 9 | lhCortexVol | 1.00 | 0.82 | 1.00 | 0.78 |
| 10 | lh_G_temp_sup-Lateral_volume | 1.00 | 0.88 | 1.00 | 0.85 |
| 11 | lh_S_oc_sup_and_transversal_area | 1.00 | 0.93 | 1.00 | 0.91 |
In addition, the identification results (ACC: accuracy, SENS.: sensitivity, SPEC.: specificity, F1: F1 value as a harmonic mean of specificity and sensitivity) are shown for the test sample.
Results of the stepwise LDAs for the 11 LBR dataset.
| Step | Feature name | ACC. | SENS. | SPEC. | F1 |
|---|---|---|---|---|---|
| 1 | EstimatedTotalIntraCranialVol | 0.99 | 0.14 | 1.00 | 0.11 |
| 2 | TotalGrayVol | 0.99 | 0.38 | 1.00 | 0.32 |
| 3 | CSF | 0.99 | 0.48 | 1.00 | 0.41 |
| 4 | WhiteSurfArea_area | 1.00 | 0.71 | 1.00 | 0.64 |
| 5 | Cerebellum-Cortex | 1.00 | 0.78 | 1.00 | 0.73 |
| 6 | SubCortGrayVol | 1.00 | 0.90 | 1.00 | 0.86 |
| 7 | CorticalWhiteMatterVol | 1.00 | 0.89 | 1.00 | 0.86 |
| 8 | WM-hypointensities | 1.00 | 0.94 | 1.00 | 0.92 |
| 9 | Cerebellum-White-Matter | 1.00 | 0.89 | 1.00 | 0.86 |
| 10 | MeanThickness_thickness | 1.00 | 0.93 | 1.00 | 0.91 |
| 11 | CC | 1.00 | 0.93 | 1.00 | 0.90 |
Shown are the anatomical measures contributing to the identification result separately for each step. In addition, the identification results (ACC: accuracy, SENS.: sensitivity, SPEC.: specificity, F1: F1 value as a harmonic mean of specificity and sensitivity) are shown for the test sample.