| Literature DB >> 35642196 |
Savas Okyay1,2, Nihat Adar1.
Abstract
Medical doctors may struggle to diagnose dementia, particularly when clinical test scores are missing or incorrect. In case of any doubts, both morphometrics and demographics are crucial when examining dementia in medicine. This study aims to impute and verify clinical test scores with brain MRI analysis and additional demographics, thereby proposing a decision support system that improves diagnosis and prognosis in an easy-to-understand manner. Therefore, we impute the missing clinical test score values by unsupervised dementia-related user-based collaborative filtering to minimize errors. By analyzing succession rates, we propose a reliability scale that can be utilized for the consistency of existing clinical test scores. The complete base of 816 ADNI1-screening samples was processed, and a hybrid set of 603 features was handled. Moreover, the detailed parameters in use, such as the best neighborhood and input features were evaluated for further comparative analysis. Overall, certain collaborative filtering configurations outperformed alternative state-of-the-art imputation techniques. The imputation system and reliability scale based on the proposed methodology are promising for supporting the clinical tests.Entities:
Keywords: Clinical test scores; Dementia; Imputation; Incomplete data; Missing values; Reliability scale; User-based collaborative filtering
Year: 2022 PMID: 35642196 PMCID: PMC9148556 DOI: 10.7717/peerj.13425
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 3.061
Reliability scale results example.
| Clinical test type | Range | Clinical test score | Computer-aided prediction | Absolute error percentage | Reliability |
|---|---|---|---|---|---|
| MMSE | [0, 30] | 20 | 22.555 | 8.52 | not trusted |
| MMSE | [0, 30] | 21 | 26.469 | 18.23 | not trusted |
| MMSE | [0, 30] | 28 | 27.977 | 0.08 | trusted |
| GDS | [0, 12] | 0 | 0.662 | 5.52 | trusted |
| GDS | [0, 12] | 2 | 2.01 | 0.08 | trusted |
| GDS | [0, 12] | 5 | 1.335 | 30.54 | not trusted |
| CDR | [−1, 3] | 0.5 | 0.5 | 0 | trusted |
| CDR | [−1, 3] | 0.5 | 0.552 | 1.3 | trusted |
| CDR | [−1, 3] | 1 | 0.166 | 20.85 | not trusted |
Note:
Absolute Error Percentage = |Clinical Test Score − Computer-Aided Prediction| / Range
Figure 1Graphical flowchart of the methodology.
Screening schedule and clinical test score sparsity relationship.
| Screening schedule | Sample count | MMSE | GDS | CDR | NIQ | FAQ |
|---|---|---|---|---|---|---|
| ADNI1 Screening | 828 | ✓ | ✓ | ✓ | ||
| ADNI1/GO Month 6 | 753 | ✓ | ✓ | ✓ | ✓ | |
| ADNI1/GO Month 12 | 710 | ✓ | ✓ | ✓ | ✓ | ✓ |
Figure 2Brief list of the initial feature set.
Similarity weight and prediction equations in practice.
| Stage (type) | Name | Equation |
|---|---|---|
| Similarity (corr.) | Pearson Correlation Coefficient |
|
| Similarity (corr.) | Median-Based Robust Correlation |
|
| Similarity (corr.) | Cosine Similarity |
|
| Similarity (dist.) | Manhattan Distance Similarity |
|
| Similarity (dist.) | Euclidian Distance Similarity |
|
| Prediction (avg.) | Weighted Average |
|
Figure 3Illustration of the data elimination and imputation operations in the test procedure.
The figure represents the actual values of the first 25 samples in a randomly selected independent test scenario. (A) The non-green vertical lines indicate the columns of the non-sparse clinical test score attributes. (B) The zoomed-in representation of corresponding vertical column values is shown. (C) The bordered red cells indicate that the related values are randomly emptied. (D) The bordered light green cells indicate that the missing values are imputed.
Performance metric formulations in practice.
| Name | Formula |
|---|---|
| Mean Absolute Error |
|
| Mean Squared Error |
|
| Root Mean Squared Error |
|
| R-Squared |
|
Figure 4Performance metric plots of the average of multiple individual tests.
Selected best-performing test results ordered by the summation-based combination of the performance metric rankings.
| Imputation technique | Feature vector set | Similarity measurement | BNC | MAE | RMSE | R2 |
|---|---|---|---|---|---|---|
| DUCF | Hybrid Set of Features | MAN |
| 0.06933 | 0.09196 | 0.92235 |
| DUCF | Hybrid Set of Features | MAN |
| 0.06968 | 0.09194 | 0.92232 |
| DUCF | Hybrid Set of Features | MAN |
| 0.06888 | 0.09208 | 0.92225 |
| RegEM | Hybrid Set of Features |
|
| 0.06988 | 0.08956 | 0.92546 |
| DUCF | Morphometrics Features | EUC |
| 0.07000 | 0.09192 | 0.92232 |
| DUCF | Morphometrics Features | EUC |
| 0.06968 | 0.09197 | 0.92228 |
| DUCF | Morphometrics Features | MAN |
| 0.06921 | 0.09211 | 0.92210 |
| DUCF | Hybrid Set of Features | MAN |
| 0.06998 | 0.09194 | 0.92225 |
| LRMC | Hybrid Set of Features |
|
| 0.07109 | 0.09216 | 0.92155 |
| DUCF | Hybrid Set of Features | EUC |
| 0.07155 | 0.09360 | 0.91945 |
| DUCF | Morphometrics Features | EUC |
| 0.06778 | 0.09808 | 0.91280 |
| DUCF | Hybrid Set of Features | MAN |
| 0.06780 | 0.09851 | 0.91236 |
| DUCF | Morphometrics Features | EUC |
| 0.06670 | 0.10956 | 0.89329 |
| DUCF | Morphometrics Features | COS |
| 0.06634 | 0.11000 | 0.89831 |
| DUCF | Hybrid Set of Features | MAN |
| 0.06714 | 0.10987 | 0.89339 |
| DUCF | Hybrid Set of Features | PCC |
| 0.06633 | 0.11064 | 0.89514 |
| DUCF | Hybrid Set of Features | MRC |
| 0.06626 | 0.11107 | 0.89418 |
| Attribute Median | Missing Features |
|
| 0.07683 | 0.10673 | 0.89698 |
| Attribute Mean | Missing Features |
|
| 0.08305 | 0.10335 | 0.90032 |
| Attribute Winsorized Mean | Missing Features |
|
| 0.08325 | 0.10423 | 0.89998 |
| Zero Value | Missing Features |
|
| 0.44131 | 0.54957 |
|
Note:
Existing methods are highlighted with a gray background color.
Predefined thresholds for the reliability scale.
| Clinical test type | ||
| MMSE | 2.162 | 2.945 |
| GDS | 1.179 | 1.606 |
| CDR | 0.393 | 0.535 |
| NIQ | 2.850 | 3.882 |
| FAQ | 2.948 | 4.016 |