| Literature DB >> 33257744 |
Shuqing Chen1, Daniel Stromer2, Harb Alnasser Alabdalrahim2, Stefan Schwab3, Markus Weih3, Andreas Maier2.
Abstract
Dementia is one of the most common neurological syndromes in the world. Usually, diagnoses are made based on paper-and-pencil tests and scored depending on personal judgments of experts. This technique can introduce errors and has high inter-rater variability. To overcome these issues, we present an automatic assessment of the widely used paper-based clock-drawing test by means of deep neural networks. Our study includes a comparison of three modern architectures: VGG16, ResNet-152, and DenseNet-121. The dataset consisted of 1315 individuals. To deal with the limited amount of data, which also included several dementia types, we used optimization strategies for training the neural network. The outcome of our work is a standardized and digital estimation of the dementia screening result and severity level for an individual. We achieved accuracies of 96.65% for screening and up to 98.54% for scoring, overcoming the reported state-of-the-art as well as human accuracies. Due to the digital format, the paper-based test can be simply scanned by using a mobile device and then be evaluated also in areas where there is a staff shortage or where no clinical experts are available.Entities:
Year: 2020 PMID: 33257744 PMCID: PMC7704614 DOI: 10.1038/s41598-020-74710-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Scoring and screening of study CDTs. The CDTs are binary classified into ‘Pass’ or ‘Fail’ for dementia screening. For scoring, six classes/scores exists – ‘Score 1’ to ‘Score 6’. The images show an exemplary CDT from the study for each score where one can see the the clocks’ get worse with increasing scores.
Subject counts for screening and scoring. The total count of subjects is 1315. Screening: The count for ‘Fail’ is larger than ‘Pass’ by about 23%. Scoring: The largest group has ‘Score 3’ while the total count of subjects is 1315.
| Screening: ‘Pass’ | Screening: ‘Fail’ | |||||
|---|---|---|---|---|---|---|
| Score 1 | Score 2 | Score 3 | Score 4 | Score 5 | Score 6 | Total |
| 240 | 351 | 445 | 152 | 92 | 35 | 1315 |
Figure 2(a) VGG16, (b) ResNet-152, and (c) DenseNet-121 neural network architectures with modified classification layer.
Averaged binary classification accuracy results for the experiments with VGG16, ResNet-152 and DenseNet-121.
| Index | Data distribution | VGG16 | ResNet-152 | DenseNet-121 |
|---|---|---|---|---|
| 1 | randomly | 0.9055 | 0.9405 | 0.9601 |
| 2 | t-SNE | 0.8577 | 0.8552 | 0.9521 |
| 3 | PCA | 0.9102 | 0.9422 | 0.9640 |
| 4 | LLE | 0.9359 | 0.9490 |
Figure 3Average confusion matrix visualization of a five-fold cross validation. (a) The resulting confusion matrix of the best performing classifier for screening for dementia—DenseNet-121 with LLE. (b) The confusion matrix for scoring dementia with DenseNet-121 and LLE. (c) In the case where off-diagonal mislabelling is considered to be correct, the accuracy dramatically increases.
Above: dementia scoring results for DenseNet-121 and LLE. Bold denotes the highest result. Below: dementia scoring with off-diagonal acceptance for DenseNet-121 and LLE. The measures show very accurate results.
| Class | Precision | Recall | f1-score | AUC |
|---|---|---|---|---|
| Score 1 | 0.67 | 0.79 | 0.73 | 0.84 |
| Score 2 | 0.67 | 0.51 | 0.58 | 0.72 |
| Score 3 | ||||
| Score 4 | 0.76 | 0.72 | 0.74 | 0.84 |
| Score 5 | 0.64 | 0.63 | 0.63 | 0.80 |
| Score 6 | 0.74 | 0.78 | 0.76 | 0.88 |
| Index | Used parameters | DenseNet-121 | ResNet-152 | VGG16 | Data distribution |
|---|---|---|---|---|---|
| 1 | Optimization algorithm | RMSprop | Adam | SGD | randomly |
| Learning rate | 0.0001 | 0.0005 | 0.001 | ||
| Learning rate scheduler | StepLR | StepLR | StepLR | ||
| Step size in learning rate scheduler | 7 | 7 | 7 | ||
| Batch size | 16 | 16 | 4 | ||
| Loss function | Cross entropy | Cross entropy | Cross entropy | ||
| 2 | Optimization algorithm | RMSprop | Adam | SGD | t-SNE |
| Learning rate | 0.0001 | 0.0005 | 0.001 | ||
| Learning rate scheduler | StepLR | StepLR | StepLR | ||
| Step size in learning rate scheduler | 7 | 7 | 7 | ||
| Batch size | 16 | 16 | 4 | ||
| Loss function | Cross entropy | Cross entropy | Cross entropy | ||
| 3 | Optimization algorithm | RMSprop | Adam | SGD | PCA |
| Learning rate | 0.0001 | 0.0005 | 0.001 | ||
| Learning rate scheduler | StepLR | StepLR | StepLR | ||
| Step size in learning rate scheduler | 7 | 7 | 7 | ||
| Batch size | 16 | 16 | 4 | ||
| Loss function | Cross entropy | Cross entropy | Cross entropy | ||
| 4 | Optimization algorithm | RMSprop | Adam | SGD | LLE |
| Learning rate | 0.0001 | 0.0005 | 0.001 | ||
| Learning rate scheduler | StepLR | StepLR | StepLR | ||
| Step size in learning rate scheduler | 7 | 7 | 7 | ||
| Batch size | 16 | 16 | 4 | ||
| Loss function | Cross entropy | Cross entropy | Cross entropy |