| Literature DB >> 35626392 |
Aiham Taleb1, Csaba Rohrer2, Benjamin Bergner1, Guilherme De Leon3, Jonas Almeida Rodrigues4, Falk Schwendicke2, Christoph Lippert1,5, Joachim Krois2.
Abstract
High annotation costs are a substantial bottleneck in applying deep learning architectures to clinically relevant use cases, substantiating the need for algorithms to learn from unlabeled data. In this work, we propose employing self-supervised methods. To that end, we trained with three self-supervised algorithms on a large corpus of unlabeled dental images, which contained 38K bitewing radiographs (BWRs). We then applied the learned neural network representations on tooth-level dental caries classification, for which we utilized labels extracted from electronic health records (EHRs). Finally, a holdout test-set was established, which consisted of 343 BWRs and was annotated by three dental professionals and approved by a senior dentist. This test-set was used to evaluate the fine-tuned caries classification models. Our experimental results demonstrate the obtained gains by pretraining models using self-supervised algorithms. These include improved caries classification performance (6 p.p. increase in sensitivity) and, most importantly, improved label-efficiency. In other words, the resulting models can be fine-tuned using few labels (annotations). Our results show that using as few as 18 annotations can produce ≥45% sensitivity, which is comparable to human-level diagnostic performance. This study shows that self-supervision can provide gains in medical image analysis, particularly when obtaining labels is costly and expensive.Entities:
Keywords: annotation efficient deep learning; data driven approaches; dental caries classification; representation learning; self-supervised learning; unsupervised methods
Year: 2022 PMID: 35626392 PMCID: PMC9140204 DOI: 10.3390/diagnostics12051237
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Flowchart of self-supervised learning stages. First, a deep learning model, e.g., CNN, is trained on unlabeled data using a proxy task. Then, the obtained knowledge (representations) is transferred into a target downstream task.
Figure 2Illustration scheme of the three self-supervised algorithms and how to fine-tune the resulting encoder CNN. (a) SimCLR relies on attracting the views of each image together and repelling them from the views of other images. (b) In BYOL the target network calculates moving averages of the online network, which is updated with loss gradients. (c) Barlow Twins computes the cross-correlation matrix of two batches of image views and minimizes its difference to the identity matrix. (d) The obtained CNN encoder is fine-tuned on input tooth images for caries classification.
Caries classification results when fine-tuning on the full training set. We highlight in bold the best models.
| Method | Sensitivity | Specificity | ROC-AUC |
|---|---|---|---|
| Baseline | 51.80 |
| 71.50 |
| SimCLR | 57.20 | 89.30 | 73.30 |
| BYOL | 54.60 |
| 73.00 |
| Barlow Twins |
| 88.90 |
|
Figure 3Evaluation results for data-efficiency by successively increasing the size of the training set. Each row represents a different caries prevalence group (∼20%, 50%, 75%), and the columns are evaluation metrics (sensitivity, specificity, roc-auc). The plots show the mean for each metric in thick lines and the 95% confidence interval (CI) as bands.
Caries classification results when fine-tuning on varying quantities of labeled samples (numbers of #teeth/#BWRs). The results are grouped by the prevalence of the caries (∼20%, 50%, 75%). We highlight in bold the best models in each row (i.e., for each fine-tuning dataset size).
| Prev. | #Teeth/#BWRs | SimCLR | BYOL | Barlow Twins | Baseline | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sens. | Spc. | Roc | Sens. | Spc. | Roc | Sens. | Spc. | Roc | Sens. | Spc. | Roc | ||
| ∼20% | 152/18 | 40.02 | 64.27 | 52.15 | 44.78 | 60.45 |
|
| 58.43 | 52.35 | 32.87 |
| 52.49 |
| 305/37 |
| 55.39 |
| 48.35 | 56.26 | 52.31 | 41.01 | 62.34 | 51.68 | 41.74 |
| 52.52 | |
| 1.5K/190 | 46.40 | 63.78 | 55.09 |
| 51.16 |
| 45.60 | 63.46 | 54.53 | 42.45 |
| 53.53 | |
| 3K/380 | 52.99 | 59.79 | 56.39 |
| 59.61 |
| 50.08 | 59.29 | 54.69 | 46.61 |
| 55.23 | |
| 15K/1.9K | 48.96 | 75.28 | 62.12 |
| 73.44 |
| 51.34 | 75.40 | 63.37 | 44.78 |
| 61.89 | |
| 30K/3.8K |
| 79.42 | 67.11 | 51.18 | 83.98 | 67.58 | 50.88 |
|
| 50.19 | 81.57 | 65.88 | |
| 50% | 152/18 | 58.94 | 48.19 | 53.56 | 62.09 | 48.28 | 55.19 |
| 48.28 |
| 48.85 |
| 52.47 |
| 305/37 | 59.62 | 48.24 | 53.93 |
| 48.83 | 56.06 | 60.07 |
|
| 56.00 | 49.69 | 52.84 | |
| 1.5K/190 | 62.59 | 49.85 | 56.22 |
| 55.93 |
| 60.92 |
| 58.68 | 56.24 | 54.80 | 55.52 | |
| 3K/380 |
| 49.03 | 58.49 | 65.65 | 60.02 | 62.83 | 65.91 |
|
| 58.00 | 54.43 | 56.21 | |
| 15K/1.9K |
| 63.29 | 64.46 | 60.71 |
|
| 61.34 | 73.01 | 67.17 | 58.99 | 73.28 | 66.13 | |
| 30K/3.8K | 62.33 | 72.40 | 67.37 |
| 72.57 | 68.42 | 59.13 |
|
| 59.86 | 75.90 | 67.88 | |
| 75% | 152/18 | 64.80 | 42.37 | 53.59 | 70.64 | 38.56 | 54.60 |
| 30.20 |
| 57.15 |
| 52.61 |
| 305/37 | 70.07 |
| 55.59 | 74.00 | 35.67 | 54.84 |
| 31.59 |
| 68.68 | 37.58 | 53.13 | |
| 1.5K/190 | 72.42 |
| 55.96 | 77.32 | 37.04 | 57.18 |
| 33.59 |
| 69.32 | 38.10 | 53.71 | |
| 3K/380 | 75.65 | 40.85 | 58.25 | 78.68 | 41.59 | 60.14 |
|
|
| 71.25 | 39.66 | 55.45 | |
| 15K/1.9K | 78.02 | 51.35 | 64.69 |
| 51.16 | 66.39 | 79.62 | 55.13 |
| 74.45 |
| 65.15 | |
| 30K/3.8K | 79.04 | 54.51 | 66.77 |
| 54.14 | 67.72 | 78.66 |
|
| 76.94 | 58.31 | 67.62 | |