| Literature DB >> 35741125 |
Csaba Rohrer1, Joachim Krois1,2, Jay Patel3, Hendrik Meyer-Lueckel4, Jonas Almeida Rodrigues1,5, Falk Schwendicke1,2.
Abstract
Convolutional Neural Networks (CNNs) such as U-Net have been widely used for medical image segmentation. Dental restorations are prominent features of dental radiographs. Applying U-Net on the panoramic image is challenging, as the shape, size and frequency of different restoration types vary. We hypothesized that models trained on smaller, equally spaced rectangular image crops (tiles) of the panoramic would outperform models trained on the full image. A total of 1781 panoramic radiographs were annotated pixelwise for fillings, crowns, and root canal fillings by dental experts. We used different numbers of tiles for our experiments. Five-times-repeated three-fold cross-validation was used for model evaluation. Training with more tiles improved model performance and accelerated convergence. The F1-score for the full panoramic image was 0.7, compared to 0.83, 0.92 and 0.95 for 6, 10 and 20 tiles, respectively. For root canals fillings, which are small, cone-shaped features that appear less frequently on the radiographs, the performance improvement was even higher (+294%). Training on tiles and pooling the results thereafter improved pixelwise classification performance and reduced the time to model convergence for segmenting dental restorations. Segmentation of panoramic radiographs is biased towards more frequent and extended classes. Tiling may help to overcome this bias and increase accuracy.Entities:
Keywords: deep learning; dental restorations; image segmentation; machine learning
Year: 2022 PMID: 35741125 PMCID: PMC9221749 DOI: 10.3390/diagnostics12061316
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Experimental workflow and an example showing processing an image with 12 tiles.
Figure 2Visualization of the five-times-repeated three-fold cross-validation. For each hold-out test set, the remaining data is split into three different train and validation sets. Each datapoint is used as test data at some point.
Figure 3Mean and 95% CI F1-score (a) and sensitivity (b) for each restoration class as well as for all classes jointly.