| Literature DB >> 35626239 |
Sanjeev B Khanagar1,2, Khalid Alfouzan2,3, Mohammed Awawdeh1,2, Lubna Alkadi2,3, Farraj Albalawi1,2, Abdulmohsen Alfadley2,3.
Abstract
Evolution in the fields of science and technology has led to the development of newer applications based on Artificial Intelligence (AI) technology that have been widely used in medical sciences. AI-technology has been employed in a wide range of applications related to the diagnosis of oral diseases that have demonstrated phenomenal precision and accuracy in their performance. The aim of this systematic review is to report on the diagnostic accuracy and performance of AI-based models designed for detection, diagnosis, and prediction of dental caries (DC). Eminent electronic databases (PubMed, Google scholar, Scopus, Web of science, Embase, Cochrane, Saudi Digital Library) were searched for relevant articles that were published from January 2000 until February 2022. A total of 34 articles that met the selection criteria were critically analyzed based on QUADAS-2 guidelines. The certainty of the evidence of the included studies was assessed using the GRADE approach. AI has been widely applied for prediction of DC, for detection and diagnosis of DC and for classification of DC. These models have demonstrated excellent performance and can be used in clinical practice for enhancing the diagnostic performance, treatment quality and patient outcome and can also be applied to identify patients with a higher risk of developing DC.Entities:
Keywords: artificial intelligence; dental caries; detection; diagnosis; prediction
Year: 2022 PMID: 35626239 PMCID: PMC9139989 DOI: 10.3390/diagnostics12051083
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Description of the PICO (P = Population, I = Intervention, C = Comparison, O = Outcome) elements.
|
| What is the performance of AI-based models designed for detection, diagnosis and prediction of DC? |
|
| Patients who underwent investigation for DC |
|
| AI applications for detection, diagnosis and prediction of DC |
|
| Expert/Specialist opinions, Reference standards/models |
|
| Measurable or predictive outcomes such as accuracy, sensitivity, specificity, ROC = receiver operating characteristic curve, AUC = area under the curve, AUROC = area under the receiver operating characteristic, ICC = intraclass correlation coefficient, IOU = intersection-over-union, PRC = precision recall curve, statistical significance, F1 scores, vDSC = volumetric dice similarity coefficient, sDSC = surface dice similarity coefficient, PPV= positive predictive value, NPV = negative predictive value, mean decreased gini (MDG), mean decreased accuracy (MDA) coefficients, intersection over union (IoU), dice coefficient |
Figure 1Flow chart for screening and selection of articles.
Details of the studies that have used AI-based models for detection, diagnosis and prediction of DC.
| Serial No. | Authors | Year of Publication | Study Design | Algorithm Architecture | Objective of the Study | No. of Patients/Images/Photographs for Testing | Study Factor | Modality | Comparison if Any | Evaluation Accuracy/Average Accuracy/Statistical Significance | Results (+)Effective, (−)Non Effective (N) Neutral | Outcomes | Authors Suggestions/Conclusions |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Zanella-Calzada et al. [ | 2018 | Prospective cohort | ANNs | To analyze the dietary and demographic factors that determine oral health and DC | 6868 cases for training, 2944 cases for testing | DC lesions | Data sets | National Health and Nutrition Examination Survey Data | Accuracy of 0.69, AUC values of 0.69 and 0.75 | (+)Effective | This ANNs-based model demonstrated high accuracy in diagnosing DC based on dietary and demographic factors | This model can help dentists by providing an easy, free and fast tool for the diagnosis of DC |
| 2 | Lee et al. [ | 2018 | Retrospective cohort | DCNNs | Deep CNN algorithms (GoogLeNet Inception v3) for detection and diagnosis of DC on periapical radiographs | 2400 periapical radiographs for training, 600 periapical radiographs for testing | Gray/white matter and lesions | Periapical radiographs | Not mentioned | Accuracy of 89.0%, 88.0%, 82.0% and AUC of 0.917, 0.890, 0.845 for premolar, molar, and both premolar and molar models respectively. | (+)Effective | This CNNs-based model demonstrated good performance in detecting DC | This models is of potential use for detection and diagnosis of DC |
| 3 | Choi et al. [ | 2018 | Retrospective cohort | CNNs | An automatic model for detection of proximal DC in periapical radiographs | 475 periapical radiographs | DC lesions | Periapical radiographs | Experts and naïve CNN approach as reference models | (+)Effective | This model was superior to the system using a naïve CNN. | This model was successful in detecting proximal DC | |
| 4 | Casalegno et al. [ | 2019 | Retrospective cohort | CNNs | DL model for the automated detection and localization of DC in near-infrared transillumination (TI) images | 217 grayscale images | DC lesions | TI images | Reference deep neural networks models and experts | Mean intersection-over-union (IOU) score of 72.7% on a 5-class segmentation task and IOU score of 49.5% and 49.0% and ROC curve of 83.6% and 85.6% for proximal and occlusal carious lesions, respectively | (+)Effective | This DL approach holds promising results for increasing the speed and accuracy of caries’ detection | This model can support dentists by providing high-throughput diagnostic assistance and improving patient outcomes |
| 5 | Hung et al. [ | 2019 | Retrospective cohort | ANNs | ML model for diagnostic prediction of root caries | 7272 cases for training and 1818 for testing | Variables | Data sets | Trained dental professionals and reference models | Accuracy of 97.1%, precision of 95.1%, sensitivity of 99.6%, specificity of 94.3% and AUC of 0.997 | (+)Effective | This model demonstrated the best performance | This model can be implemented for clinical diagnosis and can be utilized by dental and non-dental professionals |
| 6 | Cantu et al. [ | 2020 | Retrospective cohort | CNNs | To assess the performance pf a DL model for detecting DC on bitewing radiographs | 3293 Bitewing radiographs for training and 252 for testing | DC lesions | Bitewing radiographs | 4 experienced dentists | Accuracy of 0.80; sensitivity of 0.75, specificity of 0.83 | (+)Effective | This CNN-based model was significantly more accurate than the experienced dentists. | This model can assist dentists particularly in the detection of initial caries lesions on bitewings |
| 7 | Geetha et al. [ | 2020 | Cross sectional | ANNs | ANN based model for diagnosing DC in digital radiographs | 145 digital radiographs | DC lesions | Intraoral digital images (digital radiographs) | Experienced dentist | Accuracy of 97.1%, false positive (FP) rate of 2.8%, ROC area of 0.987 and PRC area of 0.987 | (+)Effective | This model based on back-propagation neural network can predict DC more accurately | Improved algorithms and high quantity and quality datasets may demonstrate better results in clinical dental practice |
| 8 | Schwendicke et al. [ | 2020 | Cross sectional | CNNs | CNN based model for detecting DC in near-infrared-light transillumination (NILT) images. | 226 extracted teeth images | DC lesions | NILT images | 2 experienced dentists | Mean AUC was 0.74. Sensitivity of 0.59, specificity of 0.76, PPV was 0.63, NPV 0.73 | (+)Effective | These models (Resnet18 and | These models can be of relevance in settings, like schools, care homes or rural outpost centers |
| 9 | Karhade D.S et al. [ | 2021 | Retrospective cohort | ANNs | Automated MLalgorithm for classification early childhood caries (ECC) | 6040 | Variables | Data sets | External National Health and Nutrition Examination Survey (NHANES) dataset/ 10 trained and calibrated clinical examiners | AUC of (0.74), Sensitivity of (0.67), and PPV of (0.64) | (+)Effective | This ML model’s performance was similar to the reference model | This model is valuable for ECC screening |
| 10 | Duong et al. [ | 2021 | Cross sectional | ANNs | An automated ML for detecting DC using smart phone photographs | 620 teeth | DC lesions | Photos using smartphone | 4 trained and calibrated dentists | Accuracy of 92.37%, sensitivity 88.1% and | (+)Effective | This model demonstrated an auspicious potential for clinical diagnostics with reasonable accuracy and minimal cost | This support vector machine requires further improvement and verification |
| 11 | Duong et al. [ | 2021 | Cross sectional | CNNs | AI based model for detection and classification of DC using smart phone photographs | 587 extracted teeth | DC lesions | Photos using smartphone | Trained dentists | Accuracy of 87.39%, sensitivity of 89.88%, and specificity of | (+)Effective | This model demonstrated good accuracy in the detection of DC. | This model needs to be trained with both in vivo and vitro images. There is a need for developing a good imaging technique for occlusal surfaces |
| 12 | Ramos-Gomez et al. [ | 2021 | Retrospective cohort | ANNs | ML algorithm (Random forest) for identifying survey items that predict DC | 182 subjects | Variables | Data sets | 2 trained dentists | For classifying active caries parent’s age (MDG = 0.84; MDA = 1.97), unmet needs (MDG = 0.71; MDA = 2.06). | (+)Effective | This model showed potential for screening DC | This model showed potential for screening for DC among children using survey answers |
| 13 | Askar et al. [ | 2021 | Cross sectional | CNNs | DL model for detecting white spot lesions using digital camera photographs | 51 patients | White spot Lesions | Digital camera images | Trained dentist | Detecting any lesions (PPV/NPV) between 0.77–0.80. | (+)Effective | This model showed satisfying accuracy to detect white spot lesions, particularly fluorosis | There is a need for more data sets for generalizability |
| 14 | Chen et al. [ | 2021 | Retrospective cohort | CNNs | DL model for detecting dental disease on periapical radiographs | 2900 | DC/ | Digital periapical radiographs | Reference models/trained experts | DC and PDL were detected with precision, recall, and average precision values less than 0.25 for mild level, 0.2–0.3 for moderate level and 0.5–0.6 for severe level | (+)Effective | These model can detect DC using periapical radiographs | These models are best utilized for the detection of lesions with severe levels. Hence the models need more training at each level |
| 15 | Devlin et al. [ | 2021 | Randomized control trial | CNNs | To detect enamel-only proximal DC using AssistDent artificial intelligence (AI) software on bitewing radiographs | 24 patients | DC lesions | Bitewing radiographs | 6 dental specialists (for grading) | High accuracy of diagnosis with sensitivity of 71% and decrease in specificity of 11% are statistically significant ( | (+)Effective | This model significantly improved dentists’ ability to detect enamel-only proximal caries | Can be used as a supportive tool by dentist to practice preventive dentistry |
| 16 | Bayrakdar et al. [ | 2021 | Retrospective cohort | CNNs | DL models (VGG-16 and U-Net) for automatic caries detection and segmentation on bitewing radiographs | 621 patients (2325 images, 2072 for training, 200 for validating and 53 for testing) | DC lesions | Bitewing radiographs | 5 experienced experienced observers | For caries detection sensitivity 0.84, precision 0.81, and F-measure rates 0.84 and for caries segmentation were sensitivity 0.86, precision 0.84, and F-measure rates 0.84 | (+)Effective | These models can accurately detect DC. There were also beneficial in the segmentation of DC | The performance of these models was superior to specialists and can be beneficial for clinicians in clinical decision making |
| 17 | Zaorska et al. [ | 2021 | Prospective cohort | CNNs | AI model for predicting DC based on chosen polymorphisms | 95 patients | DC lesions | Data sets | Logistic regression model | Sensitivity of 90, specificity of 96% overall accuracy of 93% ( | (+)Effective | This model displayed high accuracy in predicting DC | The knowledge of potential risk status could be useful in designing oral hygiene practices and recommending dietery habits for patients |
| 18 | Pang et al. [ | 2021 | Prospective cohort | ANNs | AI based ML model for caries risk prediction based on environmental and genetic factors | 953 patients (633 for training and 320 for testing) | DC lesions | Data sets | Logistic regression model | AUC of 0.73 | (+)Effective | This model could accurately identify individuals at high and very high caries risk | This is a powerful tool for identifying individuals at high caries risk at community-level |
| 19 | Zheng et al. [ | 2021 | Cross sectional | CNNs | To evaluate and compare three CNNs models (VGG19, Inception V3, and ResNet18) for diagnosing deep DC. | 844 (717 for training and 127 for testing) | Deep DC lesions | Radiographs | VGG19, Inception V3, experienced dentists | Accuracy = 0.82, precision = 0.81, sensitivity = 0.85 specificity = 0.82, AUC = 0.89, | (+)Effective | CNN model ResNet18 showed good performance | With clinical parameters this model demonstrated enhanced performance |
| 20 | Lian et al. [ | 2021 | Cross sectional | CNNs | To evaluate DL methods for detecting DC lesions (nnU-Net) and classifying DC (DenseNet121) on panoramic radiographs | 1160 (1071 for training and validating, 89 for testing) | DC lesions | Panoramic radiographs | 6 expert expert dentists | IoU 0.785, Dice coefficient values of 0.663. | (+)Effective | These models displayed similar results to that of expert dentists | These models need to be explored for disease diagnosis and treatment planning |
| 21 | Moran et al. [ | 2021 | Cross sectional | CNNs | CNN model (Inception) for identifying approximal DC in bitewing radiographs | 112 (45 for testing) | DC lesions | Digital bitewing radiographs | ResNet model | Accuracy of 73.3% | (+)Effective | This model demonstrated promising results in comparison with the reference model | This model can be used for assisting clinicians in decision making |
| 22 | Mertens S et al. [ | 2021 | Randomized control trial | CNNs | CNN model for detection of proximal DC using bitewing radiographs | 140 patients (20 testing) | DC lesions | Bitewing radiographs | 5 expert expert dentists | ROC of 0.89 and sensitivity of 0.81 with statistical significance ( | (+)Effective | Dentists using AI model demonstrated statistically significant performance in comparison with other dentists | This model can increase diagnostic accuracy of dentists |
| 23 | Vinayahalingam et al. [ | 2021 | Retrospective cohort | CNNs | To evaluate CNN based model (MobileNet V2) for classifying DC on panoramic radiographs | 500 (320 for training, 80 for validating 100 for testing) | DC lesions | Panoramic radiographs | Reference standards | Accuracy of 0.87, sensitivity of 0.86, specificity of 0.88, AUC of 0.90, F1 score of 0.86 | (+)Effective | This model displayed good performance in detecting DC in third molars | This model is an initiation for developing a model that can assist clinicians in deciding on removal of third molars |
| 24 | Lee et al. [ | 2021 | Cross sectional | CNNs | To evaluate deep CNN (U-Net) models for detection of DC in bitewing radiographs | 304 for training, 50 for testing | DC lesions | Bitewing radiographs | 3 expert dentists | Precision of 63.29%; recall of 65.02%; F1-score of 64.14% | (+)Effective | This model displayed considerable performance in detecting DC | This model can help clinicians in detecting DC more accurately |
| 25 | Hur et al. [ | 2021 | Retrospective cohort | ANNs | MLmodels for predicting DC on second molars associated with impacted third molars in CBCT and panoramic radiographs | 1321 patients (2642 impacted mandibular third molars,1850 for training and 792 for testing) | DC lesions | Panoramic radiographs and CBCT images | Single predictors as reference | ROC of 0.88 to 0.89 | (+)Effective | This ML model demonstrated significantly superior performance in prediction of DC in comparison to other models | This model can be of great value for clinicians for preventive treatment and decision making on third molars |
| 26 | De Araujo Faria et al. [ | 2021 | Retrospective cohort | ANNs | AI based model for prediction and detection of radiation-related caries (RRC) on panoramic radiographs | 15 head and neck cancer (HNC) patients | DC lesions | Digital Panoramic radiographs | 2 Expert dentists | For detection accuracy of 98.8% AUC = 0.9869 and for prediction and accuracy of 99.2%, AUC = 0.9886 | (+)Effective | This model displayed high accuracy in detection and diagnosis of RRC | These models can aid in designing preventive dental care for patients with HNC |
| 27 | Wu et al. [ | 2021 | Prospective cohort | ANNs | MLmodel identifying caries-related oral microbes in cross-sectional mother-child dyads | 37 salivary samples and 36 plaque samples for children DC prediction models. | DC lesions | Data sets | Reference standards | AUC of 0.82 for the child’s saliva model, AUC of 0.78 for the child’s plaque model, and AUC of 0.73 for the mother’s plaque model | (+)Effective | These models achieved desirable results for both mother and children | More variables need to be considered in the future for fine-tuning the models |
| 28 | Mao et al. [ | 2021 | Cross sectional | CNNs | CNN based model for identifying DC and restorations on bitewing radiographs | 278 images (70% for training and 30% for testing) | DC lesions | Bitewing radiographs | Reference models GoogleNet, Vgg19, and ResNet50 | Accuracy of 95.56% for restoration judgment and accuracy of 90.30% for DC judgment | (+)Effective | AlexNet model demonstrated high accuracy in comparison to other models | This model can assist dentists in better decision making and treatment planning |
| 29 | Park et al. [ | 2021 | Prospective cohort | ANNs | ML based AI models (XGBoost, random forest, LightGBM algorithms and Final model) for predicting early childhood caries | 4195 (2936 for training and 1259 for testing) | DC lesions | Data sets | Traditional regression model | AUROC = 0.774–0.785 | (+)Effective | ML-based models showed favorable performance in predicting DC | Can be useful in identifying high risk groups and implementing preventive treatments |
| 30 | Huang et al. [ | 2021 | Cross sectional | CNNs | AI based models AlexNet, VGG-16, ResNet-152, Xception, and ResNext-101 for detecting DC | 748 cross-sectional 2D images(599 for training and 149 for testing) | DC lesions | OCT and micro-CT images | 5 clinicians | ResNet-152 demonstrated highest accuracy rate of 95.21% and sensitivity of 98.85% specificity of 89.83%, and the PPV of 93.48% and NPV was 98.15%,. | (+)Effective | ResNet-152 CNNs models are better than clinicians at distinguishing pathological tooth structures using OCT images | These models can aid clinicians in providing patients with more accurate diagnoses |
| 31 | Bayraktar et al. [ | 2022 | Cross sectional | CNNs | Assess the performance of CNN based model (YOLO) for diagnosis of interproximal caries lesions on bitewing radiographs | 1000 (800 for training and 200 for testing) | DC lesions | Digital bitewing radiographs | 2 experienced dentists | Accuracy of 94.59%, sensitivity was 72.26, specificity was 98.19%, PPV was 86.58%, NPV was 95.64% and overall AUC was 87.19%. | (+)Effective | This CNN-based model showed good performance with high accuracy scores | This model can assist clinicians in diagnosing interproximal DC |
| 32 | Zhang et al. [ | 2022 | Cross sectional | CNNs | To assess the performance of CNN based model (ConvNet) for detecting DC using oral photographs | 625 Subjects (3932) oral photographs(2507 for training and 1125 for testing) | DC lesions | Oral photographs | 3 board certified dentists | AUC of 85.65%, sensitivity of 81.90% | (+)Effective | The DL model displayed promising results in detecting DC on oral photographs | This is a cost-effective tool for screening of DC |
| 33 | Kühnisch et al. [ | 2022 | Retrospective cohort | CNNs | To evaluate a (CNNs) based model for detection and categorization of DC using oral photographs | 2417 photographs (1891 for training and 479 for testing) | DC lesions | Oral photographs | Expert standards | Accuracy of 92.5%, sensitivity of 89.6; specificity of 94.3; AUC was 0.964 | (+)Effective | This DL model displayed promising accuracy in detecting DC using intraoral photographs | This model can be of potential use and feasible in future |
| 34 | Zhu et al. [ | 2022 | Retrospective cohort | CNNs | A CNNs based model CariesNet to delineate different caries degrees on panoramic radiographs | 1159 (900 for training, 135 for validating and 124 for testing) | DC lesions | Panoramic radiographs | Reference models | Mean Dice coefficient of 93.64%, accuracy of 93.61%,F1 score 92.87, precision of 94.09 and recall of 86.01 | (+)Effective | This CNN-based model effectively segmented the DC lesions from panoramic radiographs | This model was successful in segmenting even small lesions from large images |
ML = machine learning, ANNs = artificial neural networks, CNNs = convolutional neural networks, DCNNs = deep neural networks, c-index = concordance index, CT = computed tomography scans, CBCT = cone-beam computed tomography, OCT = optical coherence tomography.
Figure 2QUADAS-2 assessment of the individual risk of bias domains and applicability concerns.
Assessment of Strength of Evidence.
| Outcome | Strength of Evidence (GRADE) |
|---|---|
| Application and performance in AI models in prediction of DC [ | ⨁⨁⨁◯ |
| Application and performance in AI models in detection and diagnosis of DC | ⨁⨁⨁⨁ |
| Application and performance in classification of DC [ | ⨁⨁⨁◯ |
⨁⨁⨁⨁ Strong Evidence; ⨁⨁⨁◯ Moderate Evidence.