| Literature DB >> 35455730 |
Neel Shimpi1, Ingrid Glurich1, Reihaneh Rostami2, Harshad Hegde3, Brent Olson4, Amit Acharya5.
Abstract
Oral cavity cancer (OCC) is associated with high morbidity and mortality rates when diagnosed at late stages. Early detection of increased risk provides an opportunity for implementing prevention strategies surrounding modifiable risk factors and screening to promote early detection and intervention. Historical evidence identified a gap in the training of primary care providers (PCPs) surrounding the examination of the oral cavity. The absence of clinically applicable analytical tools to identify patients with high-risk OCC phenotypes at point-of-care (POC) causes missed opportunities for implementing patient-specific interventional strategies. This study developed an OCC risk assessment tool prototype by applying machine learning (ML) approaches to a rich retrospectively collected data set abstracted from a clinical enterprise data warehouse. We compared the performance of six ML classifiers by applying the 10-fold cross-validation approach. Accuracy, recall, precision, specificity, area under the receiver operating characteristic curve, and recall-precision curves for the derived voting algorithm were: 78%, 64%, 88%, 92%, 0.83, and 0.81, respectively. The performance of two classifiers, multilayer perceptron and AdaBoost, closely mirrored the voting algorithm. Integration of the OCC risk assessment tool developed by clinical informatics application into an electronic health record as a clinical decision support tool can assist PCPs in targeting at-risk patients for personalized interventional care.Entities:
Keywords: machine learning; oral cancer; patient care management; precision medicine; risk assessment
Year: 2022 PMID: 35455730 PMCID: PMC9032985 DOI: 10.3390/jpm12040614
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Shows the features and their associated ranks.
| Feature | Gain Ratio |
|---|---|
| Diseases of lips | 0.1429 |
| Disorders of oral soft tissue | 0.1277 |
| Leukoplakia in oral mucosa | 0.1232 |
| Presence of swelling or lump in mouth | 0.0483 |
| Throat pain | 0.0398 |
| Esophageal reflux | 0.0287 |
| Stomatitis and mucositis | 0.0099 |
| Radiation therapy | 0.0087 |
| Oral aphthae | 0.0032 |
| Oral thrush | 0.0021 |
| Tobacco use | 0.0008 |
| Chemotherapy | 0.0003 |
| Alcohol abuse | 0.0000 |
Figure 1Summary of the performance metrics for six classifiers: (a) Accuracy; (b) Recall/Sensitivity; (c) Precision and (d) Specificity.
Figure 2Performance of the classifiers in terms of four evaluation metrics on third model.
Figure 3Area under RP curve and ROC curve for MLP, AdaBoost, and voting algorithm.
Figure 4Result of applying (a) MLP, (b) AdaBoost, and (c) voting algorithm on third model.
Performance metrics for various studies that used machine learning algorithms.
| Reference No | N | ML Algorithms Used | Sensitivity | Specificity |
|---|---|---|---|---|
| Speight et al. [ | 1662 | Neural network (NN) | 80% | 77% |
| Kent et al. [ | 1939 | Genetic programming (GP) | 73% GP | 65% GP |
| Tseng et al. [ | 1099 | Decision tree (DT) | Total accuracy: DT: 95.8% | |
| Rosma et al. [ | 191 | Fuzzy NNs | 46% fuzzy NN | 85% fuzzy NN |