| Literature DB >> 34885164 |
John Adeoye1, Mohamad Koohi-Moghadam2, Anthony Wing Ip Lo3, Raymond King-Yin Tsang4, Velda Ling Yu Chow5, Li-Wu Zheng1, Siu-Wai Choi1, Peter Thomson6, Yu-Xiong Su1.
Abstract
Machine-intelligence platforms for the prediction of the probability of malignant transformation of oral potentially malignant disorders are required as adjunctive decision-making platforms in contemporary clinical practice. This study utilized time-to-event learning models to predict malignant transformation in oral leukoplakia and oral lichenoid lesions. A total of 1098 patients with oral white lesions from two institutions were included in this study. In all, 26 features available from electronic health records were used to train four learning algorithms-Cox-Time, DeepHit, DeepSurv, random survival forest (RSF)-and one standard statistical method-Cox proportional hazards model. Discriminatory performance, calibration of survival estimates, and model stability were assessed using a concordance index (c-index), integrated Brier score (IBS), and standard deviation of the averaged c-index and IBS following training cross-validation. This study found that DeepSurv (c-index: 0.95, IBS: 0.04) and RSF (c-index: 0.91, IBS: 0.03) were the two outperforming models based on discrimination and calibration following internal validation. However, DeepSurv was more stable than RSF upon cross-validation. External validation confirmed the utility of DeepSurv for discrimination (c-index-0.82 vs. 0.73) and RSF for individual survival estimates (0.18 vs. 0.03). We deployed the DeepSurv model to encourage incipient application in clinical practice. Overall, time-to-event models are successful in predicting the malignant transformation of oral leukoplakia and oral lichenoid lesions.Entities:
Keywords: artificial intelligence; machine learning; oral cancer; oral leukoplakia; oral lichenoid lesions
Year: 2021 PMID: 34885164 PMCID: PMC8657223 DOI: 10.3390/cancers13236054
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Input features, variable category, and missing data.
| Input Feature | Type | Missing Instance | Handling Technique |
|---|---|---|---|
| Age | Continuous | 0 | NA |
| Sex | Binary | 0 | NA |
| Tobacco smoking | Binary | 2 | One-hot transformation |
| Alcohol drinking | Categorical (nominal) | 33 | |
| Patient category | Categorical (nominal) | 0 | NA |
| Risk-habit indulgence | Categorical (nominal) | 0 | NA |
| Previous malignancy | Categorical (nominal) | 0 | NA |
| Charlson Comorbidity Index (CCI) | Continuous | 0 | NA |
| Hypertension status | Binary | 0 | NA |
| Diabetes Mellitus status | Binary | 0 | NA |
| Hyperlipidemia status | Binary | 0 | NA |
| Autoimmune disease status | Binary | 0 | NA |
| Viral hepatitis status | Binary | 0 | NA |
| Family history of malignancy | Binary | 592 | Variable elimination |
| Type of lesion | Binary | 0 | NA |
| Clinical subtype of lichenoid lesion | Categorical (nominal) | 0 | NA |
| Tongue/FOM involved | Binary | 0 | NA |
| Labial/buccal mucosa involved | Binary | 0 | NA |
| Retromolar area involved | Binary | 0 | NA |
| Gingiva involved | Binary | 0 | NA |
| Palate involved | Binary | 0 | NA |
| Number of lesions | Categorical (ordinal) | 0 | NA |
| Lesion size | Continuous | 464 | Variable elimination |
| Presence of ulcers or erosions | Binary | 0 | NA |
| Lesion border status | Binary | 679 | Variable elimination |
| Presence of induration | Binary | 0 | NA |
| Treatment at diagnosis | Categorical (nominal) | 0 | NA |
| Recurrence after surgical excision | Binary | 0 | NA |
| Number of recurrences | Categorical (ordinal) | 0 | NA |
| Oral epithelial dysplasia at diagnosis | Categorical (nominal) | 0 | NA |
| Oral epithelial dysplasia detected during follow-up | Categorical (nominal) | 0 | NA |
NA—Not applicable; FOM—Floor of the mouth.
Demographic, clinical, and pathologic characteristics of all patients with oral leukoplakia and lichenoid lesions used to train learning algorithms.
| Variables | N = 716 | |
|---|---|---|
| N (%) | ||
| Median age (IQR) | 58 (49–67) | |
| Gender | Female | 401 (56.0) |
| Male | 315 (44.0) | |
| Patient category | NSND | 469 (65.5) |
| SD | 247 (34.5) | |
| Continued risk habits following diagnosis | Yes | 14 (2.0) |
| No | 167 (23.3) | |
| Not applicable | 535 (74.7) | |
| Previous malignancy | Head and neck tumors | 21 (2.9) |
| Other tumors | 46 (6.4) | |
| Hematologic malignancies | 23 (3.2) | |
| No malignancy | 626 (87.4) | |
| Charlson comorbidity index—mean (SD) | 0.64 (1.02) | |
| Hypertension | 211 (29.5) | |
| Diabetes mellitus | 111 (15.5) | |
| Hyperlipidemia | 122 (17.0) | |
| Autoimmune disease | 42 (5.9) | |
| Viral hepatitis infection | 69 (9.6) | |
| Lesion | Oral leukoplakia | 389 (54.3) |
| Oral lichen planus/oral lichenoid lesion | 327 (45.7) | |
| Clinical subtype of lichenoid lesion | Reticular/Papular | 100 (14.0) |
| Erosive/Atrophic | 142 (19.8) | |
| Plaque | 85 (11.9) | |
| Tongue/FOM | 245 (34.2) | |
| Buccal/Labial mucosa | 407 (56.8) | |
| Retromolar area | 26 (3.6) | |
| Gingiva | 88 (12.3) | |
| Palate | 23 (3.2) | |
| Number of lesions | Single | 469 (65.5) |
| Bilateral or double | 210 (29.3) | |
| Multiple | 37 (5.2) | |
| Presence of ulcers or erosions | 228 (31.8) | |
| Induration | 47 (6.6) | |
| Treatment | Surgical excision | 221 (30.9) |
| Medical | 195 (27.2) | |
| No treatment | 300 (41.9) | |
| Post-excision recurrence | 42 (19.0) | |
| Number of recurrences | 1 | 30 (4.2) |
| 2 | 7 (1.0) | |
| 3 | 4 (0.6) | |
| 4 | 1 (0.1) | |
| Oral epithelial dysplasia at diagnosis | Absent | 641 (89.5) |
| Mild | 34 (4.7) | |
| Moderate | 27 (3.8) | |
| Severe | 7 (1.0) | |
| Unknown (defaulted biopsy at diagnosis) | 7 (1.0) | |
| Oral epithelial dysplasia at follow-up | Absent | 658 (91.9) |
| Mild | 11 (1.5) | |
| Moderate | 15 (2.1) | |
| Severe | 24 (3.4) | |
| Unknown (defaulted biopsy during follow-up) | 8 (1.1) | |
| Malignant transformation | 76 (10.6) | |
| AJCC TNM stage | Stage I | 47 (6.6) |
| Stage II | 9 (1.3) | |
| Stage III | 6 (0.8) | |
| Stage IV | 12 (1.7) | |
| Tumor grade | Well differentiated | 23 (3.2) |
| Moderately differentiated | 30 (4.2) | |
| Poorly differentiated | 3 (0.4) | |
| Tumor prognosis | Remission | 58 (8.1) |
| Recurrence | 6 (0.8) | |
| Cancer-related death | 6 (0.8) | |
| Second primary tumor | 6 (0.8) | |
Figure 1(a) Concordance indices across the five cross-validation folds for algorithms trained for prediction of malignant transformation. (b) Integrated Brier scores across the five cross-validation folds for algorithms trained for prediction of malignant transformation.
Performance measures of time-to-event algorithms for prediction of malignant transformation of oral leukoplakia and lichenoid lesions.
| Models | Five-Fold | Internal Validation | Repeat Five-Fold | Internal Validation | External Validation | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Concordance Index | Integrated Brier Scores (IBS) | Concordance Index | Integrated Brier Scores (IBS) | Concordance Index | Integrated Brier Scores (IBS) | Concordance Index | Integrated Brier Scores (IBS) | Concordance Index | Integrated Brier Scores (IBS) | |
| Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | |||||||
| Cox-PH | 0.70 (0.098) | 0.03 (0.005) | 0.83 | 0.03 | ||||||
| Cox-Time | 0.88 (0.034) | 0.11 (0.055) | 0.86 | 0.06 | ||||||
| DeepHit | 0.84 (0.061) | 0.17 (0.064) | 0.86 | 0.08 | ||||||
| DeepSurv | 0.88 (0.046) | 0.11 (0.053) | 0.95 | 0.04 | 0.78 (0.097) | 0.13 (0.069) | 0.92 | 0.05 | 0.82 | 0.18 |
| RSF | 0.85 (0.142) | 0.03 (0.007) | 0.91 | 0.03 | 0.89 (0.064) | 0.03 (0.006) | 0.92 | 0.03 | 0.73 | 0.03 |
Figure 2Predicted malignant-transformation-free survival plots generated for 143 patients in the internal validation cohort for (a) DeepSurv, (b) Cox-Time, and (c) DeepHit. DeepHit plots were generated following linear interpolation. The red lines in (a,b) represent the Brier scores plotted at each time point.
Figure 3Preview of web-based prognostic tool generated from the model for optimization. (a,b) HTML page for input of predictive variables; (c) display of output generated upon prediction.