| Literature DB >> 34160618 |
Lebohang Radebe1,2, Daniëlle C M van der Kaay3, Jonathan D Wasserman4,5, Anna Goldenberg1,2,6,7.
Abstract
OBJECTIVE: To develop a machine learning tool to integrate clinical data for the prediction of non-benign thyroid cytology and histology. CONTEXT: Papillary thyroid carcinoma is the most common endocrine malignancy. Since most nodules are benign, the challenge for the clinician is to identify those most likely to harbor malignancy while limiting exposure to surgical risks among those with benign nodules.Entities:
Keywords: thyroid nodule malignancy
Mesh:
Year: 2021 PMID: 34160618 PMCID: PMC8824766 DOI: 10.1210/clinem/dgab435
Source DB: PubMed Journal: J Clin Endocrinol Metab ISSN: 0021-972X Impact factor: 5.958
Historical cytologic outcomes based on predictions made on clinical impressions for all patients
| Prediction based on clinical impression | |||
|---|---|---|---|
| Benign | Nonbenign/Uncertain | ||
|
| Benign | 5 | 23 |
| Nonbenign | 0 | 39 |
If a nodule was suspicious and therefore deemed nonbenign or uncertain (based on clinical suspicion and/or sonographic features), biopsy was performed. If nodules were strongly suspected of being benign, biopsy was deferred. Outcome was then determined by histology and/or clinical follow-up as defined in “Materials and Methods.”
Machine learning prediction of benign and nonbenign cytology
| Accuracy, % (± SD) | False-negative rate, % (± SD) | False-positive rate, % (± SD) | Area under receiver operator curve, % (± SD) | |
|---|---|---|---|---|
|
| 65.67 | 0.00 | 82.14 | 58.93 |
|
| 83.55 ± 1.58 | 12.50 ± 4.79 | 21.43 ± 9.22 | 83.04 ± 2.48 |
|
| 77.57 ± 5.07 | 10.27 ± 6.78 | 14.10 ± 5.43 | 83.78 ± 4.46 |
Compares the results of historical practice to random forest classifier and the simplified rule set using 4 measures of performance.
Example of a final rule set to determine indication for biopsy
| Rule No. | Rule | Decision | Historical No. of patients that correctly satisfy specific rule | Historical No. of patients that correctly satisfy all rules |
|---|---|---|---|---|
| 1 | Composition of nodule is entirely solid | Likely nonbenign—recommend biopsy | 12/12 | 12/12 |
| LNs appear normal | ||||
| Tumor is unifocal | ||||
| 2 | Nodule > 50% cystic | Likely benign-defer biopsy | 7/7 | 19/19 |
| Margin is regular | ||||
| LNs appear normal | ||||
| 3 | Composition of nodule is entirely solid | Likely nonbenign–recommend biopsy | 4/4 | 23/23 |
| Hypoechoic halo is either absent OR complete but not partial | ||||
| Margin is irregular/microlobulated/ spiculated | ||||
| 4 | Composition of nodule is entirely solid | Likely nonbenign–recommend biopsy | 9/10 | 32/33 |
| Margin is indistinct | ||||
| Tumor is unifocal OR multifocal (unilaterally) | ||||
| 5 | Composition of nodule is mixed solid/ cystic < 50% cyst | Likely benign–defer biopsy | 10/11 | 42/44 |
| Hypoechoic halo is absent OR complete (ie, not absent) | ||||
| Tumor is unifocal OR multifocal (bilaterally) | ||||
| 6 | Composition of nodule is entirely solid | Likely nonbenign—recommend biopsy | 6/7 | 48/51 |
| LNs are enlarged but normal appearing OR are suspicious for metastasis | ||||
| 7 | LNs are not visualized or are visualized contralateral to primary tumor (but not ipsilaterally) | Likely benign—defer biopsy | 3/4 | 51/55 |
| Tumor is multifocal (unilaterally) | ||||
| 8 | Composition of nodule is entirely solid | Likely nonbenign—recommend biopsy | 3/4 | 54/59 |
| Margin is indistinct | ||||
| Tumor is multifocal and bilateral | ||||
| 9 | Otherwise | Likely nonbenign—recommend biopsy | 3/8 | 57/67 |
Abbreviation: LN, lymph node.
aThese misclassification are the least acceptable type of error—patients classified as likely benign when they were not.
Comparison of decision making for the need for biopsy according to historical practice, current practice and our biopsy rule set model
| Cytology or 2-year follow-up | Predicted to not need/need biopsy | ||
|---|---|---|---|
| Historical data set | All patients evaluated according to current practice | Rule set model | |
|
| 5 | 13/15 | 20/8 |
|
| 0/39 | 3/36 | 2/37 |
Only patients with minimum 2-year follow-up were included.
Decision to pursue biopsy based on clinical and sonographic features.
Historical outcomes based on predictions made on clinical impressions for patients with nonmalignant cytology (Bethesda 1-5)
| Prediction based on clinical impression | |||
|---|---|---|---|
| Suspected benign—managed nonoperatively | Uncertain (cannot exclude malignancy) —underwent surgery | ||
|
|
| 13 | 29 |
|
| 0 | 11 |
If a nodule was uncertain (based on clinical suspicion and/or sonographic features and/or biopsy results), surgery was performed. If nodules were strongly suspected of being benign, surgery was deferred. Outcome was then determined by histology and/or clinical follow-up as defined in “Materials and Methods”.
Machine learning prediction for benign vs nonbenign histology
| Accuracy, % (± SD) | False-negative rate, % (± SD) | False-positive rate, % (± SD) | Area under receiver operator curve, % (± SD) | |
|---|---|---|---|---|
|
| 45.28 | 0.00 | 69.05 | 65.48 |
|
| 83.24 ± 4.33 | 29.17 ± 17.18 | 14.09 ± 8.79 | 78.37 ± 4.96 |
|
| 77.47 ± 2.71 | 11.67 ± 1.32 | 45.83 ± 20.83 | 61.64 ± 10.28 |
Compares the results of historical practice to random forest classifier and the simplified rule set using 4 measures of performance.
Final surgery decisional rule set (after biopsy)
| Rule No. | Rule | Decision | Historical No. of patients that correctly satisfy specific rule | Historical No. of patients that correctly satisfy all rules |
|---|---|---|---|---|
| 1 | Margin is regular | Likely benign–defer surgery | 27/27 | 27/27 |
| Cytology is benign or inadequate (Bethesda 1 or 2) | ||||
| 2 | There are no echogenic foci | Likely benign-defer surgery | 5/5 | 32/32 |
| Solid component is hypoechoic or markedly hypoechoic | ||||
| LNs are not visualized or are visualized contralateral to the primary tumor (but not ipsilaterally) | ||||
| Cytology is benign (Bethesda 2) | ||||
| 3 | LNs are not visualized or are visualized contralateral to primary tumor (but not ipsilaterally) | Likely benign-defer surgery | 4/4 | 36/36 |
| Cytology is inadequate (Bethesda 1) | ||||
| 4 | Solid component is hypoechoic or markedly hypoechoic | Likely nonbenign–consider surgery | 6/7 | 42/43 |
| LNs are not visualized or are visualized contralateral to the primary tumor (but not ipsilaterally) | ||||
| Cytology is indeterminate (Bethesda 3-5) | ||||
| 5 | Solid component is isoechoic, hyperechoic or mixed echogenicity | Likely nonbenign–consider surgery | 3/5 | 45/48 |
| Hypoechoic halo is absent | ||||
| Margin is irregular/microlobulated/ spiculated OR indistinct | ||||
| Cytology is benign or indeterminate (Bethesda 2-5) | ||||
| 6 | Otherwise | Likely nonbenign–consider surgery | 2/5 | 47/53 |
Abbreviation: LN, lymph node.
Comparison of decision making for the need for surgery according to historical practice, Modified McGill Thyroid Nodule Score, cytology alone, and our surgery rule set model
| Histology or 2-year follow-up | Predicted to not need/need surgery | |||
|---|---|---|---|---|
| All patients evaluated according to historical practice | Modified McGill Thyroid Nodule Score | Cytology alone | Rule set model | |
|
| 13/29 | 36/6 | 40/14 | 36/6 |
| 39/3 | ||||
|
| 0/11 | 3/33 | 2/22 | 0/11 |
| 4/32 |