| Literature DB >> 32287288 |
Theodore Warsavage1, Fuyong Xing1, Anna E Barón1, William J Feser1, Erin Hirsch1, York E Miller2,3, Stephen Malkoski4, Holly J Wolf5, David O Wilson6, Debashis Ghosh1.
Abstract
We present a case study for implementing a machine learning algorithm with an incremental value framework in the domain of lung cancer research. Machine learning methods have often been shown to be competitive with prediction models in some domains; however, implementation of these methods is in early development. Often these methods are only directly compared to existing methods; here we present a framework for assessing the value of a machine learning model by assessing the incremental value. We developed a machine learning model to identify and classify lung nodules and assessed the incremental value added to existing risk prediction models. Multiple external datasets were used for validation. We found that our image model, trained on a dataset from The Cancer Imaging Archive (TCIA), improves upon existing models that are restricted to patient characteristics, but it was inconclusive about whether it improves on models that consider nodule features. Another interesting finding is the variable performance on different datasets, suggesting population generalization with machine learning models may be more challenging than is often considered.Entities:
Year: 2020 PMID: 32287288 PMCID: PMC7156089 DOI: 10.1371/journal.pone.0231468
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Receiver operating characteristic curves and associated AUC are shown for the image model predictions and risk calculator predictions on Colorado Lung Nodule Cohort.
The model Image refers to our algorithm, the Gould and McWilliams are risk calculators and the posterior models are the Gould and McWilliams models updated with information from the image model.
Reclassification of nodules in Colorado Lung Nodule Cohort from updating the McWilliams risk prediction model with the Image model.
The initial model is the McWilliams prediction model, the updated model is the McWilliams prediction score updated with Image model prediction with Bayes’ rule. More improvement is seen in the event class with probabilities 0.4 to 0.8, compared to the non-event class.
| Initial Model | [0,0.2) | [0.2,0.4) | [0.4,0.6) | [0.6,0.8) | [0.8,1] | % reclassified |
| [0,0.2) | 36 | 12 | 0 | 2 | 0 | 28 |
| [0.2,0.4) | 6 | 6 | 4 | 4 | 3 | 74 |
| [0.4,0.6) | 3 | 3 | 1 | 5 | 3 | 93 |
| [0.6,0.8) | 0 | 0 | 0 | 0 | 3 | 100 |
| [0.8,1] | 0 | 0 | 0 | 0 | 0 | NA |
| Events (n = 87) | ||||||
| Updated model | ||||||
| Initial Model | [0,0.2) | [0.2,0.4) | [0.4,0.6) | [0.6,0.8) | [0.8,1] | % reclassified |
| [0,0.2) | 10 | 5 | 1 | 1 | 0 | 41 |
| [0.2,0.4) | 4 | 8 | 8 | 3 | 1 | 67 |
| [0.4,0.6) | 2 | 2 | 6 | 6 | 14 | 80 |
| [0.6,0.8) | 0 | 3 | 1 | 5 | 7 | 69 |
| [0.8,1] | 0 | 0 | 0 | 0 | 0 | NA |
Fig 2Receiver operating characteristic curves and associated AUC are shown for the Image model and Gould model applied to the pilot PLuSS dataset.
The Image model refers to our model that only used the CT data for predictors, the Gould model is from the Gould risk calculator and the posterior model is the Gould model updated with information from the Image model.
Reclassification table for the pilot PLuSS datasets.
The initial model is based on the Gould prediction model, the updated model incorporates the predicted probabilities from our Image model into the Gould predictions. More individuals are reclassified to a higher probability class in the event group compared to the nonevent group.
| Initial Model | [0,0.2) | [0.2,0.4) | [0.4,0.6) | [0.6,0.8) | [0.8,1] | % reclassified |
| [0,0.2) | 0 | 1 | 0 | 0 | 0 | 100 |
| [0.2,0.4) | 5 | 8 | 2 | 0 | 0 | 47 |
| [0.4,0.6) | 1 | 4 | 6 | 2 | 0 | 54 |
| [0.6,0.8) | 0 | 2 | 2 | 4 | 1 | 56 |
| [0.8,1] | 0 | 1 | 1 | 1 | 1 | 75 |
| Events (n = 46) | ||||||
| Updated model | ||||||
| Initial Model | [0,0.2) | [0.2,0.4) | [0.4,0.6) | [0.6,0.8) | [0.8,1] | % reclassified |
| [0,0.2) | 1 | 0 | 1 | 0 | 0 | 50 |
| [0.2,0.4) | 5 | 4 | 6 | 1 | 0 | 75 |
| [0.4,0.6) | 1 | 2 | 8 | 2 | 1 | 43 |
| [0.6,0.8) | 0 | 0 | 2 | 3 | 3 | 62 |
| [0.8,1] | 0 | 0 | 0 | 2 | 4 | 33 |
Fig 3Receiver operating characteristic curves and associated AUC are shown for the Gould prediction model on its own, the image model on its own and the incremental value of the Image model incorporated into the Gould model which we called the posterior model.
This is a sample of one of five imputed datasets.
Reclassification table on PLuSS validation dataset with Gould prediction model as initial model, and updated model with Image model probability incorporated into the Gould risk prediction score.
Although both non-events and event groups were generally classified to higher probability classes, this effect was more pronounced in the event group. This is a sample of one of five imputed datasets.
| Nonevents (n = 97) | ||||||
| Updated model | ||||||
| Initial Model | [0,0.2) | [0.2,0.4) | [0.4,0.6) | [0.6,0.8) | [0.8,1] | % reclassified |
| [0,0.2) | 45 | 3 | 0 | 0 | 0 | 6 |
| [0.2,0.4) | 9 | 4 | 4 | 1 | 0 | 78 |
| [0.4,0.6) | 3 | 9 | 6 | 5 | 0 | 74 |
| [0.6,0.8) | 0 | 1 | 2 | 0 | 2 | 100 |
| [0.8,1] | 0 | 0 | 0 | 2 | 1 | 67 |
| Events (n = 99) | ||||||
| Updated model | ||||||
| Initial Model | [0,0.2) | [0.2,0.4) | [0.4,0.6) | [0.6,0.8) | [0.8,1] | % reclassified |
| [0,0.2) | 23 | 6 | 3 | 1 | 0 | 30 |
| [0.2,0.4) | 4 | 6 | 4 | 0 | 0 | 57 |
| [0.4,0.6) | 0 | 1 | 4 | 4 | 9 | 78 |
| [0.6,0.8) | 0 | 0 | 2 | 8 | 15 | 68 |
| [0.8,1] | 0 | 0 | 0 | 0 | 9 | 0 |