| Literature DB >> 32392098 |
Melle S Sieswerda1,2, Inigo Bermejo2, Gijs Geleijnse1, Mieke J Aarts1, Valery E P P Lemmens2, Dirk De Ruysscher2, André L A J Dekker2, Xander A A M Verbeek1.
Abstract
PURPOSE: The TNM classification system is used for prognosis, treatment, and research. Regular updates potentially break backward compatibility. Reclassification is not always possible, is labor intensive, or requires additional data. We developed a Bayesian network (BN) for reclassifying the 5th, 6th, and 7th editions of the TNM and predicting survival for non-small-cell lung cancer (NSCLC) without training data with known classifications in multiple editions.Entities:
Year: 2020 PMID: 32392098 PMCID: PMC7265790 DOI: 10.1200/CCI.19.00136
Source DB: PubMed Journal: JCO Clin Cancer Inform ISSN: 2473-4276
TNM Editions Used by the Netherlands Cancer Registry and No. of Records Available by Period, Split Into Data for Training and Testing
Parameters Obtained From the Netherlands Cancer Registry
FIG 1.Bayesian network (after Expectation-Maximization learning) visualized using BayesiaLab 8. Each node represents a random variable with a (conditional) probability table and shows its state’s prior probabilities in a (rotated) histogram. The arrows indicate the (causal) relations between the nodes. The “_567”-suffix indicate nodes that, in conjunction with the “edition” node, can take on values from all TNM editions. The “_7”-suffixed nodes can take on 7th edition values only.
Macro-Aggregated and Micro-Aggregated Statistics for Predicting 6th Edition Stage Group Using 7th Edition Data, 7th Edition Stage Group Using 6th Edition Data, and Survival Using Data From All Editions
FIG 2.Area under the curve for 2-year survival (0.81), computed using BayesiaLab, and the test set comprising all editions. ROC, receiver operation characteristic.
FIG 3.Visualization of model calibration for predicting survival. The calibration (bubble) plot shows, for each subpopulation, the relation between a survival category’s predicted probability (x-axis) and its observed frequency in the dataset (y-axis). The size of each bubble corresponds to the population’s size in the dataset. The calibration curve shows the same relation, but averages predicted/observed values by applying quantile-based binning with 1,000 samples per bin (21 bins).