BACKGROUND: Tumor control probability (TCP) to radiotherapy is determined by complex interactions between tumor biology, tumor microenvironment, radiation dosimetry, and patient-related variables. The complexity of these heterogeneous variable interactions constitutes a challenge for building predictive models for routine clinical practice. We describe a datamining framework that can unravel the higher order relationships among dosimetric dose-volume prognostic variables, interrogate various radiobiological processes, and generalize to unseen data before when applied prospectively. MATERIAL AND METHODS: Several datamining approaches are discussed that include dose-volume metrics, equivalent uniform dose, mechanistic Poisson model, and model building methods using statistical regression and machine learning techniques. Institutional datasets of non-small cell lung cancer (NSCLC) patients are used to demonstrate these methods. The performance of the different methods was evaluated using bivariate Spearman rank correlations (rs). Over-fitting was controlled via resampling methods. RESULTS: Using a dataset of 56 patients with primary NCSLC tumors and 23 candidate variables, we estimated GTV volume and V75 to be the best model parameters for predicting TCP using statistical resampling and a logistic model. Using these variables, the support vector machine (SVM) kernel method provided superior performance for TCP prediction with an rs=0.68 on leave-one-out testing compared to logistic regression (rs=0.4), Poisson-based TCP (rs=0.33), and cell kill equivalent uniform dose model (rs=0.17). CONCLUSIONS: The prediction of treatment response can be improved by utilizing datamining approaches, which are able to unravel important non-linear complex interactions among model variables and have the capacity to predict on unseen data for prospective clinical applications.
BACKGROUND:Tumor control probability (TCP) to radiotherapy is determined by complex interactions between tumor biology, tumor microenvironment, radiation dosimetry, and patient-related variables. The complexity of these heterogeneous variable interactions constitutes a challenge for building predictive models for routine clinical practice. We describe a datamining framework that can unravel the higher order relationships among dosimetric dose-volume prognostic variables, interrogate various radiobiological processes, and generalize to unseen data before when applied prospectively. MATERIAL AND METHODS: Several datamining approaches are discussed that include dose-volume metrics, equivalent uniform dose, mechanistic Poisson model, and model building methods using statistical regression and machine learning techniques. Institutional datasets of non-small cell lung cancer (NSCLC) patients are used to demonstrate these methods. The performance of the different methods was evaluated using bivariate Spearman rank correlations (rs). Over-fitting was controlled via resampling methods. RESULTS: Using a dataset of 56 patients with primary NCSLC tumors and 23 candidate variables, we estimated GTV volume and V75 to be the best model parameters for predicting TCP using statistical resampling and a logistic model. Using these variables, the support vector machine (SVM) kernel method provided superior performance for TCP prediction with an rs=0.68 on leave-one-out testing compared to logistic regression (rs=0.4), Poisson-based TCP (rs=0.33), and cell kill equivalent uniform dose model (rs=0.17). CONCLUSIONS: The prediction of treatment response can be improved by utilizing datamining approaches, which are able to unravel important non-linear complex interactions among model variables and have the capacity to predict on unseen data for prospective clinical applications.
Authors: Patricia E Lindsay; Issam El Naqa; Andrew J Hope; Milos Vicic; Jing Cui; Jeffrey D Bradley; Joseph O Deasy Journal: Med Phys Date: 2007-01 Impact factor: 4.071
Authors: S Levegrün; A Jackson; M J Zelefsky; E S Venkatraman; M W Skwarchuk; W Schlegel; Z Fuks; S A Leibel; C C Ling Journal: Int J Radiat Oncol Biol Phys Date: 2000-07-15 Impact factor: 7.038
Authors: Alan Pollack; Didier Cowen; Patricia Troncoso; Gunar K Zagars; Andrew C von Eschenbach; Marvin L Meistrich; Timothy McDonnell Journal: Cancer Date: 2003-04-01 Impact factor: 6.860
Authors: Issam El Naqa; Sarah L Kerns; James Coates; Yi Luo; Corey Speers; Catharine M L West; Barry S Rosenstein; Randall K Ten Haken Journal: Phys Med Biol Date: 2017-08-01 Impact factor: 3.609
Authors: Jung Hun Oh; Jeffrey Craft; Rawan Al Lozi; Manushka Vaidya; Yifan Meng; Joseph O Deasy; Jeffrey D Bradley; Issam El Naqa Journal: Phys Med Biol Date: 2011-02-18 Impact factor: 3.609
Authors: Issam El Naqa; Dan Ruan; Gilmer Valdes; Andre Dekker; Todd McNutt; Yaorong Ge; Q Jackie Wu; Jung Hun Oh; Maria Thor; Wade Smith; Arvind Rao; Clifton Fuller; Ying Xiao; Frank Manion; Matthew Schipper; Charles Mayo; Jean M Moran; Randall Ten Haken Journal: Med Phys Date: 2018-08-24 Impact factor: 4.071
Authors: Adriana M De Mendoza; Soňa Michlíková; Johann Berger; Jens Karschau; Leoni A Kunz-Schughart; Damian D McLeod Journal: Sci Rep Date: 2021-03-09 Impact factor: 4.379