Panayiotis Petousis1, Simon X Han2, Denise Aberle2, Alex A T Bui2. 1. Department of Bioengineering, University of California, Los Angeles, CA, USA; Medical Imaging Informatics (MII) Group, Department of Radiological Sciences, University of California, Los Angeles, CA, USA. Electronic address: pp89@ucla.edu. 2. Department of Bioengineering, University of California, Los Angeles, CA, USA; Medical Imaging Informatics (MII) Group, Department of Radiological Sciences, University of California, Los Angeles, CA, USA.
Abstract
INTRODUCTION: Identifying high-risk lung cancer individuals at an early disease stage is the most effective way of improving survival. The landmark National Lung Screening Trial (NLST) demonstrated the utility of low-dose computed tomography (LDCT) imaging to reduce mortality (relative to X-ray screening). As a result of the NLST and other studies, imaging-based lung cancer screening programs are now being implemented. However, LDCT interpretation results in a high number of false positives. A set of dynamic Bayesian networks (DBN) were designed and evaluated to provide insight into how longitudinal data can be used to help inform lung cancer screening decisions. METHODS: The LDCT arm of the NLST dataset was used to build and explore five DBNs for high-risk individuals. Three of these DBNs were built using a backward construction process, and two using structure learning methods. All models employ demographics, smoking status, cancer history, family lung cancer history, exposure risk factors, comorbidities related to lung cancer, and LDCT screening outcome information. Given the uncertainty arising from lung cancer screening, a cancer state-space model based on lung cancer staging was utilized to characterize the cancer status of an individual over time. The models were evaluated on balanced training and test sets of cancer and non-cancer cases to deal with data imbalance and overfitting. RESULTS: Results were comparable to expert decisions. The average area under the curve (AUC) of the receiver operating characteristic (ROC) for the three intervention points of the NLST trial was higher than 0.75 for all models. Evaluation of the models on the complete LDCT arm of the NLST dataset (N=25,486) demonstrated satisfactory generalization. Consensus of predictions over similar cases is reported in concordance statistics between the models' and the physicians' predictions. The models' predictive ability with respect to missing data was also evaluated with the sample of cases that missed the second screening exam of the trial (N=417). The DBNs outperformed comparison models such as logistic regression and naïve Bayes. CONCLUSION: The lung cancer screening DBNs demonstrated high discrimination and predictive power with the majority of cancer and non-cancer cases.
INTRODUCTION: Identifying high-risk lung cancer individuals at an early disease stage is the most effective way of improving survival. The landmark National Lung Screening Trial (NLST) demonstrated the utility of low-dose computed tomography (LDCT) imaging to reduce mortality (relative to X-ray screening). As a result of the NLST and other studies, imaging-based lung cancer screening programs are now being implemented. However, LDCT interpretation results in a high number of false positives. A set of dynamic Bayesian networks (DBN) were designed and evaluated to provide insight into how longitudinal data can be used to help inform lung cancer screening decisions. METHODS: The LDCT arm of the NLST dataset was used to build and explore five DBNs for high-risk individuals. Three of these DBNs were built using a backward construction process, and two using structure learning methods. All models employ demographics, smoking status, cancer history, family lung cancer history, exposure risk factors, comorbidities related to lung cancer, and LDCT screening outcome information. Given the uncertainty arising from lung cancer screening, a cancer state-space model based on lung cancer staging was utilized to characterize the cancer status of an individual over time. The models were evaluated on balanced training and test sets of cancer and non-cancer cases to deal with data imbalance and overfitting. RESULTS: Results were comparable to expert decisions. The average area under the curve (AUC) of the receiver operating characteristic (ROC) for the three intervention points of the NLST trial was higher than 0.75 for all models. Evaluation of the models on the complete LDCT arm of the NLST dataset (N=25,486) demonstrated satisfactory generalization. Consensus of predictions over similar cases is reported in concordance statistics between the models' and the physicians' predictions. The models' predictive ability with respect to missing data was also evaluated with the sample of cases that missed the second screening exam of the trial (N=417). The DBNs outperformed comparison models such as logistic regression and naïve Bayes. CONCLUSION: The lung cancer screening DBNs demonstrated high discrimination and predictive power with the majority of cancer and non-cancer cases.
Authors: Margaret R Spitz; Waun Ki Hong; Christopher I Amos; Xifeng Wu; Matthew B Schabath; Qiong Dong; Sanjay Shete; Carol J Etzel Journal: J Natl Cancer Inst Date: 2007-05-02 Impact factor: 13.506
Authors: Marion Verduijn; Peter M J Rosseel; Niels Peek; Evert de Jonge; Bas A J M de Mol Journal: J Biomed Inform Date: 2007-07-26 Impact factor: 6.317
Authors: Patrick Maisonneuve; Vincenzo Bagnardi; Massimo Bellomi; Lorenzo Spaggiari; Giuseppe Pelosi; Cristiano Rampinelli; Raffaella Bertolotti; Nicole Rotmensz; John K Field; Andrea Decensi; Giulia Veronesi Journal: Cancer Prev Res (Phila) Date: 2011-08-02
Authors: German Cuaya; Angélica Muñoz-Meléndez; Lidia Nuñez Carrera; Eduardo F Morales; Ivett Quiñones; Alberto I Pérez; Aldo Alessi Journal: Med Biol Eng Comput Date: 2012-10-14 Impact factor: 2.602
Authors: Kathleen A Cronin; Mitchell H Gail; Zhaohui Zou; Peter B Bach; Jarmo Virtamo; Demetrius Albanes Journal: J Natl Cancer Inst Date: 2006-05-03 Impact factor: 13.506
Authors: Jung Hun Oh; Jeffrey Craft; Rawan Al Lozi; Manushka Vaidya; Yifan Meng; Joseph O Deasy; Jeffrey D Bradley; Issam El Naqa Journal: Phys Med Biol Date: 2011-02-18 Impact factor: 3.609
Authors: Margaret R Spitz; Carol J Etzel; Qiong Dong; Christopher I Amos; Qingyi Wei; Xifeng Wu; Waun Ki Hong Journal: Cancer Prev Res (Phila) Date: 2008-09
Authors: Shiwen Shen; Simon X Han; Panayiotis Petousis; Robert E Weiss; Frank Meng; Alex A T Bui; William Hsu Journal: Comput Biol Med Date: 2016-12-22 Impact factor: 4.589
Authors: José Carlos R Alcantud; Gonzalo Varela; Beatriz Santos-Buitrago; Gustavo Santos-García; Marcelo F Jiménez Journal: PLoS One Date: 2019-06-19 Impact factor: 3.240