Mucahit Cevik1, Mehmet Ali Ergun1, Natasha K Stout2, Amy Trentham-Dietz3, Mark Craven4, Oguzhan Alagoz1,3. 1. Department of Industrial and Systems Engineering, University of Wisconsin, Madison, WI, USA (MC, MAE, OA) 2. Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA, USA (NKS) 3. Department of Population Health Sciences and Carbone Cancer Center, University of Wisconsin, Madison, WI, USA (AT-D, OA) 4. Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI, USA (MC)
Abstract
BACKGROUND: Most cancer simulation models include unobservable parameters that determine disease onset and tumor growth. These parameters play an important role in matching key outcomes such as cancer incidence and mortality, and their values are typically estimated via a lengthy calibration procedure, which involves evaluating a large number of combinations of parameter values via simulation. The objective of this study is to demonstrate how machine learning approaches can be used to accelerate the calibration process by reducing the number of parameter combinations that are actually evaluated. METHODS: Active learning is a popular machine learning method that enables a learning algorithm such as artificial neural networks to interactively choose which parameter combinations to evaluate. We developed an active learning algorithm to expedite the calibration process. Our algorithm determines the parameter combinations that are more likely to produce desired outputs and therefore reduces the number of simulation runs performed during calibration. We demonstrate our method using the previously developed University of Wisconsin breast cancer simulation model (UWBCS). RESULTS: In a recent study, calibration of the UWBCS required the evaluation of 378 000 input parameter combinations to build a race-specific model, and only 69 of these combinations produced results that closely matched observed data. By using the active learning algorithm in conjunction with standard calibration methods, we identify all 69 parameter combinations by evaluating only 5620 of the 378 000 combinations. CONCLUSION: Machine learning methods hold potential in guiding model developers in the selection of more promising parameter combinations and hence speeding up the calibration process. Applying our machine learning algorithm to one model shows that evaluating only 1.49% of all parameter combinations would be sufficient for the calibration.
BACKGROUND: Most cancer simulation models include unobservable parameters that determine disease onset and tumor growth. These parameters play an important role in matching key outcomes such as cancer incidence and mortality, and their values are typically estimated via a lengthy calibration procedure, which involves evaluating a large number of combinations of parameter values via simulation. The objective of this study is to demonstrate how machine learning approaches can be used to accelerate the calibration process by reducing the number of parameter combinations that are actually evaluated. METHODS: Active learning is a popular machine learning method that enables a learning algorithm such as artificial neural networks to interactively choose which parameter combinations to evaluate. We developed an active learning algorithm to expedite the calibration process. Our algorithm determines the parameter combinations that are more likely to produce desired outputs and therefore reduces the number of simulation runs performed during calibration. We demonstrate our method using the previously developed University of Wisconsin breast cancer simulation model (UWBCS). RESULTS: In a recent study, calibration of the UWBCS required the evaluation of 378 000 input parameter combinations to build a race-specific model, and only 69 of these combinations produced results that closely matched observed data. By using the active learning algorithm in conjunction with standard calibration methods, we identify all 69 parameter combinations by evaluating only 5620 of the 378 000 combinations. CONCLUSION: Machine learning methods hold potential in guiding model developers in the selection of more promising parameter combinations and hence speeding up the calibration process. Applying our machine learning algorithm to one model shows that evaluating only 1.49% of all parameter combinations would be sufficient for the calibration.
Authors: Douglas C A Taylor; Vivek Pawar; Denise T Kruzikas; Kristen E Gilmore; Myrlene Sanon; Milton C Weinstein Journal: Pharmacoeconomics Date: 2012-02-01 Impact factor: 4.981
Authors: Nataliya G Batina; Amy Trentham-Dietz; Ronald E Gangnon; Brian L Sprague; Marjorie A Rosenberg; Natasha K Stout; Dennis G Fryback; Oguzhan Alagoz Journal: Breast Cancer Res Treat Date: 2013-02-16 Impact factor: 4.872
Authors: Romain Pirracchio; Maya L Petersen; Marco Carone; Matthieu Resche Rigon; Sylvie Chevret; Mark J van der Laan Journal: Lancet Respir Med Date: 2014-11-24 Impact factor: 30.700
Authors: Clara D M van Karnebeek; Tima Mohammadi; Nicole Tsao; Graham Sinclair; Sandra Sirrs; Sylvia Stockler; Carlo Marra Journal: Mol Genet Metab Date: 2014-07-15 Impact factor: 4.797
Authors: Rafael Meza; Kevin ten Haaf; Chung Yin Kong; Ayca Erdogan; William C Black; Martin C Tammemagi; Sung Eun Choi; Jihyoun Jeon; Summer S Han; Vidit Munshi; Joost van Rosmalen; Paul Pinsky; Pamela M McMahon; Harry J de Koning; Eric J Feuer; William D Hazelton; Sylvia K Plevritis Journal: Cancer Date: 2014-02-27 Impact factor: 6.860
Authors: Oguzhan Alagoz; Mehmet Ali Ergun; Mucahit Cevik; Brian L Sprague; Dennis G Fryback; Ronald E Gangnon; John M Hampton; Natasha K Stout; Amy Trentham-Dietz Journal: Med Decis Making Date: 2018-04 Impact factor: 2.583
Authors: Krishna P Reddy; Alexander J B Bulteel; Douglas E Levy; Pamela Torola; Emily P Hyle; Taige Hou; Benjamin Osher; Liyang Yu; Fatma M Shebl; A David Paltiel; Kenneth A Freedberg; Milton C Weinstein; Nancy A Rigotti; Rochelle P Walensky Journal: BMJ Open Date: 2020-05-12 Impact factor: 2.692