INTRODUCTION: Recurrent cancer is common, costly, and lethal, yet we know little about it in community-based populations. Electronic health records and tumor registries contain vast amounts of data regarding community-based patients, but usually lack recurrence status. Existing algorithms that use structured data to detect recurrence have limitations. METHODS: We developed algorithms to detect the presence and timing of recurrence after definitive therapy for stages I-III lung and colorectal cancer using 2 data sources that contain a widely available type of structured data (claims or electronic health record encounters) linked to gold-standard recurrence status: Medicare claims linked to the Cancer Care Outcomes Research and Surveillance study, and the Cancer Research Network Virtual Data Warehouse linked to registry data. Twelve potential indicators of recurrence were used to develop separate models for each cancer in each data source. Detection models maximized area under the ROC curve (AUC); timing models minimized average absolute error. Algorithms were compared by cancer type/data source, and contrasted with an existing binary detection rule. RESULTS: Detection model AUCs (>0.92) exceeded existing prediction rules. Timing models yielded absolute prediction errors that were small relative to follow-up time (<15%). Similar covariates were included in all detection and timing algorithms, though differences by cancer type and dataset challenged efforts to create 1 common algorithm for all scenarios. CONCLUSIONS: Valid and reliable detection of recurrence using big data is feasible. These tools will enable extensive, novel research on quality, effectiveness, and outcomes for lung and colorectal cancer patients and those who develop recurrence.
INTRODUCTION: Recurrent cancer is common, costly, and lethal, yet we know little about it in community-based populations. Electronic health records and tumor registries contain vast amounts of data regarding community-based patients, but usually lack recurrence status. Existing algorithms that use structured data to detect recurrence have limitations. METHODS: We developed algorithms to detect the presence and timing of recurrence after definitive therapy for stages I-III lung and colorectal cancer using 2 data sources that contain a widely available type of structured data (claims or electronic health record encounters) linked to gold-standard recurrence status: Medicare claims linked to the Cancer Care Outcomes Research and Surveillance study, and the Cancer Research Network Virtual Data Warehouse linked to registry data. Twelve potential indicators of recurrence were used to develop separate models for each cancer in each data source. Detection models maximized area under the ROC curve (AUC); timing models minimized average absolute error. Algorithms were compared by cancer type/data source, and contrasted with an existing binary detection rule. RESULTS: Detection model AUCs (>0.92) exceeded existing prediction rules. Timing models yielded absolute prediction errors that were small relative to follow-up time (<15%). Similar covariates were included in all detection and timing algorithms, though differences by cancer type and dataset challenged efforts to create 1 common algorithm for all scenarios. CONCLUSIONS: Valid and reliable detection of recurrence using big data is feasible. These tools will enable extensive, novel research on quality, effectiveness, and outcomes for lung and colorectal cancerpatients and those who develop recurrence.
Authors: Mark C Hornbrook; Gene Hart; Jennifer L Ellis; Donald J Bachman; Gary Ansell; Sarah M Greene; Edward H Wagner; Roy Pardee; Mark M Schmidt; Ann Geiger; Amy L Butani; Terry Field; Hassan Fouayzi; Irina Miroshnik; Liyan Liu; Robert Diseker; Karen Wells; Rick Krajenta; Lois Lamerato; Christine Neslund Dudas Journal: J Natl Cancer Inst Monogr Date: 2005
Authors: Beth L Nordstrom; Joanna L Whyte; Marilyn Stolar; Catherine Mercaldi; Joel D Kallich Journal: Pharmacoepidemiol Drug Saf Date: 2012-05 Impact factor: 2.890
Authors: Jessica Chubak; Onchee Yu; Gaia Pocobelli; Lois Lamerato; Joe Webster; Marianne N Prout; Marianne Ulcickas Yood; William E Barlow; Diana S M Buist Journal: J Natl Cancer Inst Date: 2012-04-30 Impact factor: 13.506
Authors: Michael J Hassett; Debra P Ritzwoller; Nathan Taback; Nikki Carroll; Angel M Cronin; Gladys V Ting; Deb Schrag; Joan L Warren; Mark C Hornbrook; Jane C Weeks Journal: Med Care Date: 2014-10 Impact factor: 2.983
Authors: Craig C Earle; Ann B Nattinger; Arnold L Potosky; Kathleen Lang; Rajiv Mallick; Mark Berger; Joan L Warren Journal: Med Care Date: 2002-08 Impact factor: 2.983
Authors: Joan L Warren; Angela Mariotto; Danielle Melbert; Deborah Schrag; Paul Doria-Rose; David Penson; K Robin Yabroff Journal: Med Care Date: 2016-08 Impact factor: 2.983
Authors: Michael J Hassett; Matthew Banegas; Hajime Uno; Shicheng Weng; Angel M Cronin; Maureen O'Keeffe Rosetti; Nikki M Carroll; Mark C Hornbrook; Debra P Ritzwoller Journal: J Oncol Pract Date: 2019-05-20 Impact factor: 3.840
Authors: Hava Izci; Tim Tambuyzer; Krizia Tuand; Victoria Depoorter; Annouschka Laenen; Hans Wildiers; Ignace Vergote; Liesbet Van Eycken; Harlinde De Schutter; Freija Verdoodt; Patrick Neven Journal: J Natl Cancer Inst Date: 2020-10-01 Impact factor: 13.506
Authors: Hajime Uno; Debra P Ritzwoller; Angel M Cronin; Nikki M Carroll; Mark C Hornbrook; Michael J Hassett Journal: JCO Clin Cancer Inform Date: 2018-12
Authors: Debra P Ritzwoller; Michael J Hassett; Hajime Uno; Angel M Cronin; Nikki M Carroll; Mark C Hornbrook; Lawrence C Kushi Journal: J Natl Cancer Inst Date: 2018-03-01 Impact factor: 13.506
Authors: Mara Meyer Epstein; Cassandra Saphirak; Yanhua Zhou; Candace LeBlanc; Alan G Rosmarin; Arlene Ash; Sonal Singh; Kimberly Fisher; Brenda M Birmann; Jerry H Gurwitz Journal: Pharmacoepidemiol Drug Saf Date: 2019-11-17 Impact factor: 2.890
Authors: Nikki M Carroll; Debra P Ritzwoller; Matthew P Banegas; Maureen O'Keeffe-Rosetti; Angel M Cronin; Hajime Uno; Mark C Hornbrook; Michael J Hassett Journal: JCO Clin Cancer Inform Date: 2019-03
Authors: Debra P Ritzwoller; Paul A Fishman; Matthew P Banegas; Nikki M Carroll; Maureen O'Keeffe-Rosetti; Angel M Cronin; Hajime Uno; Mark C Hornbrook; Michael J Hassett Journal: Health Serv Res Date: 2018-07-24 Impact factor: 3.402
Authors: Natalia Kunst; Fernando Alarid-Escudero; Eline Aas; Veerle M H Coupé; Deborah Schrag; Karen M Kuntz Journal: Cancer Epidemiol Biomarkers Prev Date: 2020-09-30 Impact factor: 4.254