Background: This study developed, validated, and disseminated a generalizable informatics algorithm for detecting breast cancer recurrence and timing using a gold standard measure of recurrence coupled with data derived from a readily available common data model that pools health insurance claims and electronic health records data. Methods: The algorithm has two parts: to detect the presence of recurrence and to estimate the timing of recurrence. The primary data source was the Cancer Research Network Virtual Data Warehouse (VDW). Sixteen potential indicators of recurrence were considered for model development. The final recurrence detection and timing models were determined, respectively, by maximizing the area under the ROC curve (AUROC) and minimizing average absolute error. Detection and timing algorithms were validated using VDW data in comparison with a gold standard recurrence capture from a third site in which recurrences were validated through chart review. Performance of this algorithm, stratified by stage at diagnosis, was compared with other published algorithms. All statistical tests were two-sided. Results: Detection model AUROCs were 0.939 (95% confidence interval [CI] = 0.917 to 0.955) in the training data set (n = 3370) and 0.956 (95% CI = 0.944 to 0.971) and 0.900 (95% CI = 0.872 to 0.928), respectively, in the two validation data sets (n = 3370 and 3961, respectively). Timing models yielded average absolute prediction errors of 12.6% (95% CI = 10.5% to 14.5%) in the training data and 11.7% (95% CI = 9.9% to 13.5%) and 10.8% (95% CI = 9.6% to 12.2%) in the validation data sets, respectively, and were statistically significantly lower by 12.6% (95% CI = 8.8% to 16.5%, P < .001) than those estimated using previously reported timing algorithms. Similar covariates were included in both detection and timing algorithms but differed substantially from previous studies. Conclusions: Valid and reliable detection of recurrence using data derived from electronic medical records and insurance claims is feasible. These tools will enable extensive, novel research on quality, effectiveness, and outcomes for breast cancer patients and those who develop recurrence.
Background: This study developed, validated, and disseminated a generalizable informatics algorithm for detecting breast cancer recurrence and timing using a gold standard measure of recurrence coupled with data derived from a readily available common data model that pools health insurance claims and electronic health records data. Methods: The algorithm has two parts: to detect the presence of recurrence and to estimate the timing of recurrence. The primary data source was the Cancer Research Network Virtual Data Warehouse (VDW). Sixteen potential indicators of recurrence were considered for model development. The final recurrence detection and timing models were determined, respectively, by maximizing the area under the ROC curve (AUROC) and minimizing average absolute error. Detection and timing algorithms were validated using VDW data in comparison with a gold standard recurrence capture from a third site in which recurrences were validated through chart review. Performance of this algorithm, stratified by stage at diagnosis, was compared with other published algorithms. All statistical tests were two-sided. Results: Detection model AUROCs were 0.939 (95% confidence interval [CI] = 0.917 to 0.955) in the training data set (n = 3370) and 0.956 (95% CI = 0.944 to 0.971) and 0.900 (95% CI = 0.872 to 0.928), respectively, in the two validation data sets (n = 3370 and 3961, respectively). Timing models yielded average absolute prediction errors of 12.6% (95% CI = 10.5% to 14.5%) in the training data and 11.7% (95% CI = 9.9% to 13.5%) and 10.8% (95% CI = 9.6% to 12.2%) in the validation data sets, respectively, and were statistically significantly lower by 12.6% (95% CI = 8.8% to 16.5%, P < .001) than those estimated using previously reported timing algorithms. Similar covariates were included in both detection and timing algorithms but differed substantially from previous studies. Conclusions: Valid and reliable detection of recurrence using data derived from electronic medical records and insurance claims is feasible. These tools will enable extensive, novel research on quality, effectiveness, and outcomes for breast cancerpatients and those who develop recurrence.
Authors: Reina Haque; Jiaxiao Shi; Joanne E Schottinger; Syed Ajaz Ahmed; Joanie Chung; Chantal Avila; Valerie S Lee; Thomas Craig Cheetham; Laurel A Habel; Suzanne W Fletcher; Marilyn L Kwan Journal: Med Care Date: 2015-04 Impact factor: 2.983
Authors: Mark C Hornbrook; Gene Hart; Jennifer L Ellis; Donald J Bachman; Gary Ansell; Sarah M Greene; Edward H Wagner; Roy Pardee; Mark M Schmidt; Ann Geiger; Amy L Butani; Terry Field; Hassan Fouayzi; Irina Miroshnik; Liyan Liu; Robert Diseker; Karen Wells; Rick Krajenta; Lois Lamerato; Christine Neslund Dudas Journal: J Natl Cancer Inst Monogr Date: 2005
Authors: Beth L Nordstrom; Joanna L Whyte; Marilyn Stolar; Catherine Mercaldi; Joel D Kallich Journal: Pharmacoepidemiol Drug Saf Date: 2012-05 Impact factor: 2.890
Authors: Michael J Hassett; Debra P Ritzwoller; Nathan Taback; Nikki Carroll; Angel M Cronin; Gladys V Ting; Deb Schrag; Joan L Warren; Mark C Hornbrook; Jane C Weeks Journal: Med Care Date: 2014-10 Impact factor: 2.983
Authors: Marilyn L Kwan; Christine B Ambrosone; Marion M Lee; Janice Barlow; Sarah E Krathwohl; Isaac Joshua Ergas; Christine H Ashley; Julie R Bittner; Jeanne Darbinian; Keren Stronach; Bette J Caan; Warren Davis; Susan E Kutner; Charles P Quesenberry; Carol P Somkin; Barbara Sternfeld; John K Wiencke; Shichun Zheng; Lawrence H Kushi Journal: Cancer Causes Control Date: 2008-05-14 Impact factor: 2.506
Authors: Craig C Earle; Ann B Nattinger; Arnold L Potosky; Kathleen Lang; Rajiv Mallick; Mark Berger; Joan L Warren Journal: Med Care Date: 2002-08 Impact factor: 2.983
Authors: Michael J Hassett; Matthew Banegas; Hajime Uno; Shicheng Weng; Angel M Cronin; Maureen O'Keeffe Rosetti; Nikki M Carroll; Mark C Hornbrook; Debra P Ritzwoller Journal: J Oncol Pract Date: 2019-05-20 Impact factor: 3.840
Authors: Hava Izci; Tim Tambuyzer; Krizia Tuand; Victoria Depoorter; Annouschka Laenen; Hans Wildiers; Ignace Vergote; Liesbet Van Eycken; Harlinde De Schutter; Freija Verdoodt; Patrick Neven Journal: J Natl Cancer Inst Date: 2020-10-01 Impact factor: 13.506
Authors: Hajime Uno; Debra P Ritzwoller; Angel M Cronin; Nikki M Carroll; Mark C Hornbrook; Michael J Hassett Journal: JCO Clin Cancer Inform Date: 2018-12
Authors: Mara Meyer Epstein; Cassandra Saphirak; Yanhua Zhou; Candace LeBlanc; Alan G Rosmarin; Arlene Ash; Sonal Singh; Kimberly Fisher; Brenda M Birmann; Jerry H Gurwitz Journal: Pharmacoepidemiol Drug Saf Date: 2019-11-17 Impact factor: 2.890
Authors: Nikki M Carroll; Debra P Ritzwoller; Matthew P Banegas; Maureen O'Keeffe-Rosetti; Angel M Cronin; Hajime Uno; Mark C Hornbrook; Michael J Hassett Journal: JCO Clin Cancer Inform Date: 2019-03
Authors: Debra P Ritzwoller; Paul A Fishman; Matthew P Banegas; Nikki M Carroll; Maureen O'Keeffe-Rosetti; Angel M Cronin; Hajime Uno; Mark C Hornbrook; Michael J Hassett Journal: Health Serv Res Date: 2018-07-24 Impact factor: 3.402
Authors: Donna R Rivera; Clara J K Lam; Lindsey Enewold; Valentina I Petkov; Quyen Tran; Sean Brennan; Lois Dickie; Timothy S McNeel; Annie M Noone; Bradley Ohm; Dolly P White; Joan L Warren; Angela B Mariotto; Lynne Penberthy Journal: J Natl Cancer Inst Monogr Date: 2020-05-01
Authors: Donna R Rivera; Andrew Grothen; Bradley Ohm; Timothy S McNeel; Sean Brennan; Clara J K Lam; Lynne Penberthy; Lindsey Enewold; Valentina I Petkov Journal: J Natl Cancer Inst Monogr Date: 2020-05-01
Authors: Jennifer L Caswell-Jin; Alison Callahan; Natasha Purington; Summer S Han; Haruka Itakura; Esther M John; Douglas W Blayney; George W Sledge; Nigam H Shah; Allison W Kurian Journal: JCO Clin Cancer Inform Date: 2021-05