Literature DB >> 29706685

Quantifying predictive capability of electronic health records for the most harmful breast cancer.

Yirong Wu1, Jun Fan1, Peggy Peissig2, Richard Berg2, Ahmad Pahlavan Tafti2, Jie Yin3,4, Ming Yuan1, David Page1, Jennifer Cox1, Elizabeth S Burnside1.   

Abstract

Improved prediction of the "most harmful" breast cancers that cause the most substantive morbidity and mortality would enable physicians to target more intense screening and preventive measures at those women who have the highest risk; however, such prediction models for the "most harmful" breast cancers have rarely been developed. Electronic health records (EHRs) represent an underused data source that has great research and clinical potential. Our goal was to quantify the value of EHR variables in the "most harmful" breast cancer risk prediction. We identified 794 subjects who had breast cancer with primary non-benign tumors with their earliest diagnosis on or after 1/1/2004 from an existing personalized medicine data repository, including 395 "most harmful" breast cancer cases and 399 "least harmful" breast cancer cases. For these subjects, we collected EHR data comprised of 6 components: demographics, diagnoses, symptoms, procedures, medications, and laboratory results. We developed two regularized prediction models, Ridge Logistic Regression (Ridge-LR) and Lasso Logistic Regression (Lasso-LR), to predict the "most harmful" breast cancer one year in advance. The area under the ROC curve (AUC) was used to assess model performance. We observed that the AUCs of Ridge-LR and Lasso-LR models were 0.818 and 0.839 respectively. For both the Ridge-LR and Lasso-LR models, the predictive performance of the whole EHR variables was significantly higher than that of each individual component (p<0.001). In conclusion, EHR variables can be used to predict the "most harmful" breast cancer, providing the possibility to personalize care for those women at the highest risk in clinical practice.

Entities:  

Keywords:  breast cancer; electronic health records (EHRs); least absolute shrinkage and selection operator (Lasso); regularized prediction model

Year:  2018        PMID: 29706685      PMCID: PMC5914175          DOI: 10.1117/12.2293954

Source DB:  PubMed          Journal:  Proc SPIE Int Soc Opt Eng        ISSN: 0277-786X


  21 in total

1.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  Performance of common genetic variants in breast-cancer risk models.

Authors:  Sholom Wacholder; Patricia Hartge; Ross Prentice; Montserrat Garcia-Closas; Heather Spencer Feigelson; W Ryan Diver; Michael J Thun; David G Cox; Susan E Hankinson; Peter Kraft; Bernard Rosner; Christine D Berg; Louise A Brinton; Jolanta Lissowska; Mark E Sherman; Rowan Chlebowski; Charles Kooperberg; Rebecca D Jackson; Dennis W Buckman; Peter Hui; Ruth Pfeiffer; Kevin B Jacobs; Gilles D Thomas; Robert N Hoover; Mitchell H Gail; Stephen J Chanock; David J Hunter
Journal:  N Engl J Med       Date:  2010-03-18       Impact factor: 91.245

3.  Comparing Mammography Abnormality Features to Genetic Variants in the Prediction of Breast Cancer in Women Recommended for Breast Biopsy.

Authors:  Elizabeth S Burnside; Jie Liu; Yirong Wu; Adedayo A Onitilo; Catherine A McCarty; C David Page; Peggy L Peissig; Amy Trentham-Dietz; Terrie Kitchner; Jun Fan; Ming Yuan
Journal:  Acad Radiol       Date:  2015-10-26       Impact factor: 3.173

4.  Reduction in late-stage breast cancer incidence in the mammography era: Implications for overdiagnosis of invasive cancer.

Authors:  Mark A Helvie; Joanne T Chang; R Edward Hendrick; Mousumi Banerjee
Journal:  Cancer       Date:  2014-05-19       Impact factor: 6.860

Review 5.  Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review.

Authors:  Benjamin A Goldstein; Ann Marie Navar; Michael J Pencina; John P A Ioannidis
Journal:  J Am Med Inform Assoc       Date:  2016-05-17       Impact factor: 4.497

6.  Projecting individualized probabilities of developing breast cancer for white females who are being examined annually.

Authors:  M H Gail; L A Brinton; D P Byar; D K Corle; S B Green; C Schairer; J J Mulvihill
Journal:  J Natl Cancer Inst       Date:  1989-12-20       Impact factor: 13.506

7.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

8.  A logistic regression model based on the national mammography database format to aid breast cancer diagnosis.

Authors:  Jagpreet Chhatwal; Oguzhan Alagoz; Mary J Lindstrom; Charles E Kahn; Katherine A Shaffer; Elizabeth S Burnside
Journal:  AJR Am J Roentgenol       Date:  2009-04       Impact factor: 3.959

9.  Predicting invasive breast cancer versus DCIS in different age groups.

Authors:  Mehmet U S Ayvaci; Oguzhan Alagoz; Jagpreet Chhatwal; Alejandro Munoz del Rio; Edward A Sickles; Houssam Nassif; Karla Kerlikowske; Elizabeth S Burnside
Journal:  BMC Cancer       Date:  2014-08-11       Impact factor: 4.430

10.  How to control confounding effects by statistical analysis.

Authors:  Mohamad Amin Pourhoseingholi; Ahmad Reza Baghestani; Mohsen Vahedi
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2012
View more
  5 in total

1.  The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities.

Authors:  Lauren J Beesley; Maxwell Salvatore; Lars G Fritsche; Anita Pandit; Arvind Rao; Chad Brummett; Cristen J Willer; Lynda D Lisabeth; Bhramar Mukherjee
Journal:  Stat Med       Date:  2019-12-20       Impact factor: 2.373

2.  A Comparison of Logistic Regression Against Machine Learning Algorithms for Gastric Cancer Risk Prediction Within Real-World Clinical Data Streams.

Authors:  Robert J Huang; Nicole Sung-Eun Kwon; Yutaka Tomizawa; Alyssa Y Choi; Tina Hernandez-Boussard; Joo Ha Hwang
Journal:  JCO Clin Cancer Inform       Date:  2022-06

Review 3.  Leveraging Electronic Health Records to Address Breast Cancer Disparities.

Authors:  Solange Bayard; Genevieve Fasano; Rulla M Tamimi; Pilyung Stephen Oh
Journal:  Curr Breast Cancer Rep       Date:  2022-09-03

4.  Electronic health records and patient registries in medical oncology departments in Spain.

Authors:  N Ribelles; I Alvarez-Lopez; A Arcusa; J I Chacon; J de la Haba; J García-Corbacho; J Garcia-Mata; C Jara; J M Jerez; M Lázaro-Quintela; L Leon-Mateos; N Ramirez-Merino; A Tibau; A Garcia-Palomo
Journal:  Clin Transl Oncol       Date:  2021-04-17       Impact factor: 3.405

5.  Personalized Risk-Based Screening Design for Comparative Two-Arm Group Sequential Clinical Trials.

Authors:  Yeonhee Park
Journal:  J Pers Med       Date:  2022-03-12
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.