Literature DB >> 35763703

A Comparison of Logistic Regression Against Machine Learning Algorithms for Gastric Cancer Risk Prediction Within Real-World Clinical Data Streams.

Robert J Huang1, Nicole Sung-Eun Kwon1, Yutaka Tomizawa2, Alyssa Y Choi3, Tina Hernandez-Boussard4, Joo Ha Hwang1.   

Abstract

PURPOSE: Noncardia gastric cancer (NCGC) is a leading cause of global cancer mortality, and is often diagnosed at advanced stages. Development of NCGC risk models within electronic health records (EHR) may allow for improved cancer prevention. There has been much recent interest in use of machine learning (ML) for cancer prediction, but few studies comparing ML with classical statistical models for NCGC risk prediction.
METHODS: We trained models using logistic regression (LR) and four commonly used ML algorithms to predict NCGC from age-/sex-matched controls in two EHR systems: Stanford University and the University of Washington (UW). The LR model contained well-established NCGC risk factors (intestinal metaplasia histology, prior Helicobacter pylori infection, race, ethnicity, nativity status, smoking history, anemia), whereas ML models agnostically selected variables from the EHR. Models were developed and internally validated in the Stanford data, and externally validated in the UW data. Hyperparameter tuning of models was achieved using cross-validation. Model performance was compared by accuracy, sensitivity, and specificity.
RESULTS: In internal validation, LR performed with comparable accuracy (0.732; 95% CI, 0.698 to 0.764), sensitivity (0.697; 95% CI, 0.647 to 0.744), and specificity (0.767; 95% CI, 0.720 to 0.809) to penalized lasso, support vector machine, K-nearest neighbor, and random forest models. In external validation, LR continued to demonstrate high accuracy, sensitivity, and specificity. Although K-nearest neighbor demonstrated higher accuracy and specificity, this was offset by significantly lower sensitivity. No ML model consistently outperformed LR across evaluation criteria.
CONCLUSION: Drawing data from two independent EHRs, we find LR on the basis of established risk factors demonstrated comparable performance to optimized ML algorithms. This study demonstrates that classical models built on robust, hand-chosen predictor variables may not be inferior to data-driven models for NCGC risk prediction.

Entities:  

Mesh:

Year:  2022        PMID: 35763703      PMCID: PMC9259116          DOI: 10.1200/CCI.22.00039

Source DB:  PubMed          Journal:  JCO Clin Cancer Inform        ISSN: 2473-4276


  31 in total

1.  Age-specific trends in incidence of noncardia gastric cancer in US adults.

Authors:  William F Anderson; M Constanza Camargo; Joseph F Fraumeni; Pelayo Correa; Philip S Rosenberg; Charles S Rabkin
Journal:  JAMA       Date:  2010-05-05       Impact factor: 56.272

2.  Case-case comparison of smoking and alcohol risk associations with Epstein-Barr virus-positive gastric cancer.

Authors:  M Constanza Camargo; Chihaya Koriyama; Keitaro Matsuo; Woo-Ho Kim; Roberto Herrera-Goepfert; Linda M Liao; Jun Yu; Gabriel Carrasquilla; Joseph J Y Sung; Isabel Alvarado-Cabrero; Jolanta Lissowska; Fernando Meneses-Gonzalez; Yashushi Yatabe; Ti Ding; Nan Hu; Philip R Taylor; Douglas R Morgan; Margaret L Gulley; Javier Torres; Suminori Akiba; Charles S Rabkin
Journal:  Int J Cancer       Date:  2013-08-28       Impact factor: 7.396

3.  Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.

Authors:  Hyuna Sung; Jacques Ferlay; Rebecca L Siegel; Mathieu Laversanne; Isabelle Soerjomataram; Ahmedin Jemal; Freddie Bray
Journal:  CA Cancer J Clin       Date:  2021-02-04       Impact factor: 508.702

Review 4.  An Approach to the Primary and Secondary Prevention of Gastric Cancer in the United States.

Authors:  Robert J Huang; Meira Epplein; Chisato Hamashima; Il Ju Choi; Eunjung Lee; Dennis Deapen; Yanghee Woo; Thuy Tran; Shailja C Shah; John M Inadomi; David A Greenwald; Joo Ha Hwang
Journal:  Clin Gastroenterol Hepatol       Date:  2021-10-06       Impact factor: 13.576

Review 5.  Endoscopic Screening in Asian Countries Is Associated With Reduced Gastric Cancer Mortality: A Meta-analysis and Systematic Review.

Authors:  Xing Zhang; Meng Li; Shuntai Chen; Jiaqi Hu; Qiujun Guo; Rui Liu; Honggang Zheng; Zhichao Jin; Yuan Yuan; Yupeng Xi; Baojin Hua
Journal:  Gastroenterology       Date:  2018-04-30       Impact factor: 22.682

Review 6.  Human gastric carcinogenesis: a multistep and multifactorial process--First American Cancer Society Award Lecture on Cancer Epidemiology and Prevention.

Authors:  P Correa
Journal:  Cancer Res       Date:  1992-12-15       Impact factor: 12.701

7.  Population-Based Analysis of Differences in Gastric Cancer Incidence Among Races and Ethnicities in Individuals Age 50 Years and Older.

Authors:  Shailja C Shah; Meg McKinley; Samir Gupta; Richard M Peek; Maria Elena Martinez; Scarlett L Gomez
Journal:  Gastroenterology       Date:  2020-08-06       Impact factor: 22.682

8.  Analysis of matched case-control studies.

Authors:  Neil Pearce
Journal:  BMJ       Date:  2016-02-25

9.  Weakly supervised natural language processing for assessing patient-centered outcome following prostate cancer treatment.

Authors:  Imon Banerjee; Kevin Li; Martin Seneviratne; Michelle Ferrari; Tina Seto; James D Brooks; Daniel L Rubin; Tina Hernandez-Boussard
Journal:  JAMIA Open       Date:  2019-01-04

10.  MINIMAR (MINimum Information for Medical AI Reporting): Developing reporting standards for artificial intelligence in health care.

Authors:  Tina Hernandez-Boussard; Selen Bozkurt; John P A Ioannidis; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2020-12-09       Impact factor: 4.497

View more
  1 in total

1.  Can National Registries Contribute to Predict the Risk of Cancer? The Cancer Risk Assessment Model (CRAM).

Authors:  Dorte E Jarbøl; Nana Hyldig; Sören Möller; Sonja Wehberg; Sanne Rasmussen; Kirubakaran Balasubramaniam; Peter F Haastrup; Jens Søndergaard; Katrine H Rubin
Journal:  Cancers (Basel)       Date:  2022-08-06       Impact factor: 6.575

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.