Literature DB >> 32509383

Predicting long-term multicategory cause of death in patients with prostate cancer: random forest versus multinomial model.

Jianwei Wang1, Fei Deng2, Fuqing Zeng3, Andrew J Shanahan4, Wei Vivian Li5, Lanjing Zhang6,7,8,9.   

Abstract

The majority of patients with prostate cancer die of non-cancer causes of death (COD). It is thus important to accurately predict multi-category COD in these patients. Random forest (RF), a popular machine learning model, has been shown useful for predicting binary cancer-specific deaths. However, its accuracy for predicting multi-category COD in cancer patients is unclear. We included patients in Surveillance, Epidemiology, and End Results-18 cancer registry-program with prostate cancer diagnosed in 2004 (followed-up through 2016). They were randomly divided into training and testing sets with equal sizes. We evaluated prediction accuracies of RF and conventional statistical/multinomial models for 6-category COD by data-encoding types using the 2-fold cross-validation approach. Among 49,864 prostate cancer patients, 29,611 (59.4%) were alive at the end of follow-up, and 5,448 (10.9%) died of cardiovascular disease, 4,607 (9.2%) of prostate cancer, 3,681 (7.4%) of non-prostate cancer, 717 (1.4%) of infection, and 5,800 (11.6%) of other causes. We predicted 6-category COD among these patients with a mean accuracy of 59.1% (n=240, 95% CI, 58.7%-59.4%) in RF models with one-hot encoding, and 50.4% (95% CI, 49.7%-51.0%) in multinomial models. Tumor characteristics, prostate-specific antigen level, and diagnosis confirmation-method were important in RF and multinomial models. In RF models, no statistical differences were found between the accuracies of training versus cross-validation phases, and those of categorical versus one-hot encoding. We here report that RF models can outperform multinomial logistic models (absolute accuracy-difference, 8.7%) in predicting long-term 6-category COD among prostate cancer patients, while pathology diagnosis itself and tumor pathology remain important factors. AJCR
Copyright © 2020.

Entities:  

Keywords:  Prostate cancer; cause-specific mortality; machine learning; prediction; prognosis

Year:  2020        PMID: 32509383

Source DB:  PubMed          Journal:  Am J Cancer Res        ISSN: 2156-6976            Impact factor:   6.166


  9 in total

1.  Performance and efficiency of machine learning algorithms for analyzing rectangular biomedical data.

Authors:  Fei Deng; Jibing Huang; Xiaoling Yuan; Chao Cheng; Lanjing Zhang
Journal:  Lab Invest       Date:  2021-02-11       Impact factor: 5.662

2.  Prediction of Micronucleus Assay Outcome Using In Vivo Activity Data and Molecular Structure Features.

Authors:  Priyanka Ramesh; Shanthi Veerappapillai
Journal:  Appl Biochem Biotechnol       Date:  2021-10-20       Impact factor: 2.926

3.  Multimetric feature selection for analyzing multicategory outcomes of colorectal cancer: random forest and multinomial logistic regression models.

Authors:  Catherine H Feng; Mary L Disis; Chao Cheng; Lanjing Zhang
Journal:  Lab Invest       Date:  2021-09-18       Impact factor: 5.662

4.  A polygenic stacking classifier revealed the complicated platelet transcriptomic landscape of adult immune thrombocytopenia.

Authors:  Chengfeng Xu; Ruochi Zhang; Meiyu Duan; Yongming Zhou; Jizhang Bao; Hao Lu; Jie Wang; Minghui Hu; Zhaoyang Hu; Fengfeng Zhou; Wenwei Zhu
Journal:  Mol Ther Nucleic Acids       Date:  2022-04-06       Impact factor: 10.183

5.  Automated model versus treating physician for predicting survival time of patients with metastatic cancer.

Authors:  Michael F Gensheimer; Sonya Aggarwal; Kathryn R K Benson; Justin N Carter; A Solomon Henry; Douglas J Wood; Scott G Soltys; Steven Hancock; Erqi Pollom; Nigam H Shah; Daniel T Chang
Journal:  J Am Med Inform Assoc       Date:  2021-06-12       Impact factor: 4.497

6.  Classify multicategory outcome in patients with lung adenocarcinoma using clinical, transcriptomic and clinico-transcriptomic data: machine learning versus multinomial models.

Authors:  Fei Deng; Lanlan Shen; He Wang; Lanjing Zhang
Journal:  Am J Cancer Res       Date:  2020-12-01       Impact factor: 6.166

7.  The Challenges and Opportunities of Translational Pathology.

Authors:  Lanjing Zhang
Journal:  J Clin Transl Pathol       Date:  2022-02-23

8.  Variation in and Factors Associated With US County-Level Cancer Mortality, 2008-2019.

Authors:  Weichuan Dong; Wyatt P Bensken; Uriel Kim; Johnie Rose; Qinjin Fan; Nicholas K Schiltz; Nathan A Berger; Siran M Koroukian
Journal:  JAMA Netw Open       Date:  2022-09-01

9.  Predictive modeling of estrogen receptor agonism, antagonism, and binding activities using machine- and deep-learning approaches.

Authors:  Heather L Ciallella; Daniel P Russo; Lauren M Aleksunes; Fabian A Grimm; Hao Zhu
Journal:  Lab Invest       Date:  2020-08-10       Impact factor: 5.662

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.