Literature DB >> 33307409

Predict multicategory causes of death in lung cancer patients using clinicopathologic factors.

Fei Deng1, Haijun Zhou2, Yong Lin3, John A Heim4, Lanlan Shen5, Yuan Li6, Lanjing Zhang7.   

Abstract

BACKGROUND: Random forests (RF) is a widely used machine-learning algorithm, and outperforms many other machine learning algorithms in prediction-accuracy. But it is rarely used for predicting causes of death (COD) in cancer patients. On the other hand, multicategory COD are difficult to classify in lung cancer patients, largely because they have multiple labels (versus binary labels).
METHODS: We tuned RF algorithms to classify 5-category COD among the lung cancer patients in the surveillance, epidemiology and end results-18, whose lung cancers were diagnosed in 2004, for the completeness in their follow-up. The patients were randomly divided into training and validation sets (1:1 and 4:1 sample-splits). We compared the prediction accuracy of the tuned RF and multinomial logistic regression (MLR) models.
RESULTS: We included 42,257 qualified lung cancers in the database. The COD were lung cancer (72.41%), other causes or alive (14.43%), non-lung cancer (6.85%), cardiovascular disease (5.35%), and infection (0.96%). The tuned RF model with 300 iterations and 10 variables outperformed the MLR model (accuracy = 69.8% vs 64.6%, 1:1 sample-split), while 4:1 sample-split produced lower prediction-accuracy than 1:1 sample-split. The top-10 important factors in the RF model were sex, chemotherapy status, age (65+ vs < 65 years), radiotherapy status, nodal status, T category, histology type and laterality, all of which except T category and laterality were also important in MLR model.
CONCLUSION: We tuned RF models to predict 5-category CODs in lung cancer patients, and show RF outperforms MLR in prediction accuracy. We also identified the factors associated with these COD.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Lung cancer; Machine learning; Multi-label classification; Multinomial logistic regression

Mesh:

Year:  2020        PMID: 33307409     DOI: 10.1016/j.compbiomed.2020.104161

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  5 in total

1.  Multimetric feature selection for analyzing multicategory outcomes of colorectal cancer: random forest and multinomial logistic regression models.

Authors:  Catherine H Feng; Mary L Disis; Chao Cheng; Lanjing Zhang
Journal:  Lab Invest       Date:  2021-09-18       Impact factor: 5.662

2.  Classify multicategory outcome in patients with lung adenocarcinoma using clinical, transcriptomic and clinico-transcriptomic data: machine learning versus multinomial models.

Authors:  Fei Deng; Lanlan Shen; He Wang; Lanjing Zhang
Journal:  Am J Cancer Res       Date:  2020-12-01       Impact factor: 6.166

3.  Low PRRX1 expression and high ZEB1 expression are significantly correlated with epithelial-mesenchymal transition and tumor angiogenesis in non-small cell lung cancer.

Authors:  Ruixue Yang; Yuanqun Liu; Yufei Wang; Xiaolin Wang; Hongfei Ci; Chao Song; Shiwu Wu
Journal:  Medicine (Baltimore)       Date:  2021-01-29       Impact factor: 1.817

4.  Predictions of cervical cancer identification by photonic method combined with machine learning.

Authors:  Michał Kruczkowski; Anna Drabik-Kruczkowska; Anna Marciniak; Martyna Tarczewska; Monika Kosowska; Małgorzata Szczerska
Journal:  Sci Rep       Date:  2022-03-08       Impact factor: 4.379

Review 5.  A Powerful Paradigm for Cardiovascular Risk Stratification Using Multiclass, Multi-Label, and Ensemble-Based Machine Learning Paradigms: A Narrative Review.

Authors:  Jasjit S Suri; Mrinalini Bhagawati; Sudip Paul; Athanasios D Protogerou; Petros P Sfikakis; George D Kitas; Narendra N Khanna; Zoltan Ruzsa; Aditya M Sharma; Sanjay Saxena; Gavino Faa; John R Laird; Amer M Johri; Manudeep K Kalra; Kosmas I Paraskevas; Luca Saba
Journal:  Diagnostics (Basel)       Date:  2022-03-16
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.