Literature DB >> 31312809

Lung Cancer Survival Prediction via Machine Learning Regression, Classification, and Statistical Techniques.

James A Bartholomai1, Hermann B Frieboes1.   

Abstract

A regression model is developed to predict survival time in months for lung cancer patients. It was previously shown that predictive models perform accurately for short survival times of less than 6 months; however, model accuracy is reduced when attempting to predict longer survival times. This study employs an approach for which regression models are used in combination with a classification model to predict survival time. A set of de-identified lung cancer patient data was obtained from the Surveillance, Epidemiology, and End Results (SEER) database. The models use a subset of factors selected by ANOVA. Model accuracy is measured by a confusion matrix for classification and by Root Mean Square Error (RMSE) for regression. Random Forests are used for classification, while general Linear Regression, Gradient Boosted Machines (GBM), and Random Forests are used for regression. The regression results show that RF had the best performance for survival times ≤6 and >24 months (RMSE 10.52 and 20.51, respectively), while GBM performed best for 7-24 months (RMSE 15.65). Comparison plots of the results further indicate that the regression models perform better for shorter survival times than the RMSE values are able to reflect.

Entities:  

Keywords:  SEER database; biomedical big data; lung cancer; machine learning; supervised classification

Year:  2019        PMID: 31312809      PMCID: PMC6634305          DOI: 10.1109/ISSPIT.2018.8642753

Source DB:  PubMed          Journal:  Proc IEEE Int Symp Signal Proc Inf Tech


  4 in total

1.  Distinctive egg-laying patterns in terminal versus non-terminal periods in three fruit fly species.

Authors:  Xiang Meng; Junjie Hu; Richard E Plant; Tim E Carpenter; James R Carey
Journal:  Exp Gerontol       Date:  2020-12-11       Impact factor: 4.032

2.  Classify multicategory outcome in patients with lung adenocarcinoma using clinical, transcriptomic and clinico-transcriptomic data: machine learning versus multinomial models.

Authors:  Fei Deng; Lanlan Shen; He Wang; Lanjing Zhang
Journal:  Am J Cancer Res       Date:  2020-12-01       Impact factor: 6.166

3.  When Do We Need Massive Computations to Perform Detailed COVID-19 Simulations?

Authors:  Christopher B Lutz; Philippe J Giabbanelli
Journal:  Adv Theory Simul       Date:  2021-11-23

4.  Machine Learning Using Real-World and Translational Data to Improve Treatment Selection for NSCLC Patients Treated with Immunotherapy.

Authors:  Arsela Prelaj; Mattia Boeri; Alessandro Robuschi; Roberto Ferrara; Claudia Proto; Giuseppe Lo Russo; Giulia Galli; Alessandro De Toma; Marta Brambilla; Mario Occhipinti; Sara Manglaviti; Teresa Beninato; Achille Bottiglieri; Giacomo Massa; Emma Zattarin; Rosaria Gallucci; Edoardo Gregorio Galli; Monica Ganzinelli; Gabriella Sozzi; Filippo G M de Braud; Marina Chiara Garassino; Marcello Restelli; Alessandra Laura Giulia Pedrocchi; Francesco Trovo'
Journal:  Cancers (Basel)       Date:  2022-01-16       Impact factor: 6.639

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.