Literature DB >> 35690348

CondiS: A conditional survival distribution-based method for censored data imputation overcoming the hurdle in machine learning-based survival analysis.

Yizhuo Wang1, Christopher R Flowers2, Ziyi Li3, Xuelin Huang4.   

Abstract

Data analyses by machine learning (ML) algorithms are gaining popularity in biomedical research. When time-to-event data are of interest, censoring is common and needs to be properly addressed. Most ML methods cannot conveniently and appropriately take the censoring information into consideration, potentially leading to inaccurate or biased results. We aim to develop a general-purpose method for imputing censored survival data, facilitating downstream ML analysis. In this study, we propose a novel method of imputing the survival times for censored observations. The proposal is based on their conditional survival distributions (CondiS) derived from Kaplan-Meier estimators. CondiS can replace censored observations with their best approximations from the statistical model, allowing for direct application of ML methods. When covariates are available, we extend CondiS by incorporating the covariate information through ML modeling (CondiS-X), which further improves the accuracy of the imputed survival time. Compared with existing methods with similar purposes, the proposed methods achieved smaller prediction errors and higher concordance with the underlying true survival times in extensive simulation studies. We also demonstrated the usage and advantages of the proposed methods through two real-world cancer datasets. The major advantage of CondiS is that it allows for the direct application of standard ML techniques for analysis once the censored survival times are imputed. We present a user-friendly R package to implement our method, which is a useful tool for ML-based biomedical research in this era of big data.
Copyright © 2022 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Data censoring; Imputation; Kaplan Meier; Machine learning; Survival analysis

Mesh:

Year:  2022        PMID: 35690348     DOI: 10.1016/j.jbi.2022.104117

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   8.000


  1 in total

1.  CondiS Web App: Imputation of Censored Lifetimes for Machine Learning-Based Survival Analysis.

Authors:  Yizhuo Wang; Christopher R Flowers; Ziyi Li; Xuelin Huang
Journal:  Bioinformatics       Date:  2022-07-08       Impact factor: 6.931

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.