Literature DB >> 19833488

Impact of censoring on learning Bayesian networks in survival modelling.

Ivan Stajduhar1, Bojana Dalbelo-Basić, Nikola Bogunović.   

Abstract

OBJECTIVE: Bayesian networks are commonly used for presenting uncertainty and covariate interactions in an easily interpretable way. Because of their efficient inference and ability to represent causal relationships, they are an excellent choice for medical decision support systems in diagnosis, treatment, and prognosis. Although good procedures for learning Bayesian networks from data have been defined, their performance in learning from censored survival data has not been widely studied. In this paper, we explore how to use these procedures to learn about possible interactions between prognostic factors and their influence on the variate of interest. We study how censoring affects the probability of learning correct Bayesian network structures. Additionally, we analyse the potential usefulness of the learnt models for predicting the time-independent probability of an event of interest. METHODS AND MATERIALS: We analysed the influence of censoring with a simulation on synthetic data sampled from randomly generated Bayesian networks. We used two well-known methods for learning Bayesian networks from data: a constraint-based method and a score-based method. We compared the performance of each method under different levels of censoring to those of the naive Bayes classifier and the proportional hazards model. We did additional experiments on several datasets from real-world medical domains. The machine-learning methods treated censored cases in the data as event-free.
RESULTS: We report and compare results for several commonly used model evaluation metrics. On average, the proportional hazards method outperformed other methods in most censoring setups. As part of the simulation study, we also analysed structural similarities of the learnt networks. Heavy censoring, as opposed to no censoring, produces up to a 5% surplus and up to 10% missing total arcs. It also produces up to 50% missing arcs that should originally be connected to the variate of interest.
CONCLUSION: Presented methods for learning Bayesian networks from data can be used to learn from censored survival data in the presence of light censoring (up to 20%) by treating censored cases as event-free. Given intermediate or heavy censoring, the learnt models become tuned to the majority class and would thus require a different approach.

Mesh:

Year:  2009        PMID: 19833488     DOI: 10.1016/j.artmed.2009.08.001

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  10 in total

Review 1.  Application of machine learning algorithms for clinical predictive modeling: a data-mining approach in SCT.

Authors:  R Shouval; O Bondi; H Mishan; A Shimoni; R Unger; A Nagler
Journal:  Bone Marrow Transplant       Date:  2013-10-07       Impact factor: 5.483

2.  A Naive Bayes machine learning approach to risk prediction using censored, time-to-event data.

Authors:  Julian Wolfson; Sunayan Bandyopadhyay; Mohamed Elidrisi; Gabriela Vazquez-Benitez; David M Vock; Donald Musgrove; Gediminas Adomavicius; Paul E Johnson; Patrick J O'Connor
Journal:  Stat Med       Date:  2015-05-18       Impact factor: 2.373

3.  A probabilistic analysis of completely excised high-grade soft tissue sarcomas of the extremity: an application of a Bayesian belief network.

Authors:  Jonathan Agner Forsberg; John H Healey; Murray F Brennan
Journal:  Ann Surg Oncol       Date:  2012-04-20       Impact factor: 5.344

4.  Accurate prediction of coronary artery disease using reliable diagnosis system.

Authors:  Indrajit Mandal; N Sairam
Journal:  J Med Syst       Date:  2012-02-12       Impact factor: 4.460

5.  CondiS Web App: Imputation of Censored Lifetimes for Machine Learning-Based Survival Analysis.

Authors:  Yizhuo Wang; Christopher R Flowers; Ziyi Li; Xuelin Huang
Journal:  Bioinformatics       Date:  2022-07-08       Impact factor: 6.931

6.  Adapting machine learning techniques to censored time-to-event health record data: A general-purpose approach using inverse probability of censoring weighting.

Authors:  David M Vock; Julian Wolfson; Sunayan Bandyopadhyay; Gediminas Adomavicius; Paul E Johnson; Gabriela Vazquez-Benitez; Patrick J O'Connor
Journal:  J Biomed Inform       Date:  2016-03-16       Impact factor: 6.317

7.  Modelling survival data to account for model uncertainty: a single model or model averaging?

Authors:  Sri Astuti Thamrin; James M McGree; Kerrie L Mengersen
Journal:  Springerplus       Date:  2013-12-11

8.  Learning rule sets from survival data.

Authors:  Łukasz Wróbel; Adam Gudyś; Marek Sikora
Journal:  BMC Bioinformatics       Date:  2017-05-30       Impact factor: 3.169

9.  A novel dynamic Bayesian network approach for data mining and survival data analysis.

Authors:  Ali Sheidaei; Abbas Rahimi Foroushani; Kimiya Gohari; Hojjat Zeraati
Journal:  BMC Med Inform Decis Mak       Date:  2022-09-22       Impact factor: 3.298

10.  Application of a novel hybrid algorithm of Bayesian network in the study of hyperlipidemia related factors: a cross-sectional study.

Authors:  Xuchun Wang; Jinhua Pan; Zeping Ren; Mengmeng Zhai; Zhuang Zhang; Hao Ren; Weimei Song; Yuling He; Chenglian Li; Xiaojuan Yang; Meichen Li; Dichen Quan; Limin Chen; Lixia Qiu
Journal:  BMC Public Health       Date:  2021-07-12       Impact factor: 3.295

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.