Literature DB >> 28961917

Towards clinically more relevant dissection of patient heterogeneity via survival-based Bayesian clustering.

Ashar Ahmad1, Holger Fröhlich1,2.   

Abstract

MOTIVATION: Discovery of clinically relevant disease sub-types is of prime importance in personalized medicine. Disease sub-type identification has in the past often been explored in an unsupervised machine learning paradigm which involves clustering of patients based on available-omics data, such as gene expression. A follow-up analysis involves determining the clinical relevance of the molecular sub-types such as that reflected by comparing their disease progressions. The above methodology, however, fails to guarantee the separability of the sub-types based on their subtype-specific survival curves.
RESULTS: We propose a new algorithm, Survival-based Bayesian Clustering (SBC) which simultaneously clusters heterogeneous-omics and clinical end point data (time to event) in order to discover clinically relevant disease subtypes. For this purpose we formulate a novel Hierarchical Bayesian Graphical Model which combines a Dirichlet Process Gaussian Mixture Model with an Accelerated Failure Time model. In this way we make sure that patients are grouped in the same cluster only when they show similar characteristics with respect to molecular features across data types (e.g. gene expression, mi-RNA) as well as survival times. We extensively test our model in simulation studies and apply it to cancer patient data from the Breast Cancer dataset and The Cancer Genome Atlas repository. Notably, our method is not only able to find clinically relevant sub-groups, but is also able to predict cluster membership and survival on test data in a better way than other competing methods.
AVAILABILITY AND IMPLEMENTATION: Our R-code can be accessed as https://github.com/ashar799/SBC. CONTACT: ashar@bit.uni-bonn.de. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2017        PMID: 28961917     DOI: 10.1093/bioinformatics/btx464

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Assisted gene expression-based clustering with AWNCut.

Authors:  Yang Li; Ruofan Bie; Sebastian J Teran Hidalgo; Yichen Qin; Mengyun Wu; Shuangge Ma
Journal:  Stat Med       Date:  2018-08-09       Impact factor: 2.373

Review 2.  Multi-omic and multi-view clustering algorithms: review and cancer benchmark.

Authors:  Nimrod Rappoport; Ron Shamir
Journal:  Nucleic Acids Res       Date:  2018-11-16       Impact factor: 16.971

3.  Measuring Patient Similarity on Multiple Diseases by Joint Learning via a Convolutional Neural Network.

Authors:  Sang Ho Oh; Seunghwa Back; Jongyoul Park
Journal:  Sensors (Basel)       Date:  2021-12-25       Impact factor: 3.576

4.  SMRT: Randomized Data Transformation for Cancer Subtyping and Big Data Analysis.

Authors:  Hung Nguyen; Duc Tran; Bang Tran; Monikrishna Roy; Adam Cassell; Sergiu Dascalu; Sorin Draghici; Tin Nguyen
Journal:  Front Oncol       Date:  2021-10-20       Impact factor: 6.244

5.  Combining Pathway Identification and Breast Cancer Survival Prediction via Screening-Network Methods.

Authors:  Antonella Iuliano; Annalisa Occhipinti; Claudia Angelini; Italia De Feis; Pietro Liò
Journal:  Front Genet       Date:  2018-06-14       Impact factor: 4.599

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.