Literature DB >> 32823283

Large-scale benchmark study of survival prediction methods using multi-omics data.

Moritz Herrmann1, Philipp Probst2, Roman Hornung2, Vindi Jurinovic2, Anne-Laure Boulesteix2.   

Abstract

Multi-omics data, that is, datasets containing different types of high-dimensional molecular variables, are increasingly often generated for the investigation of various diseases. Nevertheless, questions remain regarding the usefulness of multi-omics data for the prediction of disease outcomes such as survival time. It is also unclear which methods are most appropriate to derive such prediction models. We aim to give some answers to these questions through a large-scale benchmark study using real data. Different prediction methods from machine learning and statistics were applied on 18 multi-omics cancer datasets (35 to 1000 observations, up to 100 000 variables) from the database 'The Cancer Genome Atlas' (TCGA). The considered outcome was the (censored) survival time. Eleven methods based on boosting, penalized regression and random forest were compared, comprising both methods that do and that do not take the group structure of the omics variables into account. The Kaplan-Meier estimate and a Cox model using only clinical variables were used as reference methods. The methods were compared using several repetitions of 5-fold cross-validation. Uno's C-index and the integrated Brier score served as performance metrics. The results indicate that methods taking into account the multi-omics structure have a slightly better prediction performance. Taking this structure into account can protect the predictive information in low-dimensional groups-especially clinical variables-from not being exploited during prediction. Moreover, only the block forest method outperformed the Cox model on average, and only slightly. This indicates, as a by-product of our study, that in the considered TCGA studies the utility of multi-omics data for prediction purposes was limited. Contact:moritz.herrmann@stat.uni-muenchen.de, +49 89 2180 3198 Supplementary information: Supplementary data are available at Briefings in Bioinformatics online. All analyses are reproducible using R code freely available on Github.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  benchmark; machine learning; multi-omics data; prediction models; statistics; survival analysis

Year:  2021        PMID: 32823283     DOI: 10.1093/bib/bbaa167

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  16 in total

1.  Benchmark of filter methods for feature selection in high-dimensional gene expression survival data.

Authors:  Andrea Bommert; Thomas Welchowski; Matthias Schmid; Jörg Rahnenführer
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

2.  SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data.

Authors:  Yunwei Zhang; Germaine Wong; Graham Mann; Samuel Muller; Jean Y H Yang
Journal:  Gigascience       Date:  2022-07-30       Impact factor: 7.658

3.  Classify multicategory outcome in patients with lung adenocarcinoma using clinical, transcriptomic and clinico-transcriptomic data: machine learning versus multinomial models.

Authors:  Fei Deng; Lanlan Shen; He Wang; Lanjing Zhang
Journal:  Am J Cancer Res       Date:  2020-12-01       Impact factor: 6.166

Review 4.  The Application of Bayesian Methods in Cancer Prognosis and Prediction.

Authors:  Jiadong Chu; N A Sun; Wei Hu; Xuanli Chen; Nengjun Yi; Yueping Shen
Journal:  Cancer Genomics Proteomics       Date:  2022 Jan-Feb       Impact factor: 4.069

Review 5.  State of the Field in Multi-Omics Research: From Computational Needs to Data Mining and Sharing.

Authors:  Michal Krassowski; Vivek Das; Sangram K Sahu; Biswapriya B Misra
Journal:  Front Genet       Date:  2020-12-10       Impact factor: 4.599

Review 6.  Biomarker discovery studies for patient stratification using machine learning analysis of omics data: a scoping review.

Authors:  Enrico Glaab; Armin Rauschenberger; Rita Banzi; Chiara Gerardi; Paula Garcia; Jacques Demotes
Journal:  BMJ Open       Date:  2021-12-06       Impact factor: 2.692

7.  Comprehensive Analysis of Alteration Landscape and Its Clinical Significance of Mitochondrial Energy Metabolism Pathway-Related Genes in Lung Cancers.

Authors:  Zhen Ye; Huanhuan Zhang; Fanhua Kong; Jing Lan; Shuying Yi; Wenshuang Jia; Shu Zheng; Yuna Guo; Xianquan Zhan
Journal:  Oxid Med Cell Longev       Date:  2021-12-20       Impact factor: 6.543

8.  Exploring Pathway-Based Group Lasso for Cancer Survival Analysis: A Special Case of Multi-Task Learning.

Authors:  Gabriela Malenová; Daniel Rowson; Valentina Boeva
Journal:  Front Genet       Date:  2021-11-29       Impact factor: 4.599

9.  DeepOmix: A scalable and interpretable multi-omics deep learning framework and application in cancer survival analysis.

Authors:  Lianhe Zhao; Qiongye Dong; Chunlong Luo; Yang Wu; Dechao Bu; Xiaoning Qi; Yufan Luo; Yi Zhao
Journal:  Comput Struct Biotechnol J       Date:  2021-05-01       Impact factor: 7.271

10.  A comparison of time to event analysis methods, using weight status and breast cancer as a case study.

Authors:  Georgios Aivaliotis; Jan Palczewski; Rebecca Atkinson; Janet E Cade; Michelle A Morris
Journal:  Sci Rep       Date:  2021-07-07       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.