Literature DB >> 21328602

An overview of techniques for linking high-dimensional molecular data to time-to-event endpoints by risk prediction models.

Harald Binder1, Christine Porzelius, Martin Schumacher.   

Abstract

Analysis of molecular data promises identification of biomarkers for improving prognostic models, thus potentially enabling better patient management. For identifying such biomarkers, risk prediction models can be employed that link high-dimensional molecular covariate data to a clinical endpoint. In low-dimensional settings, a multitude of statistical techniques already exists for building such models, e.g. allowing for variable selection or for quantifying the added value of a new biomarker. We provide an overview of techniques for regularized estimation that transfer this toward high-dimensional settings, with a focus on models for time-to-event endpoints. Techniques for incorporating specific covariate structure are discussed, as well as techniques for dealing with more complex endpoints. Employing gene expression data from patients with diffuse large B-cell lymphoma, some typical modeling issues from low-dimensional settings are illustrated in a high-dimensional application. First, the performance of classical stepwise regression is compared to stage-wise regression, as implemented by a component-wise likelihood-based boosting approach. A second issues arises, when artificially transforming the response into a binary variable. The effects of the resulting loss of efficiency and potential bias in a high-dimensional setting are illustrated, and a link to competing risks models is provided. Finally, we discuss conditions for adequately quantifying the added value of high-dimensional gene expression measurements, both at the stage of model fitting and when performing evaluation.
Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21328602     DOI: 10.1002/bimj.201000152

Source DB:  PubMed          Journal:  Biom J        ISSN: 0323-3847            Impact factor:   2.207


  8 in total

Review 1.  Big data in medical science--a biostatistical view.

Authors:  Harald Binder; Maria Blettner
Journal:  Dtsch Arztebl Int       Date:  2015-02-27       Impact factor: 5.594

2.  Radiomics in nuclear medicine: robustness, reproducibility, standardization, and how to avoid data analysis traps and replication crisis.

Authors:  Alex Zwanenburg
Journal:  Eur J Nucl Med Mol Imaging       Date:  2019-06-25       Impact factor: 9.236

3.  Assessment of performance of survival prediction models for cancer prognosis.

Authors:  Hung-Chia Chen; Ralph L Kodell; Kuang Fu Cheng; James J Chen
Journal:  BMC Med Res Methodol       Date:  2012-07-23       Impact factor: 4.615

4.  Integrated genomic analysis identifies subclasses and prognosis signatures of kidney cancer.

Authors:  Yann Christinat; Wilhelm Krek
Journal:  Oncotarget       Date:  2015-04-30

5.  Transforming RNA-Seq data to improve the performance of prognostic gene signatures.

Authors:  Isabella Zwiener; Barbara Frisch; Harald Binder
Journal:  PLoS One       Date:  2014-01-08       Impact factor: 3.240

6.  A regression-based differential expression detection algorithm for microarray studies with ultra-low sample size.

Authors:  Daniel Vasiliu; Samuel Clamons; Molly McDonough; Brian Rabe; Margaret Saha
Journal:  PLoS One       Date:  2015-03-04       Impact factor: 3.240

7.  Identifying Prognostic SNPs in Clinical Cohorts: Complementing Univariate Analyses by Resampling and Multivariable Modeling.

Authors:  Stefanie Hieke; Axel Benner; Richard F Schlenk; Martin Schumacher; Lars Bullinger; Harald Binder
Journal:  PLoS One       Date:  2016-05-09       Impact factor: 3.240

8.  Combining heterogeneous subgroups with graph-structured variable selection priors for Cox regression.

Authors:  Katrin Madjar; Manuela Zucknick; Katja Ickstadt; Jörg Rahnenführer
Journal:  BMC Bioinformatics       Date:  2021-12-11       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.