Literature DB >> 34245143

Performance evaluation of transcriptomics data normalization for survival risk prediction.

Ai Ni1, Li-Xuan Qin2.   

Abstract

One pivotal feature of transcriptomics data is the unwanted variations caused by disparate experimental handling, known as handling effects. Various data normalization methods were developed to alleviate the adverse impact of handling effects in the setting of differential expression analysis. However, little research has been done to evaluate their performance in the setting of survival outcome prediction, an important analysis goal for transcriptomics data in biomedical research. Leveraging a unique pair of datasets for the same set of tumor samples-one with handling effects and the other without, we developed a benchmarking tool for conducting such an evaluation in microRNA microarrays. We applied this tool to evaluate the performance of three popular normalization methods-quantile normalization, median normalization and variance stabilizing normalization-in survival prediction using various approaches for model building and designs for sample assignment. We showed that handling effects can have a strong impact on survival prediction and that quantile normalization, a most popular method in current practice, tends to underperform median normalization and variance stabilizing normalization. We demonstrated with a small example the reason for quantile normalization's poor performance in this setting. Our finding highlights the importance of putting normalization evaluation in the context of the downstream analysis setting and the potential of improving the development of survival predictors by applying median normalization. We make available our benchmarking tool for performing such evaluation on additional normalization methods in connection with prediction modeling approaches.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Keywords:  data normalization; handling effects; microRNA microarray; penalized Cox regression; survival prediction; transcriptomics data

Mesh:

Substances:

Year:  2021        PMID: 34245143      PMCID: PMC8575026          DOI: 10.1093/bib/bbab257

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  37 in total

Review 1.  The functions of animal microRNAs.

Authors:  Victor Ambros
Journal:  Nature       Date:  2004-09-16       Impact factor: 49.962

2.  Prediction of cancer outcome with microarrays.

Authors:  Andrew H S Lee
Journal:  Lancet       Date:  2005 May 14-20       Impact factor: 79.321

3.  Power calculations for preclinical studies using a K-sample rank test and the Lehmann alternative hypothesis.

Authors:  Glenn Heller
Journal:  Stat Med       Date:  2006-08-15       Impact factor: 2.373

4.  Adjusting batch effects in microarray expression data using empirical Bayes methods.

Authors:  W Evan Johnson; Cheng Li; Ariel Rabinovic
Journal:  Biostatistics       Date:  2006-04-21       Impact factor: 5.899

5.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

Review 6.  Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.

Authors:  F E Harrell; K L Lee; D B Mark
Journal:  Stat Med       Date:  1996-02-28       Impact factor: 2.373

7.  MicroRNA array normalization: an evaluation using a randomized dataset as the benchmark.

Authors:  Li-Xuan Qin; Qin Zhou
Journal:  PLoS One       Date:  2014-06-06       Impact factor: 3.240

8.  Novel molecular subgroups for clinical classification and outcome prediction in childhood medulloblastoma: a cohort study.

Authors:  Edward C Schwalbe; Janet C Lindsey; Sirintra Nakjang; Stephen Crosier; Amanda J Smith; Debbie Hicks; Gholamreza Rafiee; Rebecca M Hill; Alice Iliasova; Thomas Stone; Barry Pizer; Antony Michalski; Abhijit Joshi; Stephen B Wharton; Thomas S Jacques; Simon Bailey; Daniel Williamson; Steven C Clifford
Journal:  Lancet Oncol       Date:  2017-05-22       Impact factor: 41.316

9.  Predicting cancer outcomes from histology and genomics using convolutional networks.

Authors:  Pooya Mobadersany; Safoora Yousefi; Mohamed Amgad; David A Gutman; Jill S Barnholtz-Sloan; José E Velázquez Vega; Daniel J Brat; Lee A D Cooper
Journal:  Proc Natl Acad Sci U S A       Date:  2018-03-12       Impact factor: 11.205

10.  Study design and data analysis considerations for the discovery of prognostic molecular biomarkers: a case study of progression free survival in advanced serous ovarian cancer.

Authors:  Li-Xuan Qin; Douglas A Levine
Journal:  BMC Med Genomics       Date:  2016-06-10       Impact factor: 3.063

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.