MOTIVATION: Reproducibility analyses of biologically relevant microarray studies have mostly focused on overlap of detected biomarkers or correlation of differential expression evidences across studies. For clinical utility, direct inter-study prediction (i.e. to establish a prediction model in one study and apply to another) for disease diagnosis or prognosis prediction is more important. Normalization plays a key role for such a task. Traditionally, sample-wise normalization has been a standard for inter-array and inter-study normalization. For gene-wise normalization, it has been implemented for intra-study or inter-study predictions in a few papers while its rationale, strategy and effect remain unexplored. RESULTS: In this article, we investigate the effect of gene-wise normalization in microarray inter-study prediction. Gene-specific intensity discrepancies across studies are commonly found even after proper sample-wise normalization. We explore the rationale and necessity of gene-wise normalization. We also show that the ratio of sample sizes in normal versus diseased groups can greatly affect the performance of gene-wise normalization and an analytical method is developed to adjust for the imbalanced ratio effect. Both simulation results and applications to three lung cancer and two prostate cancer data sets, considering both binary classification and survival risk predictions, showed significant and robust improvement of the new adjustment. A calibration scheme is developed to apply the ratio-adjusted gene-wise normalization for prospective clinical trials. The number of calibration samples needed is estimated from existing studies and suggested for future applications. The result has important implication to the translational research of microarray as a practical disease diagnosis and prognosis prediction tool.
MOTIVATION: Reproducibility analyses of biologically relevant microarray studies have mostly focused on overlap of detected biomarkers or correlation of differential expression evidences across studies. For clinical utility, direct inter-study prediction (i.e. to establish a prediction model in one study and apply to another) for disease diagnosis or prognosis prediction is more important. Normalization plays a key role for such a task. Traditionally, sample-wise normalization has been a standard for inter-array and inter-study normalization. For gene-wise normalization, it has been implemented for intra-study or inter-study predictions in a few papers while its rationale, strategy and effect remain unexplored. RESULTS: In this article, we investigate the effect of gene-wise normalization in microarray inter-study prediction. Gene-specific intensity discrepancies across studies are commonly found even after proper sample-wise normalization. We explore the rationale and necessity of gene-wise normalization. We also show that the ratio of sample sizes in normal versus diseased groups can greatly affect the performance of gene-wise normalization and an analytical method is developed to adjust for the imbalanced ratio effect. Both simulation results and applications to three lung cancer and two prostate cancer data sets, considering both binary classification and survival risk predictions, showed significant and robust improvement of the new adjustment. A calibration scheme is developed to apply the ratio-adjusted gene-wise normalization for prospective clinical trials. The number of calibration samples needed is estimated from existing studies and suggested for future applications. The result has important implication to the translational research of microarray as a practical disease diagnosis and prognosis prediction tool.
Authors: Rafael A Irizarry; Benjamin M Bolstad; Francois Collin; Leslie M Cope; Bridget Hobbs; Terence P Speed Journal: Nucleic Acids Res Date: 2003-02-15 Impact factor: 16.971
Authors: Monica Benito; Joel Parker; Quan Du; Junyuan Wu; Dong Xiang; Charles M Perou; J S Marron Journal: Bioinformatics Date: 2004-01-01 Impact factor: 6.937
Authors: Giovanni Parmigiani; Elizabeth S Garrett-Mayer; Ramaswamy Anbazhagan; Edward Gabrielson Journal: Clin Cancer Res Date: 2004-05-01 Impact factor: 12.531
Authors: T R Golub; D K Slonim; P Tamayo; C Huard; M Gaasenbeek; J P Mesirov; H Coller; M L Loh; J R Downing; M A Caligiuri; C D Bloomfield; E S Lander Journal: Science Date: 1999-10-15 Impact factor: 47.728
Authors: Greg Bloom; Ivana V Yang; David Boulware; Ka Yin Kwong; Domenico Coppola; Steven Eschrich; John Quackenbush; Timothy J Yeatman Journal: Am J Pathol Date: 2004-01 Impact factor: 4.307
Authors: Kimberly J Bussey; David Kane; Margot Sunshine; Sudar Narasimhan; Satoshi Nishizuka; William C Reinhold; Barry Zeeberg; Weinstein Ajay; John N Weinstein Journal: Genome Biol Date: 2003-03-25 Impact factor: 13.583
Authors: Stephanie A Mitchell; Kevin M Brown; Michael M Henry; Michelle Mintz; Daniel Catchpoole; Bonnie LaFleur; Dietrich A Stephan Journal: BMC Genomics Date: 2004-09-23 Impact factor: 3.969
Authors: Zhibao Mi; Kui Shen; Nan Song; Chunrong Cheng; Chi Song; Naftali Kaminski; George C Tseng Journal: Bioinformatics Date: 2010-08-17 Impact factor: 6.937
Authors: Patrick Willems; Amna Mhamdi; Simon Stael; Veronique Storme; Pavel Kerchev; Graham Noctor; Kris Gevaert; Frank Van Breusegem Journal: Plant Physiol Date: 2016-05-31 Impact factor: 8.340
Authors: Apurva Bhargava; Ivory Clabaugh; Jenn P To; Bridey B Maxwell; Yi-Hsuan Chiang; G Eric Schaller; Ann Loraine; Joseph J Kieber Journal: Plant Physiol Date: 2013-03-22 Impact factor: 8.340