Literature DB >> 25788628

Test set bias affects reproducibility of gene signatures.

Prasad Patil1, Pierre-Olivier Bachant-Winner2, Benjamin Haibe-Kains3, Jeffrey T Leek1.   

Abstract

MOTIVATION: Prior to applying genomic predictors to clinical samples, the genomic data must be properly normalized to ensure that the test set data are comparable to the data upon which the predictor was trained. The most effective normalization methods depend on data from multiple patients. From a biomedical perspective, this implies that predictions for a single patient may change depending on which other patient samples they are normalized with. This test set bias will occur when any cross-sample normalization is used before clinical prediction.
RESULTS: We demonstrate that results from existing gene signatures which rely on normalizing test data may be irreproducible when the patient population changes composition or size using a set of curated, publicly available breast cancer microarray experiments. As an alternative, we examine the use of gene signatures that rely on ranks from the data and show why signatures using rank-based features can avoid test set bias while maintaining highly accurate classification, even across platforms.
AVAILABILITY AND IMPLEMENTATION: The code, data and instructions necessary to reproduce our entire analysis is available at https://github.com/prpatil/testsetbias.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25788628      PMCID: PMC4495301          DOI: 10.1093/bioinformatics/btv157

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  The sva package for removing batch effects and other unwanted variation in high-throughput experiments.

Authors:  Jeffrey T Leek; W Evan Johnson; Hilary S Parker; Andrew E Jaffe; John D Storey
Journal:  Bioinformatics       Date:  2012-01-17       Impact factor: 6.937

Review 2.  Microarray data analysis: from disarray to consolidation and consensus.

Authors:  David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal:  Nat Rev Genet       Date:  2006-01       Impact factor: 53.242

3.  Frozen robust multiarray analysis (fRMA).

Authors:  Matthew N McCall; Benjamin M Bolstad; Rafael A Irizarry
Journal:  Biostatistics       Date:  2010-01-22       Impact factor: 5.899

4.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

5.  Use of proteomic patterns in serum to identify ovarian cancer.

Authors:  Emanuel F Petricoin; Ali M Ardekani; Ben A Hitt; Peter J Levine; Vincent A Fusaro; Seth M Steinberg; Gordon B Mills; Charles Simone; David A Fishman; Elise C Kohn; Lance A Liotta
Journal:  Lancet       Date:  2002-02-16       Impact factor: 79.321

6.  Diagnosis of multiple cancer types by shrunken centroids of gene expression.

Authors:  Robert Tibshirani; Trevor Hastie; Balasubramanian Narasimhan; Gilbert Chu
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-14       Impact factor: 11.205

7.  Taming the dragon: genomic biomarkers to individualize the treatment of cancer.

Authors:  Ian J Majewski; René Bernards
Journal:  Nat Med       Date:  2011-03       Impact factor: 53.440

8.  Converting a breast cancer microarray signature into a high-throughput diagnostic test.

Authors:  Annuska M Glas; Arno Floore; Leonie J M J Delahaye; Anke T Witteveen; Rob C F Pover; Niels Bakx; Jaana S T Lahti-Domenici; Tako J Bruinsma; Marc O Warmoes; René Bernards; Lodewyk F A Wessels; Laura J Van't Veer
Journal:  BMC Genomics       Date:  2006-10-30       Impact factor: 3.969

9.  Challenges in projecting clustering results across gene expression-profiling datasets.

Authors:  Lara Lusa; Lisa M McShane; James F Reid; Loris De Cecco; Federico Ambrogi; Elia Biganzoli; Manuela Gariboldi; Marco A Pierotti
Journal:  J Natl Cancer Inst       Date:  2007-11-13       Impact factor: 13.506

10.  Supervised risk predictor of breast cancer based on intrinsic subtypes.

Authors:  Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard
Journal:  J Clin Oncol       Date:  2009-02-09       Impact factor: 44.544

View more
  38 in total

1.  Low Recombination Proficiency Score (RPS) Predicts Heightened Sensitivity to DNA-Damaging Chemotherapy in Breast Cancer.

Authors:  Sean P Pitroda; Riyue Bao; Jorge Andrade; Ralph R Weichselbaum; Philip P Connell
Journal:  Clin Cancer Res       Date:  2017-03-24       Impact factor: 12.531

2.  Training replicable predictors in multiple studies.

Authors:  Prasad Patil; Giovanni Parmigiani
Journal:  Proc Natl Acad Sci U S A       Date:  2018-03-12       Impact factor: 11.205

3.  Exome-capture RNA sequencing of decade-old breast cancers and matched decalcified bone metastases.

Authors:  Nolan Priedigkeit; Rebecca J Watters; Peter C Lucas; Ahmed Basudan; Rohit Bhargava; William Horne; Jay K Kolls; Zhou Fang; Margaret Q Rosenzweig; Adam M Brufsky; Kurt R Weiss; Steffi Oesterreich; Adrian V Lee
Journal:  JCI Insight       Date:  2017-09-07

4.  A qualitative transcriptional signature to reclassify histological grade of ER-positive breast cancer patients.

Authors:  Jing Li; Wenbin Jiang; Qirui Liang; Guanghao Liu; Yupeng Dai; Hailong Zheng; Jing Yang; Hao Cai; Guo Zheng
Journal:  BMC Genomics       Date:  2020-04-06       Impact factor: 3.969

5.  Development and verification of a personalized immune prognostic feature in breast cancer.

Authors:  HongLei Wang; Li Wu; HongTao Wang
Journal:  Exp Biol Med (Maywood)       Date:  2020-06-29

6.  Consensus on Molecular Subtypes of High-Grade Serous Ovarian Carcinoma.

Authors:  Gregory M Chen; Lavanya Kannan; Ludwig Geistlinger; Victor Kofia; Zhaleh Safikhani; Deena M A Gendoo; Giovanni Parmigiani; Michael Birrer; Benjamin Haibe-Kains; Levi Waldron
Journal:  Clin Cancer Res       Date:  2018-07-03       Impact factor: 12.531

7.  Purity Independent Subtyping of Tumors (PurIST), A Clinically Robust, Single-sample Classifier for Tumor Subtyping in Pancreatic Cancer.

Authors:  Naim U Rashid; Xianlu L Peng; Chong Jin; Richard A Moffitt; Keith E Volmar; Brian A Belt; Roheena Z Panni; Timothy M Nywening; Silvia G Herrera; Kristin J Moore; Sarah G Hennessey; Ashley B Morrison; Ryan Kawalerski; Apoorve Nayyar; Audrey E Chang; Benjamin Schmidt; Hong Jin Kim; David C Linehan; Jen Jen Yeh
Journal:  Clin Cancer Res       Date:  2019-11-21       Impact factor: 12.531

8.  PAM50 Molecular Intrinsic Subtypes in the Nurses' Health Study Cohorts.

Authors:  Kevin H Kensler; Venkat N Sankar; Jun Wang; Xuehong Zhang; Christopher A Rubadue; Gabrielle M Baker; Joel S Parker; Katherine A Hoadley; Andreea L Stancu; Michael E Pyle; Laura C Collins; David J Hunter; A Heather Eliassen; Susan E Hankinson; Rulla M Tamimi; Yujing J Heng
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2018-12-27       Impact factor: 4.254

9.  Modeling Between-Study Heterogeneity for Improved Replicability in Gene Signature Selection and Clinical Prediction.

Authors:  Naim U Rashid; Quefeng Li; Jen Jen Yeh; Joseph G Ibrahim
Journal:  J Am Stat Assoc       Date:  2019-10-29       Impact factor: 5.033

10.  Protein structure-based gene expression signatures.

Authors:  Rayees Rahman; Nicole Zatorski; Jens Hansen; Yuguang Xiong; J G Coen van Hasselt; Eric A Sobie; Marc R Birtwistle; Evren U Azeloglu; Ravi Iyengar; Avner Schlessinger
Journal:  Proc Natl Acad Sci U S A       Date:  2021-05-11       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.