Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Test set bias affects reproducibility of gene signatures.

Literature DB >> 25788628

Test set bias affects reproducibility of gene signatures.

Prasad Patil¹, Pierre-Olivier Bachant-Winner², Benjamin Haibe-Kains³, Jeffrey T Leek¹.

Abstract

MOTIVATION: Prior to applying genomic predictors to clinical samples, the genomic data must be properly normalized to ensure that the test set data are comparable to the data upon which the predictor was trained. The most effective normalization methods depend on data from multiple patients. From a biomedical perspective, this implies that predictions for a single patient may change depending on which other patient samples they are normalized with. This test set bias will occur when any cross-sample normalization is used before clinical prediction.
RESULTS: We demonstrate that results from existing gene signatures which rely on normalizing test data may be irreproducible when the patient population changes composition or size using a set of curated, publicly available breast cancer microarray experiments. As an alternative, we examine the use of gene signatures that rely on ranks from the data and show why signatures using rank-based features can avoid test set bias while maintaining highly accurate classification, even across platforms.
AVAILABILITY AND IMPLEMENTATION: The code, data and instructions necessary to reproduce our entire analysis is available at https://github.com/prpatil/testsetbias.

Entities: Disease Species

Mesh：

Substances：

Year: 2015 PMID： 25788628 PMCID： PMC4495301 DOI： 10.1093/bioinformatics/btv157

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

17 in total

1. The sva package for removing batch effects and other unwanted variation in high-throughput experiments.

Authors: Jeffrey T Leek; W Evan Johnson; Hilary S Parker; Andrew E Jaffe; John D Storey
Journal: Bioinformatics Date: 2012-01-17 Impact factor: 6.937

Review 2. Microarray data analysis: from disarray to consolidation and consensus.

Authors: David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal: Nat Rev Genet Date: 2006-01 Impact factor: 53.242

3. Frozen robust multiarray analysis (fRMA).

Authors: Matthew N McCall; Benjamin M Bolstad; Rafael A Irizarry
Journal: Biostatistics Date: 2010-01-22 Impact factor: 5.899

4. Gene expression profiling predicts clinical outcome of breast cancer.

Authors: Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal: Nature Date: 2002-01-31 Impact factor: 49.962

5. Use of proteomic patterns in serum to identify ovarian cancer.

Authors: Emanuel F Petricoin; Ali M Ardekani; Ben A Hitt; Peter J Levine; Vincent A Fusaro; Seth M Steinberg; Gordon B Mills; Charles Simone; David A Fishman; Elise C Kohn; Lance A Liotta
Journal: Lancet Date: 2002-02-16 Impact factor: 79.321

6. Diagnosis of multiple cancer types by shrunken centroids of gene expression.

Authors: Robert Tibshirani; Trevor Hastie; Balasubramanian Narasimhan; Gilbert Chu
Journal: Proc Natl Acad Sci U S A Date: 2002-05-14 Impact factor: 11.205

7. Taming the dragon: genomic biomarkers to individualize the treatment of cancer.

Authors: Ian J Majewski; René Bernards
Journal: Nat Med Date: 2011-03 Impact factor: 53.440

8. Converting a breast cancer microarray signature into a high-throughput diagnostic test.

Authors: Annuska M Glas; Arno Floore; Leonie J M J Delahaye; Anke T Witteveen; Rob C F Pover; Niels Bakx; Jaana S T Lahti-Domenici; Tako J Bruinsma; Marc O Warmoes; René Bernards; Lodewyk F A Wessels; Laura J Van't Veer
Journal: BMC Genomics Date: 2006-10-30 Impact factor: 3.969

9. Challenges in projecting clustering results across gene expression-profiling datasets.

Authors: Lara Lusa; Lisa M McShane; James F Reid; Loris De Cecco; Federico Ambrogi; Elia Biganzoli; Manuela Gariboldi; Marco A Pierotti
Journal: J Natl Cancer Inst Date: 2007-11-13 Impact factor: 13.506

10. Supervised risk predictor of breast cancer based on intrinsic subtypes.

Authors: Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard
Journal: J Clin Oncol Date: 2009-02-09 Impact factor: 44.544

38 in total

1. Low Recombination Proficiency Score (RPS) Predicts Heightened Sensitivity to DNA-Damaging Chemotherapy in Breast Cancer.

Authors: Sean P Pitroda; Riyue Bao; Jorge Andrade; Ralph R Weichselbaum; Philip P Connell
Journal: Clin Cancer Res Date: 2017-03-24 Impact factor: 12.531

2. Training replicable predictors in multiple studies.

Authors: Prasad Patil; Giovanni Parmigiani
Journal: Proc Natl Acad Sci U S A Date: 2018-03-12 Impact factor: 11.205

3. Exome-capture RNA sequencing of decade-old breast cancers and matched decalcified bone metastases.

Authors: Nolan Priedigkeit; Rebecca J Watters; Peter C Lucas; Ahmed Basudan; Rohit Bhargava; William Horne; Jay K Kolls; Zhou Fang; Margaret Q Rosenzweig; Adam M Brufsky; Kurt R Weiss; Steffi Oesterreich; Adrian V Lee
Journal: JCI Insight Date: 2017-09-07

4. A qualitative transcriptional signature to reclassify histological grade of ER-positive breast cancer patients.

Authors: Jing Li; Wenbin Jiang; Qirui Liang; Guanghao Liu; Yupeng Dai; Hailong Zheng; Jing Yang; Hao Cai; Guo Zheng
Journal: BMC Genomics Date: 2020-04-06 Impact factor: 3.969

5. Development and verification of a personalized immune prognostic feature in breast cancer.

Authors: HongLei Wang; Li Wu; HongTao Wang
Journal: Exp Biol Med (Maywood) Date: 2020-06-29

6. Consensus on Molecular Subtypes of High-Grade Serous Ovarian Carcinoma.

Authors: Gregory M Chen; Lavanya Kannan; Ludwig Geistlinger; Victor Kofia; Zhaleh Safikhani; Deena M A Gendoo; Giovanni Parmigiani; Michael Birrer; Benjamin Haibe-Kains; Levi Waldron
Journal: Clin Cancer Res Date: 2018-07-03 Impact factor: 12.531

7. Purity Independent Subtyping of Tumors (PurIST), A Clinically Robust, Single-sample Classifier for Tumor Subtyping in Pancreatic Cancer.

Authors: Naim U Rashid; Xianlu L Peng; Chong Jin; Richard A Moffitt; Keith E Volmar; Brian A Belt; Roheena Z Panni; Timothy M Nywening; Silvia G Herrera; Kristin J Moore; Sarah G Hennessey; Ashley B Morrison; Ryan Kawalerski; Apoorve Nayyar; Audrey E Chang; Benjamin Schmidt; Hong Jin Kim; David C Linehan; Jen Jen Yeh
Journal: Clin Cancer Res Date: 2019-11-21 Impact factor: 12.531

8. PAM50 Molecular Intrinsic Subtypes in the Nurses' Health Study Cohorts.

Authors: Kevin H Kensler; Venkat N Sankar; Jun Wang; Xuehong Zhang; Christopher A Rubadue; Gabrielle M Baker; Joel S Parker; Katherine A Hoadley; Andreea L Stancu; Michael E Pyle; Laura C Collins; David J Hunter; A Heather Eliassen; Susan E Hankinson; Rulla M Tamimi; Yujing J Heng
Journal: Cancer Epidemiol Biomarkers Prev Date: 2018-12-27 Impact factor: 4.254

9. Modeling Between-Study Heterogeneity for Improved Replicability in Gene Signature Selection and Clinical Prediction.

Authors: Naim U Rashid; Quefeng Li; Jen Jen Yeh; Joseph G Ibrahim
Journal: J Am Stat Assoc Date: 2019-10-29 Impact factor: 5.033

10. Protein structure-based gene expression signatures.

Authors: Rayees Rahman; Nicole Zatorski; Jens Hansen; Yuguang Xiong; J G Coen van Hasselt; Eric A Sobie; Marc R Birtwistle; Evren U Azeloglu; Ravi Iyengar; Avner Schlessinger
Journal: Proc Natl Acad Sci U S A Date: 2021-05-11 Impact factor: 11.205