Lei Bao1, Minya Pu1, Karen Messer1. 1. Division of Biostatistics, Moores Cancer Center, University of California-San Diego, La Jolla, CA 92093, USA.
Abstract
MOTIVATION: Detection and quantification of the absolute DNA copy number alterations in tumor cells is challenging because the DNA specimen is extracted from a mixture of tumor and normal stromal cells. Estimates of tumor purity and ploidy are necessary to correctly infer copy number, and ploidy may itself be a prognostic factor in cancer progression. As deep sequencing of the exome or genome has become routine for characterization of tumor samples, in this work, we aim to develop a simple and robust algorithm to infer purity, ploidy and absolute copy numbers in whole numbers for tumor cells from sequencing data. RESULTS: A simulation study shows that estimates have reasonable accuracy, and that the algorithm is robust against the presence of segmentation errors and subclonal populations. We validated our algorithm against a panel of cell lines with experimentally determined ploidy. We also compared our algorithm with the well-established single-nucleotide polymorphism array-based method called ABSOLUTE on three sets of tumors of different types. Our method had good performance on these four benchmark datasets for both purity and ploidy estimates, and may offer a simple solution to copy number alteration quantification for cancer sequencing projects. AVAILABILITY AND IMPLEMENTATION: The R package absCNseq is available from http://biostats.mcc.ucsd.edu/files/absCNseq_1.0.tar.gz CONTACT: kmesser@ucsd.edu Supplementary information: Supplementary data are available at Bioinformatics online.
MOTIVATION: Detection and quantification of the absolute DNA copy number alterations in tumor cells is challenging because the DNA specimen is extracted from a mixture of tumor and normal stromal cells. Estimates of tumor purity and ploidy are necessary to correctly infer copy number, and ploidy may itself be a prognostic factor in cancer progression. As deep sequencing of the exome or genome has become routine for characterization of tumor samples, in this work, we aim to develop a simple and robust algorithm to infer purity, ploidy and absolute copy numbers in whole numbers for tumor cells from sequencing data. RESULTS: A simulation study shows that estimates have reasonable accuracy, and that the algorithm is robust against the presence of segmentation errors and subclonal populations. We validated our algorithm against a panel of cell lines with experimentally determined ploidy. We also compared our algorithm with the well-established single-nucleotide polymorphism array-based method called ABSOLUTE on three sets of tumors of different types. Our method had good performance on these four benchmark datasets for both purity and ploidy estimates, and may offer a simple solution to copy number alteration quantification for cancer sequencing projects. AVAILABILITY AND IMPLEMENTATION: The R package absCNseq is available from http://biostats.mcc.ucsd.edu/files/absCNseq_1.0.tar.gz CONTACT: kmesser@ucsd.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Daniel C Koboldt; Qunyuan Zhang; David E Larson; Dong Shen; Michael D McLellan; Ling Lin; Christopher A Miller; Elaine R Mardis; Li Ding; Richard K Wilson Journal: Genome Res Date: 2012-02-02 Impact factor: 9.043
Authors: Anna V Roschke; Giovanni Tonon; Kristen S Gehlhaus; Nicolas McTyre; Kimberly J Bussey; Samir Lababidi; Dominic A Scudiero; John N Weinstein; Ilan R Kirsch Journal: Cancer Res Date: 2003-12-15 Impact factor: 12.701
Authors: Thomas LaFramboise; Barbara A Weir; Xiaojun Zhao; Rameen Beroukhim; Cheng Li; David Harrington; William R Sellers; Matthew Meyerson Journal: PLoS Comput Biol Date: 2005-11-25 Impact factor: 4.475
Authors: Yichen Cheng; James Y Dai; Thomas G Paulson; Xiaoyu Wang; Xiaohong Li; Brian J Reid; Charles Kooperberg Journal: Ann Appl Stat Date: 2017-07-20 Impact factor: 2.083
Authors: Isidro Cortés-Ciriano; Doga C Gulhan; Jake June-Koo Lee; Giorgio E M Melloni; Peter J Park Journal: Nat Rev Genet Date: 2021-12-08 Impact factor: 53.242
Authors: Yi Fu; Guoqiang Yu; Douglas A Levine; Niya Wang; Ie-Ming Shih; Zhen Zhang; Robert Clarke; Yue Wang Journal: Sci Rep Date: 2015-09-09 Impact factor: 4.379