Liangcai Zhang1, Li Zhang. 1. Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77230, USA and Department of Biophysics, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China.
Abstract
MOTIVATION: Data quality is a critical issue in the analyses of DNA copy number alterations obtained from microarrays. It is commonly assumed that copy number alteration data can be modeled as piecewise constant and the measurement errors of different probes are independent. However, these assumptions do not always hold in practice. In some published datasets, we find that measurement errors are highly correlated between probes that interrogate nearby genomic loci, and the piecewise-constant model does not fit the data well. The correlated errors cause problems in downstream analysis, leading to a large number of DNA segments falsely identified as having copy number gains and losses. METHOD: We developed a simple tool, called autocorrelation scanning profile, to assess the dependence of measurement error between neighboring probes. RESULTS: Autocorrelation scanning profile can be used to check data quality and refine the analysis of DNA copy number data, which we demonstrate in some typical datasets. CONTACT: lzhangli@mdanderson.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Data quality is a critical issue in the analyses of DNA copy number alterations obtained from microarrays. It is commonly assumed that copy number alteration data can be modeled as piecewise constant and the measurement errors of different probes are independent. However, these assumptions do not always hold in practice. In some published datasets, we find that measurement errors are highly correlated between probes that interrogate nearby genomic loci, and the piecewise-constant model does not fit the data well. The correlated errors cause problems in downstream analysis, leading to a large number of DNA segments falsely identified as having copy number gains and losses. METHOD: We developed a simple tool, called autocorrelation scanning profile, to assess the dependence of measurement error between neighboring probes. RESULTS: Autocorrelation scanning profile can be used to check data quality and refine the analysis of DNA copy number data, which we demonstrate in some typical datasets. CONTACT: lzhangli@mdanderson.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Kai Wang; Mingyao Li; Dexter Hadley; Rui Liu; Joseph Glessner; Struan F A Grant; Hakon Hakonarson; Maja Bucan Journal: Genome Res Date: 2007-10-05 Impact factor: 9.043
Authors: Jarupon Fah Sathirapongsasuti; Hane Lee; Basil A J Horst; Georg Brunner; Alistair J Cochran; Scott Binder; John Quackenbush; Stanley F Nelson Journal: Bioinformatics Date: 2011-08-09 Impact factor: 6.937
Authors: Jennifer Barrow; Martyna Adamowicz-Brice; Maria Cartmill; Donald MacArthur; James Lowe; Keith Robson; Marie-Anne Brundler; David A Walker; Beth Coyle; Richard Grundy Journal: Neuro Oncol Date: 2010-12-07 Impact factor: 12.300
Authors: Jun Wang; Min Lin; Andrew Crenshaw; Amy Hutchinson; Belynda Hicks; Meredith Yeager; Sonja Berndt; Wen-Yi Huang; Richard B Hayes; Stephen J Chanock; Robert C Jones; Ramesh Ramakrishnan Journal: BMC Genomics Date: 2009-11-28 Impact factor: 3.969
Authors: Cheng Li; Rameen Beroukhim; Barbara A Weir; Wendy Winckler; Levi A Garraway; William R Sellers; Matthew Meyerson Journal: BMC Bioinformatics Date: 2008-04-21 Impact factor: 3.169