BACKGROUND: Large-scale high throughput studies using microarray technology have established that copy number variation (CNV) throughout the genome is more frequent than previously thought. Such variation is known to play an important role in the presence and development of phenotypes such as HIV-1 infection and Alzheimer's disease. However, methods for analyzing the complex data produced and identifying regions of CNV are still being refined. RESULTS: We describe the presence of a genome-wide technical artifact, spatial autocorrelation or 'wave', which occurs in a large dataset used to determine the location of CNV across the genome. By removing this artifact we are able to obtain both a more biologically meaningful clustering of the data and an increase in the number of CNVs identified by current calling methods without a major increase in the number of false positives detected. Moreover, removing this artifact is critical for the development of a novel model-based CNV calling algorithm - CNVmix - that uses cross-sample information to identify regions of the genome where CNVs occur. For regions of CNV that are identified by both CNVmix and current methods, we demonstrate that CNVmix is better able to categorize samples into groups that represent copy number gains or losses. CONCLUSION: Removing artifactual 'waves' (which appear to be a general feature of array comparative genomic hybridization (aCGH) datasets) and using cross-sample information when identifying CNVs enables more biological information to be extracted from aCGH experiments designed to investigate copy number variation in normal individuals.
BACKGROUND: Large-scale high throughput studies using microarray technology have established that copy number variation (CNV) throughout the genome is more frequent than previously thought. Such variation is known to play an important role in the presence and development of phenotypes such as HIV-1 infection and Alzheimer's disease. However, methods for analyzing the complex data produced and identifying regions of CNV are still being refined. RESULTS: We describe the presence of a genome-wide technical artifact, spatial autocorrelation or 'wave', which occurs in a large dataset used to determine the location of CNV across the genome. By removing this artifact we are able to obtain both a more biologically meaningful clustering of the data and an increase in the number of CNVs identified by current calling methods without a major increase in the number of false positives detected. Moreover, removing this artifact is critical for the development of a novel model-based CNV calling algorithm - CNVmix - that uses cross-sample information to identify regions of the genome where CNVs occur. For regions of CNV that are identified by both CNVmix and current methods, we demonstrate that CNVmix is better able to categorize samples into groups that represent copy number gains or losses. CONCLUSION: Removing artifactual 'waves' (which appear to be a general feature of array comparative genomic hybridization (aCGH) datasets) and using cross-sample information when identifying CNVs enables more biological information to be extracted from aCGH experiments designed to investigate copy number variation in normal individuals.
Authors: Donald F Conrad; T Daniel Andrews; Nigel P Carter; Matthew E Hurles; Jonathan K Pritchard Journal: Nat Genet Date: 2005-12-04 Impact factor: 38.330
Authors: Heike Fiegler; Richard Redon; Dan Andrews; Carol Scott; Robert Andrews; Carol Carder; Richard Clark; Oliver Dovey; Peter Ellis; Lars Feuk; Lisa French; Paul Hunt; Dimitrios Kalaitzopoulos; James Larkin; Lyndal Montgomery; George H Perry; Bob W Plumb; Keith Porter; Rachel E Rigby; Diane Rigler; Armand Valsesia; Cordelia Langford; Sean J Humphray; Stephen W Scherer; Charles Lee; Matthew E Hurles; Nigel P Carter Journal: Genome Res Date: 2006-11-22 Impact factor: 9.043
Authors: Mark A van de Wiel; Kyung In Kim; Sjoerd J Vosse; Wessel N van Wieringen; Saskia M Wilting; Bauke Ylstra Journal: Bioinformatics Date: 2007-01-31 Impact factor: 6.937
Authors: Sharon J Diskin; Thomas Eck; Joel Greshock; Yael P Mosse; Tara Naylor; Christian J Stoeckert; Barbara L Weber; John M Maris; Gregory R Grant Journal: Genome Res Date: 2006-08-09 Impact factor: 9.043
Authors: Joris A Veltman; Eric F P M Schoenmakers; Bert H Eussen; Irene Janssen; Gerard Merkx; Brigitte van Cleef; Conny M van Ravenswaaij; Han G Brunner; Dominique Smeets; Ad Geurts van Kessel Journal: Am J Hum Genet Date: 2002-04-09 Impact factor: 11.025
Authors: Richard Redon; Shumpei Ishikawa; Karen R Fitch; Lars Feuk; George H Perry; T Daniel Andrews; Heike Fiegler; Michael H Shapero; Andrew R Carson; Wenwei Chen; Eun Kyung Cho; Stephanie Dallaire; Jennifer L Freeman; Juan R González; Mònica Gratacòs; Jing Huang; Dimitrios Kalaitzopoulos; Daisuke Komura; Jeffrey R MacDonald; Christian R Marshall; Rui Mei; Lyndal Montgomery; Kunihiro Nishimura; Kohji Okamura; Fan Shen; Martin J Somerville; Joelle Tchinda; Armand Valsesia; Cara Woodwark; Fengtang Yang; Junjun Zhang; Tatiana Zerjal; Jane Zhang; Lluis Armengol; Donald F Conrad; Xavier Estivill; Chris Tyler-Smith; Nigel P Carter; Hiroyuki Aburatani; Charles Lee; Keith W Jones; Stephen W Scherer; Matthew E Hurles Journal: Nature Date: 2006-11-23 Impact factor: 49.962
Authors: Yoon-ha Lee; Michael Ronemus; Jude Kendall; B Lakshmi; Anthony Leotta; Dan Levy; Diane Esposito; Vladimir Grubor; Kenny Ye; Michael Wigler; Boris Yamrom Journal: Proc Natl Acad Sci U S A Date: 2011-12-29 Impact factor: 11.205
Authors: Martin J Aryee; Zhijin Wu; Christine Ladd-Acosta; Brian Herb; Andrew P Feinberg; Srinivasan Yegnasubramanian; Rafael A Irizarry Journal: Biostatistics Date: 2010-09-21 Impact factor: 5.899
Authors: Lachlan J M Coin; Julian E Asher; Robin G Walters; Julia S El-Sayed Moustafa; Adam J de Smith; Rob Sladek; David J Balding; Philippe Froguel; Alexandra I F Blakemore Journal: Nat Methods Date: 2010-05-30 Impact factor: 28.547
Authors: Clara Bodelon; Svetlana Vinokurova; Joshua N Sampson; Johan A den Boon; Joan L Walker; Mark A Horswill; Keegan Korthauer; Mark Schiffman; Mark E Sherman; Rosemary E Zuna; Jason Mitchell; Xijun Zhang; Joseph F Boland; Anil K Chaturvedi; S Terence Dunn; Michael A Newton; Paul Ahlquist; Sophia S Wang; Nicolas Wentzensen Journal: Carcinogenesis Date: 2015-12-09 Impact factor: 4.944
Authors: Iuliana Ionita-Laza; Nan M Laird; Benjamin A Raby; Scott T Weiss; Christoph Lange Journal: Bioinformatics Date: 2008-08-08 Impact factor: 6.937