Gary K Chen1, Xiao Chang, Christina Curtis, Kai Wang. 1. Department of Preventive Medicine, Zilkha Neurogenetic Institute and Department of Psychiatry, University of Southern California, Los Angeles, CA 90089, USA.
Abstract
MOTIVATION: The accurate detection of copy number alterations (CNAs) in human genomes is important for understanding susceptibility to cancer and mechanisms of tumor progression. CNA detection in tumors from single nucleotide polymorphism (SNP) genotyping arrays is a challenging problem due to phenomena such as aneuploidy, stromal contamination, genomic waves and intra-tumor heterogeneity, issues that leading methods do not optimally address. RESULTS: Here we introduce methods and software (PennCNV-tumor) for fast and accurate CNA detection using signal intensity data from SNP genotyping arrays. We estimate stromal contamination by applying a maximum likelihood approach over multiple discrete genomic intervals. By conditioning on signal intensity across the genome, our method accounts for both aneuploidy and genomic waves. Finally, our method uses a hidden Markov model to integrate multiple sources of information, including total and allele-specific signal intensity at each SNP, as well as physical maps to make posterior inferences of CNAs. Using real data from cancer cell-lines and patient tumors, we demonstrate substantial improvements in accuracy and computational efficiency compared with existing methods.
MOTIVATION: The accurate detection of copy number alterations (CNAs) in human genomes is important for understanding susceptibility to cancer and mechanisms of tumor progression. CNA detection in tumors from single nucleotide polymorphism (SNP) genotyping arrays is a challenging problem due to phenomena such as aneuploidy, stromal contamination, genomic waves and intra-tumor heterogeneity, issues that leading methods do not optimally address. RESULTS: Here we introduce methods and software (PennCNV-tumor) for fast and accurate CNA detection using signal intensity data from SNP genotyping arrays. We estimate stromal contamination by applying a maximum likelihood approach over multiple discrete genomic intervals. By conditioning on signal intensity across the genome, our method accounts for both aneuploidy and genomic waves. Finally, our method uses a hidden Markov model to integrate multiple sources of information, including total and allele-specific signal intensity at each SNP, as well as physical maps to make posterior inferences of CNAs. Using real data from cancer cell-lines and patienttumors, we demonstrate substantial improvements in accuracy and computational efficiency compared with existing methods.
Authors: Helena Carén; Hanna Kryh; Maria Nethander; Rose-Marie Sjöberg; Catarina Träger; Staffan Nilsson; Jonas Abrahamsson; Per Kogner; Tommy Martinsson Journal: Proc Natl Acad Sci U S A Date: 2010-02-09 Impact factor: 11.205
Authors: Peter Van Loo; Silje H Nordgard; Ole Christian Lingjærde; Hege G Russnes; Inga H Rye; Wei Sun; Victor J Weigman; Peter Marynen; Anders Zetterberg; Bjørn Naume; Charles M Perou; Anne-Lise Børresen-Dale; Vessela N Kristensen Journal: Proc Natl Acad Sci U S A Date: 2010-09-13 Impact factor: 11.205
Authors: Graham R Bignell; Chris D Greenman; Helen Davies; Adam P Butler; Sarah Edkins; Jenny M Andrews; Gemma Buck; Lina Chen; David Beare; Calli Latimer; Sara Widaa; Jonathon Hinton; Ciara Fahey; Beiyuan Fu; Sajani Swamy; Gillian L Dalgliesh; Bin T Teh; Panos Deloukas; Fengtang Yang; Peter J Campbell; P Andrew Futreal; Michael R Stratton Journal: Nature Date: 2010-02-18 Impact factor: 49.962
Authors: Nic Waddell; Jeremy Arnold; Sibylle Cocciardi; Leonard da Silva; Anna Marsh; Joan Riley; Cameron N Johnstone; Mohammed Orloff; Guillaume Assie; Charis Eng; Lynne Reid; Patricia Keith; Max Yan; Stephen Fox; Peter Devilee; Andrew K Godwin; Frans B L Hogervorst; Fergus Couch; Sean Grimmond; James M Flanagan; Kumkum Khanna; Peter T Simpson; Sunil R Lakhani; Georgia Chenevix-Trench Journal: Breast Cancer Res Treat Date: 2009-12-04 Impact factor: 4.872
Authors: Christopher Yau; Dmitri Mouradov; Robert N Jorissen; Stefano Colella; Ghazala Mirza; Graham Steers; Adrian Harris; Jiannis Ragoussis; Oliver Sieber; Christopher C Holmes Journal: Genome Biol Date: 2010-09-21 Impact factor: 13.583
Authors: Chris D Greenman; Graham Bignell; Adam Butler; Sarah Edkins; Jon Hinton; Dave Beare; Sajani Swamy; Thomas Santarius; Lina Chen; Sara Widaa; P Andy Futreal; Michael R Stratton Journal: Biostatistics Date: 2009-10-15 Impact factor: 5.899
Authors: Rameen Beroukhim; Craig H Mermel; Dale Porter; Guo Wei; Soumya Raychaudhuri; Jerry Donovan; Jordi Barretina; Jesse S Boehm; Jennifer Dobson; Mitsuyoshi Urashima; Kevin T Mc Henry; Reid M Pinchback; Azra H Ligon; Yoon-Jae Cho; Leila Haery; Heidi Greulich; Michael Reich; Wendy Winckler; Michael S Lawrence; Barbara A Weir; Kumiko E Tanaka; Derek Y Chiang; Adam J Bass; Alice Loo; Carter Hoffman; John Prensner; Ted Liefeld; Qing Gao; Derek Yecies; Sabina Signoretti; Elizabeth Maher; Frederic J Kaye; Hidefumi Sasaki; Joel E Tepper; Jonathan A Fletcher; Josep Tabernero; José Baselga; Ming-Sound Tsao; Francesca Demichelis; Mark A Rubin; Pasi A Janne; Mark J Daly; Carmelo Nucera; Ross L Levine; Benjamin L Ebert; Stacey Gabriel; Anil K Rustgi; Cristina R Antonescu; Marc Ladanyi; Anthony Letai; Levi A Garraway; Massimo Loda; David G Beer; Lawrence D True; Aikou Okamoto; Scott L Pomeroy; Samuel Singer; Todd R Golub; Eric S Lander; Gad Getz; William R Sellers; Matthew Meyerson Journal: Nature Date: 2010-02-18 Impact factor: 49.962
Authors: Wei Sun; Fred A Wright; Zhengzheng Tang; Silje H Nordgard; Peter Van Loo; Tianwei Yu; Vessela N Kristensen; Charles M Perou Journal: Nucleic Acids Res Date: 2009-07-06 Impact factor: 16.971
Authors: Alex J Cornish; Phuc H Hoang; Sara E Dobbins; Philip J Law; Daniel Chubb; Giulia Orlando; Richard S Houlston Journal: Blood Adv Date: 2019-01-08