Xi Chen1, Xu Shi1, Leena Hilakivi-Clarke2, Ayesha N Shajahan-Haq2, Robert Clarke2, Jianhua Xuan1. 1. Bradley Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, VA 22203, USA. 2. Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, Washington, DC 20057, USA.
Abstract
MOTIVATION: Whole genome DNA-sequencing (WGS) of paired tumor and normal samples has enabled the identification of somatic DNA changes in an unprecedented detail. Large-scale identification of somatic structural variations (SVs) for a specific cancer type will deepen our understanding of driver mechanisms in cancer progression. However, the limited number of WGS samples, insufficient read coverage, and the impurity of tumor samples that contain normal and neoplastic cells, limit reliable and accurate detection of somatic SVs. RESULTS: We present a novel pattern-based probabilistic approach, PSSV, to identify somatic structural variations from WGS data. PSSV features a mixture model with hidden states representing different mutation patterns; PSSV can thus differentiate heterozygous and homozygous SVs in each sample, enabling the identification of those somatic SVs with heterozygous mutations in normal samples and homozygous mutations in tumor samples. Simulation studies demonstrate that PSSV outperforms existing tools. PSSV has been successfully applied to breast cancer data to identify somatic SVs of key factors associated with breast cancer development. AVAILABILITY AND IMPLEMENTATION: An R package of PSSV is available at http://www.cbil.ece.vt.edu/software.htm CONTACT: xuan@vt.eduSupplementary information: Supplementary data are available at Bioinformatics online.
MOTIVATION: Whole genome DNA-sequencing (WGS) of paired tumor and normal samples has enabled the identification of somatic DNA changes in an unprecedented detail. Large-scale identification of somatic structural variations (SVs) for a specific cancer type will deepen our understanding of driver mechanisms in cancer progression. However, the limited number of WGS samples, insufficient read coverage, and the impurity of tumor samples that contain normal and neoplastic cells, limit reliable and accurate detection of somatic SVs. RESULTS: We present a novel pattern-based probabilistic approach, PSSV, to identify somatic structural variations from WGS data. PSSV features a mixture model with hidden states representing different mutation patterns; PSSV can thus differentiate heterozygous and homozygous SVs in each sample, enabling the identification of those somatic SVs with heterozygous mutations in normal samples and homozygous mutations in tumor samples. Simulation studies demonstrate that PSSV outperforms existing tools. PSSV has been successfully applied to breast cancer data to identify somatic SVs of key factors associated with breast cancer development. AVAILABILITY AND IMPLEMENTATION: An R package of PSSV is available at http://www.cbil.ece.vt.edu/software.htm CONTACT: xuan@vt.eduSupplementary information: Supplementary data are available at Bioinformatics online.
Authors: Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo Journal: Genome Res Date: 2010-07-19 Impact factor: 9.043
Authors: Jennifer L Freeman; George H Perry; Lars Feuk; Richard Redon; Steven A McCarroll; David M Altshuler; Hiroyuki Aburatani; Keith W Jones; Chris Tyler-Smith; Matthew E Hurles; Nigel P Carter; Stephen W Scherer; Charles Lee Journal: Genome Res Date: 2006-06-29 Impact factor: 9.043
Authors: Jianmin Wang; Charles G Mullighan; John Easton; Stefan Roberts; Sue L Heatley; Jing Ma; Michael C Rusch; Ken Chen; Christopher C Harris; Li Ding; Linda Holmfeldt; Debbie Payne-Turner; Xian Fan; Lei Wei; David Zhao; John C Obenauer; Clayton Naeve; Elaine R Mardis; Richard K Wilson; James R Downing; Jinghui Zhang Journal: Nat Methods Date: 2011-06-12 Impact factor: 28.547
Authors: Geòrgia Escaramís; Cristian Tornador; Laia Bassaganyas; Raquel Rabionet; Jose M C Tubio; Alexander Martínez-Fundichely; Mario Cáceres; Marta Gut; Stephan Ossowski; Xavier Estivill Journal: PLoS One Date: 2013-05-21 Impact factor: 3.240
Authors: Alexis Christoforides; John D Carpten; Glen J Weiss; Michael J Demeure; Daniel D Von Hoff; David W Craig Journal: BMC Genomics Date: 2013-05-04 Impact factor: 3.969
Authors: B Hirsch; V Endris; S Lassmann; W Weichert; N Pfarr; P Schirmacher; V Kovaleva; M Werner; I Bonzheim; F Fend; J Sperveslage; K Kaulich; A Zacher; G Reifenberger; K Köhrer; S Stepanow; S Lerke; T Mayr; D E Aust; G Baretton; S Weidner; A Jung; T Kirchner; M L Hansmann; L Burbat; E von der Wall; M Dietel; M Hummel Journal: Virchows Arch Date: 2018-01-27 Impact factor: 4.064