Camir Ricketts1,2, Daniel Seidman1, Victoria Popic3, Fereydoun Hormozdiari4, Serafim Batzoglou3, Iman Hajirasouliha2. 1. Tri-Institutional Training Program in Computational Biology & Medicine, New York, NY 10065, USA. 2. Department of Physiology and Biophysics, Institute for Computational Biomedicine, Englander Institute for Precision Medicine, The Meyer Cancer Center, Weill Cornell Medicine of Cornell University, New York, NY 10021, USA. 3. Department of Computer Science, Stanford University, Stanford, CA 94305, USA. 4. Department of Biochemistry and Molecular Medicine, MIND Institute and Genome Center, University of California, Davis, CA 95616, USA.
Abstract
MOTIVATION: We propose Meltos, a novel computational framework to address the challenging problem of building tumor phylogeny trees using somatic structural variants (SVs) among multiple samples. Meltos leverages the tumor phylogeny tree built on somatic single nucleotide variants (SNVs) to identify high confidence SVs and produce a comprehensive tumor lineage tree, using a novel optimization formulation. While we do not assume the evolutionary progression of SVs is necessarily the same as SNVs, we show that a tumor phylogeny tree using high-quality somatic SNVs can act as a guide for calling and assigning somatic SVs on a tree. Meltos utilizes multiple genomic read signals for potential SV breakpoints in whole genome sequencing data and proposes a probabilistic formulation for estimating variant allele fractions (VAFs) of SV events. RESULTS: In order to assess the ability of Meltos to correctly refine SNV trees with SV information, we tested Meltos on two simulated datasets with five genomes in both. We also assessed Meltos on two real cancer datasets. We tested Meltos on multiple samples from a liposarcoma tumor and on a multi-sample breast cancer data (Yates et al., 2015), where the authors provide validated structural variation events together with deep, targeted sequencing for a collection of somatic SNVs. We show Meltos has the ability to place high confidence validated SV calls on a refined tumor phylogeny tree. We also showed the flexibility of Meltos to either estimate VAFs directly from genomic data or to use copy number corrected estimates. AVAILABILITY AND IMPLEMENTATION: Meltos is available at https://github.com/ih-lab/Meltos. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: We propose Meltos, a novel computational framework to address the challenging problem of building tumor phylogeny trees using somatic structural variants (SVs) among multiple samples. Meltos leverages the tumor phylogeny tree built on somatic single nucleotide variants (SNVs) to identify high confidence SVs and produce a comprehensive tumor lineage tree, using a novel optimization formulation. While we do not assume the evolutionary progression of SVs is necessarily the same as SNVs, we show that a tumor phylogeny tree using high-quality somatic SNVs can act as a guide for calling and assigning somatic SVs on a tree. Meltos utilizes multiple genomic read signals for potential SV breakpoints in whole genome sequencing data and proposes a probabilistic formulation for estimating variant allele fractions (VAFs) of SV events. RESULTS: In order to assess the ability of Meltos to correctly refine SNV trees with SV information, we tested Meltos on two simulated datasets with five genomes in both. We also assessed Meltos on two real cancer datasets. We tested Meltos on multiple samples from a liposarcoma tumor and on a multi-sample breast cancer data (Yates et al., 2015), where the authors provide validated structural variation events together with deep, targeted sequencing for a collection of somatic SNVs. We show Meltos has the ability to place high confidence validated SV calls on a refined tumor phylogeny tree. We also showed the flexibility of Meltos to either estimate VAFs directly from genomic data or to use copy number corrected estimates. AVAILABILITY AND IMPLEMENTATION: Meltos is available at https://github.com/ih-lab/Meltos. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Srinivas R Viswanathan; Gavin Ha; Andreas M Hoff; Jeremiah A Wala; Jian Carrot-Zhang; Christopher W Whelan; Nicholas J Haradhvala; Samuel S Freeman; Sarah C Reed; Justin Rhoades; Paz Polak; Michelle Cipicchio; Stephanie A Wankowicz; Alicia Wong; Tushar Kamath; Zhenwei Zhang; Gregory J Gydush; Denisse Rotem; J Christopher Love; Gad Getz; Stacey Gabriel; Cheng-Zhong Zhang; Scott M Dehm; Peter S Nelson; Eliezer M Van Allen; Atish D Choudhury; Viktor A Adalsteinsson; Rameen Beroukhim; Mary-Ellen Taplin; Matthew Meyerson Journal: Cell Date: 2018-06-18 Impact factor: 41.582
Authors: A Sorana Morrissy; Livia Garzia; David J H Shih; Scott Zuyderduyn; Xi Huang; Patryk Skowron; Marc Remke; Florence M G Cavalli; Vijay Ramaswamy; Patricia E Lindsay; Salomeh Jelveh; Laura K Donovan; Xin Wang; Betty Luu; Kory Zayne; Yisu Li; Chelsea Mayoh; Nina Thiessen; Eloi Mercier; Karen L Mungall; Yusanne Ma; Kane Tse; Thomas Zeng; Karey Shumansky; Andrew J L Roth; Sohrab Shah; Hamza Farooq; Noriyuki Kijima; Borja L Holgado; John J Y Lee; Stuart Matan-Lithwick; Jessica Liu; Stephen C Mack; Alex Manno; K A Michealraj; Carolina Nor; John Peacock; Lei Qin; Juri Reimand; Adi Rolider; Yuan Y Thompson; Xiaochong Wu; Trevor Pugh; Adrian Ally; Mikhail Bilenky; Yaron S N Butterfield; Rebecca Carlsen; Young Cheng; Eric Chuah; Richard D Corbett; Noreen Dhalla; An He; Darlene Lee; Haiyan I Li; William Long; Michael Mayo; Patrick Plettner; Jenny Q Qian; Jacqueline E Schein; Angela Tam; Tina Wong; Inanc Birol; Yongjun Zhao; Claudia C Faria; José Pimentel; Sofia Nunes; Tarek Shalaby; Michael Grotzer; Ian F Pollack; Ronald L Hamilton; Xiao-Nan Li; Anne E Bendel; Daniel W Fults; Andrew W Walter; Toshihiro Kumabe; Teiji Tominaga; V Peter Collins; Yoon-Jae Cho; Caitlin Hoffman; David Lyden; Jeffrey H Wisoff; James H Garvin; Duncan S Stearns; Luca Massimi; Ulrich Schüller; Jaroslav Sterba; Karel Zitterbart; Stephanie Puget; Olivier Ayrault; Sandra E Dunn; Daniela P C Tirapelli; Carlos G Carlotti; Helen Wheeler; Andrew R Hallahan; Wendy Ingram; Tobey J MacDonald; Jeffrey J Olson; Erwin G Van Meir; Ji-Yeoun Lee; Kyu-Chang Wang; Seung-Ki Kim; Byung-Kyu Cho; Torsten Pietsch; Gudrun Fleischhack; Stephan Tippelt; Young Shin Ra; Simon Bailey; Janet C Lindsey; Steven C Clifford; Charles G Eberhart; Michael K Cooper; Roger J Packer; Maura Massimino; Maria Luisa Garre; Ute Bartels; Uri Tabori; Cynthia E Hawkins; Peter Dirks; Eric Bouffet; James T Rutka; Robert J Wechsler-Reya; William A Weiss; Lara S Collier; Adam J Dupuy; Andrey Korshunov; David T W Jones; Marcel Kool; Paul A Northcott; Stefan M Pfister; David A Largaespada; Andrew J Mungall; Richard A Moore; Nada Jabado; Gary D Bader; Steven J M Jones; David Malkin; Marco A Marra; Michael D Taylor Journal: Nature Date: 2016-01-13 Impact factor: 49.962
Authors: Marek Cmero; Ke Yuan; Cheng Soon Ong; Jan Schröder; Niall M Corcoran; Tony Papenfuss; Christopher M Hovens; Florian Markowetz; Geoff Macintyre Journal: Nat Commun Date: 2020-02-05 Impact factor: 14.919