Brian K Mannakee1,2, Uthra Balaji3, Agnieszka K Witkiewicz2,4,5, Ryan N Gutenkunst6, Erik S Knudsen2,4. 1. Department of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health. 2. University of Arizona Cancer Center, University of Arizona, Tucson, AZ 85721, USA. 3. McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX 75390, USA. 4. Department of Medicine. 5. Department of Pathology. 6. Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA.
Abstract
Motivation: Tumor genome sequencing offers great promise for guiding research and therapy, but spurious variant calls can arise from multiple sources. Mouse contamination can generate many spurious calls when sequencing patient-derived xenografts. Paralogous genome sequences can also generate spurious calls when sequencing any tumor. We developed a BLAST-based algorithm, Mouse And Paralog EXterminator (MAPEX), to identify and filter out spurious calls from both these sources. Results: When calling variants from xenografts, MAPEX has similar sensitivity and specificity to more complex algorithms. When applied to any tumor, MAPEX also automatically flags calls that potentially arise from paralogous sequences. Our implementation, mapexr, runs quickly and easily on a desktop computer. MAPEX is thus a useful addition to almost any pipeline for calling genetic variants in tumors. Availability and implementation: The mapexr package for R is available at https://github.com/bmannakee/mapexr under the MIT license. Contact: mannakee@email.arizona.edu or rgutenk@email.arizona.edu or eknudsen@email.arizona.edu. Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: Tumor genome sequencing offers great promise for guiding research and therapy, but spurious variant calls can arise from multiple sources. Mouse contamination can generate many spurious calls when sequencing patient-derived xenografts. Paralogous genome sequences can also generate spurious calls when sequencing any tumor. We developed a BLAST-based algorithm, Mouse And Paralog EXterminator (MAPEX), to identify and filter out spurious calls from both these sources. Results: When calling variants from xenografts, MAPEX has similar sensitivity and specificity to more complex algorithms. When applied to any tumor, MAPEX also automatically flags calls that potentially arise from paralogous sequences. Our implementation, mapexr, runs quickly and easily on a desktop computer. MAPEX is thus a useful addition to almost any pipeline for calling genetic variants in tumors. Availability and implementation: The mapexr package for R is available at https://github.com/bmannakee/mapexr under the MIT license. Contact: mannakee@email.arizona.edu or rgutenk@email.arizona.edu or eknudsen@email.arizona.edu. Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo Journal: Genome Res Date: 2010-07-19 Impact factor: 9.043
Authors: Daniel C Koboldt; Qunyuan Zhang; David E Larson; Dong Shen; Michael D McLellan; Ling Lin; Christopher A Miller; Elaine R Mardis; Li Ding; Richard K Wilson Journal: Genome Res Date: 2012-02-02 Impact factor: 9.043
Authors: Garima Khandelwal; María Romina Girotti; Christopher Smowton; Sam Taylor; Christopher Wirth; Marek Dynowski; Kristopher K Frese; Ged Brady; Caroline Dive; Richard Marais; Crispin Miller Journal: Mol Cancer Res Date: 2017-04-25 Impact factor: 5.852
Authors: Claudia I Müller; Carl W Miller; Wolf-K Hofmann; Mitchell E Gross; Christine S Walsh; Norihiko Kawamata; Quang T Luong; H Phillip Koeffler Journal: Leuk Res Date: 2006-06-09 Impact factor: 3.156
Authors: Shinichi Yachida; Siân Jones; Ivana Bozic; Tibor Antal; Rebecca Leary; Baojin Fu; Mihoko Kamiyama; Ralph H Hruban; James R Eshleman; Martin A Nowak; Victor E Velculescu; Kenneth W Kinzler; Bert Vogelstein; Christine A Iacobuzio-Donahue Journal: Nature Date: 2010-10-28 Impact factor: 49.962
Authors: Thomas Conway; Jeremy Wazny; Andrew Bromage; Martin Tymms; Dhanya Sooraj; Elizabeth D Williams; Bryan Beresford-Smith Journal: Bioinformatics Date: 2012-06-15 Impact factor: 6.937
Authors: Michael D Kessler; Nicholas W Bateman; Thomas P Conrads; George L Maxwell; Julie C Dunning Hotopp; Timothy D O'Connor Journal: Cancer Date: 2019-03-13 Impact factor: 6.860
Authors: Marcello Stanzione; Jun Zhong; Edmond Wong; Thomas J LaSalle; Jillian F Wise; Antoine Simoneau; David T Myers; Sarah Phat; Moshe Sade-Feldman; Michael S Lawrence; M Kyle Hadden; Lee Zou; Anna F Farago; Nicholas J Dyson; Benjamin J Drapkin Journal: Sci Adv Date: 2022-05-13 Impact factor: 14.957