Milovan Suvakov1, Arijit Panda1, Colin Diesh2, Ian Holmes2, Alexej Abyzov1. 1. Department of Quantitative Health Sciences, Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA. 2. Department of Bioengineering, University of California, Berkeley, CA 94720, USA.
Abstract
BACKGROUND: Detecting copy number variations (CNVs) and copy number alterations (CNAs) based on whole-genome sequencing data is important for personalized genomics and treatment. CNVnator is one of the most popular tools for CNV/CNA discovery and analysis based on read depth. FINDINGS: Herein, we present an extension of CNVnator developed in Python-CNVpytor. CNVpytor inherits the reimplemented core engine of its predecessor and extends visualization, modularization, performance, and functionality. Additionally, CNVpytor uses B-allele frequency likelihood information from single-nucleotide polymorphisms and small indels data as additional evidence for CNVs/CNAs and as primary information for copy number-neutral losses of heterozygosity. CONCLUSIONS: CNVpytor is significantly faster than CNVnator-particularly for parsing alignment files (2-20 times faster)-and has (20-50 times) smaller intermediate files. CNV calls can be filtered using several criteria, annotated, and merged over multiple samples. Modular architecture allows it to be used in shared and cloud environments such as Google Colab and Jupyter notebook. Data can be exported into JBrowse, while a lightweight plugin version of CNVpytor for JBrowse enables nearly instant and GUI-assisted analysis of CNVs by any user. CNVpytor release and the source code are available on GitHub at https://github.com/abyzovlab/CNVpytor under the MIT license.
BACKGROUND: Detecting copy number variations (CNVs) and copy number alterations (CNAs) based on whole-genome sequencing data is important for personalized genomics and treatment. CNVnator is one of the most popular tools for CNV/CNA discovery and analysis based on read depth. FINDINGS: Herein, we present an extension of CNVnator developed in Python-CNVpytor. CNVpytor inherits the reimplemented core engine of its predecessor and extends visualization, modularization, performance, and functionality. Additionally, CNVpytor uses B-allele frequency likelihood information from single-nucleotide polymorphisms and small indels data as additional evidence for CNVs/CNAs and as primary information for copy number-neutral losses of heterozygosity. CONCLUSIONS: CNVpytor is significantly faster than CNVnator-particularly for parsing alignment files (2-20 times faster)-and has (20-50 times) smaller intermediate files. CNV calls can be filtered using several criteria, annotated, and merged over multiple samples. Modular architecture allows it to be used in shared and cloud environments such as Google Colab and Jupyter notebook. Data can be exported into JBrowse, while a lightweight plugin version of CNVpytor for JBrowse enables nearly instant and GUI-assisted analysis of CNVs by any user. CNVpytor release and the source code are available on GitHub at https://github.com/abyzovlab/CNVpytor under the MIT license.
Authors: Omer Gokcumen; Verena Tischler; Jelena Tica; Qihui Zhu; Rebecca C Iskow; Eunjung Lee; Markus Hsi-Yang Fritz; Amy Langdon; Adrian M Stütz; Pavlos Pavlidis; Vladimir Benes; Ryan E Mills; Peter J Park; Charles Lee; Jan O Korbel Journal: Proc Natl Acad Sci U S A Date: 2013-09-06 Impact factor: 11.205
Authors: Mingfu Zhu; Anna C Need; Yujun Han; Dongliang Ge; Jessica M Maia; Qianqian Zhu; Erin L Heinzen; Elizabeth T Cirulli; Kimberly Pelak; Min He; Elizabeth K Ruzzo; Curtis Gumbs; Abanish Singh; Sheng Feng; Kevin V Shianna; David B Goldstein Journal: Am J Hum Genet Date: 2012-08-30 Impact factor: 11.025
Authors: Roel Hermsen; Joep de Ligt; Wim Spee; Francis Blokzijl; Sebastian Schäfer; Eleonora Adami; Sander Boymans; Stephen Flink; Ruben van Boxtel; Robin H van der Weide; Tim Aitman; Norbert Hübner; Marieke Simonis; Boris Tabakoff; Victor Guryev; Edwin Cuppen Journal: BMC Genomics Date: 2015-05-06 Impact factor: 3.969
Authors: Jason G Wallace; Peter J Bradbury; Nengyi Zhang; Yves Gibon; Mark Stitt; Edward S Buckler Journal: PLoS Genet Date: 2014-12-04 Impact factor: 5.917
Authors: Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis Journal: Nature Date: 2015-10-01 Impact factor: 49.962
Authors: Peter H Sudmant; Tobias Rausch; Eugene J Gardner; Robert E Handsaker; Alexej Abyzov; John Huddleston; Yan Zhang; Kai Ye; Goo Jun; Markus Hsi-Yang Fritz; Miriam K Konkel; Ankit Malhotra; Adrian M Stütz; Xinghua Shi; Francesco Paolo Casale; Jieming Chen; Fereydoun Hormozdiari; Gargi Dayama; Ken Chen; Maika Malig; Mark J P Chaisson; Klaudia Walter; Sascha Meiers; Seva Kashin; Erik Garrison; Adam Auton; Hugo Y K Lam; Xinmeng Jasmine Mu; Can Alkan; Danny Antaki; Taejeong Bae; Eliza Cerveira; Peter Chines; Zechen Chong; Laura Clarke; Elif Dal; Li Ding; Sarah Emery; Xian Fan; Madhusudan Gujral; Fatma Kahveci; Jeffrey M Kidd; Yu Kong; Eric-Wubbo Lameijer; Shane McCarthy; Paul Flicek; Richard A Gibbs; Gabor Marth; Christopher E Mason; Androniki Menelaou; Donna M Muzny; Bradley J Nelson; Amina Noor; Nicholas F Parrish; Matthew Pendleton; Andrew Quitadamo; Benjamin Raeder; Eric E Schadt; Mallory Romanovitch; Andreas Schlattl; Robert Sebra; Andrey A Shabalin; Andreas Untergasser; Jerilyn A Walker; Min Wang; Fuli Yu; Chengsheng Zhang; Jing Zhang; Xiangqun Zheng-Bradley; Wanding Zhou; Thomas Zichner; Jonathan Sebat; Mark A Batzer; Steven A McCarroll; Ryan E Mills; Mark B Gerstein; Ali Bashir; Oliver Stegle; Scott E Devine; Charles Lee; Evan E Eichler; Jan O Korbel Journal: Nature Date: 2015-10-01 Impact factor: 49.962
Authors: Hui Guo; Michael H Duyzend; Bradley P Coe; Carl Baker; Kendra Hoekzema; Jennifer Gerdts; Tychele N Turner; Michael C Zody; Jennifer S Beighley; Shwetha C Murali; Bradley J Nelson; Michael J Bamshad; Deborah A Nickerson; Raphael A Bernier; Evan E Eichler Journal: Genet Med Date: 2018-12-03 Impact factor: 8.822
Authors: Po-Ru Loh; Giulio Genovese; Robert E Handsaker; Hilary K Finucane; Yakir A Reshef; Pier Francesco Palamara; Brenda M Birmann; Michael E Talkowski; Samuel F Bakhoum; Steven A McCarroll; Alkes L Price Journal: Nature Date: 2018-07-11 Impact factor: 49.962
Authors: Lide Han; Xuefang Zhao; Mary Lauren Benton; Thaneer Perumal; Ryan L Collins; Gabriel E Hoffman; Jessica S Johnson; Laura Sloofman; Harold Z Wang; Matthew R Stone; Kristen J Brennand; Harrison Brand; Solveig K Sieberts; Stefano Marenco; Mette A Peters; Barbara K Lipska; Panos Roussos; John A Capra; Michael Talkowski; Douglas M Ruderfer Journal: Nat Commun Date: 2020-06-12 Impact factor: 17.694
Authors: Taejeong Bae; Liana Fasching; Yifan Wang; Joo Heon Shin; Milovan Suvakov; Yeongjun Jang; Scott Norton; Caroline Dias; Jessica Mariani; Alexandre Jourdon; Feinan Wu; Arijit Panda; Reenal Pattni; Yasmine Chahine; Rebecca Yeh; Rosalinda C Roberts; Anita Huttner; Joel E Kleinman; Thomas M Hyde; Richard E Straub; Christopher A Walsh; Alexander E Urban; James F Leckman; Daniel R Weinberger; Flora M Vaccarino; Alexej Abyzov Journal: Science Date: 2022-07-28 Impact factor: 63.714
Authors: Nicoletta Gronchi; Nicola De Bernardini; Rosemary A Cripwell; Laura Treu; Stefano Campanaro; Marina Basaglia; Maria R Foulquié-Moreno; Johan M Thevelein; Willem H Van Zyl; Lorenzo Favaro; Sergio Casella Journal: Front Microbiol Date: 2022-01-20 Impact factor: 5.640