Abbas A Rizvi1, Ezgi Karaesmen1, Martin Morgan2, Leah Preus3, Junke Wang3, Michael Sovic3, Theresa Hahn4, Lara E Sucheston-Campbell3,5. 1. Division of Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, Columbus, OH, USA. 2. Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA. 3. Division of Pharmacy Practice and Science, College of Pharmacy, The Ohio State University, Columbus, OH, USA. 4. Department of Medicine, Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA. 5. Department of Veterinary Biosciences, College of Veterinary Medicine, The Ohio State University, Columbus, OH 43210, USA.
Abstract
SUMMARY: To address the limited software options for performing survival analyses with millions of SNPs, we developed gwasurvivr, an R/Bioconductor package with a simple interface for conducting genome-wide survival analyses using VCF (outputted from Michigan or Sanger imputation servers), IMPUTE2 or PLINK files. To decrease the number of iterations needed for convergence when optimizing the parameter estimates in the Cox model, we modified the R package survival; covariates in the model are first fit without the SNP, and those parameter estimates are used as initial points. We benchmarked gwasurvivr with other software capable of conducting genome-wide survival analysis (genipe, SurvivalGWAS_SV and GWASTools). gwasurvivr is significantly faster and shows better scalability as sample size, number of SNPs and number of covariates increases. AVAILABILITY AND IMPLEMENTATION: gwasurvivr, including source code, documentation and vignette are available at: http://bioconductor.org/packages/gwasurvivr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
SUMMARY: To address the limited software options for performing survival analyses with millions of SNPs, we developed gwasurvivr, an R/Bioconductor package with a simple interface for conducting genome-wide survival analyses using VCF (outputted from Michigan or Sanger imputation servers), IMPUTE2 or PLINK files. To decrease the number of iterations needed for convergence when optimizing the parameter estimates in the Cox model, we modified the R package survival; covariates in the model are first fit without the SNP, and those parameter estimates are used as initial points. We benchmarked gwasurvivr with other software capable of conducting genome-wide survival analysis (genipe, SurvivalGWAS_SV and GWASTools). gwasurvivr is significantly faster and shows better scalability as sample size, number of SNPs and number of covariates increases. AVAILABILITY AND IMPLEMENTATION: gwasurvivr, including source code, documentation and vignette are available at: http://bioconductor.org/packages/gwasurvivr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham Journal: Am J Hum Genet Date: 2007-07-25 Impact factor: 11.025
Authors: Stephanie M Gogarten; Tushar Bhangale; Matthew P Conomos; Cecelia A Laurie; Caitlin P McHugh; Ian Painter; Xiuwen Zheng; David R Crosslin; David Levine; Thomas Lumley; Sarah C Nelson; Kenneth Rice; Jess Shen; Rohit Swarnkar; Bruce S Weir; Cathy C Laurie Journal: Bioinformatics Date: 2012-10-10 Impact factor: 6.937
Authors: Valerie Obenchain; Michael Lawrence; Vincent Carey; Stephanie Gogarten; Paul Shannon; Martin Morgan Journal: Bioinformatics Date: 2014-03-28 Impact factor: 6.937
Authors: Sayantan Das; Lukas Forer; Sebastian Schönherr; Carlo Sidore; Adam E Locke; Alan Kwong; Scott I Vrieze; Emily Y Chew; Shawn Levy; Matt McGue; David Schlessinger; Dwight Stambolian; Po-Ru Loh; William G Iacono; Anand Swaroop; Laura J Scott; Francesco Cucca; Florian Kronenberg; Michael Boehnke; Gonçalo R Abecasis; Christian Fuchsberger Journal: Nat Genet Date: 2016-08-29 Impact factor: 38.330
Authors: Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis Journal: Nature Date: 2015-10-01 Impact factor: 49.962
Authors: Shane McCarthy; Sayantan Das; Warren Kretzschmar; Olivier Delaneau; Andrew R Wood; Alexander Teumer; Hyun Min Kang; Christian Fuchsberger; Petr Danecek; Kevin Sharp; Yang Luo; Carlo Sidore; Alan Kwong; Nicholas Timpson; Seppo Koskinen; Scott Vrieze; Laura J Scott; He Zhang; Anubha Mahajan; Jan Veldink; Ulrike Peters; Carlos Pato; Cornelia M van Duijn; Christopher E Gillies; Ilaria Gandin; Massimo Mezzavilla; Arthur Gilly; Massimiliano Cocca; Michela Traglia; Andrea Angius; Jeffrey C Barrett; Dorrett Boomsma; Kari Branham; Gerome Breen; Chad M Brummett; Fabio Busonero; Harry Campbell; Andrew Chan; Sai Chen; Emily Chew; Francis S Collins; Laura J Corbin; George Davey Smith; George Dedoussis; Marcus Dorr; Aliki-Eleni Farmaki; Luigi Ferrucci; Lukas Forer; Ross M Fraser; Stacey Gabriel; Shawn Levy; Leif Groop; Tabitha Harrison; Andrew Hattersley; Oddgeir L Holmen; Kristian Hveem; Matthias Kretzler; James C Lee; Matt McGue; Thomas Meitinger; David Melzer; Josine L Min; Karen L Mohlke; John B Vincent; Matthias Nauck; Deborah Nickerson; Aarno Palotie; Michele Pato; Nicola Pirastu; Melvin McInnis; J Brent Richards; Cinzia Sala; Veikko Salomaa; David Schlessinger; Sebastian Schoenherr; P Eline Slagboom; Kerrin Small; Timothy Spector; Dwight Stambolian; Marcus Tuke; Jaakko Tuomilehto; Leonard H Van den Berg; Wouter Van Rheenen; Uwe Volker; Cisca Wijmenga; Daniela Toniolo; Eleftheria Zeggini; Paolo Gasparini; Matthew G Sampson; James F Wilson; Timothy Frayling; Paul I W de Bakker; Morris A Swertz; Steven McCarroll; Charles Kooperberg; Annelot Dekker; David Altshuler; Cristen Willer; William Iacono; Samuli Ripatti; Nicole Soranzo; Klaudia Walter; Anand Swaroop; Francesco Cucca; Carl A Anderson; Richard M Myers; Michael Boehnke; Mark I McCarthy; Richard Durbin Journal: Nat Genet Date: 2016-08-22 Impact factor: 38.330
Authors: Hancong Tang; Theresa Hahn; Ezgi Karaesmen; Abbas A Rizvi; Junke Wang; Sophie Paczesny; Tao Wang; Leah Preus; Qianqian Zhu; Yiwen Wang; Christopher A Haiman; Daniel Stram; Loreall Pooler; Xin Sheng; David Van Den Berg; Guy Brock; Amy Webb; Marcelo C Pasquini; Philip L McCarthy; Stephen R Spellman; Lara E Sucheston-Campbell Journal: Blood Adv Date: 2019-08-13
Authors: Christopher Wills; Yazhou He; Matthew G Summers; Yi Lin; Amanda I Phipps; Katie Watts; Philip J Law; Nada A Al-Tassan; Timothy S Maughan; Richard Kaplan; Richard S Houlston; Ulrike Peters; Polly A Newcomb; Andrew T Chan; Daniel D Buchanan; Steve Gallinger; Loic L Marchand; Rish K Pai; Qian Shi; Steven R Alberts; Victoria Gray; Hannah D West; Valentina Escott-Price; Malcolm G Dunlop; Jeremy P Cheadle Journal: Eur J Cancer Date: 2021-11-15 Impact factor: 10.002
Authors: Jacob J Hughey; Seth D Rhoades; Darwin Y Fu; Lisa Bastarache; Joshua C Denny; Qingxia Chen Journal: BMC Genomics Date: 2019-11-04 Impact factor: 3.969
Authors: Angelica Macauda; Chiara Piredda; Alyssa I Clay-Gilmour; Juan Sainz; Gabriele Buda; Miroslaw Markiewicz; Torben Barington; Elad Ziv; Michelle A T Hildebrandt; Alem A Belachew; Judit Varkonyi; Witold Prejzner; Agnieszka Druzd-Sitek; John Spinelli; Niels Frost Andersen; Jonathan N Hofmann; Marek Dudziński; Joaquin Martinez-Lopez; Elzbieta Iskierka-Jazdzewska; Roger L Milne; Grzegorz Mazur; Graham G Giles; Lene Hyldahl Ebbesen; Marcin Rymko; Krzysztof Jamroziak; Edyta Subocz; Rui Manuel Reis; Ramon Garcia-Sanz; Anna Suska; Eva Kannik Haastrup; Daria Zawirska; Norbert Grzasko; Annette Juul Vangsted; Charles Dumontet; Marcin Kruszewski; Magdalena Dutka; Nicola J Camp; Rosalie G Waller; Waldemar Tomczak; Matteo Pelosini; Małgorzata Raźny; Herlander Marques; Niels Abildgaard; Marzena Wątek; Artur Jurczyszyn; Elizabeth E Brown; Sonja Berndt; Aleksandra Butrym; Celine M Vachon; Aaron D Norman; Susan L Slager; Federica Gemignani; Federico Canzian; Daniele Campa Journal: Int J Cancer Date: 2021-03-30 Impact factor: 7.396