Michał Stolarczyk1, Vincent P Reuter1, Jason P Smith1,2, Neal E Magee3, Nathan C Sheffield1,2,4,5. 1. Center for Public Health Genomics, University of Virginia, PO Box 800717, Charlottesville, VA, 22908, USA. 2. Department of Biochemistry and Molecular Genetics, University of Virginia, PO Box 800733, Charlottesville, VA, 22908, USA. 3. Research Computing, University of Virginia, 560 Ray C. Hunt Drive, Charlottesville, VA, 22903, USA. 4. Department of Public Health Sciences, University of Virginia, PO Box 800717, Charlottesville, VA, 22908, USA. 5. Department of Biomedical Engineering, University of Virginia, PO Box 400259, Charlottesville, VA, 22904, USA.
Abstract
BACKGROUND: Reference genome assemblies are essential for high-throughput sequencing analysis projects. Typically, genome assemblies are stored on disk alongside related resources; e.g., many sequence aligners require the assembly to be indexed. The resulting indexes are broadly applicable for downstream analysis, so it makes sense to share them. However, there is no simple tool to do this. RESULTS: Here, we introduce refgenie, a reference genome assembly asset manager. Refgenie makes it easier to organize, retrieve, and share genome analysis resources. In addition to genome indexes, refgenie can manage any files related to reference genomes, including sequences and annotation files. Refgenie includes a command line interface and a server application that provides a RESTful API, so it is useful for both tool development and analysis. CONCLUSIONS: Refgenie streamlines sharing genome analysis resources among groups and across computing environments. Refgenie is available at https://refgenie.databio.org.
BACKGROUND: Reference genome assemblies are essential for high-throughput sequencing analysis projects. Typically, genome assemblies are stored on disk alongside related resources; e.g., many sequence aligners require the assembly to be indexed. The resulting indexes are broadly applicable for downstream analysis, so it makes sense to share them. However, there is no simple tool to do this. RESULTS: Here, we introduce refgenie, a reference genome assembly asset manager. Refgenie makes it easier to organize, retrieve, and share genome analysis resources. In addition to genome indexes, refgenie can manage any files related to reference genomes, including sequences and annotation files. Refgenie includes a command line interface and a server application that provides a RESTful API, so it is useful for both tool development and analysis. CONCLUSIONS: Refgenie streamlines sharing genome analysis resources among groups and across computing environments. Refgenie is available at https://refgenie.databio.org.
Authors: Jennifer Harrow; Adam Frankish; Jose M Gonzalez; Electra Tapanari; Mark Diekhans; Felix Kokocinski; Bronwen L Aken; Daniel Barrell; Amonida Zadissa; Stephen Searle; If Barnes; Alexandra Bignell; Veronika Boychenko; Toby Hunt; Mike Kay; Gaurab Mukherjee; Jeena Rajan; Gloria Despacio-Reyes; Gary Saunders; Charles Steward; Rachel Harte; Michael Lin; Cédric Howald; Andrea Tanzer; Thomas Derrien; Jacqueline Chrast; Nathalie Walters; Suganthi Balasubramanian; Baikang Pei; Michael Tress; Jose Manuel Rodriguez; Iakes Ezkurdia; Jeltje van Baren; Michael Brent; David Haussler; Manolis Kellis; Alfonso Valencia; Alexandre Reymond; Mark Gerstein; Roderic Guigó; Tim J Hubbard Journal: Genome Res Date: 2012-09 Impact factor: 9.043
Authors: Deanna M Church; Valerie A Schneider; Tina Graves; Katherine Auger; Fiona Cunningham; Nathan Bouk; Hsiu-Chuan Chen; Richa Agarwala; William M McLaren; Graham R S Ritchie; Derek Albracht; Milinn Kremitzki; Susan Rock; Holland Kotkiewicz; Colin Kremitzki; Aye Wollam; Lee Trani; Lucinda Fulton; Robert Fulton; Lucy Matthews; Siobhan Whitehead; Will Chow; James Torrance; Matthew Dunn; Glenn Harden; Glen Threadgold; Jonathan Wood; Joanna Collins; Paul Heath; Guy Griffiths; Sarah Pelan; Darren Grafham; Evan E Eichler; George Weinstock; Elaine R Mardis; Richard K Wilson; Kerstin Howe; Paul Flicek; Tim Hubbard Journal: PLoS Biol Date: 2011-07-05 Impact factor: 8.029
Authors: Aleksandra Tarkowska; Denise Carvalho-Silva; Charles E Cook; Edd Turner; Robert D Finn; Andrew D Yates Journal: PLoS Comput Biol Date: 2018-12-13 Impact factor: 4.475
Authors: Daniel Blankenberg; Gregory Von Kuster; Emil Bouvier; Dannon Baker; Enis Afgan; Nicholas Stoler; James Taylor; Anton Nekrutenko Journal: Genome Biol Date: 2014-02-20 Impact factor: 13.583
Authors: Nagampalli VijayKrishna; Jayadev Joshi; Nate Coraor; Jennifer Hillman-Jackson; Dave Bouvier; Marius van den Beek; Ignacio Eguinoa; Frederik Coppens; John Davis; Michał Stolarczyk; Nathan C Sheffield; Simon Gladman; Gianmauro Cuccuru; Björn Grüning; Nicola Soranzo; Helena Rasche; Bradley W Langhorst; Matthias Bernt; Dan Fornika; David Anderson de Lima Morais; Michel Barrette; Peter van Heusden; Mauro Petrillo; Antonio Puertas-Gallardo; Alex Patak; Hans-Rudolf Hotz; Daniel Blankenberg Journal: Bioinform Adv Date: 2022-04-29
Authors: Mikhail G Dozmorov; Katarzyna M Tyc; Nathan C Sheffield; David C Boyd; Amy L Olex; Jason Reed; J Chuck Harrell Journal: Gigascience Date: 2021-04-21 Impact factor: 6.524
Authors: Jason P Smith; Arun B Dutta; Kizhakke Mattada Sathyan; Michael J Guertin; Nathan C Sheffield Journal: Genome Biol Date: 2021-05-15 Impact factor: 17.906
Authors: Nathan C Sheffield; Vivien R Bonazzi; Philip E Bourne; Tony Burdett; Timothy Clark; Robert L Grossman; Ola Spjuth; Andrew D Yates Journal: Sci Data Date: 2022-09-08 Impact factor: 8.501
Authors: Avi Srivastava; Laraib Malik; Hirak Sarkar; Mohsen Zakeri; Fatemeh Almodaresi; Charlotte Soneson; Michael I Love; Carl Kingsford; Rob Patro Journal: Genome Biol Date: 2020-09-07 Impact factor: 13.583
Authors: Michael I Love; Charlotte Soneson; Peter F Hickey; Lisa K Johnson; N Tessa Pierce; Lori Shepherd; Martin Morgan; Rob Patro Journal: PLoS Comput Biol Date: 2020-02-25 Impact factor: 4.475