Literature DB >> 31995185

Refgenie: a reference genome resource manager.

Michał Stolarczyk1, Vincent P Reuter1, Jason P Smith1,2, Neal E Magee3, Nathan C Sheffield1,2,4,5.   

Abstract

BACKGROUND: Reference genome assemblies are essential for high-throughput sequencing analysis projects. Typically, genome assemblies are stored on disk alongside related resources; e.g., many sequence aligners require the assembly to be indexed. The resulting indexes are broadly applicable for downstream analysis, so it makes sense to share them. However, there is no simple tool to do this.
RESULTS: Here, we introduce refgenie, a reference genome assembly asset manager. Refgenie makes it easier to organize, retrieve, and share genome analysis resources. In addition to genome indexes, refgenie can manage any files related to reference genomes, including sequences and annotation files. Refgenie includes a command line interface and a server application that provides a RESTful API, so it is useful for both tool development and analysis.
CONCLUSIONS: Refgenie streamlines sharing genome analysis resources among groups and across computing environments. Refgenie is available at https://refgenie.databio.org.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Keywords:  data management; data portability; reference assemblies; reference genomes

Year:  2020        PMID: 31995185      PMCID: PMC6988606          DOI: 10.1093/gigascience/giz149

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  20 in total

1.  Indexing huge genome sequences for solving various problems.

Authors:  K Sadakane; T Shibuya
Journal:  Genome Inform       Date:  2001

2.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

3.  Near-optimal probabilistic RNA-seq quantification.

Authors:  Nicolas L Bray; Harold Pimentel; Páll Melsted; Lior Pachter
Journal:  Nat Biotechnol       Date:  2016-04-04       Impact factor: 54.908

4.  GENCODE: the reference human genome annotation for The ENCODE Project.

Authors:  Jennifer Harrow; Adam Frankish; Jose M Gonzalez; Electra Tapanari; Mark Diekhans; Felix Kokocinski; Bronwen L Aken; Daniel Barrell; Amonida Zadissa; Stephen Searle; If Barnes; Alexandra Bignell; Veronika Boychenko; Toby Hunt; Mike Kay; Gaurab Mukherjee; Jeena Rajan; Gloria Despacio-Reyes; Gary Saunders; Charles Steward; Rachel Harte; Michael Lin; Cédric Howald; Andrea Tanzer; Thomas Derrien; Jacqueline Chrast; Nathalie Walters; Suganthi Balasubramanian; Baikang Pei; Michael Tress; Jose Manuel Rodriguez; Iakes Ezkurdia; Jeltje van Baren; Michael Brent; David Haussler; Manolis Kellis; Alfonso Valencia; Alexandre Reymond; Mark Gerstein; Roderic Guigó; Tim J Hubbard
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

5.  Modernizing reference genome assemblies.

Authors:  Deanna M Church; Valerie A Schneider; Tina Graves; Katherine Auger; Fiona Cunningham; Nathan Bouk; Hsiu-Chuan Chen; Richa Agarwala; William M McLaren; Graham R S Ritchie; Derek Albracht; Milinn Kremitzki; Susan Rock; Holland Kotkiewicz; Colin Kremitzki; Aye Wollam; Lee Trani; Lucinda Fulton; Robert Fulton; Lucy Matthews; Siobhan Whitehead; Will Chow; James Torrance; Matthew Dunn; Glenn Harden; Glen Threadgold; Jonathan Wood; Joanna Collins; Paul Heath; Guy Griffiths; Sarah Pelan; Darren Grafham; Evan E Eichler; George Weinstock; Elaine R Mardis; Richard K Wilson; Kerstin Howe; Paul Flicek; Tim Hubbard
Journal:  PLoS Biol       Date:  2011-07-05       Impact factor: 8.029

6.  NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy.

Authors:  Kim D Pruitt; Tatiana Tatusova; Garth R Brown; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2011-11-24       Impact factor: 16.971

7.  Wrangling Galaxy's reference data.

Authors:  Daniel Blankenberg; James E Johnson; James Taylor; Anton Nekrutenko
Journal:  Bioinformatics       Date:  2014-02-28       Impact factor: 6.937

8.  Eleven quick tips to build a usable REST API for life sciences.

Authors:  Aleksandra Tarkowska; Denise Carvalho-Silva; Charles E Cook; Edd Turner; Robert D Finn; Andrew D Yates
Journal:  PLoS Comput Biol       Date:  2018-12-13       Impact factor: 4.475

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

10.  Dissemination of scientific software with Galaxy ToolShed.

Authors:  Daniel Blankenberg; Gregory Von Kuster; Emil Bouvier; Dannon Baker; Enis Afgan; Nicholas Stoler; James Taylor; Anton Nekrutenko
Journal:  Genome Biol       Date:  2014-02-20       Impact factor: 13.583

View more
  10 in total

1.  Expanding the Galaxy's reference data.

Authors:  Nagampalli VijayKrishna; Jayadev Joshi; Nate Coraor; Jennifer Hillman-Jackson; Dave Bouvier; Marius van den Beek; Ignacio Eguinoa; Frederik Coppens; John Davis; Michał Stolarczyk; Nathan C Sheffield; Simon Gladman; Gianmauro Cuccuru; Björn Grüning; Nicola Soranzo; Helena Rasche; Bradley W Langhorst; Matthias Bernt; Dan Fornika; David Anderson de Lima Morais; Michel Barrette; Peter van Heusden; Mauro Petrillo; Antonio Puertas-Gallardo; Alex Patak; Hans-Rudolf Hotz; Daniel Blankenberg
Journal:  Bioinform Adv       Date:  2022-04-29

2.  Chromatin conformation capture (Hi-C) sequencing of patient-derived xenografts: analysis guidelines.

Authors:  Mikhail G Dozmorov; Katarzyna M Tyc; Nathan C Sheffield; David C Boyd; Amy L Olex; Jason Reed; J Chuck Harrell
Journal:  Gigascience       Date:  2021-04-21       Impact factor: 6.524

3.  Streamlining differential exon and 3' UTR usage with diffUTR.

Authors:  Stefan Gerber; Gerhard Schratt; Pierre-Luc Germain
Journal:  BMC Bioinformatics       Date:  2021-04-13       Impact factor: 3.169

4.  PEPPRO: quality control and processing of nascent RNA profiling data.

Authors:  Jason P Smith; Arun B Dutta; Kizhakke Mattada Sathyan; Michael J Guertin; Nathan C Sheffield
Journal:  Genome Biol       Date:  2021-05-15       Impact factor: 17.906

5.  Linking big biomedical datasets to modular analysis with Portable Encapsulated Projects.

Authors:  Nathan C Sheffield; Michał Stolarczyk; Vincent P Reuter; André F Rendeiro
Journal:  Gigascience       Date:  2021-12-06       Impact factor: 6.524

6.  PEPATAC: an optimized pipeline for ATAC-seq data analysis with serial alignments.

Authors:  Jason P Smith; M Ryan Corces; Jin Xu; Vincent P Reuter; Howard Y Chang; Nathan C Sheffield
Journal:  NAR Genom Bioinform       Date:  2021-11-23

7.  From biomedical cloud platforms to microservices: next steps in FAIR data and analysis.

Authors:  Nathan C Sheffield; Vivien R Bonazzi; Philip E Bourne; Tony Burdett; Timothy Clark; Robert L Grossman; Ola Spjuth; Andrew D Yates
Journal:  Sci Data       Date:  2022-09-08       Impact factor: 8.501

8.  Alignment and mapping methodology influence transcript abundance estimation.

Authors:  Avi Srivastava; Laraib Malik; Hirak Sarkar; Mohsen Zakeri; Fatemeh Almodaresi; Charlotte Soneson; Michael I Love; Carl Kingsford; Rob Patro
Journal:  Genome Biol       Date:  2020-09-07       Impact factor: 13.583

9.  Tximeta: Reference sequence checksums for provenance identification in RNA-seq.

Authors:  Michael I Love; Charlotte Soneson; Peter F Hickey; Lisa K Johnson; N Tessa Pierce; Lori Shepherd; Martin Morgan; Rob Patro
Journal:  PLoS Comput Biol       Date:  2020-02-25       Impact factor: 4.475

10.  Histone H3 lysine 27 acetylation profile undergoes two global shifts in undernourished children and suggests altered one-carbon metabolism.

Authors:  Kristyna Kupkova; Savera J Shetty; Rashidul Haque; William A Petri; David T Auble
Journal:  Clin Epigenetics       Date:  2021-09-26       Impact factor: 6.551

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.