Literature DB >> 33500774

Reproducibly sampling SARS-CoV-2 genomes across time, geography, and viral diversity.

Evan Bolyen1,2, Matthew R Dillon1, Nicholas A Bokulich3, Jason T Ladner4, Brendan B Larsen5, Crystal M Hepp2,4, Darrin Lemmer6, Jason W Sahl4,7, Andrew Sanchez1, Chris Holdgraf8, Chris Sewell9, Aakash G Choudhury10, John Stachurski10, Matthew McKay10, Anthony Simard1, David M Engelthaler6, Michael Worobey5, Paul Keim4,6,7, J Gregory Caporaso1,7.   

Abstract

The COVID-19 pandemic has led to a rapid accumulation of SARS-CoV-2 genomes, enabling genomic epidemiology on local and global scales. Collections of genomes from resources such as GISAID must be subsampled to enable computationally feasible phylogenetic and other analyses. We present genome-sampler, a software package that supports sampling collections of viral genomes across multiple axes including time of genome isolation, location of genome isolation, and viral diversity. The software is modular in design so that these or future sampling approaches can be applied independently and combined (or replaced with a random sampling approach) to facilitate custom workflows and benchmarking. genome-sampler is written as a QIIME 2 plugin, ensuring that its application is fully reproducible through QIIME 2's unique retrospective data provenance tracking system. genome-sampler can be installed in a conda environment on macOS or Linux systems. A complete default pipeline is available through a Snakemake workflow, so subsampling can be achieved using a single command. genome-sampler is open source, free for all to use, and available at https://caporasolab.us/genome-sampler. We hope that this will facilitate SARS-CoV-2 research and support evaluation of viral genome sampling approaches for genomic epidemiology. Copyright:
© 2020 Bolyen E et al.

Entities:  

Keywords:  QIIME 2; SARS-CoV-2; bioinformatics; genome-sampler; genomics

Year:  2020        PMID: 33500774      PMCID: PMC7814287          DOI: 10.12688/f1000research.24751.2

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


  9 in total

1.  Nomenclature for incompletely specified bases in nucleic acid sequences: recommendations 1984.

Authors:  A Cornish-Bowden
Journal:  Nucleic Acids Res       Date:  1985-05-10       Impact factor: 16.971

2.  Snakemake--a scalable bioinformatics workflow engine.

Authors:  Johannes Köster; Sven Rahmann
Journal:  Bioinformatics       Date:  2012-08-20       Impact factor: 6.937

3.  Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2.

Authors:  Evan Bolyen; Jai Ram Rideout; Matthew R Dillon; Nicholas A Bokulich; Christian C Abnet; Gabriel A Al-Ghalith; Harriet Alexander; Eric J Alm; Manimozhiyan Arumugam; Francesco Asnicar; Yang Bai; Jordan E Bisanz; Kyle Bittinger; Asker Brejnrod; Colin J Brislawn; C Titus Brown; Benjamin J Callahan; Andrés Mauricio Caraballo-Rodríguez; John Chase; Emily K Cope; Ricardo Da Silva; Christian Diener; Pieter C Dorrestein; Gavin M Douglas; Daniel M Durall; Claire Duvallet; Christian F Edwardson; Madeleine Ernst; Mehrbod Estaki; Jennifer Fouquier; Julia M Gauglitz; Sean M Gibbons; Deanna L Gibson; Antonio Gonzalez; Kestrel Gorlick; Jiarong Guo; Benjamin Hillmann; Susan Holmes; Hannes Holste; Curtis Huttenhower; Gavin A Huttley; Stefan Janssen; Alan K Jarmusch; Lingjing Jiang; Benjamin D Kaehler; Kyo Bin Kang; Christopher R Keefe; Paul Keim; Scott T Kelley; Dan Knights; Irina Koester; Tomasz Kosciolek; Jorden Kreps; Morgan G I Langille; Joslynn Lee; Ruth Ley; Yong-Xin Liu; Erikka Loftfield; Catherine Lozupone; Massoud Maher; Clarisse Marotz; Bryan D Martin; Daniel McDonald; Lauren J McIver; Alexey V Melnik; Jessica L Metcalf; Sydney C Morgan; Jamie T Morton; Ahmad Turan Naimey; Jose A Navas-Molina; Louis Felix Nothias; Stephanie B Orchanian; Talima Pearson; Samuel L Peoples; Daniel Petras; Mary Lai Preuss; Elmar Pruesse; Lasse Buur Rasmussen; Adam Rivers; Michael S Robeson; Patrick Rosenthal; Nicola Segata; Michael Shaffer; Arron Shiffer; Rashmi Sinha; Se Jin Song; John R Spear; Austin D Swafford; Luke R Thompson; Pedro J Torres; Pauline Trinh; Anupriya Tripathi; Peter J Turnbaugh; Sabah Ul-Hasan; Justin J J van der Hooft; Fernando Vargas; Yoshiki Vázquez-Baeza; Emily Vogtmann; Max von Hippel; William Walters; Yunhu Wan; Mingxun Wang; Jonathan Warren; Kyle C Weber; Charles H D Williamson; Amy D Willis; Zhenjiang Zech Xu; Jesse R Zaneveld; Yilong Zhang; Qiyun Zhu; Rob Knight; J Gregory Caporaso
Journal:  Nat Biotechnol       Date:  2019-08       Impact factor: 54.908

4.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

5.  VSEARCH: a versatile open source tool for metagenomics.

Authors:  Torbjørn Rognes; Tomáš Flouri; Ben Nichols; Christopher Quince; Frédéric Mahé
Journal:  PeerJ       Date:  2016-10-18       Impact factor: 2.984

6.  The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.

Authors:  Enis Afgan; Dannon Baker; Bérénice Batut; Marius van den Beek; Dave Bouvier; Martin Cech; John Chilton; Dave Clements; Nate Coraor; Björn A Grüning; Aysam Guerler; Jennifer Hillman-Jackson; Saskia Hiltemann; Vahid Jalili; Helena Rasche; Nicola Soranzo; Jeremy Goecks; James Taylor; Anton Nekrutenko; Daniel Blankenberg
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

7.  Nextstrain: real-time tracking of pathogen evolution.

Authors:  James Hadfield; Colin Megill; Sidney M Bell; John Huddleston; Barney Potter; Charlton Callender; Pavel Sagulenko; Trevor Bedford; Richard A Neher
Journal:  Bioinformatics       Date:  2018-12-01       Impact factor: 6.931

8.  Data, disease and diplomacy: GISAID's innovative contribution to global health.

Authors:  Stefan Elbe; Gemma Buckland-Merrett
Journal:  Glob Chall       Date:  2017-01-10

9.  GenBank.

Authors:  Dennis A Benson; Mark Cavanaugh; Karen Clark; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; Eric W Sayers
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

  9 in total
  6 in total

1.  Genomic Epidemiology of SARS-CoV-2 in Tocantins State and the Diffusion of P.1.7 and AY.99.2 Lineages in Brazil.

Authors:  Ueric José Borges de Souza; Raíssa Nunes Dos Santos; Fernando Lucas de Melo; Aline Belmok; Jucimária Dantas Galvão; Tereza Cristina Vieira de Rezende; Franciano Dias Pereira Cardoso; Rogério Fernandes Carvalho; Monike da Silva Oliveira; Jose Carlos Ribeiro Junior; Evgeni Evgeniev Gabev; Ester Cerdeira Sabino; Clarice Weis Arns; Bergmann Morais Ribeiro; Fernando Rosado Spilki; Fabrício Souza Campos
Journal:  Viruses       Date:  2022-03-23       Impact factor: 5.818

2.  Comparative phylodynamics reveals the evolutionary history of SARS-CoV-2 emerging variants in the Arabian Peninsula.

Authors:  Moh A Alkhamis; Nicholas M Fountain-Jones; Mohammad M Khajah; Mohammad Alghounaim; Salman K Al-Sabah
Journal:  Virus Evol       Date:  2022-05-18

3.  Applying Genomic Epidemiology to Characterize a COVID-19 Outbreak in a Developmentally Disabled Adult Group Home Setting, Arizona.

Authors:  Hayley D Yaglom; Marette Gebhardt; Ashlyn Pfeiffer; Mary Ellen Ormsby; Daniel E Jasso-Selles; Darrin Lemmer; Megan L Folkerts; Chris French; Matthew Maurer; Jolene R Bowers; David M Engelthaler
Journal:  Front Public Health       Date:  2021-05-13

4.  Methods for sequencing the pandemic: benefits of rapid or high-throughput processing.

Authors:  Megan L Folkerts; Darrin Lemmer; Ashlyn Pfeiffer; Danielle Vasquez; Chris French; Amber Jones; Marjorie Nguyen; Brendan Larsen; W Tanner Porter; Krystal Sheridan; Jolene R Bowers; David M Engelthaler
Journal:  F1000Res       Date:  2021-01-26

5.  Population Genomics Approaches for Genetic Characterization of SARS-CoV-2 Lineages.

Authors:  Fatima Mostefai; Isabel Gamache; Arnaud N'Guessan; Justin Pelletier; Jessie Huang; Carmen Lia Murall; Ahmad Pesaranghader; Vanda Gaonac'h-Lovejoy; David J Hamelin; Raphaël Poujol; Jean-Christophe Grenier; Martin Smith; Etienne Caron; Morgan Craig; Guy Wolf; Smita Krishnaswamy; B Jesse Shapiro; Julie G Hussin
Journal:  Front Med (Lausanne)       Date:  2022-02-21

6.  Multiple expansions of globally uncommon SARS-CoV-2 lineages in Nigeria.

Authors:  Egon A Ozer; Lacy M Simons; Olubusuyi M Adewumi; Adeola A Fowotade; Ewean C Omoruyi; Johnson A Adeniji; Oluseyi A Olayinka; Taylor J Dean; Janet Zayas; Pavan P Bhimalli; Michelle K Ash; Almoustapha I Maiga; Anou M Somboro; Mamoudou Maiga; Adam Godzik; Jeffrey R Schneider; João I Mamede; Babafemi O Taiwo; Judd F Hultquist; Ramon Lorenzo-Redondo
Journal:  Nat Commun       Date:  2022-02-03       Impact factor: 14.919

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.