Literature DB >> 33430919

BlastFrost: fast querying of 100,000s of bacterial genomes in Bifrost graphs.

Nina Luhmann1, Guillaume Holley2, Mark Achtman3.   

Abstract

BlastFrost is a highly efficient method for querying 100,000s of genome assemblies, building on Bifrost, a dynamic data structure for compacted and colored de Bruijn graphs. BlastFrost queries a Bifrost data structure for sequences of interest and extracts local subgraphs, enabling the identification of the presence or absence of individual genes or single nucleotide sequence variants. We show two examples using Salmonella genomes: finding within minutes the presence of genes in the SPI-2 pathogenicity island in a collection of 926 genomes and identifying single nucleotide polymorphisms associated with fluoroquinolone resistance in three genes among 190,209 genomes. BlastFrost is available at https://github.com/nluhmann/BlastFrost/tree/master/data .

Entities:  

Year:  2021        PMID: 33430919      PMCID: PMC7798312          DOI: 10.1186/s13059-020-02237-3

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


  34 in total

1.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

2.  Search and clustering orders of magnitude faster than BLAST.

Authors:  Robert C Edgar
Journal:  Bioinformatics       Date:  2010-08-12       Impact factor: 6.937

Review 3.  The microbial pan-genome.

Authors:  Duccio Medini; Claudio Donati; Hervé Tettelin; Vega Masignani; Rino Rappuoli
Journal:  Curr Opin Genet Dev       Date:  2005-09-26       Impact factor: 5.578

4.  GrapeTree: visualization of core genomic relationships among 100,000 bacterial pathogens.

Authors:  Zhemin Zhou; Nabil-Fareed Alikhan; Martin J Sergeant; Nina Luhmann; Cátia Vaz; Alexandre P Francisco; João André Carriço; Mark Achtman
Journal:  Genome Res       Date:  2018-07-26       Impact factor: 9.043

5.  Phylogenetic modeling of lateral gene transfer reconstructs the pattern and relative timing of speciations.

Authors:  Gergely J Szöllosi; Bastien Boussau; Sophie S Abby; Eric Tannier; Vincent Daubin
Journal:  Proc Natl Acad Sci U S A       Date:  2012-10-04       Impact factor: 11.205

Review 6.  Salmonella pathogenicity island 2.

Authors:  M Hensel
Journal:  Mol Microbiol       Date:  2000-06       Impact factor: 3.501

7.  Multilocus sequence typing as a replacement for serotyping in Salmonella enterica.

Authors:  Mark Achtman; John Wain; François-Xavier Weill; Satheesh Nair; Zhemin Zhou; Vartul Sangal; Mary G Krauland; James L Hale; Heather Harbottle; Alexandra Uesbeck; Gordon Dougan; Lee H Harrison; Sylvain Brisse
Journal:  PLoS Pathog       Date:  2012-06-21       Impact factor: 6.823

8.  CARD 2017: expansion and model-centric curation of the comprehensive antibiotic resistance database.

Authors:  Baofeng Jia; Amogelang R Raphenya; Brian Alcock; Nicholas Waglechner; Peiyao Guo; Kara K Tsang; Briony A Lago; Biren M Dave; Sheldon Pereira; Arjun N Sharma; Sachin Doshi; Mélanie Courtot; Raymond Lo; Laura E Williams; Jonathan G Frye; Tariq Elsayegh; Daim Sardar; Erin L Westman; Andrew C Pawlowski; Timothy A Johnson; Fiona S L Brinkman; Gerard D Wright; Andrew G McArthur
Journal:  Nucleic Acids Res       Date:  2016-10-26       Impact factor: 16.971

9.  Bifrost: highly parallel construction and indexing of colored and compacted de Bruijn graphs.

Authors:  Guillaume Holley; Páll Melsted
Journal:  Genome Biol       Date:  2020-09-17       Impact factor: 13.583

Review 10.  Data structures based on k-mers for querying large collections of sequencing data sets.

Authors:  Camille Marchet; Christina Boucher; Simon J Puglisi; Paul Medvedev; Mikaël Salson; Rayan Chikhi
Journal:  Genome Res       Date:  2020-12-16       Impact factor: 9.043

View more
  6 in total

1.  Lossless indexing with counting de Bruijn graphs.

Authors:  Mikhail Karasikov; Harun Mustafa; Gunnar Rätsch; André Kahles
Journal:  Genome Res       Date:  2022-05-24       Impact factor: 9.438

Review 2.  Methods and Developments in Graphical Pangenomics.

Authors:  Joseph Outten; Andrew Warren
Journal:  J Indian Inst Sci       Date:  2021-08-24

Review 3.  Population-scale genotyping of structural variation in the era of long-read sequencing.

Authors:  Cheng Quan; Hao Lu; Yiming Lu; Gangqiao Zhou
Journal:  Comput Struct Biotechnol J       Date:  2022-05-27       Impact factor: 6.155

Review 4.  Data structures based on k-mers for querying large collections of sequencing data sets.

Authors:  Camille Marchet; Christina Boucher; Simon J Puglisi; Paul Medvedev; Mikaël Salson; Rayan Chikhi
Journal:  Genome Res       Date:  2020-12-16       Impact factor: 9.043

5.  Role of mobile genetic elements in the global dissemination of the carbapenem resistance gene blaNDM.

Authors:  Mislav Acman; Ruobing Wang; Lucy van Dorp; Liam P Shaw; Qi Wang; Nina Luhmann; Yuyao Yin; Shijun Sun; Hongbin Chen; Hui Wang; Francois Balloux
Journal:  Nat Commun       Date:  2022-03-03       Impact factor: 14.919

6.  EnteroBase: hierarchical clustering of 100 000s of bacterial genomes into species/subspecies and populations.

Authors:  Mark Achtman; Zhemin Zhou; Jane Charlesworth; Laura Baxter
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2022-08-22       Impact factor: 6.671

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.