Literature DB >> 31508216

Large-scale sequence comparisons with sourmash.

N Tessa Pierce1, Luiz Irber1, Taylor Reiter1, Phillip Brooks1, C Titus Brown1.   

Abstract

The sourmash software package uses MinHash-based sketching to create "signatures", compressed representations of DNA, RNA, and protein sequences, that can be stored, searched, explored, and taxonomically annotated. sourmash signatures can be used to estimate sequence similarity between very large data sets quickly and in low memory, and can be used to search large databases of genomes for matches to query genomes and metagenomes. sourmash is implemented in C++, Rust, and Python, and is freely available under the BSD license at http://github.com/dib-lab/sourmash.

Entities:  

Keywords:  MinHash; bioinformatics; k-mer; sequence analysis; sourmash

Mesh:

Year:  2019        PMID: 31508216      PMCID: PMC6720031          DOI: 10.12688/f1000research.19675.1

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


  13 in total

1.  Evolutionary implications of microbial genome tetranucleotide frequency biases.

Authors:  David T Pride; Richard J Meinersmann; Trudy M Wassenaar; Martin J Blaser
Journal:  Genome Res       Date:  2003-02       Impact factor: 9.043

2.  Fast search of thousands of short-read sequencing experiments.

Authors:  Brad Solomon; Carl Kingsford
Journal:  Nat Biotechnol       Date:  2016-02-08       Impact factor: 54.908

3.  Comparative metagenomic and rRNA microbial diversity characterization using archaeal and bacterial synthetic communities.

Authors:  Migun Shakya; Christopher Quince; James H Campbell; Zamin K Yang; Christopher W Schadt; Mircea Podar
Journal:  Environ Microbiol       Date:  2013-02-06       Impact factor: 5.491

4.  Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes.

Authors:  Mads Albertsen; Philip Hugenholtz; Adam Skarshewski; Kåre L Nielsen; Gene W Tyson; Per H Nielsen
Journal:  Nat Biotechnol       Date:  2013-05-26       Impact factor: 54.908

5.  How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?

Authors:  Nicholas J Schurch; Pietá Schofield; Marek Gierliński; Christian Cole; Alexander Sherstnev; Vijender Singh; Nicola Wrobel; Karim Gharbi; Gordon G Simpson; Tom Owen-Hughes; Mark Blaxter; Geoffrey J Barton
Journal:  RNA       Date:  2016-03-28       Impact factor: 4.942

6.  Mash: fast genome and metagenome distance estimation using MinHash.

Authors:  Brian D Ondov; Todd J Treangen; Páll Melsted; Adam B Mallonee; Nicholas H Bergman; Sergey Koren; Adam M Phillippy
Journal:  Genome Biol       Date:  2016-06-20       Impact factor: 13.583

7.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

8.  The khmer software package: enabling efficient nucleotide sequence analysis.

Authors:  Michael R Crusoe; Hussien F Alameldin; Sherine Awad; Elmar Boucher; Adam Caldwell; Reed Cartwright; Amanda Charbonneau; Bede Constantinides; Greg Edvenson; Scott Fay; Jacob Fenton; Thomas Fenzl; Jordan Fish; Leonor Garcia-Gutierrez; Phillip Garland; Jonathan Gluck; Iván González; Sarah Guermond; Jiarong Guo; Aditi Gupta; Joshua R Herr; Adina Howe; Alex Hyer; Andreas Härpfer; Luiz Irber; Rhys Kidd; David Lin; Justin Lippi; Tamer Mansour; Pamela McA'Nulty; Eric McDonald; Jessica Mizzi; Kevin D Murray; Joshua R Nahum; Kaben Nanlohy; Alexander Johan Nederbragt; Humberto Ortiz-Zuazaga; Jeramia Ory; Jason Pell; Charles Pepe-Ranney; Zachary N Russ; Erich Schwarz; Camille Scott; Josiah Seaman; Scott Sievert; Jared Simpson; Connor T Skennerton; James Spencer; Ramakrishnan Srinivasan; Daniel Standage; James A Stapleton; Susan R Steinman; Joe Stein; Benjamin Taylor; Will Trimble; Heather L Wiencko; Michael Wright; Brian Wyss; Qingpeng Zhang; En Zyme; C Titus Brown
Journal:  F1000Res       Date:  2015-09-25

9.  Kraken: ultrafast metagenomic sequence classification using exact alignments.

Authors:  Derrick E Wood; Steven L Salzberg
Journal:  Genome Biol       Date:  2014-03-03       Impact factor: 13.583

Review 10.  A survey of best practices for RNA-seq data analysis.

Authors:  Ana Conesa; Pedro Madrigal; Sonia Tarazona; David Gomez-Cabrero; Alejandra Cervera; Andrew McPherson; Michał Wojciech Szcześniak; Daniel J Gaffney; Laura L Elo; Xuegong Zhang; Ali Mortazavi
Journal:  Genome Biol       Date:  2016-01-26       Impact factor: 13.583

View more
  30 in total

1.  Critical Assessment of Metagenome Interpretation: the second round of challenges.

Authors:  Fernando Meyer; Adrian Fritz; Zhi-Luo Deng; David Koslicki; Till Robin Lesker; Alexey Gurevich; Gary Robertson; Mohammed Alser; Dmitry Antipov; Francesco Beghini; Denis Bertrand; Jaqueline J Brito; C Titus Brown; Jan Buchmann; Aydin Buluç; Bo Chen; Rayan Chikhi; Philip T L C Clausen; Alexandru Cristian; Piotr Wojciech Dabrowski; Aaron E Darling; Rob Egan; Eleazar Eskin; Evangelos Georganas; Eugene Goltsman; Melissa A Gray; Lars Hestbjerg Hansen; Steven Hofmeyr; Pingqin Huang; Luiz Irber; Huijue Jia; Tue Sparholt Jørgensen; Silas D Kieser; Terje Klemetsen; Axel Kola; Mikhail Kolmogorov; Anton Korobeynikov; Jason Kwan; Nathan LaPierre; Claire Lemaitre; Chenhao Li; Antoine Limasset; Fabio Malcher-Miranda; Serghei Mangul; Vanessa R Marcelino; Camille Marchet; Pierre Marijon; Dmitry Meleshko; Daniel R Mende; Alessio Milanese; Niranjan Nagarajan; Jakob Nissen; Sergey Nurk; Leonid Oliker; Lucas Paoli; Pierre Peterlongo; Vitor C Piro; Jacob S Porter; Simon Rasmussen; Evan R Rees; Knut Reinert; Bernhard Renard; Espen Mikal Robertsen; Gail L Rosen; Hans-Joachim Ruscheweyh; Varuni Sarwal; Nicola Segata; Enrico Seiler; Lizhen Shi; Fengzhu Sun; Shinichi Sunagawa; Søren Johannes Sørensen; Ashleigh Thomas; Chengxuan Tong; Mirko Trajkovski; Julien Tremblay; Gherman Uritskiy; Riccardo Vicedomini; Zhengyang Wang; Ziye Wang; Zhong Wang; Andrew Warren; Nils Peder Willassen; Katherine Yelick; Ronghui You; Georg Zeller; Zhengqiao Zhao; Shanfeng Zhu; Jie Zhu; Ruben Garrido-Oter; Petra Gastmeier; Stephane Hacquard; Susanne Häußler; Ariane Khaledi; Friederike Maechler; Fantin Mesny; Simona Radutoiu; Paul Schulze-Lefert; Nathiana Smit; Till Strowig; Andreas Bremges; Alexander Sczyrba; Alice Carolyn McHardy
Journal:  Nat Methods       Date:  2022-04-08       Impact factor: 28.547

2.  CMash: fast, multi-resolution estimation of k-mer-based Jaccard and containment indices.

Authors:  Shaopeng Liu; David Koslicki
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

3.  The minimizer Jaccard estimator is biased and inconsistent.

Authors:  Mahdi Belbasi; Antonio Blanca; Robert S Harris; David Koslicki; Paul Medvedev
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

4.  Highly Specialized Carbohydrate Metabolism Capability in Bifidobacterium Strains Associated with Intestinal Barrier Maturation in Early Preterm Infants.

Authors:  Bing Ma; Sripriya Sundararajan; Gita Nadimpalli; Michael France; Elias McComb; Lindsay Rutt; Jose M Lemme-Dumit; Elise Janofsky; Lisa S Roskes; Pawel Gajer; Li Fu; Hongqiu Yang; Mike Humphrys; Luke J Tallon; Lisa Sadzewicz; Marcela F Pasetti; Jacques Ravel; Rose M Viscardi
Journal:  mBio       Date:  2022-06-13       Impact factor: 7.786

5.  To Petabytes and beyond: recent advances in probabilistic and signal processing algorithms and their application to metagenomics.

Authors:  R A Leo Elworth; Qi Wang; Pavan K Kota; C J Barberan; Benjamin Coleman; Advait Balaji; Gaurav Gupta; Richard G Baraniuk; Anshumali Shrivastava; Todd J Treangen
Journal:  Nucleic Acids Res       Date:  2020-06-04       Impact factor: 16.971

6.  Minimizer-space de Bruijn graphs: Whole-genome assembly of long reads in minutes on a personal computer.

Authors:  Barış Ekim; Bonnie Berger; Rayan Chikhi
Journal:  Cell Syst       Date:  2021-09-14       Impact factor: 10.304

Review 7.  Metagenomics: a path to understanding the gut microbiome.

Authors:  Sandi Yen; Jethro S Johnson
Journal:  Mamm Genome       Date:  2021-07-14       Impact factor: 2.957

8.  LINbase: a web server for genome-based identification of prokaryotes as members of crowdsourced taxa.

Authors:  Long Tian; Chengjie Huang; Reza Mazloom; Lenwood S Heath; Boris A Vinatzer
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

9.  Viromes outperform total metagenomes in revealing the spatiotemporal patterns of agricultural soil viral communities.

Authors:  Christian Santos-Medellin; Laura A Zinke; Anneliek M Ter Horst; Danielle L Gelardi; Sanjai J Parikh; Joanne B Emerson
Journal:  ISME J       Date:  2021-02-21       Impact factor: 10.302

10.  Cartography of opportunistic pathogens and antibiotic resistance genes in a tertiary hospital environment.

Authors:  Kern Rei Chng; Chenhao Li; Denis Bertrand; Amanda Hui Qi Ng; Junmei Samantha Kwah; Hwee Meng Low; Chengxuan Tong; Maanasa Natrajan; Michael Hongjie Zhang; Licheng Xu; Karrie Kwan Ki Ko; Eliza Xin Pei Ho; Tamar V Av-Shalom; Jeanette Woon Pei Teo; Chiea Chuen Khor; Swaine L Chen; Christopher E Mason; Oon Tek Ng; Kalisvar Marimuthu; Brenda Ang; Niranjan Nagarajan
Journal:  Nat Med       Date:  2020-06-08       Impact factor: 53.440

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.