Literature DB >> 33588753

cognac: rapid generation of concatenated gene alignments for phylogenetic inference from large, bacterial whole genome sequencing datasets.

Ryan D Crawford1, Evan S Snitkin2.   

Abstract

BACKGROUND: The quantity of genomic data is expanding at an increasing rate. Tools for phylogenetic analysis which scale to the quantity of available data are required. To address this need, we present cognac, a user-friendly software package to rapidly generate concatenated gene alignments for phylogenetic analysis.
RESULTS: We illustrate that cognac is able to rapidly identify phylogenetic marker genes using a data driven approach and efficiently generate concatenated gene alignments for very large genomic datasets. To benchmark our tool, we generated core gene alignments for eight unique genera of bacteria, including a dataset of over 11,000 genomes from the genus Escherichia producing an alignment with 1353 genes, which was constructed in less than 17 h.
CONCLUSIONS: We demonstrate that cognac presents an efficient method for generating concatenated gene alignments for phylogenetic analysis. We have released cognac as an R package ( https://github.com/rdcrawford/cognac ) with customizable parameters for adaptation to diverse applications.

Entities:  

Keywords:  Concatenated gene tree; Core genome; Multiple sequence alignment; Phylogenetics

Mesh:

Year:  2021        PMID: 33588753      PMCID: PMC7885345          DOI: 10.1186/s12859-021-03981-4

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  32 in total

Review 1.  Molecular phylogenetics: principles and practice.

Authors:  Ziheng Yang; Bruce Rannala
Journal:  Nat Rev Genet       Date:  2012-03-28       Impact factor: 53.242

2.  Testing congruence in phylogenomic analysis.

Authors:  Jessica W Leigh; Edward Susko; Manuela Baumgartner; Andrew J Roger
Journal:  Syst Biol       Date:  2008-02       Impact factor: 15.683

3.  Dealing with incongruence in phylogenomic analyses.

Authors:  Nicolas Galtier; Vincent Daubin
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

4.  On the complexity of multiple sequence alignment.

Authors:  L Wang; T Jiang
Journal:  J Comput Biol       Date:  1994       Impact factor: 1.479

5.  Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms.

Authors:  M C Maiden; J A Bygraves; E Feil; G Morelli; J E Russell; R Urwin; Q Zhang; J Zhou; K Zurth; D A Caugant; I M Feavers; M Achtman; B G Spratt
Journal:  Proc Natl Acad Sci U S A       Date:  1998-03-17       Impact factor: 11.205

6.  Nucleotide polymorphism at the alcohol dehydrogenase locus of Drosophila melanogaster.

Authors:  M Kreitman
Journal:  Nature       Date:  1983 Aug 4-10       Impact factor: 49.962

7.  Concatenation and Species Tree Methods Exhibit Statistically Indistinguishable Accuracy under a Range of Simulated Conditions.

Authors:  João Tonini; Andrew Moore; David Stern; Maryia Shcheglovitova; Guillermo Ortí
Journal:  PLoS Curr       Date:  2015-03-09

8.  Genomic diversity affects the accuracy of bacterial single-nucleotide polymorphism-calling pipelines.

Authors:  Stephen J Bush; Dona Foster; David W Eyre; Emily L Clark; Nicola De Maio; Liam P Shaw; Nicole Stoesser; Tim E A Peto; Derrick W Crook; A Sarah Walker
Journal:  Gigascience       Date:  2020-02-01       Impact factor: 6.524

9.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

10.  The RAST Server: rapid annotations using subsystems technology.

Authors:  Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal:  BMC Genomics       Date:  2008-02-08       Impact factor: 3.969

View more
  4 in total

1.  The survivor strain: isolation and characterization of Phormidium yuhuli AB48, a filamentous phototactic cyanobacterium with biotechnological potential.

Authors:  Moritz Koch; Avery J C Noonan; Yilin Qiu; Kalen Dofher; Brandon Kieft; Soheyl Mottahedeh; Manisha Shastri; Steven J Hallam
Journal:  Front Bioeng Biotechnol       Date:  2022-08-15

2.  Combined comparative genomics and clinical modeling reveals plasmid-encoded genes are independently associated with Klebsiella infection.

Authors:  Jay Vornhagen; Emily K Roberts; Lavinia Unverdorben; Sophia Mason; Alieysa Patel; Ryan Crawford; Caitlyn L Holmes; Yuang Sun; Alexandra Teodorescu; Evan S Snitkin; Lili Zhao; Patricia J Simner; Pranita D Tamma; Krishna Rao; Keith S Kaye; Michael A Bachman
Journal:  Nat Commun       Date:  2022-08-01       Impact factor: 17.694

3.  Regional Spread of blaNDM-1-Containing Klebsiella pneumoniae ST147 in Post-Acute Care Facilities.

Authors:  Zena Lapp; Ryan Crawford; Arianna Miles-Jay; Ali Pirani; William E Trick; Robert A Weinstein; Mary K Hayden; Evan S Snitkin; Michael Y Lin
Journal:  Clin Infect Dis       Date:  2021-10-20       Impact factor: 9.079

4.  Genomic Update of Phenotypic Prediction Rule for Methicillin-Resistant Staphylococcus aureus (MRSA) USA300 Discloses Jail Transmission Networks with Increased Resistance.

Authors:  Sarah E Sansom; Emily Benedict; Stephanie N Thiede; Bala Hota; Alla Aroutcheva; Darjai Payne; Chad Zawitz; Evan S Snitkin; Stefan J Green; Robert A Weinstein; Kyle J Popovich
Journal:  Microbiol Spectr       Date:  2021-07-21
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.