Literature DB >> 22174280

SEPP: SATé-enabled phylogenetic placement.

S Mirarab1, N Nguyen, T Warnow.   

Abstract

We address the problem of Phylogenetic Placement, in which the objective is to insert short molecular sequences (called query sequences) into an existing phylogenetic tree and alignment on full-length sequences for the same gene. Phylogenetic placement has the potential to provide information beyond pure "species identification" (i.e., the association of metagenomic reads to existing species), because it can also give information about the evolutionary relationships between these query sequences and to known species. Approaches for phylogenetic placement have been developed that operate in two steps: first, an alignment is estimated for each query sequence to the alignment of the full-length sequences, and then that alignment is used to find the optimal location in the phylogenetic tree for the query sequence. Recent methods of this type include HMMALIGN+EPA, HMMALIGN+pplacer, and PaPaRa+EPA.We report on a study evaluating phylogenetic placement methods on biological and simulated data. This study shows that these methods have extremely good accuracy and computational tractability under conditions where the input contains a highly accurate alignment and tree for the full-length sequences, and the set of full-length sequences is sufficiently small and not too evolutionarily diverse; however, we also show that under other conditions accuracy declines and the computational requirements for memory and time exceed acceptable limits. We present SEPP, a general "boosting" technique to improve the accuracy and/or speed of phylogenetic placement techniques. The key algorithmic aspect of this booster is a dataset decomposition technique in SATé, a method that utilizes an iterative divide-and-conquer technique to co-estimate alignments and trees on large molecular sequence datasets. We show that SATé-boosting improves HMMALIGN+pplacer, placing short sequences more accurately when the set of input sequences has a large evolutionary diameter and produces placements of comparable accuracy in a fraction of the time for easier cases. SEPP software and the datasets used in this study are all available for free at http://www.cs.utexas.edu/users/phylo/software/sepp/submission.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22174280     DOI: 10.1142/9789814366496_0024

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  57 in total

Review 1.  Analytical tools and databases for metagenomics in the next-generation sequencing era.

Authors:  Mincheol Kim; Ki-Hyun Lee; Seok-Whan Yoon; Bong-Soo Kim; Jongsik Chun; Hana Yi
Journal:  Genomics Inform       Date:  2013-09-30

2.  Large-scale multiple sequence alignment and tree estimation using SATé.

Authors:  Kevin Liu; Tandy Warnow
Journal:  Methods Mol Biol       Date:  2014

3.  Is there convergence of gut microbes in blood-feeding vertebrates?

Authors:  Se Jin Song; Jon G Sanders; Daniel T Baldassarre; Jaime A Chaves; Nicholas S Johnson; Antoinette J Piaggio; Matthew J Stuckey; Eva Nováková; Jessica L Metcalf; Bruno B Chomel; Alvaro Aguilar-Setién; Rob Knight; Valerie J McKenzie
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2019-06-03       Impact factor: 6.237

4.  Feasibility of collection and analysis of microbiome data in a longitudinal randomized trial of community gardening.

Authors:  Mireia Gascon; Kylie K Harrall; Alyssa W Beavers; Deborah H Glueck; Maggie A Stanislawski; Katherine Alaimo; Angel Villalobos; James R Hebert; Kelsey Dexter; Kaigang Li; Jill Litt
Journal:  Future Microbiol       Date:  2020-06-04       Impact factor: 3.165

5.  Adding unaligned sequences into an existing alignment using MAFFT and LAST.

Authors:  Kazutaka Katoh; Martin C Frith
Journal:  Bioinformatics       Date:  2012-09-27       Impact factor: 6.937

6.  Suppression of the gut microbiome ameliorates age-related arterial dysfunction and oxidative stress in mice.

Authors:  Vienna E Brunt; Rachel A Gioscia-Ryan; James J Richey; Melanie C Zigler; Lauren M Cuevas; Antonio Gonzalez; Yoshiki Vázquez-Baeza; Micah L Battson; Andrew T Smithson; Andrew D Gilley; Gail Ackermann; Andrew P Neilson; Tiffany Weir; Kevin P Davy; Rob Knight; Douglas R Seals
Journal:  J Physiol       Date:  2019-02-27       Impact factor: 5.182

7.  Differential Activation of Hepatic Invariant NKT Cell Subsets Plays a Key Role in Progression of Nonalcoholic Steatohepatitis.

Authors:  Igor Maricic; Idania Marrero; Akiko Eguchi; Ryota Nakamura; Casey D Johnson; Suryasarathi Dasgupta; Carolyn D Hernandez; Phirum Sam Nguyen; Austin D Swafford; Rob Knight; Ariel E Feldstein; Rohit Loomba; Vipin Kumar
Journal:  J Immunol       Date:  2018-10-15       Impact factor: 5.422

8.  Gut microbiota utilize immunoglobulin A for mucosal colonization.

Authors:  G P Donaldson; M S Ladinsky; K B Yu; J G Sanders; B B Yoo; W-C Chou; M E Conner; A M Earl; R Knight; P J Bjorkman; S K Mazmanian
Journal:  Science       Date:  2018-05-03       Impact factor: 47.728

9.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

10.  Phylogeny Estimation Given Sequence Length Heterogeneity.

Authors:  Vladimir Smirnov; Tandy Warnow
Journal:  Syst Biol       Date:  2021-02-10       Impact factor: 15.683

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.