Literature DB >> 17118958

PartTree: an algorithm to build an approximate tree from a large number of unaligned sequences.

Kazutaka Katoh1, Hiroyuki Toh.   

Abstract

MOTIVATION: To construct a multiple sequence alignment (MSA) of a large number (> approximately 10,000) of sequences, the calculation of a guide tree with a complexity of O(N2) to O(N3), where N is the number of sequences, is the most time-consuming process.
RESULTS: To overcome this limitation, we have developed an approximate algorithm, PartTree, to construct a guide tree with an average time complexity of O(N log N). The new MSA method with the PartTree algorithm can align approximately 60,000 sequences in several minutes on a standard desktop computer. The loss of accuracy in MSA caused by this approximation was estimated to be several percent in benchmark tests using Pfam. AVAILABILITY: The present algorithm has been implemented in the MAFFT sequence alignment package (http://align.bmr.kyushu-u.ac.jp/mafft/software/). SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics online.

Mesh:

Year:  2006        PMID: 17118958     DOI: 10.1093/bioinformatics/btl592

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  33 in total

1.  Large-scale multiple sequence alignment and tree estimation using SATé.

Authors:  Kevin Liu; Tandy Warnow
Journal:  Methods Mol Biol       Date:  2014

2.  FastTree 2--approximately maximum-likelihood trees for large alignments.

Authors:  Morgan N Price; Paramvir S Dehal; Adam P Arkin
Journal:  PLoS One       Date:  2010-03-10       Impact factor: 3.240

3.  Multiple Sequence Alignment Computation Using the T-Coffee Regressive Algorithm Implementation.

Authors:  Edgar Garriga; Paolo Di Tommaso; Cedrik Magis; Ionas Erb; Leila Mansouri; Athanasios Baltzis; Evan Floden; Cedric Notredame
Journal:  Methods Mol Biol       Date:  2021

4.  Identification of a dual-targeted protein belonging to the mitochondrial carrier family that is required for early leaf development in rice.

Authors:  Jiming Xu; Jian Yang; Zhongchang Wu; Huili Liu; Fangliang Huang; Yunrong Wu; Chris Carrie; Reena Narsai; Monika Murcha; James Whelan; Ping Wu
Journal:  Plant Physiol       Date:  2013-02-14       Impact factor: 8.340

5.  Simple chained guide trees give high-quality protein multiple sequence alignments.

Authors:  Kieran Boyce; Fabian Sievers; Desmond G Higgins
Journal:  Proc Natl Acad Sci U S A       Date:  2014-07-07       Impact factor: 11.205

6.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

7.  Sequence embedding for fast construction of guide trees for multiple sequence alignment.

Authors:  Gordon Blackshields; Fabian Sievers; Weifeng Shi; Andreas Wilm; Desmond G Higgins
Journal:  Algorithms Mol Biol       Date:  2010-05-14       Impact factor: 1.405

8.  Evidence for a stable association of Psb30 (Ycf12) with photosystem II core complex in the cyanobacterium Synechocystis sp. PCC 6803.

Authors:  Natsuko Inoue-Kashino; Takeshi Takahashi; Akiko Ban; Miwa Sugiura; Yuichiro Takahashi; Kazuhiko Satoh; Yasuhiro Kashino
Journal:  Photosynth Res       Date:  2008-08-08       Impact factor: 3.573

9.  Characterization of the DraT/DraG system for posttranslational regulation of nitrogenase in the endophytic betaproteobacterium Azoarcus sp. strain BH72.

Authors:  Janina Oetjen; Barbara Reinhold-Hurek
Journal:  J Bacteriol       Date:  2009-04-03       Impact factor: 3.490

10.  Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features.

Authors:  Timo Lassmann; Oliver Frings; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2008-12-22       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.