Literature DB >> 16954142

Probalign: multiple sequence alignment using partition function posterior probabilities.

Usman Roshan1, Dennis R Livesay.   

Abstract

MOTIVATION: The maximum expected accuracy optimization criterion for multiple sequence alignment uses pairwise posterior probabilities of residues to align sequences. The partition function methodology is one way of estimating these probabilities. Here, we combine these two ideas for the first time to construct maximal expected accuracy sequence alignments.
RESULTS: We bridge the two techniques within the program Probalign. Our results indicate that Probalign alignments are generally more accurate than other leading multiple sequence alignment methods (i.e. Probcons, MAFFT and MUSCLE) on the BAliBASE 3.0 protein alignment benchmark. Similarly, Probalign also outperforms these methods on the HOMSTRAD and OXBENCH benchmarks. Probalign ranks statistically highest (P-value < 0.005) on all three benchmarks. Deeper scrutiny of the technique indicates that the improvements are largest on datasets containing N/C-terminal extensions and on datasets containing long and heterogeneous length proteins. These points are demonstrated on both real and simulated data. Finally, our method also produces accurate alignments on long and heterogeneous length datasets containing protein repeats. Here, alignment accuracy scores are at least 10% and 15% higher than the other three methods when standard deviation of length is >300 and 400, respectively. AVAILABILITY: Open source code implementing Probalign as well as for producing the simulated data, and all real and simulated data are freely available from http://www.cs.njit.edu/usman/probalign

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16954142     DOI: 10.1093/bioinformatics/btl472

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  76 in total

1.  MTRAP: pairwise sequence alignment algorithm by a new measure based on transition probability between two consecutive pairs of residues.

Authors:  Toshihide Hara; Keiko Sato; Masanori Ohya
Journal:  BMC Bioinformatics       Date:  2010-05-08       Impact factor: 3.169

2.  MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts.

Authors:  Xin Deng; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2011-12-14       Impact factor: 3.169

Review 3.  A classification of bioinformatics algorithms from the viewpoint of maximizing expected accuracy (MEA).

Authors:  Michiaki Hamada; Kiyoshi Asai
Journal:  J Comput Biol       Date:  2012-02-07       Impact factor: 1.479

4.  LocARNA-P: accurate boundary prediction and improved detection of structural RNAs.

Authors:  Sebastian Will; Tejal Joshi; Ivo L Hofacker; Peter F Stadler; Rolf Backofen
Journal:  RNA       Date:  2012-03-26       Impact factor: 4.942

5.  Comparative genomic analysis of the genus Staphylococcus including Staphylococcus aureus and its newly described sister species Staphylococcus simiae.

Authors:  Haruo Suzuki; Tristan Lefébure; Paulina Pavinski Bitar; Michael J Stanhope
Journal:  BMC Genomics       Date:  2012-01-24       Impact factor: 3.969

6.  Uncertainty in homology inferences: assessing and improving genomic sequence alignment.

Authors:  Gerton Lunter; Andrea Rocco; Naila Mimouni; Andreas Heger; Alexandre Caldeira; Jotun Hein
Journal:  Genome Res       Date:  2007-12-11       Impact factor: 9.043

7.  PSAR-align: improving multiple sequence alignment using probabilistic sampling.

Authors:  Jaebum Kim; Jian Ma
Journal:  Bioinformatics       Date:  2013-11-12       Impact factor: 6.937

8.  PLAST-ncRNA: Partition function Local Alignment Search Tool for non-coding RNA sequences.

Authors:  Satish Chikkagoudar; Dennis R Livesay; Usman Roshan
Journal:  Nucleic Acids Res       Date:  2010-06-03       Impact factor: 16.971

9.  Improving accuracy of multiple sequence alignment algorithms based on alignment of neighboring residues.

Authors:  Yue Lu; Sing-Hoi Sze
Journal:  Nucleic Acids Res       Date:  2008-12-04       Impact factor: 16.971

10.  Quality measures for protein alignment benchmarks.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2010-01-04       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.