Literature DB >> 10786304

Rapid assessment of extremal statistics for gapped local alignment.

R Olsen1, R Bundschuh, T Hwa.   

Abstract

The statistical significance of gapped local alignments is characterized by analyzing the extremal statistics of the scores obtained from the alignment of random amino acid sequences. By identifying a complete set of linked clusters, "islands," we devise a method which accurately predicts the extremal score statistics by using only one to a few pairwise alignments. The success of our method relies crucially on the link between the statistics of island scores and extremal score statistics. This link is motivated by heuristic arguments, and firmly established by extensive numerical simulations for a variety of scoring parameter settings and sequence lengths. Our approach is several orders of magnitude faster than the widely used shuffling method, since island counting is trivially incorporated into the basic Smith-Waterman alignment algorithm with minimal computational cost, and all islands are counted in a single alignment. The availability of a rapid and accurate significance estimation method gives one the flexibility to fine tune scoring parameters to detect weakly homologous sequences and obtain optimal alignment fidelity.

Mesh:

Year:  1999        PMID: 10786304

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  11 in total

1.  PhyLAT: a phylogenetic local alignment tool.

Authors:  Hongtao Sun; Jeremy D Buhler
Journal:  Bioinformatics       Date:  2012-04-06       Impact factor: 6.937

2.  Confidence assignment for mass spectrometry based peptide identifications via the extreme value distribution.

Authors:  Gelio Alves; Yi-Kuo Yu
Journal:  Bioinformatics       Date:  2016-04-29       Impact factor: 6.937

3.  ESTIMATING THE GUMBEL SCALE PARAMETER FOR LOCAL ALIGNMENT OF RANDOM SEQUENCES BY IMPORTANCE SAMPLING WITH STOPPING TIMES.

Authors:  Yonil Park; Sergey Sheetlin; John L Spouge
Journal:  Ann Stat       Date:  2009-12-01       Impact factor: 4.028

4.  Accelerating pairwise statistical significance estimation for local alignment by harvesting GPU's power.

Authors:  Yuhong Zhang; Sanchit Misra; Ankit Agrawal; Md Mostofa Ali Patwary; Wei-Keng Liao; Zhiguang Qin; Alok Choudhary
Journal:  BMC Bioinformatics       Date:  2012-04-12       Impact factor: 3.169

5.  Pairwise statistical significance of local sequence alignment using multiple parameter sets and empirical justification of parameter set change penalty.

Authors:  Ankit Agrawal; Xiaoqiu Huang
Journal:  BMC Bioinformatics       Date:  2009-03-19       Impact factor: 3.169

6.  Powerful fusion: PSI-BLAST and consensus sequences.

Authors:  Dariusz Przybylski; Burkhard Rost
Journal:  Bioinformatics       Date:  2008-08-04       Impact factor: 6.937

7.  The Gumbel pre-factor k for gapped local alignment can be estimated from simulations of global alignment.

Authors:  Sergey Sheetlin; Yonil Park; John L Spouge
Journal:  Nucleic Acids Res       Date:  2005-09-06       Impact factor: 16.971

8.  The FOLDALIGN web server for pairwise structural RNA alignment and mutual motif search.

Authors:  Jakob H Havgaard; Rune B Lyngsø; Jan Gorodkin
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

9.  Back-translation for discovering distant protein homologies in the presence of frameshift mutations.

Authors:  Marta Girdea; Laurent Noe; Gregory Kucherov
Journal:  Algorithms Mol Biol       Date:  2010-01-04       Impact factor: 1.405

10.  Fast pairwise structural RNA alignments by pruning of the dynamical programming matrix.

Authors:  Jakob H Havgaard; Elfar Torarinsson; Jan Gorodkin
Journal:  PLoS Comput Biol       Date:  2007-08-20       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.