Literature DB >> 15817690

Convergent Island Statistics: a fast method for determining local alignment score significance.

Aleksandar Poleksic1, Joseph F Danzer, Kevin Hambly, Derek A Debe.   

Abstract

MOTIVATION: Background distribution statistics for profile-based sequence alignment algorithms cannot be calculated analytically, and hence such algorithms must resort to measuring the significance of an alignment score by assessing its location among a distribution of background alignment scores. The Gumbel parameters that describe this background distribution are usually pre-computed for a limited number of scoring systems, gap schemes, and sequence lengths and compositions. The use of such look-ups is known to introduce errors, which compromise the significance assessment of a remote homology relationship. One solution is to estimate the background distribution for each pair of interest by generating a large number of sequence shuffles and use the distribution of their scores to approximate the parameters of the underlying extreme value distribution. This is computationally very expensive, as a large number of shuffles are needed to precisely estimate the score statistics.
RESULTS: Convergent Island Statistics (CIS) is a computationally efficient solution to the problem of calculating the Gumbel distribution parameters for an arbitrary pair of sequences and an arbitrary set of gap and scoring schemes. The basic idea behind our method is to recognize the lack of similarity for any pair of sequences early in the shuffling process and thus save on the search time. The method is particularly useful in the context of profile-profile alignment algorithms where the normalization of alignment scores has traditionally been a challenging task. CONTACT: aleksandar@eidogen.com SUPPLEMENTARY INFORMATION: http://www.eidogen-sertanty.com/Documents/convergent_island_stats_sup.pdf.

Mesh:

Year:  2005        PMID: 15817690     DOI: 10.1093/bioinformatics/bti433

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Ligand-binding pocket shape differences between sphingosine 1-phosphate (S1P) receptors S1P1 and S1P3 determine efficiency of chemical probe identification by ultrahigh-throughput screening.

Authors:  Stephan C Schürer; Steven J Brown; Pedro J Gonzalez-Cabrera; Marie-Therese Schaeffer; Jacqueline Chapman; Euijung Jo; Peter Chase; Tim Spicer; Peter Hodder; Hugh Rosen
Journal:  ACS Chem Biol       Date:  2008-07-01       Impact factor: 5.100

2.  Accelerating pairwise statistical significance estimation for local alignment by harvesting GPU's power.

Authors:  Yuhong Zhang; Sanchit Misra; Ankit Agrawal; Md Mostofa Ali Patwary; Wei-Keng Liao; Zhiguang Qin; Alok Choudhary
Journal:  BMC Bioinformatics       Date:  2012-04-12       Impact factor: 3.169

3.  Island method for estimating the statistical significance of profile-profile alignment scores.

Authors:  Aleksandar Poleksic
Journal:  BMC Bioinformatics       Date:  2009-04-20       Impact factor: 3.169

4.  Lovastatin lactone may improve irritable bowel syndrome with constipation (IBS-C) by inhibiting enzymes in the archaeal methanogenesis pathway.

Authors:  Steven M Muskal; Joe Sliman; John Kokai-Kun; Mark Pimentel; Vince Wacher; Klaus Gottlieb
Journal:  F1000Res       Date:  2016-04-08
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.