Literature DB >> 11139604

The estimation of statistical parameters for local alignment score distributions.

S F Altschul1, R Bundschuh, R Olsen, T Hwa.   

Abstract

The distribution of optimal local alignment scores of random sequences plays a vital role in evaluating the statistical significance of sequence alignments. These scores can be well described by an extreme-value distribution. The distribution's parameters depend upon the scoring system employed and the random letter frequencies; in general they cannot be derived analytically, but must be estimated by curve fitting. For obtaining accurate parameter estimates, a form of the recently described 'island' method has several advantages. We describe this method in detail, and use it to investigate the functional dependence of these parameters on finite-length edge effects.

Mesh:

Year:  2001        PMID: 11139604      PMCID: PMC29669          DOI: 10.1093/nar/29.2.351

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  25 in total

1.  Accurate formula for P-values of gapped local sequence and profile alignments.

Authors:  R Mott
Journal:  J Mol Biol       Date:  2000-07-14       Impact factor: 5.469

2.  Distribution of glutamine and asparagine residues and their near neighbors in peptides and proteins.

Authors:  A B Robinson; L R Robinson
Journal:  Proc Natl Acad Sci U S A       Date:  1991-10-15       Impact factor: 11.205

3.  Empirical statistical estimates for sequence similarity searches.

Authors:  W R Pearson
Journal:  J Mol Biol       Date:  1998-02-13       Impact factor: 5.469

4.  Local alignment statistics.

Authors:  S F Altschul; W Gish
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

5.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

Authors:  S Karlin; S F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  1990-03       Impact factor: 11.205

6.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

7.  Optimal alignments in linear space.

Authors:  E W Myers; W Miller
Journal:  Comput Appl Biosci       Date:  1988-03

8.  The significance of protein sequence similarities.

Authors:  J F Collins; A F Coulson; A Lyall
Journal:  Comput Appl Biosci       Date:  1988-03

9.  Identification of protein coding regions by database similarity search.

Authors:  W Gish; D J States
Journal:  Nat Genet       Date:  1993-03       Impact factor: 38.330

10.  An improved algorithm for matching biological sequences.

Authors:  O Gotoh
Journal:  J Mol Biol       Date:  1982-12-15       Impact factor: 5.469

View more
  48 in total

Review 1.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

2.  The compositional adjustment of amino acid substitution matrices.

Authors:  Yi-Kuo Yu; John C Wootton; Stephen F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  2003-12-08       Impact factor: 11.205

3.  The CATH database: an extended protein family resource for structural and functional genomics.

Authors:  F M G Pearl; C F Bennett; J E Bray; A P Harrison; N Martin; A Shepherd; I Sillitoe; J Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

4.  Sequence similarities of protein kinase substrates and inhibitors with immunoglobulins and model immunoglobulin homologue: cell adhesion molecule from the living fossil sponge Geodia cydonium. Mapping of coherent database similarities and implications for evolution of CDR1 and hypermutation.

Authors:  J Kubrycht; J Borecký; P Soucek; P Jezek
Journal:  Folia Microbiol (Praha)       Date:  2004       Impact factor: 2.099

5.  Microbial community succession during lactate amendment and electron acceptor limitation reveals a predominance of metal-reducing Pelosinus spp.

Authors:  Jennifer J Mosher; Tommy J Phelps; Mircea Podar; Richard A Hurt; James H Campbell; Meghan M Drake; James G Moberly; Christopher W Schadt; Steven D Brown; Terry C Hazen; Adam P Arkin; Anthony V Palumbo; Boris A Faybishenko; Dwayne A Elias
Journal:  Appl Environ Microbiol       Date:  2012-01-20       Impact factor: 4.792

6.  PhyLAT: a phylogenetic local alignment tool.

Authors:  Hongtao Sun; Jeremy D Buhler
Journal:  Bioinformatics       Date:  2012-04-06       Impact factor: 6.937

7.  Objective method for estimating asymptotic parameters, with an application to sequence alignment.

Authors:  Sergey Sheetlin; Yonil Park; John L Spouge
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2011-09-13

8.  ALP & FALP: C++ libraries for pairwise local alignment E-values.

Authors:  Sergey Sheetlin; Yonil Park; Martin C Frith; John L Spouge
Journal:  Bioinformatics       Date:  2015-10-01       Impact factor: 6.937

Review 9.  Protein database searches using compositionally adjusted substitution matrices.

Authors:  Stephen F Altschul; John C Wootton; E Michael Gertz; Richa Agarwala; Aleksandr Morgulis; Alejandro A Schäffer; Yi-Kuo Yu
Journal:  FEBS J       Date:  2005-10       Impact factor: 5.542

10.  Identifying the conserved network of cis-regulatory sites of a eukaryotic genome.

Authors:  Ting Wang; Gary D Stormo
Journal:  Proc Natl Acad Sci U S A       Date:  2005-11-21       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.