Literature DB >> 2075189

Tests for the statistical significance of protein sequence similarities in data-bank searches.

R F Mott1, T B Kirkwood, R N Curnow.   

Abstract

A suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino acid matches. The method is extended to allow for the existence of a sequence insertion/deletion within the region of similarity. The accuracy of statistical distributions underlying the tests is validated using randomly generated sequences and real sequences selected at random from the data banks. A computer program to perform the tests is briefly described.

Mesh:

Substances:

Year:  1990        PMID: 2075189     DOI: 10.1093/protein/4.2.149

Source DB:  PubMed          Journal:  Protein Eng        ISSN: 0269-2139


  1 in total

1.  Conditioning on the number of bands in interpreting matches of multilocus DNA profiles.

Authors:  R N Curnow
Journal:  Genetica       Date:  1995       Impact factor: 1.082

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.