| Literature DB >> 2075189 |
R F Mott1, T B Kirkwood, R N Curnow.
Abstract
A suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino acid matches. The method is extended to allow for the existence of a sequence insertion/deletion within the region of similarity. The accuracy of statistical distributions underlying the tests is validated using randomly generated sequences and real sequences selected at random from the data banks. A computer program to perform the tests is briefly described.Mesh:
Substances:
Year: 1990 PMID: 2075189 DOI: 10.1093/protein/4.2.149
Source DB: PubMed Journal: Protein Eng ISSN: 0269-2139