| Literature DB >> 8819157 |
Abstract
We have implemented an iterative algorithm for the identification of diagnostic patterns from sets of multiple-domain proteins, where domains need not be common to all the proteins in the defining set. Our algorithm was applied to sequences gathered using a variety of methods, including BLAST, common keywords, and common E.C. numbers. In all cases, useful diagnostic patterns were obtained, possessing both high sensitivity and specificity. The patterns were found to correlate in several cases with both functional and structural domains. Patterns generated from a large number of sequence families were analyzed for probable multiple-domain structure.Mesh:
Substances:
Year: 1996 PMID: 8819157 PMCID: PMC2143450 DOI: 10.1002/pro.5560050703
Source DB: PubMed Journal: Protein Sci ISSN: 0961-8368 Impact factor: 6.725