| Literature DB >> 11473020 |
M J Wise1.
Abstract
Low complexity proteins and protein domains have sequences which appear highly non-random. Over the years, these sequences have been routinely filtered out during sequence similarity searches because interest has been focused on globular proteins, and inclusion of these domains can severely skew search results. However, early work on these proteins and more recent studies of the related area of repeated protein sequences suggests that low complexity protein domains have function and therefore are in need of further investigation. 0j.py is a new tool for demarcating low complexity protein domains more accurately than has been possible to date. The paper describes 0j.py and its use in revealing proteins with repeated and poly-amino-acid peptides. Statistical methods are then employed to to examine the distribution of these proteins across species, while keyword clustering is used to suggest roles performed by proteins through the use of low complexity domains.Entities:
Mesh:
Substances:
Year: 2001 PMID: 11473020 DOI: 10.1093/bioinformatics/17.suppl_1.s288
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937