| Literature DB >> 23161676 |
Christian J A Sigrist1, Edouard de Castro, Lorenzo Cerutti, Béatrice A Cuche, Nicolas Hulo, Alan Bridge, Lydie Bougueleret, Ioannis Xenarios.
Abstract
PROSITE (http://prosite.expasy.org/) consists of documentation entries describing protein domains, families and functional sites, as well as associated patterns and profiles to identify them. It is complemented by ProRule a collection of rules, which increases the discriminatory power of these profiles and patterns by providing additional information about functionally and/or structurally critical amino acids. PROSITE signatures, together with ProRule, are used for the annotation of domains and features of UniProtKB/Swiss-Prot entries. Here, we describe recent developments that allow users to perform whole-proteome annotation as well as a number of filtering options that can be combined to perform powerful targeted searches for biological discovery. The latest version of PROSITE (release 20.85, of 30 August 2012) contains 1308 patterns, 1039 profiles and 1041 ProRules.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23161676 PMCID: PMC3531220 DOI: 10.1093/nar/gks1067
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Results of the ScanProsite search of the 16 569 predicted Solenopsis invicta proteins against the complete set of PROSITE patterns and profiles
| Patterns | Profiles | |
|---|---|---|
| Total number of PROSITE signature matches in all proteins | 4903 | 9664 |
| Number of distinct proteins matching PROSITE signatures | 2696 | 4349 |
| Number of distinct PROSITE signatures matched | 626 | 622 |
| Number of proteins annotated with one or more functional sites | 520 | 1693 |
| Total number of functional sites annotated | 744 | 7022 |
| Number of distinct PROSITE signatures providing annotation for functional sites | 74 | 148 |
| Total number of detected domains annotated with functional sites | 606 | 3397 |
aPattern hits are validated by automatically generated ‘miniprofiles’ that assign a status to pattern matches (8).
Figure 1.The use of logical operators in ScanProsite. The PROSITE profiles used are PS50122 (CHEB), PS50123 (CHER) and PS50110 (RESPONSE_REGULATORY). The matched architectures correspond to the following UniProtKB/Swiss-Prot entries: Q02998 (YH19_RHOCA), A1SMR4 (CHEB_NOCSJ), P31758 (FRZG_MYXXA), P31759 (FRZF_MYXXA) and A1VZQ6 (CHER_CAMJJ). Single-asterisk symbol denotes that ‘not’ has to be used with another operator (‘and’ or ‘or’). Double-asterisk symbol denotes that parentheses have to be preceded and followed by a space.