| Literature DB >> 21609959 |
H B Rao1, F Zhu, G B Yang, Z R Li, Y Z Chen.
Abstract
Sequence-derived structural and physicochemical features have been extensively used for analyzing and predicting structural, functional, expression and interaction profiles of proteins and peptides. PROFEAT has been developed as a web server for computing commonly used features of proteins and peptides from amino acid sequence. To facilitate more extensive studies of protein and peptides, numerous improvements and updates have been made to PROFEAT. We added new functions for computing descriptors of protein-protein and protein-small molecule interactions, segment descriptors for local properties of protein sequences, topological descriptors for peptide sequences and small molecule structures. We also added new feature groups for proteins and peptides (pseudo-amino acid composition, amphiphilic pseudo-amino acid composition, total amino acid properties and atomic-level topological descriptors) as well as for small molecules (atomic-level topological descriptors). Overall, PROFEAT computes 11 feature groups of descriptors for proteins and peptides, and a feature group of more than 400 descriptors for small molecules plus the derived features for protein-protein and protein-small molecule interactions. Our computational algorithms have been extensively tested and used in a number of published works for predicting proteins of specific structural or functional classes, protein-protein interactions, peptides of specific functions and quantitative structure activity relationships of small molecules. PROFEAT is accessible free of charge at http://bidd.cz3.nus.edu.sg/cgi-bin/prof/protein/profnew.cgi.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21609959 PMCID: PMC3125735 DOI: 10.1093/nar/gkr284
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.PROFEAT new web page.
List of PROFEAT computed features for proteins, peptides and protein–protein interactions
| Feature group | Features | No. of descriptors | No. of descriptor values |
|---|---|---|---|
| Composition-1 | Amino acid composition | 1 | 20 |
| Composition-2 | Dipeptide composition | 1 | 400 |
| Autocorrelation 1 | Normalized Moreau–Broto autocorrelation | ||
| Autocorrelation 2 | Moran autocorrelation | ||
| Autocorrelation 3 | Geary autocorrelation | ||
| Composition, Transition, Distribution | Composition | 7 | 21 |
| Transition | 7 | 21 | |
| Distribution | 7 | 105 | |
| Quasi-sequence order descriptors | Sequence order coupling number | 2 | 90 |
| Quasi-sequence order descriptors | 2 | 150 | |
| PAAC | PAAC | ||
| APAAC | APAAC | ||
| Topological descriptors | Topological descriptors | 405 | |
| TAAPs | TAP |
aThe number depends on the choice of the number of properties of amino acid and the choice of the maximum values of the lag.
bThe number depends on the choice of the number of the set of amino acid properties and the choice of the λ value.
cThe number depends on the choice of the λ value.
dThe numbers depend on the choice of the number of properties of amino acid.