| Literature DB >> 11565901 |
Abstract
An improved multiple linear regression (MLR) method is proposed to predict a protein's secondary structural content based on its primary sequence. The amino acid composition, the autocorrelation function, and the interaction function of side-chain mass derived from the primary sequence are taken into account. The average absolute errors of prediction over 704 unrelated proteins with the jackknife test are 0.088, 0.081, and 0.059 with standard deviations 0.073, 0.066, and 0.055 for alpha-helix, beta-sheet, and coil, respectively. That the sum of predicted secondary structure content should be close to 1.0 was introduced as a criterion to evaluate whether the prediction is acceptable. While only the predictions with the sum of predicted secondary structure content between 0.99 and 1.01 are accepted (about 11% of all proteins), the absolute errors are 0.058 for alpha-helix, 0.054 for beta-sheet, and 0.045 for coil.Mesh:
Substances:
Year: 2001 PMID: 11565901 DOI: 10.1023/a:1010967008838
Source DB: PubMed Journal: J Protein Chem ISSN: 0277-8033