| Literature DB >> 21547362 |
Songyot Nakariyakul1, Zhi-Ping Liu, Luonan Chen.
Abstract
Detecting thermophilic proteins is an important task for designing stable protein engineering in interested temperatures. In this work, we develop a simple but efficient method to classify thermophilic proteins from mesophilic ones using the amino acid and dipeptide compositions. Since most of the amino acid and dipeptide compositions are redundant, we propose a new forward floating selection technique to select only a useful subset of these compositions as features for support vector machine-based classification. We test the proposed method on a benchmark data set of 915 thermophilic and 793 mesophilic proteins. The results show that our method using 28 amino acid and dipeptide compositions achieves an accuracy rate of 93.3% evaluated by the jackknife cross-validation test, which is higher not only than the existing methods but also than using all amino acid and dipeptide compositions.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21547362 DOI: 10.1007/s00726-011-0923-1
Source DB: PubMed Journal: Amino Acids ISSN: 0939-4451 Impact factor: 3.520