Literature DB >> 21787282

Predicting thermophilic proteins with pseudo amino acid composition:approached from chaos game representation and principal component analysis.

Xiao-Lei Liu1, Jin-Long Lu, Xue-Hai Hu.   

Abstract

Comprehensive knowledge of thermophilic mechanisms about some organisms whose optimum growth temperature (OGT) ranges from 50 to 80 °C degree plays a major role for helping to design stable proteins. How to predict function-unknown proteins to be thermophilic is a long but not fairly resolved problem. Chaos game representation (CGR) can investigate hidden patterns in protein sequences, and also can visually reveal their previously unknown structures. In this paper, using the general form of pseudo amino acid composition to represent protein samples, we proposed a novel method for presenting protein sequence to a CGR picture using CGR algorithm. A 24-dimensional vector extracted from these CGR segments and the first two PCA features are used to classify thermophilic and mesophilic proteins by Support Vector Machine (SVM). Our method is evaluated by the jackknife test. For the 24-dimensional vector, the accuracy is 0.8792 and Matthews Correlation Coefficient (MCC) is 0.7587. The 26-dimensional vector by hybridizing with PCA components performs highly satisfaction, in which the accuracy achieves 0.9944 and MCC achieves 0.9888. The results show the effectiveness of the new hybrid method.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21787282     DOI: 10.2174/092986611797642661

Source DB:  PubMed          Journal:  Protein Pept Lett        ISSN: 0929-8665            Impact factor:   1.890


  4 in total

1.  A Method for Prediction of Thermophilic Protein Based on Reduced Amino Acids and Mixed Features.

Authors:  Changli Feng; Zhaogui Ma; Deyun Yang; Xin Li; Jun Zhang; Yanjuan Li
Journal:  Front Bioeng Biotechnol       Date:  2020-05-05

2.  Prediction of RNA-protein interactions using conjoint triad feature and chaos game representation.

Authors:  Hongchu Wang; Pengfei Wu
Journal:  Bioengineered       Date:  2018       Impact factor: 3.269

3.  Accurate prediction of nuclear receptors with conjoint triad feature.

Authors:  Hongchu Wang; Xuehai Hu
Journal:  BMC Bioinformatics       Date:  2015-12-03       Impact factor: 3.169

4.  iThermo: A Sequence-Based Model for Identifying Thermophilic Proteins Using a Multi-Feature Fusion Strategy.

Authors:  Zahoor Ahmed; Hasan Zulfiqar; Abdullah Aman Khan; Ijaz Gul; Fu-Ying Dao; Zhao-Yue Zhang; Xiao-Long Yu; Lixia Tang
Journal:  Front Microbiol       Date:  2022-02-22       Impact factor: 5.640

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.