Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.

Literature DB >> 18726140

Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.

Zhan-Chao Li¹, Xi-Bin Zhou, Zong Dai, Xiao-Yong Zou.

Abstract

A prior knowledge of protein structural classes can provide useful information about its overall structure, so it is very important for quick and accurate determination of protein structural class with computation method in protein science. One of the key for computation method is accurate protein sample representation. Here, based on the concept of Chou's pseudo-amino acid composition (AAC, Chou, Proteins: structure, function, and genetics, 43:246-255, 2001), a novel method of feature extraction that combined continuous wavelet transform (CWT) with principal component analysis (PCA) was introduced for the prediction of protein structural classes. Firstly, the digital signal was obtained by mapping each amino acid according to various physicochemical properties. Secondly, CWT was utilized to extract new feature vector based on wavelet power spectrum (WPS), which contains more abundant information of sequence order in frequency domain and time domain, and PCA was then used to reorganize the feature vector to decrease information redundancy and computational complexity. Finally, a pseudo-amino acid composition feature vector was further formed to represent primary sequence by coupling AAC vector with a set of new feature vector of WPS in an orthogonal space by PCA. As a showcase, the rigorous jackknife cross-validation test was performed on the working datasets. The results indicated that prediction quality has been improved, and the current approach of protein representation may serve as a useful complementary vehicle in classifying other attributes of proteins, such as enzyme family class, subcellular localization, membrane protein types and protein secondary structure, etc.

Entities: Chemical Gene

Mesh：

Substances：
Amino Acids
Proteins

Year: 2008 PMID： 18726140 DOI： 10.1007/s00726-008-0170-2

Source DB: PubMed Journal: Amino Acids ISSN： 0939-4451 Impact factor: 3.520

Keyword Cloud
Cited

17 in total

Review 1. Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors: Kuo-Chen Chou
Journal: Mol Genet Genomics Date: 2020-01-01 Impact factor: 3.291

2. Prediction of antimicrobial peptides based on sequence alignment and feature selection methods.

Authors: Ping Wang; Lele Hu; Guiyou Liu; Nan Jiang; Xiaoyun Chen; Jianyong Xu; Wen Zheng; Li Li; Ming Tan; Zugen Chen; Hui Song; Yu-Dong Cai; Kuo-Chen Chou
Journal: PLoS One Date: 2011-04-13 Impact factor: 3.240

3. A multi-label predictor for identifying the subcellular locations of singleplex and multiplex eukaryotic proteins.

Authors: Xiao Wang; Guo-Zheng Li
Journal: PLoS One Date: 2012-05-22 Impact factor: 3.240

4. Accurate prediction of protein structural class.

Authors: Xia-Yu Xia; Meng Ge; Zhi-Xin Wang; Xian-Ming Pan
Journal: PLoS One Date: 2012-06-19 Impact factor: 3.240

5. Environmental genes and genomes: understanding the differences and challenges in the approaches and software for their analyses.

Authors: Marie Lisandra Zepeda Mendoza; Thomas Sicheritz-Pontén; M Thomas P Gilbert
Journal: Brief Bioinform Date: 2015-02-11 Impact factor: 11.622

6. Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM.

Authors: Yunyun Liang; Sanyang Liu; Shengli Zhang
Journal: Comput Math Methods Med Date: 2015-12-15 Impact factor: 2.238

7. PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.

Authors: Liqi Li; Xiang Cui; Sanjiu Yu; Yuan Zhang; Zhong Luo; Hua Yang; Yue Zhou; Xiaoqi Zheng
Journal: PLoS One Date: 2014-03-27 Impact factor: 3.240

8. Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.

Authors: Marcin J Mizianty; Lukasz Kurgan
Journal: BMC Bioinformatics Date: 2009-12-13 Impact factor: 3.169

9. PseAAC-General: fast building various modes of general form of Chou's pseudo-amino acid composition for large-scale protein datasets.

Authors: Pufeng Du; Shuwang Gu; Yasen Jiao
Journal: Int J Mol Sci Date: 2014-02-26 Impact factor: 5.923

10. iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.

Authors: Wei Chen; Peng-Mian Feng; Hao Lin; Kuo-Chen Chou
Journal: Biomed Res Int Date: 2014-05-21 Impact factor: 3.411