Literature DB >> 18726140

Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.

Zhan-Chao Li1, Xi-Bin Zhou, Zong Dai, Xiao-Yong Zou.   

Abstract

A prior knowledge of protein structural classes can provide useful information about its overall structure, so it is very important for quick and accurate determination of protein structural class with computation method in protein science. One of the key for computation method is accurate protein sample representation. Here, based on the concept of Chou's pseudo-amino acid composition (AAC, Chou, Proteins: structure, function, and genetics, 43:246-255, 2001), a novel method of feature extraction that combined continuous wavelet transform (CWT) with principal component analysis (PCA) was introduced for the prediction of protein structural classes. Firstly, the digital signal was obtained by mapping each amino acid according to various physicochemical properties. Secondly, CWT was utilized to extract new feature vector based on wavelet power spectrum (WPS), which contains more abundant information of sequence order in frequency domain and time domain, and PCA was then used to reorganize the feature vector to decrease information redundancy and computational complexity. Finally, a pseudo-amino acid composition feature vector was further formed to represent primary sequence by coupling AAC vector with a set of new feature vector of WPS in an orthogonal space by PCA. As a showcase, the rigorous jackknife cross-validation test was performed on the working datasets. The results indicated that prediction quality has been improved, and the current approach of protein representation may serve as a useful complementary vehicle in classifying other attributes of proteins, such as enzyme family class, subcellular localization, membrane protein types and protein secondary structure, etc.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18726140     DOI: 10.1007/s00726-008-0170-2

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  17 in total

Review 1.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

2.  Prediction of antimicrobial peptides based on sequence alignment and feature selection methods.

Authors:  Ping Wang; Lele Hu; Guiyou Liu; Nan Jiang; Xiaoyun Chen; Jianyong Xu; Wen Zheng; Li Li; Ming Tan; Zugen Chen; Hui Song; Yu-Dong Cai; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-04-13       Impact factor: 3.240

3.  A multi-label predictor for identifying the subcellular locations of singleplex and multiplex eukaryotic proteins.

Authors:  Xiao Wang; Guo-Zheng Li
Journal:  PLoS One       Date:  2012-05-22       Impact factor: 3.240

4.  Accurate prediction of protein structural class.

Authors:  Xia-Yu Xia; Meng Ge; Zhi-Xin Wang; Xian-Ming Pan
Journal:  PLoS One       Date:  2012-06-19       Impact factor: 3.240

5.  Environmental genes and genomes: understanding the differences and challenges in the approaches and software for their analyses.

Authors:  Marie Lisandra Zepeda Mendoza; Thomas Sicheritz-Pontén; M Thomas P Gilbert
Journal:  Brief Bioinform       Date:  2015-02-11       Impact factor: 11.622

6.  Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM.

Authors:  Yunyun Liang; Sanyang Liu; Shengli Zhang
Journal:  Comput Math Methods Med       Date:  2015-12-15       Impact factor: 2.238

7.  PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.

Authors:  Liqi Li; Xiang Cui; Sanjiu Yu; Yuan Zhang; Zhong Luo; Hua Yang; Yue Zhou; Xiaoqi Zheng
Journal:  PLoS One       Date:  2014-03-27       Impact factor: 3.240

8.  Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.

Authors:  Marcin J Mizianty; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2009-12-13       Impact factor: 3.169

9.  PseAAC-General: fast building various modes of general form of Chou's pseudo-amino acid composition for large-scale protein datasets.

Authors:  Pufeng Du; Shuwang Gu; Yasen Jiao
Journal:  Int J Mol Sci       Date:  2014-02-26       Impact factor: 5.923

10.  iSS-PseDNC: identifying splicing sites using pseudo dinucleotide composition.

Authors:  Wei Chen; Peng-Mian Feng; Hao Lin; Kuo-Chen Chou
Journal:  Biomed Res Int       Date:  2014-05-21       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.