Literature DB >> 28886267

The effect of three novel feature extraction methods on the prediction of the subcellular localization of multi-site virus proteins.

Lei Wang1,2, Yaou Zhao1,2, Yuehui Chen1,2, Dong Wang1,2.   

Abstract

Experimental methods play a crucial role in identifying the subcellular localization of proteins and building high-quality databases. However, more efficient, automated computational methods are required to predict the subcellular localization of proteins on a large scale. Various efficient feature extraction methods have been proposed to predict subcellular localization, but challenges remain. In this paper, three novel feature extraction methods are established to improve multi-site prediction. The first novel feature extraction method utilizes repetitive information via moving windows based on a dipeptide pseudo amino acid composition method (R-Dipeptide). The second novel feature extraction method utilizes the impact of each amino acid residue on its following residues based on pseudo amino acids (I-PseAAC). The third novel feature extraction method provides local information about protein sequences that reflects the strength of the physicochemical properties of residues (PseAAC2). The multi-label k-nearest neighbor algorithm (MLKNN) is used to predict the subcellular localization of multi-site virus proteins. The best overall accuracy values of R-Dipeptide, I-PseAAC, and PseAAC2 when applied to dataset S from Virus-mPloc are 59.92%, 59.13%, and 57.94% respectively.

Entities:  

Keywords:  I-PseAAC; PseAAC2; R-Dipeptide; feature extraction; subcellular localization

Mesh:

Substances:

Year:  2017        PMID: 28886267      PMCID: PMC5972939          DOI: 10.1080/21655979.2017.1373536

Source DB:  PubMed          Journal:  Bioengineered        ISSN: 2165-5979            Impact factor:   3.269


  17 in total

1.  A neural network method for identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites.

Authors:  H Nielsen; J Engelbrecht; S Brunak; G von Heijne
Journal:  Int J Neural Syst       Date:  1997 Oct-Dec       Impact factor: 5.866

2.  Virus-mPLoc: a fusion classifier for viral protein subcellular location prediction by incorporating multiple sites.

Authors:  Hong-Bin Shen; Kuo-Chen Chou
Journal:  J Biomol Struct Dyn       Date:  2010-10

3.  PairProSVM: protein subcellular localization based on local pairwise profile alignment and SVM.

Authors:  Man-Wai Mak; Jian Guo; Sun-Yuan Kung
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2008 Jul-Sep       Impact factor: 3.710

4.  mLASSO-Hum: A LASSO-based interpretable human-protein subcellular localization predictor.

Authors:  Shibiao Wan; Man-Wai Mak; Sun-Yuan Kung
Journal:  J Theor Biol       Date:  2015-07-09       Impact factor: 2.691

5.  Gram-positive and Gram-negative protein subcellular localization by incorporating evolutionary-based descriptors into Chou׳s general PseAAC.

Authors:  Abdollah Dehzangi; Rhys Heffernan; Alok Sharma; James Lyons; Kuldip Paliwal; Abdul Sattar
Journal:  J Theor Biol       Date:  2014-09-28       Impact factor: 2.691

6.  Effect of different drying methods on the myosin structure, amino acid composition, protein digestibility and volatile profile of squid fillets.

Authors:  Yun Deng; Yali Luo; Yuegang Wang; Yanyun Zhao
Journal:  Food Chem       Date:  2014-09-10       Impact factor: 7.514

7.  Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies.

Authors:  H Nakashima; K Nishikawa
Journal:  J Mol Biol       Date:  1994-04-22       Impact factor: 5.469

8.  Prediction of protein subcellular locations by GO-FunD-PseAA predictor.

Authors:  Kuo-Chen Chou; Yu-Dong Cai
Journal:  Biochem Biophys Res Commun       Date:  2004-08-06       Impact factor: 3.575

9.  A multi-label classifier for predicting the subcellular localization of gram-negative bacterial proteins with both single and multiple sites.

Authors:  Xuan Xiao; Zhi-Cheng Wu; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-06-17       Impact factor: 3.240

10.  Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier.

Authors:  Xiao Wang; Hui Li; Qiuwen Zhang; Rong Wang
Journal:  Biomed Res Int       Date:  2016-04-24       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.