Literature DB >> 20047490

The accurate prediction of protein family from amino acid sequence by measuring features of sequence fragments.

Huixiao Hong1, Qilong Hong, Roger Perkins, Leming Shi, Hong Fang, Zhenqiang Su, Yvonne Dragan, James C Fuscoe, Weida Tong.   

Abstract

The rapid advances in proteomic analyses coupled with the completion of multiple genomes have led to an increased demand for determining protein functions. The first step is classification or prediction into families. A method was developed for the prediction of protein family based only on protein sequence using support vector machine (SVM) models. In these models, the amino acids were classified into three categories (apolar, polar, and charged). Consecutive fragments ranging from one to five were annotated by amino acid type to define the protein features of each protein. SVM models were constructed based on the protein features of a training set of proteins and then examined with an independent set of proteins. The approach was tested for 20 protein families from the iProClass database of Protein Information Resources (PIR). For two-class SVM models, an average prediction accuracy of 0.9985 was achieved, while for multi-class SVM models an accuracy of 0.9941 was achieved. This study demonstrates that SVM based methods can accurately recognize and predict the protein family to which a sequence belongs based solely on its primary amino acid sequence.

Mesh:

Substances:

Year:  2009        PMID: 20047490     DOI: 10.1089/cmb.2008.0115

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  6 in total

1.  Classification of nucleotide sequences using support vector machines.

Authors:  Tae-Kun Seo
Journal:  J Mol Evol       Date:  2010-08-26       Impact factor: 2.395

2.  Understanding and predicting binding between human leukocyte antigens (HLAs) and peptides by network analysis.

Authors:  Heng Luo; Hao Ye; Hui Ng; Leming Shi; Weida Tong; William Mattes; Donna Mendrick; Huixiao Hong
Journal:  BMC Bioinformatics       Date:  2015-09-25       Impact factor: 3.169

3.  sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides.

Authors:  Heng Luo; Hao Ye; Hui Wen Ng; Sugunadevi Sakkiah; Donna L Mendrick; Huixiao Hong
Journal:  Sci Rep       Date:  2016-08-25       Impact factor: 4.379

4.  Competitive docking model for prediction of the human nicotinic acetylcholine receptor α7 binding of tobacco constituents.

Authors:  Hui Wen Ng; Carmine Leggett; Sugunadevi Sakkiah; Bohu Pan; Hao Ye; Leihong Wu; Chandrabose Selvaraj; Weida Tong; Huixiao Hong
Journal:  Oncotarget       Date:  2018-02-08

5.  A Rat α-Fetoprotein Binding Activity Prediction Model to Facilitate Assessment of the Endocrine Disruption Potential of Environmental Chemicals.

Authors:  Huixiao Hong; Jie Shen; Hui Wen Ng; Sugunadevi Sakkiah; Hao Ye; Weigong Ge; Ping Gong; Wenming Xiao; Weida Tong
Journal:  Int J Environ Res Public Health       Date:  2016-03-25       Impact factor: 3.390

6.  Pathway Analysis Revealed Potential Diverse Health Impacts of Flavonoids that Bind Estrogen Receptors.

Authors:  Hao Ye; Hui Wen Ng; Sugunadevi Sakkiah; Weigong Ge; Roger Perkins; Weida Tong; Huixiao Hong
Journal:  Int J Environ Res Public Health       Date:  2016-03-26       Impact factor: 3.390

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.