Literature DB >> 11836223

Classifying G-protein coupled receptors with support vector machines.

Rachel Karchin1, Kevin Karplus, David Haussler.   

Abstract

MOTIVATION: The enormous amount of protein sequence data uncovered by genome research has increased the demand for computer software that can automate the recognition of new proteins. We discuss the relative merits of various automated methods for recognizing G-Protein Coupled Receptors (GPCRs), a superfamily of cell membrane proteins. GPCRs are found in a wide range of organisms and are central to a cellular signalling network that regulates many basic physiological processes. They are the focus of a significant amount of current pharmaceutical research because they play a key role in many diseases. However, their tertiary structures remain largely unsolved. The methods described in this paper use only primary sequence information to make their predictions. We compare a simple nearest neighbor approach (BLAST), methods based on multiple alignments generated by a statistical profile Hidden Markov Model (HMM), and methods, including Support Vector Machines (SVMs), that transform protein sequences into fixed-length feature vectors.
RESULTS: The last is the most computationally expensive method, but our experiments show that, for those interested in annotation-quality classification, the results are worth the effort. In two-fold cross-validation experiments testing recognition of GPCR subfamilies that bind a specific ligand (such as a histamine molecule), the errors per sequence at the Minimum Error Point (MEP) were 13.7% for multi-class SVMs, 17.1% for our SVMtree method of hierarchical multi-class SVM classification, 25.5% for BLAST, 30% for profile HMMs, and 49% for classification based on nearest neighbor feature vector Kernel Nearest Neighbor (kernNN). The percentage of true positives recognized before the first false positive was 65% for both SVM methods, 13% for BLAST, 5% for profile HMMs and 4% for kernNN.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11836223     DOI: 10.1093/bioinformatics/18.1.147

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  73 in total

1.  Construction of a sequence motif characteristic of aminergic G protein-coupled receptors.

Authors:  Enoch S Huang
Journal:  Protein Sci       Date:  2003-07       Impact factor: 6.725

2.  SVM-Prot: Web-based support vector machine software for functional classification of a protein from its primary sequence.

Authors:  C Z Cai; L Y Han; Z L Ji; X Chen; Y Z Chen
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors.

Authors:  Manoj Bhasin; G P S Raghava
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

4.  MotifPrototyper: a Bayesian profile model for motif families.

Authors:  Eric P Xing; Richard M Karp
Journal:  Proc Natl Acad Sci U S A       Date:  2004-07-13       Impact factor: 11.205

5.  Prediction of RNA-binding proteins from primary sequence by a support vector machine approach.

Authors:  Lian Yi Han; Cong Zhong Cai; Siew Lin Lo; Maxey C M Chung; Yu Zong Chen
Journal:  RNA       Date:  2004-03       Impact factor: 4.942

6.  Availability of short amino acid sequences in proteins.

Authors:  Joji M Otaki; Shunsuke Ienaka; Tomonori Gotoh; Haruhiko Yamamoto
Journal:  Protein Sci       Date:  2005-02-02       Impact factor: 6.725

7.  EHPred: an SVM-based method for epoxide hydrolases recognition and classification.

Authors:  Jia Jia; Liang Yang; Zi-Zhang Zhang
Journal:  J Zhejiang Univ Sci B       Date:  2006-01       Impact factor: 3.066

8.  The prediction of human oral absorption for diffusion rate-limited drugs based on heuristic method and support vector machine.

Authors:  H X Liu; R J Hu; R S Zhang; X J Yao; M C Liu; Z D Hu; B T Fan
Journal:  J Comput Aided Mol Des       Date:  2005-01       Impact factor: 3.686

9.  Discriminative prediction of mammalian enhancers from DNA sequence.

Authors:  Dongwon Lee; Rachel Karchin; Michael A Beer
Journal:  Genome Res       Date:  2011-08-29       Impact factor: 9.043

10.  Cloning and characterization of an orphan seven transmembrane receptor from Schistosoma mansoni.

Authors:  M S Pearson; D P McManus; D J Smyth; M K Jones; A M Sykes; A Loukas
Journal:  Parasitology       Date:  2007-08-23       Impact factor: 3.234

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.