Literature DB >> 19394310

Prediction of functionally important sites from protein sequences using sparse kernel least squares classifiers.

Ke Tang1, Ganesan Pugalenthi, P N Suganthan, Christopher J Lanczycki, Saikat Chakrabarti.   

Abstract

Identification of functionally important sites (FIS) in proteins is a critical problem and can have profound importance where protein structural information is limited. Machine learning techniques have been very useful in successful classification of many important biological problems. In this paper, we adopt the sparse kernel least squares classifiers (SKLSC) approach for classification and/or prediction of FIS using protein sequence derived features. The SKLSC algorithm was applied to 5435 FIS that have been extracted from 312 reliable alignments for a wide range of protein families. We obtained 68.28% sensitivity and 68.66% specificity for training dataset and 65.34% sensitivity and 66.88% specificity for testing dataset. Further, large scale benchmarking study using alignments of 101 protein families containing 1899 FIS showed that our method achieved an average approximately 70% sensitivity in predicting different types of FIS, such as active sites, metal, ligand or protein binding sites. Our findings also indicate that active sites and metal binding sites are comparably easier to predict compared to the ligand and protein binding sites. Despite moderate success, our results suggest the usefulness and potential of SKLSC approach in prediction of FIS using only protein sequence derived information.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19394310      PMCID: PMC2750782          DOI: 10.1016/j.bbrc.2009.04.096

Source DB:  PubMed          Journal:  Biochem Biophys Res Commun        ISSN: 0006-291X            Impact factor:   3.575


  24 in total

1.  The PSIPRED protein structure prediction server.

Authors:  L J McGuffin; K Bryson; D T Jones
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

2.  Analysis and prediction of functional sub-types from protein sequence alignments.

Authors:  S S Hannenhalli; R B Russell
Journal:  J Mol Biol       Date:  2000-10-13       Impact factor: 5.469

3.  Characterizing proteolytic cleavage site activity using bio-basis function neural networks.

Authors:  Rebecca Thomson; T Charles Hodgman; Zheng Rong Yang; Austin K Doyle
Journal:  Bioinformatics       Date:  2003-09-22       Impact factor: 6.937

4.  Using a neural network and spatial clustering to predict the location of active sites in enzymes.

Authors:  Alex Gutteridge; Gail J Bartlett; Janet M Thornton
Journal:  J Mol Biol       Date:  2003-07-18       Impact factor: 5.469

5.  Remote homology detection: a motif based approach.

Authors:  Asa Ben-Hur; Douglas Brutlag
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

6.  AAindex: Amino Acid Index Database.

Authors:  S Kawashima; H Ogata; M Kanehisa
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

7.  From fold predictions to function predictions: automation of functional site conservation analysis for functional genome predictions.

Authors:  B Zhang; L Rychlewski; K Pawłowski; J S Fetrow; J Skolnick; A Godzik
Journal:  Protein Sci       Date:  1999-05       Impact factor: 6.725

8.  Gene selection algorithms for microarray data based on least squares support vector machine.

Authors:  E Ke Tang; P N Suganthan; Xin Yao
Journal:  BMC Bioinformatics       Date:  2006-02-27       Impact factor: 3.169

9.  A tool for the prediction of functionally important sites in proteins using a library of functional templates.

Authors:  Christopher J Lanczycki; Saikat Chakrabarti
Journal:  Bioinformation       Date:  2008-02-22

10.  A machine learning approach for the identification of odorant binding proteins from sequence-derived properties.

Authors:  Ganesan Pugalenthi; Ke Tang; P N Suganthan; G Archunan; R Sowdhamini
Journal:  BMC Bioinformatics       Date:  2007-09-19       Impact factor: 3.169

View more
  2 in total

1.  CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure.

Authors:  Jan-Oliver Janda; Markus Busch; Fabian Kück; Mikhail Porfenenko; Rainer Merkl
Journal:  BMC Bioinformatics       Date:  2012-04-05       Impact factor: 3.169

2.  Non-negative discriminative brain functional connectivity for identifying schizophrenia on resting-state fMRI.

Authors:  Qi Zhu; Jiashuang Huang; Xijia Xu
Journal:  Biomed Eng Online       Date:  2018-03-13       Impact factor: 2.819

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.