Literature DB >> 14980014

Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships.

Li Liao1, William Stafford Noble.   

Abstract

One key element in understanding the molecular machinery of the cell is to understand the structure and function of each protein encoded in the genome. A very successful means of inferring the structure or function of a previously unannotated protein is via sequence similarity with one or more proteins whose structure or function is already known. Toward this end, we propose a means of representing proteins using pairwise sequence similarity scores. This representation, combined with a discriminative classification algorithm known as the support vector machine (SVM), provides a powerful means of detecting subtle structural and evolutionary relationships among proteins. The algorithm, called SVM-pairwise, when tested on its ability to recognize previously unseen families from the SCOP database, yields significantly better performance than SVM-Fisher, profile HMMs, and PSI-BLAST.

Mesh:

Substances:

Year:  2003        PMID: 14980014     DOI: 10.1089/106652703322756113

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  61 in total

1.  Descriptor-based protein remote homology identification.

Authors:  Ziding Zhang; Sunil Kochhar; Martin G Grigorov
Journal:  Protein Sci       Date:  2005-01-04       Impact factor: 6.725

2.  Framework for kernel regularization with application to protein clustering.

Authors:  Fan Lu; Sündüz Keles; Stephen J Wright; Grace Wahba
Journal:  Proc Natl Acad Sci U S A       Date:  2005-08-18       Impact factor: 11.205

3.  Improved prediction of malaria degradomes by supervised learning with SVM and profile kernel.

Authors:  Rui Kuang; Jianying Gu; Hong Cai; Yufeng Wang
Journal:  Genetica       Date:  2008-12-06       Impact factor: 1.082

4.  Gene function prediction based on genomic context clustering and discriminative learning: an application to bacteriophages.

Authors:  Jason Li; Saman K Halgamuge; Christopher I Kells; Sen-Lin Tang
Journal:  BMC Bioinformatics       Date:  2007-05-22       Impact factor: 3.169

Review 5.  Machine learning for in silico virtual screening and chemical genomics: new strategies.

Authors:  Jean-Philippe Vert; Laurent Jacob
Journal:  Comb Chem High Throughput Screen       Date:  2008-09       Impact factor: 1.339

6.  Physicochemical property distributions for accurate and rapid pairwise protein homology detection.

Authors:  Bobbie-Jo M Webb-Robertson; Kyle G Ratuiste; Christopher S Oehmen
Journal:  BMC Bioinformatics       Date:  2010-03-19       Impact factor: 3.169

7.  Local combinational variables: an approach used in DNA-binding helix-turn-helix motif prediction with sequence information.

Authors:  Wenwei Xiong; Tonghua Li; Kai Chen; Kailin Tang
Journal:  Nucleic Acids Res       Date:  2009-08-03       Impact factor: 16.971

8.  Protein-protein interaction based on pairwise similarity.

Authors:  Nazar Zaki; Sanja Lazarova-Molnar; Wassim El-Hajj; Piers Campbell
Journal:  BMC Bioinformatics       Date:  2009-05-17       Impact factor: 3.169

9.  A discriminative method for protein remote homology detection and fold recognition combining Top-n-grams and latent semantic analysis.

Authors:  Bin Liu; Xiaolong Wang; Lei Lin; Qiwen Dong; Xuan Wang
Journal:  BMC Bioinformatics       Date:  2008-12-01       Impact factor: 3.169

10.  Enhanced protein fold recognition through a novel data integration approach.

Authors:  Yiming Ying; Kaizhu Huang; Colin Campbell
Journal:  BMC Bioinformatics       Date:  2009-08-26       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.