Literature DB >> 16706721

Finding motifs in protein secondary structure for use in function prediction.

Sébastien Ferré1, Ross D King.   

Abstract

This paper presents a novel algorithm for the discovery of biological sequence motifs. Our motivation is the prediction of gene function. We seek to discover motifs and combinations of motifs in the secondary structure of proteins for application to the understanding and prediction of functional classes. The motifs found by our algorithm allow both flexible length structural elements and flexible length gaps and can be of arbitrary length. The algorithm is based on neither top-down nor bottom-up search, but rather is dichotomic. It is also "anytime," so that fixed termination of the search is not necessary. We have applied our algorithm to yeast sequence data to discover rules predicting function classes from secondary structure. These resultant rules are informative, consistent with known biology, and a contribution to scientific knowledge. Surprisingly, the rules also demonstrate that secondary structure prediction algorithms are effective for membrane proteins and suggest that the association between secondary structure and function is stronger in membrane proteins than globular ones. We demonstrate that our algorithm can successfully predict gene function directly from predicted secondary structure; e.g., we correctly predict the gene YGL124c to be involved in the functional class "cytoplasmic and nuclear degradation." Datasets and detailed results (generated motifs, rules, evaluation on test dataset, and predictions on unknown dataset) are available at www.aber.ac.uk/compsci/Research/bio/dss/yeast.ss.mips/, and www.genepredictions.org.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16706721     DOI: 10.1089/cmb.2006.13.719

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  5 in total

1.  Structural fragment clustering reveals novel structural and functional motifs in alpha-helical transmembrane proteins.

Authors:  Annalisa Marsico; Andreas Henschel; Christof Winter; Anne Tuukkanen; Boris Vassilev; Kerstin Scheubert; Michael Schroeder
Journal:  BMC Bioinformatics       Date:  2010-04-26       Impact factor: 3.169

2.  Bayesian Markov Random Field analysis for protein function prediction based on network data.

Authors:  Yiannis A I Kourmpetis; Aalt D J van Dijk; Marco C A M Bink; Roeland C H J van Ham; Cajo J F ter Braak
Journal:  PLoS One       Date:  2010-02-24       Impact factor: 3.240

3.  Improving protein secondary structure prediction based on short subsequences with local structure similarity.

Authors:  Hsin-Nan Lin; Ting-Yi Sung; Shinn-Ying Ho; Wen-Lian Hsu
Journal:  BMC Genomics       Date:  2010-12-02       Impact factor: 3.969

4.  From E-MAPs to module maps: dissecting quantitative genetic interactions using physical interactions.

Authors:  Igor Ulitsky; Tomer Shlomi; Martin Kupiec; Ron Shamir
Journal:  Mol Syst Biol       Date:  2008-07-15       Impact factor: 11.429

5.  Functionally important segments in proteins dissected using Gene Ontology and geometric clustering of peptide fragments.

Authors:  Karuppasamy Manikandan; Debnath Pal; Suryanarayanarao Ramakumar; Nathan E Brener; Sitharama S Iyengar; Guna Seetharaman
Journal:  Genome Biol       Date:  2008-03-10       Impact factor: 13.583

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.