| Literature DB >> 17945916 |
Themis P Exarchos1, Costas Papaloukas, Christos Lampros, Dimitrios I Fotiadis.
Abstract
Protein classification in terms of fold recognition can be employed to determine the structural and functional properties of a newly discovered protein. In this work sequential pattern mining (SPM) is utilized for sequence-based fold recognition. One of the most efficient SPM algorithms, cSPADE, is employed for protein primary structure analysis. Then a classifier uses the extracted sequential patterns for classifying proteins of unknown structure in the appropriate fold category. The proposed methodology exhibited an overall accuracy of 36% in a multi-class problem of 17 candidate categories. The classification performance reaches up to 65% when the three most probable protein folds are considered.Mesh:
Substances:
Year: 2006 PMID: 17945916 DOI: 10.1109/IEMBS.2006.260336
Source DB: PubMed Journal: Conf Proc IEEE Eng Med Biol Soc ISSN: 1557-170X