| Literature DB >> 15725734 |
Konstantinos Blekas1, Dimitrios I Fotiadis, Aristidis Likas.
Abstract
We present a system for multi-class protein classification based on neural networks. The basic issue concerning the construction of neural network systems for protein classification is the sequence encoding scheme that must be used in order to feed the neural network. To deal with this problem we propose a method that maps a protein sequence into a numerical feature space using the matching scores of the sequence to groups of conserved patterns (called motifs) into protein families. We consider two alternative ways for identifying the motifs to be used for feature generation and provide a comparative evaluation of the two schemes. We also evaluate the impact of the incorporation of background features (2-grams) on the performance of the neural system. Experimental results on real datasets indicate that the proposed method is highly efficient and is superior to other well-known methods for protein classification.Mesh:
Substances:
Year: 2005 PMID: 15725734 DOI: 10.1089/cmb.2005.12.64
Source DB: PubMed Journal: J Comput Biol ISSN: 1066-5277 Impact factor: 1.479