| Literature DB >> 22296788 |
Abstract
SUMMARY: We present an accurate and fast web server, WegoLoc for predicting subcellular localization of proteins based on sequence similarity and weighted Gene Ontology (GO) information. A term weighting method in the text categorization process is applied to GO terms for a support vector machine classifier. As a result, WegoLoc surpasses the state-of-the-art methods for previously used test datasets. WegoLoc supports three eukaryotic kingdoms (animals, fungi and plants) and provides human-specific analysis, and covers several sets of cellular locations. In addition, WegoLoc provides (i) multiple possible localizations of input protein(s) as well as their corresponding probability scores, (ii) weights of GO terms representing the contribution of each GO term in the prediction, and (iii) a BLAST E-value for the best hit with GO terms. If the similarity score does not meet a given threshold, an amino acid composition-based prediction is applied as a backup method. AVAILABILITY: WegoLoc and User's guide are freely available at the website http://www.btool.org/WegoLoc CONTACT: smchiks@ks.ac.kr; dougnam@unist.ac.kr SUPPLEMENTARY INFORMATION: Supplementary data is available at http://www.btool.org/WegoLoc.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22296788 DOI: 10.1093/bioinformatics/bts062
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937