| Literature DB >> 27323846 |
Xiaotong Guo1, Fulin Liu1, Ying Ju2, Zhen Wang2, Chunyu Wang3.
Abstract
Predicting protein subcellular location is necessary for understanding cell function. Several machine learning methods have been developed for computational prediction of primary protein sequences because wet experiments are costly and time consuming. However, two problems still exist in state-of-the-art methods. First, several proteins appear in different subcellular structures simultaneously, whereas current methods only predict one protein sequence in one subcellular structure. Second, most software tools are trained with obsolete data and the latest new databases are missed. We proposed a novel multi-label classification algorithm to solve the first problem and integrated several latest databases to improve prediction performance. Experiments proved the effectiveness of the proposed method. The present study would facilitate research on cellular proteomics.Entities:
Year: 2016 PMID: 27323846 PMCID: PMC4914962 DOI: 10.1038/srep28087
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379