| Literature DB >> 28350835 |
Xiaogeng Wan1, Xin Zhao1, Stephen S T Yau1.
Abstract
Protein classification is one of the critical problems in bioinformatics. Early studies used geometric distances and polygenetic-tree to classify proteins. These methods use binary trees to present protein classification. In this paper, we propose a new protein classification method, whereby theories of information and networks are used to classify the multivariate relationships of proteins. In this study, protein universe is modeled as an undirected network, where proteins are classified according to their connections. Our method is unsupervised, multivariate, and alignment-free. It can be applied to the classification of both protein sequences and structures. Nine examples are used to demonstrate the efficiency of our new method.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28350835 PMCID: PMC5370107 DOI: 10.1371/journal.pone.0174386
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240