| Literature DB >> 28393857 |
Lulu Yu1, Yusen Zhang2, Ivan Gutman3, Yongtang Shi4, Matthias Dehmer5,6.
Abstract
We develop a novel position-feature-based model for protein sequences by employing physicochemical properties of 20 amino acids and the measure of graph energy. The method puts the emphasis on sequence order information and describes local dynamic distributions of sequences, from which one can get a characteristic B-vector. Afterwards, we apply the relative entropy to the sequences representing B-vectors to measure their similarity/dissimilarity. The numerical results obtained in this study show that the proposed methods leads to meaningful results compared with competitors such as Clustal W.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28393857 PMCID: PMC5385872 DOI: 10.1038/srep46237
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Matrix mapped to the bipartite graph.
Figure 2Phylogenetic tree of the 9 ND5 proteins constructed by our method.
Figure 3(a) Phylogenetic tree of 24 TFs constructed by our method. (b) Phylogenetic tree of 24 TFs constructed by Clustal W.
Figure 4(a) Phylogenetic tree of 27 AFPs constructed by our method. (b) Phylogenetic tree of 27 AFPs constructed by Clustal W.
Figure 5Phylogenetic tree of 50 beta-globin proteins constructed by our method.