Ping-An He1, Hong Tao1, Tingting Ma1, Qi Dai2, Yuhua Yao2.
Abstract
BACKGROUND AND
OBJECTIVE: The rapidly growing number of protein data available creates necessity of computational methods with low complexity to infer accurate protein structure, function, and evolution.
METHOD: A new description of proteins based on five topological indices of star-like graph representation and the occurrence frequency of 20 amino acids was proposed to compare the similarities of proteins.
RESULTS: A phylogenetic tree of eight ND6 proteins was constructed to demonstrate the effectiveness and rationality of our approach. Analogously, we applied this method to RNA polymerase proteins of some subtypes of influenza virus to infer their phylogenetic relationship. The results showed that the phylogenetic relationship among RNA polymerase of influenza virus is closely related to distributions of species virus host and geographical distribution.
CONCLUSION: This novel approach is based on a mapping which can be recaptured mathematically without loss of information. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
BACKGROUND AND
OBJECTIVE: The rapidly growing number of protein data available creates necessity of computational methods with low complexity to infer accurate protein structure, function, and evolution.
METHOD: A new description of proteins based on five topological indices of star-like graph representation and the occurrence frequency of 20 amino acids was proposed to compare the similarities of proteins.
RESULTS: A phylogenetic tree of eight ND6 proteins was constructed to demonstrate the effectiveness and rationality of our approach. Analogously, we applied this method to RNA polymerase proteins of some subtypes of influenza virus to infer their phylogenetic relationship. The results showed that the phylogenetic relationship among RNA polymerase of influenza virus is closely related to distributions of species virus host and geographical distribution.
CONCLUSION: This novel approach is based on a mapping which can be recaptured mathematically without loss of information. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Entities:
Keywords:
RNA polymerase; alignment-free; phylogenetic tree; pseudo amino acids composition; star-like graph; topologicalzzm321990indices
Mesh:
Substances:
Year: 2017
PMID: 28215145 DOI: 10.2174/1386207320666170217152811
Source DB: PubMed Journal: Comb Chem High Throughput Screen ISSN: 1386-2073 Impact factor: 1.339