Literature DB >> 33646531

Analysis of protein determinants of host-specific infection properties of polyomaviruses using machine learning.

Myeongji Cho1,2, Hayeon Kim3, Hyeon S Son4,5,6.   

Abstract

BACKGROUND: The large tumor antigen (LT-Ag) and major capsid protein VP1 are known to play important roles in determining the host-specific infection properties of polyomaviruses (PyVs).
OBJECTIVE: The objective of this study was to investigate the physicochemical properties of amino acids of LT-Ag and VP1 that have important effects on host specificity, as well as classification techniques used to predict PyV hosts.
METHODS: We collected and used reference sequences of 86 viral species for analysis. Based on the clustering pattern of the reconstructed phylogenetic tree, the dataset was divided into three groups: mammalian, avian, and fish. We then used random forest (RF), naïve Bayes (NB), and k-nearest neighbors (kNN) algorithms for host classification.
RESULTS: Among the three algorithms, classification accuracy using kNN was highest in both LT-Ag (ACC = 98.83) and VP1 (ACC = 96.51). The amino acid physicochemical property most strongly correlated with host classification was charge, followed by solvent accessibility, polarity, and hydrophobicity in LT-Ag. However, in VP1, amino acid composition showed the highest correlation with host classification, followed by charge, normalized van der Waals volume, and solvent accessibility.
CONCLUSIONS: The results of the present study suggest the possibility of determining or predicting the host range and infection properties of PyVs at the molecular level by identifying the host species of active and emerging PyVs that exhibit different infection properties among diverse host species. Structural and biochemical differences of LT-Ag and VP1 proteins in host species that reflect these amino acid properties can be considered primary factors that determine the host specificity of PyV.

Entities:  

Keywords:  Classification; Evolutionary relationship; Host specificity; Machine learning; Polyomavirus

Mesh:

Substances:

Year:  2021        PMID: 33646531     DOI: 10.1007/s13258-021-01059-2

Source DB:  PubMed          Journal:  Genes Genomics        ISSN: 1976-9571            Impact factor:   1.839


  27 in total

1.  Prediction of protein cellular attributes using pseudo-amino acid composition.

Authors:  K C Chou
Journal:  Proteins       Date:  2001-05-15

2.  SVM-Prot: Web-based support vector machine software for functional classification of a protein from its primary sequence.

Authors:  C Z Cai; L Y Han; Z L Ji; X Chen; Y Z Chen
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  Using the concept of Chou's pseudo amino acid composition for risk type prediction of human papillomaviruses.

Authors:  Maryam Esmaeili; Hassan Mohabatkar; Sasan Mohsenzadeh
Journal:  J Theor Biol       Date:  2009-12-02       Impact factor: 2.691

4.  Protein folding class predictor for SCOP: approach based on global descriptors.

Authors:  I Dubchak; I Muchnik; S H Kim
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1997

5.  Urinary MHPG: improved tricyclic antidepressant drug selection in clinical practice.

Authors: 
Journal:  Med J Aust       Date:  1979-09-08       Impact factor: 7.738

6.  Prediction of protein folding class using global description of amino acid sequence.

Authors:  I Dubchak; I Muchnik; S R Holbrook; S H Kim
Journal:  Proc Natl Acad Sci U S A       Date:  1995-09-12       Impact factor: 11.205

Review 7.  A cornucopia of human polyomaviruses.

Authors:  James A DeCaprio; Robert L Garcea
Journal:  Nat Rev Microbiol       Date:  2013-03-11       Impact factor: 60.633

8.  High-affinity Rb binding, p53 inhibition, subcellular localization, and transformation by wild-type or tumor-derived shortened Merkel cell polyomavirus large T antigens.

Authors:  Sophie Borchert; Manja Czech-Sioli; Friederike Neumann; Claudia Schmidt; Peter Wimmer; Thomas Dobner; Adam Grundhoff; Nicole Fischer
Journal:  J Virol       Date:  2013-12-26       Impact factor: 5.103

9.  Codon usage patterns of LT-Ag genes in polyomaviruses from different host species.

Authors:  Myeongji Cho; Hayeon Kim; Hyeon S Son
Journal:  Virol J       Date:  2019-11-14       Impact factor: 4.099

10.  The Ancient Evolutionary History of Polyomaviruses.

Authors:  Christopher B Buck; Koenraad Van Doorslaer; Alberto Peretti; Eileen M Geoghegan; Michael J Tisza; Ping An; Joshua P Katz; James M Pipas; Alison A McBride; Alvin C Camus; Alexa J McDermott; Jennifer A Dill; Eric Delwart; Terry F F Ng; Kata Farkas; Charlotte Austin; Simona Kraberger; William Davison; Diana V Pastrana; Arvind Varsani
Journal:  PLoS Pathog       Date:  2016-04-19       Impact factor: 6.823

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.