| Literature DB >> 34898980 |
R Shyama Prasad Rao1, Nagib Ahsan2,3, Chunhui Xu4, Lingtao Su4, Jacob Verburgt5, Luca Fornelli2,6, Daisuke Kihara5,7, Dong Xu4.
Abstract
SARS-CoV-2, responsible for the current COVID-19 pandemic that claimed over 5.0 million lives, belongs to a class of enveloped viruses that undergo quick evolutionary adjustments under selection pressure. Numerous variants have emerged in SARS-CoV-2, posing a serious challenge to the global vaccination effort and COVID-19 management. The evolutionary dynamics of this virus are only beginning to be explored. In this work, we have analysed 1.79 million spike glycoprotein sequences of SARS-CoV-2 and found that the virus is fine-tuning the spike with numerous amino acid insertions and deletions (indels). Indels seem to have a selective advantage as the proportions of sequences with indels steadily increased over time, currently at over 89%, with similar trends across countries/variants. There were as many as 420 unique indel positions and 447 unique combinations of indels. Despite their high frequency, indels resulted in only minimal alteration of N-glycosylation sites, including both gain and loss. As indels and point mutations are positively correlated and sequences with indels have significantly more point mutations, they have implications in the evolutionary dynamics of the SARS-CoV-2 spike glycoprotein.Entities:
Keywords: COVID-19; Computational proteomics; N-glycosylation sites; SARS-CoV-2; indels; molecular evolution; selection; sequence analysis
Year: 2021 PMID: 34898980 PMCID: PMC8655444 DOI: 10.1177/11769343211064616
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625