Literature DB >> 36247800

Evolution of SARS-CoV-2 virus and assessment of the effectiveness of COVID-19 vaccine.

Veljko Veljkovic1, Vladimir Perovic1, Isabelle Chambers2, Slobodan Paessler1,2,3.   

Abstract

A safe and effective vaccine is urgently needed to bring the current SARS-CoV-2 pandemic under control. The spike protein (SP) of SARS-CoV-2 represents the principal target for most vaccines currently under development. Despite the presence of a CoV proof-reading function in viral replication, SP protein from SARS-CoV still extensively mutates, which might have an impact on current and future vaccine development. Here, we present analysis of more than 1600 SP unique variants suggesting that vaccine candidates based on the Wuhan-Hu-1 reference strain would be effective against most of currently circulated SARS-CoV-2 viruses, but that further monitoring of the evolution of this virus is important for identification of other mutations, which could affect the effectiveness of vaccines. Copyright:
© 2021 Veljkovic V et al.

Entities:  

Keywords:  COVID-19; SARS-CoV-2; mutations; vaccine effectiveness

Year:  2021        PMID: 36247800      PMCID: PMC9490287          DOI: 10.12688/f1000research.28215.1

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


Introduction

The current Coronavirus Disease 2019 (COVID-19) pandemic, caused by Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), represents an unprecedented health threat resulting in over 1.1 million deaths globally ( www.worldometers.info/coronavirus) and significant economic damage. There is a consensus that development of an effective and safe vaccine represents the best strategy to control the COVID-19 pandemic, and consequently, this endeavor remains a top public health priority. Researchers and drug companies around the world are working hard to develop a COVID-19 vaccine, and currently 282 vaccine candidates are in clinical trials or will enter clinical investigation soon (clinicaltrial.gov). The spike glycoprotein (SP) of SARS-CoV-2 is the principal target for most these vaccine candidates. Despite the presence of a CoV proof-reading function in viral replication , SARS-CoV still extensively mutates, which might have an impact on current and future vaccine development. In the GISAID database more than 1,600 SARS-CoV-2 strains with unique combinations of mutations in SP are deposited. In order to fully understand the impact of the mutations on the SP properties (pathogenesis, virulence, transmissibility, immunogenicity), advanced laboratory studies are required. These investigations take time and are done in close collaboration between different research groups, institutions and academia. However, some important information about the biological effects of mutations could and should also be obtained by the in silico analysis of SP, which can be performed quickly and with a minimal experimental foreknowledge at the time of analysis. Previously, a novel bioinformatics approach, which is based on electronic biology, was used in the assessment of the effect of mutations on the vaccine effectiveness (VE) for seasonal flu vaccines. Most recently, this approach allowed successful prediction of VE for two successive flu seasons . Here, we used this bioinformatics approach for the analysis of the effect(s) of mutations in SP from SARS-CoV-2 virus on the effectiveness of the COVID-19 vaccines and/or vaccine candidates. The presented results showed that VE of vaccine candidates that are based on SP from the reference SARS-CoV-2 strain YP_009724390 remains between 76% and 94%, despite an extensive mutation rate in this important and likely protective antigen. Our analysis also suggests that further close monitoring of SP mutations might be essential in order to be able to respond to changes that may influence VE in the future.

Methods

Viruses

We analyzed the non-redundant subunit 1 of S proteins (SP1) from human SARS-CoV-2 viruses deposited in the GISAID ( https://www.epicov.org/epi3/cfrontend#18c7c7) from January to October, 2020, and SP1 from viruses isolated from mink that are presented in the same database. Prototype virus Wuhan-HU-1 (YP_009724390) is used in the analysis as the wild type (WT) and as the vaccine virus.

Informational spectrum method (ISM)

The ISM is the virtual spectroscopy method for the analysis of the protein biological properties . This bioinformatics approach encompasses three basic steps: (i) the representation of the primary structure of the protein as the numerical sequence by the assignment to each amino acid of the corresponding value of the electron-ion interaction potential (EIIP), (ii) the transformation of the obtained numerical sequence into the informational spectrum (IS), and (iii) the calculation of the cross-spectrum (CS) between interacting proteins. The EIIP is the physical parameter that determines the long-range interactions of biological molecules (interactions at distances >5Å) . This molecular descriptor is defined by the following equation : where Z* is the average quasivalence number (AQVN): where N is the total number of atoms, n is the number of atoms of the i-th component, Z is the valence number of the i-th atomic component in the molecule and m is the number of components. The EIIP values calculated according to the Equation (1) are given in Rydbergs (Ry). The numerical sequence, representing the primary structure of protein, is transformed into the IS by the discrete Fourier transformation where x(m) is the m-th member of a given numerical series, N is the total number of points in this series, and X(n) are discrete Fourier transformation coefficients. In this way, the information defined by the sequence of amino acids is represented as the series of frequencies and their amplitudes. The frequencies in IS correspond to the distribution of the structural motifs with defined physicochemical properties determining a biological characteristics of a protein. When comparing proteins, which share the same biological or biochemical function, the ISM technique allows detection of code/frequency pairs that are specific for their common biological properties, or which correlate with their specific interactions.

Phylogenetic analysis

The algorithm of the ISM-based phylogenetic analysis, which was used for the assessment of the biological effect of mutations in SP1 proteins from SARS-CoV-2, was previously described in detail . Figure 1 provides the schematic presentation of this algorithm. Here, we used an ISM distance measure d defined on the ratio between specific frequencies F(0.257) and F(0.479) in IS of SP1 protein, which characterize its interaction with host factors .
Figure 1.

The schematic representation of the ISM-based phylogenetic algorithm ISTREE.

Results and discussion

Figure 2 presents the ISM-based phylogenetic tree (the high resolution tree is given in Extended data: Figure 1S ) calculated for 1643 nonredundant SP1 proteins from SARS-CoV-2 viruses collected from January to October, 2020. Analyzed SP1s in this tree are grouped into four separated clusters. The largest cluster A, encompassing 78% of analyzed proteins, contains two sub-clusters, which includes the Wuhan-Hu-1 reference sequence for the Spike protein that is also the basis for different vaccine candidates (sub-cluster A2) and viruses with SP1 with the most abundant mutation D614G (sub-cluster A1). According to the IS concept, proteins that are grouped in the same cluster in the ISM-base phylogenetic tree have similar interacting and immunological profiles . This indicates that a vaccine based on SP from the Wuhan-Hu-1 reference strain will be effective against viruses that are grouped together with this virus in the sub-cluster A2. It was suggested that a single vaccine candidate based on the reference SARS-CoV-2 strain would likely match most of currently circulating variants, including the mutant D614G . Recently, it also showed that sera from D614-infected hamsters exhibit modestly higher neutralization titers against G614 virus than against D614 virus, indicating that this mutation may not reduce the ability of vaccines in clinical trials to protect against COVID-19 . This suggests that viruses which are grouped together with the mutant D614G in the sub-cluster A1 would also be responsive to the vaccine candidates that are based on the Wuhan-Hu-1 reference strain. In contrast to the cluster A, viruses that are grouped into other three small clusters B, C and D do not perfectly match the vaccine virus and might be resistant to the vaccine.
Figure 2.

The ISM-based phylogenetic tree of the non-redundant SP1 proteins from SARS-CoV-2 viruses deposited in GISAID from January to October 2020.

In Table 1, the distribution by months of non-redundant SP1 from potentially vaccine responsive SARS-CoV-2 is reported. These data suggest that VE of the vaccine candidates, estimated on the base of the IS parameters would be between 76% and 95%. This estimation of VE is limited with the assumption that viruses from the cluster A dominate among currently circulating SARS-CoV-2. Despite this limitation, the predicted VE is in accord with the results of the recently completed Phase-3 clinical trial of the candidate COVID-19 vaccine. This study, which enrolled 43,538 subjects, showed that the tested vaccine candidate to be more than 90% effective in preventing COVID-19 in participants.
Table 1.

Distribution by months of the non-redundant SP1 proteins from SARS-CoV-2 viruses deposited in GISAID from January to October, 2020.

MonthNumber of virusesVaccine responsive viruses [%]
All nonredundant SP1164378
February4295
March43186
April52583
May36086
June37076
July32278
August32987
September33392
October12176
Although potentially vaccine non-responsive viruses in the clusters B, C and D represent only 22% of all 1,643 viruses with non-redundant SP1, these viruses could negatively affect VE. Mass vaccination, which is expected in the near future, creates an immune pressure in the population that would lead to increase in mutations in SP1 as well as in natural selection and preservation of escape mutants. Moreover, this fraction of mutant viruses that can escape vaccine protection and be efficiently transmitted could significantly increase among circulating SARS-CoV-2 once vaccination starts. For this reason, real-time monitoring of virus evolution with different methods, including the ISM-based phylogenetic tool for detection of “functional” changes in the SP1 of SARS-CoV-2 is needed. In September 2020 in Denmark, 12 human cases of COVID-19 have been identified with SARS-CoV-2 unique variants associated with farmed minks. Viruses isolated from these patients had a combination of mutations, or changes that have not been previously observed and the implications of the identified changes in this variant are not yet well understood. Preliminary findings indicate that this particular mink-associated variant identified in both minks and the 12 human cases could negatively affect VE because mutations in these viruses moderately decreased sensitivity to neutralizing antibodies. Danish authorities have undertaken different actions to limit the further spread of this variant of the virus among mink and human populations, including culling of all farmed mink in Denmark (more than 17 million). In order to assess the effect of these mutations on VE we calculated the ISM-phylogenetic tree for 5 non-redundant SP1 sequences from SARS-CoV-2 viruses from mink deposited in GISAID ( Figure 3). As presented, four of five viruses are grouped with the vaccine virus suggesting that these variants would not significantly affect VE of the COVID-19 vaccines that are based on the SP from the Wuhan-Hu-1 reference strain. The virus with the combination of mutations G261D and Y453F is out of this cluster and it could be potentially resistant to these vaccines. This suggests that further spread of this variant should be carefully monitored. Of note is that the virus with single mutation Y453F is grouped with the vaccine virus indicating that this variant will not significantly affect VE although this mutation is located in RBD. Contrary, this mutation in combination with G261D could decrease responsiveness to vaccine. This result also points out that the assessment of biological effect of mutations should be included in analysis all mutations that are present in protein.
Figure 3.

The ISM-based phylogenetic tree of the non-redundant SP1 proteins from SARS-CoV-2 isolated from minks (GISAID).

Conclusions

The presented analysis of SP1 proteins, performed by the electronic biology tool, suggest (i) that vaccine candidates based on the Wuhan-Hu-1 reference strain would be effective against most of currently circulated SARS-CoV-2 variants, and ii) further ISM-based monitoring of the evolution of SARS-CoV-2 is important for identification of other mutations, which could affect the effectiveness of vaccines against this virus. Therefore, the scientific community needs to be proactive in submitting genetic sequences into databases such as GSAID. For example, we have States such as Wisconsin, South and North Dakota with low mortality rate 0.8, 1.2, 1.2, respectively (GISAID accessed on November 39, 2020), but no ability to check genetic sequences of viruses from those States as none were deposited. It is very important to support genetic data collection and to share that data in real time in order to organize successful monitoring using different approaches including the one published in this paper.

Data availability

Sequence data of the viruses were obtained from the GISAID Database. To access the database each individual user should complete the “ Registration Form For Individual Users”. This form, together with detailed instructions, are available on the website. After submission of the Registration form, the user will receive a password. There are no other restrictions for access to GISAID. Conditions of access to, and use of, the GISAID Database and Data are defined by Terms of Use.

Extended data

Harvard Dataverse: The ISM-based phylogenetic tree of the non-redundant SP1 proteins from SARS-CoV-2 viruses deposited in GISAID from January to October 2020, https://doi.org/10.7910/DVN/JLXWWP . This project contains the following extended data: Figure 1S Data are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication). Dear Editor, I want to inform you that this article is an exciting piece of work. The study presented the analysis of SP unique variants using the electronic biology tool whereas SP is an important target for developing most of the SARS-CoV-2 vaccines. The authors found that the vaccine candidates based on the Wuhan-Hu-1 reference strain would be effective against most of the currently circulated SARS-CoV-2 variants and that ISM-based monitoring of the evolution of SARS-CoV-2 is important for the identification of other mutations, which could affect the effectiveness of vaccines against this virus. Is the work clearly and accurately presented and does it cite the current literature? Yes If applicable, is the statistical analysis and its interpretation appropriate? Yes Are all the source data underlying the results available to ensure full reproducibility? Yes Is the study design appropriate and is the work technically sound? Yes Are the conclusions drawn adequately supported by the results? Yes Are sufficient details of methods and analysis provided to allow replication by others? Yes Reviewer Expertise: Clinical Pharmacy, Respiratory system diseases and COVID-19 I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. The article by Veljkovic et al. describes the application of an in silico method previously developed by the group to monitor the "functional" evolution of viral sequences based on the informational spectrum method (ISM). This method is based on the primary amino acid sequences of proteins, from which informational spectra are being derived that ultimately inform about protein-protein interactions. This method allows a rapid assessment of functional alterations in evolving proteins like changes in antigenicity or receptor usage and has been validated in previous studies of the Veljkovic group. A key study was the validation of mutations in the hemagglutinin sequences of H5N1 Influenza viruses predicted by ISM to alter receptor specificity in functional in vitro studies by Schmier et al. (2015), which should be mentioned in the introduction. Here this method is applied to analyze more than 1600 non-redundant sequences of the spike glycoprotein of the SARS-CoV-2 actually circulating and deposited in the GISAID database over time to derive their immunological profiles and to detect potential escape from the actual vaccine strains. The derived phylogenetic tree uncovers 2 major clusters corresponding to the Wuhan-Hu1 reference group and a derived D614G mutant covered by the actual vaccines and 3 minor clusters representing 22% of sequences, of which vaccine coverage is not known. These results nicely show how potential candidates for vaccine escape mutants can rapidly be identified by the ISM method, which can then be proven experimentally. The authors also identified by ISM a subgroup of mink SARS-CoV-2 sequences transmitted to humans in Denmark, which represent outliers compared to the actual vaccine strains and should thus be watched carefully in the future. Overall, this paper nicely underlines the value of the ISM method, which can rapidly be performed on a huge set of related sequences to identify potentially "dangerous" outliers, which may be functionally altered in terms of virulence or antigenic escape. This sets the ground to focus experimentally on these outlier strains for functional implications of the predicted mutations, thus limiting long-lasting experimental work to essential strains. As such, the manuscript potentially contributes essentially to the control of the most acute viral health problem we are actually facing. The paper is generally well written, minor linguistic issues should be corrected in the manuscript. Minor issues: Abstract: "....most vaccines currently in clinical applications or under development"; "Here we present computational analysis of more than...."; "effective against most of currently circulating SARS-CoV-2 viruses. However, further monitoring....". Introduction:  "...in silico analysis of SP, which can be performed much more rapidly and with minimal experimental knowledge at the time of analysis. Furthermore, in a proof of principle study, it was shown that HA mutations predicted in silico to alter avian receptor specificity of H5N1 Influenza A viruses indeed result in enhanced specificity for human receptors (Schmier et al., Scientific Reports 2015). Results: In Fig. 2 it should be indicated, what percentage of sequences has been assigned to the different groups (A1, A2, B, C, D). Also in Table 1, it should be indicated in an additional column, what percentage of sequences belongs to the minor groups (B, C, D) over the months to detect a potential increase over time. Is the work clearly and accurately presented and does it cite the current literature? Yes If applicable, is the statistical analysis and its interpretation appropriate? Not applicable Are all the source data underlying the results available to ensure full reproducibility? Yes Is the study design appropriate and is the work technically sound? Yes Are the conclusions drawn adequately supported by the results? Yes Are sufficient details of methods and analysis provided to allow replication by others? Yes Reviewer Expertise: Virology, HIV-1, Influenza Viruses, vaccine development, virus entry, antigenicity I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
  12 in total

1.  Is it possible to analyze DNA and protein sequences by the methods of digital signal processing?

Authors:  V Veljković; I Cosić; B Dimitrijević; D Lalović
Journal:  IEEE Trans Biomed Eng       Date:  1985-05       Impact factor: 4.538

2.  Spike mutation D614G alters SARS-CoV-2 fitness.

Authors:  Jessica A Plante; Yang Liu; Jianying Liu; Hongjie Xia; Bryan A Johnson; Kumari G Lokugamage; Xianwen Zhang; Antonio E Muruato; Jing Zou; Camila R Fontes-Garfias; Divya Mirchandani; Dionna Scharton; John P Bilello; Zhiqiang Ku; Zhiqiang An; Birte Kalveram; Alexander N Freiberg; Vineet D Menachery; Xuping Xie; Kenneth S Plante; Scott C Weaver; Pei-Yong Shi
Journal:  Nature       Date:  2020-10-26       Impact factor: 49.962

Review 3.  Coronaviruses: an RNA proofreading machine regulates replication fidelity and diversity.

Authors:  Mark R Denison; Rachel L Graham; Eric F Donaldson; Lance D Eckerle; Ralph S Baric
Journal:  RNA Biol       Date:  2011-03-01       Impact factor: 4.652

4.  In Silico Prediction and Experimental Confirmation of HA Residues Conferring Enhanced Human Receptor Specificity of H5N1 Influenza A Viruses.

Authors:  Sonja Schmier; Ahmed Mostafa; Thomas Haarmann; Norbert Bannert; John Ziebuhr; Veljko Veljkovic; Ursula Dietrich; Stephan Pleschka
Journal:  Sci Rep       Date:  2015-06-19       Impact factor: 4.379

5.  Prediction of influenza vaccine effectiveness for the influenza season 2017/18 in the US.

Authors:  Slobodan Paessler; Veljko Veljkovic
Journal:  F1000Res       Date:  2017-11-29

6.  Identification of hemagglutinin structural domain and polymorphisms which may modulate swine H1N1 interactions with human receptor.

Authors:  Veljko Veljkovic; Henry L Niman; Sanja Glisic; Nevena Veljkovic; Vladimir Perovic; Claude P Muller
Journal:  BMC Struct Biol       Date:  2009-09-28

7.  Characterization of conserved properties of hemagglutinin of H5N1 and human influenza viruses: possible consequences for therapy and infection control.

Authors:  Veljko Veljkovic; Nevena Veljkovic; Claude P Muller; Sybille Müller; Sanja Glisic; Vladimir Perovic; Heinz Köhler
Journal:  BMC Struct Biol       Date:  2009-04-07

8.  Predicted Enhanced Human Propensity of Current Avian-Like H1N1 Swine Influenza Virus from China.

Authors:  Veljko Veljkovic; Nevena Veljkovic; Slobodan Paessler; Marco Goeijenbier; Vladimir Perovic; Sanja Glisic; Claude P Muller
Journal:  PLoS One       Date:  2016-11-09       Impact factor: 3.240

9.  Use of the informational spectrum methodology for rapid biological analysis of the novel coronavirus 2019-nCoV: prediction of potential receptor, natural reservoir, tropism and therapeutic/vaccine target.

Authors:  Veljko Veljkovic; Júlia Vergara-Alert; Joaquim Segalés; Slobodan Paessler
Journal:  F1000Res       Date:  2020-01-27

10.  A SARS-CoV-2 vaccine candidate would likely match all currently circulating variants.

Authors:  Bethany Dearlove; Eric Lewitus; Hongjun Bai; Yifan Li; Daniel B Reeves; M Gordon Joyce; Paul T Scott; Mihret F Amare; Sandhya Vasan; Nelson L Michael; Kayvon Modjarrad; Morgane Rolland
Journal:  Proc Natl Acad Sci U S A       Date:  2020-08-31       Impact factor: 12.779

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.