| Literature DB >> 36034694 |
Dana Ashoor1, Maryam Marzouq1, Khaled Trabelsi1, Sadok Chlif2, Nasser Abotalib1, Noureddine Ben Khalaf1, Ahmed R Ramadan1, M-Dahmani Fathallah1.
Abstract
In this study, we evaluated the use of a predictive computational approach for SARS-CoV-2 genetic variations analysis in improving the current variant labeling system. First, we reviewed the basis of the system developed by the World Health Organization (WHO) for the labeling of SARS-CoV-2 genetic variants and the derivative adapted by the United States Centers for Disease Control and Prevention (CDC). Both labeling systems are based on the virus' major attributes. However, we found that the labeling criteria of the SARS-CoV-2 variants derived from these attributes are not accurately defined and are used differently by the two agencies. Consequently, discrepancies exist between the labels given by WHO and the CDC to the same variants. Our observations suggest that giving the variant of concern (VOC) label to a new variant is premature and might not be appropriate. Therefore, we used a comparative computational approach to predict the effects of the mutations on the virus structure and functions of five VOCs. By linking these data to the criteria used by WHO/CDC for variant labeling, we ascertained that a predictive computational comparative approach of the genetic variations is a good way for rapid and more accurate labeling of SARS-CoV-2 variants. We propose to label all emergent variants, variant under monitoring or variant being monitored (VUM/VBM), and to carry out computational predictive studies with thorough comparison to existing variants, upon which more appropriate and informative labels can be attributed. Furthermore, harmonization of the variant labeling system would be globally beneficial to communicate about and fight the COVID-19 pandemic.Entities:
Keywords: SARS-CoV-2; computational prediction; variant of interest; variant under monitoring; variants of concern
Mesh:
Year: 2022 PMID: 36034694 PMCID: PMC9399656 DOI: 10.3389/fcimb.2022.868205
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 6.073
Figure 1SARS-CoV-2 variant attributes and derived criteria used by the World Health Organization (WHO) and the Centers for Disease Control and Prevention (CDC) for the labeling of emergent variants.
Figure 2Matrix showing the commonalities and discrepancies of the criteria used by WHO and the CDC to label SARS CoV-2 variants. The green and yellow triangles indicate the criteria used by WHO and the CDC, respectively. The red squares indicate the criteria used by both agencies for a same label. The numbers correspond to the labeling criteria displayed in .
Global spreading of SARS-CoV-2 omicron variant (B.1.1.529).
| Date | Transmissibility | Incidence | Country | Reference |
|---|---|---|---|---|
| 9–14 November, | – | 70 | First 3 |
|
| 26 November | High to very high | 8 |
| |
| 27 November | Prediction: 100-fold | 113 | 8 |
|
| 2 December | – | 390 | 31 |
|
| 3 December | Three times higher than other variants | 486 | 38 |
|
| 4 December | – | 689 | 42 |
|
| 7 December | Three to six times higher than | 959 | 42 |
|
| 21 December | Six times | 17,514 | 78 |
|
| 22 December | Six times | 20,322 | 79* |
|
*Currently, the omicron has gained global presence, and other subvariants have emerged.
Actual application of the criteria formulated by WHO and the CDC to the five major SARS-CoV-2 variants.
| Labeling Criteria | |||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Variants | |||||||||||||||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
|
| |||||||||||||||||||||||||
|
| |||||||||||||||||||||||||
|
| |||||||||||||||||||||||||
|
| |||||||||||||||||||||||||
|
| # | # | # | # | # | ||||||||||||||||||||
Figure 3Outline of the computational approach used to predict the viral attributes of SARS-CoV-2 emerging variants. This two-step approach includes a number of computed tasks from which various predictions are retrieved. The predictions are linked to the viral attributes and the criteria that apply are used to label a given variant.
Figure 4Mapping of the nonsynonymous mutations’ characteristics of the SARS-CoV-2 four variants. A typical genomic organization of SARS-CoV-2 contains the following: 5’ end UTR; open reading frames: ORF 1a and ORF 1b; the structural genes coding for the Spike (S) protein, the Envelope (E), the Membrane (M), and the Nucleocapsid. The accessory genes such as 3a, 3b, 6, 7a, 7b, 8, 10, and 14 are distributed among the structural genes. The 3’ end UTR follows the poly (A) tail. The green, yellow, blue, purple, and red show, respectively, the synonymous mutations characteristic of the alpha (α) variant (B.1.1.7), beta (β) variant (B.1.351), omicron (o) variant (B.1.1.529), gamma (γ) variant (P.1), and delta (δ) variant (B.1.617.2). The (-) represents the deletion, (°) represents the insertion, (*) represents the stop codon, magenta triangles indicate variations in the receptor-binding domain (RBD), and cyan triangles denote variations in the receptor-binding motif (RBM). The NCBI reference sequence for the surface glycoprotein of SARS-CoV-2 is YP_009724390.1.
Figure 5Representation of the surface models of SARS CoV-2 variant S spike protein (monomer). Side views (upper and middle rows) and top view (third row). Color coding is as follows: mutations (red), S1 subunit (cyan), S2 subunit (green), furin cleavage site (yellow), fusion peptides FP1 and FP2 (black), and the arrows show the TMPRSS2 cleavage site (orange).
Figure 6Percentage of solvent-exposed residues in the S protein of the five SARS-CoV-2 VOCs. Solvent-exposed residues at the surface of the S protein are potential epitopes. Variation of the percentage of solvent-exposed residues provide a useful hint for the prediction of immunogenicity changes between variants.
Figure 7This figure represents the 3D models generated for the α, β, γ, δ, and O variants, their structural analysis, and the quality assessment by superimposition with chain B of the crystallized model PBD 6VXX. (A) Superimposition RMSD values. (B) Calculated common contact percentages and Tm scores.
Comparison of contact map percentage (red) and TM-align (blue) scoring between the different SARS CoV-2 VOCs. The arrowheads denote the variant used as a reference when calculating the data.
| TM-align | |||||||
|---|---|---|---|---|---|---|---|
|
| |||||||
| Contact map % |
|
|
|
|
|
| |
|
| 0.99321 | 0.99811 | 0.99374 | 0.99392 | |||
|
| 96.1 | 0.99361 | 0.98905 | 0.99169 | |||
|
| 98.4 | 96.9 | 0.99547 | 0.99155 | |||
|
| 95.8 | 94.8 | 97.1 | 0.99041 | |||
|
| 96.1 | 94.6 | 96.0 | 94.6 | |||
Comparison of the mutation profiles of five SARS-CoV-2 VOCs’ S protein in the contact residues with the ACE2 receptor.
| 6LZGContact residues |
|
|
|
|
| Single mutations’ ΔΔG | Effect of single mutation |
|---|---|---|---|---|---|---|---|
| K417 | K417N | K417N | 0.75 | Destabilizing | |||
| K417T | 0.18 | Destabilizing | |||||
| G446 | G446S | -0.18 | Stabilizing | ||||
| G447 | |||||||
| Y449 | |||||||
| – | L452R | -0.57 | Stabilizing | ||||
| Y453 | |||||||
| L455 | |||||||
| F456 | |||||||
| Y473 | |||||||
| A475 | |||||||
| G476 | |||||||
| S477 | S477N | -0.15 | Stabilizing | ||||
| – | T478K | 0.19 | Destabilizing | ||||
| E484 | E484K | E484K | E484A | 0.17 | Destabilizing | ||
| G485 | |||||||
| F486 | |||||||
| N487 | |||||||
| Y489 | |||||||
| F490 | |||||||
| L492 | |||||||
| Q493 | Q493R | 0.25 | Destabilizing | ||||
| Y495 | |||||||
| G496 | G496S | 1.2 | Destabilizing | ||||
| F497 | |||||||
| Q498 | Q498R | 2.34 | Destabilizing | ||||
| P499 | |||||||
| T500 | |||||||
| N501 | N501Y | N501Y | N501Y | N501Y | 1.53 | Destabilizing | |
| G502 | |||||||
| V503 | |||||||
| Y505 | Y505H | 1.33 | Destabilizing | ||||
| Q506 | |||||||
|
| 1.53 | 1.79 | 1.07 | -0.33 | 0.18 | NA | NA |
|
| Destabilizing | Destabilizing | Destabilizing | Stabilizing | Destabilizing | NA | NA |
Positive and negative signs correspond respectively to destabilizing (decreases in binding affinity) and stabilizing mutations (increases in binding affinity).
S protein/ACE2 complex contact residues’ interactions pattern of different SARS-CoV-2 variants.
| Spike RBD residue(PDB 6LZG-version 2.4) | Interaction residue on ACE2 isoform 1 | SARS CoV-2 variants | ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
| ||
| Lys417 | 1X Asp30(P) | 1X Asp30(P) | Missing | Missing | 1X Asp30(P) | Missing |
| Tyr449 | 1X Gln42(P) | Missing | Missing | Missing | Missing | Missing |
| Tyr453 | 1X His34(H) | 1X His34(H) | 1X His34(P) | 1X His34(P) | 1X His34(H) | 1X His34(P) |
| Leu455 | 1X His34(H) | 1X His34(H) | 3X His34(H) | 3X His34(H) | 1X His34(H) | 3X His34(H) |
| Phe456 | 1X Thr27(H) | 1X Thr27(H) | 1X Thr27(H) | 1X Thr27(H) | 1X Thr27(H) | 1X Thr27(H) |
| Ala475 | 1X Ser19(P) | 1X Ser19(P) | 1X Ser19(P) | 1X Ser19(P) | 1X Ser19(P) | 1X Ser19(P) |
| Gly476 | 1X Ser19(H) | 1X Ser19(H) | 1X Ser19(H) | 1X Ser19(H) | 1X Ser19(H) | 1X Ser19(H) |
| Glu484 | Salt Lys31 | Salt Lys31 | ||||
| Phe486 | 1X Met82(H) | 1X Met82(H) | 1X Met82(H) | 1X Met82(H) | 1X Met82(H) | 1X Met82(H) |
| Asn487 | 1X Gln24(P) | 1X Gln24(P) | 1X Gln24(P) | 1X Gln24(P) | 1X Gln24(P) | 1X Gln24(P) |
| Tyr489 | 1X Thr27(H) | 1X Thr27(H) | 1X Thr27(H) | 1X Thr27(H) | 1X Thr27(H) | 1X Thr27(H) |
| Gln493 | 1X His34(H) | 1X Glu35(P) | 1X Glu35(P) | 1X Glu35(P) | 1X Glu35(P) | 1X His34(P) |
| Gly496 | 1X Lys353(P) | Missing | Missing | Missing | 1X Lys353(P) | 1X Lys353(P) |
| Gln498 | 1X Gln42(P) | 1X Gln42(P) | 1X Gln42(P) | 1X Gln42(P) | 1X Gln42(P) | Missing |
| Thr500 | 1X Tyr41(P) | 1X Tyr41(P) | 1X Tyr41(P) | 1X Tyr41(P) | 1X Tyr41(P) | 1X Tyr41(P) |
| Asn501 | 3X Tyr41(H) | 5X Tyr41(H) | 1X Asp38(P) | 1X Asp38(P) | 4X Tyr41(H) | 1X Lys353(P) |
| Gly502 | 1X Lys353(P) | 1X Lys353(P) | 1X Lys353(P) | 1X Lys353(P) | 1X Lys353(P) | 1X Lys353(P) |
| Tyr505 | 5X Lys353(H) | 4X Lys353(H) | 1X Glu37(P) | 1X Glu37(P) | 1X Glu37(P) | 4X Lys353(H) |
|
| 10P/56H | 8P/1 salt/63H | 10P/83H | 10P/80H | 10P/1 salt/72H | 10P/1 salt/85H |
|
| 1.53 | 1.79 | 1.07 | −0.33 | 0.18 | |
|
|
|
|
|
|
| |
Polar interactions (P), hydrophobic interactions (H), Missing interactions are highlighted in red, new interactions highlighted in blue, salt bridges are highlighted in green. SARS-CoV-2 Wuhan strain sequence was used as reference sequence.