| Literature DB >> 34281387 |
Jobin John Jacob1, Karthick Vasudevan1,2, Agila Kumari Pragasam1, Karthik Gunasekaran3, Balaji Veeraraghavan1, Ankur Mutreja4.
Abstract
The currently ongoing COVID-19 pandemic caused by SARS-CoV-2 has accounted for millions of infections and deaths across the globe. Genome sequences of SARS-CoV-2 are being published daily in public databases and the availability of these genome data sets has allowed unprecedented access to the mutational patterns of SARS-CoV-2 evolution. We made use of the same genomic information for conducting phylogenetic analysis and identifying lineage-specific mutations. The catalogued lineage-defining mutations were analyzed for their stabilizing or destabilizing impact on viral proteins. We recorded persistence of D614G, S477N, A222V, and V1176F variants and a global expansion of the PANGOLIN variant B.1. In addition, a retention of Q57H (B.1.X), R203K/G204R (B.1.1.X), T85I (B.1.2-B.1.3), G15S+T428I (C.X), and I120F (D.X) variations was observed. Overall, we recorded a striking balance between stabilizing and destabilizing mutations, therefore leading to well-maintained protein structures. With selection pressures in the form of newly developed vaccines and therapeutics to mount in the coming months, the task of mapping viral mutations and recording their impact on key viral proteins should be crucial to preemptively catch any escape mechanism for which SARS-CoV-2 may evolve. IMPORTANCE Since its initial isolation in Wuhan, China, large numbers of SARS-CoV-2 genome sequences have been shared in publicly accessible repositories, thus enabling scientists to do detailed evolutionary analysis. We investigated the evolutionarily associated mutational diversity overlaid on the major phylogenetic lineages circulating globally, using 513 representative genomes. We detailed the phylogenetic persistence of key variants facilitating global expansion of the PANGOLIN variant B.1, including the recent, fast-expanding, B.1.1.7 lineage. The stabilizing or destabilizing impact of the catalogued lineage-defining mutations on viral proteins indicates their possible involvement in balancing the protein function and structure. A clear understanding of this mutational profile is of high clinical significance to catch any vaccine escape mechanism, as the same proteins make crucial components of vaccines that have recently been approved or are in development. In this vein, our study provides an imperative framework and baseline data upon which further analysis could be built as newer variants of SARS-CoV-2 continue to appear.Entities:
Keywords: SARS-CoV-2; evolution; mutation; stability; vaccine
Mesh:
Substances:
Year: 2021 PMID: 34281387 PMCID: PMC8406184 DOI: 10.1128/mBio.01188-21
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1Maximum likelihood phylogenetic tree inferred from 513 SARS-CoV-2 genomes. The tree was constructed using multiple genome sequence alignment (MAFFT) by mapping against the Wuhan-Hu-1 strain (accession NC_045512). Tips are colored with the major lineages assigned by PANGOLIN. Respective lineages assigned by GISAID and origin of sequence are labeled as color strips. The scale bar indicates the distance corresponding to substitution per site.
FIG 2Schematic representation of the major evolutionary events/amino acid substitutions that gave rise to SARS-CoV-2 variants in sequential order.
Lineage-defining SNPs and their impact on protein structures
| Protein | Lineage-defining mutation | ΔΔS in kcal mol−1 K−1 | Change in dynamics | ΔΔG in kcal mol−1 (DUET) | ΔΔG in kcal mol−1 (SDM) | Stability |
|---|---|---|---|---|---|---|
| Nsp12 | P323L | −0.33 | Decreasing flexibility | 0.43 (stabilizing) | 1.57 (stabilizing) | Stabilizing |
| Spike | D614G | −0.01 | Decreasing flexibility | 0.46 (stabilizing) | 2.33 (stabilizing) | Stabilizing |
| Orf3a | G251V | −0.39 | Decreasing flexibility | −0.6 (destabilizing) | −2.19 (destabilizing) | Destabilizing |
| Q57H | 0.44 | Increasing flexibility | −1.25 (destabilizing) | 0.87 (stabilizing) | Inconclusive | |
| Orf8 | L84S | 0.30 | Increasing flexibility | −1.41 (destabilizing) | −1.41 (destabilizing) | Destabilizing |
| Nsp2 | T85I | 0.07 | Increasing flexibility | 0.54 (stabilizing) | 1.93 (stabilizing) | Stabilizing |
| I120F | −1.30 | Decreasing flexibility | −1.04 (destabilizing) | −0.21 (destabilizing) | Destabilizing | |
| Nsp6 | L37F | −0.29 | Decreasing flexibility | −0.72 (Destabilizing) | −0.04 (neutral) | Inconclusive |
| Nucleocapsid | R203K | −0.98 | Decreasing flexibility | −1.57 (destabilizing) | −0.48 (destabilizing) | Destabilizing |
| protein (N) | G204R | −0.16 | Decreasing flexibility | −1.06 (destabilizing) | −1.95 (destabilizing) | Destabilizing |
| Nsp5 | G15S | −0.31 | Decreasing flexibility | −0.98 (destabilizing) | −0.79 (destabilizing) | Destabilizing |
Predicted effect of protein stability in the presence of amino acid mutations in the SARS-COV-2 genomes
| Protein | Mutation | ΔΔG SDM (kcal/mol) | ΔΔG DUET (kcal/mol) | Stability |
|---|---|---|---|---|
| Spike | A222V | 0.95 | 0.91 | Stabilizing |
| S477N | 0.31 | 0.02 | Stabilizing | |
| L18F | −0.801 | −0.46 | Destabilizing | |
| N439K | −0.29 | 0.3 | Inconclusive | |
| L5F | −0.801 | −0.1 | Destabilizing | |
| W1214G | −1.913 | −0.28 | Destabilizing | |
| R21I | −0.856 | 0.46 | Inconclusive | |
| A262S | −2.13 | −1.66 | Destabilizing | |
| S98F | 1.23 | −0.58 | Inconclusive | |
| D1163Y | 0.21 | 0.26 | Stabilizing | |
| G1167V | −0.58 | −2.25 | Destabilizing | |
| D936Y | −0.13 | −0.32 | Destabilizing | |
| P272L | 2.12 | 0.36 | Stabilizing | |
| D80Y | 0.77 | −3.08 | Inconclusive | |
| E583D | −0.69 | −0.86 | Destabilizing | |
| P1263L | −0.231 | 1.29 | Inconclusive | |
| K1073N | −0.48 | −0.45 | Destabilizing | |
| D253G | −0.19 | 0.04 | Inconclusive | |
| T723I | 0.64 | 0.21 | Stabilizing | |
| A688V | −0.18 | 0.03 | Inconclusive | |
| A626S | −2.6 | −1.66 | Destabilizing | |
| L54F | −0.61 | −1.33 | Destabilizing | |
| H655Y | 0.6 | 1.44 | Stabilizing | |
| G769V | 0.6 | 0.14 | Stabilizing | |
| L176F | 0.12 | −0.95 | Inconclusive | |
| G1124V | −1.52 | −0.14 | Destabilizing | |
| V622F | −0.2 | −0.67 | Destabilizing | |
| S255F | 0.94 | −0.8 | Inconclusive | |
| H49Y | 0.4 | 1.14 | Stabilizing | |
| D839Y | −0.389 | −1.08 | Destabilizing | |
| V1176F | −0.92 | −0.55 | Destabilizing | |
| D215H | 0.8 | 1.35 | Stabilizing | |
| H146Y | 1.139 | −0.29 | Inconclusive | |
| A879S | −2.57 | −1.69 | Destabilizing | |
| Q677H | 0.98 | −0.48 | Inconclusive | |
| D1084Y | −0.43 | −0.03 | Destabilizing | |
| V1068F | −1.05 | −1.15 | Destabilizing | |
| P25S | −0.392 | 0.93 | Inconclusive | |
| A520S | −1.23 | −0.15 | Destabilizing | |
| G261V | 0.16 | 0.16 | Stabilizing | |
| D574Y | −0.56 | −0.45 | Destabilizing | |
| T29I | 0.48 | 0.51 | Stabilizing | |
| Y453F | −0.17 | −0.48 | Destabilizing | |
| N501Y | 0.41 | −0.42 | Inconclusive | |
| S939F | 0.76 | −0.71 | Inconclusive | |
| T95I | 1.91 | 0.37 | Stabilizing | |
| Q675H | 0.8 | −0.4 | Inconclusive | |
| Nucleocapsid | A220V | −0.51 | −1.13 | Destabilizing |
| S194L | 1.15 | −0.02 | Inconclusive | |
| D103Y | 1.45 | 0.55 | Stabilizing | |
| P13L | 0.84 | 0.23 | Stabilizing | |
| S197L | 1.27 | 0.26 | Stabilizing | |
| A398V | −0.98 | −1.03 | Destabilizing | |
| P199L | 1.27 | −0.21 | Inconclusive | |
| M234I | 0.69 | 0.41 | Stabilizing | |
| S188L | 1.21 | −0.06 | Inconclusive | |
| S183Y | 0.05 | −0.71 | Inconclusive | |
| Membrane | T175M | 0.69 | −0.26 | Inconclusive |
| Envelope | P71S | −0.03 | −2.35 | Destabilizing |
FIG 3Schematic representation of SARS-CoV-2 genome organization, the major amino acid substitutions, and stability of amino acid changes. Stabilizing mutations are colored green, destabilizing mutations are colored red, and mutations that neither stabilize nor destabilize are colored yellow.
Impact of independent, double and triple mutations in the spike protein
| Protein | Combinations | Mutations | ΔΔG (pred) | C (pred) |
|---|---|---|---|---|
| Spike | Independent | D614G | 0.422 | 0.892 |
| Double | D614G+S477N | 0.228 | 0.896 | |
| D614G+A222V | 0.195 | 0.889 | ||
| Triple | D614G+S477N+A222V | 0.129 | 0.129 |