| Literature DB >> 34696404 |
Johan Lennerstrand1, Navaneethan Palanisamy2.
Abstract
Several vaccines with varying efficacies have been developed and are currently administered globally to minimize the spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Despite having an RNA-dependent RNA polymerase with a proofreading activity, new variants of SARS-CoV-2 are on the rise periodically. Some of the mutations in these variants, especially mutations on the spike protein, aid the virus in transmission, infectivity and host immune evasion. Further, these mutations also reduce the effectiveness of some of the current vaccines and monoclonal antibodies (mAbs). In the present study, using the available 984,769 SARS-CoV-2 nucleotide sequences on the NCBI database from the end of 2019 till 28 July 2021, we have estimated the global prevalence of so-called 'adaptive mutations' and 'mutations identified in the prolonged infections', in the receptor-binding domain (RBD) of the spike (S) protein. Irrespective of the geographical region, in the case of the adaptive mutations, N501Y (48.38%) was found to be the dominant mutation followed by L452R (17.52%), T478K (14.31%), E484K (4.69%), S477N (3.29%), K417T (1.64%), N439K (0.7%) and S494P (0.7%). Other mutations were found to be less prevalent (less than 0.7%). Since the last two months, there has been a massive increase of L452R and T478K mutations (delta variant) in certain areas. In the case of prolonged infections' mutations (long-term SARS-CoV-2 infections), V483A (0.009%) was found to be dominant followed by Q493R (0.009%), while other mutations were found in less than 0.007% of the studied sequences. The data obtained in this study will aid in the development of better infection control policies, thereby curbing the spread of this virus.Entities:
Keywords: COVID-19; SARS-CoV-2; coronavirus; prevalence; spike
Mesh:
Substances:
Year: 2021 PMID: 34696404 PMCID: PMC8538289 DOI: 10.3390/v13101974
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
List of adaptive and prolonged infections’ mutations in the RBD of the SARS-CoV-2 S protein studied and their resistance to mAbs that are in emergency use and clinical trials ⁑.
| Type | Mutation | Resistance to mAbs # (Fold Change Compared to SARS-CoV-2 Wuhan-Hu-1 Isolate) |
|---|---|---|
| Adaptive | R346K | C135 (≥100-fold) PVH, PVV [ |
| K417N | casirivimab (43-fold) PVM [ | |
| K417T | data not available | |
| N439K | C135 (≥100-fold) PVH [ | |
| L452Q | data not available | |
| L452R | bamlanivimab (≥100-fold) PVV [ | |
| Y453F | casirivimab (≥100-fold) PVV [ | |
| S477N | no resistance to current mAbs | |
| T478K | no resistance to current mAbs | |
| T478R | data not available | |
| E484K | bamlanivimab (≥100-fold) PVV [ | |
| E484Q | bamlanivimab (≥100-fold) PVV [ | |
| F490S | bamlanivimab (≥100-fold) PVV [ | |
| S494P | bamlanivimab (>100-fold) PVV [ | |
| N501Y | etesevimab (20-fold) PVM [ | |
| Prolonged infections | T415A | data not available |
| T470N | data not available | |
| V483A | bamlanivimab (48-fold) PVV [ | |
| E484A | C144 (≥100-fold) PVH, PVV [ | |
| F486I | data not available | |
| Q493K | C144 (≥100-fold) PVH, PVV [ | |
| Q493R | bamlanivimab (≥100-fold) PVV [ |
⁑ Residue positions and resistance data were taken from Stanford University’s coronavirus antiviral and resistance database (CoV-RDB) using the following link: https://covdb.stanford.edu/ (accessed on 29 May 2021) [45]. # Resistance ≤3.4-fold is not shown here. PVH = pseudovirus (HIV), PVM = pseudovirus (MLV), PVV = pseudovirus (VSV).
Figure 1Global prevalence of adaptive and prolonged infections’ mutations in the receptor-binding domain (RBD) of the SARS-CoV-2 spike (S) protein until 29 May 2021. (A) SARS-CoV-2 virion (left) and schematics of the S protein (right). The S protein consists of two subunits, namely S1 and S2. The S1 subunit comprises of N-terminal domain (NTD) and receptor-binding domain (RBD). The receptor-binding motif (RBM) is the region within RBD that binds to the angiotensin-converting enzyme 2 (ACE2) receptor. Numbers in black indicate the residue index of the S protein in the SARS-CoV-2 reference genome (NCBI accession number: NC_045512.2). ROI = region of interest. Adaptive mutations are represented in red, while prolonged infections’ mutations are represented in blue. (B) Bar graph showing the continent/country-wise distribution of SARS-CoV-2 sequences. For the percentage calculation, the total number of sequences (n = 524,124) was used. € = Europe without Switzerland and the UK. ‡ = North America without the USA. (C) Bar graph showing the percentages (%) of the respective adaptive mutations. (D) Bar graph showing the percentages (%) of the respective prolonged infections’ mutations.
Continent/country-wise prevalence of adaptive mutations in the receptor-binding domain of the SARS-CoV-2 spike protein until 29 May 2021.
| Continent/Country | No. of Studied Sequences | Mutation (%) | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R346K | K417N | K417T | N439K | L452Q | L452R | Y453F | S477N | T478K | T478R | E484K | E484Q | F490S | S494P | N501Y | ||
|
| 1728 | 0.347 | 0.289 | 5.845 | 0.058 | 1.678 | 0.058 | 4.456 | 0.174 | 0.058 | 9.028 | |||||
| Asia | 3703 | 0.135 | 1.242 | 0.351 | 0.270 | 0.027 | 1.836 | 0.513 | 1.053 | 1.782 | 0.702 | 0.189 | 7.399 | |||
| Europe € | 6092 | 0.033 | 0.509 | 0.312 | 1.346 | 0.575 | 0.230 | 5.368 | 0.066 | 1.625 | 0.033 | 0.657 | 26.133 | |||
| Switzerland | 18,234 | 0.362 | 0.516 | 0.126 | 4.442 | 0.005 | 1.475 | 0.016 | 21.213 | 0.801 | 0.856 | 0.038 | 0.033 | 0.104 | 17.906 | |
| UK | 178,878 | 0.003 | 0.084 | 0.004 | 1.348 | 0.002 | 0.070 | 0.004 | 1.094 | 0.010 | 0.007 | 0.214 | 0.013 | 0.051 | 0.323 | 51.055 |
| North America ‡ | 99 | 2.020 | 9.091 | 1.010 | ||||||||||||
| USA | 300,280 | 0.041 | 0.424 | 3.147 | 0.034 | 0.119 | 11.556 | 0.010 | 2.836 | 3.067 | 0.040 | 9.005 | 0.089 | 0.162 | 1.721 | 39.631 |
| Oceania | 13,299 | 75.848 | 0.008 | 0.278 | ||||||||||||
| South America | 603 | 0.663 | 4.312 | 1.990 | 4.312 | 0.995 | ||||||||||
€ = Europe excluding Switzerland and the UK; ‡ = North America excluding the USA; empty cell = 4 or more decimal places or 0%.
Continent/country-wise prevalence of prolonged infections’ mutations in the receptor-binding domain of the SARS-CoV-2 spike protein until 29 May 2021.
| Continent/Country | No. of Studied Sequences | Mutation (%) | ||||||
|---|---|---|---|---|---|---|---|---|
| T415A | T470N | V483A | E484A | F486I | Q493K | Q493R | ||
|
| 1728 | |||||||
| Asia | 3703 | 0.027 | ||||||
| Europe € | 6092 | 0.033 | 0.016 | 0.016 | ||||
| Switzerland | 18,234 | 0.005 | ||||||
| UK | 178,878 | 0.001 | 0.001 | 0.006 | 0.002 | 0.007 | 0.006 | |
| North America ‡ | 99 | |||||||
| USA | 300,280 | 0.007 | 0.001 | 0.021 | 0.001 | 0.005 | 0.008 | |
| Oceania | 13,299 | |||||||
| South America | 603 | |||||||
€ = Europe excluding Switzerland and the UK; ‡ = North America excluding the USA; empty cell = 4 or more decimal places or 0%.
Figure 2Global prevalence of adaptive and prolonged infections’ mutations in the receptor-binding domain (RBD) of the SARS-CoV-2 spike (S) protein from 30 May 2021 till 28 July 2021. (A) Bar graph showing the continent/country-wise distribution of SARS-CoV-2 sequences. For the percentage calculation, the total number of sequences (n = 460,645) was used. § = Europe without Germany, Switzerland and the UK. ‡ = North America without the USA. (B) Bar graph showing the percentages (%) of the respective adaptive mutations. (C) Bar graph showing the percentages (%) of the respective prolonged infections’ mutations.
Continent/country-wise prevalence of adaptive mutations in the receptor-binding domain of the SARS-CoV-2 Scheme 30 May 2021 till 28 July 2021.
| Continent/ | No. of Studied Sequences | Mutation (%) | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R346K | K417N | K417T | N439K | L452Q | L452R | Y453F | S477N | T478K | T478R | E484K | E484Q | F490S | S494P | N501Y | ||
|
| 271 | 11.808 | 11.439 | 1.107 | 20.664 | 0.369 | 0.738 | 3.321 | 49.815 | |||||||
| Asia | 1354 | 7.090 | 0.148 | 0.074 | 57.903 | 0.148 | 50.591 | 0.074 | 6.499 | 6.721 | 0.074 | 23.560 | ||||
| Europe § | 7400 | 0.014 | 0.311 | 0.041 | 3.608 | 0.027 | 0.122 | 1.041 | 0.027 | 0.554 | 0.014 | 0.041 | 0.027 | 17.270 | ||
| Germany | 120,429 | 0.033 | 1.476 | 0.453 | 2.148 | 0.121 | 3.310 | 0.002 | 2.352 | 2.819 | 0.011 | 3.582 | 0.134 | 0.542 | 0.174 | 79.022 |
| Switzerland | 17,578 | 0.233 | 0.688 | 0.484 | 2.463 | 0.017 | 2.890 | 9.148 | 2.020 | 0.006 | 1.911 | 0.091 | 0.307 | 0.165 | 58.095 | |
| UK | 229,321 | 0.007 | 0.256 | 0.065 | 0.090 | 0.003 | 49.075 | 0.025 | 48.633 | 0.666 | 0.171 | 0.112 | 0.054 | 48.698 | ||
| North America ‡ | 66 | 3.030 | 1.515 | 6.061 | 1.515 | |||||||||||
| USA | 83,122 | 0.066 | 0.929 | 6.950 | 0.011 | 0.217 | 23.336 | 0.018 | 3.621 | 18.640 | 0.017 | 14.132 | 0.109 | 0.393 | 0.829 | 50.363 |
| Oceania | 634 | 0.946 | ||||||||||||||
| South America | 293 | 37.201 | 0.341 | 1.024 | 62.457 | 1.024 | 0.341 | 39.249 | ||||||||
§ = Europe excluding Germany, Switzerland and the UK; ‡ = North America excluding the USA; empty cell = 4 or more decimal places or 0%. Prevalence of prolonged infections’ mutations from 30 May 2021 till 28 July 2021.
Continent/country-wise prevalence of prolonged infections’ mutations in the receptor-binding domain of the SARS-CoV-2 spike protein from 30 May 2021 till 28 July 2021.
| Continent/ | No. of Studied Sequences | Mutation (%) | ||||||
|---|---|---|---|---|---|---|---|---|
| T415A | T470N | V483A | E484A | F486I | Q493K | Q493R | ||
|
| 271 | 1.107 | ||||||
| Asia | 1354 | |||||||
| Europe § | 7400 | 0.068 | ||||||
| Germany | 120,429 | 0.001 | 0.004 | 0.002 | 0.002 | 0.003 | ||
| Switzerland | 17,578 | 0.006 | 0.057 | 0.011 | ||||
| UK | 229,321 | 0.002 | 0.001 | 0.003 | 0.010 | 0.010 | ||
| North America ‡ | 66 | |||||||
| USA | 83,122 | 0.002 | 0.010 | 0.006 | 0.001 | 0.006 | 0.029 | |
| Oceania | 634 | |||||||
| South America | 293 | |||||||
§ = Europe excluding Germany, Switzerland and the UK; ‡ = North America excluding the USA; empty cell = 4 or more decimal places or 0%.