| Literature DB >> 35862764 |
Jonathan T Lee1, Qingyi Yang2, Alexey Gribenko1, B Scott Perrin3, Yuao Zhu1, Rhonda Cardin1, Paul A Liberator1, Annaliesa S Anderson1, Li Hao1.
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) continues to represent a global health emergency as a highly transmissible, airborne virus. An important coronaviral drug target for treatment of COVID-19 is the conserved main protease (Mpro). Nirmatrelvir is a potent Mpro inhibitor and the antiviral component of Paxlovid. The significant viral sequencing effort during the ongoing COVID-19 pandemic represented a unique opportunity to assess potential nirmatrelvir escape mutations from emerging variants of SARS-CoV-2. To establish the baseline mutational landscape of Mpro prior to the introduction of Mpro inhibitors, Mpro sequences and its cleavage junction regions were retrieved from ~4,892,000 high-quality SARS-CoV-2 genomes in the open-access Global Initiative on Sharing Avian Influenza Data (GISAID) database. Any mutations identified from comparison to the reference sequence (Wuhan-Hu-1) were catalogued and analyzed. Mutations at sites key to nirmatrelvir binding and protease functionality (e.g., dimerization sites) were still rare. Structural comparison of Mpro also showed conservation of key nirmatrelvir contact residues across the extended Coronaviridae family (α-, β-, and γ-coronaviruses). Additionally, we showed that over time, the SARS-CoV-2 Mpro enzyme remained under purifying selection and was highly conserved relative to the spike protein. Now, with the emergency use authorization (EUA) of Paxlovid and its expected widespread use across the globe, it is essential to continue large-scale genomic surveillance of SARS-CoV-2 Mpro evolution. This study establishes a robust analysis framework for monitoring emergent mutations in millions of virus isolates, with the goal of identifying potential resistance to present and future SARS-CoV-2 antivirals. IMPORTANCE The recent authorization of oral severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) antivirals, such as Paxlovid, has ushered in a new era of the COVID-19 pandemic. The emergence of new variants, as well as the selective pressure imposed by antiviral drugs themselves, raises concern for potential escape mutations in key drug binding motifs. To determine the potential emergence of antiviral resistance in globally circulating isolates and its implications for the clinical response to the COVID-19 pandemic, sequencing of SARS-CoV-2 viral isolates before, during, and after the introduction of new antiviral treatments is critical. The infrastructure built herein for active genetic surveillance of Mpro evolution and emergent mutations will play an important role in assessing potential antiviral resistance as the pandemic progresses and Mpro inhibitors are introduced. We anticipate our framework to be the starting point in a larger effort for global monitoring of the SARS-CoV-2 Mpro mutational landscape.Entities:
Keywords: 3CLpro; Mpro; Paxlovid; SARS-CoV-2; mutation; nirmatrelvir; purifying selection; surveillance
Mesh:
Substances:
Year: 2022 PMID: 35862764 PMCID: PMC9426535 DOI: 10.1128/mbio.00869-22
Source DB: PubMed Journal: mBio Impact factor: 7.786
FIG 1Active site conservation of coronavirus (CoV) main proteases. (A) Sequence alignment of the 26 binding site amino acids. The key amino acids with relative positions (P) are indicated by color-coded arrows based on their interaction with the inhibitor, nirmatrelvir. (B) severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) Mpro-binding pocket of nirmatrelvir. The pocket surface is colored based on the inhibitor’s interaction shown in panel A.
FIG 2Comparison of structure and sequence identity across 12 CoV main proteases. (A) Superposition of 12 CoV main proteases based on the 26-amino acid backbone heavy atoms at the active site. The proteases are represented by colored lines, with β-CoV proteases in yellow, α-CoV proteases in blue, and γ-CoV protease in magenta. The complete list of CoV proteases can be found in Table S1. (B) Percent sequence identity, similarity, and root mean square deviation (RMSD) (Cα, alpha-Carbon) of 26 amino acids at the nirmatrelvir-binding site for β-CoVs, α-CoVs, and IBV-CoV (γ-CoV). Identity and similarity values range from 50 to 100, and RMSD (Cα) values range from 0.30 to 1.02 in their respective color-mapping scales.
FIG 3Dynamic change in amino acid mutation rate of Mpro compared to Spike protein (S) and RNA-dependent RNA polymerase (RdRp). (A) Average amino acid changes per residue in Mpro, S protein, and RdRp among isolates collected from January 2020 through January 2022. (B) Relative distribution of variants of concern (VOCs)/variants of interest (VOIs) based on collection date. The rapid rise in amino acid changes found in S protein and Mpro near the end of 2021 corresponds to the emergence and takeover of Omicron.
FIG 4Prevalent mutations in Mpro and their position relative to nirmatrelvir binding. Only P132H, characteristic of the Omicron variant, exceeds 100,000 cases, and no residues interact with nirmatrelvir (shown in red). The full geographic and lineage breakdown of these mutations can be found in Table S2.
FIG 5Mpro mutations within VOC/VOI populations. (A) The five most prevalent sequences for each lineage are shown as colored bars (blue, gold, red, purple, and green), with the cumulative remaining sequences are in gray. The most prevalent sequence (blue) corresponds to the Wuhan-Hu-1 sequence (wild type [WT]) and is found in all but three lineages. For these remaining lineages (Omicron, Lambda, and Beta), each characteristic nonsynonymous substitution is assigned a pattern: P132H (stripes), G15S (diamonds), and K90R (squares). (B) Relative mutation frequency among Delta variant isolates. The positions of the four most prevalent mutation sites found in this variant (K88, K90, I259, and A260) are shown on the protein structure (WT). (C) Relative mutation frequency among Omicron variant isolates. The positions of the three most prevalent mutation sites (K90, P132, and T169) are shown on the protein structure.
FIG 6Mpro mutation breakdown at nirmatrelvir contact and catalytic residues. (A) Mutations identified at residues directly interacting with nirmatrelvir and/or substrate peptide. (B) Three-dimensional structural model of Mpro (PDB ID 7RFS), with residues from panel A highlighted in “stick” representation and shown in individual colors. The protein backbone is shown in ribbon representation. AA, amino acid. Stop codons are donoted as (*).
Mutation breakdown at Mpro dimerization interface residues
| Residue position | Reference AA | Mutations | No. of subjects | No. of countries | No. of lineages | Countries | Lineages | Characteristics |
|---|---|---|---|---|---|---|---|---|
| 1 | Ser (S) | S1C, S1G, S1N | 135 | 4 | 6 | UK (90.37%), Australia (6.67%) | B.1.617.2 (88.89%), D.2 (6.67%) | Side chain hydrogen bond to the side chain of E166 |
| 4 | Arg (R) | R4K, R4I, R4S, R4G | 593 | 27 | 47 | USA (34.74%), Poland (20.24%), Germany (11.3%), UK (7.93%), Sweden (6.58%) | B.1.617.2 (20.41%), AY.122 (18.03%), AY.100 (10.71%), AY.25.1 (6.8%), B.1.1.7 (5.95%), B.1.177 (5.78%) | Salt bridge to E290 |
| 6 | Met (M) | M6L, M6I, M6T, M6V, M6R | 158 | 18 | 35 | USA (39.24%), UK (18.99%), France (18.35%), Switzerland (5.06%) | B.1 (9.49%), B.1.160 (9.49%), BA.1 (9.49%), AY.4 (8.86%), B.1.1.7 (8.23%), B.1.617.2 (6.96%), AY.44 (6.33%), AY.12 (5.7%) | van der Waals/hydrophobic interaction with the side chain of Y126 |
| 7 | Ala (A) | A7V, A7T, A7S, A7G, A7P | 1,053 | 42 | 79 | USA (63.06%), Mexico (6.74%), UK (6.55%) | AY.25 (21.51%), AY.44 (14.44%), B.1.617.2 (14.44%), B.1.632 (6.31%), AY.4 (5.83%), B.1.1.7 (5.26%) | van der Waals/hydrophobic interaction with the side chain of V125 |
| 9 | Pro (P) | P9S | 45 | 2 | 3 | South Korea (97.78%) | B.1.497 (95.56%) | van der Waals/hydrophobic interaction with the side chain of P122 |
| 12 | Lys (K) | K12R, K12N | 338 | 13 | 23 | USA (71.01%), UK (21.3%) | B.1.617.2 (45.24%), AY.103 (20.54%), AY.4 (11.61%), B.1.1.7 (8.63%) | Electrostatic interaction with the side chain of E14 |
| 14 | Glu (E) | E14D, E14* | 6 | 3 | 4 | USA (66.67%), Sweden (16.67%), UK (16.67%) | B.1.617.2 (50.0%), AY.100 (16.67%), AY.4 (16.67%), AY.9.1 (16.67%) | Side chain hydrogen bond to backbone amide of G11; electrostatic interaction with the side chain of K12 |
| 122 | Pro (P) | P122S, P122L, P122I, P122A | 121 | 20 | 31 | UK (41.32%), USA (25.62%), France (9.92%) | B.1.617.2 (23.14%), AY.4 (20.66%), AY.118 (6.61%), B.1.1.7 (5.79%) | van der Waals/hydrophobic interaction with the side chain of P9 |
| 125 | Val (V) | V125I, V125A, V125L | 361 | 26 | 40 | UK (34.9%), USA (29.36%), Canada (9.42%), Germany (6.65%) | AY.4 (24.93%), B.1.617.2 (21.05%), AY.25 (9.42%), AY.98 (6.37%) | van der Waals/hydrophobic interaction with the side chain of A7 |
| 126 | Tyr (Y) | Y126C, Y126P | 4 | 2 | 4 | Turkey (50.0%), USA (50.0%) | B.1.1.7 (25.0%), B.1.177.86 (25.0%), B.1.351 (25.0%), B.1.400 (25.0%) | van der Waals/hydrophobic interaction with the side chain of M6 |
| 139 | Ser (S) | S139A, S139T | 6 | 2 | 2 | UK (66.67%), USA (33.33%) | BA.1 (66.67%), B.1.399 (33.33%) | Side chain hydrogen bond to the side chain of Q299 |
| 166 | Glu (E) | E166G, E166D | 5 | 4 | 5 | USA (40.0%), Finland (20.0%), Nigeria (20.0%), Switzerland (20.0%) | AY.107 (20.0%), AY.39 (20.0%), B.1.177.23 (20.0%), B.1.525 (20.0%), B.1.617.2 (20.0%) | Side chain hydrogen bond to the side chain of S1 |
| 285 | Ala (A) | A285V, A285P, A285T, A285D, A285S, A285E, A285G | 1,426 | 57 | 115 | USA (25.74%), Switzerland (18.37%), UK (10.87%), Brazil (6.1%) | B.1.1.29 (17.21%), B.1.617.2 (11.8%), B.1.1.7 (7.37%), AY.4 (7.09%) | van der Waals/hydrophobic interaction with the side chains of A285 and L286 |
| 286 | Leu (L) | L286I, L286F | 6 | 4 | 5 | USA (50.0%), Egypt (16.67%), Netherlands (16.67%), UK (16.67%) | B.1.2 (33.33%), AY.3 (16.67%), AY.4.2 (16.67%), B.1 (16.67%), B.1.617.2 (16.67%) | van der Waals/hydrophobic interaction with the side chain of A285 |
| 290 | Glu (E) | 0 | – | – | – | – | Salt bridge to R4 | |
| 298 | Arg (R) | R298K, R298G, R298I, R298S, R298T | 582 | 34 | 52 | UK (58.25%), USA (20.96%) | AY.4 (43.47%), B.1.617.2 (24.4%) | Side chain hydrogen bond to the backbone of S123 |
| 299 | Gln (Q) | Q299H | 3 | 1 | 1 | Nigeria (100.0%) | B.1.1.7 (100.0%) | Side chain hydrogen bond to the side chain of S139 |
AA, amino acid.
Side chains of K12 and E14 are over 5 Å apart. Hence, an actual salt bridge is not likely to form, although a relatively weak ionic attraction cannot be ruled out.
Stop codons are denoted with (*).
Dashes (-) indicate that no data was available at the time of this study.