| Literature DB >> 33609027 |
Ashley E Nazario-Toole1, Hui Xia1, Thomas F Gibbons1.
Abstract
INTRODUCTION: The outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has created a global pandemic resulting in over 1 million deaths worldwide. In the Department of Defense (DoD), over 129,000 personnel (civilians, dependents, and active duty) have been infected with the virus to date. Rapid estimations of transmission and mutational patterns of virus outbreaks can be accomplished using whole-genome viral sequencing. Deriving interpretable and actionable results from pathogen sequence data is accomplished by the construction of phylogenetic trees (from local and global virus sequences) and by the creation of protein maps, to visualize and predict the effects of structural protein amino acid mutations.Entities:
Mesh:
Substances:
Year: 2022 PMID: 33609027 PMCID: PMC7928784 DOI: 10.1093/milmed/usab031
Source DB: PubMed Journal: Mil Med ISSN: 0026-4075 Impact factor: 1.437
FIGURE 1.Genomic and epidemiological diversity of 59MDW SARS-CoV-2 sequences. The SARS-CoV-2 phylogeny was constructed in a local version of Nextstrain (Auspice.us) and shows the evolutionary relationship of 59MDW SARS-CoV-2 genomes (black dots) and a subsample of publicly available SARSCoV-2 genomes collected worldwide (n = 301, colored by viral clade [see legend]). This phylogeny contextualizes mutation rates and transmission patterns of SARS-CoV-2 strains collected at JBSA/Lackland and can be viewed locally using Auspice.us and a provided .json file. The tree is a time-resolved display rooted relative to early samples collected in Wuhan, China, and site numbering and genome structure uses Wuhan-Hu-1/2019 as reference. Positive control samples are shown in red and mutations in the spike protein are indicated in 59MDW samples.
Summary of Nucleotide and Non-synonymous Amino Acid Mutations in 59MDW SARS-CoV-2 Sequences
| Gene | Number of nucleotide mutations | Nucleotide mutation frequency | Number of non-synonymous amino acid changes |
|---|---|---|---|
|
| 44 | 0.404 | 23 |
|
| 21 | 0.193 | 13 |
|
| 14 | 0.128 | 7 |
|
| 11 | 0.101 | 9 |
|
| 1 | 0.009 | 1 |
|
| 0 | 0.000 | 0 |
|
| 1 | 0.009 | 0 |
|
| 2 | 0.018 | 1 |
|
| 5 | 0.046 | 4 |
|
| 9 | 0.083 | 4 |
|
| 1 | 0.009 | 1 |
109 nucleotide mutations were identified in coding regions of SARS-CoV-2 genomes sequenced from 59MDW samples. The mutation frequency per gene was calculated by dividing the number of mutations per gene by number of genome-wide mutations. Column 4 lists the number of non-synonymous amino acid changes per gene.
Non-synonymous Mutations in SARS-CoV-2 Structural Proteins
| Gene | Nucleotide mutation | Amino acid mutation | Biochemical property | 59MDW sample(s) | Frequency |
|---|---|---|---|---|---|
|
| 21,642 G > C | R21T | Positively charged to polar but neutral | 3829 | 2.70% |
|
| 22,245 G > T | D228Y | Negatively charged to polar but neutral | 3680 | 2.70% |
|
| 23,403 A > G | D614G | Negatively charged to nonpolar | All | 100.00% |
|
| 23,625 C > T | A688V | Both nonpolar | 3829, 3831, 3784, 2940, 2913, 3014 | 16.22% |
|
| 2,375 A > G | T732A | Polar but neutral to nonpolar | 3,786 | 2.70% |
|
| 24,236 G > T | A892S | Nonpolar to polar but neutral | 3807, 3723, 3804, 3776, 3808 | 13.51% |
|
| 24,770 G > T | A1070S | Nonpolar to polar but neutral | 3784 | 2.70% |
|
| 26,455 C > T | P71S | Nonpolar to polar but neutral | 2973, 3488, 2956 | 8.11% |
|
| 28,362 C > G | G30A | Both nonpolar | 3654, 3681, 3096, 3709, 3680 | 13.51% |
|
| 28,845 C > T | R191L | Positively charged to nonpolar | 3654, 3681, 3096, 3709, 3680 | 13.51% |
|
| 28,775 C > T | P168S | Nonpolar to polar but neutral | 3488 | 2.70% |
|
| 28,821 C > A | S183Y | Both polar but neutral | 3014, 2913, 3784, 2940, 3829, 3831, 3717, 3535, 3687, 3724, 3540, 3537, 3802, 3542, 3667, 3808, 3776, 3804, 3273, 3807 | 54.05% |
|
| 28,881 G > A | R203K | Both positively charged | 2576, 3786, 3634 | 8.11% |
|
| 28,882 G > A | G204R | Nonpolar to positively charged | 2576, 3786, 3634 | 8.11% |
SARS-CoV-2 whole-genome sequencing revealed 14 non-synonymous mutations in genes encoding structural proteins (S = spike; E = envelope; N = nucleocapsid) in 59MDW specimens. The biochemical properties of each non-synonymous change was characterized using the NCBI structure program, and the mutation frequency was calculated by dividing the number of specimens with a mutation by the total number of sequenced clinical specimens (N = 37).
FIGURE 2.59MDW amino acid mutations overlaid on spike structure in prefusion conformation. (A-D) Spike protein amino acid mutations were mapped onto the prefusion SARS-CoV-2 spike glycoprotein structure (PDB: 6VSB; Wrapp et al.[19]) using PyMOL. Predicted B-cell epitopes (Grifoni et al.[20]) are shown in black, and 59MDW amino acid changes are shown in gray. Spike protein domains: Yellow = S1 (Note, blue = N-terminal domain [NTD] and green = receptor-binding domain [RBD] are also part of S1); cyan = fusion peptide (FP); red = S2 domain; purple = connector domain; (CD). (E) Diversity of the S protein of SARS-CoV-2 sequenced from 59MDW specimens.