| Literature DB >> 36194597 |
Kristen J Wade1, Samantha Tisa1, Chloe Barrington1, Jesslyn C Henriksen1, Kristy R Crooks2, Christopher R Gignoux2, Austin T Almand3, J Jordan Steel3, John C Sitko3, Joseph W Rohrer2, Douglas P Wickert3, Erin A Almand3, David D Pollock1, Olivia S Rissland1.
Abstract
Since the initial reported discovery of SARS-CoV-2 in late 2019, genomic surveillance has been an important tool to understand its transmission and evolution. Here, we sought to describe the underlying regional phylodynamics before and during a rapid spreading event that was documented by surveillance protocols of the United States Air Force Academy (USAFA) in late October-November of 2020. We used replicate long-read sequencing on Colorado SARS-CoV-2 genomes collected July through November 2020 at the University of Colorado Anschutz Medical campus in Aurora and the United States Air Force Academy in Colorado Springs. Replicate sequencing allowed rigorous validation of variation and placement in a phylogenetic relatedness network. We focus on describing the phylodynamics of a lineage that likely originated in the local Colorado Springs community and expanded rapidly over the course of two months in an outbreak within the well-controlled environment of the United States Air Force Academy. Divergence estimates from sampling dates indicate that the SARS-CoV-2 lineage associated with this rapid expansion event originated in late October 2020. These results are in agreement with transmission pathways inferred by the United States Air Force Academy, and provide a window into the evolutionary process and transmission dynamics of a potentially dangerous but ultimately contained variant.Entities:
Mesh:
Year: 2022 PMID: 36194597 PMCID: PMC9531818 DOI: 10.1371/journal.pone.0274050
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Flow chart of SARS-CoV-2 sequencing, variant prediction, variant quality analysis and relatedness network generation.
Blue boxes describe the purpose of each stage of analysis, free text designates the specific method used at each stage and orange arrows indicate points where steps were repeated iteratively until sufficient quality metrics were fulfilled.
Fig 2Major phylogroup clustering of the CUAF SARS-CoV-2 genomes.
Tip labels indicate how many CU Anschutz (CU) and USAFA (AF) genomes are associated with ancestor. Color shading indicates which main phylogroup genomes belong to, A1-A3, corresponding to NextStrain clades 20A, 20C and 20G, respectively.
Fig 3Closely related subset of Air Force samples suggests a novel, rapidly transmitting strain with low average mutation rate, the “Colorado Springs Variant” (A12-A13).
Numbers at tips indicate average number of mutations per genome in each ancestral grouping.
Lineage-defining mutations in a set of 44 Colorado SARS-CoV-2 genomes.
| Mutation frequency in US [NextStrain] | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Position | ORF | REF | ALT | Amino acid change | August 2020 [8–14] | November 2020 [11–14] | February 2021 [2–13] | May 2021 [5–5] | Ancestor introduced | NextStrain clade |
| 241 | ORF1A | C | T | <<1% | <<1% | <<1% | <<1% | A1 | 20A | |
| 829 | ORF1A | C | T | <<1% | <<1% | <<1% | <<1% | A13 | NA | |
| 1059 | ORF1A | C | T | T265I | 25% | 36% | 27% | 12% | A2 | 20C |
| 1927 | ORF1A | T | C | <<1% | <<1% | <<1% | <<1% | A11 | NA | |
| 2668 | ORF1A | C | T | <<1% | <<1% | <<1% | <<1% | A7 | NA | |
| 3037 | ORF1A | C | T | <<1% | <<1% | <<1% | <<1% | A1 | 20A | |
| 4021 | ORF1A | C | T | <<1% | <<1% | <<1% | <<1% | A12 | NA | |
| 7006 | ORF1A | C | T | <<1% | <<1% | <<1% | <<1% | A8 | NA | |
| 7086 | ORF1A | C | T | <<1% | <<1% | <<1% | <<1% | A12 | sporadic | |
| 10319 | ORF1A | C | T | L3352F | 5% | 26% | 11% | <1% | A3 | 20G |
| 11824 | ORF1A | C | T | <<1% | <<1% | <<1% | <<1% | A9 | NA | |
| 12295 | ORF1A | C | T | <<1% | <<1% | <<1% | <<1% | A12 | NA | |
| 13216 | ORF1A | T | C | <<1% | <<1% | <<1% | <<1% | A12 | NA | |
| 14187 | ORF1B | G | A | <<1% | <<1% | <<1% | <<1% | A12 | NA | |
| 14408 | ORF1B | C | T | P314L | 90% | 94% | 91% | 97% | A1 | 20A |
| 15766 | ORF1B | G | T | V767L | 3% | 9% | 2% | <<1% | A11 | 20G |
| 18424 | ORF1B | A | G | N1653D | 3% | 24% | 11% | <1% | A9 | 20G |
| 18486 | ORF1B | C | T | <<1% | <<1% | <<1% | <<1% | A6 | NA | |
| 18538 | ORF1B | G | T | V1691L | <<1% | 1% | 1% | <<1% | A11 | sporadic |
| 19180 | ORF1B | G | T | <<1% | <<1% | <<1% | <<1% | A8 | sporadic | |
| 19891 | ORF1B | G | T | <<1% | <<1% | <<1% | <<1% | A9 | NA | |
| 20268 | ORF1B | A | G | <<1% | <<1% | <<1% | <<1% | A5 | NA | |
| 21304 | ORF1B | C | T | R2613C | 3% | 24% | 10% | <1% | A10 | 20G |
| 21390 | ORF1B | A | G | <<1% | <<1% | <<1% | <<1% | A12 | NA | |
| 21830 | S | G | T | <<1% | <<1% | <<1% | <<1% | A13 | NA | |
| 22162 | S | T | C | <<1% | <<1% | <<1% | <<1% | A6 | NA | |
| 22255 | S | A | T | <<1% | <<1% | <<1% | <<1% | A11 | NA | |
| 22687 | S | C | T | <<1% | <<1% | <<1% | <<1% | A11 | NA | |
| 23403 | S | A | G | D614G | 98% | 98% | 98% | 100% | A1 | 20A |
| 25563 | ORF3A | G | T | Q57H | 31% | 49% | 31% | 13% | A2 | 20C |
| 25593 | ORF3A | G | C | K67N | <<1% | 1% | <<1% | <<1% | A12 | sporadic |
| 25907* | ORF3A | G | T | G172V | 4% | 25% | 11% | <1% | A10 | 20G |
| 25930 | ORF3A | T | C | S180P | <<1% | 1% | 1% | <<1% | A11 | sporadic |
| 26040 | ORF3A | A | T | <<1% | <<1% | <<1% | <<1% | A9 | sporadic | |
| 27964 | ORF8 | C | T | S24L | 6% | 27% | 11% | <1% | A3 | 20G |
| 28472 | N | C | T | P67S | 3% | 24% | 11% | <1% | A10 | 20G |
| 28655 | N | G | T | <<1% | <<1% | <<1% | <<1% | A7 | NA | |
| 28854 | N | C | T | S194L | 16% | 15% | 4% | 2% | A4 | 20A |
| 28869 | N | C | T | P199L | 5% | 28% | 12% | 3% | A10 | 20G |
| 28887 | N | C | T | T205I | 1% | 10% | 17% | 9% | A9 | 21C |
| 29439 | N | A | T | Q389H | <1% | <1% | <<1% | <<1% | A11 | sporadic |
* Mutation is found at frequencies > 1% in the ncov North America dataset
1positions refer to aligned location in the Wuhan reference genome
2REF and ALT refer to the nucleotide state at that position in the reference genome and the corresponding alternative allele at that position in comparison genomes, respectively.
3Amino acid replacement standard notation shows the reference amino acid, the amino acid position in the corresponding gene, and then the inferred amino acid in the alternative genomes, all as determined from ncov.
4The clade corresponding to an ancestral sequence was determined as the named clade in the NextStrain database at the time of evaluation with the most mutations shared with the “ancestor introduced” prior to that clade on the NextStrain phylogenetic tree.
5Mutation is not associated with a phylogenetic grouping, but occurs at seemingly random branches throughout the entire tree date accessed: 7/12/21
Number of genomes: Showing 4000 of 4043 genomes sampled between Mar 2020 and June 2021
Fig 4Frequency of a CUAF clade 20G mutation in North America between March 2020 and June 2021.
The mutation pictured here is from ORF1A, genomic position 10319, amino acid position 3352.
Fig 5Divergence estimates indicate that the CO Springs lineage originated around October 26th, 2020.
The red arrow indicates the node where the CO Springs lineage branches off from ancestral nodes. 95% HPD and tree distances are in units of “days before November 19, 2020”.
Lineage-defining mutations contributing to the Colorado Springs variant [A12-A13].
| Ancestor | Mutation position | ORF | Protein | REF allele | ALT allele | Amino acid position | Codon change | Amino acid change | Biochem property change? |
|---|---|---|---|---|---|---|---|---|---|
| A13 | 829 | ORF1A | Nsp2 | C | T | 188 | AAC->AAT | N->N | N |
| A12 | 4021 | ORF1A | Macro Domain | C | T | 1252 | AAC->AAT | N->N | N |
| A12 | 7086 | ORF1A | Nsp3_C | C | T | 2274 | ACT->ATT | T-> I | Y |
| A12 | 12295 | ORF1A | Nsp8 | C | T | 4010 | ACT->ACC | T->T | N |
| A12 | 13216 | ORF1A | Nsp10 | T | C | 4317 | GAT->GAC | D->D | N |
| A12 | 14187 | ORF1B | RdRp | G | A | 250 | AGG->AGA | R->R | N |
| A12 | 21390 | ORF1B | Methyltransferase | A | G | 244 | TTA->TTG | L->L | N |
| A13 | 21830 | S | Spike | G | T | 268 | GTT->TTT | V->F | Y |
| A12 | 25593 | ORF3a | Orf3a | G | C | 67 | AAG->AAC | K->N | Y |