| Literature DB >> 33861950 |
Nicole L Washington1, Karthik Gangavarapu2, Mark Zeller3, Alexandre Bolze4, Elizabeth T Cirulli4, Kelly M Schiabor Barrett4, Brendan B Larsen5, Catelyn Anderson3, Simon White4, Tyler Cassens4, Sharoni Jacobs4, Geraint Levan4, Jason Nguyen4, Jimmy M Ramirez4, Charlotte Rivera-Garcia4, Efren Sandoval4, Xueqing Wang4, David Wong4, Emily Spencer3, Refugio Robles-Sikisaka3, Ezra Kurzban3, Laura D Hughes6, Xianding Deng7, Candace Wang7, Venice Servellita7, Holly Valentine8, Peter De Hoff8, Phoebe Seaver8, Shashank Sathe8, Kimberly Gietzen9, Brad Sickler9, Jay Antico9, Kelly Hoon9, Jingtao Liu9, Aaron Harding10, Omid Bakhtar10, Tracy Basler11, Brett Austin11, Duncan MacCannell12, Magnus Isaksson4, Phillip G Febbo9, David Becker4, Marc Laurent4, Eric McDonald11, Gene W Yeo8, Rob Knight8, Louise C Laurent8, Eileen de Feo9, Michael Worobey5, Charles Y Chiu13, Marc A Suchard14, James T Lu4, William Lee4, Kristian G Andersen15.
Abstract
The highly transmissible B.1.1.7 variant of SARS-CoV-2, first identified in the United Kingdom, has gained a foothold across the world. Using S gene target failure (SGTF) and SARS-CoV-2 genomic sequencing, we investigated the prevalence and dynamics of this variant in the United States (US), tracking it back to its early emergence. We found that, while the fraction of B.1.1.7 varied by state, the variant increased at a logistic rate with a roughly weekly doubling rate and an increased transmission of 40%-50%. We revealed several independent introductions of B.1.1.7 into the US as early as late November 2020, with community transmission spreading it to most states within months. We show that the US is on a similar trajectory as other countries where B.1.1.7 became dominant, requiring immediate and decisive action to minimize COVID-19 morbidity and mortality.Entities:
Keywords: 501Y.V1; B.1.1.7; COVID-19; SARS-CoV-2; VOC-202012/01; genomic epidemiology; variant of concern
Mesh:
Year: 2021 PMID: 33861950 PMCID: PMC8009040 DOI: 10.1016/j.cell.2021.03.052
Source DB: PubMed Journal: Cell ISSN: 0092-8674 Impact factor: 41.582
Figure 1Estimated proportion of B.1.1.7 in SARS-CoV-2 tests at Helix since December 15, 2020
(A) Map of contiguous states in the US with each bubble representing the number of positive Helix COVID-19 tests from each state.
(B) Estimated proportion of B.1.1.7 in total number of positive tests with Cq(N gene) <27, in the US overall, California, Florida, and Georgia from December 15th, 2020 to February 11th, 2021. The proportion of B.1.1.7 samples was estimated using: . There is an ∼2 week lag between sequence data and testing data. We had sequence data until February 2nd, but we had testing data until February 19th. To fully utilize the testing data, we used the average proportion of B.1.1.7 sequences in sequenced samples with SGTF from the last 5 days of available sequence data in each location to infer the proportion of B.1.1.7 cases in total positive tests for an additional 2 week period (February 3 to February 19). The black line shows the 5-day rolling average of the estimated proportion of B.1.1.7 in total positives. The inverted bar chart shows the temporal distribution of the B.1.1.7 genomes sequenced and the number of sequenced samples with SGTF.
(C) Logistic growth curves fit to the rolling average of the estimated proportion of B.1.1.7 in total positives for the US, Florida, California, and Georgia. The shaded area represents the 95% CI for each fit. The inset shows the zoomed in view of the curve fit. The predicted time when the estimated proportion of B.1.1.7 cases crosses 0.5 is indicated in red. See also Data S1.
Figure 2Phylogenetic analysis of B.1.1.7 lineage in the US
(A) Maximum clade credibility (MCC) tree of the time resolved phylogenetic analysis of B.1.1.7 sequences in the US in the context of sequences sampled globally. The gradient represents uncertainty in the tree topology and is used to mask internal nodes with very low support (posterior probability ≪0.1). Clades that consist primarily of sequences sampled in the US supported by a basal node with posterior probability ≥0.98 are highlighted in the tree with the posterior probability annotated at the basal node. The closest ancestral node to each clade with a posterior probability ≥0.98 is highlighted in black. In this phylogeny, we only show clades with ≥20 sequences and independent introductions into Pennsylvania (PA) and Georgia (GA). Please see Figure S1 for a phylogeny with all the annotations of the MCC tree.
(B) The color scheme of terminal nodes sampled in the MCC tree. Sequences sampled outside the US are colored in light gray. US states with no B.1.1.7 sequence sampling in the dataset are shown in light gray in the map.
(C) The TMRCA of each clade highlighted in the MCC tree.
(D) The proportion of the geographic sampling of sequences within each clade (singletons have been excluded, including those in Texas, Pennsylvania, and Massachusetts). The colors follow the same scheme as shown in (B).
See also Figure S1.
Figure S1Phylogenetic analysis of B.1.1.7 lineage in the US, related to Figure 2
(A) Maximum clade credibility (MCC) tree of the time resolved phylogenetic analysis of B.1.1.7 sequences in the U.S. in the context of sequences sampled globally. The gradient represents uncertainty in the tree topology and is used to mask internal nodes with very low support (posterior probability ≪ 0.1). Clades that consist primarily of sequences sampled in the U.S. supported by a basal node with posterior probability ≥ 0.98 are highlighted in the tree with the posterior probability annotated at the basal node. The closest ancestral node to each clade with a posterior probability ≥ 0.98 is highlighted in black.
(B) The color scheme of terminal nodes sampled in the MCC tree. Sequences sampled outside the U.S. are colored in light gray. U.S. States with no B.1.1.7 sequence sampling in the dataset are shown in light-gray in the map.
(C) The TMRCA of each clade highlighted in the MCC tree.
(D) The proportion of the geographic sampling of sequences within each clade (singletons have been excluded, including those in Texas, Pennsylvania, and Massachusetts).
Mean proportion of B.1.1.7 sequences in sequence SGTF samples from the last 5 days with sequence data in each location used to infer the proportion of B.1.1.7 cases in total positive tests from February 3 to February 19, 2021
| Location | Mean |
|---|---|
| California | 1 |
| Florida | 0.897 |
| Georgia | 0.916 |
| USA | 0.81 |
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Helix ® COVID-19 Test | Helix | EUA201636 |
| TaqPath COVID-19 Combo Kit | Thermo Fisher | Cat#A47814 |
| Illumina CovidSeq Test | Illumina | 1000000128490 v01 |
| MagMAX Viral/Pathogen II Nucleic Acid Isolation Kit | Thermo Fisher | Cat#A48383 |
| Omega BioTek MagBind Viral DNA/RNA Kit | Omega Biotek | Cat#M6246-03 |
| Nextera XT | Illumina | Cat#FC-131-1096 |
| Illumina NextSeq with 500/550 Mid Output Kit v2.5 | Illumina | Cat#20024908 |
| Illumina NovaSeq 6000 with SP Reagent Kit v1.5 | Illumina | Cat#20028400 |
| Nanopore MinION KAPA HyperPrep kit | Roche | Cat#07962363001 |
| End-repair and Ligation Sequencing Kit | Oxford Nanopore | Cat#SQK-LSK109 |
| Quick-DNA/RNA Viral MagBead kit | Zymo Research | Cat#R2141 |
| Illumina NovaSeq 6000 S4 flow cell & Reagent Kit v1.5 | Illumina | Cat#20044417 |
| Illumina NovaSeq Xp 4-lane Kit v1.5 | Illumina | Cat#20042337 |
| Omega BioTex MagBind Viral DNA/RNA Kit | Omega Biotek | Cat#M6246-03 |
| KindFisher Flex Purification System | Thermo Fisher | Cat#5400630 |
| Protoscript First Strand cDNA Synthesis Kit | NEB | Cat#E6560L |
| Q5 Hot Start High-Fidelity DNA Polymerase Kit | NEB | Cat #0493L |
| NEBNext Ultra II DNA Library Kit for Illumina | NEB | Cat#E7645L |
| MiSeq Reagent Kit V2 | Illumina | Cat#MS-102-2002 |
| SARS-CoV-2 reference genome | NCBI | NCBI: NC_045512.2 |
| B.1.1.7 first identified sequence | GISAID | GISAID: EPI_ISL_601443 |
| B.1.1.7 outgroup | NCBI | NCBI: NC_045512 |
| BEAST XML and log files | This paper | |
| Non-US B.1.1.7 and additional B.1.1 lineage sequences | GISAID | |
| US B.1.1.7 sequences | This paper; GISAID | |
| SGTF and B117 ongoing Summary Level Data | This paper | |
| SARS-CoV-2 primers | N/A | |
| NEBNext Multiplex Oligos for Illumina | NEB | Cat#E6440L |
| ARTIC Network n-CoV-19 V3 primers | ARTIC Network | |
| DRAGEN COVIDSeq Test Pipeline v.1.3.0.28 | Illumina | N/A |
| Pangolin v2.0 | ||
| NextClade v0.12.0 | ||
| Iqtree2 | ||
| BEASTv1.10.5pre | ||
| BEAGLE | ||
| baltic | ||
| Chiu lab genome assembly & variant calling | N/A | |
| Andersen Lab consensus calling for nanopore data | ||
| Snakemake | ||
| bwa-mem | ||
| iVar v1.2.2 | ||
| Transmissibility estimation | N/A | |
| Conditional reference prior for overall clock rate | N/A | |