| Literature DB >> 34103497 |
Matthew M Hernandez1,2, Ana S Gonzalez-Reiche3, Hala Alshammary1, Shelcie Fabre4, Zenab Khan3, Adriana van De Guchte3, Ajay Obla3, Ethan Ellis3,5, Mitchell J Sullivan3, Jessica Tan1,6, Bremy Alburquerque3,6, Juan Soto3,5, Ching-Yi Wang4, Shwetha Hara Sridhar3,5, Ying-Chih Wang3,5, Melissa Smith3,5, Robert Sebra3,5,7,8, Alberto E Paniz-Mondolfi2,4, Melissa R Gitman2,4, Michael D Nowak2,4, Carlos Cordon-Cardo2, Marta Luksza3,9, Florian Krammer1, Harm van Bakel10,11, Viviana Simon12,13,14, Emilia Mia Sordillo15,16.
Abstract
Numerous reports document the spread of SARS-CoV-2, but there is limited information on its introduction before the identification of a local case. This may lead to incorrect assumptions when modeling viral origins and transmission. Here, we utilize a sample pooling strategy to screen for previously undetected SARS-CoV-2 in de-identified, respiratory pathogen-negative nasopharyngeal specimens from 3,040 patients across the Mount Sinai Health System in New York. The patients had been previously evaluated for respiratory symptoms or influenza-like illness during the first 10 weeks of 2020. We identify SARS-CoV-2 RNA from specimens collected as early as 25 January 2020, and complete SARS-CoV-2 genome sequences from multiple pools of samples collected between late February and early March, documenting an increase prior to the later surge. Our results provide evidence of sporadic SARS-CoV-2 infections a full month before both the first officially documented case and emergence of New York as a COVID-19 epicenter in March 2020.Entities:
Year: 2021 PMID: 34103497 PMCID: PMC8187428 DOI: 10.1038/s41467-021-23688-7
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Detection of SARS-CoV-2 nucleic acids in nasopharyngeal specimens collected in the first 10 weeks of 2020.
a Schematic representation of the study design. Nasopharyngeal swab specimens that tested negative for respiratory pathogens (RPN) were pooled. Each pool consisted of ten specimens from the same week from one of five hospital sites. Nucleic acid amplification testing (NAAT) was performed and RNA was processed for SARS-CoV-2 genome assembly. b Select events and responses to the evolving SARS-CoV-2 pandemic are annotated over the timeframe surveyed. Confirmed cases in NYC for the last 2 weeks are noted. Absolute counts of pools that tested positive or negative for RT-PCR targets (ORF1ab+E+ (magenta), ORF1ab+ (yellow), E+ (cyan), Negative (dark purple)) are depicted by week collected. c Distribution of pools with RT-PCR target results (ORF1ab+E+ (solid), ORF1ab+ (dotted), E+ (cross-hatched)) across the five different hospital sites in NYC (Hospital A, blue; B, red; C, green; D, purple; E, orange). d Distribution of SARS-CoV-2 sequences recovered by collection week and hospital site of RPN pools. Filled points reflect complete SARS-CoV-2 consensus genomes recovered and points with X’s reflect partial genomes recovered (e.g., incomplete genomes and those validated by SARS-CoV-2 reads). Colors denote hospital sites as indicated by the legend in (c).
Fig. 2Phylogenetic relationships of previously undetected SARS-CoV-2 and other NY and global isolates.
a Multiple sequence alignment of >95% complete SARS-CoV-2 genome sequences obtained from RPN pools relative to Wuhan-Hu-1 (RefSeq: NC_045512). RPN pools are ordered by date and PANGO lineage as displayed in (a). The SARS-CoV-2 genome coordinates and gene annotations are shown above. Single- nucleotide variations (SNVs) are depicted with vertical lines in red (clade defining) or black (other). b Coverage for pools with detectable RT-PCR targets (ORF1ab+E+ (magenta), ORF1ab+ (yellow), E+ (cyan)) collected prior to the first confirmed case in NY (NY1) with detectable SARS-CoV-2 reads that could not be assembled to complete genomes (>Q30 reads are shown). Nextera XT comprises data from both whole-genome and targeted amplicon sequencing library preparations. c Maximum-likelihood (ML) phylogenetic inference shown as a time tree of seven SARS-CoV-2 genome sequences from this surveillance study in a global background of 2993. Tip circles indicate the position of the respiratory pathogen-negative (RPN) pools (red) described in this report, the first reported COVID-19 case in NYC (green) from 29 February, later NYC cases from MSHS (yellow) and other institutions (dark gray), and US (blue) early isolates prior to 1 March. Tips without circles correspond to the background global isolates. The yellow box delineates the position of the clade containing the majority of NYC sequences detected during the early spread. The PANGO lineage classification of the RPN pools is indicated on the right, and the NextStrain clades are shown as node labels. The specimen identifier is indicated for RPN pools detected earlier than NY1. The time tree was inferred under a strict clock model with a nucleotide substitution rate of 0.80 × 10−3.
Lineage classification of previously undetected SARS-CoV-2 in NYC.
| Sample | Week ending | Ct values ( | Genome completeness (%) | NextStrain clade | Clade-defining mutations | PANGO lineage | Lineage detection prior to 1 March |
|---|---|---|---|---|---|---|---|
| Pool-134 | 7-Mar-2020 | 18.62, 18.63 | 99.9 | 20C | S:D614G, ORF1b:P314L, OFR3a:Q57H, ORF1a:T265I | B.1 | Mainly Europe, linked to Italian outbreak, only a few North American (non-US) isolates |
| Pool-41 | 7-Mar-2020 | 28.94, 29.36 | 99.7 | 20C | S:D614G, ORF1b:P314L, OFR3a:Q57H, ORF1a:T265I | B.1 | |
| Pool-58 | 7-Mar-2020 | 31.66, 33.65 | 99.8 | 20C | S:D614G, ORF1b:P314L, OFR3a:Q57H, ORF1a:T265I | B.1 | |
| Pool-53 | 7-Mar-2020 | 20.64, 21.07 | 99.8 | 20C | S:D614G, ORF1b:P314L, OFR3a:Q57H, ORF1a:T265I | B.1 | |
| Pool-38 | 7-Mar-2020 | 28.91, 29.43 | 98.9 | 20A | S:D614G, ORF1b:P314L | B.1.5 | Europe/South America/Asia |
| Pool-95 | 29-Feb-2020 | 20.80, 21.11 | 99.8 | 19A | T14408C | B.2.12 | Asia/Europe/Oceania |
| Pool-56 | 7-Mar-2020 | 25.37, 25.65 | 99.7 | 19B | C8782T ORF8:L84S | A.2 | Europe |