Sushama Telwatte1, Nitasha Kumar1, Albert Vallejo-Gracia2, G Renuka Kumar2, Chuanyi M Lu3, Melanie Ott4, Joseph K Wong1, Steven A Yukl5. 1. Department of Medicine, University of California, San Francisco, CA, USA; Department of Medicine, San Francisco VA Health Care System, San Francisco, CA, USA. 2. Gladstone Institute of Virology, Gladstone Institutes, San Francisco, CA, USA. 3. Department of Laboratory Medicine, University of California, San Francisco, CA, USA; Department of Laboratory Medicine, San Francisco VA Health Care System, San Francisco, CA, USA. 4. Department of Medicine, University of California, San Francisco, CA, USA; Gladstone Institute of Virology, Gladstone Institutes, San Francisco, CA, USA. 5. Department of Medicine, University of California, San Francisco, CA, USA; Department of Medicine, San Francisco VA Health Care System, San Francisco, CA, USA. Electronic address: steven.yukl@ucsf.edu.
Abstract
A hallmark of coronavirus transcription is the generation of negative-sense RNA intermediates that serve as the templates for the synthesis of positive-sense genomic RNA (gRNA) and an array of subgenomic mRNAs (sgRNAs) encompassing sequences arising from discontinuous transcription. Existing PCR-based diagnostic assays for SAR-CoV-2 are qualitative or semi-quantitative and do not provide the resolution needed to assess the complex transcription dynamics of SARS-CoV-2 over the course of infection. We developed and validated a novel panel of sensitive, quantitative RT-ddPCR assays designed to target regions spanning the genome of SARS-CoV-2. Our assays target untranslated regions (5', 3') as well as different coding regions, including non-structural genes that are only found in full length (genomic) RNA and structural genes that are found in genomic as well as different subgenomic RNAs. Application of these assays to clinically relevant samples will enhance our understanding of SARS-CoV-2 gene expression and may also inform the development of improved diagnostic tools and therapeutics. Published by Elsevier B.V.
A hallmark of coronavirus transcription is the generation of negative-sense RNA intermediates that serve as the templates for the synthesis of positive-sense genomic RNA (gRNA) and an array of subgenomic mRNAs (sgRNAs) encompassing sequences arising from discontinuous transcription. Existing PCR-based diagnostic assays for SAR-CoV-2 are qualitative or semi-quantitative and do not provide the resolution needed to assess the complex transcription dynamics of SARS-CoV-2 over the course of infection. We developed and validated a novel panel of sensitive, quantitative RT-ddPCR assays designed to target regions spanning the genome of SARS-CoV-2. Our assays target untranslated regions (5', 3') as well as different coding regions, including non-structural genes that are only found in full length (genomic) RNA and structural genes that are found in genomic as well as different subgenomic RNAs. Application of these assays to clinically relevant samples will enhance our understanding of SARS-CoV-2 geneexpression and may also inform the development of improved diagnostic tools and therapeutics. Published by Elsevier B.V.
Theetiologic agent responsible for the ongoing COVID-19 pandemic, identified as Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), (Zhu et al., 2020; Kimet al., 2020a) is an enveloped virus with a positive-sense, single-stranded RNA genome of ∼30 kb. SARS-CoV-2, which is a member of the β-coronavirus genus, is the seventh coronavirus known to infect humans. It shares approximately 50 % sequence homology with MERS and 79 % sequence homology with SARS-CoV (Lu et al., 2020) but appears to bemore closely related to other coronaviruses from animals. SARS-CoV-2 shares extensive sequence homology with theSARS-likebat coronaviruses RmYN02 fromR. malayanus and RaTG13 fromR. affinis (93.3 % and 96.1 % sequence identity, respectively) (Zhou et al., 2020a), though other animals have also been proposed as sources for the virus (Andersenet al., 2020; Malaiyan et al., 2020).Theexact mechanism of SARS-CoV-2 replication and transcription is not fully understood (V’Kovski et al., 2020). However, a hallmark of coronavirus transcription and other viruses of the order Nidovirales is the generation of negative-sense RNA intermediates that serve as the templates for the synthesis of positive-sense genomic RNA (gRNA) and an array of subgenomic RNAs (sgRNAs). Subgenomic RNAs result from discontinuous transcription and encompass sequences from both ends of the genome (Sawicki et al., 2007; Kimet al., 2020b) (Fig. 1
). Following cell entry, SARS-CoV-2 genomic RNA is transcribed and translated to generate the non-structural proteins (NSP) from the two open reading frames (ORF), ORF1a and ORF1b (Kimet al., 2020b). Viral transcription is thought to involve the following components: (i) virus replication-transcription complex [RTC] consisting of RNA-dependent RNA polymerase [RdRp, Nsp12] (Cheng et al., 2005); the zinc-binding helicase [Nsp13] (Ivanov et al., 2004) and other enzymes responsible for viral RNA modification and proofreading) (Romano et al., 2020); (ii) AU-rich transcription-regulating sequences (TRSs); (iii) the N protein; and (iv) doublemembrane vesicles in the cytoplasm of infected cells (Sola et al., 2015; Snijder et al., 2020; Wu et al., 2020). During the synthesis of thenegative strand RNA, the replication-transcription complex encounters TRS elements located upstream of ‘body’ genes in the 3′ portion of the genome, referred to as ‘body’ TRS elements. It is hypothesized that a fixed proportion of replication-transcription complexes will either continueelongation past the body TRS or else pause, move, and re-initiate transcription adjacent to another TRS in the 5′ untranslated region, at theend of the leader sequence (‘leader TRS’) (Sethna et al., 1991; Sawicki and Sawicki, 1995; Woo et al., 2005). This discontinuous transcription results in a nested set of negative-strand subgenomic RNA (sgRNA) transcripts that feature a common 5′ leader sequence fused to one of various ‘body’ genes from the 3′ part of the genome (Fig. 1). Transcription of the sgRNAs is likely regulated by TRS sequences in the leader sequence and upstream of 3′ genes (Sola et al., 2015), and may allow variation in viral geneexpression. Notably, although some sgRNAs are structurally polycistronic, it is believed that only the first ORF at the 5′ end is translated fromeach sgRNA (Sawicki and Sawicki, 1995; Viehweger et al., 2019).
Fig. 1
Schematic presentation of SARS-CoV-2 genome organization, virion structure and canonical sgRNAs. SARS-CoV-2 encodes two large genes, ORF1a (yellow) and ORF1b (blue), which encode 16 non-structural proteins (NSP1–NSP16). The structural genes encode the structural proteins, spike (S; green), envelope (E; blue), membrane (M; purple), and nucleocapsid (N; gold). Primer locations for each gene target are indicated. Virion structure and canonical subgenomic (sg) RNAs produced by SARS-CoV-2 are shown in the lower panel (S, 3a, E, M, 6, 7a, 7b, 8 and N). The 5′ leader TRS found in each canonical sgRNA is depicted in grey. Figure is adapted from Sawicki et al. 2007 and Kim et al. 2020.
Schematic presentation of SARS-CoV-2 genome organization, virion structure and canonical sgRNAs. SARS-CoV-2encodes two large genes, ORF1a (yellow) and ORF1b (blue), which encode 16 non-structural proteins (NSP1–NSP16). The structural genes encode the structural proteins, spike (S; green), envelope (E; blue), membrane (M; purple), and nucleocapsid (N; gold). Primer locations for each gene target are indicated. Virion structure and canonical subgenomic (sg) RNAs produced by SARS-CoV-2 are shown in the lower panel (S, 3a, E, M, 6, 7a, 7b, 8 and N). The 5′ leader TRS found in each canonical sgRNA is depicted in grey. Figure is adapted from Sawicki et al. 2007 and Kimet al. 2020.A recently published study confirms that a similar mechanismexists for SARS-CoV-2 to generate nine canonical sgRNAs that join the 5′ UTR to genes in ORF 2–9 (Kimet al., 2020b) (Fig. 1). For other coronaviruses, sgRNAs encode virulence factors such as proteins that directly cause lesions (Cowley et al., 2010) or indirectly inhibit immune responses (Kopecky-Bromberg et al., 2007). Incorporation of 5′UTR sequences into the capped subgenomic mRNA templates of SARS-CoVmay confer resistance to cleavage by viral nsp1 protein (Huang et al., 2011), which typically inhibits host geneexpression by degradation of host mRNA (Kamitani et al., 2006; Narayanan et al., 2008; Kamitani et al., 2009). For positive-sense RNA viruses, sgRNAs act as messengers for expression of structural proteins or proteins related to pathogenesis and can regulate the transition between translation and virion production (Sztuba-Solińska et al., 2011). The various roles of sgRNAs in SARS-CoV-2 infection and pathogenesis remain to beelucidated, but the rapid accumulation and persistence of sgRNAs following infectionmay also contribute to disease progression.Understanding the viral dynamics of SARS-CoV-2 and the host response areessential in devising strategies to develop antiviral treatments or vaccines and curb new infections. Existing PCR-based diagnostic assays for SAR-CoV-2, which are interpreted in a qualitative or semi-quantitativemanner (positive, negative or indeterminate) and target only 1–2 viral regions, do not account for possible variation in RNA copy numbers due to subgenomic transcription. For example, themore 5′ genes in ORF1a and ORF1b areexcluded from subgenomic transcripts and thereforemay be present at lower levels than genes in ORF 2–10, which are present in subgenomic as well as genomic transcripts. Among the ‘body’ genes found in sgRNAs, those at the 3′ end [ORF 9–10] would beexpected to present in all sgRNAs and thereforemight be present at higher copy numbers than more 5′ body genes, such as ORF2, which is only present in one type of subgenomic transcript. Therefore, RNA levels of a given genemay depend on the degree to which it is transcribed as various sgRNAs (Kimet al., 2020b), the degree to which the sample includes virion or cell-associated RNAs, and the degree to which sgRNAs may persist in doublemembraned vesicles even after replication has ceased (van Hemert et al., 2008; Alexandersenet al., 2020).Molecular assays that can quantify different genes present in sgRNA and/or gRNA species may prove useful for improving clinical diagnostic tests and for research in understanding how the regulation of viral geneexpression contributes to clinical disease. To help investigate these questions, we devised a novel panel of seven ddPCR-based assays that target various conserved regions of SARS-CoV-2 RNA, including the 5′ and 3′ untranslated regions, non-structural genes that are only found in full length (genomic) RNA, and structural genes that are also contained in different sgRNAs (Fig.1 and Table 1
).
Table 1
SARS-Cov2 ddPCR assay panel for assessing patient samples.
Assay Name
RNA Target
Detects
5′UTR
5′ untranslated region
Genomic RNA
Main Proteinase-NSP5
Main Proteinase
Genomic RNA
RdRp
RNA-dependent RNA Polymerase
Genomic RNA
S-PBCS
Polybasic cleavage site of the surface (S) glycoprotein
Genomic/subgenomic
M-ORF5
Membrane glycoprotein
Genomic/subgenomic
N-ORF9
Nucleocapsid
Genomic/subgenomic
3′UTR
3′ untranslated region
Genomic/subgenomic
SARS-Cov2 ddPCR assay panel for assessing patient samples.We selected genes encoding two non-structural proteins [Main Proteinase (NSP5) and RNA dependent RNA polymerase (RdRp-NSP12)] and four major structural proteins [Spike glycoprotein (S), envelope (E), membrane (M), and nucleocapsid (N)] that are known to serve critical functions in SARS-CoV-2 infection. For thespike protein, in which notablemutations haveemerged (Korber et al., 2020; du Plessis et al., 2020; Fiorentini et al., 2021), we designed a primer/probe set to target the short, highly-conserved ‘polybasic cleavage site’ (‘S-PBCS’) of SARS-CoV-2, which is functionally cleaved to yield the S1 and S2 subunits (Walls et al., 2020), in a manner similar to the hemagglutinin (HA) protein of avian influenza viruses (AIVs) (Böttcher-Friebertshäuser et al., 2014). In AIVs, the insertion or substitution of basic amino acids at the HA cleavage site is associated with enhanced pathogenicity (Monneet al., 2014; Horimoto et al., 1995).TheSARS-CoV-2PBCS allows effective cleavage by host furin and other proteases (Andersenet al., 2020), and may potentially enhance its infectivity in humans and distinguish it from related animal coronaviruses (Zhou et al., 2020a; Andersenet al., 2020; Nao et al., 2017). Elucidating the granular detail of SARS-CoV-2 transcription could help us to understand how the virus replicates and how it may evadehuman immune defenses. Detailed mapping of theexpressed viral transcripts across times and cell types is essential for further studies of viral geneexpression, mechanisms of replication, and probing host-viral interactions involved in pathogenicity.
Materials and methods
Primer design and selection
Multiple primer and probe sets were designed to target various regions of SAR-CoV-2, including untranslated regions that likely play an important role in regulating transcription (5′ and 3′ untranslated regions [UTR]), non-structural genes found only in genomic RNA (main protease [NSP5; ORF1a], RNA-dependent RNA polymerase [RdRp; ORF1b]), and structural genes that may also be found in various sgRNAs (spike [S] protein [ORF2] polybasic cleavage site [PBCS], membrane [M] glycoprotein [ORF5], and nucleocapsid [N] protein [ORF9]). Primers/probes were designed using the Primer Quest® Tool (Integrated DNA Technologies, Coralville, IA). A multiple sequence alignment was performed using Clustal Omega (Madeira et al., 2019), encompassing complete sequences of 86 SARS-CoV-2 isolates from all geographical locations and all sequences available from the US on 3/14/2020. Reference sequences of other coronaviruses, including SARS-CoV (NC_004718.3), MERS-CoV (NC_019843.3), HCoV-229E (NC_002645.1), HcoV-NL63 (NC_005831.2), HcoV−OC43 (NC_006213.1), and HcoV-HKU1 (NC_006577.2), were included in the alignment to exclude primer sets with significant overlap with non-SARS-CoV-2 sequences. For each region, we selected at least one primer/probe set (and where possible, an alternative set) that aligned to all SARS-CoV-2 isolates but had 1 or moremismatch with SARS-CoV and greater than 5 mismatches with MERS-CoV, HCoV-229E, HCoV-NL63, HCoV−OC43, and HCoV-HKU1 (Table 1). A sequence similarity analysis using Basic Local Alignment Search Tool (BLAST) (Altschul et al., 1997) found no significant similarity in any primer or probe to human sequences.
Validations using plasmid DNA
Plasmid constructs containing the regions of interest (5′UTR, 3′UTR, Main Proteinase, M gene, N gene, S protein, and a 528 nt fragment of RdRp) were designed in pBluescript KS(+) (Bio Basic Inc., Ontario, Canada) to enable assay validations using DNA and for use in in vitro transcription reactions to generate viral RNA for standards. Plasmid concentrations were quantified using ultraviolet (UV) spectrophotometry (NanoDrop ND-1000 instrument, Thermo Fisher) and themolecular weights were used to calculate the number of molecules per μL. Extracted PBMC from a healthy donor (150−200 ng/well) and H2O were included as negative controls for each assay.Each primer/probe set was tested using droplet digital PCR (ddPCR), as performed using the QX100 system (Bio-Rad). Droplet digital PCR was chosen because it enables “absolute” quantification, it is relatively less dependent on PCR efficiency (which may be reduced by sequencemismatches or inhibitors), and it may bemore precise than quantitative PCR (qPCR) at low copy numbers (Yukl et al., 2018). Plasmid DNA was added to ddPCR wells at expected inputs of 1–103 copies/well in duplicate (1000 and 100 copies) or quadruplicate (10 and 1 copy). Each reaction consisted of 20 μL per well containing 10 μL of ddPCR Probe Supermix (no deoxyuridine triphosphate), 900 nM of primers, 250 nM of probe, and 5 μL of plasmid DNA. Droplets wereamplified using a Mastercycler® nexus (Eppendorf, Hamburg, Germany) with the following cycling conditions: 10 min at 95 °C, 45 cycles of 30 s at 95 °C and 59 °C for 60 s, and a final droplet cure step of 10 min at 98 °C. Droplets were read and analyzed using the QuantaSoft software in the absolute quantification mode.
Validations using synthetic RNA
In vitro transcribed (IVT) RNA standards were generated from the aforementioned plasmids using the T7 RiboMAX™ Express Large Scale RNA Production System (Promega, Madison, WI). The concentration of each IVT RNA standard was measured by Nanodrop and themolecular weight was used to calculate theexpected number of molecules per μL. The length, integrity, and concentration of each IVT standard were confirmed using the Agilent Bioanalyzer RNA 6000 Nano assay (Agilent, Santa Clara, CA) prior to dilution in nuclease-freewater to working concentrations.A reverse transcription (RT) reaction was performed in 50 μL containing 5 μL of 10× SuperScript III buffer (Invitrogen), 5 μL of 50 mMMgCl2, 2.5 μL of random hexamers (50 ng/μL; Invitrogen), 2.5 μL of 50 μMpoly-dT15, 2.5 μL of 10 mMdeoxynucleoside triphosphates (dNTPs), 1.25 μL of RNAseOUT (40 U/μL; Invitrogen), and 2.5 μL of SuperScript III RT (200 U/μL; Invitrogen). Although the IVT standards were not polyadenylated, reverse transcription was performed with both random hexamers and poly-dT because we anticipated that these assays would be applied to clinical samples containing long polyadenylated SARS-CoV-2 RNAs, for which the combination of poly-dT plus random hexamers may reduce bias towards reverse transcription of any one region (as can be seen with specific reverse primers), the 5′ end (as would beexpected with random hexamers), or the 3′ end (as would beexpected with poly-dT).IVT RNA standards were added to RT reactions at inputs of 1, 10, 102, 103, and 104 copies per 5 μL (2 replicate RT reactions for each input). RT reactions were performed in a conventional thermocycler at 25.0 °C for 10 min, 50.0 °C for 50 min, followed by an inactivation step at 85.0 °C for 5 min. Undiluted RT product (5 μL) was added to ddPCR reactions (total volume of 20 μL) and ddPCR was performed as described in Section 2.2. Alternative primer/probe sets for a given region were compared head-to-head using this approach. Based on performance of each primer/probe set using plasmid DNA and IVT RNA, one primer/probe set for each region was selected for further testing.To determine the robustness of our approach, in addition to testing each assay with varying RNA copy inputs (each with two replicate RT reactions per input and replicate ddPCR wells for each RT), we performed repeat, independent experiments using the same parameters to confirmeach assay’s efficiency and sensitivity (n = 4 for N-ORF9, CDC_N1, and CDC_N2; n = 3 for 5′UTR, 3′UTR; and n = 2 for all others). No data wereexcluded as outliers.
Validations using SARS-CoV-2 supernatant viral RNA
Vero CCL-81 kidney epithelial cells, derived fromCercopithecus aethiops, wereinfected with SARS-CoV-2 (Isolate: USA-WA1/2020) at an MOI of 0.003 (250 000 cells/well). Cells were incubated for 72 h at 37 °C/5% CO2 and harvested. The supernatant virus was clarified by 2 centrifugation steps (180 xg, 5 min) and added directly to 1 mL TRI reagent (Molecular Research Center Inc.). Total RNA was extracted using TRI reagent, including the addition of polyacryl carrier (2.5 μL). Extracted RNA was then subjected to two rounds of DNase I treatment as follows to ensure degradation and removal of contaminating DNA. First, eluted RNA was added to a DNase Reaction Mix containing 40 mM Tris−HCL (pH 7.9; Invitrogen), 6 mMMgCl2 (Ambion), 10 mMCaCl2 (Sigma) and 1 U DNase RQ1 (Promega) and incubated at 37 °C for 15 min. Next, supernatant RNA was purified using the RNeasy Mini Kit with on-column DNase digestion with RNase-Free DNase I (Qiagen). The copies/μL in the supernatant standard wereestimated by triplicatemeasurements using the Abbott RealTimeSARS-CoV-2 assay (Abbott m2000 Molecular Platform). Dilutions of the supernatant standard were added to RT reactions to achieveexpected inputs of 1–70,000 copies per 5 μL RT (the input into each ddPCR well). RT reactions were performed as above, with random hexamers and poly-dT, except that the total volume of the RT was scaled up so that two replicate 5 μL aliquots of cDNA could then be used to test each assay in parallel using replicate 20 μL ddPCR reactions (as described in Sections 2.2 and 2.3) containing primers/probe specific for a given region.To determine whether a 1-step dd-RT-PCR approach could be adopted for each SARS-CoV-2 assay, a total of 1000 RNA copies/final ddPCR well (supernatant standard) were added to 20 μL reactions containing: 1 × 1-step RT-ddPCR Supermix (Bio-Rad), 1 mMmanganese acetate, 900 nM of primers, and 250 nM of probe. Droplets wereamplified using a Mastercycler® nexus (Eppendorf, Hamburg, Germany) with the following cycling conditions: a reverse transcription reaction at 65 °C for 30 min, inactivation at 95 °C for 5 min, followed by 40 cycles of 30 s at 94 °C and 60 °C for 60 s, and a final droplet cure step of 10 min at 98 °C. Droplets were read and analyzed using the QuantaSoft software in the absolute quantification mode and compared to samples run in parallel with 1000 copies/final ddPCR well using the 2-step approach, in which the RT reaction was performed separately and undiluted RT product (5 μL from the same RT reaction) was added to ddPCR reactions (total volume of 20 μL).
Assay efficiency in presence of background RNA
Further validations were performed to determineeach assay’s sensitivity to inhibition by “background” cellular RNA, as would beexpected in clinical samples containing cells. The supernatant standard (1000 copies per 5 μL RT) was added to RT reactions with or without cellular RNA fromA549 cells (lung epithelial cell line) or donor PBMC (both added at 100 ng/μl per RT, or 500 ng per ddPCR well). RT reactions contained a total of 125 μL with 12.5 μL of 10× SuperScript III buffer (Invitrogen), 12.5 μL of 50 mMMgCl2, 6.25 μL of random hexamers (50 ng/μL; Invitrogen), 6.25 μL of 50 μMdT15, 6.25 μL of 10 mMdeoxynucleoside triphosphates (dNTPs), 3.125 μL of RNAseOUT (40 U/μL; Invitrogen), and 6.25 μL of SuperScript III RT (200 U/μL; Invitrogen). RT reactions were incubated at 25.0 °C for 10 min, 50.0 °C for 50 min, followed by an inactivation step at 85.0 °C for 5 min. Undiluted cDNA (5 μL) was added to each 20 μL ddPCR reaction and replicate ddPCR reactions were performed for each assay.
Assay validations in clinical diagnostic samples from SARS-CoV-2 infected individuals
To investigate the variations in geneexpression in clinical samples and determine whether our RT-ddPCR assays correlate with a clinical test, we obtained unused nucleic acid (ranging from 8.25 to 16.8 μL) that remained after extraction by the Abbott m2000 instrument from nasopharyngeal swabs from 3 individuals who tested positive with the Abbott m2000 Real TimeSARS-CoV-2 assay. Nucleic acid from these 3 individuals, who had Ct values of 11.59, 15.81, and 19.14 (respectively) on the Abbot assay, was tested using our SARS-CoV-2 primer/probe sets targeted to the 5′UTR, Main Proteinase, RdRp, S, M, N and 3′UTR regions. The available volume of nucleic acid was added into 85 μL RT reactions containing 1× SuperScript III buffer, 5 mMMgCl2, 2.5 ng of random hexamers, 2.5 μMdT15, 0.5 mMdeoxynucleoside triphosphates (dNTPs), 1U/μL of RNAseOUT, and 10U/μL of SuperScript III RT. RT reactions were performed as described in Section 2.5. Undiluted cDNA was divided evenly across wells (5 μL input into each ddPCR well, tested in duplicate) and ddPCR reactions were performed as described in Section 2.2. Absolute values obtained by ddPCR were adjusted to account for differing input volume of nucleic acid to yield theSARS-CoV-2 copies/μL extract. The log-linear relationship between viral load measured by RT-PCR (Abbott m2000 Real TimeSARS-CoV-2 assay) and RT-ddPCR was determined using GraphPad Prism (version 8.4.1).
Results
Detection limit, linearity, and efficiency of primer/probes sets using plasmid DNA
Two primer/probe sets were designed for each region (indicated in Fig. 1; ‘Primer locations’) except thespike protein polybasic cleavage site, for which only one primer/probe set was designed. To evaluate the performance of each primer/probe set at the PCR stage, each set was first tested on plasmid DNA. Since no commercially available plasmid contains the wholeSARS-CoV-2 genome, and construction of such a plasmid is technically challenging (due to the 30 kb length) and subject to higher biosafety restrictions, we constructed or purchased plasmids containing individual genes or regions. For each plasmid, the DNA concentration was measured by UV spectroscopy (NanoDrop) and the number of molecules (expected copies) was calculated using themolecular weight.Each primer/probe set was assessed for detection limit, dynamic range, linearity, and efficiency by measuring the absolute number of copies detected using droplet digital PCR (ddPCR) fromexpected inputs of serially diluted plasmid DNA. All primer/probes sets could detect as few as 1–10 copies and were linear over at least 3 orders of magnitude (R2>0.99 for all; Fig. 2
). Assay efficiencies (measured by the slope) ranged from 0.67 (“N-ORF9_8″) to 1.1 (“M-ORF5″). One primer/probe set for each region was selected for further study (Table 2
; rejected primer/probe sets are listed in Table S1) based on the overall efficiency (Fig. 2), separation between the positive and negative droplets (signal to noise ratio; Fig. S1), and specificity (Table S2). For the chosen assays, no positive droplets were detected with water or DNA from peripheral mononuclear blood cells (PBMC) from uninfected blood donors.
Fig. 2
Efficiency and linearity of SARS-CoV-2 panel of ddPCR primer/probe sets determined using plasmid DNA. Plasmids containing individual SARS-CoV-2 genes or regions were quantified by UV spectroscopy and diluted (expected copies) to test the absolute number of copies detected by each primer/probe set using duplicate ddPCR reactions (measured copies). Two primer/probe sets were tested for each region except the S-PBCS. One primer/probe set from each region (indicated by coloured symbol) was selected for subsequent experiments.
Table 2
SARS-CoV-2 primer/probe sets selected for validation using IVT and supernatant viral RNA.
Target Region
Primer Namea
SARS-CoV-2 coordinatesb
Sequence (5′-3′)
5′UTR
5′UTR_F
152−171
GTTGACAGGACACGAGTAAC
5′UTR_P
175−197
TCTATCTTCTGCAGGCTGCTTAC
5′UTR_R
220−241
GAAACCTAGATGTGCTGATGAT
Main proteinase/NSP5 (ORF1a)
NSP5_F
10366−10387
TCGCATTCAACCAGGACAGACT
NSP5_P
10399−10425
AGCTTGTTACAATGGTTCACCATCTGG
NSP5_R
10426−10450
GGGCCTCATAGCACATTGGTAAACA
RNA-dependent RNA polymerase / NSP12 (ORF1b)
RDRP_F
15341−15364
CCTCACTTGTTCTTGCTCGCAAAC
RDRP_P
15370−15393
ACGTGTTGTAGCTTGTCACACCGT
RDRP_R
15437−15456
TGAACCGCCACACATGACCA
S protein/ polybasic cleavage site (ORF 2)
S_PBCS_F
23554−23576
ACCCATTGGTGCAGGTATATGCG
S_PBCS_P
23603−23622
ACACTACGTGCCCGCCGAGG*
S_PBCS_R
23641−23664
GCACCAAGTGACATAGTGTAGGCA
M protein (ORF 5)
M-ORF5_F
26768−26789
CGCAATGGCTTGTCTTGTAGGC
M-ORF5_P
26794−26816
TGTGGCTCAGCTACTTCATTGCT
M-ORF5_R
26821−26840
CGTACGCGCAAACAGTCTGA
N protein (ORF 9)
N-ORF9_F
28833−28851
CATCACGTAGTCGCAACAG
N-ORF9_P
28885−28907
AACTTCTCCTGCTAGAATGGCTG
N-ORF9_R
28917−28934
AAGCAAGAGCAGCATCAC
3′UTR
3′UTR_F
29702−29723
GGAGGACTTGAAAGAGCCACCA
3′UTR_P
29727−29746
TTTCACCGAKGCCACRCGGA
3′UTR_R
29768−29788
GGCAGCTCTCCCTAGCATTGT
Reverse complement.
‘F’ = forward primer, ‘R’ = reverse primer, ‘P’= probe (fluorophore/quencher: FAM, MGB).
SARS-CoV2 coordinates indicated are based on the SARS-CoV2 reference sequence (NC_045512.2).
Efficiency and linearity of SARS-CoV-2 panel of ddPCR primer/probe sets determined using plasmid DNA. Plasmids containing individual SARS-CoV-2 genes or regions were quantified by UV spectroscopy and diluted (expected copies) to test the absolute number of copies detected by each primer/probe set using duplicate ddPCR reactions (measured copies). Two primer/probe sets were tested for each region except the S-PBCS. One primer/probe set fromeach region (indicated by coloured symbol) was selected for subsequent experiments.SARS-CoV-2 primer/probe sets selected for validation using IVT and supernatant viral RNA.Reverse complement.‘F’ = forward primer, ‘R’ = reverse primer, ‘P’= probe (fluorophore/quencher: FAM, MGB).SARS-CoV2 coordinates indicated are based on theSARS-CoV2 reference sequence (NC_045512.2).
Detection limit, linearity, and efficiency using in vitro transcribed and supernatant viral RNA
Selected primer/probe sets for each region were tested using standards prepared from in vitro transcribed (IVT) RNA generated from the designed plasmids (5′UTR, Main Proteinase, RDRP, S, M, N and 3′UTR; Fig. 3
and Table 2). Theexpected copy numbers were calculated using the RNA concentration (as measured by UV spectroscopy [NanoDrop] and confirmed by the Agilent Bioanalyzer) and themolecular weight. Using RT-ddPCR, all assays could detect as few as 10 copies of RNA and demonstrated linearity over 3–4 orders of magnitude (R2>0.999 for all; Fig. 3). Theefficiencies for detecting IVT RNA standards, which ranged from 0.18 (for Main Proteinase) to 0.96 (S-PBCS), weremore variable than those observed for plasmid DNA. No amplification was detected in ‘No RT’ control reactions containing 10,000 IVT RNA copies/well, confirming the absence of any contaminating plasmid DNA. However, it is worth noting that none of these IVT standards were polyadenylated (so they should not be reverse-transcribed by poly-dT) and some of the standards were very short (<300 base pairs), which would likely limit theefficiency with which they were reverse transcribed by random hexamers. In addition, some of themeasured differences in efficiency could reflect actual differences in the copy numbers present in the various IVT standards, which are difficult to determine precisely.
Fig. 3
Efficiency and linearity of SARS-CoV-2 panel of RT-ddPCR primer/probe sets determined using in vitro transcribed (IVT) RNA. RNA standards containing a given region or gene of SARS-CoV-2 were prepared by in vitro transcription from plasmids and quantified by independent means (UV spectroscopy and the Agilent Bioanalyzer). Various inputs of each IVT RNA standard (which were used to calculate ‘Expected Copies’ per ddPCR well) were reverse transcribed and replicate aliquots of cDNA were used to measure the absolute number of copies detected by each ddPCR assay (‘Measured Copies’). Each primer/probe set was tested using expected inputs of 1-104 copies per ddPCR well (except S-PBCS, which was tested at inputs of 2-2100 copies). Data represent average of duplicate wells from a representative experiment. S=slope, indicating assay efficiency. Each primer/probe set was tested in at least two independent experiments.
Efficiency and linearity of SARS-CoV-2 panel of RT-ddPCR primer/probe sets determined using in vitro transcribed (IVT) RNA. RNA standards containing a given region or gene of SARS-CoV-2 were prepared by in vitro transcription from plasmids and quantified by independent means (UV spectroscopy and the Agilent Bioanalyzer). Various inputs of each IVT RNA standard (which were used to calculate ‘Expected Copies’ per ddPCR well) were reverse transcribed and replicate aliquots of cDNA were used to measure the absolute number of copies detected by each ddPCR assay (‘Measured Copies’). Each primer/probe set was tested using expected inputs of 1-104 copies per ddPCR well (except S-PBCS, which was tested at inputs of 2-2100 copies). Data represent average of duplicate wells from a representativeexperiment. S=slope, indicating assay efficiency. Each primer/probe set was tested in at least two independent experiments.To circumvent these limitations, we prepared oneSARS-CoV-2 supernatant viral RNA standard containing all of the target regions by extracting RNA from cell-free supernatant from a cell line (Vero CCL81) infected in vitro with a SARS-CoV-2patient isolate (USA-WA1/2020). Theexpected copies in this supernatant standard were calculated using the Ct valuemeasured by the Abbott m2000 Real TimeSARS-CoV-2 viral load assay, which targets the N and RdRp genes using probes labelled with the same fluorophore. This supernatant virus standard enabled the preparation of common RT reactions containing specific inputs of SARS-CoV-2 RNA, from which aliquots of cDNA could be divided evenly across our panel of assays for simultaneous assessment of all target regions in ddPCR reactions (Fig. 4
).
Fig. 4
Efficiency and linearity of SARS-CoV-2 panel of ddPCR assays determined using SARS-CoV-2 supernatant viral RNA. A SARS-CoV-2 “supernatant” standard was prepared by extracting the RNA from the supernatant of an in vitro infection and quantified using the Abbott m2000 Real Time SARS-CoV-2 assay. Various inputs of the supernatant standard (which were used to calculate ‘Expected Copies’ per ddPCR well) were applied to a common reverse transcription reaction, from which aliquots of cDNA were used to measure the absolute number of copies detected by each ddPCR assay (measured copies). Each assay was tested with expected inputs of 10-104 copies/ddPCR well in duplicate. S (slope) and R2 are indicated for each assay. Representative data for n = 2 independent experiments are shown.
Efficiency and linearity of SARS-CoV-2 panel of ddPCR assays determined using SARS-CoV-2 supernatant viral RNA. A SARS-CoV-2 “supernatant” standard was prepared by extracting the RNA from the supernatant of an in vitro infection and quantified using the Abbott m2000 Real TimeSARS-CoV-2 assay. Various inputs of the supernatant standard (which were used to calculate ‘Expected Copies’ per ddPCR well) were applied to a common reverse transcription reaction, from which aliquots of cDNA were used to measure the absolute number of copies detected by each ddPCR assay (measured copies). Each assay was tested with expected inputs of 10-104 copies/ddPCR well in duplicate. S (slope) and R2 are indicated for each assay. Representative data for n = 2 independent experiments are shown.Expected inputs of 10–70,000 copies per well were used to measure the absolute copies of 5′UTR, Main Proteinase, RdRp, S, M, N and 3′UTR regions. All assays detected as few as 10 copies of the supernatant standard and were linear over four orders of magnitude (R2>0.999 for all). No amplification was detected in ‘No RT’ control reactions containing 10,000 RNA copies/well. Assay efficiencies (slope or regression coefficient derived from the relationship betweenmeasured and expected copies) were all greater than 1.0 (range: 1.05–2.46), likely because theestimate from the Abbott assay was lower than the true value and/or the RT-ddPCR assays aremoreefficient. In addition, themeasured copy numbers tended to increase from 5′ to 3′ targets (slopes: 5′UTR=1.05, RdRp = 1.20, S-PBCS = 1.32, M-ORF5 = 1.51, N=1.85−2.45, and 3′UTR = 2.26; Fig. 4). Since primer/probe sets for S-PBCS, M, N and 3′UTR were designed to detect both gRNA and sgRNA (Table 1), this observation could reflect the presence of 3′ subgenomic RNAs in the supernatant standard and/or greater efficiency of reverse transcription from the 3′ end of the genome.
Primer/probe specificity and false positive rate
To determine the non-specific reactivity of oligonucleotides (false positive rate) for each primer/probe set, we performed a median of 26 [range 18–32] ‘no template’ controls (NTC). These reactions were performed with both water (water NTC) and DNA or RNA isolated fromSARS-CoV-2-negative donor PBMC (DNA/RNA NTC) (Table S2). Except for oneexperiment using IVT RNA, where a total of three droplets were detected across duplicate NTC wells containing donor PBMC tested for Main Proteinase-NSP5, no other false positives were observed.
Comparison of new and existing SARS-CoV-2 primer/probe sets in ddPCR platform
Our assay panel included new primers/probes for thenucleocapsid (N-ORF9), which is targeted by existing diagnostic real-time PCR assays. We compared the performance of our ‘N-ORF9′ primers/probe to the primers/probes from the U.S. Center for Disease Control assays for thenucleocapsid (CDC-N1 and CDC-N2) (-Novel Coronavirus, 2020) using ddPCR. The N-ORF9 assay efficiency was similar to that of CDC-N1 and CDC-N2 for plasmid DNA, in between that of CDC-N1 and CDC-N2 for IVT RNA, and similar to CDC-N1 for the supernatant standard (Fig. 2, Fig. 3, Fig. 4).In addition, we compared our primers/probes for theRdRp to published primers/probes for the “IP2″ assay (Protocol, 2020) (which targets ORF1a) and “E-Sarbeco” (Corman et al., 2020) assay (which targets theE gene) using RT-ddPCR and the supernatant standard (Fig. 5
; Table 3
). TheIP2 (ORF1a) assay efficiency was 1.11, compared to 1.20–1.28 for our RdRp (ORF1b) and 1.36 for our main protease (ORF1a) assays (Fig. 4, Fig. 5). TheE-Sarbeco [ORF4] assay efficiency (1.08) was similar to theIP2, but may have been lower efficiency than our primer/probe sets targeting adjacent genes (S-PBCS [ORF2]: 1.32; M-ORF5: 1.51).
Fig. 5
Comparison of assay efficiency and linearity of published assays, ORF1a “nCoV_IP2″ and E gene and novel RDRP-NSP12 assay. The performance of our RDRP-NSP12 assay was compared to published primers/probes for ORF1a and the E gene in the ddPCR platform using common RT reactions containing supernatant viral RNA inputs of 2-2 × 104 copies/ddPCR well. S (slope) and R2 are indicated for each assay.
Table 3
SARS-CoV-2 assays from other sources.
Target Region
Primer Namea
SARS-CoV-2 coordinatesb
Sequence (5−3’)
Reference
N protein/ ORF 9
CDC_N1_F
28287−28306
GAC CCC AAA ATC AGC GAA AT
CDC_N1_P
28309−28330
ACC CCG CAT TAC GTT TGG ACC
CDC_N1_R
28335−28358
TCT GGT TAC TGC CAG TTG AAT CTG
40
CDC_N2_F
29164−29183
TTACAAACATTGGCCGCAAA
CDC_N2_P
29188−29210
ACAATTTGCCCCCAGCGCTTCAG
CDC_N2_R
29213−29230
GCG CGA CAT TCC GAA
ORF1a
nCoV_IP2−12669Fw
12690−12707
ATGAGCTTAGTCCTGTTG
nCoV_IP2−12696bProbe(+)
12717−12737
AGATGTCTTGTGCTGCCGGTA
41
nCoV_IP2−12759Rv
12780−12797
CTCCCTTTGTTGTGTTGT
E gene
E_Sarbeco_F1
26269−26394
ACAGGTACGTTAATAGTTAATAGCGT
E_Sarbeco_P1
26,332- 26,357
ACACTAGCCATCCTTACTGCGCTTCG
42
E_Sarbeco_R2
26360−26381
ATATTGCAGCAGTACGCACACA
= known mismatches as reported in 43 and in-house SARS-CoV2 multiple sequence alignment (mismatches identified were relative to sequence MT825091.1 from Iran).
‘F’ = forward primer, ‘R’ = reverse primer, ‘P’= probe (fluorophore/quencher: FAM, MGB).
SARS-CoV-2 coordinates indicated are based on the SARS-CoV-2 reference sequence (NC_045512.2).
Comparison of assay efficiency and linearity of published assays, ORF1a “nCoV_IP2″ and E gene and novel RDRP-NSP12 assay. The performance of our RDRP-NSP12 assay was compared to published primers/probes for ORF1a and theE gene in the ddPCR platform using common RT reactions containing supernatant viral RNA inputs of 2-2 × 104 copies/ddPCR well. S (slope) and R2 are indicated for each assay.SARS-CoV-2 assays from other sources.= known mismatches as reported in 43 and in-houseSARS-CoV2multiple sequence alignment (mismatches identified were relative to sequenceMT825091.1 from Iran).‘F’ = forward primer, ‘R’ = reverse primer, ‘P’= probe (fluorophore/quencher: FAM, MGB).SARS-CoV-2 coordinates indicated are based on theSARS-CoV-2 reference sequence (NC_045512.2).Finally, we assessed the performance of our SARS-CoV-2 primer/probe sets, 5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, CDC_N2 and 3′UTR, in a ‘1-step’ dd-RT-PCR (where RNA is encapsulated in droplets and then reverse transcription occurs in each droplet using the reverse primer for PCR) compared to the ‘2-step’ approach (reverse transcription using random hexamers and poly-dT, followed by droplet encapsulation and ddPCR). We observed that the 2-step approach was similar to or outperformed the 1-step dd-RT-PCR method in terms of detection at the same input (1000 copies) for 5′UTR, RdRp, S-PBCS, M-ORF5, and 3′UTR (Fig. S2). The 1-step method enabled higher detection of ProNSP5, N-ORF9 and CDC_N2 relative to the 2-step approach, but the 1-step showed more false positives (N-ORF9 and CDC_N2) or poorer signal to noise ratio (ProNSP5). Overall, the performance of theSARS-CoV-2 panel of ddPCR assays was superior using a 2-step approach compared to a 1-step dd-RT-PCR approach.
Lower limit of detection of SARS-CoV-2 in RNA
Our validation studies included SARS-CoV-2 RNA inputs of as few as 1 copy per ddPCR reaction (Fig. 2, Fig. 3). Weestimated the lower limit of detection (LLOD) for each assay in our panel based on data for all replicates tested at 10 copy and 1 copy inputs (Table S3). At 10 copies, all of our assays detected SARS-CoV-2 in ≥85.7 % of tests (range = 85.7–100 %). At 1 copy input, our assays detected SARS-CoV-2 in ≥25 % of tests (range = 25–88 %), underscoring the high sensitivity of our assays.
Effect of background RNA on assay efficiencies
Next, we assessed theefficiencies of the primer/probe sets in our panel (5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, and 3′UTR) and that of reported assays (IP2_ORF1a, CDC_N1, CDC_N2, and E_Sarbeco) in the presence of “background” RNA from uninfected cells (Fig. 6
). At a constant input of 1000 copies of theSARS-CoV-2 viral supernatant RNA, we determined theeffect of adding cellular RNA (100 ng per μL of RT) extracted from PBMC or a lung epithelial cell line (A549 cells). All assays tested (5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, 3′UTR, IP2_ORF1a, CDC_N1, CDC_N2, and E_Sarbeco) showed slightly greater efficiency in the presence of 100 ng/μL background RNA fromeither PBMC or A549 cells compared to the supernatant standard with no background RNA (Fig. 6). No false positives were detected with 100 ng/μL RT RNA from PBMC, while 1–4 droplets were sometimes detected in the RNA fromA549 cells using some assays (Main Proteinase, RdRp, S-PBCS). Overall, these data suggest that in samples derived from individuals infected with SARS-CoV-2, the primer/probe sets for 5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, and 3′UTR are likely to beminimally inhibited by background RNA, making them ideally suited to a diverse range of clinical samples.
Fig. 6
Effect of background RNA on ddPCR assay performance. We simultaneously tested primer/probe sets in our panel (5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, and 3′UTR) against reported assays (CDC_N1, CDC_N2, E_Sarbeco, and IP2_ORF1a) in the presence and absence of background RNA. Each primer/probe set was tested with a constant input of SARS-CoV-2 supernatant virus standard (predicted to yield 1000 copies/ddPCR well) in the presence or absence of background RNA from PBMC or a lung epithelial cell line (A549) added at a concentration of 100 ng/μL of RT reaction (500 ng/ddPCR well, or 1 μg for the 2 replicate wells used to test each assay). Negative controls included water, 1 μg/assay PBMC RNA, and 1 μg/assay A549 RNA. Primer/probe sets are indicated on x-axis in order from 5′ to 3′ and dotted line indicates 1000 SARS-CoV-2 RNA copy input. Error bars represent standard deviation from duplicate wells.
Effect of background RNA on ddPCR assay performance. We simultaneously tested primer/probe sets in our panel (5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, and 3′UTR) against reported assays (CDC_N1, CDC_N2, E_Sarbeco, and IP2_ORF1a) in the presence and absence of background RNA. Each primer/probe set was tested with a constant input of SARS-CoV-2 supernatant virus standard (predicted to yield 1000 copies/ddPCR well) in the presence or absence of background RNA from PBMC or a lung epithelial cell line (A549) added at a concentration of 100 ng/μL of RT reaction (500 ng/ddPCR well, or 1 μg for the 2 replicate wells used to test each assay). Negative controls included water, 1 μg/assay PBMC RNA, and 1 μg/assay A549 RNA. Primer/probe sets are indicated on x-axis in order from 5′ to 3′ and dotted line indicates 1000 SARS-CoV-2 RNA copy input. Error bars represent standard deviation from duplicate wells.
Strong correlation between viral loads measured by RT-ddPCR and real-time PCR in clinical diagnostic samples
To compare our panel of SARS-CoV-2 primer/probe sets (5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, and 3′UTR) with a clinical test, we obtained unused nucleic acid that had beenextracted by the Abbott m2000 molecular platform from nasopharyngeal swabs from threeSARS-CoV-2-infected individuals and remained after clinical testing using the Abbott Real TimeSARS-CoV-2 assay. Using this nucleic acid, wemeasured RNA levels of the 5′UTR, Main Proteinase, RdRp, S, M, N and 3′UTR regions using our RT-ddPCR assays. (Fig. 7
). As observed with the supernatant standard, transcripts containing themost 3′ regions (N-ORF9 and 3′UTR) tended to be present at higher copy numbers, while those containing the 5′UTR tended to be present at lower levels. However, as expected given the small sample size (n = 3), we observed variation in transcript levels both between individuals and relative to the supernatant virus standard. For example, levels of S-PBCS RNA tended to be lower than those of themore 5′ Main Protease (NSP5) transcripts. These potential differences in SARS-CoV-2 transcription profilemay reflect changes in viral dynamics over the course of infection or inter-individual variability in viral sequences or host responses, and should be confirmed in future studies using longitudinal samples frommore individuals.
Fig. 7
Measurement of SARS-CoV-2 targets in three SARS-CoV-2 infected individuals determined using RT-ddPCR. Unused nucleic acid (ranging from 8.6-16.8 μL) extracted by the Abbott m2000 platform from nasopharyngeal swabs from SARS-CoV-2 infected individuals (n = 3) was used in a common RT reaction for each individual. Resulting cDNA was divided evenly across reactions for the seven assays in our panel and targets were measured using ddPCR. Colored symbols indicate SARS-CoV-2 target region. Copy numbers from each assay are expressed as SARS-CoV-2 copies per μL of nucleic acid and grouped for each individual (x-axis). Threshold cycle (Ct) values, as determined by Abbott Real Time SARS-CoV-2 viral load assay, are indicated above each individual’s dataset.
Measurement of SARS-CoV-2 targets in threeSARS-CoV-2 infected individuals determined using RT-ddPCR. Unused nucleic acid (ranging from 8.6-16.8 μL) extracted by the Abbott m2000 platform from nasopharyngeal swabs fromSARS-CoV-2 infected individuals (n = 3) was used in a common RT reaction for each individual. Resulting cDNA was divided evenly across reactions for the seven assays in our panel and targets weremeasured using ddPCR. Colored symbols indicateSARS-CoV-2 target region. Copy numbers fromeach assay areexpressed as SARS-CoV-2 copies per μL of nucleic acid and grouped for each individual (x-axis). Threshold cycle (Ct) values, as determined by Abbott Real TimeSARS-CoV-2 viral load assay, are indicated aboveeach individual’s dataset.Next, we determined the correlation between the Ct value as measured by the Abbott assay and SARS-CoV-2 copy numbers as determined by RT-ddPCR. For each target, this relationship was modelled using linear regression following log transformation of SARS-CoV-2 copies/μL extract [where y = Log2(x)] (Fig S3). The coefficient of determination (R2) for each model was ≥0.93 for all targets, underscoring the log-linear relationship between ddPCR-based SARS-CoV-2 transcript levels and Ct values in diagnostic specimens. Taken together, these data strongly underscore the sensitivity of our primer/probe sets (5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, and 3′UTR), demonstrating the ability to detect all targets using minimal RNA inputs (effectively 1.2−2.4 μL RNA input per assay), and their strong correlation with Ct values obtained by real-time PCR using clinical assays. Furthermore, these data highlight that delineation of theSARS-CoV-2 transcription profile in samples across differing timepoints within and betweenparticipantsmay yield valuable insight into viral transcription dynamics across the course of SARS-CoV-2 infection.
Discussion
The 2019 SARS-CoV-2 outbreak has heralded the development of an array of diagnostic molecular tools to study this novel coronavirus. However, currently described PCR-based diagnostic assays are qualitative or semi-quantitative, are limited to the simultaneous detection of one or two regions (typically RdRp and N), and do not account for variation in gene copy numbers due to subgenomic transcription. Here, we report a panel of new primer/probe sets that span theSARS-CoV-2 genome and target important nongenic regions, non-structural genes found only in genomic RNA, and structural genes that are also found in different subgenomic RNAs.We used thesenew primers/probes for RT-ddPCR rather than qRT-PCR because ddPCR provides absolute quantification (does not require an external calibrator), tends to tolerate sequencemismatches in primer/probe sequences better than qRT-PCR, and may bemore precise at low copies, while providing similar sensitivity and reproducibility (Yukl et al., 2018; Arvia et al., 2021). During validation of these assays with multiple different standards, we sometimes found that theefficiency of the same assay varied across different standards. The greatest variation was observed using in vitro transcribed standards, whose varying length and lack of a poly-A tail may affect RT efficiency (Fig. 3). Together, these observations may reflect differences in the nature of the standards (DNA, short in vitro transcribed non-polyadenylated RNA, or supernatant RNA) as well as the difficulty in determining theexact number of copies in an external standard; the latter issue highlights a major advantage of the absolute quantification provided by ddPCR.On all standards tested, the sevenSARS-CoV-2 RT-ddPCR assays wereextremely sensitive (down to 1–10 copies) and linear over 3–4 orders of magnitude. All seven assays showed no inhibition by up to 500,000 cell equivalents of RNA per ddPCR well, suggesting that these assays could beextremely useful for SARS-CoV-2 research. Whilemost existing clinical assays for SARS-CoV-2 use qPCR because it is less expensive and may have fewer false positives than ddPCR, it is likely that the primer/probe sets described here would also work well in qPCR assays for research or clinical testing.The utility of assays that target multiple genomic regions is supported by studies demonstrating loss in sensitivity of published assays owing to mutations that could affect primer annealing. For instance, a recent study found that 34.38 % (11,627) of SARS-CoV-2 genomes featured a singlemutation capable of affecting annealing of a PCR primer in tested assays from the World Health Organization, Centers for Disease Control and Prevention, National Microbiology Data Center, and Hong Kong University (Penarrubia et al., 2020). Another study found single nucleotidemismatches in 0.2 % and 0.4 % of the surveyed SARS-CoV-2 sequences compared to the CDC-N1 probe and reverse primer, respectively, and 0.4 % of those sequences compared to Charité’s E_Sarbeco_R primer (Vogels et al., 2020). Therefore, a strategy that can target multiple genomic regions may have utility in sensitive detection of SARS-CoV-2.Extensive, well-designed studies have assessed the analytical sensitivity and efficiency of existing RT-qPCR primer-probes sets (Vogels et al., 2020; Uhteg et al., 2020; Etievant et al., 2020; Kudo et al., 2020) and explored adaptation of such assays to the ddPCR platform (Suo et al., 2020). In this study, we describe how some of the available diagnostic assays compare to our novel SARS-CoV-2 assays and report how a multi-assay approach using the ddPCR platform could significantly advance our understanding of SARS-CoV-2 transcription and replication. While highly-sensitive PCR-based assays might not beessential to identify SARS-CoV-2-infected individuals in the transmissible/contagious phase of infection, quantitative assays capable of detecting very low copies of SARS-CoV-2 will be particularly useful in understanding the course of infection and correlates of disease progression. Existing clinical assays are quite sensitive for detecting COVID-19 during the first 1–3 weeks of infection, but often becomenegative thereafter (Xiao et al., 2020; Zhou et al., 2020b; Zheng et al., 2020). However, in some cases, individuals who test positivemay have a subsequent negative test followed by another positive or alternating positive and negative tests (Lan et al., 2020; Alvarez-Moreno and Rodríguez-Morales, 2020; Huang et al., 2020). Some individuals may also have prolonged viral shedding after symptomatic relief, with one study noting a patient with qRT-PCR positivity detected in upper respiratory tract samples 83 days post-symptom onset (Li et al., 2020). Therefore, sensitive RT-ddPCR assays such as those described in this study could be of great utility in studying the course of infection two or more weeks after the resolution of acute symptoms. Another advantage of the approach described here is that it permits a single sample to be simultaneously assayed for multiple targets, which may increase sensitivity and specificity compared to measuring only 1–2 targets while helping to delineate the levels of various SARS-CoV-2 coding regions in infectedpatient samples from cross-sectional or longitudinal studies. As such, this panel of RT-ddPCR assays can be applied to a diverse range of clinically relevant samples in which SARS-CoV-2 RNA may be present in low or high abundance.Using both the supernatant virus standard and clinical samples from the nasopharynx, we tended to observe higher copy numbers for targets at the 3′ end of the genome (N, 3′UTR) compared to the 5′ end (5′UTR, main protease) (Figs. 4,6,7 and S2). This discrepancy is not explained by differences in PCR efficiency, since theefficiency of the N assay on plasmid DNA was actually lower than that of assays for the 5′UTR or main protease. It is possible that reverse transcription is moreefficient for assays at the 3′ end (perhaps due to moreefficient reverse transcription from thepoly-dT), although random hexamers should bias towards the 5′ end and the combination has been used to prevent bias towards either the 5′ or 3′ end of the 9.6 kb genome of HIV-1 (Yukl et al., 2018; Telwatteet al., 2018). Moreover, the supernatant levels of 3′ RNA regions also tended to be higher than those of 5′ regions whenmeasured using a “1-step” approach in which reverse transcription was performed using specific reverse primers. It seems likely that the 3′ assays measure higher copies because they are detecting subgenomic RNAs, which are reported to beexcluded from virions (Wölfel et al., 2020) but may have been present in the supernatant if they were released from dying cells or actively secreted (for example, in exosomes).This excess of targets corresponding to sgRNAs may bemuch greater in samples that contain more cells or cell-associated RNA, and it has important implications for clinical testing and research. For targets in the 3′ third of the genome that are transcribed as sgRNAs, regions that are further downstream (3′) may be incorporated into a greater variety of sgRNAs and therefore should be present at higher copy numbers, so assays targeting these regions may bemore sensitive to detect infection (Kimet al., 2020b). On the other hand, sgRNAs are not infectious, so assays targeting more 5′ regions that are transcribed only as genomic RNA (ORF 1a and 1b) may correlate better with infectivity.The clinical implications of SARS-CoV-2 subgenomic RNA transcription are currently unknown. The synthesis of subgenomic RNAs is a common strategy employed by positive-sense RNA viruses to transcribe their 3′ proximal genes that encode products essential for particle formation and pathogenesis (Koev and Miller, 2000; Pasternak et al., 2006; Gorbalenya et al., 2006). In coronaviruses such as mouse hepatitis virus (MHV), the synthesis of subgenomic RNAs may function as important mediators of positive strand synthesis (Baric and Yount, 2000), and more broadly, members of the order Nidovirales (including Coronaviridae) feature high levels of redundancy to ensure continued protein synthesis even in theevent of point mutations in regulatory sequences (Di et al., 2017). The characterization of theSARS-CoV-2 transcription profile in differing patient samples over the course of infectionmay provide insight into themolecular mechanisms by which SARS-CoV-2 regulates geneexpression through differential transcription of genomic and subgenomic RNAs, and how this differential geneexpression may contribute to pathogenesis.We found that our panel of primer/probe sets (5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, and 3′UTR) performed better in the presence of background RNA, irrespective of origin (blood or epithelial cells, Fig. 6). This finding accords with other studies that haveextensively validated theeffect of differing variables on RT efficiency and suggest that the presence of some background RNA may increaseefficiency of the reverse transcription step (Kuang et al., 2018; Miranda and Steward, 2017; Levesque-Sergerieet al., 2007). While theefficiency of our panel of RT-ddPCR assays (5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, and 3′UTR) tended to decrease with RNA concentrations above 100 ng/μL RT, even at 500 ng RNA/μL RT, these assays still performed better than in the absence of any background RNA, suggesting that they are ideally suited for testing samples from different tissues where the levels of genomic RNA may differ considerably. Furthermore, our comparison of viral loads obtained by RT-ddPCR and qRT-PCR demonstrates the strong correlation between data obtained from these two platforms and theminimal RNA input required to yield robust data using our RT-ddPCR assays.Limitations of this study should be acknowledged. In order to test our primer/probe sets in parallel with published assays (total of 11 assays) in background RNA experiments (Fig. 6), we increased RT reaction volumes from 50−70 μL to 125 μL to accommodate the additional assays. In the absence of background RNA, theefficiency appeared to be higher in the 50−70 μL RT reactions (Fig. 4, Fig. 5, >100 % efficiency for all assays) than the 125 μL reactions (Fig. 7; median efficiency = 88 % [range: 60–133 %]). If the discrepancy is not due to a difference in the actual input of the standard, it is possible that larger reaction volumes lead to lower efficiency in reverse transcription. However, for application to patient samples, our core panel of 7 RT-ddPCR assays (5′UTR, Pro-NSP5, RdRp-NSP12, S-PBCS, M-ORF5, N-ORF9, and 3′UTR; Table 1) is sufficient to provide a detailed view of the transcription profile of SARS-CoV-2, so preparation of RT reactions >70 μL will likely be unnecessary.For our study of the levels of SARS-CoV-2 coding regions and correlation with the Ct value as determined by the Abbott m2000 SARS-CoV-2 Real Time Assay, a limited amount of nucleic acid was available from only a small number (n = 3) of de-identified individuals. Despite this small sample size, we demonstrated both the sensitivity of all assays in our panel and their strong correlation with Ct values in diagnostic specimens. These data allude to potential differences in the transcription dynamics of SAR-CoV-2 during the course of infection and merit further investigation.
Conclusions
We developed a panel of sensitive, quantitative RT-ddPCR-based SARS-CoV-2 assays that collectively span the genome and target nongenic and genic regions, including important enzymes transcribed only as genomic RNA and structural genes that are also transcribed as different subgenomic RNAs. These assays can serve as novel molecular tools to investigateSARS-CoV-2 infection, replication dynamics, and geneexpression to better understand the viral dynamics and pathogenesis of SARS-CoV-2 over the course of infection. Future studies employing these assays will enhance our understanding of SARS-CoV-2 replication and transcription and may also inform the development of improved diagnostic tools and therapeutics.
Author_statement
The study authors had no subject contact or access to any personally-identifiable information (Category 4, IRB exempt). The data generated and/or analyzed during the current study are included in themanuscript or supplements. All authors have seen and approved the content and have contributed to the work. The authors declare no conflicts of interest. None of the content has been published or is under consideration for publication elsewhere.
Funding
This research was supported by funds from theEmergency COVID-19 Research Seed Funding of the University of California (Grant Number R00RG3113 [ST]). The investigators received salary support from the U.S. Department of Veterans Affairs (SAY and JKW), the National Institute of Diabetes and Digestive and Kidney Diseases at the NIH (R01DK108349 [SAY, JKW], R01DK120387 [SAY]), the National Institute of Allergy and Infectious Diseases at the NIH (R01AI132128 [SAY, JKW]), the UCSF/GIVI Center for AIDS Research (CFAR; Grant# P30 AI027763 [ST]; award #A120163 [PI: Paul Volberding]), and the California HIV/AIDS Research Program (Grant number BB19-SF-009 [ST]). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of themanuscript.
Ethics
This study included the use of de-identified nucleic acid from threeSARS-CoV-2-infected individuals. The study authors had no subject contact or access to any personally-identifiable information (Category 4, IRB exempt).
CRediT authorship contribution statement
Sushama Telwatte: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Supervision, Validation, Visualization, Writing - original draft. Nitasha Kumar: Investigation, Writing - review & editing. Albert Vallejo-Gracia: Investigation, Writing - review & editing. G. Renuka Kumar: Investigation, Writing - review & editing. Chuanyi M. Lu: Resources, Writing - review & editing. Melanie Ott: Resources, Supervision, Writing - review & editing. Joseph K. Wong: Resources, Supervision, Writing - review & editing. Steven A. Yukl: Conceptualization, Funding acquisition, Methodology, Resources, Supervision, Writing - original draft.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Authors: Sushama Telwatte; Holly Anne Martin; Ryan Marczak; Parinaz Fozouni; Albert Vallejo-Gracia; G Renuka Kumar; Victoria Murray; Sulggi Lee; Melanie Ott; Joseph K Wong; Steven A Yukl Journal: Methods Date: 2021-04-18 Impact factor: 4.647
Authors: Michael D Lu; Sushama Telwatte; Nitasha Kumar; Fernanda Ferreira; Holly Anne Martin; Gayatri Nikhila Kadiyala; Adam Wedrychowski; Sara Moron-Lopez; Tsui-Hua Chen; Erin A Goecker; Robert W Coombs; Chuanyi M Lu; Joseph K Wong; Athe Tsibris; Steven A Yukl Journal: PLoS One Date: 2022-04-27 Impact factor: 3.752
Authors: Laura A E Van Poelvoorde; Mathieu Gand; Marie-Alice Fraiture; Sigrid C J De Keersmaecker; Bavo Verhaegen; Koenraad Van Hoorde; Ann Brigitte Cay; Nadège Balmelle; Philippe Herman; Nancy Roosens Journal: Curr Issues Mol Biol Date: 2021-11-06 Impact factor: 2.976
Authors: Brett A Duguay; Adrian Herod; Eric S Pringle; Susan M A Monro; Marc Hetu; Colin G Cameron; Sherri A McFarland; Craig McCormick Journal: Viruses Date: 2022-01-08 Impact factor: 5.048