| Literature DB >> 33924636 |
Mathieu Gand1, Kevin Vanneste1, Isabelle Thomas2, Steven Van Gucht2, Arnaud Capron3, Philippe Herman4, Nancy H C Roosens1, Sigrid C J De Keersmaecker1.
Abstract
For 1 year now, the world is undergoing a coronavirus disease-2019 (COVID-19) pandemic due to the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The most widely used method for COVID-19 diagnosis is the detection of viral RNA by RT-qPCR with a specific set of primers and probe. It is important to frequently evaluate the performance of these tests and this can be done first by an in silico approach. Previously, we reported some mismatches between the oligonucleotides of publicly available RT-qPCR assays and SARS-CoV-2 genomes collected from GISAID and NCBI, potentially impacting proper detection of the virus. In the present study, 11 primers and probe sets investigated during the first study were evaluated again with 84,305 new SARS-CoV-2 unique genomes collected between June 2020 and January 2021. The lower inclusivity of the China CDC assay targeting the gene N has continued to decrease with new mismatches detected, whereas the other evaluated assays kept their inclusivity above 99%. Additionally, some mutations specific to new SARS-CoV-2 variants of concern were found to be located in oligonucleotide annealing sites. This might impact the strategy to be considered for future SARS-CoV-2 testing. Given the potential threat of the new variants, it is crucial to assess if they can still be correctly targeted by the primers and probes of the RT-qPCR assays. Our study highlights that considering the evolution of the virus and the emergence of new variants, an in silico (re-)evaluation should be performed on a regular basis. Ideally, this should be done for all the RT-qPCR assays employed for SARS-CoV-2 detection, including also commercial tests, although the primer and probe sequences used in these kits are rarely disclosed, which impedes independent performance evaluation.Entities:
Keywords: COVID-19; RT-qPCR; SARS-CoV-2; WGS data; bioinformatics tool; detection; in silico specificity evaluation; mismatches; primers and probes; variants
Mesh:
Substances:
Year: 2021 PMID: 33924636 PMCID: PMC8069896 DOI: 10.3390/genes12040565
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
List of investigated primers and probe sets.
| Assay * | Target | Oligonucleotide Sequence (5’–3’) | Length (bp) | Amplicon | Source |
|---|---|---|---|---|---|
| 1 | N | Fw GGGGAACTTCTCCTGCTAGAAT | 22 | 28880–28979 | China CDC, China [ |
| 2 | RdRp-P2 | Fw GTGARATGGTCATGTGTGGCGG | 22 | 15430–15530 | Charité Hospital, Germany [ |
| 2 *** | E *** | Fw ACAGGTACGTTAATAGTTAATAGCGT | 26 | 26268–26381 | |
| 2 | N | Fw CACATTGGCACCCGCAATC | 19 | 28705–28833 | |
| 3 | RdRp_IP2 | Fw ATGAGCTTAGTCCTGTTG | 18 | 12689–12797 | Institut Pasteur Paris, |
| 3 | RdRp_IP4 | Fw GGTAACTGGTATGATTTCG | 19 | 14079–14186 | |
| 4 | N-1 | Fw GACCCCAAAATCAGCGAAAT | 20 | 28286–28358 | US CDC, |
| 4 | N-2 | Fw TTACAAACATTGGCCGCAAA | 20 | 29163–29230 | |
| 4 | N-3 | Fw GGGAGCCTTGAATACACCAAAA | 22 | 28680–28752 | |
| 8 | S | Fw CCTACTAAATTAAATGATCTCTGCTTTACT | 30 | 22711–22869 | Chan et al. |
| 9 | ORF1a | Fw AGAAGATTGGTTAGATGATGATAGT | 25 | 3192–3310 | Lu et al. |
*: to allow comparisons with the study of Gand et al., 2020 [46], the assay numbering as used in this previous study was conserved. **: starting and ending position of the sequence amplified by the corresponding forward and reverse primers, in the NCBI SARS-CoV-2 reference sequence NC_045512. ***: this primers and probe set is also used in the RT-qPCR test from Institut Pasteur Paris (France) [14]. Fw: forward primer; Rv: reverse primer; P: probe.
Inclusivity evaluation of primers and probe sets using polymeraSe Chain Reaction Evaluation through largE-scale miNing of gEnomic Data (SCREENED).
| Assay | Target | Negative RT-qPCR Result †,* | Inclusivity | Inclusivity |
|---|---|---|---|---|
| 1 | N | 30,445 (13%) | 63.89% | 86.03% |
| 2 | RdRp-P2 | 96 (9.4%) | 99.89% | 100% |
| 2 | E | 8 (0%) | 99.99% | 100% |
| 2 | N | 316 (2.2%) | 99.63% | 99.81% |
| 3 | RdRp_IP2 | 169 (6.5%) | 99.80% | 99.88% |
| 3 | RdRp_IP4 | 44 (0%) | 99.95% | 100% |
| 4 | N-1 | 181 (2.2%) | 99.79% | 99.73% |
| 4 | N-2 | 833 (0%) | 99.01% | 99.96% |
| 4 | N-3 | 100 (1%) | 99.88% | 100% |
| 8 | S | 28 (3.6%) | 99.97% | 100% |
| 9 | ORF1a | 95 (0%) | 99.89% | 100% |
†: number of representative genomes that produced a theoretical negative RT-qPCR signal according to the SCREENED settings (detailed in Section 2.4) and, consequently, considered as false negative (FN). The percentage of the genomes resulting in negative results and belonging to one of the new variants is indicated between brackets. All turned out to belong to the B.1.1.7 lineage (20I/501Y.V1). * results obtained in the present study with 84,305 representative SARS-CoV-2 genomes collected between 7 June 2020 and 7 January 2021. ** results obtained in the previous study with 2569 representative SARS-CoV-2 genomes collected up to 7 April 2020 [46].
Diversity among the sequences amplified by the evaluated primers and probe sets.
| Assay | 1 | 2 | 2 | 2 | 3 | 3 | 4 | 4 | 4 | 8 | 9 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Target | N | RdRp-P2 | E | N | RdRp | RdRp | N-1 | N-2 | N-3 | S | ORF1a |
| Clusters * | 496 | 89 | 116 | 291 | 157 | 124 | 199 | 123 | 152 | 160 | 231 |
| First cluster ** | 30.3% | 97.5% | 98.2% | 94.7% | 98.4% | 98.4% | 96.1% | 95.7% | 97.1% | 98.6% | 91.1% |
| Second cluster ** | 28.8% | 1.3% | 0.5% | 0.7% | 0.2% | 0.3% | 0.5% | 1.8% | 0.7% | 0.2% | 4.8% |
| Third cluster ** | 26.9% | 0.4% | 0.4% | 0.6% | 0.1% | 0.1% | 0.4% | 0.9% | 0.2% | 0.1% | 1.7% |
*: number of amplicon clusters produced by SCREENED for each evaluated primers and probe set. **: Repartition of the number of amplicons among the clusters for the 3 largest clusters for each evaluated primers and probe set.
Presence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants’ mutations in the sequence amplified by the evaluated primers and probe sets.
| Assay | Target | Amplicon Start Pos. * | Amplicon End Pos. * | Nucleotide Change* | Amino Acid Change | Impact on Primer or Probe Sequences (5’–3’) ** | FN Results *** |
|---|---|---|---|---|---|---|---|
| 9 | ORF1a | 3192 | 3310 | C3267T v1 | T1001I | None | No |
| 1 | N | 28880 | 28979 | C28977T v1 | S235F | Rv CA | Depending on other mutations |
| 8 | S | 22711 | 22869 | G22813T v2 | K417N | P CGCTCCAGGGCAAACTGGAAA | No |
| 1 | N | 28880 | 28979 | C28887T v2 | T205I | Fw GGGGAA | No |
| 3 | RdRp_IP2 | 12689 | 12797 | C12778T v3 | Synonymous | None | No |
| 8 | S | 22711 | 22869 | A22812C v3 | K417T | P CGCTCCAGGGCAAACTGGAA | No |
*: positions were determined according to SARS-CoV-2 reference sequence NC_045512.2. **: The nucleotide change is indicated in the primer or probe sequence. If “none” is indicated, the nucleotide change is located in the amplicon sequence but outside primers and probe annealing sites. ***: a “No” is indicated if no mutation was detected in the annealing site or if a mutation in the annealing site would not lead to a FN result according to the criteria applied in SCREENED (see Section 2.4). A “Depending on other mutations” is indicated if a mutation in the annealing site would result in a FN result when combined with another mismatch identified with SCREENED (Supplementary Material File S1). v1: Mutation in 20I/501Y.V1 variant [30,32]; v2: mutation in 20H/501Y.V2 variant [33]; v3: mutation in variant 20J/501Y.V3 [34]; Fw: forward primer; Rv: reverse primer; P: probe.