Literature DB >> 35570946

Impact of SARS-CoV-2 Mutations on PCR Assay Sequence Alignment.

Daniel Antonio Negrón1, June Kang1, Shane Mitchell1, Mitchell Y Holland1, Stephen Wist1, Jameson Voss2, Lauren Brinkac1, Katharine Jennings1, Stephanie Guertin3, Bruce G Goodwin2, Shanmuga Sozhamannan2,4.   

Abstract

Real-time reverse transcription polymerase chain reaction (RT-PCR) assays are the most widely used molecular tests for the detection of SARS-CoV-2 and diagnosis of COVID-19 in clinical samples. PCR assays target unique genomic RNA regions to identify SARS-CoV-2 with high sensitivity and specificity. In general, assay development incorporates the whole genome sequences available at design time to be inclusive of all target species and exclusive of near neighbors. However, rapid accumulation of mutations in viral genomes during sustained growth in the population can result in signature erosion and assay failures, creating situational blind spots during a pandemic. In this study, we analyzed the signatures of 43 PCR assays distributed across the genome against over 1.6 million SARS-CoV-2 sequences. We present evidence of significant signature erosion emerging in just two assays due to mutations, while adequate sequence identity was preserved in the other 41 assays. Failure of more than one assay against a given variant sequence was rare and mostly occurred in the two assays noted to have signature erosion. Assays tended to be designed in regions with statistically higher mutations rates. in silico analyses over time can provide insights into mutation trends and alert users to the emergence of novel variants that are present in the population at low proportions before they become dominant. Such routine assessment can also potentially highlight false negatives in test samples that may be indicative of mutations having functional consequences in the form of vaccine and therapeutic failures. This study highlights the importance of whole genome sequencing and expanded real-time monitoring of diagnostic PCR assays during a pandemic.
Copyright © 2022 Negrón, Kang, Mitchell, Holland, Wist, Voss, Brinkac, Jennings, Guertin, Goodwin and Sozhamannan.

Entities:  

Keywords:  COVID-19; PCR; RT-PCR; SARS-CoV-2; biosurveillance; diagnostics; pandemic; signature

Mesh:

Year:  2022        PMID: 35570946      PMCID: PMC9096222          DOI: 10.3389/fpubh.2022.889973

Source DB:  PubMed          Journal:  Front Public Health        ISSN: 2296-2565


Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), like other Coronaviruses, is an enveloped virus with a linear, positive-sense single-stranded RNA genome. SARS-CoV-2 genomes range from 27 to 32 kb with an arrangement that is co-linear with other Coronaviruses. The genome is flanked by untranslated regions (UTRs) and protected by a 5′ 7-methylguanosine (m7G) cap and 3′ poly-A tail (1, 2). Genomic translation of ORF1ab yields polyproteins pp1a and pp1ab, which undergo autoproteolysis to yield non-structural (nsp) proteins involved in expression and replication (2). Transcription of the S (spike), E (envelope), M (membrane), and N (nucleocapsid) yields structural proteins for dissemination (2). Additional accessory protein genes include ORF 3a, 6, 7a, 7b, 8, and 9b (2, 3). Whole genome sequencing (WGS) based genetic surveillance is critical for tracking and forecasting pathogen evolution. Phylogenetic analyses can estimate the spatiotemporal mobility of pathogens between communities around the world over the course of a pandemic. This has been essential for identifying a circulating Variant Being Monitored (VBM), Variant of Concern (VoC), or Variant of Interest (VoI) of SARS-CoV-2, the causative agent of coronavirus disease 2019 (COVID-19). Since the emergence of SARS-CoV-2 in late 2019, sustained transmission among the human host has generated numerous variants with specific phenotypic attributes. Mutations that potentially confer increased transmission, pathogenicity (4), immune escape, or resistance to therapeutics can be identified when sequenced and interpreted by combining in silico inferences with in vitro functional, animal, and epidemiological studies to link genotypes to phenotypes. Without genomic surveillance, some mutations may go unnoticed if they do not impact diagnostic assays. Polymerase chain reaction (PCR) assays are susceptible to false negatives that result from mutations that weaken primer annealing. For example, a mutation or mutations causing one primer set to fail within a multiplex assay—called a partial assay failure—is best exemplified by the Alpha (B.1.1.7) variant. This variant was discovered first in the UK in December, 2020 by the so-called S gene target failure (SGTF) (aka the S gene dropout) in a Thermofisher TaqPath PCR test kit that targets 3 different regions (ORF1ab, E gene, and S gene) of the viral genome (5). A 6-nt deletion (Δ69–70 aa) resulted in the SGTF while the other two targets of the kit were positive for a given sample (5). After analyses determined the cause of the target failure was a new mutation, SGTF was used as a proxy for the presence of Alpha variant (B.1.1.7) in test samples. There are other examples of less severe assay failures in some commercial SARS-CoV-2 PCR tests (6, 7). Such failures affect biosurveillance network responses and public health policy decisions for disease control, containment, and prevention as they form the basis for our understanding of how an outbreak is progressing. Thus, systematic and periodic assessment of real-time PCR assay performance is critical for maintaining assay specificity and decreasing false negatives (6–10). The Global Initiative on Sharing Avian Influenza Data (GISAID) organization and Nextstrain project provide genomic data, metadata, and phylodynamic analysis for monitoring ongoing outbreaks (11, 12). Continuous rapid sampling and sequencing of SARS-CoV-2 and sharing of data throughout the course of this pandemic provided a unique opportunity for visualizing signature erosion over time. Specifically, since January 20, 2020, we have used the PCR Signature Erosion Tool (PSET) to periodically evaluate assays in silico against all SARS-CoV-2 genome sequences from the GISAID EpiCoV™ database and share results on virological.org (13). PSET has been used previously to evaluate the impact of genomic drift on current and proposed assays for Ebolavirus and Mammarenavirus (14, 15). Here, we present our assessment of diagnostic PCR assays in silico by applying PSET to regularly collected SARS-CoV-2 sequences. Recent alignment-based studies have been conducted with comparable results (8, 9). Our approach significantly increases the number of assays and subject sequences tested, breaks-down alignment rates over time by lineage, compares the mutation rate of assay target regions to the rest of the genome, and calculates the number of sequences producing false negatives for multiple assays.

Materials and Methods

PSET Analysis

In this study we tested the in silico performance of 43 PCR assays (Supplementary Table 1) with different gene targets [19 ORF1ab, 8 spike (S) protein, 1 ORF3a, 3 envelope (E) protein, 1 ORF8, and 11 nucleocapsid (N)] against a set of 1,690,689 SARS-CoV-2 accessions (Supplementary Table 2) and sequences downloaded from the GISAID EpiCoV™ database on July 7, 2021 (11, 13, 16–26). At the time of analysis, the CDC listed Pango lineages P.1, B.1.351, B.1.1.7, B.1.427, and B.1.429 as VoCs and lineages P.2, B.1.525, and B.1.526 as VoIs (27, 28). Additional follow-up analysis was performed on the B.1.1.529 VoC as it emerged later. The PSET definition for each assay target was based on a reference amplicon sequence with delimited primer and probe regions (Figure 1). Twenty nucleotides of additional sequence context outside the amplicon were also included to inform alignment at the 5′ and 3′ ends. Context and inter-primer regions were obtained via global-local alignment of the assay primers to the SARS-CoV-2 reference genome by running the glsearch36 program of the FASTA suite (29).
Figure 1

PSET assay definition. The assay definition includes square brackets and parentheses to delimit the primer sequences. Note that a probe is optional and that any sequence outside of the amplicon region is considered context for alignment purposes. This example corresponds to the cdc_n1 assay.

PSET assay definition. The assay definition includes square brackets and parentheses to delimit the primer sequences. Note that a probe is optional and that any sequence outside of the amplicon region is considered context for alignment purposes. This example corresponds to the cdc_n1 assay. The first phase of PSET analysis queries the assay target definition against a BLAST+ database to search for matching subjects based on local alignment with the blastn program (30). Expansion of ambiguous DNA codes of the query is required for compatibility. For example, “GAWTAYA” has two, representing four expansions: “GAATACA,” “GATTACA,” “GAATATA,” and “GATTATA.” Only the first permutation is queried. An additional step re-evaluates the BLAST+ identity statistics by replacing the expanded query with the original one to account for ambiguous base similarity. Subjects with ≥85% identity to the amplicon region are then extended to cover the query range and extracted. The second phase queries the corresponding primers separately against the library of extracted sequences to search for matching subjects based on global-local alignment with the glsearch36 program, which is compatible with ambiguous DNA codes. Subjects with ≥90% identity are kept and aggregated by subject accession. A true positive (TP) is called if all primers aligned to the subject with the required identity and strand arrangement such that the primer would hypothetically amplify the target. The special case of a perfect true positive (PT) is called when there is 100% identity. Otherwise, a false negative (FN) is called. If the subject contains an N in any of the primer alignment regions, an N is appended to the category name (TPN and FNN). Doing so avoids removing low-quality, yet potentially informative sequences. An additional unknown (UNK) category is included to account for subjects that failed alignment during the first phase. In the case of near-neighbor analysis, a false positive (FP) or true negative (TN) is called when the taxonomy of the subject differs from the assay target. FPs with 100% identity are perfect false positives (PF). Results were filtered to include high-coverage, human-host sequences with assigned Centers for Disease Control (CDC) and Pango lineages corresponding to VoC [Alpha (B.1.1.7), Beta (B.1.351), Gamma (P.1), Delta (B.1.617), Epsilon (B.1.427/B.1.429)] and VoI [Eta (B1.525), Theta (P.2), and Iota (B.1.526)] (27, 28). Metadata also included VoC and VoI designation times from disease control centers (31–39). Further aggregation and visualization was performed using the R programming language with the tidyverse (v1.3.1), lubridate (1.8.0), tsibble (v1.1.0), and cowplot (v1.1.1) packages (40–43). The PSET workflow itself was implemented using Biopython and Snakemake (44, 45). A summary of results and methodology refinements were posted to virological.org on a near-weekly basis (13).

Variation Analysis

For variant analysis, the local GISAID EpiCoV™ database used for the PSET analysis was filtered to include only unambiguous DNA codes, resulting in a subset of 961,051 sequences. Single nucleotide variations (SNVs) and insertions or deletions (indels) were calculated by running the nucmer and show-snps programs of the MUMmer4 suite (46). Parameters for nucmer included NCBI GenBank accession no. NC_045512.2 (47) as the reference and flags to search the forward strand of each query (-f) with 28 threads (-t). The show-snps program transformed the resulting delta file into a tabular file (-T) of SNP/indel calls. An R script loaded the variants identified via MUMmer4 and split the data into two groups according to overlap with assay primer and probe target regions. The percentage of positions with ≥n mutations was calculated, where n was in the range [0, 500]. A two-sample Kolmogorov-Smirnov procedure tested the null hypothesis that the observed mutation percentages arose from the same distribution for the assay target and non-target genomic regions.

Omicron Analysis

The Omicron (B.1.1.529) wave emerged after the initial analysis was completed. Accordingly, an updated analysis targeted this VoC. On February 16, 2022, genomes were downloaded manually from the EpiCoV™ search page. Filters were enabled to include complete sequences with high coverage and collection dates. The PSET workflow analyzed the final set of 48,358 sequences (Supplementary Table 3) as previously described.

Results

Assay Regions and Genomic Variation

Figure 2 shows the target region of each assay with respect to the reference SARS-CoV-2 genome (NCBI GenBank accession no. NC_045512.2), SNVs, and indels (47). Mutations with <1% prevalence (~99.62% of all positions with observed mutations) were excluded from the figure but remain listed in Supplementary Table 4. The figure also plots the assay target regions on the genome. Plotting all the regions together with respect to mutations helps visualize signature erosion due to genetic drift.
Figure 2

Genomic variation and assay regions. The numbered rectangles and labeled arrows indicate PCR assay and gene regions respectively. Heat values indicate the percentage of SNVs observed at the reference location across all subjects. Percentages for both SNVs and indels are compiled in Supplementary Table 4.

Genomic variation and assay regions. The numbered rectangles and labeled arrows indicate PCR assay and gene regions respectively. Heat values indicate the percentage of SNVs observed at the reference location across all subjects. Percentages for both SNVs and indels are compiled in Supplementary Table 4. On the other hand, when the assay target regions are compared specifically against the non-target regions, assay target regions with mutations are present in higher percentage of genomes (Figure 3). The two-sample Kolmogorov-Smirnov test obtained a p value of 2.2E−16, rejecting the null hypothesis at a significance level of 0.05 in favor of the two-sided alternative.
Figure 3

Regional mutation distributions. The two-sample Kolmogorov-Smirnov test rejected the null hypothesis (α = 0.05), supporting the elevated number of mutations observed in the assay regions (D = 0.408, p < 2.2e−16, alternative hypothesis: two-sided). Supplementary Tables 5, 6 list the counts by position and percentage of mutations by region.

Regional mutation distributions. The two-sample Kolmogorov-Smirnov test rejected the null hypothesis (α = 0.05), supporting the elevated number of mutations observed in the assay regions (D = 0.408, p < 2.2e−16, alternative hypothesis: two-sided). Supplementary Tables 5, 6 list the counts by position and percentage of mutations by region.

Assay Alignment Confusion Matrix

The assays were able to correctly detect the vast majority of the 1,690,689 genome sequences according to the in silico PSET analysis. Of the 43 assays, 34 aligned with 100% identity to over 1.6 million subjects. Otherwise, TP rates exceeded 98%, except for the Young-S and China_N assays, which exhibited high FN rates at ~47% and ~59% respectively. The TP percentage was calculated as the sum of PT, TP, and TPN divided by the total number of subjects. Alignments exceeded identity threshold for most corresponding subject amplicon sequences, only failing on average ~0.74% (UNK%) of the time. Table 1 and Supplementary Table 7 include a full breakdown by assay. In the separate Omicron study, most of the assays (40 / 43) were able to detect sequences with a TP rate of ≥96%, with Young_S, Thailand_WH-NIC_N, and China_N assays having reached TP rates of 75.64%, 2.98%, and 0.34% respectively (Supplementary Table 8). We also looked at the specificity of the assays against near-neighbor sequences (Supplementary Table 9), where 8 showed significant false positives (FP) and perfect FPs (PFs) (those with 100% identity).
Table 1

Confusion matrix of assay calls based on alignment/arrangement.

ID Assay From To Gene PT TP TPN FN FNN UNK
1Japan_NIID_WH-1_F501483896ORF1ab1,657,34620,3691788911612,591
2Japan_NIID_WH-1_F509491837ORF1ab1,640,10637,11242920126812,573
3Japan_NIID_WH-1_Seq_F519501823ORF1ab1,634,10928,5264336,6168,41912,586
4Yip-ORF1ab1,8651,970ORF1ab1,661,32016,2614081843212,250
5Noblis.123,2393,482ORF1ab1,644,65233,3392832416912,105
6C1_COV_ORF1a10,96411,071ORF1ab1,643,24127,7162,061429317,374
7France_nCoV_IP212,68912,797ORF1ab1,647,39930,8042801255912,022
8China_ORF1ab13,34113,460ORF1ab1,664,71013,0373941825512,275
9France_nCoV_IP414,07914,186ORF1ab1,554,134124,0863003493911,781
10Young-ORF1ab14,15414,243ORF1ab1,653,74224,46623811129911,833
11ncov_rdrp_115,43015,530ORF1ab01,676,3131797921,62411,781
12ncov_rdrp_215,43015,530ORF1ab711,677,03423811,56411,781
13Won-ORF1ab15,44015,558ORF1ab1,563,669114,92020223611,860
14Chan-ORF1ab16,21916,353ORF1ab571,677,4052547225012,651
15Noblis.4017,16917,337ORF1ab1,617,53661,0761811106311,723
16Noblis.4418,10218,466ORF1ab1,660,92617,339268684112,047
17Noblis.4218,28418,466ORF1ab1,661,71516,697263226011,932
18HKU-ORF1b-nsp1418,77718,909ORF1ab1,663,43315,101226813011,818
19C2_COV_ORF1b18,97319,082ORF1ab1,632,38945,44089318711,879
20Young-S21,76221,876S775,27162,1502,900798,48013,94237,946
21Chan-S22,71122,869S1,607,06273,7321,3171091,5406,929
22Won-S23,11323,213S1,666,24812,6868088492,2217,877
23C5_COV_S_gene23,99524,134S1,657,63124,567595402167,640
24Noblis.5724,04524,205S1,655,63726,6344061121367,764
25Japan_WuhanCoV-spk124,35324,900S1,664,35417,288234302868,425
26Japan_NIID_WH-1_F2438124,36324,856S1,656,82324,516512349768,413
27Japan_NIID_WH-1_Seq_F2438324,36524,848S1,655,75822,7094673,2351098,411
28C3_COV_ORF3a25,84925,993ORF3a1,513,748160,2681,7574875114,117
29Won-E26,25826,365E51,678,0871107430012,113
30ncov_e_gene26,26826,381E1,671,8636,0991394428812,256
31Niu-E26,30226,391E1,661,88715,8703412626412,301
32C4_COV_ORF827,99928,135ORF8833,817828,3669,4572,86138815,800
33cdc_n128,28628,358N1,621,04356,882407294812,280
34Thailand_WH-NIC_N28,31928,376N1,649,54128,4952421014912,261
35Young-N28,58228,648N11,677,9531533313812,411
36cdc_n328,68028,752N1,614,50663,680429103912,025
37ncov_n_gene28,70528,833N1,633,14142,89139921815313,887
38Won-N28,73128,849N1,634,74141,5922171143413,991
39China_N28,88028,979N401,644258,1025691,010,6326,16813,574
40Japan_NIID_2019-nCOV_N29,12429,282N31,676,34619620017713,767
41HKU-N29,14429,254N1,633,06143,09260819619013,542
42cdc_n229,16329,230N1,622,61753,8995657913013,399
43Chan-N29,20929,306N1,630,59545,5783686522313,860

True positive (TP); false negative (FN); perfect TP (PT); TP/FN aligned with N's (TPN/FNN); unknown (UNK).

Confusion matrix of assay calls based on alignment/arrangement. True positive (TP); false negative (FN); perfect TP (PT); TP/FN aligned with N's (TPN/FNN); unknown (UNK).

Assay Alignment Identity Over Time

Alignment identity of all assays against sequences of different lineages of SARS-CoV-2 was assessed from 2020-03-15 to 2021-07-05. Figure 4 depicts heat maps of the TP rate over time for each VoC/VoI with a corresponding line graph of the cumulative log-total number of sequences. Only GISAID sequences with collection date metadata specifying year, month, and day were included in the heatmap. The graph reveals sudden changes in assay alignment identity with respect to variant abundance.
Figure 4

Cumulative true positive rate. Vertical and horizontal facets divide the graph by Pango lineage and assay target gene. (A) The top line graph shows the logarithm of the cumulative total of subject sequences. (B) Heat map of the current PCR assays with the cumulative conditional true positive rate assessed from Apr 2020 to Feb 2021. The assay targeting specific genes are labeled on the right. White represents the absence of subject sequences of the lineage in the reference database. Vertical lines when each disease control organization called the lineage a VoC/VoI based on available data (compiled in Supplementary Table A).

Cumulative true positive rate. Vertical and horizontal facets divide the graph by Pango lineage and assay target gene. (A) The top line graph shows the logarithm of the cumulative total of subject sequences. (B) Heat map of the current PCR assays with the cumulative conditional true positive rate assessed from Apr 2020 to Feb 2021. The assay targeting specific genes are labeled on the right. White represents the absence of subject sequences of the lineage in the reference database. Vertical lines when each disease control organization called the lineage a VoC/VoI based on available data (compiled in Supplementary Table A). Alignment identity remained constant for most assays, with perfect or near-perfect TP rates. However, some interesting patterns were observed. Some assays targeting the S and N genes performed poorly. For example, Young-S exhibited low TP rates within the B.1.1.7 and B.1.525 lineages and faltered for B.1.258, B.1.427, and B.1.429 isolates as cumulative sampling increased. Other assays targeting the S protein gene performed well, despite some temporary rough patches for Chan-S, Won-S and C5_COV_S_gene. Recently, Chan-S appears to have started failing for the B.1.258 lineage. China_N failed for the B.1.1.7, P.2, and P.1 clades. All other assays targeting the N gene performed well. However, Japan_NIID_2019-nCOV_N, HKU-N, cdc_n2, and Chan-N recently started failing for the B.1.525 lineage. Also, Young-N temporarily exhibited a low TP rate before recovering.

VoC/VoI Call Patterns

Figure 4 displays the timeline for each lineage to be identified as a VoC/VoI by the CDC, European CDC (ECDC), Public Health England (PHE), and World Health Organization (WHO). Most identifications occurred within a month of each other. However, for P.2 and B.1.429, 186 and 118 days passed between the initial and final call. The CDC and PHE called the former within 12 days of each other and the former within 111 days. In both cases, the WHO called last. We found no evidence of a call from the ECDC in these cases. The B.1.429 case shows the initial call occurring during a period of low alignment identity for the Young-S assay and the beginning of a decline for the Japan_NIID_WF-1_Seq_F519 assay. Despite the varying times at which these lineages were identified, when looking at the cumulative increase of the GISAID sequences for each, it is readily apparent that they initially occurred at very low proportions. This increase, and the resulting VoC/VoI designations, underscore the importance of continual monitoring from multiple organizations offering different perspectives and criteria with respect to spatiotemporal trends.

In silico False Negative Distribution by Assay

As assay targets were distributed throughout the genome, we were also interested in seeing whether emerging variants possessed mutations that resulted in more than one assay producing a predicted FN. Supplementary Table SB shows that nearly all (>99%) sequences resulted in a predicted true positive in at least 41 of 43 assays. Also, Supplementary Table SC shows that the majority (97%) of sequences producing a single FN were observed to originate from the China_N (87%) or Young-S (10%) assays. The majority (765,813) of sequences produced FNs in two different assays, China_N and Young-S, while 643,876 sequences did not produce an FN in any of the 43 assays. We observed a substantial drop-off in the number of sequences which caused FNs in 3 or more assays. This suggests that it is unlikely for a sequence to produce FNs in more than 2 assays and that a multiplex panel with any of the other 41 assays evaluated here would likely perform successfully.

Discussion

Signature Erosion Analysis

The rapid increase of novel SARS-CoV-2 variants raises concerns regarding the efficacy of PCR-based diagnostic assays. This study assesses the potential signature erosion of the current COVID-19 real-time diagnostic assays using the GISAID sequence database. We find that 41 out of the 43 assays continue to perform very well throughout the observed timeframe even as new variants arose. Additional analysis completed after the emergence of Omicron revealed that 40 of 43 assays continued to perform well. Variants containing mutations within the target regions for primer and probe hybridization were observed to have a substantial effect on predicted assay alignment identity. The characteristic S:Δ69/70 mutation of the Alpha variant occurs within the forward primer of the Young-S assay target, while mutations N:R203K and N:G204R occurred within the borders of the China_N assay target, resulting in PSET calling significant FN rates of those assays. The China_N assay failed completely for the lineages B.1.1.7, P1, and P2. This may be due to a “GGG” to “AAC” mutation in the forward primer, as observed in previous studies (8, 9). The 3-nt substitution mutation could potentially reduce the China_N forward primer binding affinity, which could significantly increase the FN rate, especially since it occurs at the 5′-end. The B.1.1.529 lineage contains an N:Δ31/33 deletion (48) which falls inside the borders of the Thailand_WH-NIC_N assay and results in PSET predicting significant FN rates. This may be a similar situation as described above where primer binding affinity is affected by this type of mutation. Various point mutations at the primer binding motifs are observed in all the lineages, which did not affect overall performance according to evaluation of in silico alignments. The exception was for lineage B.1.258, which bears a complementary sequence to the China_N primer resulting in perfect alignment. The China_N and Young-S assays experience 56 × and 44 × the FN rate of all other assays combined, respectively. In almost all assays, there is a percentage of mutations in one of the targets of the assay.

Recent Studies and Future Direction

The COVID-19 pandemic has demonstrated the need for the in silico monitoring of PCR-based diagnostic assay performance. Furthermore, given that surveillance network data are widely available to aid in quick-turnaround research and development of replacement assays, in silico monitoring has become an important early warning indicator (10). Our approach is consistent with recent alignment-based studies by confirming the overall high assay target sequence identity and detection of the China_N mutation (8, 9). We also observed potential early signs of signature erosion prior to official variant calls. Future research can further elucidate validation of in vitro and in silico predictions while considering the clinical and public health relevance. For example, the PSET algorithm could incorporate mismatch position, since mismatches near the 3′ end of the probe sequence potentially have a greater impact on assay performance (49, 50). Additional wet lab experiments can help systematically select alignment thresholds and motivate algorithmic refinements.

Conclusion

This study highlights the consequences of mutations in SARS-CoV-2 genomes on PCR-based diagnostic assays. The importance of real-time monitoring of molecular assay alignment in silico is highlighted by the discovery of the Alpha variant. The failure of one primer/probe combination within the multiplex assay targeting a region of a deletion in this variant underscored the need for extensive sequence-based surveillance. Sustained transmission and proliferation of viruses such as SARS-CoV-2 in a global pandemic leads to rapid evolution and accumulation of mutations that confer advantageous phenotypes, such as potentially evading diagnostics, therapeutics, and vaccines. Real-time in silico monitoring of assay signature erosion allows for the redesign and refinement of diagnostic assays to address assay and medical countermeasure failures to avoid dire failures of medical countermeasures.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.

Author Contributions

SS: conceptualization and supervision. DN, JK, and SW: data curation. DN, JK, SM, and SW: formal analysis. BG: funding acquisition. DN, JK, SM, LB, and SS: investigation. DN, JK, SM, MH, SW, and SS: methodology. BG and SS: project administration. DN, MH, and SW: software. DN, JK, SM, SW, JV, and SS: validation. DN and SW: visualization. DN, JK, SM, and SS: writing—original draft. DN, JK, SM, JV, LB, KJ, SG, and SS: writing—review and editing. All authors contributed to the article and approved the submitted version. The following CRediT (Contributor Roles Taxonomy) terms indicate the corresponding initials of the contributing authors.

Funding

Funding for this work was provided by Joint Program Executive Office, JPL-CBRND-EB, DBPAO under the contract number W911QY-17-C-0016.

Conflict of Interest

DN, JK, SM, MH, SW, LB, and KJ were employed by Noblis, Inc. SG was employed by Noblis ESI. SS was employed by Logistics Management Institute. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  32 in total

1.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

2.  Lassa virus circulating in Liberia: a retrospective genomic characterisation.

Authors:  Michael R Wiley; Lawrence Fakoli; Andrew G Letizia; Stephen R Welch; Jason T Ladner; Karla Prieto; Daniel Reyes; Nicole Espy; Joseph A Chitty; Catherine B Pratt; Nicholas Di Paola; Fahn Taweh; Desmond Williams; Jon Saindon; William G Davis; Ketan Patel; Mitchell Holland; Daniel Negrón; Ute Ströher; Stuart T Nichol; Shanmuga Sozhamannan; Pierre E Rollin; John Dogba; Tolbert Nyenswah; Fatorma Bolay; César G Albariño; Mosoka Fallah; Gustavo Palacios
Journal:  Lancet Infect Dis       Date:  2019-10-03       Impact factor: 25.071

3.  Variants in SARS-CoV-2 associated with mild or severe outcome.

Authors:  Jameson D Voss; Martin Skarzynski; Erin M McAuley; Ezekiel J Maier; Thomas Gibbons; Anthony C Fries; Richard R Chapleau
Journal:  Evol Med Public Health       Date:  2021-06-27

4.  Three Novel Real-Time RT-PCR Assays for Detection of COVID-19 Virus.

Authors:  Peihua Niu; Roujian Lu; Li Zhao; Huijuan Wang; Baoying Huang; Fei Ye; Wenling Wang; Wenjie Tan
Journal:  China CDC Wkly       Date:  2020-06-19

Review 5.  Mismatch amplification mutation assay-polymerase chain reaction: A method of detecting fluoroquinolone resistance mechanism in bacterial pathogens.

Authors:  Vijaya Kumar Deekshit; Kadeeja Jazeela; Gunimala Chakraborty; Anusha Rohit; Anirban Chakraborty; Indrani Karunasagar
Journal:  Indian J Med Res       Date:  2019-02       Impact factor: 2.375

6.  Development of a Novel, Genome Subtraction-Derived, SARS-CoV-2-Specific COVID-19-nsp2 Real-Time RT-PCR Assay and Its Evaluation Using Clinical Specimens.

Authors:  Cyril Chik-Yan Yip; Chi-Chun Ho; Jasper Fuk-Woo Chan; Kelvin Kai-Wang To; Helen Shuk-Ying Chan; Sally Cheuk-Ying Wong; Kit-Hang Leung; Agnes Yim-Fong Fung; Anthony Chin-Ki Ng; Zijiao Zou; Anthony Raymond Tam; Tom Wai-Hin Chung; Kwok-Hung Chan; Ivan Fan-Ngai Hung; Vincent Chi-Chung Cheng; Owen Tak-Yin Tsang; Stephen Kwok Wing Tsui; Kwok-Yung Yuen
Journal:  Int J Mol Sci       Date:  2020-04-08       Impact factor: 5.923

7.  Detection of 2019 novel coronavirus (2019-nCoV) by real-time RT-PCR.

Authors:  Victor M Corman; Olfert Landt; Marco Kaiser; Richard Molenkamp; Adam Meijer; Daniel Kw Chu; Tobias Bleicker; Sebastian Brünink; Julia Schneider; Marie Luisa Schmidt; Daphne Gjc Mulders; Bart L Haagmans; Bas van der Veer; Sharon van den Brink; Lisa Wijsman; Gabriel Goderski; Jean-Louis Romette; Joanna Ellis; Maria Zambon; Malik Peiris; Herman Goossens; Chantal Reusken; Marion Pg Koopmans; Christian Drosten
Journal:  Euro Surveill       Date:  2020-01

8.  Deepening of In Silico Evaluation of SARS-CoV-2 Detection RT-qPCR Assays in the Context of New Variants.

Authors:  Mathieu Gand; Kevin Vanneste; Isabelle Thomas; Steven Van Gucht; Arnaud Capron; Philippe Herman; Nancy H C Roosens; Sigrid C J De Keersmaecker
Journal:  Genes (Basel)       Date:  2021-04-13       Impact factor: 4.096

Review 9.  SARS-CoV-2: from its discovery to genome structure, transcription, and replication.

Authors:  Ayslan Castro Brant; Wei Tian; Vladimir Majerciak; Wei Yang; Zhi-Ming Zheng
Journal:  Cell Biosci       Date:  2021-07-19       Impact factor: 7.133

10.  US CDC Real-Time Reverse Transcription PCR Panel for Detection of Severe Acute Respiratory Syndrome Coronavirus 2.

Authors:  Xiaoyan Lu; Lijuan Wang; Senthilkumar K Sakthivel; Brett Whitaker; Janna Murray; Shifaq Kamili; Brian Lynch; Lakshmi Malapati; Stephen A Burke; Jennifer Harcourt; Azaibi Tamin; Natalie J Thornburg; Julie M Villanueva; Stephen Lindstrom
Journal:  Emerg Infect Dis       Date:  2020-05-15       Impact factor: 6.883

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.