| Literature DB >> 35570946 |
Daniel Antonio Negrón1, June Kang1, Shane Mitchell1, Mitchell Y Holland1, Stephen Wist1, Jameson Voss2, Lauren Brinkac1, Katharine Jennings1, Stephanie Guertin3, Bruce G Goodwin2, Shanmuga Sozhamannan2,4.
Abstract
Real-time reverse transcription polymerase chain reaction (RT-PCR) assays are the most widely used molecular tests for the detection of SARS-CoV-2 and diagnosis of COVID-19 in clinical samples. PCR assays target unique genomic RNA regions to identify SARS-CoV-2 with high sensitivity and specificity. In general, assay development incorporates the whole genome sequences available at design time to be inclusive of all target species and exclusive of near neighbors. However, rapid accumulation of mutations in viral genomes during sustained growth in the population can result in signature erosion and assay failures, creating situational blind spots during a pandemic. In this study, we analyzed the signatures of 43 PCR assays distributed across the genome against over 1.6 million SARS-CoV-2 sequences. We present evidence of significant signature erosion emerging in just two assays due to mutations, while adequate sequence identity was preserved in the other 41 assays. Failure of more than one assay against a given variant sequence was rare and mostly occurred in the two assays noted to have signature erosion. Assays tended to be designed in regions with statistically higher mutations rates. in silico analyses over time can provide insights into mutation trends and alert users to the emergence of novel variants that are present in the population at low proportions before they become dominant. Such routine assessment can also potentially highlight false negatives in test samples that may be indicative of mutations having functional consequences in the form of vaccine and therapeutic failures. This study highlights the importance of whole genome sequencing and expanded real-time monitoring of diagnostic PCR assays during a pandemic.Entities:
Keywords: COVID-19; PCR; RT-PCR; SARS-CoV-2; biosurveillance; diagnostics; pandemic; signature
Mesh:
Year: 2022 PMID: 35570946 PMCID: PMC9096222 DOI: 10.3389/fpubh.2022.889973
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1PSET assay definition. The assay definition includes square brackets and parentheses to delimit the primer sequences. Note that a probe is optional and that any sequence outside of the amplicon region is considered context for alignment purposes. This example corresponds to the cdc_n1 assay.
Figure 2Genomic variation and assay regions. The numbered rectangles and labeled arrows indicate PCR assay and gene regions respectively. Heat values indicate the percentage of SNVs observed at the reference location across all subjects. Percentages for both SNVs and indels are compiled in Supplementary Table 4.
Figure 3Regional mutation distributions. The two-sample Kolmogorov-Smirnov test rejected the null hypothesis (α = 0.05), supporting the elevated number of mutations observed in the assay regions (D = 0.408, p < 2.2e−16, alternative hypothesis: two-sided). Supplementary Tables 5, 6 list the counts by position and percentage of mutations by region.
Confusion matrix of assay calls based on alignment/arrangement.
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Japan_NIID_WH-1_F501 | 483 | 896 | ORF1ab | 1,657,346 | 20,369 | 178 | 89 | 116 | 12,591 |
| 2 | Japan_NIID_WH-1_F509 | 491 | 837 | ORF1ab | 1,640,106 | 37,112 | 429 | 201 | 268 | 12,573 |
| 3 | Japan_NIID_WH-1_Seq_F519 | 501 | 823 | ORF1ab | 1,634,109 | 28,526 | 433 | 6,616 | 8,419 | 12,586 |
| 4 | Yip-ORF1ab | 1,865 | 1,970 | ORF1ab | 1,661,320 | 16,261 | 408 | 18 | 432 | 12,250 |
| 5 | Noblis.12 | 3,239 | 3,482 | ORF1ab | 1,644,652 | 33,339 | 283 | 241 | 69 | 12,105 |
| 6 | C1_COV_ORF1a | 10,964 | 11,071 | ORF1ab | 1,643,241 | 27,716 | 2,061 | 4 | 293 | 17,374 |
| 7 | France_nCoV_IP2 | 12,689 | 12,797 | ORF1ab | 1,647,399 | 30,804 | 280 | 125 | 59 | 12,022 |
| 8 | China_ORF1ab | 13,341 | 13,460 | ORF1ab | 1,664,710 | 13,037 | 394 | 18 | 255 | 12,275 |
| 9 | France_nCoV_IP4 | 14,079 | 14,186 | ORF1ab | 1,554,134 | 124,086 | 300 | 349 | 39 | 11,781 |
| 10 | Young-ORF1ab | 14,154 | 14,243 | ORF1ab | 1,653,742 | 24,466 | 238 | 111 | 299 | 11,833 |
| 11 | ncov_rdrp_1 | 15,430 | 15,530 | ORF1ab | 0 | 1,676,313 | 179 | 792 | 1,624 | 11,781 |
| 12 | ncov_rdrp_2 | 15,430 | 15,530 | ORF1ab | 71 | 1,677,034 | 238 | 1 | 1,564 | 11,781 |
| 13 | Won-ORF1ab | 15,440 | 15,558 | ORF1ab | 1,563,669 | 114,920 | 202 | 2 | 36 | 11,860 |
| 14 | Chan-ORF1ab | 16,219 | 16,353 | ORF1ab | 57 | 1,677,405 | 254 | 72 | 250 | 12,651 |
| 15 | Noblis.40 | 17,169 | 17,337 | ORF1ab | 1,617,536 | 61,076 | 181 | 110 | 63 | 11,723 |
| 16 | Noblis.44 | 18,102 | 18,466 | ORF1ab | 1,660,926 | 17,339 | 268 | 68 | 41 | 12,047 |
| 17 | Noblis.42 | 18,284 | 18,466 | ORF1ab | 1,661,715 | 16,697 | 263 | 22 | 60 | 11,932 |
| 18 | HKU-ORF1b-nsp14 | 18,777 | 18,909 | ORF1ab | 1,663,433 | 15,101 | 226 | 81 | 30 | 11,818 |
| 19 | C2_COV_ORF1b | 18,973 | 19,082 | ORF1ab | 1,632,389 | 45,440 | 893 | 1 | 87 | 11,879 |
| 20 | Young-S | 21,762 | 21,876 | S | 775,271 | 62,150 | 2,900 | 798,480 | 13,942 | 37,946 |
| 21 | Chan-S | 22,711 | 22,869 | S | 1,607,062 | 73,732 | 1,317 | 109 | 1,540 | 6,929 |
| 22 | Won-S | 23,113 | 23,213 | S | 1,666,248 | 12,686 | 808 | 849 | 2,221 | 7,877 |
| 23 | C5_COV_S_gene | 23,995 | 24,134 | S | 1,657,631 | 24,567 | 595 | 40 | 216 | 7,640 |
| 24 | Noblis.57 | 24,045 | 24,205 | S | 1,655,637 | 26,634 | 406 | 112 | 136 | 7,764 |
| 25 | Japan_WuhanCoV-spk1 | 24,353 | 24,900 | S | 1,664,354 | 17,288 | 234 | 302 | 86 | 8,425 |
| 26 | Japan_NIID_WH-1_F24381 | 24,363 | 24,856 | S | 1,656,823 | 24,516 | 512 | 349 | 76 | 8,413 |
| 27 | Japan_NIID_WH-1_Seq_F24383 | 24,365 | 24,848 | S | 1,655,758 | 22,709 | 467 | 3,235 | 109 | 8,411 |
| 28 | C3_COV_ORF3a | 25,849 | 25,993 | ORF3a | 1,513,748 | 160,268 | 1,757 | 48 | 751 | 14,117 |
| 29 | Won-E | 26,258 | 26,365 | E | 5 | 1,678,087 | 110 | 74 | 300 | 12,113 |
| 30 | ncov_e_gene | 26,268 | 26,381 | E | 1,671,863 | 6,099 | 139 | 44 | 288 | 12,256 |
| 31 | Niu-E | 26,302 | 26,391 | E | 1,661,887 | 15,870 | 341 | 26 | 264 | 12,301 |
| 32 | C4_COV_ORF8 | 27,999 | 28,135 | ORF8 | 833,817 | 828,366 | 9,457 | 2,861 | 388 | 15,800 |
| 33 | cdc_n1 | 28,286 | 28,358 | N | 1,621,043 | 56,882 | 407 | 29 | 48 | 12,280 |
| 34 | Thailand_WH-NIC_N | 28,319 | 28,376 | N | 1,649,541 | 28,495 | 242 | 101 | 49 | 12,261 |
| 35 | Young-N | 28,582 | 28,648 | N | 1 | 1,677,953 | 153 | 33 | 138 | 12,411 |
| 36 | cdc_n3 | 28,680 | 28,752 | N | 1,614,506 | 63,680 | 429 | 10 | 39 | 12,025 |
| 37 | ncov_n_gene | 28,705 | 28,833 | N | 1,633,141 | 42,891 | 399 | 218 | 153 | 13,887 |
| 38 | Won-N | 28,731 | 28,849 | N | 1,634,741 | 41,592 | 217 | 114 | 34 | 13,991 |
| 39 | China_N | 28,880 | 28,979 | N | 401,644 | 258,102 | 569 | 1,010,632 | 6,168 | 13,574 |
| 40 | Japan_NIID_2019-nCOV_N | 29,124 | 29,282 | N | 3 | 1,676,346 | 196 | 200 | 177 | 13,767 |
| 41 | HKU-N | 29,144 | 29,254 | N | 1,633,061 | 43,092 | 608 | 196 | 190 | 13,542 |
| 42 | cdc_n2 | 29,163 | 29,230 | N | 1,622,617 | 53,899 | 565 | 79 | 130 | 13,399 |
| 43 | Chan-N | 29,209 | 29,306 | N | 1,630,595 | 45,578 | 368 | 65 | 223 | 13,860 |
True positive (TP); false negative (FN); perfect TP (PT); TP/FN aligned with N's (TPN/FNN); unknown (UNK).
Figure 4Cumulative true positive rate. Vertical and horizontal facets divide the graph by Pango lineage and assay target gene. (A) The top line graph shows the logarithm of the cumulative total of subject sequences. (B) Heat map of the current PCR assays with the cumulative conditional true positive rate assessed from Apr 2020 to Feb 2021. The assay targeting specific genes are labeled on the right. White represents the absence of subject sequences of the lineage in the reference database. Vertical lines when each disease control organization called the lineage a VoC/VoI based on available data (compiled in Supplementary Table A).