| Literature DB >> 36046362 |
Simone Maestri1,2, Valentina Grosso1, Massimiliano Alfano1, Denise Lavezzari1, Chiara Piubelli3, Zeno Bisoffi3,4, Marzia Rossato1,5, Massimo Delledonne1,5.
Abstract
Diagnostic tests based on reverse transcription-quantitative polymerase chain reaction (RT-qPCR) are the gold standard approach to detect severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection from clinical specimens. However, unless specifically optimized, this method is usually unable to recognize the specific viral strain responsible of coronavirus disease 2019, a crucial information that is proving increasingly important in relation to virus spread and treatment effectiveness. Even if some RT-qPCR commercial assays are currently being developed for the detection of viral strains, they focus only on single/few genetic variants that may not be sufficient to uniquely identify a specific strain. Therefore, genome sequencing approaches remain the most comprehensive solution for virus genotyping and to recognize viral strains, but their application is much less widespread due to higher costs. Starting from the well-established ARTIC protocol coupled to nanopore sequencing, in this work, we developed STArS (STrain-Amplicon-Seq), a cost/time-effective sequencing-based workflow for both SARS-CoV-2 diagnostics and genotyping. A set of 10 amplicons was initially selected from the ARTIC tiling panel, to cover: (i) all the main biologically relevant genetic variants located on the Spike gene; (ii) a minimal set of variants to uniquely identify the currently circulating strains; (iii) genomic sites usually amplified by RT-qPCR method to identify SARS-CoV-2 presence. PCR-amplified clinical samples (both positive and negative for SARS-CoV-2 presence) were pooled together with a serially diluted exogenous amplicon at known concentration and sequenced on a MinION device. Thanks to a scoring rule, STArS had the capability to accurately classify positive samples in agreement with RT-qPCR results, both at the qualitative and quantitative level. Moreover, the method allowed to effectively genotype strain-specific variants and thus also return the phylogenetic classification of SARS-CoV-2-postive samples. Thanks to the reduced turnaround time and costs, the proposed approach represents a step towards simplifying the clinical application of sequencing for viral genotyping, hopefully aiding in combatting the global pandemic.Entities:
Year: 2022 PMID: 36046362 PMCID: PMC9422081 DOI: 10.1093/biomethods/bpac020
Source DB: PubMed Journal: Biol Methods Protoc ISSN: 2396-8923
ARTIC amplicons selected for STArS protocol
| ARTIC amplicon | Genomic coordinates | SARS-CoV-2 Gene | Common mutations |
|---|---|---|---|
| 51 | 15 171–15 560 | ORF1b | − (detection amplicon) |
| 71 | 21 357–21 743 | ORF1b/S | L18F, T19R, T19I, T20N, L24S, del25/27, P26S |
| 72 | 21 658–22 038 | S | Q52R, del21765:6, D80A, D138Y, G142D, del21991:3 |
| 73 | 21 961–22 346 | S | E156G, del157/158, R190S, V231G, D215G |
| 75 | 22 516–22 903 | S | G339D, S371F, S373P, S375F, T376A, D405N, R408S, K417T, K417N, N439K, N440K |
| 76 | 22 797–23 214 | S | L452R, S477N, T478K, E484K, E484A, Q493R, Q498R, N501Y, Y505H |
| 77 | 23 122–23 522 | S | A570D, D614G |
| 78 | 23 443–23 847 | S | H655Y, Q677H, N679K, P681H, P681R, A701V, T716I |
| 81 | 24 391–24 789 | S | D950N, Q954H, N969K, S982A, T1027I |
| 93 | 28 081–28 464 | ORF8/N | Y73C, S84L, E92K, del119/120, del31/33, D3L, del : 28278:3 (detection amplicon) |
ARTIC amplicons selected for identifying and genotyping SARS-CoV-2 virus are reported, together with genomic positions and annotated gene.
SARS-CoV-2 RT–qPCR Ct values of clinical samples
| Sequencing experiment | Sample ID | Ct N gene | Date of collection |
|---|---|---|---|
| Run 1 | 18 | 28.1 | 9 February 2021 |
| 326 | 30.6 | 8 February 2021 | |
| 128 | 31.6 | 9 February 2021 | |
| 241 | 33.9 | 9 February 2021 | |
| 331 | 34.3 | 8 February 2021 | |
| 325 | 30.6 | 8 February 2021 | |
| 282 | 38.4 | 9 February 2021 | |
| 172 | 40.5 | 5 February 2021 | |
| 41 | 33.4 | 28 January 2021 | |
| 123 | 33.2 | 11 February 2021 | |
| 80 | 27.2 | 11 February 2021 | |
| Run 2 | 08 | 25.0 | 20 February 2021 |
| 09 | 30.0 | 20 February 2021 | |
| 10 | ND | 20 February 2021 |
For each sample, the cycle threshold (Ct) values, as detected on N gene is reported. ND: not detectable.
RT–qPCR and sequencing results of PCs
| Sequencing experiment | Sample ID | Ct | Sample barcode | Num. PASS reads | Num. PASS reads mapped to SARS-CoV-2 genome | Mean PASS filtered read length (SD) |
|---|---|---|---|---|---|---|
| Run 1 | Ampl_96_10_copies_amplified_rep1 | 5.0 | BC13 | 10 685 | 10 668 | 511 (24) |
| Ampl_96_10_copies_amplified_rep2 | 5.0 | BC14 | 7707 | 7696 | 511 (23) | |
| Ampl_96_10^9_copies_rep1 | 11.3 | BC21 | 1809 | 1799 | 508 (13) | |
| Ampl_96_10^9_copies_rep2 | 11.3 | BC22 | 1950 | 1943 | 508 (12) | |
| Ampl_96_10^7_copies_rep1 | 17.3 | BC15 | 48 | 44 | 520 (42) | |
| Ampl_96_10^7_copies_rep2 | 17.3 | BC16 | 22 | 22 | 503 (23) | |
| Ampl_96_10^5_copies_rep1 | 23.7 | BC17 | 9 | 9 | 520 (37) | |
| Ampl_96_10^5_copies_rep2 | 23.7 | BC18 | 6 | 5 | 546 (80) | |
| Ampl_96_10^3_copies_rep1 | 30.8 | BC19 | 8 | 8 | 516 (24) | |
| Ampl_96_10^3_copies_rep2 | 30.8 | BC20 | 7 | 7 | 534 (72) | |
| Run 2 | Ampl_96_10_copies_amplified | 5.0 | BC12 | 434 | 429 | 502 (25) |
For each sample, the Ct, the length, the number of reads and number of reads mapped to SARS-CoV-2 genome are reported.
Figure 1:Correlation between sequencing and RT–qPCR results for PC. The number of reads assigned to each dilution of amplicon 96 of the PC and their Ct is reported. Linear regression lines and confidence intervals are also shown.
Sequencing results of clinical samples
| Sequencing experiment | Sample ID | Sample barcode | Num. PASS reads | Num. PASS reads mapped to SARS-CoV-2 genome | Mean PASS filtered read length (SD) | Score | Outcome |
|---|---|---|---|---|---|---|---|
| Run 1 | 18 | BC01 | 35 450 | 35 369 | 509 (15) | 10 | + |
| 326 | BC02 | 87 785 | 87 675 | 504 (12) | 10 | + | |
| 128 | BC03 | 41 116 | 41 092 | 505 (17) | 10 | + | |
| 241 | BC04 | 42 388 | 42 330 | 508 (15) | 10 | + | |
| 331 | BC05 | 4396 | 4341 | 515 (21) | 4 | + | |
| 325 | BC06 | 2096 | 2031 | 504 (18) | 7 | + | |
| 282 | BC07 | 50 | 30 | 520 (53) | 3 | + | |
| 172 | BC08 | 55 | 6 | 526 (78) | 3 | + | |
| 41 | BC09 | 2448 | 2268 | 509 (26) | 4 | + | |
| 123 | BC10 | 235 674 | 235 483 | 508 (15) | 10 | + | |
| 80 | BC11 | 170 356 | 170 018 | 502 (18) | 10 | + | |
| NTC | BC12 | 717 | 2 | 490 (75) | NA | NA | |
| Run 2 | 08 | BC08 | 456 | 448 | 503 (16) | 9 | + |
| 09 | BC09 | 13 | 13 | 501 (15) | 3 | + | |
| 10 | BC10 | 1 | 0 | 478 (NA) | 0 | − | |
| NTC | BC11 | 0 | 0 | − | NA | NA |
For each sample, the number and length of reads are reported. Moreover, the total Score is reported, together with the predicted outcome. ‘+’: positive classification; ‘−’: negative classification; ‘?’: inconclusive test. NTC: no template control.
Figure 2:Correlation between sequencing and RT–qPCR results. The number of reads assigned to each sample and their Ct on N gene is reported. Linear regression lines and confidence intervals are also shown.
Figure 3:SARS-CoV-2 genome sequencing coverage for Run 1. For each sample, the number of reads mapped to each amplicon is reported in log10 scale.
Figure 4:Identification of SARS-CoV-2 strains using STArS. The heatmap represents the distribution of a subset of mutations of interest (rows) across the samples (columns). Blue stands for reference genotype, red stands for variant genotype, while grey stands for not genotypable position. Sample ‘172’ was excluded from the heatmap, since no position was genotypable.