| Literature DB >> 35746701 |
Maria Szargut1, Sandra Cytacka1, Karol Serwin2, Anna Urbańska2, Romain Gastineau3, Miłosz Parczewski2, Andrzej Ossowski1.
Abstract
The COVID-19 pandemic demonstrated how rapidly various molecular methods can be adapted for a Public Health Emergency. Whether a need arises for whole-genome studies (next-generation sequencing), fast and high-throughput diagnostics (reverse-transcription real-time PCR) or global immunization (construction of mRNA or viral vector vaccines), the scientific community has been able to answer all these calls. In this study, we aimed at the assessment of effectiveness of the commercially available solution for full-genome SARS-CoV-2 sequencing (AmpliSeq™ SARS-CoV-2 Research Panel and Ion AmpliSeq™ Library Kit Plus, Thermo Fisher Scientific). The study is based on 634 samples obtained from patients from Poland, with varying viral load, assigned to a number of lineages. Here, we also present the results of protocol modifications implemented to obtain high-quality genomic data. We found that a modified library preparation protocol required less viral RNA input in order to obtain the optimal library quantity. Concurrently, neither concentration of cDNA nor reamplification of libraries from low-template samples improved the results of sequencing. On the basis of the amplicon success rates, we propose one amplicon to be redesigned, namely, the r1_1.15.1421280, for which less than 50 reads were produced by 44% of samples. Additionally, we found several mutations within different SARS-CoV-2 lineages that cause the neighboring amplicons to underperform. Therefore, due to constant SARS-CoV-2 evolution, we support the idea of conducting ongoing sequence-based surveillance studies to continuously validate commercially available RT-PCR and whole-genome sequencing solutions.Entities:
Keywords: COVID-19; NGS; SARS-CoV-2; library preparation; molecular epidemiology; variants of concern; whole-genome sequencing
Mesh:
Year: 2022 PMID: 35746701 PMCID: PMC9227152 DOI: 10.3390/v14061230
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.818
Figure 1Correlation between initial RNA input and library construction with regard to the number of amplification cycles (no-normalization group and no library reamplification).
Overview of samples subjected to library construction, amplification protocols, and quality of constructed libraries.
| Sequencing Batch | Number of Samples | Samples with Optimal Viral Copy Number per Reaction (≥200) | Samples that Yielded | Reamplified Libraries | Reamplified | Number of Fasta Files Obtained | Amplification Protocol |
|---|---|---|---|---|---|---|---|
| R1 | 80 | 78 | 65 | 9 | 8 | 80 | NORMALIZATION PROTOCOL |
| R2 | 80 | 73 | 50 | 30 | 29 | 79 | |
| SUBTOTAL 1 (R1–R2) | 160 | 151 | 115 | 39 | 37 | 159 | |
| R3 | 79 | 47 | 54 | 25 | 21 | 79 | NO-NORMALIZATION PROTOCOL |
| R4 | 79 | 78 | 78 | 1 | 0 | 78 | |
| R5 | 77 | 51 | 75 | 2 | 0 | 77 | |
| R6 | 77 | 76 | 77 | 0 | - | 77 | |
| R7 | 81 | 81 | 81 | 0 | - | 81 | |
| R8 | 81 | 81 | 81 | 0 | - | 81 | |
| SUBTOTAL 2 (R3–R8) | 474 | 414 | 446 | 28 | 21 | 473 | |
| TOTAL | 634 | 565 | 561 | 67 | 58 | 632 |
1 normalization protocol: samples were divided into three groups on the basis of the CT value: ≤22 samples were normalized to 20,000 viral copy number/reaction and amplified for 17 cycles; 23–25 samples were normalized to 2500 viral copy number/reaction and amplified for 20 cycles; 26–31 samples were normalized to 78 viral copy number/reaction and amplified for 25 cycles; 2 no-normalization protocol: samples were divided into four groups on the basis of the CT value: 12–17 samples were amplified for 13 cycles; 18–22 samples were amplified for 18 cycles, 23–27 samples were amplified for 23 cycles; >27 samples were concentrated for 15 min in 45 °C, afterwards water was added for a final volume of 10 μL, and samples were amplified for 27 cycles (conc.–concentration).
Results of concentration of low-template RNA isolates (conc.–concentration).
| N GENE CT | VIRAL RNA COPY PER REACTION | LIBRARY CONC. (pM) | REAMPLI-FIED | READS | ON TARGET | MEAN DEPTH | UNIFORMITY | ||
|---|---|---|---|---|---|---|---|---|---|
| LOW-TEMPLATE | MEAN | 30.1 | 99.3 | 340.1 | 73.0 | 525,759.0 | 72.06% | 2380.7 | 42.96% |
| MEDIAN | 29.0 | 78.0 | 86.6 | 31.6 | 347,048.0 | 84.94% | 503.9 | 38.67% | |
| SD | 2.0 | 50.9 | 721.1 | 106.3 | 560,685.2 | 26.48% | 3126.2 | 30.29% | |
| LOW-TEMPLATE SAMPLES, NOT | MEAN | 29.0 | 104.0 | 100.2 | 105.9 | 566,962.3 | 74.48% | 2174.1 | 43.41% |
| MEDIAN | 29.0 | 78.0 | 74.7 | 91.4 | 580,470.0 | 77.98% | 718.1 | 38.91% | |
| SD | 1.1 | 48.6 | 90.3 | 68.5 | 440,854.7 | 16.89% | 2370.6 | 26.86% | |
Mean parameters of samples and libraries constructed within the study; basic quality metrics of sequencing runs (conc.–concentration).
| N GENE CT | VIRAL RNA COPY PER REACTION | LIBRARY CONC. (pM) (TARGET 70 pM) | REAMPLIFIED | READS | ON TARGET | MEAN DEPTH | UNIFORMITY | ||
|---|---|---|---|---|---|---|---|---|---|
| SUMMARIZED | OVERALL | 20.7 | 517,043.1 | 1432.7 | 72.0 | 1,096,133.6 | 94.74% | 6866.4 | 80.66% |
| R1–R2 | 19.5 | 103,894.7 | 283.6 | 72.4 | 1,053,337.7 | 96.28% | 6400.0 | 73.60% | |
| R3–R8 | 21.8 | 685,312.1 | 1817.4 | 71.5 | 1,110,459.0 | 93.97% | 7023.5 | 84.18% | |
| OPTIMAL CONC. | OVERALL | 20.4 | 578,483.9 | 1616.2 | - | 1,209,977.6 | 97.18% | 7618.0 | 88.93% |
| R1–R2 | 18.7 | 122,172.9 | 379.4 | - | 1,348,146.2 | 97.64% | 8315.9 | 85.08% | |
| R3–R8 | 21.5 | 713,803.7 | 1919.5 | - | 1,176,098.3 | 97.01% | 7446.2 | 90.34% | |
| OPTIMAL INITIAL RNA INPUT | OVERALL | 19.7 | 574,396.9 | 1570.4 | 66.3 | 1,165,060.8 | 98.38% | 7411.8 | 86.80% |
| R1–R2 | 19.0 | 108,337.3 | 292.7 | 66.3 | 1,078,694.5 | 97.28% | 6639.7 | 74.95% | |
| R3–R8 | 20.3 | 775,134.2 | 2036.4 | - | 1,196,561.5 | 99.02% | 7694.9 | 93.68% | |
| OPTIMAL CONC. AND INITIAL RNA INPUT | OVERALL | 20.1 | 737,264.0 | 1772.5 | - | 1,213,934.0 | 98.71% | 7750.6 | 91.40% |
| R1–R2 | 19.5 | 124,233.1 | 398.5 | - | 1,311,516.7 | 97.72% | 8094.1 | 84.08% | |
| R3–R8 | 20.3 | 924,833.2 | 2044.0 | - | 1,194,652.6 | 99.03% | 7682.4 | 93.70% | |
| REAMPLIFIED | OVERALL | 23.1 | 33,212.4 | 29.4 | 72.0 | 167,032.7 | 80.00% | 723.8 | 32.84% |
| R1–R2 | 22.3 | 45,257.7 | 30.9 | 72.1 | 248,339.7 | 90.58% | 1142.5 | 37.93% | |
| R3–R8 | 31.0 | 87.8 | 27.4 | 72.0 | 53,783.7 | 64.71% | 119.1 | 25.48% | |
| REAMPLIFIED WITH OPTIMAL CONC. | OVERALL | 22.6 | 35,579.1 | 29.5 | 81.5 | 184,595.0 | 79.81% | 801.0 | 35.36% |
| R1–R2 | 22.3 | 45,257.7 | 30.9 | 72.1 | 248,339.7 | 90.58% | 1142.5 | 37.93% | |
| R3–R8 | 33.0 | 91.0 | 26.5 | 100.9 | 53,750.7 | 57.71% | 100.0 | 30.07% |
Amplicons with low mean percentage of reads within total sample’s reads for specific SARS-CoV-2 lineages and potential SNPs associated with amplification failure.
| LINEAGE | NO OF SAMPLES | TARGET REGION | RANGE | RANGE LOST IF AMPLICON FAILS | MEAN % READS WITHIN ALL SAMPLES | CHANGE | POSITION OF VARIANT IN RELATION TO AMPLICON RANGE 1 | VARIANT | % READS | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| 3U | ||
| B.1.1.44 | N = 1 | r1_1.10.67083 | 9179 | 9390 | 9199 | 9251 | 0.199% | 9166C>A | 13U |
| 0.005% |
| B.1.389 | N = 1 | r1_1.10.711902 | 9432 | 9643 | 9477 | 9561 | 0.377% | 9430C>T | 2U | T | 0.004% |
| T | 0.002% | ||||||||||
| B.1.1.317 | N = 1 | r1_1.11.376074 | 9857 | 10,071 | 9906 | 9952 | 0.388% | 9857C>T | 0U | T | 0.008% |
| AY.9 | N = 2 | r1_1.12.1117806 | 11,523 | 11,730 | 11,634 | 11,684 | 0.344% | 11514C>T | 9U | T | 0.002% |
| B.1.160 | N = 2 | r1_1.12.539895 | 11,260 | 11,477 | 11,366 | 11,410 | 0.434% | 11497C>T | 20D | T | 0.082% |
| B.1.1.306 | N = 1 | r1_1.13.620498 | 12,563 | 12,790 | 12,711 | 12,772 | 0.630% | 12805T>C | 15D | ||
|
|
|
|
|
|
|
|
|
|
| ||
|
|
| ||||||||||
|
|
| ||||||||||
|
|
| ||||||||||
| B.1.389 | N = 1 | r1_1.16.1212393 | 15,560 | 15,741 | 15,583 | 15,624 | 0.301% | 15543G>T | 17U | ||
| B.1.258 | N = 12 | r1_1.16.534874 | 15,387 | 15,582 | 15,522 | 15,559 | 0.879% | 15598G>A | 16D | ||
| B.1.389 | N = 1 | r1_1.21.272458 | 20,380 | 20,604 | 20,457 | 20,497 | 0.372% | 20622A>T 20623G>T 20624A>T | 18D | ||
| B.1.1.398 | N = 4 | r1_1.22.1029456 | 21,187 | 21,255 | 21,199 | 21,245 | 0.409% | 21178C>T | 9U | ||
| B.1.2 | N = 5 | r1_1.22.906171 | 21,309 | 21,531 | 21,346 | 21,450 | 0.351% | 21304C>T | 5U |
| 0.002% |
|
| 0.004% | ||||||||||
| AY.9 | N = 2 | r1_1.23.1186794 | 22,246 | 22,446 | 22,320 | 22,367 | 0.407% | 22227C>T | 19U | T | 0.006% |
| B.1.177 | N = 9 | T | 0.004% | ||||||||
| B.1.177.8 | N = 1 | T | 0.003% | ||||||||
| T | 0.009% | ||||||||||
| T | 0.006% | ||||||||||
|
|
|
|
|
|
|
|
|
|
| ||
|
|
| ||||||||||
| AY.1 | N = 2 | r1_1.23.86525 | 21,847 | 22,058 | 21,974 | 22,024 | 0.325% | 21846C>T | 1U | ||
| B.1.617.2 | N = 1 | ||||||||||
| B.1.389 | N = 1 | r1_1.24.394902 | 23,580 | 23,724 | 23,643 | 23,682 | 0.601% | 23730C>T | 6D | ||
| AY.9 | N = 2 | r1_1.24.626090 | 23,008 | 23,224 | 23,049 | 23,088 | 0.380% | 22995C>A | 13U | A | 0.022% |
| AY.1 | N = 2 | ||||||||||
| B.1.221 | N = 32 | r1_1.26.1209362 | 25,666 | 25,887 | 25,797 | 25,844 | 0.249% | 25906G>C | 19D |
| 0.008% |
| C/G (1382/3601) | 0.284% | ||||||||||
| B.1.1.121 | N = 3 | r1_1.27.410513 | 26,651 | 26,847 | 26,801 | 26,838 | 0.424% | 26645C>T | 6U | ||
| AY.9 | N = 2 | r1_1.3.760885 | 2398 | 2616 | 2300 | 2463 | 0.263% | 2388C>T | 10U | ||
| B.1.258 | N = 12 | r1_1.30.1041188 | 29,498 | 29,727 | 29,530 | 29,592 | 0.328% | 29734G>C | 7D | ||
| AY.1 | N = 2 | 29742G>T | 15D | T | 0.013% | ||||||
| AY.9 | N = 2 | T | 0.029% | ||||||||
| T | 0.100% | ||||||||||
| B.1.1.159 | N = 2 | r1_1.5.75163 | 3891 | 4110 | 3973 | 4018 | 0.240% | 4114T>C | 4D | ||
| B.1.1.306 | N = 1 | 3879–3899del | 12U | ||||||||
| B.1.1.44 | N = 1 | r1_1.6.1402513 | 4920 | 5129 | 5028 | 5073 | 0.385% | 5147C>T | 18D | T | 0.024% |
| AY.9 | N = 2 | r1_1.6.888565 | 5587 | 5810 | 5671 | 5716 | 0.481% | 5584A>G | 3U | ||
| B.1.1.317 | N = 1 | r1_1.7.488632 | 6293 | 6512 | 6339 | 6379 | 0.561% | 6536G>A | 24D | ||
| B.1.177 | N = 9 | 6286C>T | 7U | ||||||||
| B.1.177.8 | N = 1 | ||||||||||
| B.1.1.44 | N = 1 | r1_1.8.592180 | 6804 | 7017 | 6830 | 6882 | 0.210% | - | |||
| B.1.1.159 | N = 2 | r1_1.8.816048 | 7149 | 7360 | 7284 | 7331 | 0.149% | 7379G>A | 19D | ||
1 U—upstream, D—downstream. Underperforming amplicons are marked red. Variants that differ to those found in lineages are bolded and underlined. % READS—percentage of amplicon’s reads within total reads for the sample that shared the variant.
Prediction of amplicons with low mean percentage of reads within total sample’s reads for SARS-CoV-2 VOC B.1.1.529 and potential SNPs associated with amplification failure.
| LINEAGE | NO OF SAMPLES | TARGET REGION | RANGE | RANGE LOST IF | MEAN % READS WITHIN ALL | CHANGE | POSITION OF VARIANT IN | ||
|---|---|---|---|---|---|---|---|---|---|
| B.1.1.529 | N = 5 | r1_1.11.528369 | 10,453 | 10,679 | 10,603 | 10,655 | 0.197% | 10449C>A | 4U |
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
|
| ||
| r1_1.23.474025 | 22,494 | 22,685 | 22,582 | 22,622 | 0.436% | 22686C>T | 1D | ||
| r1_1.24.942468 | 23,089 | 23,290 | 23,225 | 23,268 | 0.256% | 23075T>C | 14U | ||
| r1_1.26.781963 | 25,372 | 25,566 | 25,478 | 25,511 | 0.785% | 25584C>T | 18D | ||
| r1_1.27.993816 | 26,588 | 26,800 | 26,606 | 26,650 | 0.332% | 26577C>G | 11U | ||
| r1_1.29.497787 | 28,374 | 28,606 | 28,463 | 28,512 | 0.521% | 28,362–28,370del * | 4U | ||
1 U—upstream, D—downstream. Underperforming amplicons are marked red. * Deletion present in four out of five samples tested.