| Literature DB >> 36056719 |
Florence Nicot1, Pauline Trémeaux1, Justine Latour1, Nicolas Jeanne1, Noémie Ranger1, Stéphanie Raymond1,2, Chloé Dimeglio1,2, Gérald Salin3, Cécile Donnadieu3, Jacques Izopet1,2.
Abstract
Fast, accurate sequencing methods are needed to identify new variants and genetic mutations of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome. Single-molecule real-time (SMRT) Pacific Biosciences (PacBio) provides long, highly accurate sequences by circular consensus reads. This study compares the performance of a target capture SMRT PacBio protocol for whole-genome sequencing (WGS) of SARS-CoV-2 to that of an amplicon PacBio SMRT sequencing protocol. The median genome coverage was higher (p < 0.05) with the target capture protocol (99.3% [interquartile range, IQR: 96.3-99.5]) than with the amplicon protocol (99.3% [IQR: 69.9-99.3]). The clades of 65 samples determined with both protocols were 100% concordant. After adjusting for Ct values, S gene coverage was higher with the target capture protocol than with the amplicon protocol. After stratification on Ct values, higher S gene coverage with the target capture protocol was observed only for samples with Ct > 17 (p < 0.01). PacBio SMRT sequencing protocols appear to be suitable for WGS, genotyping, and detecting mutations of SARS-CoV-2.Entities:
Keywords: SARS-CoV-2 genotyping; SMRT sequencing; long read sequencing; whole-genome sequencing
Year: 2022 PMID: 36056719 PMCID: PMC9539136 DOI: 10.1002/jmv.28123
Source DB: PubMed Journal: J Med Virol ISSN: 0146-6615 Impact factor: 20.693
Figure 1Target capture and amplicon protocol workflows for SARS‐CoV‐2 sequencing. cDNA, complementary DNA; PCR, polymerase chain reaction; SMRT, single‐molecule real‐time.
Figure 2Median read depth (blue line) and maximum coverage (blue area) along the entire SARS‐CoV‐2 genome (reference NC‐045512.2) (A) target capture or (B) amplicon. 3′‐UTR, 3′‐untranslated region; ORF, open reading frame.
Figure 3Whole genome (A) and S gene (B) coverage depending on the N gene cycle threshold with the two protocols.
Spike gene mutations detected by target capture and amplicon protocols in samples gave discrepant results (n = 21)
| Sample | Clade |
| Target capture | Amplicon | ||||
|---|---|---|---|---|---|---|---|---|
| S gene coverage (%) | Missing position in spike (AA) | Mutation detected | S gene coverage (%) | Missing position in spike (AA) | Mutation detected | |||
| 1 | 21J (Delta) | 25 | 99.6 | 522–526 | T19R, G142D, E156‐, F157‐, R158G, L452R, T478K, D614G, | 51.98 | 683–1274 | T19R, G142D, E156‐, F157‐, R158G, L452R, T478K, D614G |
| 2 | 21J (Delta) | 21 | 100 | _ | T19R, G142D, E156‐, F157‐, R158G, | 45.88 | 343–1032 | T19R, G142D, E156‐, F157‐, R158G |
| 3 | 21J (Delta) | 24.7 | 3 | 1–1236 | 51.98 | 683–1274 |
| |
| 4 | 21J (Delta) | 25 | 7.7 | 1–1176 | 45.88 | 343–1032 |
| |
| 9 | 21J (Delta) | 17.6 | 100 | _ | T19R, G142D, E156‐, F157‐, R158G, | 45.88 | 343–1032 | T19R, G142D, E156‐, F157‐, R158G |
| 10 | 21J (Delta) | 22.3 | 89.7 | 255‐296,522‐607 | T19R, G142D, E156‐, F157‐, R158G, P251L, L452R, T478K, D614G, | 51.98 | 683–1274 | T19R, G142D, E156‐, F157‐, R158G, P251L, L452R, T478K, D614G |
| 13 | 20I (Alpha, V1) | 18.3 | 100 | _ | H69‐, V70‐, Y144‐, | 26.46 | 343–1032, 1051–1274 | H69‐, V70‐, Y144‐ |
| 14 | 21J (Delta) | 21.6 | 85.9 | 22–26, 252–296, 512–577, 803–866 | T19R, T95I, G142D, E156‐, F157‐, R158G, | 47.58 | 343–661, 683–1032 | T19R, T95I, E156‐, F157‐, G142D, R158G |
| 19 | 21J (Delta) | 24 | 99.6 | 22–26 | T19R, T95I, G142D, E156‐, F157‐, R158G, L452R, T478K, D614G, | 51.98 | 683–1274 | T19R, G142D, E156‐, F157‐, R158G, L452R, T478K, D614G |
| 23 | 21J (Delta) | 19.9 | 99.2 | 252–256, 292–296 | T19R, G142D, E156‐, F157‐, R158G, | 53.44 | 683–1032, 1051–‐1274 | T19R, T95I, G142D, E156‐, F157‐, R158G |
| 24 | 21J (Delta) | 22.5 | 100 | _ | T19R, T29A, G142D, E156‐, F157‐, R158G, T250I, T299I, | 25.31 | 343–1274 | T19R, T29A, G142D, E156‐, F157‐, R158G, T250I, T299I |
| 29 | 21J (Delta) | 21.1 | 100 | _ | E156‐, F157‐, T19R, G142D, R158G, | 47.58 | 343–661, 683–1032 | T19R, G142D, E156‐, F157‐, R158G |
| 32 | 19A | 27.0 | NA | 1–1273 | 51.98 | 683–1274 |
| |
| 35 | 21J (Delta) | 25.0 | 94.2 | 22–26, 522–526 | T19R, T95I, G142D, E156‐, F157‐, R158G, | 47.58 | 343–661, 683–1032 | T19R,T95I,G142D,E156‐,F157‐,R158G |
| 37 | 21J (Delta) | 22.3 | 97.6 | 262–286, 292–296 | T19R, G142D, E156‐, F157‐, R158G, | 47.58 | 343–661, 683–1032 | T19R, G142D, E156‐, F157‐, R158G |
| 44 | 21J (Delta) | 22.6 | 100 | _ | T19R, T95I, G142D, E156‐, F157‐, R158G, L452R, T478K, D614G, | 53.47 | 683–1032, 1051–1274 | T19R, G142D, E156‐, F157‐, R158G, L452R, T478K, D614G |
| 46 | 21J (Delta) | 24 | 64.8 | 1–26, 252–296, 522‐866, 1093‐1126 | G142D, E156‐, F157‐, R158G, | 45.88 | 343–1032 | L5F, T19R, G142D, E156‐, F157‐, R158G |
| 47 | 21J (Delta) | 25 | 87.6 | 1–26, 252–296, 522–526, 752–786, 799, 801, 803–816, 833–866 | G142D, E156‐, F157‐, R158G, | 28.32 | 343–661, 681–1032, 1051–1274 | T19R, G142D, E156‐, F157‐, R158G |
| 63 | 20A | 22.7 | 7.7 | 1–1176 | 48.6 | 20–325, 683–1032 |
| |
| 71 | 20E (EU1) | 17 | 100 | _ | A222V, | 57.5 | 343–661, 1051–1274 | A222V |
| 77 | 20I (Alpha, V1) | 22.5 | 100 | _ |
| 48.1 | 1–661 | P681H, T716I, S982A, A1020S, D1118H |
Note: Mutations in bold were detected by only one of the protocols.
Abbreviations: AA, amino acid; NA, not applicable.
Clade based on target capture or amplicon analysis (if not determined with the target capture sequence).
Amplicons covering S gene with 1.2 kb protocol: NC_045512.2: 21 563–22 612 (spike AA mutations: 1–350), NC_045512.2: 22 538–23 631 (spike AA mutations, covering the receptor binding domain (RBD): 325–690), NC_045512.2: 23 545–24 736 (spike AA mutations: 661–1058) and NC_045512.2: 24 659–25 790 (spike AA mutations: 1032–1409).