| Literature DB >> 34960813 |
Sébastien Lhomme1,2, Justine Latour2, Nicolas Jeanne2, Pauline Trémeaux2, Noémie Ranger2, Marion Migueres1,2, Gérald Salin3, Cécile Donnadieu3, Jacques Izopet1,2.
Abstract
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is the causal agent of the COVID-19 pandemic that emerged in late 2019. The outbreak of variants with mutations in the region encoding the spike protein S1 sub-unit that can make them more resistant to neutralizing or monoclonal antibodies is the main point of the current monitoring. This study examines the feasibility of predicting the variant lineage and monitoring the appearance of reported mutations by sequencing only the region encoding the S1 domain by Pacific Bioscience Single Molecule Real-Time sequencing (PacBio SMRT). Using the PacBio SMRT system, we successfully sequenced 186 of the 200 samples previously sequenced with the Illumina COVIDSeq (whole genome) system. PacBio SMRT detected mutations in the S1 domain that were missed by the COVIDseq system in 27/186 samples (14.5%), due to amplification failure. These missing positions included mutations that are decisive for lineage assignation, such as G142D (n = 11), N501Y (n = 6), or E484K (n = 2). The lineage of 172/186 (92.5%) samples was accurately determined by analyzing the region encoding the S1 domain with a pipeline that uses key positions in S1. Thus, the PacBio SMRT protocol is appropriate for determining virus lineages and detecting key mutations.Entities:
Keywords: Illumina sequencing; PacBio SMRT sequencing; S1 domain; SARS-CoV-2; clade; lineage
Mesh:
Substances:
Year: 2021 PMID: 34960813 PMCID: PMC8707593 DOI: 10.3390/v13122544
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Key mutations used for clade/lineage determination.
| Clade | 20I/ | 20I/ | 20I/ | 20H/ | 21A/ | 21A/ | 20J/ | 21H | 20D/ | 20B/ | 20A/ | 19B/ | 19B/ | 20I/ | 20B/P.3 | 20A/ | 20E/ | 20A/ | 20A/ | 21C/ | 21D/ | 21F/ | 20C/ | 20C/ | 21B/ | 20A/ | 20A/ | 21G/ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Lineage | B.1.1.7 | B.1.1.7+ | B.1.1.7+ | B.1.351 | B.1.617.2 | AY.1/AY.2 | P.1 | B.1.621 | C.36.3 | B.1.1.318 | B.1.620 | A.27 | A.28 | B.1.1.7+ | B.1.1.28.3 | B.1.160 | B.1.177 | B.1.214.2 | B.1.221 | B.1.427/ | B.1.525 | B.1.526 | B.1.526.1 | B.1.616 | B.1.617.1 | B.1.617.3 | B.1.619 | C.37 |
| WHO | Alpha | Alpha | Alpha | Beta | Delta | Delta | Gamma | Mu | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | Epsilon | Eta | Iota | N/A | N/A | Kappa | N/A | N/A | Lambda |
| L5F | F | |||||||||||||||||||||||||||
| P9L | ||||||||||||||||||||||||||||
| S12F | F | |||||||||||||||||||||||||||
| S13I | I | |||||||||||||||||||||||||||
| L18F | (F) | F | F | |||||||||||||||||||||||||
| T19R | (R) | (R) | (R) | |||||||||||||||||||||||||
| T20N | N | |||||||||||||||||||||||||||
| P26S | S | S | ||||||||||||||||||||||||||
| Q52R | R | |||||||||||||||||||||||||||
| H66D | D | |||||||||||||||||||||||||||
| A67V | V | |||||||||||||||||||||||||||
| 69del | del | del | del | del | del | del | del | del | ||||||||||||||||||||
| 70del | del | del | del | del | del | del | del | del | ||||||||||||||||||||
| G75V | V | V | ||||||||||||||||||||||||||
| T76I | I | |||||||||||||||||||||||||||
| D80A/G | A | G | ||||||||||||||||||||||||||
| T95I | (I) | (I) | I | I | I | (I) | ||||||||||||||||||||||
| S98F | F | |||||||||||||||||||||||||||
| V126A | A | |||||||||||||||||||||||||||
| C136F | ||||||||||||||||||||||||||||
| D138Y | Y | |||||||||||||||||||||||||||
| 141del | del | |||||||||||||||||||||||||||
| G142D/del/V | (D) | (D) | del | V | (D) | |||||||||||||||||||||||
| 143del | del | |||||||||||||||||||||||||||
| 144del/T | del | del | del | (T) | (del) | del | del | del | del | del | ||||||||||||||||||
| Y145S/insN | (S) | (insN) | ||||||||||||||||||||||||||
| W152R/C | R | C | ||||||||||||||||||||||||||
| E154K | K * | |||||||||||||||||||||||||||
| E156G | (G) | (G) | ||||||||||||||||||||||||||
| 157del/S | (del) | (del) | S | (del) | ||||||||||||||||||||||||
| 158del | (del) | (del) | (del) | |||||||||||||||||||||||||
| R190S | S | |||||||||||||||||||||||||||
| I210T | T | |||||||||||||||||||||||||||
| 214insTDR | insTDR * | |||||||||||||||||||||||||||
| D215G | (G) | G | ||||||||||||||||||||||||||
| A222V | V | (V) | ||||||||||||||||||||||||||
| 242del | (del) | (del) | ||||||||||||||||||||||||||
| 243del | (del) | del | del | |||||||||||||||||||||||||
| 244del | (del) | del | del | |||||||||||||||||||||||||
| H245Y | Y | |||||||||||||||||||||||||||
| 247del | del | |||||||||||||||||||||||||||
| 248del | del | |||||||||||||||||||||||||||
| 249del | del | |||||||||||||||||||||||||||
| 250del | del | |||||||||||||||||||||||||||
| 251del | del | |||||||||||||||||||||||||||
| 252del | del | |||||||||||||||||||||||||||
| D253G/del | G | del | ||||||||||||||||||||||||||
| R346K/S | K | S | ||||||||||||||||||||||||||
| Q414K | K | |||||||||||||||||||||||||||
| K417N/T | (N) | N * | T | |||||||||||||||||||||||||
| N440K | K * | |||||||||||||||||||||||||||
| Y449H | ||||||||||||||||||||||||||||
| N450K | K | |||||||||||||||||||||||||||
| L452R/Q | R * | (R) | R * | R * | R * | R * | R * | R * | R * | Q * | ||||||||||||||||||
| S477N | N * | N | N^ | |||||||||||||||||||||||||
| T478K | K * | (K) | ||||||||||||||||||||||||||
| V483A | A | |||||||||||||||||||||||||||
| E484K/Q | K * | Q * | K * | K * | (K) | K * | K * | K * | K * | K^ | Q * | Q * | K * | |||||||||||||||
| F490S | S | |||||||||||||||||||||||||||
| N501Y/T | Y * | Y * | Y * | Y * | Y * | Y | Y * | T * | Y * | Y * | ||||||||||||||||||
| A570D | D | D | D | D | ||||||||||||||||||||||||
| D614G | G | G | G | G | G | (G) | G | G | G | G | G | G | G | G | G | G | G | G | G | G | G | G | G | G | G | |||
| A653V | V | |||||||||||||||||||||||||||
| H655Y | Y | Y | Y | Y * | ||||||||||||||||||||||||
| G669S | S | |||||||||||||||||||||||||||
| Q677H | H | (H) | H | |||||||||||||||||||||||||
| N679K | ||||||||||||||||||||||||||||
| P681H/R | H | H | H | R | (R) | H | H * | H | H | H | R | R | ||||||||||||||||
| A701V | V | V | ||||||||||||||||||||||||||
| T716I | I | I | I | I | I |
*: Essential mutation to validate the profile. ^: At least one mutation required to validate the profile. (): Possible mutation, not required to validate the profile but improves discrimination. Only mutations detected in the S1 domain were covered by the fragment amplified by PacBio SMRT sequencing.
Sample lineages and clades.
| Clade | Lineage | Month of Collection | Total | |
|---|---|---|---|---|
| January | July | |||
| 20I | B.1.1.7 | 48 | 3 | 51 |
| 20A | B.1.160 | 37 | 37 | |
| 20E(EU1) | B.1.177 | 37 | 37 | |
| 21A | B.1.617.2 | 20 | 20 | |
| 21A | AY.4 | 19 | 19 | |
| 20A | B.1.221 | 8 | 8 | |
| 20A | B.1 | 4 | 4 | |
| 21A | AY.34 | 6 | 6 | |
| 21A | AY.9 | 4 | 4 | |
| 20B | B.1.1.241 | 3 | 3 | |
| 20B | B.1.1.317 | 2 | 2 | |
| 20H | B.1.351 | 2 | 2 | |
| 20C | B.1 | 1 | 1 | |
| 20B | B.1.1.186 | 1 | 1 | |
| 20B | B.1.1.28 | 1 | 1 | |
| 20G | B.1.2 | 1 | 1 | |
| 20A | B.1.214 | 1 | 1 | |
| 21A | AY.23 | 1 | 1 | |
| 21A | AY.7.1 | 1 | 1 | |
| Total | 146 | 54 | 200 | |
List of mutations detected with the Illumina and PacBio SMRT systems in 14 patient samples giving discrepant results.
| Sample | Clade | Lineage | Mutations Detected with Illumina | Mutations Detected with Pacbio |
|---|---|---|---|---|
| 20 | 20A | B.1.214 | T95I, D614G, T716I | T95I, |
| 33 | 20A | B.1.160 | D614G | |
| 62 | 20A | B.1.160 | D614G | |
| 79 | 20A | B.1.160 | D614G | |
| 110 | 20A | B.1.160 | D614G | |
| 111 | 20A | B.1.160 | D614G | |
| 136 | 20B | B.1.1.186 | L18F, D614G | L18F, |
| 30 | 20H (Beta) | B.1.351 | L18F, D80A, D215G, 241del, 242del, 243del, K417N, N501Y, D614G, A701V | L18F, D80A, D215G, 241del, 242del, 243del, K417N, |
| 134 | 20H (Beta) | B.1.351 | D80A, D215G, 241del, 242del, 243del, K417N, D614G, A701V | D80A, D215G, 241del, 242del, 243del, K417N, |
| 15 | 20I (Alpha) | B.1.1.7 | 69del, 70del, 144del, A570D, D614G, P681H, T716I | 69del, 70del, 144del, |
| 27 | 20I (Alpha) | B.1.1.7 | 69del, 70del, 144del, A570D, D614G, P681H, T716I | 69del, 70del, 144del, |
| 56 | 20I (Alpha) | B.1.1.7 | 69del, 70del, 144del, A570D, D614G, P681H, T716I | 69del, 70del, 144del, |
| 73 | 20I (Alpha) | B.1.1.7 | 69del, 70del, 144del, A570D, D614G, P681H, T716I | 69del, 70del, 144del, |
| 116 | 20I (Alpha) | B.1.1.7 | 69del, 70del, 144del, A570D, D614G, P681H, T716I | 69del, 70del, 144del, |
| 143 | 21A (Delta) | AY.4 | T19R, E156G, 157del, 158del, L452R, T478K, D614G, P681R | T19R, |
| 153 | 21A (Delta) | AY.4 | T19R, E156G, 157del, 158del, L452R, T478K, D614G, P681R, D950N | T19R, |
| 163 | 21A (Delta) | AY.4 | T19R, L452R, T478K, D614G, P681R, D950N | T19R, |
| 167 | 21A (Delta) | AY.4 | T19R, L452R, T478K, D614G, P681R, D950N | T19R, |
| 169 | 21A (Delta) | AY.4 | T19R, E156G, 157del, 158del, L452R, T478K, D614G, P681R | T19R, |
| 170 | 21A (Delta) | AY.4 | T19R, L452R, T478K, D614G, P681R | T19R, |
| 189 | 21A (Delta) | AY.4 | T19R, E156G, 157del, 158del, L452R, T478K, D614G, P681R | T19R, |
| 186 | 21A (Delta) | AY.7.1 | T19R, L452R, T478K, D614G, P681R | T19R, |
| 142 | 21A (Delta) | AY.9 | T19R, E156G, 157del, 158del, A222V, L452R, T478K, D614G, P681R | T19R, |
| 183 | 21A (Delta) | AY.9 | L452R, T478K, D614G, P681R, D950N | |
| 185 | 21A (Delta) | AY.9 | T19R, E156G, 157del, 158del, A222V, L452R, T478K, D614G, P681R | T19R, |
| 157 | 21A (Delta) | AY.23 | T19R, L452R, T478K, D614G, P681R, D950N, V1264L | T19R, |
| 154 | 21A (Delta) | AY.34 | T19R, E156G, 157del, 158del, L452R, T478K, D614G, Q677H, P681R, D950N | T19R, |
The mutations listed above were detected using the Wuhan-Hu-1genome sequence (NC_045512.2) as reference. In bold: mutations detected by the PacBio SMRT platform.
Figure 1Sequencing depth over the spike gene with Illumina COVIDseq protocol. Logarithmic representation of the number of reads mapped on the reference genome, zoomed on the spike region. The x-axis was labeled from 21563 and 25384, corresponding to the positions on the complete genome. (A). Sequencing depth over the spike gene with Illumina COVIDseq protocol with singled-end 35 nt read-length sequencing and (B) paired-end 150 nt read-length sequencing. (C) Sequencing depth over the spike gene with PacBio protocol.
Comparison of the lineages determined with full-length genomes and the S1-encoding region based on key positions.
| Full Length Genome (Illumina) | Spike S1 Region (PacBio) | |||
|---|---|---|---|---|
| Clade | Lineage |
| Lineage |
|
| 20A | B.1.221 | 8 | B.1.221 | 8 |
| 20A | B.1 | 4 | NA | 4 |
| 20A | B.1.214 | 1 | NA | 1 |
| 20A | B.1.160 | 37 | B.1.160 | 37 |
| 20B | B.1.1.186 | 1 | NA | 1 |
| 20B | B.1.1.241 | 3 | NA | 3 |
| 20B | B.1.1.28 | 1 | NA | 1 |
| 20B | B.1.1.317 | 2 | NA | 2 |
| 20C | B.1 | 1 | NA | 1 |
| 20E (EU1) | B.1.177 | 37 | B.1.177 | 37 |
| 20G | B.1.2 | 1 | NA | 1 |
| 20H | B.1.351 | 2 | B.1.351 | 2 |
| 20I | B.1.1.7 | 45 | B.1.1.7 | 45 |
| 21A | B.1.617.2 | 17 | B.1.617.2 | 43 |
| 21A | AY.4 | 16 | ||
| 21A | AY.34 | 4 | ||
| 21A | AY.9 | 4 | ||
| 21A | AY.23 | 1 | ||
| 21A | AY.7.1 | 1 | ||
NA: not assigned.