| Literature DB >> 34756072 |
Wasfi Fares1, Kais Ghedira2, Mariem Gdoura1,3, Anissa Chouikha1, Sondes Haddad-Boubaker1, Marwa Khedhiri1, Kaouthar Ayouni1, Asma Lamari1, Henda Touzi1, Walid Hammemi1, Zina Medeb1, Amel Sadraoui1, Nahed Hogga1, Nissaf Ben Alaya4,5, Henda Triki1,5.
Abstract
Recent efforts have reported numerous variants that influence severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) viral characteristics, including pathogenicity, transmission rate, and detectability by molecular tests. Whole-genome sequencing based on next-generation sequencing technologies is the method of choice to identify all viral variants; however, the resources needed to use these techniques for a representative number of specimens remain limited in many low- and middle-income countries. To decrease sequencing costs, we developed a primer set allowing partial sequences to be generated in the viral S gene, enabling rapid detection of numerous variants of concern (VOCs) and variants of interest (VOIs); whole-genome sequencing is then performed on a selection of viruses based on partial sequencing results. Two hundred one nasopharyngeal specimens collected during the decreasing phase of a high-transmission COVID-19 wave in Tunisia were analyzed. The results reveal high genetic variability within the sequenced fragment and allow the detection of first introductions in the country of already-known VOCs and VOIs, as well as other variants that have interesting genomic mutations and need to be kept under surveillance. IMPORTANCE The method of choice for SARS-CoV-2 variant detection is whole-genome sequencing using next-generation sequencing (NGS) technologies. Resources for this technology remain limited in many low- and middle-income countries, where it is not possible to perform whole-genome sequencing for representative numbers of SARS-CoV-2-positive cases. In the present work, we developed a novel strategy based on a first partial Sanger screening in the S gene, which includes key mutations of the already known VOCs and VOIs, for rapid identification of these VOCs and VOIs and to help better select specimens that need to be sequenced by NGS technologies. The second step consists of whole-genome sequencing to allow a holistic view of all variants within the selected viral strains and confirm the initial classification of the strains based on partial S gene sequencing.Entities:
Keywords: COVID-19; SARS-CoV-2; Tunisia; VOCs; VOIs; spike protein; whole-genome sequencing
Mesh:
Substances:
Year: 2021 PMID: 34756072 PMCID: PMC8579926 DOI: 10.1128/Spectrum.00639-21
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
FIG 1Sample collection period investigated in the present study. The graph displays the numbers of cases and the numbers of deaths in Tunisia since the declaration of the pandemic in March 2020. The x axis represents the number of weeks from March 2020 until May 2021. Weeks highlighted in red represent the sample collection period investigated in the present study.
FIG 2Phylogenetic tree of 201 SARS-CoV-2 sequences based on partial S gene nucleotide sequencing. The phylogenetic tree includes 201 Tunisian sequences compared to 9 representative reference sequences of SARS-CoV-2 clades. The tree was constructed using the neighbor-joining method and the Tamura 3-parameter (T92) model. Topology was supported by 1,000 bootstrap replicates. The sequences reported in this study are indicated by the laboratory code. The sequences downloaded from GISAID are indicated by their accession numbers. Cluster 1, marked in purple, includes sequences presenting the D614G substitution and the lack of the amino acid substitution N501Y. Cluster 2, marked in blue, includes sequences having the N501Y, A570D, D614G, and P681H substitutions. Cluster 3, marked in red, groups sequences with the N501Y, A653V, and H655Y substitutions and lacking the amino acid substitution D614G.
Amino acid substitution profile in the sequenced fragment of the S gene of the samples investigated in the present study
| Mutation profile | No. of samples with mutation(s) in cluster: | ||
|---|---|---|---|
| 1 ( | 2 ( | 3 ( | |
| E484K, D614G | 2 | 0 | 0 |
| E484K, D614G, Q677H | 1 | 0 | 0 |
| D614G, S637L, A647S | 1 | 0 | 0 |
| D614G, I666L | 1 | 0 | 0 |
| D614G, Q675L | 1 | 0 | 0 |
| D574Y, D614G, A626S | 1 | 0 | 0 |
| D614G, A626S | 2 | 0 | 0 |
| D614G, V622F | 1 | 0 | 0 |
| D614G, E619Q | 1 | 0 | 0 |
| D614G, D627E | 16 | 0 | 0 |
| D614G | 147 | 0 | 0 |
| N501Y, A570D, D614G, P681H | 0 | 15 | 0 |
| N501Y, A653V, Q655H | 0 | 0 | 11 |
| N501Y, A653V, Q655H, Q677H | 0 | 0 | 1 |
Mutation profile by WGS
Amino acid substitutions previously found by partial sequencing in the S gene are marked with a star.
FIG 3Phylogenetic tree of 18 SARS-CoV-2 whole-genome sequences circulating in Tunisia compared to 9 reference strain genomes. The phylogenetic tree includes 18 Tunisian sequences compared to 9 representative reference sequences of SARS-Cov-2 clades. The tree was constructed using the neighbor-joining method and the Tamura 3-parameter (T92) model. Topology was supported by 1,000 bootstrap replicates. The sequences reported in this study are indicated by the laboratory code. The sequences downloaded from GISAID database are indicated by their accession numbers. Cluster 1 is marked in purple, clade 2 is marked in blue, and clade 3 is marked in red.