| Literature DB >> 36146662 |
Ivan Kotov1,2, Valeriia Saenko1, Nadezhda Borisova1, Anton Kolesnikov1, Larisa Kondrasheva1, Elena Tivanova1, Kamil Khafizov1, Vasily Akimkin1.
Abstract
Significant efforts are being made in many countries around the world to respond to the COVID-19 pandemic by developing diagnostic reagent kits, identifying infected people, determining treatment methods, and finally producing effective vaccines. However, novel coronavirus variants may potentially reduce the effectiveness of all these efforts, demonstrating increased transmissibility and abated response to therapy or vaccines, as well as the possibility of false negative results in diagnostic procedures based on nucleic acid amplification methods. Since the end of 2020, several variants of concern have been discovered around the world. When information about a new, potentially more dangerous strain of pathogen appears, it is crucial to determine the moment of its emergence in a region. Eventually, that permits taking timely measures and minimizing new risks associated with the spreading of the virus. Therefore, numerous nations have made tremendous efforts to identify and trace these virus variants, which necessitates serious technological processes to sequence a large number of viral genomes. Here, we report on our experience as one of the primary laboratories involved in monitoring SARS-CoV-2 variants in Russia. We discuss the various approaches used, describe effective protocols, and outline a potential technique combining several methods to increase the ability to trace genetic variants while minimizing financial and labor costs.Entities:
Keywords: NGS; SARS-CoV-2; bioinformatics; coronavirus; multiplex PCR; sequencing
Mesh:
Substances:
Year: 2022 PMID: 36146662 PMCID: PMC9504788 DOI: 10.3390/v14091855
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.818
Figure 1Timeline of coverage quality for S-protein gene sequencing from March of 2021 to July of 2022. The orange line represents the average number of 10+ read-covered regions (out of 20) in a single run. Yellow region shows 95%-confidence region for runs (2.5% of the best and worst results are discarded). The red line demonstrates the level of 10 regions covered by at least 10 reads, which is already a sample sequencing failure.
Figure 2Schematic diagram of the pipeline for updating primer pools.
Figure 3Coverage rates for 20 target regions of SARS-CoV-2 S-protein gene. Two series of 180 samples are presented. The upper color mesh (A) shows the coverage drop after the Delta variant became widespread. Several primer pairs became unstable-working, especially pair 7, which demonstrated zero coverage in the majority of cases. After the primer replacement, lower color mesh (B) was obtained. More uniform coverage allowed for an increase in the number of samples per sequencing run, and also made it possible to obtain high-quality consensus sequences.
Figure 4(A) A scheme of the iterative algorithm for optimizing large primer pools. (B) The graph shows the relationship between the number of optimization steps and the total of all generated dimers across all pools. One optimization step means “fitting” procedure for a single primer pair with its possible transfer between pools.
Figure 5Coverage rates for 98 target regions of SARS-CoV-2 whole genome. Two series of 47 and 24 samples are presented for old (A) and new (B) poolings, respectively.