| Literature DB >> 35358277 |
Mirzakamol S Ayubov1, Zabardast T Buriev1, Mukhammadjon K Mirzakhmedov1, Abdurakhmon N Yusupov1, Dilshod E Usmanov1, Shukhrat E Shermatov1, Khurshida A Ubaydullaeva1, Ibrokhim Y Abdurakhmonov1.
Abstract
Due to rapid mutations in the coronavirus genome over time and re-emergence of multiple novel variants of concerns (VOC), there is a continuous need for a periodic genome sequencing of SARS-CoV-2 genotypes of particular region. This is for on-time development of diagnostics, monitoring and therapeutic tools against virus in the global pandemics condition. Toward this goal, we have generated 18 high-quality whole-genome sequence data from 32 SARS-CoV-2 genotypes of PCR-positive COVID-19 patients, sampled from the Tashkent region of Uzbekistan. The nucleotide polymorphisms in the sequenced sample genomes were determined, including nonsynonymous (missense) and synonymous mutations in coding regions of coronavirus genome. Phylogenetic analysis grouped fourteen whole genome sample sequences (1, 2, 4, 5, 8, 10-15, 17, 32) into the G clade (or GR sub-clade) and four whole genome sample sequences (3, 6, 25, 27) into the S clade. A total of 128 mutations were identified, consisting of 45 shared and 83 unique mutations. Collectively, nucleotide changes represented one unique frameshift mutation, four upstream region mutations, six downstream region mutations, 50 synonymous mutations, and 67 missense mutations. The sequence data, presented herein, is the first coronavirus genomic sequence data from the Republic of Uzbekistan, which should contribute to enrich the global coronavirus sequence database, helping in future comparative studies. More importantly, the sequenced genomic data of coronavirus genotypes of this study should be useful for comparisons, diagnostics, monitoring, and therapeutics of COVID-19 disease in local and regional levels.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35358277 PMCID: PMC8970392 DOI: 10.1371/journal.pone.0266417
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary data for the COVID-19 samples used in this study.
| ID | Collection date | Sex | Age | Coverage | GISAID acccession # | NCBI accession # | Clade |
|---|---|---|---|---|---|---|---|
| 1 | 2020-10-14 | Male | 46 | 39785 | EPI_ISL_1402423 | MW853559.1 | GR |
| 2 | 2020-10-14 | Female | 36 | 27370 | EPI_ISL_1477044 | -* | GR |
| 3 | 2020-10-14 | Female | 69 | 33746 | EPI_ISL_1439578 | MW853560.1 | S |
| 4 | 2020-10-14 | Male | 69 | 24333 | EPI_ISL_1402424 | MW853561.1 | GR |
| 5 | 2020-10-14 | Male | 39 | 30782 | EPI_ISL_1402425 | MW853562.1 | GR |
| 6 | 2020-10-14 | Male | 56 | 29427 | EPI_ISL_1402426 | - | S |
| 7 | 2020-10-14 | Male | 39 | 24940 | EPI_ISL_1439580 | MW853563.1 | GR |
| 8 | 2020-10-14 | Female | 22 | 30129 | EPI_ISL_1402427 | MW853564.1 | GR |
| 10 | 2020-10-14 | Male | 36 | 8732 | EPI_ISL_1477045 | - | GR |
| 11 | 2020-10-14 | Female | 47 | 33539 | EPI_ISL_1402428 | MW853565.1 | GR |
| 12 | 2020-10-14 | Female | 41 | 33025 | EPI_ISL_1439584 | MW853566.1 | GR |
| 13 | 2020-10-14 | Female | 52 | 26892 | EPI_ISL_1477046 | - | GR |
| 14 | 2020-10-14 | Female | 67 | 18178 | EPI_ISL_1439589 | MW853567.1 | GR |
| 15 | 2020-10-14 | Female | 71 | 19556 | EPI_ISL_1439593 | MW853568.1 | GR |
| 17 | 2020-12-07 | Female | 20 | 444.5 | EPI_ISL_1477047 | - | GR |
| 25 | 2020-12-07 | Female | 63 | 6348 | EPI_ISL_1439597 | MW853569.1 | S |
| 27 | 2020-12-07 | Male | 36 | 3380 | EPI_ISL_1477048 | - | S |
| 32 | 2020-12-07 | Female | 37 | 21180 | EPI_ISL_1477049 | - | GR |
*—These samples only have accession numbers in GISAID but not in NCBI. All sequence reads (EPI_ISL_1402423 to EPI_ISL_1477049; S1 File) can be retrieved from the GISAID database (http://www.gisaid.org/) searching with a key word of ‘hCoV-19/Uzbekistan/Tashkent-CGB’, under the “Virus name” icon of the GISAID search panel.
Fig 1The Uzbek SARS-CoV-2 sequences clustered into 2 clades.
A phylogenetic tree was generated in MEGA X based on the 18 viral sequences of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in samples collected from COVID-19 patients in the Tashkent region using the maximum likelihood method. This maximum likelihood tree was rooted by the Wuhan reference strain NC_045512.2 and produced using 128 mutations discovered in the 18 sequences (see S1 Table). The distribution of mutations shared by two or more sequences. A CA→TC substitution is located at position 28253*, and a GGG → AAC substitution is at 28881**. Bootstrap values are shown.
Fig 2Number of mutations in COVID-19 samples sequenced.
Note that the number of mutations is calculated per individual samples.
Fig 3Global phylogenetic tree provided by nextstrain.org including COVID-19 samples from Uzbekistan.
*Uzbekistan samples are shown with arrows, see S2 Table for variant and origin of Uzbekistan samples.