Literature DB >> 33795344

Genome Sequence of a SARS-CoV-2 Strain from a COVID-19 Clinical Sample from the Khagrachari District of Bangladesh.

M Imranul Hoq1, Robiul Hasan Bhuiyan2, M Khondakar Raziur Rahman3, Imam Hossen3, Sajib Rudra3, M Arif Hossain3, Shanta Paul1, M Omer Faruq2, Mohammad Omar Faruque4, H M Abdullah Al Masud5.   

Abstract

This study describes the genome sequence of a severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) strain detected in the nasopharyngeal swab sample of a coronavirus disease 2019 (COVID-19) patient from the southeastern Khagrachari District of Bangladesh.
Copyright © 2021 Hoq et al.

Entities:  

Year:  2021        PMID: 33795344      PMCID: PMC8104052          DOI: 10.1128/MRA.00189-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

The ongoing pandemic of coronavirus disease 2019 (COVID-19), which was first reported in Wuhan, China, in December 2019 (1), is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), a Betacoronavirus in the Coronaviridae family (2). In Bangladesh, the first COVID-19 case was detected on 8 March 2020 (3). We report here the genome sequence of SARS-CoV-2 from a 43-year-old male from the Khagrachari District of Bangladesh, who was hospitalized with fever, joint ache, diarrhea and difficulty breathing, and tested positive for COVID-19 by reverse transcriptase PCR (RT-PCR) (4). All the protocols were approved by the Ethical Review Board of the University of Chittagong (reference no. CUBIO0001). Informed consent was obtained from the patient, and the sample was collected with the permission of the Directorate General of Health Services of the Government of Bangladesh. Viral RNA was extracted from the nasopharyngeal swab using a PureLink viral DNA/RNA minikit (catalog no. 12280050; Thermo Fisher Scientific). cDNA synthesis and library preparation were carried out using an Illumina RNA prep with enrichment (L) tagmentation kit (catalog no. 20040537) combined with Illumina respiratory virus oligonucleotide panel v2 (catalog no. 20044311), and the prepared libraries were sequenced on an Illumina MiniSeq system in the paired-end format (read length, 74 bp) according to the manufacturer’s instructions. Duplicates and low-quality reads were removed, and coverage plots were created using Illumina DRAGEN RNA pathogen detection version 3.5.15. The library generated a total of 3,625,976 reads, of which 2,165,542 reads mapped to the reference sequence (GenBank accession no. MN908947.3) using human (hg38) and the Illumina respiratory virus panel, with the human control option of the DRAGEN software, and 1,709,268 reads were found unique after exclusion of the duplicates. The FASTQ data files were exported from the Illumina local run manager to the BaseSpace Sequence Hub; a consensus FASTA file was generated using k-mer analysis (GenBank accession no. NC_045512.2) of the DRAGEN software; and it was revealed that the genome of this strain has 29,856 bp which starts and ends at positions 7 and 29862, respectively, of the reference sequence (29,903 bp). The whole-genome comparison, using the DRAGEN software, of the strain revealed 99.85% identity, with the reference sequence having a mean coverage depth of 303×, whereas no indel was detected. The sequence displays a GC content of 38%. The consensus genome and related sample data were uploaded to the Global Initiative on Sharing All Influenza Data (GISAID) database (accession no. EPI_ISL_735496) on 25 December 2020 (5). Phylogenetic analysis using Nextcladebeta version 0.13.0 (clades.nextstrain.org) assigned the new genome to the SARS-CoV-2 clade 20A (Fig. 1) (6). According to the GISAID database basic local alignment search tool (BLAST), the genome shares the highest levels of similarity with sequences uncovered from Saudi Arabia (GISAID accession no. EPI_ISL_678004, EPI_ISL_513151, EPI_ISL_513149, EPI_ISL_437736, and EPI_ISL_437723) and India (GISAID accession no. EPI_ISL_1073009, EPI_ISL_1073011, EPI_ISL_1073010, and EPI_ISL_1073014) (5).
FIG 1

Phylogenetic tree of a SARS-CoV-2 strain from the Khagrachari District of Bangladesh. The tree was constructed on 19 February 2021 using the Nextcladebeta version 0.13.0 (clades.nextstrain.org), in which the red circle represents the position of hCoV-19/Bangladesh/CU-CTG-24/2020 (GISAID accession no. EPI_ISL_735496).

Phylogenetic tree of a SARS-CoV-2 strain from the Khagrachari District of Bangladesh. The tree was constructed on 19 February 2021 using the Nextcladebeta version 0.13.0 (clades.nextstrain.org), in which the red circle represents the position of hCoV-19/Bangladesh/CU-CTG-24/2020 (GISAID accession no. EPI_ISL_735496). An analysis of the variations, using Genome Detective Virus Tools version 1.132 (7), indicates several changes in the sequence of this strain exhibiting 11 synonymous and 11 nonsynonymous mutations relative to the Wuhan-Hu-1 reference sequence (GenBank accession no. NC_045512.2) (Table 1). According to the GISAID database, 2 mutations, namely, P681H and V1122L, of the spike glycoprotein of this virus were rare among the SARS-CoV-2 strains recovered in Bangladesh (5). As of 19 February 2021, the mutation P681H was also observed in 5 other strains of SARS-CoV-2 recovered in Bangladesh (GISAID accession no. EPI_ISL_906098, EPI_ISL_906091, EPI_ISL_890237, EPI_ISL_890188, and EPI_ISL_774976), whereas the mutation V1122L is still unique in Bangladesh.
TABLE 1

Mutations observed in hCoV-19/Bangladesh/CU-CTG-24/2020 compared with SARS-CoV-2 isolate Wuhan-Hu-1

Nucleotide positionReference nucleotideMutated nucleotideGeneAmino acid change
241CT5′-UTRbNoncoding
1006GTORF1abK247N
1853CTORF1abNone (synonymous mutation)
2836CTORF1abNone (synonymous mutation)
3037CTORF1abNone (synonymous mutation)
4331CTORF1abNone (synonymous mutation)
4755CTORF1abP1497L
6472CTORF1abNone (synonymous mutation)
7119CTORF1abS2285F
7247TCORF1abF2328L
14408CTORF1abP4715L
17056AGORF1abM5598V
18877CTORF1abNone (synonymous mutation)
22444CTSNone (synonymous mutation)
23403AGSD614G
23604CASP681H
24130CTSNone (synonymous mutation)
24926GTSV1122L
25563GTORF3aQ57H
26735CTMNone (synonymous mutation)
28854CTNS194L
29260GTNNone (synonymous mutation)

GenBank accession no. NC_045512.2.

UTR, untranslated region.

Mutations observed in hCoV-19/Bangladesh/CU-CTG-24/2020 compared with SARS-CoV-2 isolate Wuhan-Hu-1 GenBank accession no. NC_045512.2. UTR, untranslated region.

Data availability.

The sequence has been deposited in the GISAID database (accession no. EPI_ISL_735496) and GenBank (accession no. MW599343). The accession number for the raw sequence reads in the NCBI Sequence Read Archive (SRA) is SRR13718002. The BioProject and BioSample accession numbers are PRJNA701790 and SAMN17911680, respectively.
  5 in total

1.  GISAID: Global initiative on sharing all influenza data - from vision to reality.

Authors:  Yuelong Shu; John McCauley
Journal:  Euro Surveill       Date:  2017-03-30

2.  Nextstrain: real-time tracking of pathogen evolution.

Authors:  James Hadfield; Colin Megill; Sidney M Bell; John Huddleston; Barney Potter; Charlton Callender; Pavel Sagulenko; Trevor Bedford; Richard A Neher
Journal:  Bioinformatics       Date:  2018-12-01       Impact factor: 6.931

3.  Genome Detective: an automated system for virus identification from high-throughput sequencing data.

Authors:  Michael Vilsker; Yumna Moosa; Sam Nooij; Vagner Fonseca; Yoika Ghysens; Korneel Dumon; Raf Pauwels; Luiz Carlos Alcantara; Ewout Vanden Eynden; Anne-Mieke Vandamme; Koen Deforche; Tulio de Oliveira
Journal:  Bioinformatics       Date:  2019-03-01       Impact factor: 6.937

4.  A pneumonia outbreak associated with a new coronavirus of probable bat origin.

Authors:  Peng Zhou; Xing-Lou Yang; Xian-Guang Wang; Ben Hu; Lei Zhang; Wei Zhang; Hao-Rui Si; Yan Zhu; Bei Li; Chao-Lin Huang; Hui-Dong Chen; Jing Chen; Yun Luo; Hua Guo; Ren-Di Jiang; Mei-Qin Liu; Ying Chen; Xu-Rui Shen; Xi Wang; Xiao-Shuang Zheng; Kai Zhao; Quan-Jiao Chen; Fei Deng; Lin-Lin Liu; Bing Yan; Fa-Xian Zhan; Yan-Yi Wang; Geng-Fu Xiao; Zheng-Li Shi
Journal:  Nature       Date:  2020-02-03       Impact factor: 69.504

5.  The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2.

Authors: 
Journal:  Nat Microbiol       Date:  2020-03-02       Impact factor: 17.745

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.