Literature DB >> 34236232

Genome Sequences of Two Novel Coronavirus (SARS-CoV-2) Isolates from Dhaka, Bangladesh.

M Abdul Matin1, Abdullah Al Nahid2,3, Bishajit Sarkar1,4, M Asad Ullah1,4, Yusha Araf4,5, Nihad Adnan6, Rashedul Islam3, Mohammad Shahedur Rahman1,4.   

Abstract

This study reports the genome sequences of two severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) strains detected in the nasopharyngeal swab specimens of two coronavirus disease 2019 (COVID-19) patients from Dhaka, Bangladesh.

Entities:  

Year:  2021        PMID: 34236232      PMCID: PMC8265230          DOI: 10.1128/MRA.00511-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

The ongoing coronavirus disease 2019 (COVID-19) pandemic is caused by the viral pathogen severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), a member of the Betacoronavirus in the Coronaviridae family (1). The virus was first detected in late December 2019 in Wuhan, Hubei Province, China (2). On 8 March 2020, the first three cases of COVID-19 in Bangladesh were identified. As of 18 May 2021, a total of 780,857 confirmed cases of the disease with 12,181 deaths had been reported in the country (3). We announce here two genome sequences of SARS-CoV-2 from the collected samples of two female patients, 43 and 28 years old. The samples were collected on 2 and 4 December 2020 during the time period of a relatively reduced number of confirmed COVID-19 cases in Bangladesh (https://www.worldometers.info/coronavirus), prior to the second wave of the pandemic in the country from late March to early April 2021 (4). Both patients provided informed consent and were found to be COVID-19 positive by reverse transcription-PCR (RT-PCR) (5). The Ethical Review Board of Jahangirnagar University [reference no. BBEC, JU/M 2020/COVID-19/(8)3] approved all the protocols used to carry out this study. The viral nucleic acids were extracted from the collected nasopharyngeal swab specimens using the ReliaPrep viral TNA miniprep system (catalog no. AX4820; Promega) according to the manufacturer’s protocol. Library preparation was performed using an Illumina RNA prep kit with RNA enrichment Respiratory Virus Oligo Panel following the manufacturer’s instructions. Thereafter, an Illumina MiniSeq system was used according to the manufacturer’s protocol to sequence the prepared libraries in the paired-end format with a read length of 74 bp. A total of 7,170,754 and 150,934 reads, respectively, from the SARS-CoV-2 samples were generated and quality assessed with FastQC version 0.11.8 (6). The raw reads were adapter trimmed with Trimmomatic version 0.39 (7), and the surviving 6,716,784 and 142,054 reads were subsequently mapped to the SARS-CoV-2 reference genome (GenBank accession no. MN908947.3) using the Burrows-Wheeler Aligner (BWA) version 0.7.17 (8). Ultimately, the consensus genomes were generated from the mapped reads using SAMtools version 1.11 (9). The genomes of the two Bangladeshi SARS-CoV-2 samples, i.e., hCoV-19/Bangladesh/JU-WMSRC-Dhaka-5/2020 and hCoV-19/Bangladesh/JU-WMSRC-Dhaka-8/2020, were found to have base pair lengths of 29,888 and 29,848, respectively, with an average coverage of over 4,313× and 215×, respectively. No indels were found, and the GC content was measured to be 38% for both genomes. The consensus genomes along with the related sample metadata were uploaded to the Global Initiative on Sharing All Influenza Data (GISAID) database (10) on 27 January 2021. Nextclade beta version 0.14.2 (clades.nextstrain.org) by Nextstrain (11) was utilized in the phylogenetic analysis, which assigned both sequences to SARS-CoV-2 clade 20B (Fig. 1). The closest ancestral strains of the genomes include sequences from Bangladesh (GISAID accession no. EPI_ISL_959367, EPI_ISL_943554, and EPI_ISL_959386) and New Zealand (GISAID accession no. EPI_ISL_682280) according to Nextclade.
FIG 1

The genomes isolated from the two SARS-CoV-2 samples from the Dhaka District of Bangladesh are presented on a phylogenetic tree. The number of mutations in comparison to the reference genome of the Wuhan-Hu-1 isolate (GenBank accession no. MN908947.3) is represented by the x axis. Nextclade beta version 0.14.2 (clades.nextstrain.org) was used to construct the tree on 4 May 2021, with the red and dark blue circles representing the positions of the hCoV-19/Bangladesh/JU-WMSRC-Dhaka-5/2020 (GISAID accession no. EPI_ISL_884088) and hCoV-19/Bangladesh/JU-WMSRC-Dhaka-8/2020 (GISAID accession no. EPI_ISL_884087) sequences, respectively.

The genomes isolated from the two SARS-CoV-2 samples from the Dhaka District of Bangladesh are presented on a phylogenetic tree. The number of mutations in comparison to the reference genome of the Wuhan-Hu-1 isolate (GenBank accession no. MN908947.3) is represented by the x axis. Nextclade beta version 0.14.2 (clades.nextstrain.org) was used to construct the tree on 4 May 2021, with the red and dark blue circles representing the positions of the hCoV-19/Bangladesh/JU-WMSRC-Dhaka-5/2020 (GISAID accession no. EPI_ISL_884088) and hCoV-19/Bangladesh/JU-WMSRC-Dhaka-8/2020 (GISAID accession no. EPI_ISL_884087) sequences, respectively. Genome variation analysis of the consensus genomes was performed using the Genome Detective Virus Tool version 1.132 (12), which indicated several changes in the studied genomes relative to the SARS-CoV-2 reference genome sequence of the Wuhan-Hu-1 isolate (GenBank accession no. NC_045512.2). The sequence of the hCoV-19/Bangladesh/JU-WMSRC-Dhaka-5/2020 strain exhibited 6 synonymous and 9 nonsynonymous mutations, whereas 7 synonymous and 10 nonsynonymous mutations were observed in the sequence of the hCoV-19/Bangladesh/JU-WMSRC-Dhaka-8/2020 strain (Table 1). Except for the Q94L mutation in the ORF7a gene, all nonsynonymous mutations were in both strains. According to the Nextstrain community build of Bangladesh (https://nextstrain.org/community/CHRF-Genomics/ncovBangladesh@main), the 9 nonsynonymous mutations common to the two studied genomes are also prevalent in other Bangladeshi SARS-CoV-2 genome sequences. However, the ORF7a Q94L mutation could not be detected in any other Bangladeshi genomes as of 4 May 2021. The most common mutations observed in SARS-CoV-2 genomes from Bangladesh include 5 nonsynonymous mutations that were also found in both studied genomes—I300F and P4715L in the ORF1ab gene, D614G in the S gene, and R203K and G204R in the N gene (13).
TABLE 1

Mutations detected in both the hCoV-19/Bangladesh/JU-WMSRC-Dhaka-5/2020 (strain A) and hCoV-19/Bangladesh/JU-WMSRC-Dhaka-8/2020 (strain B) sequences compared to the reference genome of the Wuhan-Hu-1 isolate (GenBank accession no. MN908947.3)

Strain(s)Nucleotide positionReference nucleotideMutated nucleotideGeneAmino acid change
A, B241CT5′-UTRNoncoding
A, B335CTORF1abR24C
A, B434GAORF1abE57K
A, B1163ATORF1abI300F
B1333TGORF1abNone (synonymous mutation)
A, B3037CTORF1abNone (synonymous mutation)
A, B3652AGORF1abNone (synonymous mutation)
A6229AGORF1abNone (synonymous mutation)
A, B9436AGORF1abNone (synonymous mutation)
A, B14408CTORF1abP4715L
B18744CTORF1abNone (synonymous mutation)
A21907GASNone (synonymous mutation)
B22801GTSNone (synonymous mutation)
A, B23403AGSD614G
A, B23604CGSP681R
B27674ATORF7aQ94L
A, B28881GANR203K
A, B28882GANR203K
A, B28883GCNG204R
Mutations detected in both the hCoV-19/Bangladesh/JU-WMSRC-Dhaka-5/2020 (strain A) and hCoV-19/Bangladesh/JU-WMSRC-Dhaka-8/2020 (strain B) sequences compared to the reference genome of the Wuhan-Hu-1 isolate (GenBank accession no. MN908947.3) As more genomes of COVID-19 patients with various clinical features from various regions of the country are being sequenced on a regular basis, the evolution of the virus must be further investigated and closely monitored in order to track COVID-19 progression in Bangladesh.

Data availability.

The SARS-CoV-2 genome sequences from Bangladesh were deposited in the GISAID database (accession no. EPI_ISL_884087 and EPI_ISL_884088), and the raw reads have been submitted to the NCBI Sequence Read Archive (SRA). The BioProject accession no. is PRJNA701872 for both samples. The BioSample and SRA accession no. for the samples are SAMN17915311 and SAMN17915310 (BioSample) and SRR13722033 and SRR13721921 (SRA).
  10 in total

1.  GISAID: Global initiative on sharing all influenza data - from vision to reality.

Authors:  Yuelong Shu; John McCauley
Journal:  Euro Surveill       Date:  2017-03-30

2.  Nextstrain: real-time tracking of pathogen evolution.

Authors:  James Hadfield; Colin Megill; Sidney M Bell; John Huddleston; Barney Potter; Charlton Callender; Pavel Sagulenko; Trevor Bedford; Richard A Neher
Journal:  Bioinformatics       Date:  2018-12-01       Impact factor: 6.931

3.  Genome Detective: an automated system for virus identification from high-throughput sequencing data.

Authors:  Michael Vilsker; Yumna Moosa; Sam Nooij; Vagner Fonseca; Yoika Ghysens; Korneel Dumon; Raf Pauwels; Luiz Carlos Alcantara; Ewout Vanden Eynden; Anne-Mieke Vandamme; Koen Deforche; Tulio de Oliveira
Journal:  Bioinformatics       Date:  2019-03-01       Impact factor: 6.937

4.  A pneumonia outbreak associated with a new coronavirus of probable bat origin.

Authors:  Peng Zhou; Xing-Lou Yang; Xian-Guang Wang; Ben Hu; Lei Zhang; Wei Zhang; Hao-Rui Si; Yan Zhu; Bei Li; Chao-Lin Huang; Hui-Dong Chen; Jing Chen; Yun Luo; Hua Guo; Ren-Di Jiang; Mei-Qin Liu; Ying Chen; Xu-Rui Shen; Xi Wang; Xiao-Shuang Zheng; Kai Zhao; Quan-Jiao Chen; Fei Deng; Lin-Lin Liu; Bing Yan; Fa-Xian Zhan; Yan-Yi Wang; Geng-Fu Xiao; Zheng-Li Shi
Journal:  Nature       Date:  2020-02-03       Impact factor: 69.504

5.  The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2.

Authors: 
Journal:  Nat Microbiol       Date:  2020-03-02       Impact factor: 17.745

6.  Global and local mutations in Bangladeshi SARS-CoV-2 genomes.

Authors:  Md Mahbub Hasan; Rasel Das; Md Rasheduzzaman; Md Hamed Hussain; Nazmul Hasan Muzahid; Asma Salauddin; Meheadi Hasan Rumi; S M Mahbubur Rashid; Amam Zonaed Siddiki; Adnan Mannan
Journal:  Virus Res       Date:  2021-03-15       Impact factor: 3.303

7.  Twelve years of SAMtools and BCFtools.

Authors:  Petr Danecek; James K Bonfield; Jennifer Liddle; John Marshall; Valeriu Ohan; Martin O Pollard; Andrew Whitwham; Thomas Keane; Shane A McCarthy; Robert M Davies; Heng Li
Journal:  Gigascience       Date:  2021-02-16       Impact factor: 6.524

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  An interactive web-based dashboard to track COVID-19 in real time.

Authors:  Ensheng Dong; Hongru Du; Lauren Gardner
Journal:  Lancet Infect Dis       Date:  2020-02-19       Impact factor: 25.071

  10 in total
  1 in total

Review 1.  A Systematic Review on COVID-19 Vaccine Strategies, Their Effectiveness, and Issues.

Authors:  Shahad Saif Khandker; Brian Godman; Md Irfan Jawad; Bushra Ayat Meghla; Taslima Akter Tisha; Mohib Ullah Khondoker; Md Ahsanul Haq; Jaykaran Charan; Ali Azam Talukder; Nafisa Azmuda; Shahana Sharmin; Mohd Raeed Jamiruddin; Mainul Haque; Nihad Adnan
Journal:  Vaccines (Basel)       Date:  2021-11-24
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.