Literature DB >> 32527780

Complete Genome Sequence of a Novel Coronavirus (SARS-CoV-2) Isolate from Bangladesh.

Senjuti Saha1, Roly Malaker2, Mohammad Saiful Islam Sajib2, Md Hasanuzzaman2, Hafizur Rahman2, Zabed B Ahmed2, Mohammad Shahidul Islam2, Maksuda Islam2, Yogesh Hooda2,3, Vida Ahyong4, Manu Vanaerschot4, Joshua Batson4, Samantha Hao4, Jack Kamm4, Amy Kistler4, Cristina M Tato4, Joseph L DeRisi4,5, Samir K Saha2,6.   

Abstract

The complete genome sequence of a novel coronavirus (severe acute respiratory syndrome coronavirus 2 [SARS-CoV-2]) isolate obtained from a nasopharyngeal swab from a patient with COVID-19 in Bangladesh is reported.
Copyright © 2020 Saha et al.

Entities:  

Year:  2020        PMID: 32527780      PMCID: PMC7291105          DOI: 10.1128/MRA.00568-20

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Coronavirus disease 2019 (COVID-19) is an infectious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which belongs to the family Coronaviridae and the genus Betacoronavirus. In Bangladesh, the first three cases were detected on 8 March 2020. By 14 May 2020, more than 18,000 cases and 280 deaths had been reported in the country. The Child Health Research Foundation (CHRF) has been a testing center for COVID-19 since 29 March 2020 and uses quantitative PCR (qPCR)-based SARS-CoV-2 detection in nasopharyngeal or oropharyngeal swab samples (1). Here, we report the complete sequence of a SARS-CoV-2 isolate from a patient who tested positive in a qPCR test. All protocols were approved by the National Research Ethics Committee, Bangladesh Medical Research Council, and the ethical review board of the Bangladesh Institute of Child Health. Samples from suspected COVID-19 patients were collected for clinical care and diagnostic testing at the discretion of the attending health care providers and were received and tested at the CHRF with the permission of the Director General Health Services, Government of Bangladesh. For this study, a nasopharyngeal specimen from a symptomatic patient with COVID-19 was collected on 18 April 2020. The specimen was tested for SARS-CoV-2 using qPCR at the CHRF and had a low cycle threshold value (C < 15). Extraction of the viral nucleic acid from the nasopharyngeal specimen was performed using the Quick-RNA/DNA microprep extraction kit (product no. D7005; Zymo) according to the manufacturer’s protocol. cDNA was converted to Illumina libraries using the NEBNext Ultra II RNA library preparation kit (product no. E7770; NEB) according to the manufacturer’s protocol. Targeted enrichment of the SARS-CoV-2 sequence using 73 tiling primers at a 20:1 molar ratio of random primers to tiled primers was adapted from viral genome recovery methods described previously (2, 3). External RNA Controls Consortium Collection spike-in control mix (product no. 4456740; Thermo Fisher Scientific) was used as a marker of potential library preparation errors and for input RNA mass calculation. The library was sequenced on an Illumina iSeq100 sequencer using 150-nucleotide paired-end sequencing present at CHRF. The library generated 15,376,000 reads, of which 11,747,398 passed the default Illumina quality filter in BaseSpace. Raw fastq files were uploaded to the IDSeq portal for host subtraction (4). SARS-CoV-2 reads were recovered by mapping the raw reads against the reference sequence (GenBank accession no. MN908947.3) using minimap2 (v2.17) (5, 6) and filtering the reads that mapped against a database of human and other viral genomes using Kraken2 (v2.0.8_beta) (7). The surviving reads were adapter trimmed with Trim Galore and remapped to the same reference using minimap2. Enrichment primers were trimmed with iVar (v1.2), and the consensus genome was called using iVar (8, 9). The full pipeline is available online (https://github.com/czbiohub/sc2-msspe-bioinfo). The complete genome of the Bangladeshi SARS-CoV-2 strain (CHRF_nCoV19_0001) has 29,903 bp, with an average coverage of over 3,000×; no indels were detected, and the GC content was 38%. The sample was uploaded to the Global Initiative on Sharing All Influenza Data (GISAID) database (10) on 12 May 2020. The genome was subsequently contextualized among 5,265 other genomes available in the GISAID database in the Nextstrain Asia build of 15 May 2020 (www.nextstrain.org/ncov/asia) (11). Phylogenetic analysis of this virus genome showed that it was grouped in SARS-CoV-2 clade A2a. The genome contained two mutations not observed in the closest previously observed genotype, which occurred in samples from Ireland, Italy, Greece, Hungary, the Czech Republic, Denmark, the United States, Turkey, Jordan, Russia, Georgia, Vietnam, Argentina, Switzerland, Spain, New Zealand, India, Taiwan, Singapore, and Japan (Fig. 1). Because of the wide geographic spread of that closest ancestral genotype, it is difficult to infer the specific route by which the strain came to Bangladesh. As more SARS-CoV-2 genomes from Bangladesh and around the world are sequenced, we will be able to better monitor new introductions into and transmission dynamics within Bangladesh.
FIG 1

Phylogenetic tree of SARS-CoV-2 in the neighborhood of the first genome from Bangladesh. The x axis represents the number of mutations from the Wuhan strain (GenBank accession no. MN908947.3). The large red circle represents the position of CHRF_nCoV19_0001. Its closest ancestral genotype includes sequences from North America, Europe, the Middle East, East Asia, and Southeast Asia. The figure was rendered using Nextstrain (nextstrain.org).

Phylogenetic tree of SARS-CoV-2 in the neighborhood of the first genome from Bangladesh. The x axis represents the number of mutations from the Wuhan strain (GenBank accession no. MN908947.3). The large red circle represents the position of CHRF_nCoV19_0001. Its closest ancestral genotype includes sequences from North America, Europe, the Middle East, East Asia, and Southeast Asia. The figure was rendered using Nextstrain (nextstrain.org). In total, 9 mutations were observed in CHRF_nCoV19_0001, compared to the reference genome for the Wuhan strain (GenBank accession no. MN908947.3) from December 2019 (Table 1). These mutations included the spike protein D614G mutation that is enriched in recent SARS-CoV-2 isolates, especially from Europe and North America (12). Other mutations of interest included the position 28881 to 28883 GGG→AAC mutation, which changes the sequence of nucleoprotein N at positions 203 and 204 from RG to KR. In the Nextstrain global build of 15 May 2020, this mutation is predicted to have emerged at the end of January 2020 and is present in many isolates from clade A2a (11). Finally, one mutation observed is unique to CHRF_nCoV19_0001, i.e., A1163T. This mutation leads to a I300F change in open reading frame 1a (ORF1a) (on the nsp2 protein), and its prevalence in Bangladesh should be monitored closely.
TABLE 1

Mutations present in CHRF_nCoV19_0001 in relation to the ancestral Wuhan strain (GenBank accession no. MN908947.3)

Nucleotide positionReference nucleotideMutated nucleotideAmino acid changeComments
241CTNoncoding
1163ATORF1a, I300Fnsp2, function unclear
3037CTNo change
14408CTORF1b, P314Lnsp13, RNA-dependent RNA polymerase
17019GTORF1b, E1184Dnsp14, helicase
23403AGS, D614GS (spike protein), mutation may affect transmission/virulence
28881GAN, R203KN, involved in RNA packaging
28882GANo change
28883GCN, G204RN, involved in RNA packaging
Mutations present in CHRF_nCoV19_0001 in relation to the ancestral Wuhan strain (GenBank accession no. MN908947.3) We are currently sequencing more genomes from different regions of Bangladesh and from patients with different clinical features to further investigate the spread of COVID-19 and to monitor the evolution of SARS-CoV-2 in Bangladesh.

Data availability.

The SARS-CoV-2 genome from Bangladesh was deposited in the GISAID database (accession no. EPI_ISL_437912) and GenBank (accession no. MT476385). The raw reads have been deposited in the NCBI Sequence Read Archive (SRA accession no. SRR11801823). The BioProject and BioSample accession no. are PRJNA633241 and SAMN14938301, respectively.
  7 in total

1.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

2.  Minimap2: pairwise alignment for nucleotide sequences.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2018-09-15       Impact factor: 6.937

3.  GISAID: Global initiative on sharing all influenza data - from vision to reality.

Authors:  Yuelong Shu; John McCauley
Journal:  Euro Surveill       Date:  2017-03-30

4.  Nextstrain: real-time tracking of pathogen evolution.

Authors:  James Hadfield; Colin Megill; Sidney M Bell; John Huddleston; Barney Potter; Charlton Callender; Pavel Sagulenko; Trevor Bedford; Richard A Neher
Journal:  Bioinformatics       Date:  2018-12-01       Impact factor: 6.931

5.  An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar.

Authors:  Nathan D Grubaugh; Karthik Gangavarapu; Joshua Quick; Nathaniel L Matteson; Jaqueline Goes De Jesus; Bradley J Main; Amanda L Tan; Lauren M Paul; Doug E Brackney; Saran Grewal; Nikos Gurfield; Koen K A Van Rompay; Sharon Isern; Scott F Michael; Lark L Coffey; Nicholas J Loman; Kristian G Andersen
Journal:  Genome Biol       Date:  2019-01-08       Impact factor: 13.583

6.  Metagenomic sequencing with spiked primer enrichment for viral diagnostics and genomic surveillance.

Authors:  Xianding Deng; Asmeeta Achari; Scot Federman; Guixia Yu; Sneha Somasekar; Inês Bártolo; Shigeo Yagi; Placide Mbala-Kingebeni; Jimmy Kapetshi; Steve Ahuka-Mundeke; Jean-Jacques Muyembe-Tamfum; Asim A Ahmed; Vijay Ganesh; Manasi Tamhankar; Jean L Patterson; Nicaise Ndembi; Dora Mbanya; Lazare Kaptue; Carole McArthur; José E Muñoz-Medina; Cesar R Gonzalez-Bonilla; Susana López; Carlos F Arias; Shaun Arevalo; Steve Miller; Mars Stone; Michael Busch; Kristina Hsieh; Sharon Messenger; Debra A Wadford; Mary Rodgers; Gavin Cloherty; Nuno R Faria; Julien Thézé; Oliver G Pybus; Zoraima Neto; Joana Morais; Nuno Taveira; John R Hackett; Charles Y Chiu
Journal:  Nat Microbiol       Date:  2020-01-13       Impact factor: 17.745

7.  Improved metagenomic analysis with Kraken 2.

Authors:  Derrick E Wood; Jennifer Lu; Ben Langmead
Journal:  Genome Biol       Date:  2019-11-28       Impact factor: 17.906

  7 in total
  10 in total

1.  Metagenomic Pathogen Sequencing in Resource-Scarce Settings: Lessons Learned and the Road Ahead.

Authors:  Christina Yek; Andrea R Pacheco; Manu Vanaerschot; Jennifer A Bohl; Elizabeth Fahsbender; Andrés Aranda-Díaz; Sreyngim Lay; Sophana Chea; Meng Heng Oum; Chanthap Lon; Cristina M Tato; Jessica E Manning
Journal:  Front Epidemiol       Date:  2022-08-15

2.  COVID-19 rise in Bangladesh correlates with increasing detection of B.1.351 variant.

Authors:  Senjuti Saha; Arif M Tanmoy; Yogesh Hooda; Afroza Akter Tanni; Sharmistha Goswami; Syed Muktadir Al Sium; Mohammad Saiful Islam Sajib; Roly Malaker; Shuborno Islam; Hafizur Rahman; Ataul Mustufa Anik; Nikkon Sarker; Mohammad Shahidul Islam; Kinkar Ghosh; Probir Kumar Sarkar; Mohammed Rizwanul Ahsan Bipul; Syed Shafi Ahmed; Mohammod Shahidullah; Samir K Saha
Journal:  BMJ Glob Health       Date:  2021-05

3.  Genome-wide in silico identification and characterization of Simple Sequence Repeats in diverse completed SARS-CoV-2 genomes.

Authors:  Rasel Siddiqe; Ajit Ghosh
Journal:  Gene Rep       Date:  2021-01-26

Review 4.  Landscape of humoral immune responses against SARS-CoV-2 in patients with COVID-19 disease and the value of antibody testing.

Authors:  Sundarasamy Mahalingam; John Peter; Ziyang Xu; Devivasha Bordoloi; Michelle Ho; Vaniambadi S Kalyanaraman; Alagarsamy Srinivasan; Kar Muthumani
Journal:  Heliyon       Date:  2021-04-17

5.  Reactivity of human antisera to codon optimized SARS-CoV2 viral proteins expressed in Escherichia coli.

Authors:  Yee-Huan Toh; Yu-Weng Huang; Yo-Chen Chang; Yi-Ting Chen; Ya-Ting Hsu; Guang-Huey Lin
Journal:  Tzu Chi Med J       Date:  2021-03-09

6.  Invasive Bacterial Vaccine-Preventable Disease Surveillance: Successes and Lessons Learned in Bangladesh for a Sustainable Path Forward.

Authors:  Senjuti Saha; Samir K Saha
Journal:  J Infect Dis       Date:  2021-09-01       Impact factor: 5.226

7.  Development of an in silico multi-epitope vaccine against SARS-COV-2 by précised immune-informatics approaches.

Authors:  Saad Al Zamane; Fahim Alam Nobel; Ruksana Akter Jebin; Mohammed Badrul Amin; Pratul Dipta Somadder; Nusrat Jahan Antora; Md Imam Hossain; Mohammod Johirul Islam; Kawsar Ahmed; Mohammad Ali Moni
Journal:  Inform Med Unlocked       Date:  2021-11-03

8.  Travel ban effects on SARS-CoV-2 transmission lineages in the UAE as inferred by genomic epidemiology.

Authors:  Andreas Henschel; Samuel F Feng; Rifat A Hamoudi; Gihan Daw Elbait; Ernesto Damiani; Fathimathuz Waasia; Guan K Tay; Bassam H Mahboub; Maimunah Hemayet Uddin; Juan Acuna; Eman Alefishat; Rabih Halwani; Herbert F Jelinek; Farah Mustafa; Nawal Alkaabi; Habiba S Alsafar
Journal:  PLoS One       Date:  2022-03-02       Impact factor: 3.240

9.  Genomic surveillance unfolds the SARS-CoV-2 transmission and divergence dynamics in Bangladesh.

Authors:  Tushar Ahmed Shishir; Taslimun Jannat; Iftekhar Bin Naser
Journal:  Front Genet       Date:  2022-09-26       Impact factor: 4.772

10.  The Direct and Indirect Impact of SARS-CoV-2 Infections on Neonates: A Series of 26 Cases in Bangladesh.

Authors:  Senjuti Saha; Asm Nawshad Uddin Ahmed; Probir Kumar Sarkar; Mohammed Rizwanul Ahsan Bipul; Kinkar Ghosh; Sheikh Wasik Rahman; Hafizur Rahman; Yogesh Hooda; Nafiz Ahsan; Roly Malaker; Mohammad Saiful Islam Sajib; Mohammad Shahidul Islam; Ataul Mustufa Anik; Sudipta Saha; Naito Kanon; Maksuda Islam; Davidson H Hamer; Ruhul Amin; Mohammod Shahidullah; Samir K Saha
Journal:  Pediatr Infect Dis J       Date:  2020-12       Impact factor: 3.806

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.