Literature DB >> 26981383

Genome Sequencing and Annotation of Mycobacterium tuberculosis PR08 strain.

Mohammad Maaruf Jaafar1, Mohd Zakihalani A Halim1, Mohamad Izwan Ismail1, Lee Lian Shien1, Teh Lay Kek2, Ngeow Yun Fong3, Norazmi Mohd Nor4, Zainul Fadziruddin Zainuddin5, Tang Thean Hock6, Mohd Nazalan Mohd Najimudin7, Mohd Zaki Salleh1.   

Abstract

Mycobacterium tuberculosis is an acid fast bacterial species in the family Mycobacteriaceae and is the causative agent of most cases of tuberculosis. Here, we report the genomic features of Mycobacterium tuberculosis isolated from the cerebrospinal fluid (CSF) of a patient diagnosed with both pulmonary and extrapulmonary tuberculosis (TB). The isolated strain was identified as Mycobacterium tuberculosis PR08 (MTB PR08). Genomic DNA of the MTB PR08 strain was extracted and subjected to whole genome sequencing using MiSeq (Illumina, CA,USA). The draft genome size of MTB PR08 strain is 4,292,364 bp with a G + C content of 65.2%. This strain was annotated to have 4723 genes and 48 RNAs. This whole genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number CP010895.

Entities:  

Keywords:  CSF; Extrapulmonary; Genome; Mycobacterium tuberculosis

Year:  2015        PMID: 26981383      PMCID: PMC4778648          DOI: 10.1016/j.gdata.2015.12.030

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Direct link to deposited data [provide URL below]

http://www.ncbi.nlm.nih.gov/bioproject/PRJNA196391. (Biosample: SAMN03290698).

Experimental design, materials and methods

Mycobacterium tuberculosis PR08 (MTB PR08) was isolated from the cerebrospinal fluid (CSF) of a patient diagnosed with both pulmonary and extrapulmonary tuberculosis at a local hospital. The sample was cultured in BBL™ MGIT™ Mycobacterial Growth Indicator Tube supplemented with BBL™ MGIT™ OADC enrichment and BBL™ MGIT™ PANTA™ antibiotic mixture (Becton–Dickinson, Oxford, United Kingdom). Genomic DNA was extracted from MTB PR08 and was sequenced using MiSeq (Illumina, CA, USA), generating a total of 46,013,686 reads in a 300-cycle run. Raw reads were trimmed and assembled de novo using CLCbio (CLC Genomics Workbench version 7.0.3) (CLCbio, Aarhus, Denmark), producing an average coverage of 378 ×. Annotation was performed using the Bacterial Annotation System (BASys) [1] and Rapid Annotation using Subsystem Technology (RAST) [2] online services, and the pathogenicity and virulence genes were determined. The genes were validated using the following external gene annotation databases: TubercuList (http://tuberculist.epfl.ch), UniProtKB (http://www.ebi.ac.uk/uniprot), Virulence Factor Database (VFDB) (http://www.mgc.ac.cn), and TBDatabase (TBDB) (http://www.tbdb.org). The size of the draft genome of MTB PR08 is 4,292,364 bp with a G + C content of 65.2%. It is composed of 214 contigs with 4723 predicted genes of which 4203 were protein coding genes and 48 RNA-encoding genes. A total of 2295 (54.6%) of the protein coding genes were assigned into the Cluster of Orthologous Group (COG) [2]. Using RAST, a total of 393 subsystems were annotated in the MTB PR08 genome (Fig. 1).
Fig 1

Subsystem distribution of Mycobacterium tuberculosis strain PR08 (based on RAST annotation server).

Comparative analysis of MTB PR08 was performed against two other genomes; PR05 [3] and the reference genome H37Rv. Annotation and comparative genomics analysis of MTB PR08 and the selected reference genomes were carried out using RAST as shown in Table 1. In order to identify the functions of the genes that contributed to extrapulmonary TB, the genes were annotated using BASys. Based on the analysis, a putative gene (opcA gene) which may have been involved in extrapulmonary infection was identified. It has been reported to play a role in meningococcal adhesion, invasion of epithelial and endothelial cells and in assembly of Glucose-6-Phosphate-Dehydrogenase (G6PD) [4], [5].
Table 1

Comparative analysis between MTB PR05, MTB PR08 and the reference genome H37Rv.

GenomePR08PR05H37Rv (reference)
Genome size (bp)4,292,3644,419,5014,411,532
Number of subsystems393403390
Number of coding sequences420344374360
Number of genes472347394644
Number of RNAs484848
Comparison of genome sequences using RAST revealed that the closest strains of MTB PR08 are Mycobacterium tuberculosis NCGM2209 (score 521), Mycobacterium tuberculosis UM 1072388579 (score 473) and Mycobacterium tuberculosis NA-A0008 (score 454). This Whole Genome Shotgun project has been deposited at GenBank under the accession number CP010895.

Nucleotide sequence accession number

The whole genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number CP010895.

Conflict of interest

The authors declare that there is no conflict of interests with respect to the work published in this paper.
Specifications
Organism/cell line/tissueMycobacterium tuberculosis
StrainPR08
Sequencer or array typeIllumina MiSeq sequencer
Data formatProcessed
Experimental factorsMicrobial strain
Experimental featuresDraft genome sequence of Mycobacterium tuberculosis PR08assembly and annotation
ConsentN/A
Sample source locationKuala Lumpur, Malaysia4.1936°N103.7249°E
  5 in total

1.  Comparison of interferon-γ-, interleukin (IL)-17- and IL-22-expressing CD4 T cells, IL-22-expressing granulocytes and proinflammatory cytokines during latent and active tuberculosis infection.

Authors:  J Cowan; S Pandey; L G Filion; J B Angel; A Kumar; D W Cameron
Journal:  Clin Exp Immunol       Date:  2012-02       Impact factor: 4.330

2.  Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence.

Authors:  S T Cole; R Brosch; J Parkhill; T Garnier; C Churcher; D Harris; S V Gordon; K Eiglmeier; S Gas; C E Barry; F Tekaia; K Badcock; D Basham; D Brown; T Chillingworth; R Connor; R Davies; K Devlin; T Feltwell; S Gentles; N Hamlin; S Holroyd; T Hornsby; K Jagels; A Krogh; J McLean; S Moule; L Murphy; K Oliver; J Osborne; M A Quail; M A Rajandream; J Rogers; S Rutter; K Seeger; J Skelton; R Squares; S Squares; J E Sulston; K Taylor; S Whitehead; B G Barrell
Journal:  Nature       Date:  1998-06-11       Impact factor: 49.962

3.  BASys: a web server for automated bacterial genome annotation.

Authors:  Gary H Van Domselaar; Paul Stothard; Savita Shrivastava; Joseph A Cruz; AnChi Guo; Xiaoli Dong; Paul Lu; Duane Szafron; Russ Greiner; David S Wishart
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

4.  The RAST Server: rapid annotations using subsystems technology.

Authors:  Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal:  BMC Genomics       Date:  2008-02-08       Impact factor: 3.969

5.  Draft Genome Sequence of a Clinical Isolate of Mycobacterium tuberculosis Strain PR05.

Authors:  Azwin Ismail; L K Teh; Y F Ngeow; M N Norazmi; Zainuddin F Zainul; T H Tang; N Najimudin; M Z Salleh
Journal:  Genome Announc       Date:  2013-06-20
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.