| Literature DB >> 23991261 |
Mohamed Sassi1, Catherine Robert, Didier Raoult, Michel Drancourt.
Abstract
Mycobacterium simiae is a non-tuberculosis mycobacterium causing pulmonary infections in both immunocompetent and imunocompromized patients. We announce the draft genome sequence of M. simiae DSM 44165(T). The 5,782,968-bp long genome with 65.15% GC content (one chromosome, no plasmid) contains 5,727 open reading frames (33% with unknown function and 11 ORFs sizing more than 5000 -bp), three rRNA operons, 52 tRNA, one 66-bp tmRNA matching with tmRNA tags from Mycobacterium avium, Mycobacterium tuberculosis, Mycobacterium bovis, Mycobacterium microti, Mycobacterium marinum, and Mycobacterium africanum and 389 DNA repetitive sequences. Comparing ORFs and size distribution between M. simiae and five other Mycobacterium species M. simiae clustered with M. abscessus and M. smegmatis. A 40-kb prophage was predicted in addition to two prophage-like elements, 7-kb and 18-kb in size, but no mycobacteriophage was seen after the observation of 10(6) M. simiae cells. Fifteen putative CRISPRs were found. Three genes were predicted to encode resistance to aminoglycosides, betalactams and macrolide-lincosamide-streptogramin B. A total of 163 CAZYmes were annotated. M. simiae contains ESX-1 to ESX-5 genes encoding for a type-VII secretion system. Availability of the genome sequence may help depict the unique properties of this environmental, opportunistic pathogen.Entities:
Keywords: Mycobacterium simiae draft genome; SOLiD; non-tuberculous mycobacteria
Year: 2013 PMID: 23991261 PMCID: PMC3746425 DOI: 10.4056/sigs.3707349
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of DSM44165T [22].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Subclass | TAS [ | ||
| Order | TAS [ | ||
| Suborder | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Gram stain | Weakly positive | TAS [ | |
| Motility | Non motile | TAS [ | |
| Sporulation | nonsporulating | NAS | |
| Temperature range | mesophile | TAS [ | |
| Optimum temperature | 37°C | TAS [ | |
| Salinity | normal | TAS [ | |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| MIGS-6 | Habitat | Soil | TAS [ |
| MIGS-15 | Biotic relationship | Free-living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 2 | NAS | |
| Isolation | Macacus rhesus | TAS [ | |
| MIGS-4 | Geographic location | Country India | TAS [ |
| MIGS-5 | Sample collection time | 1965 | TAS [ |
| MIGS-4.1 | Latitude | 20.593684 | NAS [ |
| MIGS-4.2 | Longitude | 78.96288 | NAS [ |
| MIGS-4.3 | Depth | Not reported | TAS [ |
| MIGS-4.4 | Altitude | Not reported | TAS [ |
Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32].
Figure 1rpoB gene sequence based phylogenetic tree highlighting the position of DSM 44165 relative to other type strains within the genus. Phylogenetic inferences obtained using the neighbor-joining method within MEGA. Numbers at the nodes are percentages of bootstrap values obtained by repeating the analysis 1,000 times to generate a majority consensus tree. P14 was used as an outgroup.
Figure 2Electron microscopy graph of DSM 44165T
Figure 3Reference mass spectrum from strain DSM 44165. Spectra from 5 individual colonies were compared and a reference spectrum was generated.
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | One 454 paired end 3-kb library |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium |
| MIGS-31.2 | Fold coverage | 15.33 |
| MIGS-30 | Assemblers | Newbler version 2.5.3 |
| MIGS-32 | Gene calling method | Prodigal |
| EMBL-EBI/NCBI project ID | PRJEB1560 | |
| EMBL-EBI/Genbank ID | CBMJ020000001-CBMJ020000359 | |
| EMBL-EBI Date of Release | June 27, 20113 | |
| MIGS-13 | Source material identifier | DSM 44165T |
| Project relevance | Pangenome of opportunistic mycobacteria |
Figure 4Graphical circular map of the chromosome. From outside to the center: Genes on the forward strand (colored by COG categories), genes on the reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red), GC content, and GC skew.
Nucleotide content and gene count levels of the genome
| | | |
|---|---|---|
| Genome size (bp) | 5,782,968 | 100 |
| DNA coding region (bp) | 5,072,379 | 87.71 |
| DNA G+C content (bp) | 3,767,609 | 65.15 |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 5,782 | 100 |
| RNA genes | 55 | 0.95 |
| Protein-coding genes | 5,727 | 99.04 |
| Genes with function prediction | 4,673 | 81.6 |
| Genes assigned to COGs | 4,105 | 71,67 |
| Genes with peptide signals | 377 | 6.58 |
| Genes with transmembrane helices | 1,144 | 19.97 |
| CRISPR repeats | 15 |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with the 25 general COG functional categories
| Code | Value | % agea | Description |
|---|---|---|---|
| J | 157 | 2.74 | Translation |
| A | 1 | 0.02 | RNA processing and modification |
| K | 410 | 7,16 | Transcription |
| L | 171 | 2.99 | Replication, recombination and repair |
| B | 2 | 0.03 | Chromatin structure and dynamics |
| D | 34 | 0.59 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 41 | 0.72 | Defense mechanisms |
| T | 169 | 2.95 | Signal transduction mechanisms |
| M | 162 | 2.83 | Cell wall/membrane biogenesis |
| N | 48 | 0.84 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 23 | 0.40 | Intracellular trafficking and secretion |
| O | 132 | 2.30 | Posttranslational modification, protein turnover, chaperones |
| C | 400 | 6.98 | Energy production and conversion |
| G | 212 | 3.70 | Carbohydrate transport and metabolism |
| E | 151 | 2.64 | Amino acid transport and metabolism |
| F | 11 | 0.19 | Nucleotide transport and metabolism |
| H | 158 | 2,76 | Coenzyme transport and metabolism |
| I | 418 | 7,30 | Lipid transport and metabolism |
| P | 192 | 3,35 | Inorganic ion transport and metabolism |
| Q | 433 | 7,56 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 656 | 11,45 | General function prediction only |
| S | 291 | 5,08 | Function unknown |
| 1622 | 1,25 | Not in COGs | |
a) The total is based on the total number of protein coding genes in the annotated genome.
Figure 5Heatmap of the ORFs size distribution of compared with 5 other genomes.