| Literature DB >> 27486485 |
Yunyoung Kwak1, Qing X Li2, Jae-Ho Shin1.
Abstract
Mycobacterium rufum JS14(T) (=ATCC BAA-1377(T), CIP 109273(T), JCM 16372(T), DSM 45406(T)), a type strain of the species Mycobacterium rufum sp. . belonging to the family Mycobacteriaceae, was isolated from polycyclic aromatic hydrocarbon (PAH)-contaminated soil in Hilo (HI, USA) because it harbors the capability of degrading PAH. Here, we describe the first genome sequence of strain JS14(T), with brief phenotypic characteristics. The genome is composed of 6,176,413 bp with 69.25 % G + C content and contains 5810 protein-coding genes with 54 RNA genes. The genome information on M. rufum JS14(T) will provide a better understanding of the complexity of bacterial catabolic pathways for degradation of specific chemicals.Entities:
Keywords: Biodegradation; Mycobacterium; Polycyclic aromatic hydrocarbon
Year: 2016 PMID: 27486485 PMCID: PMC4969647 DOI: 10.1186/s40793-016-0167-5
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1A neighbor-joining phylogenetic tree depicting the position of M. rufum JS14T [10] (shown in boldface with an asterisk) relative to the other species within the genus Mycobacterium. In this genus, species carrying the full length of 16S rRNA gene sequence were selected from the NCBI database [45]. The collected nucleotide sequences were aligned using ClustalW [46], and the phylogenetic tree was constructed using software MEGA version 6 [47] by the neighbor-joining method with 1000 bootstrap replicates [18]. The generated bootstrap values for each species are presented at the nodes, and the scale bar indicates 0.005 nucleotide changes per nucleotide position. The strains under study and their corresponding GenBank accession numbers for 16S rRNA genes are as follows: M. chlorophenolicum PCP-I [14, 15] (NR_119093); M. gilvum Spyr1 [37, 48] (NR_074644); M. gilvum PYR-GCK [37, 48] (NR_074553); M. rhodesiae NBB3 [49] (NR_102870); M. vanbaalenii PYR-1 [16] (NR_074572); M. fluoranthenivorans FA4 [17, 50] (NR_042224); M. wolinskyi 700010 [51] (NR_119253); M. mageritense 938 [52] (NR_042265); M. smegmatis str. MC2 155 [37, 53] (NR_074726); M. flavescens ATCC 14474 [37, 54] (NR_044815); M. novocastrense 73 [55] (NR_029208); M. insubricum FI-06250 [56] (NR_125525); M. florentinum FI-93171 [57] (NR_042223); M. montefiorense ATCC BAA-256 [58, 59] (NR_028808); M. confluentis 1389/90 [60] (NR_042245); M. holsaticum 1406 [61] (NR_028945); M. elephantis DSM 44368 [62] (NR_025296); M. marinum M [37, 63] (NR_074864); M. ulcerans Agy99 [37, 64] (NR_074861); M. bovis BCG str. Pasteur 1173P2 [37, 65] (NR_074838); M. canettii CIPT 140010059 [66] (NR_074836); M. africanum GM041182 [37, 67] (NR_074835)
Fig. 2A scanning electron micrograph of M. rufum JS14T. The image was taken using a Field Emission Scanning Electron Microscope (SU8220; Hitachi, Japan) at an operating voltage of 10.0 kV. The scale bar represents 5.0 μm
Classification and general features of M. rufum JS14T [22]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| (Type) strain: JS14T (=ATCC BAA-1377T, CIP 109273T, JCM 16372T, DSM 45406T) | TAS [ | ||
| Gram stain | Positive: weak uptake of Gram stain | TAS [ | |
| Cell shape | Medium to long thin rods | TAS [ | |
| Colony pigmentation | Orange | TAS [ | |
| Motility | Non-motile | TAS [ | |
| Sporulation | Not reported | NAS | |
| Temperature range | Mesophile | NAS | |
| Optimum temperature | 28 °C | TAS [ | |
| pH range; Optimum | 7.0–8.0; 7.5 | NAS | |
| Carbon source | Fluoranthene, glucose, fructose, mannitol, trehalose, xylose, others | TAS [ | |
| Energy source | Fluoranthene | TAS [ | |
| MIGS-6 | Habitat | Soil | TAS [ |
| MIGS-6.3 | Salinity | Not tolerant salinity (2.5–5.0 % NaCl, w/v) | TAS [ |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| MIGS-15 | Biotic relationships | Free living | NAS |
| MIGS-14 | Pathogenicity | None | NAS |
| MIGS-4 | Geographic location | Hawaii, United States | TAS [ |
| MIGS-5 | Sample collection | February, 2003 | NAS |
| MIGS-4.1 | Latitude | 19° 49′ 20″ N | TAS [ |
| MIGS-4.2 | Longitude | 155° 05′ 01″ W | TAS [ |
| MIGS-4.3 | Depth | Not reported | |
| MIGS-4.4 | Altitude | Not reported |
a Evidence codes. IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [42]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Draft |
| MIGS-28 | Libraries used | 20 kb SMRT-bell library |
| MIGS-29 | Sequencing platforms | PacBio RS II |
| MIGS-31.2 | Fold coverage | 113.03× |
| MIGS-30 | Assemblers | RS HGAP Assembly Protocol [ |
| MIGS-32 | Gene-calling method | NCBI Prokaryotic Genome Annotation Pipeline [ |
| Locus Tag | EU78 | |
| INSDC ID | JROA00000000 | |
| GenBank Date of Release | October 2, 2014 | |
| GOLD ID | Gi0074119 | |
| BIOPROJECT | PRJNA247390 | |
| MIGS-13 | Source Material Identifier | ATCC BAA-1377T, CIP 109273T, JCM 16372T, DSM 45406T |
| Project relevance | Environmental |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 6,176,413 | 100.00 |
| DNA coding (bp) | 5,622,516 | 91.03 |
| DNA G + C (bp) | 4,277,025 | 69.25 |
| DNA scaffolds | 4 | 100.00 |
| Total genes | 5864 | 100.00 |
| Protein-coding genes | 5810 | 99.08 |
| RNA genes | 54 | 0.92 |
| Pseudogenes | 367 | 6.26 |
| Genes in internal clusters | 944 | 16.10 |
| Genes with function prediction | 4498 | 76.71 |
| Genes assigned to COGs | 3669 | 62.57 |
| Genes with Pfam domains | 4544 | 77.49 |
| Genes with signal peptides | 314 | 5.35 |
| Genes with transmembrane helices | 1227 | 20.92 |
| CRISPR repeats | 0 | 0.00 |
Fig. 3A graphical circular map of the M. rufum JS14T genome. The circular map was generated using the BLAST Ring Image Generator software [68]. From the inner circle to the outer circle: Genetic regions; GC content (black), and GC skew (purple/green), respectively
Numbers of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 181 | 4.25 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.02 | RNA processing and modification |
| K | 353 | 8.29 | Transcription |
| L | 118 | 2.77 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 32 | 0.75 | Cell cycle control, cell division, chromosome partitioning |
| V | 98 | 2.30 | Defense mechanisms |
| T | 173 | 4.06 | Signal transduction mechanisms |
| M | 210 | 4.93 | Cell wall/membrane/envelope biogenesis |
| N | 12 | 0.28 | Cell motility |
| U | 22 | 0.52 | Intracellular trafficking, secretion, and vesicular transport |
| O | 142 | 3.34 | Post-translational modification, protein turnover, chaperones |
| C | 312 | 7.33 | Energy production and conversion |
| G | 245 | 5.76 | Carbohydrate transport and metabolism |
| E | 333 | 7.82 | Amino acid transport and metabolism |
| F | 89 | 2.09 | Nucleotide transport and metabolism |
| H | 266 | 6.25 | Coenzyme transport and metabolism |
| I | 422 | 9.91 | Lipid transport and metabolism |
| P | 224 | 5.26 | Inorganic ion transport and metabolism |
| Q | 264 | 6.20 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 516 | 12.12 | General function prediction only |
| S | 209 | 4.91 | Function unknown |
| W | 2 | 0.05 | Extracellular structures |
| X | 33 | 0.78 | Mobilome: prophages, transposons |
| - | 2195 | 37.43 | Not in COGs |
The total is based on the total number of protein coding genes in the annotated genome
Comparison of the functional gene counts in the function profile of genome sequences
| Function ID | Name |
|
|
|
|
|
|---|---|---|---|---|---|---|
| KO:K00448 | protocatechuate 3,4-dioxygenase, alpha subunit [EC:1.13.11.3] ( | 1 | 1 | 1 | 1 | 3 |
| KO:K00449 | protocatechuate 3,4-dioxygenase, beta subunit [EC:1.13.11.3] ( | 1 | 1 | 1 | 1 | 2 |
| KO:K18253 | phthalate 3,4-dioxygenase ferredoxin subunit ( | 0 | 2 | 1 | 2 | 1 |
| KO:K18254 | phthalate 3,4-dioxygenase ferredoxin reductase subunit [EC:1.18.1.3] ( | 1 | 2 | 1 | 0 | 1 |
| KO:K00517 | E1.14.-.- (cytochrome P450) | 12 | 10 | 10 | 5 | 6 |
| KO:K18256 | 3,4-dihydroxyphthalate decarboxylase [EC:4.1.1.69] ( | 1 | 2 | 1 | 0 | 1 |
| KO:K11943 | PAH dioxygenase large subunit [EC:1.13.11.-] ( | 1 | 2 | 1 | 1 | 0 |
| KO:K11944 | PAH dioxygenase small subunit [EC:1.13.11.-] ( | 2 | 4 | 2 | 4 | 0 |
| KO:K11948 | 1-hydroxy-2-naphthoate dioxygenase [EC:1.13.11.38] ( | 1 | 2 | 1 | 0 | 0 |
| KO:K11949 | 4-(2-carboxyphenyl)-2-oxobut-3-enoate aldolase [EC:4.1.2.34] ( | 1 | 2 | 1 | 1 | 0 |
| KO:K18251 | phthalate 3,4-dioxygenase alpha subunit [EC:1.14.12.-] ( | 1 | 2 | 1 | 0 | 0 |
| KO:K18252 | phthalate 3,4-dioxygenase beta subunit [EC:1.14.12.-] ( | 1 | 2 | 1 | 1 | 0 |
| KO:K18255 | phthalate 3,4-cis-dihydrodiol dehydrogenase [EC:1.3.1.-] ( | 1 | 2 | 1 | 1 | 0 |
| KO:K18257 | cis-3,4-dihydrophenanthrene-3,4-diol dehydrogenase [EC:1.3.1.49] ( | 1 | 2 | 1 | 1 | 0 |
| KO:K18275 | 2-formylbenzoate dehydrogenase [EC:1.2.1.78] ( | 1 | 1 | 1 | 1 | 0 |
Comparison of the selected five genome sequences was conducted using function profile categories in the IMG-ER pipeline [27], and the genome sequences analyzed are as follows: M. van, M. vanbaalenii PYR-1 (IMG Genome ID 639633044) [6]; M. gil GCK, M. gilvum PYR-GCK (IMG Genome ID 640427122) [8]; M. gil Sp1, M. gilvum Spyr1 IMG Genome ID 649633070) [7]; M. aro, M. aromaticivorans JS19b1 (whole Genome Sequencing) (IMG Genome ID 2558309009) [9]; M. ruf, M. rufum JS14 (whole Genome Sequencing) (IMG Genome ID 2593339261)
Reported sequencing status for the individual genome set: a Complete genome sequence; b Draft whole-genome sequence