| Literature DB >> 26594308 |
Dong Hee Choi1, Yong Min Kwon1, Kae Kyoung Kwon1,2, Sang-Jin Kim1,2,3.
Abstract
Novosphingobium pentaromativorans US6-1(T) is a species in the family Sphingomonadaceae. According to the phylogenetic analysis based on 16S rRNA gene sequence of the N. pentaromativorans US6-1(T) and nine genome-sequenced strains in the genus Novosphingobium, the similarity ranged from 93.9 to 99.9 % and the highest similarity was found with Novosphingobium sp. PP1Y (99.9 %), whereas the ANI value based on genomes ranged from 70.9 to 93 % and the highest value was 93 %. This microorganism was isolated from muddy coastal bay sediments where the environment is heavily polluted by polycyclic aromatic hydrocarbons (PAHs). It was previously shown to be capable of degrading multiple PAHs, including benzo[a]pyrene. To further understand the PAH biodegradation pathways the previous draft genome of this microorganism was revised to obtain a complete genome using Illumina MiSeq and PacBio platform. The genome of strain US6-1(T) consists of 5,457,578 bp, which includes the 3,979,506 bp chromosome and five megaplasmids. It comprises 5110 protein-coding genes and 82 RNA genes. Here, we provide an analysis of the complete genome sequence which enables the identification of new characteristics of this strain.Entities:
Keywords: Extradiol dioxygenase; Megaplasmids; Novosphingobium; Polycyclic aromatic hydrocarbon
Year: 2015 PMID: 26594308 PMCID: PMC4653889 DOI: 10.1186/s40793-015-0102-1
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phylogenetic tree highlighting the position of Novosphingobium pentaromativorans US6-1T (in bold) relative to the other validly published 28 type strains, and 4 non-type strains that have their whole genome sequences (indicated with *) within genus Novosphingobium. A total of 1305 unambiguously aligned sequences were compared and phylogenetic trees were reconstructed using the neighbor-joining [26], maximum-likelihood [27] and maximum-parsimony [28] methods. Bootstrap values (%) are based on 1000 replicates and are indicated at the nodes when they are higher than 50 % [29]. The evolutionary distances were calculated by the Jukes-Cantor method [30] using MEGA5 [31]. The nodes are marked with filled or open circles when the node was recovered by all three or by two treeing methods, respectively. Sphingosinicella microcystinivorans Y2T was used as an outgroup. Scale bar; 0.005 changes per nucleotide position
Classification and general features of N. pentaromativorans US6-1T
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain US6-1T | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | rod | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | not reported | NAS | |
| Temperature range | 15-40 °C | IDA [ | |
| Optimum temperature | 30 °C | TAS [ | |
| pH range; Optimum | 6–9; 6.5 | TAS [ | |
| Carbon source | cyclodextrin, dextrin, glucose, maltose, sucrose, psicose, propionic acid,alanine, glutamic acid, proline | TAS [ | |
| MIGS-6 | Habitat | muddy sediment | TAS [ |
| MIGS-6.3 | Salinity | requires (2.5 %) | TAS [ |
| MIGS-22 | Oxygen requirement | Facultative anaerobic | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | non-pathogen | TAS [ |
| MIGS-4 | Geographic location | Ulsan Bay, Republic of Korea | TAS [ |
| MIGS-5 | Sample collection time | 2000 | NAS |
| MIGS-4.1 | Latitude | 129°23′14″ | NAS |
| MIGS-4.2 | Longitude | 35°29′48.5″N | NAS |
| MIGS-4.4 | Altitude | −8 m | NAS |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [42]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Illumina MiSeq, PacBio 10 K |
| MIGS-29 | Sequencing platforms | Illumina MiSeq, PacBio 10 K |
| MIGS-31.2 | Fold coverage | 395.08 × Illumina, 128.82 × PacBio |
| MIGS-30 | Assemblers | Roche gsAssembler 2.6, PacBio SMRT |
| Analysis 2.2.0, CLCbio CLC Genomics | ||
| Workbench version 7.0.4 | ||
| MIGS-32 | Gene calling method | Prodigal, tRNA-Scan-SE, HMMER |
| Locus Tag | JI59 | |
| GenBank ID | CP009291-6 | |
| GenBank Date of Release | October 10, 2014 | |
| GOLD ID | Gs0114422 | |
| BIOPROJECT | PRJNA257352 | |
| MIG-13 | Source Material Identifier | KCTC 10454T |
| Project relevance | Bioremediation, PAHs biodegradation pathway, Environmental |
Summary of genome: one chromosome and five plasmids
| Label | Size (Mb) | GC (%) | No. genes | Topology | INSDC identifier |
|---|---|---|---|---|---|
| Chromosome | 3.98 | 63.5 | 3811 | circular | CP009291 |
| pLA1 | 0.18 | 62.6 | 191 | circular | CP009294 |
| pLA2 | 0.06 | 60.29 | 85 | circular | CP009296 |
| pLA3 | 0.75 | 61.44 | 654 | circular | CP009292 |
| pLA4 | 0.33 | 62.4 | 326 | circular | CP009293 |
| pLA5 | 0.13 | 61.06 | 125 | circular | CP009295 |
Genome statistics
| Attribute | Value | % of totala |
|---|---|---|
| Genome size (bp) | 5,457,578 | 100.00 |
| DNA coding (bp) | 4,910,346 | 89.97 |
| DNA G + C (bp) | 3,439,297 | 63.02 |
| DNA scaffolds | 6 | 100.00 |
| Total genes | 5192 | 100.00 |
| Protein coding genes | 5110 | 98.42 |
| RNA genes | 82 | 1.58 |
| Pseudo genes | 59 | 1.14 |
| Genes in internal clusters | 4183 | 80.57 |
| Genes with function prediction | 4036 | 77.73 |
| Genes assigned to COGs | 3787 | 72.94 |
| Genes with Pfam domains | 4124 | 79.43 |
| Genes with signal peptides | 486 | 9.36 |
| Genes with transmembrane helices | 1073 | 20.67 |
| CRISPR repeats | 0 | 0 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Fig. 2Circular maps and genetic features of the chromosome and its plasmids of N. pentaromativorans US6-1T displaying relevant genome features. From outside to center; Genes on forward strand (colored by COG categories), genes on reverse strand (colored by COG categories), GC content and GC skew. Order and size counterclockwise from an upper map: Chr, 3.98 Mb; pLA 1, 0.18 Mb; pLA 2, 0.06 Mb; pLA 3, 0.75 Mb; pLA 4, 0.33 Mb; pLA 5, 0.13 Mb
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 167 | 3.1 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.0 | RNA processing and modification |
| K | 267 | 4.9 | Transcription |
| L | 289 | 5.3 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 36 | 0.7 | Cell cycle control, Cell division, chromosome partitioning |
| V | 50 | 0.9 | Defense mechanisms |
| T | 122 | 2.2 | Signal transduction mechanisms |
| M | 245 | 4.5 | Cell wall/membrane/envelope biogenesis |
| N | 64 | 1.2 | Cell motility |
| U | 76 | 1.4 | Intracellular trafficking and secretion |
| O | 172 | 3.1 | Posttranslational modification, protein turnover, chaperones |
| C | 294 | 5.4 | Energy production and conversion |
| G | 177 | 3.2 | Carbohydrate transport and metabolism |
| E | 272 | 5.4 | Amino acid transport and metabolism |
| F | 67 | 3.2 | Nucleotide transport and metabolism |
| H | 131 | 5.0 | Coenzyme transport and metabolism |
| I | 260 | 4.8 | Lipid transport and metabolism |
| P | 264 | 4.8 | Inorganic ion transport and metabolism |
| Q | 100 | 1.8 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 382 | 7.0 | General function prediction only |
| S | 351 | 6.4 | Function unknown |
| - | 1676 | 30.7 | Not in COGs |
The total is based on the total number of protein coding genes in the annotated genome
Fig. 3The relationship between 16S rRNA gene sequence similarities and ANI values for strains in the genus of Novosphingobium. The species boundary of 16S rRNA gene sequence similarity and ANI value are indicated at 97–98.65 % [24] and 95–96 % [32], respectively. 1, N. acidiphilum DSM 19966T; 2, N. tardaugens NBRC 16725T; 3, N. aromaticivorans DSM 12444T; 4, Novosphingobium sp. B-7; 5, N. nitrogenifigens DSM 19370T; 6, Novosphingobium sp. Rr 2-17; 7, N. lindaniclasticum LE124T; 8, Novosphingobium sp. AP12; 9, Novosphingobium sp. PP1Y