Literature DB >> 28770030

The complete genome sequence of the cold adapted crude-oil degrader: Pedobacter steynii DX4.

Sijing Chang1,2,3, Gaosen Zhang3,4, Ximing Chen3,4, Haozhi Long4,5, Yilin Wang5, Tuo Chen2,3, Guangxiu Liu3,4.   

Abstract

Pedobacter steynii DX4 was isolated from the soil of Tibetan Plateau and it can use crude oil as sole carbon and energy source at 15 °C. The genome of Pedobacter steynii DX4 has been sequenced and served as basis for analysis its metabolic mechanism. It is the first genome of crude oil degrading strain in Pedobacter genus. The 6.58 Mb genome has an average G + C content of 41.31% and encodes 5464 genes. In addition, annotation revealed that Pedobacter steynii DX4 has cold shock proteins, abundant response regulators for cell motility, and enzymes involved in energy conversion and fatty acid metabolism. The genomic characteristics could provide information for further study of oil-degrading microbes for recovery of crude oil polluted environment.

Entities:  

Keywords:  Crude oil; Degradation; Genome; Pedobacter

Year:  2017        PMID: 28770030      PMCID: PMC5531107          DOI: 10.1186/s40793-017-0249-z

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The crude oil spills occur frequently and they bring serious pollution to the terrestrial and marine environments [1, 2]. In the bioremediation of crude oil contamination, bacteria work as primary degraders [3-5]. Numerous strains be capable of degrading hydrocarbons have been singled out and identified from marine and terrestrial environments [6-8]. It was also reported that in oil polluted areas, is one of the major members of alkane degrading bacterial communities [9-11]. For the first time in genus, a cultured strain was described to have the ability to degrade crude oil [12]. The strain DX4 was isolated from frozen soil of Tibetan Plateau permafrost region. This organism was selected for genome sequencing for it exhibited the capability to utilize and degrade crude oil at a cold temperature (15 °C). In this paper, our aim was to identify genomic signatures for petroleum degradation in this strain, and investigate its application in bioremediation in cold environments.

Organism information

Classification and features

The soil sample was collected from the Dangxiong County (30.5633°N, 91.4221°E, 4488 m ASL) in the Tibetan Plateau, in 2013. The soil sample was preserved at −20 °C immediately after collection and sent to the State Key Laboratory of Cryospheric Sciences, CAS. The soil type belongs to alpine meadow soil. Crude-oil degrading strains were enriched in liquid MM medium added 2% crude oil (v/v) and incubated for 2 weeks at 20 °C [13]. The suspension of culture collection was surface spread onto the 216 L agar plates and cultivated for 5 days at 20 °C [14]. DX4 colonies on 216 L agar plates are light yellow, slightly domed mucoid and circular with smooth margins. DX4 cells are Gram negative rods, motile, non-spore-forming. The scanning electron micrograph is shown in Fig. 1. Additional characteristics of DX4 are shown in Table 1. Growth experiment was carried out in 216 L liquid medium at 20 °C and the OD600 of strain DX4 is shown in Fig. 2. In addition, Fig. 3 shows the crude oil degradation rates of the strain DX4. The degradation was carried out in liquid MM medium added 2% crude oil (v/v) at 15 °C for 2 weeks and crude oil was quantified by using gas chromatography and mass spectrometric detector [15].
Fig. 1

Scanning electron micrograph of P. steynii DX4

Table 1

Classification and general features of Pedobacter steynii DX4

MIGS IDPropertyTermEvidence code
ClassificationDomain BacteriaTAS [41]
Phylum Bacteroidetes TAS [42, 43]
Class Sphingobacteriia TAS [4446]
Order Sphingobacteriales TAS [44]
Family Sphingobacteriaceae TAS [47, 48]
Genus Pedobacter TAS [49]
Species Pedobacter steynii TAS [49]
Strain DX4
Gram stainNegativeTAS [49]
Cell shapeRodIDA
MotilityMotileTAS [49]
SporulationNon-sporulatingTAS [49]
Temperature range4-25 °CIDA
Optimum temperature20 °CTAS [50]
pH range; Optimum5-10; 7.5;IDA
Carbon sourceYeast extract, pyruvate, crude oilIDA
MIGS-6HabitatFrozen soilIDA
MIGS-6.3Salinity0.5-4.5% NaCl (w/v)TAS [51]
MIGS-22Oxygen requirementAerobicNAS
MIGS-15Biotic relationshipFree-livingIDA
MIGS-14PathogenicityNon-pathogenNAS
MIGS-4Geographic locationChina: Tibetan Plateau, Dangxiong CountyIDA
MIGS-5Sample collection2013IDA
MIGS-4.1Latitude30.5633°NNAS
MIGS-4.2Longitude91.4221°ENAS
MIGS-4.4Altitude4488 mNAS

aEvidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable. Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project

Fig. 2

Growth curve of P. steynii DX4 in 216 L liquid medium at 20 °C. The absortance at 600 nm was measured every 4 h

Fig. 3

Degrading rates of crude oil by P. steynii DX4. H1 - H16: serial n-alkanes, from Undecane to Hexacosane. H17- H32: branched alkanes and cycloalkanes, in accordance with the order: Undecane,2,6-dimethyl; Dodecane,2-methyl; Dodecane,2,6,11-trimethyl; Pentadecane, 7-methyl; Octane, 2,3,7-trimethyl; Dodecane, 3-methyl; Dodecane, 2,6,10-trimethyl; 1H-Indene, octahydro-2,2,4,4,7,7-hexamethyl-, trans; Undecane, 5-cyclohexyl; H26:Undecane, 4,8-dimethyl; Decahydro-4,4,8,9,10-pentamethylnaphthalene; Pentadecane, 2-methyl; Pentadecane, 2,6,10-trimethyl; Pentadecane, 8-hexyl; Hexadecane; 2,6,10,14-tetramethyl; Ethyl iso-allocholate

Scanning electron micrograph of P. steynii DX4 Classification and general features of Pedobacter steynii DX4 aEvidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable. Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project Growth curve of P. steynii DX4 in 216 L liquid medium at 20 °C. The absortance at 600 nm was measured every 4 h Degrading rates of crude oil by P. steynii DX4. H1 - H16: serial n-alkanes, from Undecane to Hexacosane. H17- H32: branched alkanes and cycloalkanes, in accordance with the order: Undecane,2,6-dimethyl; Dodecane,2-methyl; Dodecane,2,6,11-trimethyl; Pentadecane, 7-methyl; Octane, 2,3,7-trimethyl; Dodecane, 3-methyl; Dodecane, 2,6,10-trimethyl; 1H-Indene, octahydro-2,2,4,4,7,7-hexamethyl-, trans; Undecane, 5-cyclohexyl; H26:Undecane, 4,8-dimethyl; Decahydro-4,4,8,9,10-pentamethylnaphthalene; Pentadecane, 2-methyl; Pentadecane, 2,6,10-trimethyl; Pentadecane, 8-hexyl; Hexadecane; 2,6,10,14-tetramethyl; Ethyl iso-allocholate The molecular identification was performed with the 27F-1492R primer to amplify the 16S rRNA sequence. The 16S rRNA from DX4 was 99.64% similar to the WB2.3-45T (AM491372) thus DX4 was identified as a strain of . Figure 4 shows the phylogenetic tree constructed from the 16S rRNA sequence together with other related species using MEGA 5.0 software suite. The evolutionary history was inferred by using Neighbor-joining method based on the maximum composite likehood substitution model [16, 17].
Fig. 4

Rooted phylogenetic tree of the 16 S rRNA sequences of Pedobacter steynii strain DX4 and relative species. The 16 S rRNA sequences of Pedobacter species were aligned, and the phylogenetic tree was constructed by using Neighbor-joining method based on the maximum composite likehood substitution model

Rooted phylogenetic tree of the 16 S rRNA sequences of Pedobacter steynii strain DX4 and relative species. The 16 S rRNA sequences of Pedobacter species were aligned, and the phylogenetic tree was constructed by using Neighbor-joining method based on the maximum composite likehood substitution model

Genome sequencing information

Genome project history

The strain DX4 was selected for sequencing on the basis of its potential biodegradation capability. The initial Illumina sequencing was performed in April 2016 and the genome was closed by PacBio sequencing in August 2016. The genome project is deposited in the online genome database (NCBI-Genome) and the sequence was released for public access on September 9, 2016. A summary of the project information is shown in the Table 2.
Table 2

Project information of the whole genome sequence of P. steynii DX4

MIGS IDPropertyTerm
MIGS-31Finishing qualityFinished
MIGS-28Libraries usedPaired-end (average 500 bp)PacBio (2075 and 2775 kbp)
MIGS-29Sequencing platformsIllumina Hiseq 2000 and PacBio
MIGS-31.2Fold coverageIllumina paired-end:86×PacBio: 153×
MIGS-30AssemblersSOAPdenovo 2.3,GapCloser v1.12HGAP
MIGS-32Gene calling methodGlimmer3.02
Locus TagBFS30
GenBank IDCP017141
GenBank Date of ReleaseSeptember 9, 2016
GOLD IDGp0156107
BIOPROJECTPRJNA339039
MIGS-13Source Material IdentifierDX4
Project relevanceBiodegrading
Project information of the whole genome sequence of P. steynii DX4

Growth conditions and genomic DNA preparation

DX4 was inoculated into 216 L liquid medium and grown on a shaker (200 rpm) at 20 °C, until the cells OD600nm > 1.0. Genomic DNA was extracted from freshly grown cells using the E.Z.N.A.® Bacterial DNA Kit following the standard protocol prescribed by the manufacturer.

Genome sequencing and assembly

The complete genome sequence of DX4 was sequenced using Illumina HiSeq2000 for the initial sequencing and assembly, followed by PacBio sequencing to fully close the genome sequence [18, 19]. The Illumina platform generated 1,864,026 reads totaling 561,071,826 bp, and the data were assembled into 9 scaffolds by using SOAP denovo V2.3 [20]. The coverage of the paired-end reads was 86×. For gap closure, sequencing was performed using a PacBio SMRT cell, which resulted in 198,008 reads with an average read length of 4973 bp and a coverage of 153×. The alignment of the PacBio reads were assembled with HGAP [21]. Gap closure was managed using the Gap Closer 1.12 and resulting in the final genome of one complete chromosome. This finished genome was deposited in IMG Database with the Project ID: Gp0156107. And this whole-genome project (BioProject ID: PRJNA339039) has also been registered and assembled sequence data submitted at NCBI GenBank under the accession no.CP017141. The Average Nucleotide Identity (ANI) analysis has been carried out by using a online tool [22].

Genome annotation

Glimmer 3.0 was used to predict open reading frames (ORFs) [23]. The rRNA and tRNA gene predictions and the ORFs annotation were conducted by using BLASTp against NCBI-NR database [24], the COG database [25] and the KEGG database [26]. Genes function annotations were assigned when blastp E-values were ≤0.001 [27]. If there was no significant similarity to protein in other organisms, the gene production was described as hypothetical protein.

Genome properties

The genome statistics is shown in Table 3. The genome of DX4 is 6,581,659 base pairs in size, and has a GC content of 41.31%. Out of the total 5464 genes, 23 genes are pseudogenes and 63 are tRNAs, 13 are rRNA genes, 3 are ncRNA genes, 5362 are coding sequences CDSs. Of the total CDSs, 307 are functioning unknown (5.7%), 414 are general function prediction only (7.7%) and the remaining had a defined function. The COG-distribution of genes is shown in Table 4. The genome map (Fig. 5) was visualized by CG view server. The ANI analysis showed DX4 had 83.33% nucleotide identity with DSM 19110. Comparative analysis between strains isolated from polar region was also performed. The DX4 presented 79.03% nucleotide identity with PAMC 27485 (isolated from Antarctica), 78.42% with 4BY and 76.39% with A12, revealing the great genetic distance between these strains.
Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)6,581,659100
DNA coding (bp)6,033,40291.67
DNA G + C (bp)2,718,88341.31
DNA scaffolds1
Total genes5464100
Protein coding genes536298.13
RNA genes791.44
Pseudo genes230.42
Genes in internal clustersNA
Genes with function prediction4147.58
Genes assigned to COGs372068.01
Genes with Pfam domains426478.04
Genes with signal peptides80414.71
Genes with transmembrane helices1783.26
CRISPR repeats1
Table 4

Number of genes of Pedobacter steynii DX4 with the general COG functional categories

CodeValue% of totala Description
J1552.9Translation, ribosomal structure and biogenesis
A00RNA processing and modification
K4177.8Transcription
L1462.7Replication, recombination and repair
B10Chromatin structure and dynamics
D200.4Cell cycle control, Cell division, chromosome partitioning
V941.8Defense mechanisms
T2614.9Signal transduction mechanisms
M3065.7Cell wall/membrane biogenesis
N170.3Cell motility
U290.5Intracellular trafficking and secretion
O1582.9Posttranslational modification, protein turnover, chaperones
C1502.8Energy production and conversion
G2554.8Carbohydrate transport and metabolism
E2574.8Amino acid transport and metabolism
F741.4Nucleotide transport and metabolism
H1252.3Coenzyme transport and metabolism
I1542.9Lipid transport and metabolism
P2905.4Inorganic ion transport and metabolism
Q901.7Secondary metabolites biosynthesis, transport and catabolism
R4147.7General function prediction only
S3075.7Function unknown
-164230.6Not in COGs

aThe total is based on the total number of protein coding genes in the genome

Fig. 5

The genome map of Pedobacter steynii strain DX4.Circle 1: Base pair numbers; Circle 2 and Circle 3:Forward and reverse coding domain sequences, the color coding of the CDS represent different Clusters of Orthologous Groups categories; Circle 4: rRNA and tRNA; Circle 5: % GC plot; Circle 6:GC skew [(GC)/(G + C)]

Genome statistics Number of genes of Pedobacter steynii DX4 with the general COG functional categories aThe total is based on the total number of protein coding genes in the genome The genome map of Pedobacter steynii strain DX4.Circle 1: Base pair numbers; Circle 2 and Circle 3:Forward and reverse coding domain sequences, the color coding of the CDS represent different Clusters of Orthologous Groups categories; Circle 4: rRNA and tRNA; Circle 5: % GC plot; Circle 6:GC skew [(GC)/(G + C)]

Insights from the genome sequence

Genome annotation predicted many traits support the adaptability of DX4 to cold and crude oil-contaminated environment. The Five cold shock proteins were predicted (NCBI Protein database: WP_069377418.1, WP_062548063.1, WP_048905418.1, WP_008241764.1 and AOM75720.1). These proteins are supposed to play important roles in low temperature conditions [28]. The related strians isolated from antarctic regions, 4BY and PAMC 27485, respectively encoded four cold shock proteins. Based on the COG analysis, 261 genes in total were assigned to the signal transduction category. Among them, 22 genes were predicted to encode the response regulators and 6 were found to encode chemotaxis protein CheY [29]. These genes could play regulatory role in environment sensing and cell motility towards the crude oil. As for aerobic alkane degradation, alkB gene has been considered as a functional biomarker for alkane-degrading bacterial populations in environmental [30-32]. But in DX4 genome, no alkB homolog coding genes were found. A gene coding for haloalkane dehalogenase (WP_069382597.1, EC 3.8.1.5) was annotated. Haloalkane dehalogenase (HLD) has considerable environmental significance because it converts haloalkanes to corresponding alcohol and hydrogen halide (KEGG database: RN: R02337,) [33, 34]. In addition to that, three luciferase proteins were identified (WP_069377707.1, WP_069380456.1 and WP_069377640.1). Research showed that the bacteria luciferase can utilize reduced FMN in the oxidation of alkanes with the emission of blue-green light [35, 36]. Figure 6 shows the genes coding for HLD and luciferase protein and adjacent genes upstream and downstream, which may be relevant genes participating in the metabolism of crude oil. In addition, the presence of 19 alcohol dehydrogenase and 23 aldehyde dehydrogenase necessary for alkane degradation as well as 11 fatty acid transport and metabolism genes suggest a complete alkane degradation pathway [37, 38].
Fig. 6

Organization of Genes coding for HLD and luciferase and their adjacent genes in P. steynii strain DX4 genome

Organization of Genes coding for HLD and luciferase and their adjacent genes in P. steynii strain DX4 genome The antibiotics and secondary metabolite analysis was done using the anti-SMASH platform [39]. In total, 12 secondary metabolite clusters were identified and 11 of them were related to antibiotics. A resorcinol metabolite cluster was identified and this cluster may play important role in the degradation of resorcinol and other aromatic compounds [40]. Interestingly, the 12 secondary metabolite clusters had no similarity with the known clusters, suggesting that the strain DX4 may possess novel secondary metabolic pathways.

Conclusions

DX4 was isolated from a cold environment and could utilize crude oil as sole carbon source. The genome of DX4 reported here provides the genetic basis of its crude oil biodegrading mechanism. Genes involved in cold shock, energy conversion and response regulators for cell motility point to the unique abilities of DX4 in oil degradation and cold environment adaptation. Genomic research on DX4 would also provide a blueprint for the application of bioremediation and recovery in cold oil-polluted environments.
  39 in total

Review 1.  Evolving haloalkane dehalogenases.

Authors:  Dick B Janssen
Journal:  Curr Opin Chem Biol       Date:  2004-04       Impact factor: 8.822

2.  List of new names and new combinations previously effectively, but not validly, published--Validation List no. 118.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2007-11       Impact factor: 2.747

3.  Protein structure prediction on the Web: a case study using the Phyre server.

Authors:  Lawrence A Kelley; Michael J E Sternberg
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

4.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors:  Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal:  Nat Methods       Date:  2013-05-05       Impact factor: 28.547

5.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

6.  Conserved aspartate residues and phosphorylation in signal transduction by the chemotaxis protein CheY.

Authors:  R B Bourret; J F Hess; M I Simon
Journal:  Proc Natl Acad Sci U S A       Date:  1990-01       Impact factor: 11.205

7.  Abundance and diversity of soil petroleum hydrocarbon-degrading microbial communities in oil exploring areas.

Authors:  Yuyin Yang; Jie Wang; Jingqiu Liao; Shuguang Xie; Yi Huang
Journal:  Appl Microbiol Biotechnol       Date:  2014-09-20       Impact factor: 4.813

8.  A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers.

Authors:  Michael A Quail; Miriam Smith; Paul Coupland; Thomas D Otto; Simon R Harris; Thomas R Connor; Anna Bertoni; Harold P Swerdlow; Yong Gu
Journal:  BMC Genomics       Date:  2012-07-24       Impact factor: 3.969

Review 9.  Microbial transformation of the Deepwater Horizon oil spill-past, present, and future perspectives.

Authors:  Nikole E Kimes; Amy V Callaghan; Joseph M Suflita; Pamela J Morris
Journal:  Front Microbiol       Date:  2014-11-18       Impact factor: 5.640

10.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

View more
  1 in total

1.  Understanding Microbial Community Dynamics in Up-Flow Bioreactors to Improve Mitigation Strategies for Oil Souring.

Authors:  Avishek Dutta; Ben Smith; Thomas Goldman; Leanne Walker; Matthew Streets; Bob Eden; Reinhard Dirmeier; Jeff S Bowman
Journal:  Front Microbiol       Date:  2020-12-03       Impact factor: 5.640

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.