| Literature DB >> 28439121 |
Arghya Mukherjee1, Bobby Chettri2, James S Langpoklakpam2, Pijush Basak3, Aravind Prasad4, Ashis K Mukherjee5, Maitree Bhattacharyya3, Arvind K Singh2, Dhrubajyoti Chattopadhyay6.
Abstract
Microbial remediation of oil polluted habitats remains one of the foremost methods for restoration of petroleum hydrocarbon contaminated environments. The development of effective bioremediation strategies however, require an extensive understanding of the resident microbiome of these habitats. Recent developments such as high-throughput sequencing has greatly facilitated the advancement of microbial ecological studies in oil polluted habitats. However, effective interpretation of biological characteristics from these large datasets remain a considerable challenge. In this study, we have implemented recently developed bioinformatic tools for analyzing 65 16S rRNA datasets from 12 diverse hydrocarbon polluted habitats to decipher metagenomic characteristics of the resident bacterial communities. Using metagenomes predicted from 16S rRNA gene sequences through PICRUSt, we have comprehensively described phylogenetic and functional compositions of these habitats and additionally inferred a multitude of metagenomic features including 255 taxa and 414 functional modules which can be used as biomarkers for effective distinction between the 12 oil polluted sites. Additionally, we show that significantly over-represented taxa often contribute to either or both, hydrocarbon degradation and additional important functions. Our findings reveal significant differences between hydrocarbon contaminated sites and establishes the importance of endemic factors in addition to petroleum hydrocarbons as driving factors for sculpting hydrocarbon contaminated bacteriomes.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28439121 PMCID: PMC5430712 DOI: 10.1038/s41598-017-01126-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of datasets used in the study. (For additional details, refer to Supplementary Data Table S1).
| Biome Type | ID | Sequencing Platform | Location | Depth of sample collection (cm below surface) | Source material for sequencing | Predominant contaminant/hydrocarbon | Reference |
|---|---|---|---|---|---|---|---|
| Urban | I1-I4 | 454 GS Junior | Assam, India | 0–10 and 20–30 |
| Crude oil | This study |
| Arctic | A1-A12 | Ion Torrent PGM | USA, Canada, Norway, Russia, and Greenland | 0–15 | Treated microcosm sediment | Diesel oil | Bell |
| Urban | C1-C9 | Illumina Miseq | Changqing and Daqing, China | 2–10 |
| Crude oil | Sun |
| Mangrove | M1-M3 | 454 GS FLX | Restinga da Marambaia, Rio de Janeiro, Brazil | 0–20 | Treated microcosm sediment | Crude oil | dos Santos |
| Marine sediment | DWH1-DWH7 | Illumina | Gulf of Mexico | 0–1# |
| Crude oil | Mason |
| Taiga | Tu1-Tu4 | 454 GS FLX | Walagan, Walagan North, Taiyuan, and Jiagedaqi, China | 20–30 | Treated microcosm sediment | Crude oil | Yang |
| Taiga | Tb1-Tb4 | 454 GS FLX | Walagan, Walagan North, Taiyuan, and Jiagedaqi, China | 70–80 | Treated microcosm sediment | Crude oil | Yang |
| Taiga | Tp1-Tp4 | 454 GS FLX | Walagan, Walagan North, Taiyuan, and Jiagedaqi, China | 140–150 | Treated microcosm sediment | Crude oil | Yang |
| Arctic | OSC1, OSC3-5, OSC7, OSC9, OSC12 | 454 GS FLX | Alberta, Canada | 2,985–2,990 |
| Oil sands bitumen | An |
| Arctic | OSTPu1-OSTPu4 | 454 GS FLX | Alberta, Canada | 100–240 |
| Bitumen and various other hydrocarbons | An |
| Arctic | OSTPm2, OSTPm4, OSTPm6 | 454 GS FLX | Alberta, Canada | 610–750 |
| Bitumen and various other hydrocarbons | An |
| Arctic | OSTPd1-OSTPd4 | 454 GS FLX | Alberta, Canada | 1220–1370 |
| Bitumen and various other hydrocarbons | An |
#All samples collected at an average of ~1500 metres below sea level, depth given is from the surface of the ocean floor.
Figure 1Taxonomic distribution of bacterial communities in oil contaminated environments. Taxonomic clades detected at an average relative abundance ≥2% in at least one of 12 oil contaminated habitats, (A) at the phylum level, and (B) at the order level.
Similarities of bacterial community structure within a habitat and between pairs of habitats expressed as Bray-Curtis distances.
|
| India oil refineries | Arctic | China oil refineries | Mangrove | Marine sediments | Taiga upper active layer | Taiga bottom active layer | Taiga permafrost layer | Oil sands core | Oil sands tailings pond upper | Oil sands tailings pond median | Oil sands tailings pond deep |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| India oil refineries (I) |
| 0.51 ± 0.07 | 0.39 ± 0.03 | 0.44 ± 0.04 | 0.47 ± 0.04 | 0.41 ± 0.06 | 0.41 ± 0.04 | 0.36 ± 0.05 | 0.48 ± 0.04 | 0.46 ± 0.08 | 0.46 ± 0.08 | 0.48 ± 0.08 |
| Arctic (A) | 0.51 ± 0.07 |
| 0.48 ± 0.05 | 0.46 ± 0.03 | 0.42 ± 0.06 | 0.45 ± 0.07 | 0.41 ± 0.05 | 0.39 ± 0.05 | 0.49 ± 0.05 | 0.39 ± 0.05 | 0.39 ± 0.04 | 0.41 ± 0.06 |
| China oil refineries (C) | 0.39 ± 0.03 | 0.48 ± 0.05 |
| 0.48 ± 0.02 | 0.40 ± 0.05 | 0.38 ± 0.06 | 0.35 ± 0.05 | 0.34 ± 0.06 | 0.37 ± 0.06 | 0.32 ± 0.02 | 0.35 ± 0.04 | 0.34 ± 0.05 |
| Mangrove (M) | 0.44 ± 0.04 | 0.46 ± 0.03 | 0.48 ± 0.02 |
| 0.54 ± 0.05 | 0.34 ± 0.03 | 0.34 ± 0.02 | 0.31 ± 0.02 | 0.36 ± 0.02 | 0.41 ± 0.02 | 0.43 ± 0.04 | 0.41 ± 0.05 |
| Marine sediments (DWH) | 0.47 ± 0.04 | 0.42 ± 0.06 | 0.40 ± 0.05 | 0.54 ± 0.05 |
| 0.35 ± 0.05 | 0.35 ± 0.02 | 0.33 ± 0.05 | 0.43 ± 0.02 | 0.38 ± 0.03 | 0.39 ± 0.03 | 0.42 ± 0.04 |
| Taiga upper active layer (Tu) | 0.41 ± 0.06 | 0.45 ± 0.07 | 0.38 ± 0.06 | 0.34 ± 0.03 | 0.35 ± 0.05 |
| 0.59 ± 0.17 | 0.52 ± 0.18 | 0.45 ± 0.09 | 0.34 ± 0.06 | 0.35 ± 0.05 | 0.37 ± 0.07 |
| Taiga bottom active layer (Tb) | 0.41 ± 0.04 | 0.41 ± 0.05 | 0.35 ± 0.05 | 0.34 ± 0.02 | 0.35 ± 0.02 | 0.59 ± 0.17 |
| 0.55 ± 0.20 | 0.45 ± 0.05 | 0.33 ± 0.04 | 0.34 ± 0.04 | 0.37 ± 0.06 |
| Taiga permafrost layer (Tp) | 0.36 ± 0.05 | 0.39 ± 0.05 | 0.34 ± 0.06 | 0.31 ± 0.02 | 0.33 ± 0.05 | 0.52 ± 0.18 | 0.55 ± 0.20 |
| 0.42 ± 0.10 | 0.32 ± 0.06 | 0.33 ± 0.05 | 0.36 ± 0.08 |
| Oil sands core (OSC) | 0.48 ± 0.04 | 0.49 ± 0.05 | 0.37 ± 0.06 | 0.36 ± 0.02 | 0.43 ± 0.02 | 0.45 ± 0.09 | 0.45 ± 0.05 | 0.42 ± 0.10 |
| 0.45 ± 0.05 | 0.45 ± 0.04 | 0.56 ± 0.10 |
| Oil sands tailings pond upper (OSTPu) | 0.46 ± 0.08 | 0.39 ± 0.05 | 0.32 ± 0.02 | 0.41 ± 0.02 | 0.38 ± 0.03 | 0.34 ± 0.06 | 0.33 ± 0.04 | 0.32 ± 0.06 | 0.45 ± 0.05 |
| 0.64 ± 0.09 | 0.63 ± 0.13 |
| Oil sands tailings pond median (OSTPm) | 0.46 ± 0.08 | 0.39 ± 0.04 | 0.35 ± 0.04 | 0.43 ± 0.04 | 0.39 ± 0.03 | 0.35 ± 0.05 | 0.34 ± 0.04 | 0.33 ± 0.05 | 0.45 ± 0.04 | 0.64 ± 0.09 |
| 0.61 ± 0.12 |
| Oil sands tailings pond deep (OSTPd) | 0.48 ± 0.08 | 0.41 ± 0.06 | 0.34 ± 0.05 | 0.41 ± 0.05 | 0.42 ± 0.04 | 0.37 ± 0.07 | 0.37 ± 0.06 | 0.36 ± 0.08 | 0.56 ± 0.10 | 0.63 ± 0.13 | 0.61 ± 0.12 |
|
Figure 2Non-metric multidimensional scaling (NMDS) plot of taxonomic composition of all oil contaminated samples of all habitats. NMDS ordination of 65 oil contaminated samples across 12 habitats was carried out based on Bray-Curtis similarity distances calculated from pairwise taxonomic profile comparisons between all samples. Taxonomic clades present in at least one sample at a relative abundance ≥0.5% were used as input. A shorter linear distance between two samples denote greater similarity between the corresponding samples. Samples from 12 environments are depicted by different colors (see legend).
Summary table showing differentially abundant bacterial clades at the Family level detected by LEfSe.
|
|
|
|
|---|---|---|
|
| A | Bacteria|Actinobacteria|Acidimicrobiia|Acidimicrobiales|Iamiaceae |
|
| A | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Microbacteriaceae |
|
| C | Bacteria|Acidobacteria|Acidobacteria 6|iii1 15|mb2424 |
|
| C | Bacteria|Acidobacteria|Chloracidobacteria|RB41|Ellin6075 |
|
| C | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Dietziaceae |
|
| C | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Geodermatophilaceae |
|
| C | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Micromonosporaceae |
|
| C | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Mycobacteriaceae |
|
| C | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Nocardiaceae |
|
| C | Bacteria|Actinobacteria|Thermoleophilia|Solirubrobacterales|Solirubrobacteraceae |
|
| C | Bacteria|Bacteroidetes|Cytophagia|Cytophagales|Cytophagaceae |
|
| C | Bacteria|Chloroflexi|Chloroflexi|Roseiflexales|Kouleothrixaceae |
|
| C | Bacteria|Chloroflexi|TK10|AKYG885|Dolo 23 |
|
| C | Bacteria|Planctomycetes|Planctomycetia|Gemmatales|Gemmataceae |
|
| C | Bacteria|Planctomycetes|Planctomycetia|Pirellulales|Pirellulaceae |
|
| C | Bacteria|Planctomycetes|Planctomycetia|Planctomycetales|Planctomycetaceae |
|
| C | Bacteria|Proteobacteria|Deltaproteobacteria|Myxococcales|Myxococcaceae |
|
| C | Bacteria|Verrucomicrobia|Opitutae|Opitutales|Opitutaceae |
|
| C | Bacteria|Verrucomicrobia|Spartobacteria|Chthoniobacterales|Chthoniobacteraceae |
|
| DWH | Bacteria|Bacteroidetes|Flavobacteriia|Flavobacteriales|Flavobacteriaceae |
|
| DWH | Bacteria|Bacteroidetes|Flavobacteriia|Flavobacteriales|Weeksellaceae |
|
| DWH | Bacteria|Proteobacteria|Alphaproteobacteria|Rhodobacterales|Rhodobacteraceae |
|
| DWH | Bacteria|Proteobacteria|Gammaproteobacteria|Alteromonadales|Alteromonadaceae |
|
| DWH | Bacteria|Proteobacteria|Gammaproteobacteria|Alteromonadales|Colwelliaceae |
|
| DWH | Bacteria|Proteobacteria|Gammaproteobacteria|Marinicellales|Marinicellaceae |
|
| DWH | Bacteria|Proteobacteria|Gammaproteobacteria|Xanthomonadales|Xanthomonadaceae |
|
| DWH | Bacteria|Verrucomicrobia|Verrucomicrobiae|Verrucomicrobiales|Verrucomicrobiaceae |
|
| I | Bacteria|Acidobacteria|Acidobacteriia|Acidobacteriales|Acidobacteriaceae |
|
| I | Bacteria|Acidobacteria|Acidobacteriia|Acidobacteriales|Koribacteraceae |
|
| I | Bacteria|Bacteroidetes|Saprospirae|Saprospirales|Chitinophagaceae |
|
| I | Bacteria|Chlorobi|Ignavibacteria|Ignavibacteriales|Ignavibacteriaceae |
|
| I | Bacteria|Proteobacteria|Alphaproteobacteria|Rhodospirillales|Acetobacteraceae |
|
| I | Bacteria|Proteobacteria|Alphaproteobacteria|Rhodospirillales|Rhodospirillaceae |
|
| I | Bacteria|Proteobacteria|Betaproteobacteria|Hydrogenophilales|Hydrogenophilaceae |
|
| I | Bacteria|Proteobacteria|Gammaproteobacteria|Xanthomonadales|Sinobacteraceae |
|
| M | Bacteria|Planctomycetes|Phycisphaerae|Phycisphaerales|Phycisphaeraceae |
|
| M | Bacteria|Proteobacteria|Alphaproteobacteria|Sphingomonadales|Erythrobacteraceae |
|
| M | Bacteria|Proteobacteria|Deltaproteobacteria|Desulfuromonadales|Desulfuromonadaceae |
|
| M | Bacteria|Spirochaetes|Spirochaetes|Spirochaetales|Spirochaetaceae |
|
| OSC | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Propionibacteriaceae |
|
| OSC | Bacteria|Proteobacteria|Alphaproteobacteria|Rhizobiales|Brucellaceae |
|
| OSC | Bacteria|Proteobacteria|Alphaproteobacteria|Rhizobiales|Methylobacteriaceae |
|
| OSC | Bacteria|Proteobacteria|Betaproteobacteria|Burkholderiales|Oxalobacteraceae |
|
| OSC | Bacteria|Proteobacteria|Gammaproteobacteria|Enterobacteriales|Enterobacteriaceae |
|
| OSC | Bacteria|Proteobacteria|Gammaproteobacteria|Pseudomonadales|Moraxellaceae |
|
| OSTPd | Bacteria|Proteobacteria|Betaproteobacteria|Rhodocyclales|Rhodocyclaceae |
|
| OSTPm | Bacteria|Chloroflexi|Anaerolineae|Anaerolineales|Anaerolinaceae |
|
| OSTPm | Bacteria|Proteobacteria|Deltaproteobacteria|Desulfobacterales|Desulfobulbaceae |
|
| OSTPm | Bacteria|Proteobacteria|Deltaproteobacteria|Syntrophobacterales|Syntrophaceae |
|
| OSTPm | Bacteria|Proteobacteria|Gammaproteobacteria|Pseudomonadales|Pseudomonadaceae |
|
| OSTPu | Bacteria|Firmicutes|Clostridia|Clostridiales|Peptococcaceae |
|
| OSTPu | Bacteria|Proteobacteria|Betaproteobacteria|Burkholderiales|Comamonadaceae |
|
| OSTPu | Bacteria|Proteobacteria|Deltaproteobacteria|Desulfuromonadales|Geobacteraceae |
|
| OSTPu | Bacteria|Proteobacteria|Deltaproteobacteria|Syntrophobacterales|Syntrophorhabdaceae |
|
| Tb | Bacteria|Actinobacteria|Thermoleophilia|Gaiellales|Gaiellaceae |
|
| Tb | Bacteria|Proteobacteria|Alphaproteobacteria|Caulobacterales|Caulobacteraceae |
|
| Tb | Bacteria|Proteobacteria|Alphaproteobacteria|Rhizobiales|Bradyrhizobiaceae |
|
| Tb | Bacteria|Proteobacteria|Alphaproteobacteria|Rhizobiales|Hyphomicrobiaceae |
|
| Tp | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Sporichthyaceae |
|
| Tp | Bacteria|Chloroflexi|Ktedonobacteria|Thermogemmatisporales|Thermogemmatisporaceae |
|
| Tp | Bacteria|Proteobacteria|Alphaproteobacteria|Sphingomonadales|Sphingomonadaceae |
|
| Tp | Bacteria|Proteobacteria|Betaproteobacteria|Burkholderiales|Alcaligenaceae |
|
| Tu | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Intrasporangiaceae |
|
| Tu | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Micrococcaceae |
|
| Tu | Bacteria|Actinobacteria|Actinobacteria|Actinomycetales|Nocardioidaceae |
|
| Tu | Bacteria|Proteobacteria|Betaproteobacteria|Burkholderiales|Burkholderiaceae |
†Column labelled “Habitat” represents the petroleum contaminated environment in which the corresponding taxa (as presented in column labelled “Differentially abundant Taxa”), was found to be significantly differentially abundant by LEfSe using the one class, non-strict test (Please refer to Materials and methods, and Supplementary Table S2 for details). Acronyms represent the following habitats: A: Arctic, C: China oil refineries, I: India oil refineries, M: Mangrove, DWH: Marine sediments, OSC: Oil sands core, OSTPu: Oil sands tailings pond upper, OSTPm: Oil sands tailings pond median, OSTPd: Oil sands tailings pond deep, Tb: Taiga bottom active layer, Tu: Taiga upper active layer, Tp: Taiga permafrost layer.
‡Taxonomy is described using the following hierarchy: Kingdom|Phylum|Class|Order|Family|Genus|species.
Core modules shared between habitats as detected by HUMAnN2.
| Module ID | Definition of modules in KEGG |
|---|---|
| M00005 | PRPP biosynthesis, ribose 5 P = > PRPP |
| M00020 | Serine biosynthesis, glycerate-3P = > serine |
| M00149 | Succinate dehydrogenase, prokaryotes |
| M00153 | Cytochrome d ubiquinol oxidase |
| M00157 | F-type ATPase, prokaryotes and chloroplasts |
| M00178 | Ribosome, bacteria |
| M00188 | NitT/TauT family transport system |
| M00222 | Phosphate transport system |
| M00223 | Phosphonate transport system |
| M00236 | Putative polar amino acid transport system |
| M00237 | Branched-chain amino acid transport system |
| M00239 | Peptides/nickel transport system |
| M00240 | Iron complex transport system |
| M00250 | Lipopolysaccharide transport system |
| M00254 | ABC-2 type transport system |
| M00255 | Lipoprotein-releasing system |
| M00256 | Cell division transport system |
| M00258 | Putative ABC transport system |
| M00320 | Lipopolysaccharide export system |
Figure 3Metabolic reconstruction and functional biomarkers of metagenomes from oil polluted habitats. Cladogram showing a subset of the 4-level KEGG BRITE hierarchical structure denoted by four rings, as inferred against KEGG metabolic modules detected by HUMAnN2 from metagenomic gene family abundance data produced by PICRUSt for all oil contaminated samples. The outermost ring represents KEGG functional modules that have been detected in at least one of the 65 PICRUSt predicted metagenomes as reconstructed by HUMAnN2, while the innermost ring represents the Level 1 KEGG BRITE clades. Differentially abundant KEGG metabolic modules inferred by LEfSe using KEGG module abundance data generated by HUMAnN2 are colored corresponding to the oil contaminated habitat they have been identified to be differentially abundant in (see legend). Circles not differentially abundant in any habitat are colorless. Brackets represent a single KEGG BRITE clade at that Level from which daughter clades originate. KEGG BRITE clades with a single daughter clade are joined using regular branches. Annotations for the KEGG BRITE hierarchy follow an outside-in pattern, wherein Level 1 KEGG BRITE clades are annotated in the outermost section of the cladogram with lower clades annotated further inside ending at the outermost circle in that section of the cladogram. More information on this style of representation can be found elsewhere[28, 36, 83].
Figure 4Subset of significant correlations exhibited between KEGG orthologous gene families and bacterial clade abundances. Spearman correlations were calculated between KEGG orthologous gene families and phylotypes at any taxonomic level from phylum to OTU within 4 oil polluted habitats (habitats with six or more samples). A subset of significant associations with correlation >0.7 and p-value < 0.001 reaching a Benjamini-Hochberg false discovery rate < 0.01 are shown here. Taxonomic clades are represented in rectangles with a light purple background and KEGG orthologs are depicted in rectangles with white background (see legend). KEGG orthologs are colored according to corresponding KEGG modules, wherever applicable (see legend). Correlations for each habitat is depicted using different colors (red, Arctic; blue, China oil refineries; turquoise, Marine sediments; midnight blue, Oil sands core) with positive and negative associations represented by continuous and broken arrow lines respectively (see legend).
Figure 5SparCC network plot of global bacterial interactions in individual oil polluted habitats. Significant bacterial associations captured by SparCC (p-value < 0.01) with an absolute correlation magnitude of ≥0.6 are presented. Nodes represent detected phylotypes (OTU clustered at 97% similarity) involved in either significant co-occurrence (green edges) or co-exclusion (red edges) relationships. Border coloration depicts taxonomic affiliation of nodes at the phylum level (see legend). Node size is proportional to the connectivity of the node (both positive and negative relationships).