| Literature DB >> 27681901 |
Abstract
Hydrocarbon compounds can be biodegraded by anaerobic microorganisms to form methane through an energetically interdependent metabolic process known as syntrophy. The microorganisms that perform this process as well as the energy transfer mechanisms involved are difficult to study and thus are still poorly understood, especially on an environmental scale. Here, metagenomic data was analyzed for specific clusters of orthologous groups (COGs) related to key energy transfer genes thus far identified in syntrophic bacteria, and principal component analysis was used in order to determine whether potentially syntrophic environments could be distinguished using these syntroph related COGs as opposed to universally present COGs. We found that COGs related to hydrogenase and formate dehydrogenase genes were able to distinguish known syntrophic consortia and environments with the potential for syntrophy from non-syntrophic environments, indicating that these COGs could be used as a tool to identify syntrophic hydrocarbon biodegrading environments using metagenomic data.Entities:
Keywords: hydrocarbon biodegradation; hydrocarbon metagenomics project; metagenomics; methanogenesis; microbial interactions; principal component analysis; syntrophy
Year: 2016 PMID: 27681901 PMCID: PMC5029510 DOI: 10.3390/microorganisms4010005
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Metagenomes analyzed in this study. All metagenomes were publically available on the IMG database. Source indicates whether metagenome was originally obtained through the Hydrocarbon Metagenomics Project (HMP) or through searching the IMG database [9,10,16]. Classification was performed according to sampling location information as well as microbial community composition (Figure 1). Community information not available.
| # | Metagenome Name | IMG Genome ID | Source | Gene Count | Classification |
|---|---|---|---|---|---|
| 1 | Marine microbial communities from Deepwater Horizon subsurface plume in Gulf of Mexico, 16-4 Below Plume (16-4 Below Plume) | 3300005379 | IMG | 112790 | Non-Syntrophic Hydrocarbon |
| 2 | Marine microbial communities from Deepwater Horizon subsurface plume in Gulf of Mexico, 16-5 In Plume (16-5 In Plume) | 3300005380 | IMG | 114085 | Non-Syntrophic Hydrocarbon |
| 3 | Marine microbial communities from Deepwater Horizon subsurface plume in Gulf of Mexico, 52-1 Below Plume (52-1 Below Plume) | 2149837027 | IMG | 60113 | Non-Syntrophic Hydrocarbon |
| 4 | Marine microbial communities from Deepwater Horizon subsurface plume in Gulf of Mexico, 52-4 In plume (52-4 In Plume) | 3300005378 | IMG | 102800 | Non-Syntrophic Hydrocarbon |
| 5 | Oil sands microbial communities from Horse River, Alberta, Canada—outcrops (H1C: 454 sequencing assembly) | 3300001422 | HMP | 570427 | Non-Syntrophic Hydrocarbon |
| 6 | Oil sands microbial communities from Horse River, Alberta, Canada—outcrops collected from inside the river (H1R: 454 sequencing assembly) | 3300001393 | HMP | 559882 | Non-Syntrophic Hydrocarbon |
| 7 | Syncrude MLSB Tailings Pond Water Surface—Tailings pond microbial communities from Northern Alberta—Syncrude Mildred Lake Settling Basin (PDSYNTPWS: 454+illumina sequencing assembly) | 3300001605 | HMP | 3740874 | Non-Syntrophic Hydrocarbon |
| 8 | Syncrude MLSB WIP Surface + Isolates—Tailings pond microbial communities from Northern Alberta—Syncrude Mildred Lake Settling Basin (WIP-PD_SYN_TP_WS_002_003_071511 and isolates PD8, PD9 joint assembly) | 3300001239 | HMP | 768174 | Non-Syntrophic Hydrocarbon |
| 9 | Wastewater microbial communities from Syncrude, Ft. McMurray, Alberta—Microbes from Oil-contaminated ecosystem in Alberta, Canada Inniskillen 604.3 (Inniskillen 604.3: 454 sequencing assembly) | 3300001190 | HMP | 25694 | Non-Syntrophic Hydrocarbon |
| 10 | Wastewater microbial communities from Syncrude, Ft. McMurray, Alberta—Microbes from Sediment core from a heavy oil reservoir, Alberta Canada Inniskillen 614.3 (Inniskillen 614.3: 454+illumina sequencing assembly) | 3300001197 | HMP | 204944 | Non-Syntrophic Hydrocarbon |
| 11 | Arctic peat soil from Barrow, Alaska—NGEE Surface sample 210-1 shallow-072012 (NGEE Surface sample 210-1 shallow-072012, ASSEMBLY_DATE=20130514) | 3300001414 | IMG | 14379538 | Non-Syntrophic |
| 12 | Arctic peat soil from Barrow, Alaska—NGEE Surface sample 210-2 deep-072012 (NGEE Surface sample 210-2 deep-072012, ASSEMBLY_DATE=20130514) | 3300001396 | IMG | 8697097 | Non-Syntrophic |
| 13 | Arctic peat soil from Barrow, Alaska—NGEE Surface sample 210-2 deep-092012 (NGEE Surface sample 210-2 deep-092012, ASSEMBLY_DATE=20130516) | 3300001385 | IMG | 6241552 | Non-Syntrophic |
| 14 | Arctic peat soil from Barrow, Alaska—NGEE Surface sample 210-2 shallow-072012 (NGEE Surface sample 210-2 shallow-072012, ASSEMBLY_DATE=20130514) | 3300001416 | IMG | 14687361 | Non-Syntrophic |
| 15 | Freshwater microbial communities from Lake Mendota, WI—02JUN2012 deep hole epilimnion (Lake Mendota Deep Hole Epilimnion 02Jun12, ASSEMBLY_DATE=20140125) | 3300002296 | IMG | 1990049 | Non-Syntrophic |
| 16 | Human retroauricular crease microbial communities from NIH, USA—visit 1, subject 338793263 | 7000000458 | IMG | 36795 | Non-Syntrophic |
| 17 | Human right retroauricular crease microbial communities from NIH, USA—visit 2 of subject 763961826 replicate 2 | 7000000031 | IMG | 39970 | Non-Syntrophic |
| 18 | Marine microbial communities from expanding oxygen minimum zones in Line P, North Pacific Ocean—August 2009 P16 10m (Line P August 2009 P16 10m, March 2012 Assem) | 3300000149 | IMG | 238270 | Non-Syntrophic |
| 19 | Marine microbial communities from expanding oxygen minimum zones in Line P, North Pacific Ocean—June 2008 P4 1300m (Line P June 2008 P4 1300m, March 2012 Assem) | 3300000141 | IMG | 256292 | Non-Syntrophic |
| 20 | Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet—54 02/08/11 120m (Saanich Inlet 54 02/08/11 120m, March 2012 Assem) | 3300000146 | IMG | 169407 | Non-Syntrophic |
| 21 | Soil microbial communities from Great Prairies—Iowa, Native Prairie soil (Iowa, Native Prairie soil, Feb 2012 Assem MSU hiseq + gaii) | 3300000364 | IMG | 8508638 | Non-Syntrophic |
| 22 | Subsurface groundwater microbial communities from S. Glens Falls, New York, USA—GMW60B uncontaminated upgradient, 5.4 m (Subsurface groundwater monitoring well GMW60B uncontam upgr,5.4m, Oct 2012 Assem) | 3300000571 | IMG | 1391570 | Non-Syntrophic |
| 23 | Switchgrass and industrial compost incubating bioreactor microbial communities from the Joint BioEnergy Institute, California, USA, that is aerobic and thermophilic—SG0.5JP960 (454-Illumina assembly) —version 2 (454-Illumina assembly v2) | 3300005442 | IMG | 62968 | Non-Syntrophic |
| 24 | Wastewater treatment Type I Accumulibacter community from EBPR Bioreactor in Madison, WI—N_134min_Aerobic | 3300002344 | IMG | 367402 | Non-Syntrophic |
| 25 | Coal bed methane well microbial communities from Alberta, Canada (CO182: coal bed cutting Illumina Assembly) | 3300000052 | HMP | 665055 | Other |
| 26 | Coal-degrading lab enrichment microbial communities from Bowden, Alberta, Canada—QSAFCN5 (QSAFCN5: 454 assembly) | 3300000507 | HMP | 257387 | Other |
| 27 | Coal-degrading lab enrichment microbial communities from Bowden, Alberta, Canada—methanogenic culture: QSAFCN2 (QSAFCN2 454 assembly) | 3300000408 | HMP | 223117 | Other |
| 43 | Sheep rumen microbial communities from New Zealand—Rank43_high (high_rank43) | 3300001531 | IMG | 911348 | Other |
| 28 | Benzene-degrading bioreactor microbial communities from Toronto, Ontario, Canada, that are methanogenic—September 2009 gDNA_4 (Assembly with PE data) | 2061766000 | IMG | 207753 | Syntrophic Culture |
| 29 | Hydrocarbon resource environments microbial communities from Canada and USA—Toluene degrading community from Alberta, Canada (Toluene: 454+illumina+illuminaFosmid sequencing assembly) | 3300001567 | HMP | 1184637 | Syntrophic Culture |
| 30 | Oil sands microbial community from Northern Alberta which degrade Naphthaline (NapDC: 454 and illumina hybrid assembly) | 3300000032 | HMP | 749231 | Syntrophic Culture |
| 31 | Tailings pond microbial communities from Northern Alberta—Short chain hydrocarbon degrading methanogenic enrichment culture SCADC: (SCADC: 454+illumina assembly) | 3300000568 | HMP | 1513645 | Syntrophic Culture |
| 32 | Wastewater bioreactor microbial communities from Singapore—TA reactor DNA contigs from 4 sample (re-annotation) (MER-FS) (assembled) | 3300005443 | IMG | 95700 | Syntrophic Culture |
| 33 | Subsurface groundwater microbial communities from S. Glens Falls, New York, USA—GMW46 contaminated, 5.4 m (Subsurface groundwater monitoring well GMW46 contaminated, 5.4m, Oct 2012 Assem) | 3300000574 | IMG | 1250031 | Potentially Syntrophic Hydrocarbon |
| 34 | Tailings pond microbial communities from Northern Alberta—TP6_2008_30ft: | 2228664008 | HMP | 389458 | Potentially Syntrophic Hydrocarbon |
| 35 | Tar lake microbial communities from La Brea, Trinidad and Tobago | 2228664012 | HMP | 195772 | Potentially Syntrophic Hydrocarbon |
| 36 | Wastewater microbial communities from Syncrude, Ft. McMurray, Alberta—Microbes from Suncor taillings pond 6 2012TP6_6 (2012TP6_6m: illumina sequencing assembly) | 3300001580 | HMP | 3205687 | Potentially Syntrophic Hydrocarbon |
| 37 | Wastewater microbial communities from Syncrude, Ft. McMurray, Alberta—Microbes in water sample from Medicine Hat oil field -PW_MHGC_2012April2: (PW_MHGC_2012April2: 454+illumina sequencing assembly final) | 3300001444 | HMP | 2443755 | Potentially Syntrophic Hydrocarbon |
| 38 | Wastewater microbial communities from Syncrude, Ft. McMurray, Alberta—Microbes in water sample from Medicine Hat oil field -PW_MHGC_2012April2: (PW_MHGC_2012April2: 454 + illumina sequencing assembly) | 3300001592 | HMP | 2251644 | Potentially Syntrophic Hydrocarbon |
| 39 | Tailings Pipe from MSLB 2011 -Wastewater microbial communities from Syncrude, Ft. McMurray, Alberta—West In Pit SyncrudeMLSB2011 (SyncrudeMLSB2011: 454+illumina assembly) | 3300000558 | HMP | 3021632 | Potentially Syntrophic Hydrocarbon |
| 42 | Marine sediment microbial communities from Arctic Ocean, off the coast from Alaska—sample from high methane PC12-225-485cm (High methane PC12-225-485cm Jan 2011 assembly) | 2140918005 | IMG | 674403 | Potentially Syntrophic |
Figure 1(A) Microbial community composition of metagenomes sequenced as part of the Hydrocarbon Metagenomics Project (HMP) [10]. Relevant community members of importance to this study are colored; all others are grey. Community composition was determined using the HMP database 454 single end data 16S rRNA based prediction (total community composition not shown) [9]. Based on sample location as well as microbial community, samples were grouped into categories for use in further analysis. (B) Microbial community composition of metagenomes obtained from the IMG database using the distribution by BLAST percent identities (cumulative) with percent hits 30%+ (total community composition not shown) [16]. Based on sample location as well as microbial community, samples were grouped into categories for use in further analysis.
Categories of clusters of orthologous groups (COGs) searched in the metagenomes. Universally present COGs were obtained from previously published information [17]. Syntroph associated COG categories were obtained from the annotations in the IMG database for each of the genes for each respective category from previously published information [8,15]. Further information on the specific COGs searched in each category can be found in Supplemental Tables S1 and S2.
| Universally Present COG Categories | G1-Ribosome and Translation Initiation |
| G2-Ribosome Associated/ Protein Modification | |
| G3-Transcription/DNA Replication | |
| G4-Unknown | |
| Syntroph Associated COG Categories | FeS Oxidoreductases |
| Fnr | |
| Fix | |
| Confurcating Hydrogenases | |
| Other Hydrogenases | |
| Membrane Hydrogenases | |
| NADH Linked Formate Dehydrogenases | |
| Other Formate Dehydrogenases | |
| Membrane Formate Dehydrogenases |
Figure 2Principal component analysis plot generated from the normalized number of universally present COGs detected in each metagenome. Categories for the universally present COGs are listed in Table 2, with the individual COGs for each category listed in Supplemental Table S1. Numbers of individual COGs found in each metagenome were summed for each COG category, divided by the total number of COGs for each respective category, and the sum was normalized against the total number of genes detected in each metagenome (Table 1 and Table 2). Principal component analysis was performed using R [18]. 95% Confidence ellipses were drawn for all metagenomes classified as syntrophic/potentially syntrophic, and for all metagenomes classified as non-syntrophic. The first two principal components are shown. The corresponding statistics and scree plot are shown in Supplemental Figure S1 and Table S3.
Figure 3Circle of correlations generated for the principal component analysis plot of the number of universally present COGs detected in each metagenome, showing the variables that make up each of the first two principal components (Figure 2).
Figure 4Principal component analysis plot generated from the normalized number of universally present COGs and normalized number of syntroph associated COGs detected in each metagenome. Categories for the universally present and syntroph associated COGs are listed in Table 2, with the individual COGs for each category listed in Supplemental Table S1 (universally present) and Table S2 (syntroph associated). The numbers of individual COGs found in each metagenome were summed for each COG category, divided by the total number of COGs for each respective category, and the sum was normalized against the total number of genes detected in each metagenome (Table 1 and Table 2). Principal component analysis was performed using R [18]. 95% Confidence ellipses were drawn for all metagenomes classified as syntrophic/potentially syntrophic, and for all metagenomes classified as non-syntrophic. The first two principal components are shown. Corresponding scree plot and statistics for each principal component are shown in Supplemental Figure S2 and Table S4.
Figure 5Circle of correlations for the variables in the principal component analysis plot of the number of universally present COGs and number of syntrophic gene associated COGs detected in the metagenomes, showing the contribution of each variable to the first two principal components (Figure 4). Hyd. = hydrogenase, FDH = formate dehydrogenase.
Figure 6Abundance of each syntroph associated COG category in each metagenome. Numbers indicated in the legend refer to the sum total of COGs found for each category in a metagenome, normalized to the number of total genes detected in that metagenome. Individual COGs comprising each category are listed in Supplemental Table S2. The total number of COGs detected in the metagenomes were summed for each category, divided by the total number of COGs for each respective category, and normalized against the number of total genes detected in each metagenome (Table 1 and Table 2). Hyd. = hydrogenase, FDH = formate dehydrogenase.