| Literature DB >> 27980706 |
Rebecca A Reiss1, Peter Guerra2, Oleg Makhnin1.
Abstract
Chlorinated solvent contamination of potable water supplies is a serious problem worldwide. Biostimulation protocols can successfully remediate chlorinated solvent contamination through enhanced reductive dechlorination pathways, however the process is poorly understood and sometimes stalls creating a more serious problem. Whole metagenome techniques have the potential to reveal details of microbial community changes induced by biostimulation. Here we compare the metagenome of a tetrachloroethene contaminated Environmental Protection Agency Superfund Site before and after the application of biostimulation protocols. Environmental DNA was extracted from uncultured microbes that were harvested by on-site filtration of groundwater one month prior to and five months after the injection of emulsified vegetable oil, nutrients, and hydrogen gas bioamendments. Pair-end libraries were prepared for high-throughput DNA sequencing and 90 basepairs from both ends of randomly fragmented 400 basepair DNA fragments were sequenced. Over 31 millions reads were annotated with Metagenome Rapid Annotation using Subsystem Technology representing 32 prokaryotic phyla, 869 genera, and 3,181 species. A 3.6 log2 fold increase in biomass as measured by DNA yield per mL water was measured, but there was a 9% decrease in the number of genera detected post-remediation. We apply Bayesian statistical methods to assign false discovery rates to fold-change abundance data and use Zipf's power law to filter genera with low read counts. Plotting the log-rank against the log-fold-change facilitates the visualization of the changes in the community in response to the enhanced reductive dechlorination protocol. Members of the Archaea domain increased 4.7 log2 fold, dominated by methanogens. Prior to remediation, classes Alphaproteobacteria and Betaproteobacteria dominated the community but exhibit significant decreases five months after biostimulation. Geobacter and Sulfurospirillum replace "Sideroxydans" and Burkholderia as the most abundant genera. As a result of biostimulation, Deltaproteobacteria and Epsilonproteobacteria capable of dehalogenation, iron and sulfate reduction, and sulfur oxidation increase. Matches to thermophilic, haloalkane respiring archaea is evidence for additional species involved in biodegradation of chlorinated solvents. Additionally, potentially pathogenic bacteria increase, indicating that there may be unintended consequences of bioremediation.Entities:
Keywords: Enhanced reductive dechlorination; Metagenomics
Year: 2016 PMID: 27980706 PMCID: PMC5131427 DOI: 10.1186/s40793-016-0209-z
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Geochemical Parameters and Contamination Concentrations
| Classification | Parameter | Units | SAE3-0 a
| SAE3-5 b
| Log2 Fold-Change |
|---|---|---|---|---|---|
| Dissolved Gas by Headspace | Methane | μg/L | BDLc (<1.0) | 6540.0 ± 4292.8 | 13.7 |
| Ethene | μg/L | BDL (<2.0) | BDL (<2.0) | 0 | |
| Ethane | μg/L | BDL (<2.0) | BDL (<2.0) | 0 | |
| Anions | Chloride | mg/L | 73.7 ± 1.4 | 83.4 ± 4.7 | 0.2 |
| Bromide | mg/L | 2.66 ± 2.21 | 2.48 ± 0.37 | −0.1 | |
| Nitrogen, Nitrate (as N) | mg/L | 2.05 ± 0.15 | BDL (<0.26) | −4.0 | |
| Sulfate | mg/L | 213.3 ± 5.2 | 27.5 ± 30.5 | −3.0 | |
| Metals | Dissolved Fe | mg/L | 0.22 ± 0.08 | 9.66 ± 9.93 | 5.5 |
| Total Mn | mg/L | 0.19 ± 0.04 | 7.54 ± 0.48 | 5.3 | |
| Volatiles | PCE | μg/L | 9966.7 ± 1672.9 | 157.2 ± 196.3 | −5.9 |
| TCE | μg/L | 74.7 ± 38.5 | 223.4 ± 249.6 | 1.6 | |
| Cis-1,2-DCE | μg/L | 390.0 ± 114.7 | 7840.0 ± 3266.2 | 4.3 | |
| Trans-1,2-DCE | μg/L | BDL (<125) | 143.40 ± 62.90 | 6.8 | |
| Vinyl chloride | μg/L | BDL (<50) | 53.6 ± 21.5 | 6.7 | |
| Total VOCs | μM | 61.248 ± 10.298 | 85.892 ± 32.489 | 0.5 | |
| TOC | TOC | mg/L | 2.13 ± 0.08 | 104.40 ± 28.95 | 5.6 |
| Alkalinity | Total (as CaCO3) | mg/L | 311.7 ± 4.1 | 684.0 ± 89.9 | 1.1 |
| Carbonate | mg/L | BDL (<2.0) | BDL (<2.0) | 0 | |
| Bicarbonate | mg/L | 311.7 ± 4.1 | 684.0 ± 89.9 | 1.1 | |
| Hydroxide | mg/L | BDL (<2.0) | BDL (<2.0) | 0 | |
| Sulfide | mg/L | BDL (<1.0) | 3.28 ± 1.79 | 2.7 | |
| CO2 | Total CO2 | mg/L | 291.7 ± 7.5 | 716.0 ± 103.6 | 1.3 |
| Field Data | Temperature (°C) | °C | 16.723 ± 1.276 | 20.118 ± 1.844 | 0.3 |
| Dissolved O2 | mg/L | 0.227 ± 0.166 | 0.074 ± 0.057 | −1.6 | |
| pH | 7.250 ± 0.301 | 7.270 ± 0.334 | 0.004 | ||
| Oxidation-reduction potential | −12.97 ± 46.96 | −247.72 ± 44.07 | −4.3 | ||
| Conductivity | ms/cm3 | 1.1923 ± 0.1222 | 1.6318 ± 0.1367 | 0.5 |
aEach value is the average of six geochemical sampling events prior to sampling.
bEach value is the average of five geochemical sampling events in Nov. 2007, prior to sampling.
cOne-half of the BDL divided by the dilution factor was used to calculate fold-change. For trans-1,2,-DCE and VC the sample dilution factor was 50. No dilution was used for all other BDL sample results (dilution factor = 1)
Fig. 1Heat map of TOC and VOCs during pilot scale operations at NRAP. Up- arrows indicate when microbial samples were collected; down arrows indicate when bioamendments were injected into the aquifer
Study information
| Label | SAE3-0 | SAE3-5 |
|---|---|---|
| MG-RAST ID | EW3Pre1 (4447797.3) | EW3Post1 (4447837.3) |
| SRA ID or ENA ID | NA | NA |
| Study | North Railroad Avenue Plume EPA Superfund Site | North Railroad Avenue Plume EPA Superfund Site |
| GOLD ID (sequencing project) | NA | NA |
| GOLD ID (analysis project) | NA | NA |
| NCBI BIOPROJECT | NA | NA |
| Relevance | Bioremediation of contaminated groundwater | Bioremediation of contaminated groundwater |
Sample information
| Label | SAE3-0 | SAE3-5 |
|---|---|---|
| GOLD ID (biosample) | NA | NA |
| Biome | Freshwater | Freshwater |
| Feature | Superfund site | Superfund site |
| Material | Ground water | Ground water |
| Latitude and Longitude | 35.992056, −106.07975 | 35.992056, −106.07975 |
| Vertical distance | −2.9 m | −2.9 m |
| Geographic location | Española, New Mexico, USA | Española, New Mexico, USA |
| Collection date and time | 13/06/07, 14 h (UTC-6) | 27/11/07, 14 h (UTC-7) |
Library information
| Label | SAE3-0 | SAE3-5 |
|---|---|---|
| Sample Label(s) | SAE3-0 | SAE3-5 |
| Sample prep method | G-nome® DNA isolation kit, | G-nome® DNA isolation kit, |
| Library prep method | Illumina Paired-End DNA Sample Prep Kit | Illumina Paired-End DNA Sample Prep Kit |
| Sequencing platform | Illumina IIx | Illumina IIx |
| Sequencing chemistry | TruSeq SBS v3 | TruSeq SBS v3 |
| Sequence size (Gbp) | 1.4 | 1.4 |
| Number of reads | 15,503,268 | 15,877,664 |
| Single-read or paired-end sequencing? | Paired-end | Paired-end |
| Sequencing library insert size | 400 | 400 |
| Average read length | 90 | 90 |
| Standard deviation for read length | 1 | 1 |
Sequence processing
| Label | SAE3-0 | SAE3-5 |
|---|---|---|
| Tool(s) used for quality control | MG-RAST (default) | MG-RAST (default) |
| Number of sequences removed by quality control procedures | 1,870,805 | 1,240,627 |
| Number of sequences that passed quality control procedures | 13,632,463 | 14,637,037 |
| Number of artificial duplicate reads | 1,646,292 | 1,013,551 |
Fig. 2Color shift in microbial samples. The tube on the left is the sample collected one month prior to application of remediation protocols, the tube on the right is the sample collected five months after the addition of EVO, nutrients, and hydrogen gas to the aquifer
Metagenome statistics
| Label | SAE3-0 | SAE3-5 |
|---|---|---|
| Libraries used | SAE3-0 | SAE3-5 |
| Assembly tool(s) used | NA | NA |
| Number of contigs after assembly | NA | NA |
| Number of singletons after assembly | NA | NA |
| minimal contig length | NA | NA |
| Total bases assembled | NA | NA |
| Contig n50 | NA | NA |
| % of Sequences assembled | NA | NA |
| Measure for % assembled | NA | NA |
Annotation parameters
| Label | SAE3-0 | SAE3-5 |
|---|---|---|
| Annotation system | MG-RAST | MG-RAST |
| Gene calling program | FraGeneScan | FraGeneScan |
| Annotation algorithm | MG-RAST | MG-RAST |
| Database(s) used | MNR5 | MNR5 |
Metagenome properties
| Label | SAE3-0 | SAE3-5 |
|---|---|---|
| Number of contigs | NA | NA |
| Gbp | 1,226,921,670 | 1,317,333,330 |
| Number of features identified | 6,589,534 | 6,873,193 |
| CDS | 4,786,768 | 5,056,492 |
| rRNA | 1,802,766 | 1,816,701 |
| others | 0 | 0 |
| CDSs with COG | 1,468,398 | 1,648,018 |
| CDSs with Pfam | ||
| CDS with SEED subsystem | ||
| Alpha diversity | 217.523 | 196.901 |
Fig. 3Percent G + C shift at NRAP. A major peak at 60% is observed prior to remediation, but a bimodal distribution is observed with peaks at 45% and 60% G + C after bioamendment application
Taxonomic composition
| Phylum | SAE3-0 | SAE3-5 | Log2 Fold-Change | Total Raw Counts |
|---|---|---|---|---|
|
| 902,173.65 | 777,848.47 | −0.21 | 14,270,944 |
|
| 18,007.05 | 56,618.88 | 1.65 | 665,155 |
|
| 2,431.96 | 64,504.59 | 4.73 | 616,044 |
|
| 14,088.27 | 33,537.14 | 1.25 | 420,818 |
|
| 17,115.85 | 10,802.99 | −0.66 | 234,150 |
|
| 11,765.89 | 11,318.40 | −0.06 | 196,982 |
|
| 4,290.76 | 6,684.93 | 0.64 | 95,503 |
|
| 6,289.51 | 2,718.90 | −1.21 | 74,466 |
|
| 4,178.12 | 4,307.47 | 0.04 | 72,617 |
|
| 3,266.26 | 2,962.45 | −0.14 | 53,021 |
|
| 2,671.54 | 2,687.13 | 0.01 | 45,811 |
| unclassified (derived from | 2,923.61 | 1,836.12 | −0.67 | 39,911 |
|
| 1,307.70 | 3,105.62 | 1.25 | 38,993 |
|
| 952.30 | 2,758.34 | 1.53 | 32,993 |
|
| 1,709.80 | 1,658.48 | −0.04 | 28,752 |
|
| 8,47.05 | 2,259.90 | 1.42 | 27,555 |
|
| 1,631.34 | 1,515.31 | −0.11 | 26,812 |
|
| 497.52 | 2,447.58 | 2.30 | 26,552 |
|
| 430.03 | 2,233.53 | 2.38 | 24,042 |
|
| 482.21 | 2,029.32 | 2.07 | 22,561 |
|
| 301.95 | 1,136.92 | 1.91 | 12,889 |
|
| 257.31 | 1,129.68 | 2.13 | 12,472 |
|
| 514.61 | 618.05 | 0.26 | 9,754 |
|
| 371.35 | 654.25 | 0.82 | 8,966 |
|
| 146.83 | 828.32 | 2.50 | 8,817 |
|
| 471.62 | 239.12 | −0.98 | 5,910 |
|
| 198.75 | 394.06 | 0.99 | 5,205 |
|
| 132.67 | 356.46 | 1.43 | 4,339 |
| unclassified | 72.84 | 344.47 | 2.24 | 3,759 |
|
| 81.77 | 232.31 | 1.51 | 2,791 |
|
| 226.18 | 104.16 | −1.12 | 2,737 |
| Candidatus | 135.61 | 71.53 | −0.92 | 1,725 |
|
| 22.07 | 46.03 | 1.06 | 599 |
|
| 6.00 | 9.08 | 0.60 | 131 |
aCPM: Counts per million. Normalized values were calculated by dividing the abundance for each genera by the column total and multiplying by 106.
Fig. 5Rank abundance fold-change plot of genera detected at the NRAP site. Each of the 569 genera has at least 1000 reads and are ranked by their abundance on the y-axis. On the x-axis, those to the left of zero decrease in abundance after six months, those to the right increase. The number of genera in each category is indicated in parentheses. The labeled genera are discussed in the text. a Selected classes are indicated by marker shape. b The metabolic classification of genera is indicated by color. c Methanogenic and dehalogenic bacteria are highlighted by color. d Oxygen requirements for selected genera are indicated by color and demonstrates that the remediation protocol selects for anaerobic species. The labeled species are potential pathogens
Functional diversity
| COG Category | Metagenome SAE3-0 | Metagenome SAE3-5 |
|---|---|---|
| Cellular processes and signaling | 337,528 | 395,668 |
| Information storage and processing | 303,004 | 291,760 |
| Metabolism | 650,216 | 750,444 |
| Poorly characterized | 177,649 | 210,146 |
Fig. 4Zipf curve. Plotting the log reads per genus against the genus rank reveals an inflection point that separates genera with coverage within a linear from those with limited coverage. Genera with less than 1000 reads were excluded from subsequent analyses