Literature DB >> 35766865

Metagenomes, Metagenome-Assembled Genomes, and Metatranscriptomes from Polychlorinated Biphenyl-Contaminated Sediment Microcosms.

Jessica M Ewald1, Jerald L Schnoor1, Timothy E Mattes1.   

Abstract

We present a comprehensive data set that describes an anaerobic microbial consortium native to polychlorinated biphenyl (PCB)-contaminated sediments. Obtained from sediment microcosms incubated for 200 days, the data set includes 4 metagenomes, 4 metatranscriptomes (in duplicate), and 62 metagenome-assembled genomes and captures microbial community interactions, structure, and function relevant to anaerobic PCB biodegradation.

Entities:  

Year:  2022        PMID: 35766865      PMCID: PMC9302077          DOI: 10.1128/mra.01126-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Polychlorinated biphenyl (PCB)-contaminated sediments threaten human and ecological health but often harbor PCB-transforming bacteria that help detoxify sediments (1–3). We established anaerobic microcosms, as described previously (4, 5), to investigate anaerobic microbiomes native to PCB-contaminated lagoon sediments. Replicate microcosms contained sediments, collected in 2017, from two different locations (four bottles total) with variable PCB concentrations (28.04 ± 2.89 μg/mL [high-PCB microcosms [HPCBM], F4_1 and F4_2] versus 4.28 ± 1.05 μg/mL [low-PCB microcosms [LPCBM], E2_1 and E2_2]; P < 0.0001). After 200 days of incubation, DNA was extracted from single slurry samples (2 mL) with a modified DNeasy PowerWater Sterivex kit protocol (6), and RNA was extracted from duplicate slurry samples (5 mL) with the RNeasy PowerSoil total RNA kit (Thermo Fisher Scientific, Waltham, MA). Contaminating DNA was removed from RNA with the Direct-zol RNA MiniPrep Plus kit (Zymo Research Corp., Irvine, CA) and the TURBO DNA-free kit (Thermo Fisher Scientific). RNA quality was confirmed using a 2100 Bioanalyzer RNA Pico assay (Agilent Technologies, Santa Clara, CA). High-throughput DNA and RNA sequencing (4 metagenomes and 4 metatranscriptomes in duplicate) was performed at the Iowa Institute of Human Genetics (IIHG) (Iowa City, IA, USA). Indexed DNA libraries, prepared with the KAPA HyperPrep kit (Roche Sequencing and Life Science, Indianapolis, IN) using sheared DNA (average size, 550 bp), were pooled and sequenced on separate lanes of an S Prime NovaSeq 6000 flow cell (2 × 150-bp paired-end reads). RNA libraries were indexed and rRNA depleted with the stranded total RNA preparation with Ribo-Zero Plus kit (Illumina, Inc., San Diego, CA). To improve mRNA sequencing efficiency, supplemental rRNA probes developed from Methanosarcina barkeri (GenBank accession number NZ_CP009530.1) and Methanobacterium subterraneum (GenBank accession number NZ_CP017768.1) were added to further deplete methanogenic archaeal rRNA sequences. RNA was sequenced on a single Illumina NovaSeq 6000 flow cell lane (2 × 150-bp paired-end reads). Metagenome sequencing yielded 97,279,994 (E2_1), 105,614,978 (E2_2), 102,655,932 (F4_1), and 103,672, 591 (F4_2) raw reads from each bottle. Metatranscriptome sequencing (RNA-seq) yielded 50,403,972 (E2_1 metatranscriptome-A), 60,356,358 (E2_1-B), 49,353,924 (E2_2-A), 45,023,913 (E2_2-B), 54,860,340 (F4_1-A), 47,603,523 (F4_1-B), 55,440,491 (F4_2-B), and 71,530,818 (F4_2-C) raw reads. After trimming and filtering of unassembled sequence reads with Trimmomatic (v0.39) (7), the 10 most abundant phyla in the metagenomes and the 13 most active phlya in the metatranscriptomes were determined with Kraken2 (v2.0.8) (Fig. 1).
FIG 1

Relative abundance of reads classified at the phylum level in the HPCBM and LPCBM metagenome data sets (top 10 phyla) (A) and HPCBM and LPCBM metatranscriptome data sets (top 13 phyla) (B).

Relative abundance of reads classified at the phylum level in the HPCBM and LPCBM metagenome data sets (top 10 phyla) (A) and HPCBM and LPCBM metatranscriptome data sets (top 13 phyla) (B). Metagenome-assembled genomes (MAGs) were obtained by removing low-abundance k-mers from quality-filtered reads with khmer (v3.0.0) (8), coassembling paired reads into 4,213,733 contigs with Megahit (v1.2.9), and sorting contigs of >1,000 bp into bins with MetaBAT (v2.15) (9, 10). Assembly statistics were quantified with MetaQUAST (v5.0.2) (11). Bin completeness, contamination, and strain heterogeneity statistics were generated by CheckM (v1.1.3), and bin taxonomy was refined with Kraken2 (12–14) (Table 1). Among the MAGs, bin 47 (Dehalococcoides mccartyi) (Table 1), which was previously implicated in PCB dechlorination (4), is expected to harbor PCB dehalogenase genes.
TABLE 1

Assembly statistics and strain heterogeneity parameters for 62 high-quality MAGs

GenBank assembly accession no.Bin no.TaxonomyaCompleteness (%)Contamination (%)Strain heterogeneity (%)No. of contigsN50 (bp)
ASM2137325v1 10G: Pseudomonas95.374.014020134,137
ASM2137313v1 18G: Bacteroides94.620010455,779
ASM2137294v1 19F: Planctomycetaceae95.451.1410014641,542
ASM2137286v1 24G: Proteiniphilum91.440.2710021210,653
ASM2173286v1 47G: Dehalococcoides91.470.991009202,567
ASM2137229v1 50G: Bradyrhizobium93.061.8505675,318
ASM2137227v1 51F: Planctomycetaceae94.323.3390.9114157,735
ASM2137223v1 56C: Spirochaetia91.112.255023713,621
ASM2137189v1 69O: Clostridiales93.151.965034711,116
ASM2137317v1 129F: Microbacteriaceae92.921.74014430,602
ASM2137318v1 132G: Sulfuricella97.034.5338.465593,072
ASM2137315v1 179G: Bacillus94.64008534,100
ASM2137311v1 182G: Christensenella96.431.61503990,408
ASM2137301v1 188G: Nitrosospira95.451.82022130,422
ASM2137302v1 193G: Pseudomonas94.64006439,167
ASM2137293v1 213G: Proteiniphilum990.27035233,944
ASM2137291v1 226G: Cloacibacilus90.681.691009829,445
ASM2137289v1 244G: Cloacibacilus100009737,569
ASM2137285v1 248G: Streptomyces91.671.85070059,520
ASM2137282v1 250G: Streptomyces97.923.614010529,143
ASM2137281v1 256G: Syntrophus904.872532916,903
ASM2137278v1 257C: Negativicutes95.762.2607444,539
ASM2137277v1 270G: Streptomyces93.494.872549411,769
ASM2137272v1 298G: Pelolinea95.254.73020037,672
ASM2137275v1 300G: Bacteroides95.831.282044915,112
ASM2137271v1 336G: Methanobacterium96.291.610022114,601
ASM2137269v1 341G: Geobacter97.423.87024041,457
ASM2137267v1 350G: Christensenella97.582.02012329,890
ASM2137263v1 361G: Syntrophobacter91.224.5361.5445414,033
ASM2137261v1 367G: Streptomyces93.984.17017168,371
ASM2137264v1 369G: Clostridium96.771.69010562,815
ASM2137258v1 371G: Bacillus95.540010436,363
ASM2137257v1 378O: Rhizobiales96.823.2333.3321434,208
ASM2137249v1 379G: Lysobacter97.372.28047126,243
ASM2137251v1 397G: Pseudomonas98.92.25022237,013
ASM2137253v1 417F: Enterobacteriaceae97.740.55023639,130
ASM2137255v1 426F: Planctomycetaceae96.452.377536119,317
ASM2137247v1 428P: Chloroflexi98.171.385025020,510
ASM2137243v1 430G: Streptomyces91.814.7412.519723,332
ASM2137242v1 451G: Treponema91.950.571003608,129
ASM2137241v1 454G: Desulfovibrio93.873.1214.2947714,799
ASM2137237v1 460G: Pelosinus94.251.67019930,621
ASM2137239v1 476G: Desulfovibrio93.552.355030614,167
ASM2137235v1 485G: Rhodococcus97.832.1636.3628142,192
ASM2137233v1 495G: Methanomassiliicoccus96.770.8108225,889
ASM2137231v1 505G: Methanomassiliicoccus97.581.6110012726,633
ASM2137224v1 513G: “Candidatus Solibacter”1002.242536835,769
ASM2137216v1 565F: Flavobacteriaceae92.794.15030310,675
ASM2137218v1 577F: Parachlamydiaceae90.033.04203809,230
ASM2137215v1 582G: “Candidatus Protochlamydia”91.224.7314.2940214,309
ASM2137199v1 599G: Methanoregula99.02004369,042
ASM2137205v1 609G: Syntrophobacter98.484.537.6931820,595
ASM2137200v1 610F: Planctomycetaceae98.783.457510648,160
ASM2137195v1 619G: Bacteroides96.430.95016219,478
ASM2137196v1 625G: Planctomyces91.12.27020150,230
ASM2137193v1 658O: Enterobacterales95.373.09022313,181
ASM2137191v1 662F: Nitrospiraceae94.554.1937.515217,452
ASM2137187v1 687G: Bacteroides90.590.27023318,036
ASM2137183v1 707G: Methanoculleus91.18008850,994
ASM2137179v1 716G: Sedimentisphaera98.861.7028181,921
ASM2137185v1 736G: Flavobacterium1002.62073198,373
ASM2137180v1 739G: Clostridium91.914.55202509,902

Taxonomy was determined by the CheckM lineage workflow and further refined with Kraken2. The letters indicate the phylogenetic rank of taxonomic classification (P, phylum; C, class; F, family; O, order; G, genus).

Assembly statistics and strain heterogeneity parameters for 62 high-quality MAGs Taxonomy was determined by the CheckM lineage workflow and further refined with Kraken2. The letters indicate the phylogenetic rank of taxonomic classification (P, phylum; C, class; F, family; O, order; G, genus).

Data availability.

Raw data files (24 fastq files) and MAGs are available under BioProject accession number PRJNA743546. Metagenomic data are available under SRA accession numbers SRX11347095 to SRX11347098. Metatranscriptomic data are available under accession numbers SRX14430540 to SRX14430547. PCB data are available at Iowa Research Online (https://www.doi.org/10.25820/data.006156).
  13 in total

1.  MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph.

Authors:  Dinghua Li; Chi-Man Liu; Ruibang Luo; Kunihiko Sadakane; Tak-Wah Lam
Journal:  Bioinformatics       Date:  2015-01-20       Impact factor: 6.937

2.  Growth of Dehalococcoides spp. and increased abundance of reductive dehalogenase genes in anaerobic PCB-contaminated sediment microcosms.

Authors:  Jessica M Ewald; Shelby V Humes; Andres Martinez; Jerald L Schnoor; Timothy E Mattes
Journal:  Environ Sci Pollut Res Int       Date:  2019-06-17       Impact factor: 4.223

3.  PCB dechlorination enhancement in Anacostia River sediment microcosms.

Authors:  Valdis Krumins; Joong-Wook Park; Eun-Kyeu Son; Lisa A Rodenburg; Lee J Kerkhof; Max M Häggblom; Donna E Fennell
Journal:  Water Res       Date:  2009-08-06       Impact factor: 11.236

4.  Detection of an alkene monooxygenase in vinyl chloride-oxidizing bacteria with GeneFISH.

Authors:  Patrick M Richards; Timothy E Mattes
Journal:  J Microbiol Methods       Date:  2021-01-22       Impact factor: 2.622

5.  CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

Authors:  Donovan H Parks; Michael Imelfort; Connor T Skennerton; Philip Hugenholtz; Gene W Tyson
Journal:  Genome Res       Date:  2015-05-14       Impact factor: 9.043

6.  Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea.

Authors:  Robert M Bowers; Nikos C Kyrpides; Ramunas Stepanauskas; Miranda Harmon-Smith; Devin Doud; T B K Reddy; Frederik Schulz; Jessica Jarett; Adam R Rivers; Emiley A Eloe-Fadrosh; Susannah G Tringe; Natalia N Ivanova; Alex Copeland; Alicia Clum; Eric D Becraft; Rex R Malmstrom; Bruce Birren; Mircea Podar; Peer Bork; George M Weinstock; George M Garrity; Jeremy A Dodsworth; Shibu Yooseph; Granger Sutton; Frank O Glöckner; Jack A Gilbert; William C Nelson; Steven J Hallam; Sean P Jungbluth; Thijs J G Ettema; Scott Tighe; Konstantinos T Konstantinidis; Wen-Tso Liu; Brett J Baker; Thomas Rattei; Jonathan A Eisen; Brian Hedlund; Katherine D McMahon; Noah Fierer; Rob Knight; Rob Finn; Guy Cochrane; Ilene Karsch-Mizrachi; Gene W Tyson; Christian Rinke; Alla Lapidus; Folker Meyer; Pelin Yilmaz; Donovan H Parks; A M Eren; Lynn Schriml; Jillian F Banfield; Philip Hugenholtz; Tanja Woyke
Journal:  Nat Biotechnol       Date:  2017-08-08       Impact factor: 54.908

7.  Improved metagenomic analysis with Kraken 2.

Authors:  Derrick E Wood; Jennifer Lu; Ben Langmead
Journal:  Genome Biol       Date:  2019-11-28       Impact factor: 17.906

8.  The khmer software package: enabling efficient nucleotide sequence analysis.

Authors:  Michael R Crusoe; Hussien F Alameldin; Sherine Awad; Elmar Boucher; Adam Caldwell; Reed Cartwright; Amanda Charbonneau; Bede Constantinides; Greg Edvenson; Scott Fay; Jacob Fenton; Thomas Fenzl; Jordan Fish; Leonor Garcia-Gutierrez; Phillip Garland; Jonathan Gluck; Iván González; Sarah Guermond; Jiarong Guo; Aditi Gupta; Joshua R Herr; Adina Howe; Alex Hyer; Andreas Härpfer; Luiz Irber; Rhys Kidd; David Lin; Justin Lippi; Tamer Mansour; Pamela McA'Nulty; Eric McDonald; Jessica Mizzi; Kevin D Murray; Joshua R Nahum; Kaben Nanlohy; Alexander Johan Nederbragt; Humberto Ortiz-Zuazaga; Jeramia Ory; Jason Pell; Charles Pepe-Ranney; Zachary N Russ; Erich Schwarz; Camille Scott; Josiah Seaman; Scott Sievert; Jared Simpson; Connor T Skennerton; James Spencer; Ramakrishnan Srinivasan; Daniel Standage; James A Stapleton; Susan R Steinman; Joe Stein; Benjamin Taylor; Will Trimble; Heather L Wiencko; Michael Wright; Brian Wyss; Qingpeng Zhang; En Zyme; C Titus Brown
Journal:  F1000Res       Date:  2015-09-25

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.

Authors:  Dongwan D Kang; Feng Li; Edward Kirton; Ashleigh Thomas; Rob Egan; Hong An; Zhong Wang
Journal:  PeerJ       Date:  2019-07-26       Impact factor: 2.984

View more
  1 in total

1.  Combined read- and assembly-based metagenomics to reconstruct a Dehalococcoides mccartyi genome from PCB-contaminated sediments and evaluate functional differences among organohalide-respiring consortia in the presence of different halogenated contaminants.

Authors:  Jessica M Ewald; Jerald L Schnoor; Timothy E Mattes
Journal:  FEMS Microbiol Ecol       Date:  2022-07-13       Impact factor: 4.519

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.