Literature DB >> 35862918

Metagenomes and Metagenome-Assembled Genomes from Microbiomes Metabolizing Thin Stillage from an Ethanol Biorefinery.

Nathaniel W Fortney1,2, Kevin S Myers1,2, Abel T Ingle1,2,3, Kevin A Walters1,2,4, Matthew J Scarborough5, Timothy J Donohue1,2,4, Daniel R Noguera1,2,3.   

Abstract

Here, we report the metagenomes from five anaerobic bioreactors, operated under different conditions, that were fed carbohydrate-rich thin stillage from a corn starch ethanol plant. The putative functions of the abundant taxa identified here will inform future studies of microbial communities involved in valorizing this and other low-value agroindustrial residues.

Entities:  

Year:  2022        PMID: 35862918      PMCID: PMC9387277          DOI: 10.1128/mra.00290-22

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

We are investigating how to use anaerobic microbial communities for valorizing agroindustrial residues (1–5). We reported on fermentation products when thin stillage (TS) from starch bioethanol production was fed to a set of bioreactors (4). In that study, an anaerobic bioreactor (R1TS) was inoculated with acid-phase digester sludge from the Nine Springs Wastewater Treatment Plant (Madison, WI, USA) and provided TS as the feedstock. Four additional bioreactors (R2SR-TS, R3LowSRT, R4T-pH, and R5T-pH-LowSRT), derived from R1TS, were operated with different temperatures, pH values, and solids retention times (SRTs), resulting in diverging microbial communities and different fermentation products (4). Genomic DNA was extracted during bioreactor operation (R1TS, 6 samples; R2SR-TS, 9 samples; R3LowSRT, 6 samples; R4T-pH, 6 samples; R5T-pH-LowSRT, 2 samples) using a phenol-chloroform extraction method (2). DNA quantity and quality were determined using a Qubit 4 fluorometer (Thermo Fisher Scientific, USA) and NanoDrop ND-1000 spectrophotometer (Thermo Fisher Scientific), respectively. DNA aliquots of 500 ng (25 samples) and 3,000 ng (4 samples) were submitted to the Joint Genome Institute (JGI) for paired-end 2 × 150-bp NovaSeq S4 (Illumina, USA) and Sequel II (Pacific Biosciences [PacBio], USA) sequencing, respectively. Illumina library preparation followed established protocols (6). PacBio sequencing library preparation included shearing of genomic DNA (g-TUBE; Covaris, LLC, USA) to 6 to 10 kb and ligation using the SMRTbell Express template preparation 2.0 kit following the manufacturer’s protocol (PacBio). The resulting Illumina libraries contained between 70 million and 141 million 150-bp reads, and the PacBio libraries contained between 38 thousand and 159 thousand reads 6 to 9 kb in length. Illumina reads were filtered and error corrected using BBMap (v38.86) (mincount=2, highcountfraction=0.6) (7), assembled with metaSPAdes (v3.14.1) (8), and mapped with BBMap (v38.86) (ambiguous=random) (7) following the JGI Metagenomic Workflow (6). PacBio reads were filtered using BBtools (v38.87/38.88) (7), and CCS reads were assembled using metaFlye (v2.8.1-b1676) (3), polished with subreads using GCpp (v1.0.0-SL-release-8.0.0) (https://github.com/PacificBiosciences/gcpp), mapped using minimap2 (v2.17-r941) (4), and then binned with MetaBAT (v2:2.15) (9). The resulting metagenome-assembled genomes (MAGs) were refined by removing contigs deemed to be contaminants by ProDeGe (v2.3) (10) and a custom algorithm that compares tetranucleotide frequency among contigs (run.GC.sh and Calculating_TF_Correlations.R [https://github.com/GLBRC/metagenome_analysis]). The MAGs obtained from individual samples were dereplicated using dRep (v3.2.2) (dereplicate command with –conW 0.5 and –N50W 5 flags for custom weighting) (11). MAG quality parameters were obtained using CheckM (v1.0.11) (12), and taxonomy was assigned using GTDB-tk (v1.5.1, database release 202) (13). MAG phylogeny was visualized using RAxML-NG (v0.9.0) (Fig. 1) (14). MAGs were annotated through the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) (15).
FIG 1

Phylogeny of representative MAGs and their presence in the five different bioreactors (R1 to R5), as determined by the dRep analysis (Table 1). R1, reactor 1, TS (R1TS); R2, reactor 2, solids-removed TS (R2SR-TS); R3, reactor 3, low SRT (R3LowSRT); R4, reactor 4, high temperature and low pH (R4T-pH); R5, reactor 5, low SRT, high temperature, and low pH (R5T-pH-LowSRT). The bioreactor operating conditions are described elsewhere (4). ACET, Acetobacter; ACID, Acidaminococcus; ANEURI, Aneurinibacillus; ATO, Atopobiaceae; BAC, Bacillus; BUL, Bulleidia; CLOS, Clostridium; COMLAC, Companilactobacillus; EGG, Eggerthellaceae; EUB, Eubacteriaceae; FURLAC, Furfurilactobacillus; GAST, Gastranaerophilaceae; LAC, Lactobacillus; LACCAS, Lacticaseibacillus; LACPLAN, Lactiplantibacillus; LCO, Lachnospiraceae; LEVLAC, Levilactobacillus; LIMLAC, Limosilactobacillus; NIT, Nitrospira; PAENI, Paenibacillus; PED, Pediococcus; PREV, Prevotella; RUM, Ruminococcus; SACCH, Saccharofermentans; SAP, Saprospiraceae; SPHING, Sphingobium; TETRA, Tetrasphaera. Higher taxonomic levels are labeled, from left to right, phylum (P), class (C), and family (F). Fir., Firmicutes_C; Bacteroid., Bacteroidota; Nit., Nitrospirota; Cy., Cyanobacteria; Ery., Erysipelotrichaceae; Bac._C, Bacillaceae_C; Bacter., Bacteroidaceae; Atopo., Atopobiaceae. The phylogenetic tree was generated in RAxML-ng with 500 bootstraps using the housekeeping gene concatenations generated by GTDB-tk, with the default selection of 120 single-copy bacterial housekeeping genes. Bootstrap values greater than 50 are shown. The scale bar indicates the number of nucleotide substitutions per sequence site.

Phylogeny of representative MAGs and their presence in the five different bioreactors (R1 to R5), as determined by the dRep analysis (Table 1). R1, reactor 1, TS (R1TS); R2, reactor 2, solids-removed TS (R2SR-TS); R3, reactor 3, low SRT (R3LowSRT); R4, reactor 4, high temperature and low pH (R4T-pH); R5, reactor 5, low SRT, high temperature, and low pH (R5T-pH-LowSRT). The bioreactor operating conditions are described elsewhere (4). ACET, Acetobacter; ACID, Acidaminococcus; ANEURI, Aneurinibacillus; ATO, Atopobiaceae; BAC, Bacillus; BUL, Bulleidia; CLOS, Clostridium; COMLAC, Companilactobacillus; EGG, Eggerthellaceae; EUB, Eubacteriaceae; FURLAC, Furfurilactobacillus; GAST, Gastranaerophilaceae; LAC, Lactobacillus; LACCAS, Lacticaseibacillus; LACPLAN, Lactiplantibacillus; LCO, Lachnospiraceae; LEVLAC, Levilactobacillus; LIMLAC, Limosilactobacillus; NIT, Nitrospira; PAENI, Paenibacillus; PED, Pediococcus; PREV, Prevotella; RUM, Ruminococcus; SACCH, Saccharofermentans; SAP, Saprospiraceae; SPHING, Sphingobium; TETRA, Tetrasphaera. Higher taxonomic levels are labeled, from left to right, phylum (P), class (C), and family (F). Fir., Firmicutes_C; Bacteroid., Bacteroidota; Nit., Nitrospirota; Cy., Cyanobacteria; Ery., Erysipelotrichaceae; Bac._C, Bacillaceae_C; Bacter., Bacteroidaceae; Atopo., Atopobiaceae. The phylogenetic tree was generated in RAxML-ng with 500 bootstraps using the housekeeping gene concatenations generated by GTDB-tk, with the default selection of 120 single-copy bacterial housekeeping genes. Bootstrap values greater than 50 are shown. The scale bar indicates the number of nucleotide substitutions per sequence site.
TABLE 1

Genome accession numbers and statistics

Strain nameaCodebReactor sourcec
Sample age (days)SRA accession no.No. of raw reads per sample (×1,000)GenBank accession no.dANImedRepfGTDBtk classificationReference genomegSequencing platformhCompleteness (%)Contamination (%)MAG size (Mbp)No. of scaffoldsN50 (Mbp)GC content (%)No. of tRNAsNo. of 5S rRNAsNo. of 16S rRNAsNo. of 23S rRNAs
R1R2R3R4R5
UW_TS_ACET1_1ACET1X100 SRX12729178 383 JAKVNI000000000 132.1938d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Acetobacterales;f__Acetobacteraceae;g__Acetobacter;s__Acetobacter fabarum GCF_011516925.1 Sequel II1000.252.89612.89658.359555
UW_TS_ACET1_2X72 SRX12687768 95,453 JALCKF000000000 0.999992123.6701NovaSeq S498.510.252.37250.13858.839000
UW_TS_ACET1_3X24 SRX12665963 80,367 JALCHP000000000 0.999833123.6696NovaSeq S498.510.252.271240.13859.140000
UW_TS_ACET1_4X136 SRX12664702 125,596 JALCGX000000000 0.999986123.5262NovaSeq S4990.252.392260.10258.841000
UW_TS_ACET1_5X13 SRX12687755 74,621 JALCJH000000000 0.999995123.1801NovaSeq S498.010.252.211240.13859.138000
UW_TS_ACET1_6X24 SRX12686847 69,380 JALCJQ000000000 0.999991123.1801NovaSeq S498.010.252.211240.13859.137000
UW_TS_ACET1_7X66 SRX12686873 104,400 JALCIB000000000 0.999984123.1801NovaSeq S498.010.252.212240.13859.137000
UW_TS_ACET1_8X79 SRX12686849 79,309 JALCII000000000 0.999983123.1801NovaSeq S498.010.252.212230.13859.137000
UW_TS_ACET1_9X162 SRX12687754 66,381 JALCKU000000000 0.999983123.1801NovaSeq S498.010.252.225240.13859.137000
UW_TS_ACET1_10X48 SRX12670963 74,191 JALCHU000000000 0.999983123.1801NovaSeq S498.010.252.212220.13859.137000
UW_TS_ACET1_11X64 SRX12658907 90,171 JALCFX000000000 0.999994123.1801NovaSeq S498.010.252.227230.13859.137000
UW_TS_ACET1_12X162 SRX12686848 92,161 JALCIX000000000 0.999971123.1801NovaSeq S498.010.252.213220.13859.138000
UW_TS_ACET1_13X36 SRX12729153 71,598 JAKVLI000000000 0.999989123.1800NovaSeq S498.010.252.225240.13859.138000
UW_TS_ACET1_14X66 SRX12729463 81,580 JAKVLD000000000 0.999974123.1800NovaSeq S498.010.252.226240.13859.138000
UW_TS_ACET1_15X202 SRX12667031 121,391 JALCHJ000000000 0.999985123.0262NovaSeq S498.510.252.304240.1025939000
UW_TS_ACET1_16X62 SRX12687723 84,579 JALCJZ000000000 0.999985122.1801NovaSeq S497.010.252.182230.13859.236000
UW_TS_ACET1_17X39 SRX12657440 113,675 JALCEB000000000 0.999964120.1678NovaSeq S495.520.752.27440.10259.141000
UW_TS_ACET1_18X120 SRX12687759 78,913 JALCKN000000000 0.999977116.7101NovaSeq S491.290.252.182230.13859.135000
UW_TS_ACET1_19X90 SRX12660021 140,410 JALCGK000000000 0.999986116.2101NovaSeq S491.040.252.31250.13858.738000
UW_TS_ACET1_20X63 SRX12729174 1,591 JAKVMC000000000 0.999976112.8698Sequel II82.090.252.26821.5158.238222
UW_TS_ACET1_21X52 SRX12729156 634 JAKVLO000000000 0.999996112.8689Sequel II82.090.252.25521.50958.238222
UW_TS_ACET1_22X166 SRX12729462 793 JAKVMT000000000 0.999093107.3584Sequel II82.590.252.086210.11558.730010
UW_TS_ACET1_23X75 SRX12658615 141,269 JALCEP000000000 0.999838100.6800NovaSeq S477.790.252.174670.04358.635000
UW_TS_ACET2_1ACET2X100 SRX12729178 383 JAKVNJ000000000 132.0261d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Acetobacterales;f__Acetobacteraceae;g__Acetobacter;s__Acetobacter sp012517935 GCA_012517935.1 Sequel II1001.243.36713.36754.460555
UW_TS_ACET2_2X166 SRX12729462 793 JAKVMU000000000 0.999967130.6313Sequel II1001.243.42331.77154.460555
UW_TS_ACET2_3X63 SRX12729174 1,591 JAKVMD000000000 0.999854126.9446Sequel II95.021.243.21313.21354.351444
UW_TS_ACET2_4X62 SRX12687723 84,579 JALCJY000000000 0.999998125.3394NovaSeq S41000.253.014300.12353.945000
UW_TS_ACET2_5X75 SRX12658615 141,269 JALCEO000000000 0.999995125.2687NovaSeq S41000.252.931290.11953.945000
UW_TS_ACET2_6X24 SRX12686847 69,380 JALCJP000000000 0.999860125.2687NovaSeq S41000.253.058300.11953.945000
UW_TS_ACET2_7X90 SRX12660021 140,410 JALCGJ000000000 0.999999125.2685NovaSeq S41000.252.917310.11953.946000
UW_TS_ACET2_8X114 SRX12686875 69,693 JALCIN000000000 0.999939125.1004NovaSeq S41000.253.051370.1153.945000
UW_TS_ACET2_9X136 SRX12664702 125,596 JALCGW000000000 0.999995124.7733NovaSeq S41001.242.906310.11953.845000
UW_TS_ACET2_10X39 SRX12657440 113,675 JALCEA000000000 0.999993124.6916NovaSeq S41001.242.952350.11553.845000
UW_TS_ACET2_11X120 SRX12687759 78,913 JALCKM000000000 0.999992124.5971NovaSeq S41001.243.078330.1153.946000
UW_TS_ACET2_12X28 SRX12658904 101,250 JALCFM000000000 0.999922124.5573NovaSeq S41001.243.06370.10853.846000
UW_TS_ACET2_13X48 SRX12670963 74,191 JALCHT000000000 0.999995114.4437NovaSeq S489.0502.664280.1195438000
UW_TS_ACET2_14X79 SRX12686849 79,309 JALCIH000000000 0.999948109.3823NovaSeq S484.330.252.55350.10853.841000
UW_TS_ACET3_1ACET3X63 SRX12729174 1,591 JAKVME000000000 131.3647d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Acetobacterales;f__Acetobacteraceae;g__Acetobacter; s__NASequel II99.7503.70832.63657.653333
UW_TS_ACET3_2X39 SRX12657440 113,675 JALCDZ000000000 0.999930125.5032NovaSeq S49903.503270.24957.646000
UW_TS_ACET3_3X75 SRX12658615 141,269 JALCEN000000000 0.999949125.3846NovaSeq S49903.445270.23657.746000
UW_TS_ACET4_1ACET4X63 SRX12729174 1,591 JAKVMF000000000 130.1843d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Acetobacterales;f__Acetobacteraceae;g__Acetobacter;s__Acetobacter peroxydans GCF_006539345.1 Sequel II99.50.52.64222.16260.356444
UW_TS_ACET5_1ACET5X90 SRX12660021 140,410 JALCGI000000000 125.3562d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Acetobacterales;f__Acetobacteraceae;g__Acetobacter;s__Acetobacter peroxydans GCF_006539345.1 NovaSeq S410002.559290.11760.745000
UW_TS_ACET5_2X13 SRX12687755 74,621 JALCJG000000000 0.999990125.0534NovaSeq S410002.521340.10260.944000
UW_TS_ACET5_3X162 SRX12687754 66,381 JALCKT000000000 0.999986123.9439NovaSeq S499.502.52430.07760.845000
UW_TS_ACET5_4X66 SRX12729463 81,580 JAKVLE000000000 0.999982122.0555NovaSeq S497.7102.499460.07460.942101
UW_TS_ACET5_5X120 SRX12687759 78,913 JALCKL000000000 0.999971117.5738NovaSeq S494.350.52.272720.03961.145000
UW_TS_ACET5_6X64 SRX12658907 90,171 JALCFW000000000 0.999948117.2484NovaSeq S494.50.172.213830.03761.241000
UW_TS_ACET5_7X72 SRX12687768 95,453 JALCKE000000000 1.000000102.3029NovaSeq S476.1201.815150.17260.831000
UW_TS_ACET5_8X136 SRX12664702 125,596 JALCGV000000000 0.99997999.8305NovaSeq S477.5902.185890.02861.136101
UW_TS_ACET6_1ACET6X120 SRX12687759 78,913 JALCKK000000000 114.6918d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Acetobacterales;f__Acetobacteraceae;g__Acetobacter;s__Acetobacter indonesiensis GCF_000963945.1 NovaSeq S489.0502.337230.13455.140000
UW_TS_ACET6_2X162 SRX12686848 92,161 JALCIW000000000 0.999994114.6918NovaSeq S489.0502.346250.13455.140000
UW_TS_ACET6_3X136 SRX12664702 125,596 JALCGU000000000 0.999984111.8759NovaSeq S487.480.252.267500.0855.239000
UW_TS_ACID1_1ACID1X63 SRX12729174 15,91 JAKVMG000000000 129.8131d__Bacteria;p__Firmicutes_C;c__Negativicutes;o__Acidaminococcales;f__Acidaminococcaceae;g__Acidaminococcus;s__Acidaminococcus provencensis GCF_900291475.1 Sequel II99.381.22.91512.91552.856666
UW_TS_ANEURI1_1ANEURI1X162 SRX12687754 66,381 JALCKS000000000 122.1157d__Bacteria;p__Firmicutes;c__Bacilli;o__Aneurinibacillales;f__Aneurinibacillaceae;g__Aneurinibacillus;s__Aneurinibacillus aneurinilyticus GCF_000466385.1 NovaSeq S499.221.685.042940.08843.685036
UW_TS_ATO1_1ATO1X166 SRX12729462 793 JAKVMV000000000 131.2388d__Bacteria;p__Actinobacteriota;c__Coriobacteriia;o__Coriobacteriales;f__Atopobiaceae;g__CADBMC01;s__CADBMC01 sp902795635 GCA_902795635.1 Sequel II99.8501.88811.88867.445222
UW_TS_ATO1_2X202 SRX12667031 121,391 JALCHI000000000 0.999804129.0235NovaSeq S410001.86440.63567.545000
UW_TS_ATO1_3X136 SRX12664702 125,596 JALCGT000000000 0.999804128.9496NovaSeq S410001.86150.61467.545000
We report a total of 266 MAGs with >75% completion, grouped in 51 clusters that represent the diversity in the individual bioreactors (Table 1). This metagenomic data set adds to the expanding body of knowledge about microorganisms relevant to the valorization of agroindustrial residues by fermentation (2, 16–20). Genome accession numbers and statistics

Data availability.

Raw metagenomic sequence data and MAGs are available in NCBI GenBank under BioProject accession number PRJNA768492. All information on library construction and sequence can be found at https://gold.jgi.doe.gov/study?id=Gs0150020 using JGI GOLD Study identification number Gs0150020. All custom scripts are available at GitHub (https://github.com/GLBRC/metagenome_analysis).
  19 in total

1.  Metabolic Interactions of a Chain Elongation Microbiome.

Authors:  Wenhao Han; Pinjing He; Liming Shao; Fan Lü
Journal:  Appl Environ Microbiol       Date:  2018-10-30       Impact factor: 4.792

2.  A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life.

Authors:  Donovan H Parks; Maria Chuvochina; David W Waite; Christian Rinke; Adam Skarshewski; Pierre-Alain Chaumeil; Philip Hugenholtz
Journal:  Nat Biotechnol       Date:  2018-08-27       Impact factor: 54.908

3.  Microbial Ecological Mechanism for Long-Term Production of High Concentrations of n-Caproate via Lactate-Driven Chain Elongation.

Authors:  Xiaoyu Zhu; Xin Feng; Cheng Liang; Jiabao Li; Jia Jia; Leiyu Feng; Yong Tao; Yinguang Chen
Journal:  Appl Environ Microbiol       Date:  2021-05-11       Impact factor: 4.792

4.  DOE JGI Metagenome Workflow.

Authors:  Alicia Clum; Marcel Huntemann; Brian Bushnell; Brian Foster; Bryce Foster; Simon Roux; Patrick P Hajek; Neha Varghese; Supratim Mukherjee; T B K Reddy; Chris Daum; Yuko Yoshinaga; Ronan O'Malley; Rekha Seshadri; Nikos C Kyrpides; Emiley A Eloe-Fadrosh; I-Min A Chen; Alex Copeland; Natalia N Ivanova
Journal:  mSystems       Date:  2021-05-18       Impact factor: 6.496

5.  Mesophilic and thermophilic conditions select for unique but highly parallel microbial communities to perform carboxylate platform biomass conversion.

Authors:  Emily B Hollister; Andrea K Forrest; Heather H Wilkinson; Daniel J Ebbole; Susannah G Tringe; Stephanie A Malfatti; Mark T Holtzapple; Terry J Gentry
Journal:  PLoS One       Date:  2012-06-22       Impact factor: 3.240

6.  ProDeGe: a computational protocol for fully automated decontamination of genomes.

Authors:  Kristin Tennessen; Evan Andersen; Scott Clingenpeel; Christian Rinke; Derek S Lundberg; James Han; Jeff L Dangl; Natalia Ivanova; Tanja Woyke; Nikos Kyrpides; Amrita Pati
Journal:  ISME J       Date:  2015-06-09       Impact factor: 10.302

7.  RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference.

Authors:  Alexey M Kozlov; Diego Darriba; Tomáš Flouri; Benoit Morel; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

8.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

9.  Increasing the economic value of lignocellulosic stillage through medium-chain fatty acid production.

Authors:  Matthew J Scarborough; Griffin Lynch; Mitch Dickson; Mick McGee; Timothy J Donohue; Daniel R Noguera
Journal:  Biotechnol Biofuels       Date:  2018-07-19       Impact factor: 6.040

10.  MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.

Authors:  Dongwan D Kang; Feng Li; Edward Kirton; Ashleigh Thomas; Rob Egan; Hong An; Zhong Wang
Journal:  PeerJ       Date:  2019-07-26       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.