Literature DB >> 35894622

Metagenome-Assembled Genomes from a Microbiome Grown in Dairy Manure Hydrolysate.

Abel T Ingle1,2,3, Nathaniel W Fortney1,2, Kevin S Myers1,2, Kevin A Walters1,2,4, Matthew J Scarborough5, Timothy J Donohue1,2,4, Daniel R Noguera1,2,3.   

Abstract

Anaerobic microbiomes can be used to recover the chemical energy in agroindustrial and municipal wastes as useful products. Here, we report a total of 109 draft metagenome-assembled genomes from a bioreactor-fed carbohydrate-rich dairy manure hydrolysate. Studying these genomes will aid us in deciphering the metabolic networks in anaerobic microbiomes.

Entities:  

Year:  2022        PMID: 35894622      PMCID: PMC9387265          DOI: 10.1128/mra.00292-22

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

We are investigating microbial fermentation to valorize agroindustrial residues (1–5). Previously, we reported on the fermentation product profile produced when feeding dairy manure (DM) hydrolysate to an anaerobic bioreactor (3), and here, we report on the microbial community. The bioreactor was inoculated with acid-phase anaerobic digester sludge from the Nine Springs Wastewater Treatment Plant (Madison, WI, USA) (2–5). DNA was extracted from the inoculum and at multiple time points during bioreactor operation using a phenol-chloroform extraction method (4) that excluded a bead-beating step so that the DNA fragment lengths were appropriate for long-read sequencing. The DNA quantity and quality were determined using a Qubit v4.0 fluorometer (Thermo Fisher Scientific, Waltham, MA, USA) and a NanoDrop ND-1000 spectrophotometer (Thermo Fisher Scientific), respectively. DNA aliquots of 500 ng (6 samples, all from the bioreactor) and 3,000 ng (4 samples, 2 from the bioreactor and 2 from the inoculum) were submitted to the Joint Genome Institute (JGI; https://jgi.doe.gov/; Berkeley, CA, USA) for paired-end 2 × 150-bp sequencing on the NovaSeq S4 platform (Illumina, Inc., San Diego, CA, USA) and long-read sequencing using the PacBio Sequel II platform (Pacific Biosciences, Inc., Menlo Park, CA, USA), respectively. The Illumina libraries were end repaired, A-tailed, and ligated with Illumina-compatible adapters using the KAPA HyperPrep kit (Roche, USA) as described (6). The PacBio sequencing library preparation included shearing genomic DNA (Blue Pippin size selection; Sage Science, USA) to 6 to 10 kb and performing ligation using the SMRTbell Express template prep kit v2.0 following the manufacturer’s protocol (Pacific Biosciences). All software was used with default parameters unless otherwise noted. The Illumina reads were filtered and error corrected using bbcms from BBMap v38.86 (mincount = 2, highcountfraction = 0.6) (7), assembled using metaSPAdes v3.14.1 (8), and mapped using BBMap v38.86 (ambiguous=random) (7) following the JGI metagenomic workflow (6). The PacBio circular consensus sequencing (CCS) reads were assembled using metaFlye v2.8.1-b1676 (–meta) (9), polished using GCpp v1.0.0-SL-release-8.0.0 (Pacific Biosciences), mapped using minimap2 v2.12-r941 (10), and binned using MetaBAT v2:2.15 (11). The resulting Illumina metagenomic libraries contained between 78 and 114 million reads with a targeted length of 150 bp, and the PacBio libraries contained between 132,000 and 537,000 CCS reads with a targeted length of 6 to 10 kb. The resulting metagenome-assembled genomes (MAGs) were annotated using the JGI annotation pipeline v5.0.23 (12) and the NCBI Prokaryotic Genome Annotation Pipeline v6.0 (13). The MAGs were then refined by removing contigs deemed to be contaminants using ProDeGe v2.3 (14) and custom scripts that compare tetranucleotide frequency among contigs (run.GC.sh and Calculating_TF_Correlations.R; https://github.com/GLBRC/metagenome_analysis). All refined MAGs were dereplicated into clusters (i.e., replicate genome sets) using dRep v3.2.2 (15). The MAG quality parameters were obtained using CheckM v1.0.11 (16), and taxonomy was assigned using GTDB-Tk v1.5.1 (database release 202) (17). The MAG phylogeny was visualized using RAxML-NG v0.9.0 (Fig. 1) (18) and Interactive Tree of Life v5 (19).
FIG 1

Phylogeny of dRep-identified representative MAGs. Strain code abbreviations are as follows: ACET, Acetobacter; ATO, Atopobiaceae; ACT, Actinomycetaceae; ACUT, Acutalibacteraceae; ANA, Anaerotignaceae; BAC, Bacillus; BACIL, Bacilli; BACTE, Bacteroidales; BIF, Bifidobacteriaceae; BURK, Burkholderiaceae; CLOS, Clostridium; DIAL, Dialister; ENTER, Enterococcaceae; FLAV, Flavobacteriales; LAC, Lactobacillus; LACCAS, Lacticaseibacillus; LCO, Lachnospiraceae; LENLAC, Lentilactobacillus; LEUC, Leuconostoc; LIMLAC, Limosilactobacillus; LIGLAC, Ligilactobacillus; LIQLAC, Liquorilactobacillus; MEG, Megasphaera; PREV, Prevotella; PSEUD, Pseudomonas; SACCH, Saccharofermentans; SCHLAC, Schleiferilactobacillus; SPHING, Sphingobium; SPOR, Sporolactobacillus; XAN, Xanthomonadaceae. The higher taxonomic levels are labeled P, phylum; C, class; and F, family. Phylum abbreviations: Bacter., Bacteroidota; Fir., Firmicutes_C. Class abbreviations: Gamma., Gammaproteobacteria; Alpha., Alphaproteobacteria; Bact., Bacteroidia; Actin., Actinobacteria; Cor., Coriobacteriia; Neg., Negativicutes. Family abbreviations: Xa., Xanthomonadaceae; Ac., Acetobacter; Bifidobac., Bifidobacteriaceae; At., Atopobiaceae; An., Anaerotignaceae; Clo., Clostridiaceae; Acutali., Acutalibacteraceae. The shaded strain code abbreviations indicate representative genomes from the inoculum. The phylogenetic tree was generated in RAxML-NG with 500 bootstraps using the concatenation of 120 bacterial single-copy housekeeping genes generated using GTDB-Tk. Bootstrap values greater than 50 are shown. The scale bar indicates the number of nucleotide substitutions per sequence site.

Phylogeny of dRep-identified representative MAGs. Strain code abbreviations are as follows: ACET, Acetobacter; ATO, Atopobiaceae; ACT, Actinomycetaceae; ACUT, Acutalibacteraceae; ANA, Anaerotignaceae; BAC, Bacillus; BACIL, Bacilli; BACTE, Bacteroidales; BIF, Bifidobacteriaceae; BURK, Burkholderiaceae; CLOS, Clostridium; DIAL, Dialister; ENTER, Enterococcaceae; FLAV, Flavobacteriales; LAC, Lactobacillus; LACCAS, Lacticaseibacillus; LCO, Lachnospiraceae; LENLAC, Lentilactobacillus; LEUC, Leuconostoc; LIMLAC, Limosilactobacillus; LIGLAC, Ligilactobacillus; LIQLAC, Liquorilactobacillus; MEG, Megasphaera; PREV, Prevotella; PSEUD, Pseudomonas; SACCH, Saccharofermentans; SCHLAC, Schleiferilactobacillus; SPHING, Sphingobium; SPOR, Sporolactobacillus; XAN, Xanthomonadaceae. The higher taxonomic levels are labeled P, phylum; C, class; and F, family. Phylum abbreviations: Bacter., Bacteroidota; Fir., Firmicutes_C. Class abbreviations: Gamma., Gammaproteobacteria; Alpha., Alphaproteobacteria; Bact., Bacteroidia; Actin., Actinobacteria; Cor., Coriobacteriia; Neg., Negativicutes. Family abbreviations: Xa., Xanthomonadaceae; Ac., Acetobacter; Bifidobac., Bifidobacteriaceae; At., Atopobiaceae; An., Anaerotignaceae; Clo., Clostridiaceae; Acutali., Acutalibacteraceae. The shaded strain code abbreviations indicate representative genomes from the inoculum. The phylogenetic tree was generated in RAxML-NG with 500 bootstraps using the concatenation of 120 bacterial single-copy housekeeping genes generated using GTDB-Tk. Bootstrap values greater than 50 are shown. The scale bar indicates the number of nucleotide substitutions per sequence site. We provide a resource containing a total of 109 MAGs with greater than 75% completion and less than 3% contamination, grouped into 48 clusters that represent the microbial community diversity (Table 1). The putative classifications of these clusters span the phyla Firmicutes, Actinobacteria, Bacteroidota, and Proteobacteria (Fig. 1).
TABLE 1

Statistics and accession numbers of refined metagenome-assembled genomes

CodeaStrain namebSample sourcecGenBank accession no.SRA accession no.dANImedRepfGTDB-Tk classificationgReference genome GenBank accession no.hSequencing platformCompleteness (%)Contamination (%)MAG size (Mbp)No. of scaffoldsN50 (Mbp)%GCNo. of tRNAsNo. of rRNAs:
5S16S23S
ACET1UW_DM_ACET1_1DMB JALCOP000000000.1 SRX12654472 124.3612d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Acetobacterales;f__Acetobacteraceae;g__Acetobacter;s__Acetobacter fabarum GCF_011516925.1 Illumina NovaSeq S495.610.252.324100.59358.9539000
UW_DM_ACET1_2DMB JALCRJ000000000.1 SRX12654480 0.999971123.3789Illumina NovaSeq S495.850.252.372100.33858.8639000
UW_DM_ACET1_3DMB JALCPS000000000.1 SRX12654476 0.999977123.3504Illumina NovaSeq S495.850.252.32890.33358.9239000
UW_DM_ACET1_4DMB JALCQH000000000.1 SRX12654477 0.999979123.3504Illumina NovaSeq S495.850.252.323100.33358.9539000
UW_DM_ACET1_5DMB JALCPA000000000.1 SRX12654474 0.999992123.2260Illumina NovaSeq S495.850.252.323110.31558.9539000
ACET2UW_DM_ACET2_1DMB JALCRI000000000.1 SRX12654480 121.2990d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Acetobacterales;f__Acetobacteraceae;g__Acetobacter;s__Acetobacter peroxydans GCF_006539345.1 Illumina NovaSeq S497.010.032.422430.07260.8143000
UW_DM_ACET2_2DMB JALCQV000000000.1 SRX12654475 0.999847119.2871Illumina NovaSeq S496.180.52.277770.03761.140000
ACT1UW_DM_ACT1_1DMB JALCPR000000000.1 SRX12654476 126.2383d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Actinomycetales;f__Actinomycetaceae;g__Ancrocorticia;s__NAIllumina NovaSeq S498.3202.285150.38258.2448000
UW_DM_ACT1_2DMB JALCQG000000000.1 SRX12654477 0.999918125.5519Illumina NovaSeq S499.1602.157210.18958.1148100
UW_DM_ACT1_3DMB JALCRH000000000.1 SRX12654480 0.999083121.0378Illumina NovaSeq S497.4801.936530.05158.2344000
UW_DM_ACT1_4DMB JALCQU000000000.1 SRX12654475 0.999271120.4459Illumina NovaSeq S496.6401.872480.05758.3140000
UW_DM_ACT1_5DMB JALCOZ000000000.1 SRX12654474 0.997915116.1328Illumina NovaSeq S492.4401.773540.05558.3737100
UW_DM_ACT1_6DMB JALCOO000000000.1 SRX12654472 0.998142114.4529Illumina NovaSeq S490.7601.717490.05558.4436000
ACUT1UW_DM_ACUT1_1DMB JALCRG000000000.1 SRX12654480 125.8525d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Oscillospirales;f__Acutalibacteraceae;g__UBA4871;s__UBA4871 sp902809935 GCF_902809935.1 Illumina NovaSeq S497.850.342.30160.42942.7647300
UW_DM_ACUT1_2DMB JALCQF000000000.1 SRX12654477 0.999982125.8261Illumina NovaSeq S497.850.342.32770.42442.7447300
UW_DM_ACUT1_3DMB JALCQT000000000.1 SRX12654475 1.000000125.8261Illumina NovaSeq S497.850.342.29260.42442.7647300
UW_DM_ACUT1_4DMB JALCPQ000000000.1 SRX12654476 0.999961125.8261Illumina NovaSeq S497.850.342.32870.42442.7447300
ACUT2UW_DM_ACUT2_1DMB JALCPP000000000.1 SRX12654476 123.9766d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Oscillospirales;f__Acutalibacteraceae;g__Caprobacter;s__NAIllumina NovaSeq S497.990.022.551180.25048.6546201
ACUT3UW_DM_ACUT3_1DMB JALCPO000000000.1 SRX12654476 116.5123d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Oscillospirales;f__Acutalibacteraceae;g__Caprobacter;s__NAIllumina NovaSeq S494.070.672.111810.03658.1134110
UW_DM_ACUT3_2DMB JALCRF000000000.1 SRX12654480 0.999724114.4897Illumina NovaSeq S492.110.672.200770.03558.0241100
UW_DM_ACUT3_3DMB JALCQS000000000.1 SRX12654475 0.999801101.0366Illumina NovaSeq S478.9101.933880.02758.0531100
ACUT4UW_DM_ACUT4_1DMB JALCQE000000000.1 SRX12654477 102.1662d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Oscillospirales;f__Acutalibacteraceae;g__Caprobacter;s__Caprobacter sp002407675 GCA_002407675.1 Illumina NovaSeq S480.540.671.727520.03952.0424100
ANA1UW_DM_ANA1_1DMB JALCQR000000000.1 SRX12654475 123.9165d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Lachnospirales;f__Anaerotignaceae;g__HGM11808;s__NAIllumina NovaSeq S497.990.892.610190.18736.6841121
UW_DM_ANA1_2DMB JALCRE000000000.1 SRX12654480 0.999999122.4693Illumina NovaSeq S496.640.892.541200.17936.533020
UW_DM_ANA1_3DMB JALCOY000000000.1 SRX12654474 0.999702102.0096Illumina NovaSeq S480.351.572.007920.02436.3219020
ANA2UW_DM_ANA2_1DMB JALCPN000000000.1 SRX12654476 98.4501d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Lachnospirales;f__Anaerotignaceae;g__HGM11808;s__NAIllumina NovaSeq S477.080.562.332940.03036.2126011
ATO1UW_DM_ATO1_1DMB JALCQQ000000000.1 SRX12654475 125.7981d__Bacteria;p__Actinobacteriota;c__Coriobacteriia;o__Coriobacteriales;f__Atopobiaceae;g__Olsenella;s__Olsenella sp900555995 GCF_009695875.1 Illumina NovaSeq S499.80.082.182190.2546550000
ATO2UW_DM_ATO2_1DMB JALCPM000000000.1 SRX12654476 121.6595d__Bacteria;p__Actinobacteriota;c__Coriobacteriia;o__Coriobacteriales;f__Atopobiaceae;g__Olegusella;s__Olegusella sp002407925 GCA_002407925.1 Illumina NovaSeq S494.8901.61090.35653.8746111
BACIL1UW_DM_BACIL1_1Inoc. JALCLG000000000.1 SRR19542533, SRR19542534127.2301d__Bacteria;p__Firmicutes;c__Bacilli;o__RFN20;f__CAG-288;g__UBA7642;s__NAPacBio Sequel II96.6302.08012.08049.9948222
BACTE1UW_DM_BACTE1_1Inoc. JALCKZ000000000.1 SRR19542531, SRR19542532127.3472d__Bacteria;p__Bacteroidota;c__Bacteroidia;o__Bacteroidales;f__UBA932;g__UBA1232;s__NAPacBio Sequel II95.840.482.22912.22940.6640222
UW_DM_BACTE1_2Inoc. JALCLF000000000.1 SRR19542533, SRR195425340.996672107.4680PacBio Sequel II77.250.481.69221.23140.6630222
BIF1UW_DM_BIF1_1DMB JALCQD000000000.1 SRX12654477 128.1029d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Actinomycetales;f__Bifidobacteriaceae;g__;s__NAIllumina NovaSeq S410002.16260.65957.3251200
BIF2UW_DM_BIF2_1DMB JALCQC000000000.1 SRX12654477 120.0066d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Actinomycetales;f__Bifidobacteriaceae;g__;s__NAIllumina NovaSeq S494.8602.059320.10757.0945000
UW_DM_BIF2_2DMB JALCPL000000000.1 SRX12654476 0.999627114.7139Illumina NovaSeq S491.360.831.886500.05657.2540101
BIF3UW_DM_BIF3_1DMB JALCON000000000.1 SRX12654472 112.6135d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Actinomycetales;f__Bifidobacteriaceae;g__Bifidobacterium;s__Bifidobacterium crudilactis GCF_000738005.1 Illumina NovaSeq S491.120.762.104740.03757.7843000
BIF4UW_DM_BIF4_1DMB JALCOX000000000.1 SRX12654474 111.8635d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Actinomycetales;f__Bifidobacteriaceae;g__;s__NAIllumina NovaSeq S489.5801.703710.02858.3341100
UW_DM_BIF4_2DMB JALCOM000000000.1 SRX12654472 0.99983799.6889Illumina NovaSeq S477.501.570700.02758.1535000
BURK1UW_DM_BURK1_1DMB JAKVPB000000000.1 SRX12687775 128.9295d__Bacteria;p__Proteobacteria;c__Gammaproteobacteria;o__Burkholderiales;f__Burkholderiaceae;g__Mesosutterella;s__Mesosutterella multiformis GCF_003402575.1 PacBio Sequel II97.501.92311.92357.2255666
UW_DM_BURK1_2DMB JAKVPH000000000.1 SRX12687771, SRX126877720.999958128.9240PacBio Sequel II97.501.91811.91857.2355666
CLOS1UW_DM_CLOS1_1DMB JALCQP000000000.1 SRX12654475 123.9345d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Clostridiales;f__Clostridiaceae;g__Clostridium_B;s__Clostridium_B tyrobutyricum GCF_000359585.1 Illumina NovaSeq S498.190.232.862380.14830.7959310
UW_DM_CLOS1_2DMB JALCRD000000000.1 SRX12654480 0.999992123.9345Illumina NovaSeq S498.190.232.913390.14830.7459310
UW_DM_CLOS1_3DMB JALCQB000000000.1 SRX12654477 0.999997123.9345Illumina NovaSeq S498.190.232.873380.14830.7959310
UW_DM_CLOS1_4DMB JALCPK000000000.1 SRX12654476 0.999996105.6595Illumina NovaSeq S479.802.520340.14830.7353310
CLOS2UW_DM_CLOS2_1DMB JALCPJ000000000.1 SRX12654476 123.1268d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Clostridiales;f__Clostridiaceae;g__Clostridium_B;s__Clostridium_B luticellarii GCF_002995845.1 Illumina NovaSeq S499.1403.126750.06235.361300
UW_DM_CLOS2_2DMB JALCQA000000000.1 SRX12654477 0.999892122.7914Illumina NovaSeq S498.4502.960620.07435.2455200
UW_DM_CLOS2_3DMB JALCQO000000000.1 SRX12654475 0.999107122.0420Illumina NovaSeq S498.4502.903760.05235.1963100
UW_DM_CLOS2_4DMB JALCRC000000000.1 SRX12654480 0.999090121.9077Illumina NovaSeq S498.450.693.048780.05735.1462100
DIAL1UW_DM_DIAL1_1DMB JAKVPC000000000.1 SRX12687775 124.9117d__Bacteria;p__Firmicutes_C;c__Negativicutes;o__Veillonellales;f__Dialisteraceae;g__Dialister;s__Dialister hominis GCF_007164725.1 PacBio Sequel II94.210.021.86721.38247.949333
UW_DM_DIAL1_2DMB JAKVPI000000000.1 SRX12687771, SRX126877720.998770124.1539PacBio Sequel II92.950.021.74211.74247.7245434
ENTER1UW_DM_ENTER1_1DMB JALCOL000000000.1 SRX12654472 122.2348d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Enterococcaceae;g__;s__NAIllumina NovaSeq S496.691.662.371260.16847.2259110
UW_DM_ENTER1_2DMB JALCOW000000000.1 SRX12654474 0.999971107.0788Illumina NovaSeq S481.220.551.874210.16847.945110
FLAV1UW_DM_FLAV1_1Inoc. JALCLE000000000.1 SRR19542533, SRR19542534124.5230d__Bacteria;p__Bacteroidota;c__Bacteroidia;o__Flavobacteriales;f__PHOS-HE28;g__PHOS-HE28;s__NAPacBio Sequel II96.730.183.67090.59556.9241011
LAC1UW_DM_LAC1_1DMB JAKVPJ000000000.1 SRX12687771, SRX12687772122.4816d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Lactobacillus;s__NAPacBio Sequel II92.180.521.43021.29042.6956444
UW_DM_LAC1_2DMB JAKVPD000000000.1 SRX12687775 0.998540121.7920PacBio Sequel II91.390.521.35111.35142.7457444
UW_DM_LAC1_3DMB JALCOK000000000.1 SRX12654472 0.999299113.7841Illumina NovaSeq S489.560.261.182230.07442.9518000
UW_DM_LAC1_4DMB JALCOV000000000.1 SRX12654474 0.999136112.4220Illumina NovaSeq S488.270.261.190260.07242.8819000
UW_DM_LAC1_5DMB JALCPZ000000000.1 SRX12654477 0.998351107.5929Illumina NovaSeq S485.220.651.164440.03042.7519000
UW_DM_LAC1_6DMB JALCPI000000000.1 SRX12654476 0.998014105.8329Illumina NovaSeq S483.590.591.208450.03042.7620000
LAC2UW_DM_LAC2_1DMB JALCPY000000000.1 SRX12654477 109.0679d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Lactobacillus;s__Lactobacillus delbrueckii GCF_001433875.1 Illumina NovaSeq S486.2801.509530.03650.7257351
UW_DM_LAC2_2DMB JALCPH000000000.1 SRX12654476 0.999965108.7292Illumina NovaSeq S485.6301.438470.04150.8854351
LAC3UW_DM_LAC3_1DMB JALCOU000000000.1 SRX12654474 98.8007d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Lactobacillus;s__Lactobacillus amylovorus GCF_002706375.1 Illumina NovaSeq S477.520.081.247530.02838.1329000
LACCAS1UW_DM_LACCAS1_1DMB JALCPX000000000.1 SRX12654477 121.7694d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Lacticaseibacillus;s__Lacticaseibacillus sp900540605 GCA_900540605.1 Illumina NovaSeq S497.080.522.326320.09753.444110
UW_DM_LACCAS1_2DMB JALCOJ000000000.1 SRX12654472 0.999925119.7601Illumina NovaSeq S495.540.72.305390.07353.4932111
UW_DM_LACCAS1_3DMB JALCPG000000000.1 SRX12654476 0.999962118.9627Illumina NovaSeq S494.240.522.248250.09953.4124210
UW_DM_LACCAS1_4DMB JALCRB000000000.1 SRX12654480 0.999897115.7615Illumina NovaSeq S493.510.522.160820.03253.5818100
UW_DM_LACCAS1_5DMB JALCQN000000000.1 SRX12654475 0.999896114.7656Illumina NovaSeq S492.210.522.092760.03653.7427310
UW_DM_LACCAS1_6DMB JALCOT000000000.1 SRX12654474 0.999964103.8451Illumina NovaSeq S481.940.521.920840.02753.8623100
LACCAS2UW_DM_LACCAS2_1DMB JALCOI000000000.1 SRX12654472 118.8767d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Lacticaseibacillus;s__Lacticaseibacillus rhamnosus GCA_000615245.1 Illumina NovaSeq S495.131.632.831740.07146.5621120
LACCAS3UW_DM_LACCAS3_1DMB JALCQM000000000.1 SRX12654475 114.9943d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Lacticaseibacillus;s__NAIllumina NovaSeq S491.971.312.771730.05452.7127300
UW_DM_LACCAS3_2DMB JALCRA000000000.1 SRX12654480 0.999829112.0990Illumina NovaSeq S489.181.312.661660.05252.9645300
LACCAS4UW_DM_LACCAS4_1DMB JALCPF000000000.1 SRX12654476 95.9860d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Lacticaseibacillus;s__Lacticaseibacillus paracasei GCF_000829035.1 Illumina NovaSeq S475.1801.611830.02346.7744100
LCO1UW_DM_LCO1_1Inoc. JALCKY000000000.1 SRR19542531, SRR19542532118.3603d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Lachnospirales;f__Lachnospiraceae;g__;s__NAPacBio Sequel II91.141.323.027110.37651.9847222
UW_DM_LCO1_2Inoc. JALCLD000000000.1 SRR19542533, SRR195425340.994960106.9934PacBio Sequel II78.480.212.04740.52852.4533111
LENLAC1UW_DM_LENLAC1_1DMB JALCQL000000000.1 SRX12654475 121.1830d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Lentilactobacillus;s__Lentilactobacillus hilgardii GCF_000159315.1 Illumina NovaSeq S496.750.312.943540.08239.9945300
UW_DM_LENLAC1_2DMB JALCPE000000000.1 SRX12654476 0.999777113.1572Illumina NovaSeq S488.630.081.421270.07939.3433200
UW_DM_LENLAC1_3DMB JALCQZ000000000.1 SRX12654480 0.999879113.1317Illumina NovaSeq S488.5601.374250.08239.335100
UW_DM_LENLAC1_4DMB JALCPW000000000.1 SRX12654477 0.999860102.4262Illumina NovaSeq S476.7501.009150.13639.1512000
LEUC1UW_DM_LEUC1_1DMB JALCOH000000000.1 SRX12654472 120.7671d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Leuconostoc;s__Leuconostoc mesenteroides GCF_000014445.1 Illumina NovaSeq S491.40.531.52240.84037.6228100
UW_DM_LEUC1_2DMB JALCOS000000000.1 SRX12654474 0.999988120.7671Illumina NovaSeq S491.40.531.52550.84037.6228100
UW_DM_LEUC1_3DMB JAKVPK000000000.1 SRX12687771, SRX126877720.999972107.4873PacBio Sequel II80.291.061.632110.35037.6965222
LIGLAC1UW_DM_LIGLAC1_1DMB JALCPD000000000.1 SRX12654476 110.5664d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Ligilactobacillus;s__Ligilactobacillus acidipiscis GCF_001435755.1 Illumina NovaSeq S489.011.831.563660.03139.3227100
UW_DM_LIGLAC1_2DMB JALCPV000000000.1 SRX12654477 0.999639110.1389Illumina NovaSeq S488.481.311.526670.02939.1913100
LIMLAC1UW_DM_LIMLAC1_1DMB JAKVPL000000000.1 SRX12687772, SRX12687771130.6667d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Limosilactobacillus;s__NAPacBio Sequel II98.271.092.33512.33552.1971666
UW_DM_LIMLAC1_2DMB JAKVPE000000000.1 SRX12687775 0.999996130.6667PacBio Sequel II98.271.092.33512.33552.1971666
LIMLAC2UW_DM_LIMLAC2_1DMB JAKVPM000000000.1 SRX12687772, SRX12687771130.4626d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Limosilactobacillus;s__Limosilactobacillus oris GCF_001434465.1 PacBio Sequel II98.902.04412.04449.5862555
UW_DM_LIMLAC2_2DMB JAKVPF000000000.1 SRX12687775 1.000000130.4626PacBio Sequel II98.902.04412.04449.5862555
UW_DM_LIMLAC2_3DMB JALCQK000000000.1 SRX12654475 0.999998123.0035Illumina NovaSeq S497.7901.907220.11049.9228110
UW_DM_LIMLAC2_4DMB JALCQY000000000.1 SRX12654480 0.999984106.9135Illumina NovaSeq S481.701.286180.11049.1929100
LIMLAC3UW_DM_LIMLAC3_1DMB JALCQJ000000000.1 SRX12654475 117.6852d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Limosilactobacillus;s__Limosilactobacillus timonensis GCF_900240275.1 Illumina NovaSeq S494.0201.286320.05447.8219000
UW_DM_LIMLAC3_2DMB JALCPU000000000.1 SRX12654477 1.000000117.5831Illumina NovaSeq S494.0201.292350.05147.5918000
UW_DM_LIMLAC3_3DMB JALCPC000000000.1 SRX12654476 1.000000115.9531Illumina NovaSeq S492.3901.261330.05147.5818000
UW_DM_LIMLAC3_4DMB JALCQX000000000.1 SRX12654480 0.999991115.5152Illumina NovaSeq S491.8501.216300.05447.7116000
LIMLAC4UW_DM_LIMLAC4_1DMB JALCQI000000000.1 SRX12654475 114.5738d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Limosilactobacillus;s__NAIllumina NovaSeq S491.3701.531420.04441.2530301
UW_DM_LIMLAC4_2DMB JALCQW000000000.1 SRX12654480 0.999945112.9835Illumina NovaSeq S490.4201.522580.03240.9933302
LIQLACUW_DM_LIQLAC1_1DMB JALCPB000000000.1 SRX12654476 125.1084d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Liquorilactobacillus;s__Liquorilactobacillus nagelii GCF_001434225.1 Illumina NovaSeq S499.480.132.380220.13736.5724210
UW_DM_LIQLAC1_2DMB JALCPT000000000.1 SRX12654477 0.999945124.4405Illumina NovaSeq S499.480.132.518400.10136.5723110
MEG1UW_DM_MEG1_1DMB JAKVPG000000000.1 SRX12687775 129.5279d__Bacteria;p__Firmicutes_C;c__Negativicutes;o__Veillonellales;f__Megasphaeraceae;g__Megasphaera;s__Megasphaera sp000417505 GCF_000417505.1 PacBio Sequel II97.6302.38612.38653.6364777
UW_DM_MEG1_2DMB JAKVPN000000000.1 SRX12687772, SRX126877711.000000129.5279PacBio Sequel II97.6302.38612.38653.6364777
UW_DM_MEG1_3DMB JALCOG000000000.1 SRX12654472 0.999984122.3850Illumina NovaSeq S497.4102.141330.09854.2459200
UW_DM_MEG1_4DMB JALCOR000000000.1 SRX12654474 0.999956122.3849Illumina NovaSeq S497.4102.137330.09854.2859200
PREV1UW_DM_PREV1_1Inoc. JALCLC000000000.1 SRR19542533, SRR19542534114.1849d__Bacteria;p__Bacteroidota;c__Bacteroidia;o__Bacteroidales;f__Bacteroidaceae;g__Prevotella;s__NAPacBio Sequel II89.0702.465180.16637.8953444
PSEUD1UW_DM_PSEUD1_1Inoc. JALCKX000000000.1 SRR19542531, SRR19542532117.1056d__Bacteria;p__Proteobacteria;c__Gammaproteobacteria;o__Pseudomonadales;f__Pseudomonadaceae;g__Pseudomonas_E;s__Pseudomonas_E veronii GCF_001439695.1 PacBio Sequel II91.212.496.322210.42361.3563444
SACCH1UW_DM_SACCH1_1Inoc. JALCKW000000000.1 SRR19542531, SRR19542532111.6234d__Bacteria;p__Firmicutes_A;c__Clostridia;o__Saccharofermentanales;f__Saccharofermentanaceae;g__Saccharofermentans;s__NAPacBio Sequel II86.50.711.640130.14246.4836101
SCHLAC1UW_DM_SCHLAC1_1DMB JALCOF000000000.1 SRX12654472 121.0337d__Bacteria;p__Firmicutes;c__Bacilli;o__Lactobacillales;f__Lactobacillaceae;g__Schleiferilactobacillus;s__Schleiferilactobacillus perolens GCF_001435585.1 Illumina NovaSeq S497.771.052.820760.05749.254100
UW_DM_SCHLAC1_2DMB JALCOQ000000000.1 SRX12654474 0.999809120.1456Illumina NovaSeq S496.860.522.885790.05149.2142101
SPHING1UW_DM_SPHING1_1Inoc. JALCLB000000000.1 SRR19542533, SRR19542534114.1032d__Bacteria;p__Proteobacteria;c__Alphaproteobacteria;o__Sphingomonadales;f__Sphingomonadaceae;g__Sphingobium;s__NAPacBio Sequel II86.540.783.532100.61562.8746111
SPOR1UW_DM_SPOR1_1DMB JALCOE000000000.1 SRX12654472 110.2120d__Bacteria;p__Firmicutes;c__Bacilli;o__Bacillales_G;f__Sporolactobacillaceae;g__Sporolactobacillus;s__Sporolactobacillus sp900543345 GCA_900543345.1 Illumina NovaSeq S488.4802.806950.03549.4928210
XAN1UW_DM_XAN1_1Inoc. JALCLA000000000.1 SRR19542533, SRR19542533124.1187d__Bacteria;p__Proteobacteria;c__Gammaproteobacteria;o__Xanthomonadales;f__Xanthomonadaceae;g__Chiayiivirga;s__NAPacBio Sequel II94.972.713.67890.76266.8752222
XAN2UW_DM_XAN2_1Inoc. JALCKV000000000.1 SRR19542531, SRR19542532113.2098d__Bacteria;p__Proteobacteria;c__Gammaproteobacteria;o__Xanthomonadales;f__Xanthomonadaceae;g__Chiayiivirga;s__NAPacBio Sequel II82.960.863.74721.89166.9446222

Strain code abbreviations are as follows: ACET, Acetobacter; ACT, Actinomycetaceae; ACUT, Acutalibacteraceae; ANA, Anaerotignaceae; ATO, Atopobiaceae; BACIL, Bacilli; BACTE, Bacteroidales; BIF, Bifidobacteriaceae; BURK, Burkholderiaceae; CLOS, Clostridium; DIAL, Dialisteraceae; ENTER, Enterococcaceae; FLAV, Flavobacteriales; LAC, Lactobacillus; LACCAS, Lacticaseibacillus; LCO, Lachnospiraceae; LENLAC, Lentilactobacillus; LEUC, Leuconostoc; LIGLAC, Ligilactobacillus; LIMLAC, Limosilactobacillus; LIQLAC, Liquorilactobacillus; MEG, Megasphaera; PREV, Prevotella; PSEUD, Pseudomonadaceae; SACCH, Saccharofermentans; SCHLAC, Schleiferilactobacillus; SPHING, Sphingobium; SPOR, Sporolactobacillus; XAN, Xanthomonadaceae.

Strain name assigned to each reported MAG. The abbreviation UW_DM stands for University of Wisconsin dairy manure bioreactor. The strain name followed by “_1” denotes the representative MAG, as selected using dRep (15) for each cluster; other MAGs in the same cluster use the same strain name, followed by a number in increasing order, and are sorted by dRep score.

The sample source of each MAG is indicated either by DMB (dairy manure bioreactor) or Inoc. (inoculum).

The following are the number of filtered raw reads corresponding to each SRA accession number: SRR19542531, 111,403; SRR19542532, 152,018; SRR19542533, 134,737; SRR19542534, 259,819; SRX12654472, 112,981,324; SRX12654474, 76,248,910; SRX12654475, 99,005,980; SRX12654476, 111,069,392; SRX12654477, 105,196,124; SRX12654480, 95,275,964; SRX12687771, 110,527; SRX12687772, 119,805; SRX12687775, 191,571.

ANIm, average nucleotide identity between the representative MAG and other MAGs included in the same cluster calculated using dRep (15).

dRep scoring calculation: where A through F were weighted with the values 1, 0.5, 1, 5, 0, and 1, respectively.

Classifications made using the Genome Taxonomy Database Toolkit (GTDB-Tk) (17).

GenBank accession number of the reference genome in GTDB-Tk that is closest to the representative MAG, determined using RAxML-NG (18); NA (not applicable) indicates MAGs without a closely matched reference genome when using the default minimum alignment fraction of 0.65.

Statistics and accession numbers of refined metagenome-assembled genomes Strain code abbreviations are as follows: ACET, Acetobacter; ACT, Actinomycetaceae; ACUT, Acutalibacteraceae; ANA, Anaerotignaceae; ATO, Atopobiaceae; BACIL, Bacilli; BACTE, Bacteroidales; BIF, Bifidobacteriaceae; BURK, Burkholderiaceae; CLOS, Clostridium; DIAL, Dialisteraceae; ENTER, Enterococcaceae; FLAV, Flavobacteriales; LAC, Lactobacillus; LACCAS, Lacticaseibacillus; LCO, Lachnospiraceae; LENLAC, Lentilactobacillus; LEUC, Leuconostoc; LIGLAC, Ligilactobacillus; LIMLAC, Limosilactobacillus; LIQLAC, Liquorilactobacillus; MEG, Megasphaera; PREV, Prevotella; PSEUD, Pseudomonadaceae; SACCH, Saccharofermentans; SCHLAC, Schleiferilactobacillus; SPHING, Sphingobium; SPOR, Sporolactobacillus; XAN, Xanthomonadaceae. Strain name assigned to each reported MAG. The abbreviation UW_DM stands for University of Wisconsin dairy manure bioreactor. The strain name followed by “_1” denotes the representative MAG, as selected using dRep (15) for each cluster; other MAGs in the same cluster use the same strain name, followed by a number in increasing order, and are sorted by dRep score. The sample source of each MAG is indicated either by DMB (dairy manure bioreactor) or Inoc. (inoculum). The following are the number of filtered raw reads corresponding to each SRA accession number: SRR19542531, 111,403; SRR19542532, 152,018; SRR19542533, 134,737; SRR19542534, 259,819; SRX12654472, 112,981,324; SRX12654474, 76,248,910; SRX12654475, 99,005,980; SRX12654476, 111,069,392; SRX12654477, 105,196,124; SRX12654480, 95,275,964; SRX12687771, 110,527; SRX12687772, 119,805; SRX12687775, 191,571. ANIm, average nucleotide identity between the representative MAG and other MAGs included in the same cluster calculated using dRep (15). dRep scoring calculation: where A through F were weighted with the values 1, 0.5, 1, 5, 0, and 1, respectively. Classifications made using the Genome Taxonomy Database Toolkit (GTDB-Tk) (17). GenBank accession number of the reference genome in GTDB-Tk that is closest to the representative MAG, determined using RAxML-NG (18); NA (not applicable) indicates MAGs without a closely matched reference genome when using the default minimum alignment fraction of 0.65.

Data availability.

The raw metagenomic sequence data and the refined MAGs are available at NCBI GenBank under BioProject accession number PRJNA768492. All information on the library construction and sequencing can be found at https://gold.jgi.doe.gov/study?id=Gs0150020 using the JGI GOLD study ID Gs0150020. All custom scripts are available on GitHub (https://github.com/GLBRC/metagenome_analysis).
  18 in total

1.  DOE JGI Metagenome Workflow.

Authors:  Alicia Clum; Marcel Huntemann; Brian Bushnell; Brian Foster; Bryce Foster; Simon Roux; Patrick P Hajek; Neha Varghese; Supratim Mukherjee; T B K Reddy; Chris Daum; Yuko Yoshinaga; Ronan O'Malley; Rekha Seshadri; Nikos C Kyrpides; Emiley A Eloe-Fadrosh; I-Min A Chen; Alex Copeland; Natalia N Ivanova
Journal:  mSystems       Date:  2021-05-18       Impact factor: 6.496

2.  CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

Authors:  Donovan H Parks; Michael Imelfort; Connor T Skennerton; Philip Hugenholtz; Gene W Tyson
Journal:  Genome Res       Date:  2015-05-14       Impact factor: 9.043

3.  The standard operating procedure of the DOE-JGI Metagenome Annotation Pipeline (MAP v.4).

Authors:  Marcel Huntemann; Natalia N Ivanova; Konstantinos Mavromatis; H James Tripp; David Paez-Espino; Kristin Tennessen; Krishnaveni Palaniappan; Ernest Szeto; Manoj Pillay; I-Min A Chen; Amrita Pati; Torben Nielsen; Victor M Markowitz; Nikos C Kyrpides
Journal:  Stand Genomic Sci       Date:  2016-02-24

4.  ProDeGe: a computational protocol for fully automated decontamination of genomes.

Authors:  Kristin Tennessen; Evan Andersen; Scott Clingenpeel; Christian Rinke; Derek S Lundberg; James Han; Jeff L Dangl; Natalia Ivanova; Tanja Woyke; Nikos Kyrpides; Amrita Pati
Journal:  ISME J       Date:  2015-06-09       Impact factor: 10.302

5.  metaSPAdes: a new versatile metagenomic assembler.

Authors:  Sergey Nurk; Dmitry Meleshko; Anton Korobeynikov; Pavel A Pevzner
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

6.  RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference.

Authors:  Alexey M Kozlov; Diego Darriba; Tomáš Flouri; Benoit Morel; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

7.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

8.  Increasing the economic value of lignocellulosic stillage through medium-chain fatty acid production.

Authors:  Matthew J Scarborough; Griffin Lynch; Mitch Dickson; Mick McGee; Timothy J Donohue; Daniel R Noguera
Journal:  Biotechnol Biofuels       Date:  2018-07-19       Impact factor: 6.040

9.  MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.

Authors:  Dongwan D Kang; Feng Li; Edward Kirton; Ashleigh Thomas; Rob Egan; Hong An; Zhong Wang
Journal:  PeerJ       Date:  2019-07-26       Impact factor: 2.984

10.  Diverse Profile of Fermentation Byproducts From Thin Stillage.

Authors:  Nathaniel W Fortney; Nathaniel J Hanson; Paula R F Rosa; Timothy J Donohue; Daniel R Noguera
Journal:  Front Bioeng Biotechnol       Date:  2021-07-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.