Literature DB >> 35343806

Metagenome-Assembled Genomes from a Microbiome Converting Xylose to Medium-Chain Carboxylic Acids.

Matthew J Scarborough1, Kevin S Myers2,3, Nathaniel W Fortney2,3, Abel T Ingle2,3,4, Timothy J Donohue2,3,5, Daniel R Noguera2,3,4.   

Abstract

There is growing interest in producing beneficial products from wastes using microbiomes. We previously performed multiomic analyses of a bioreactor microbiome that converted carbohydrate-rich lignocellulosic residues to medium-chain carboxylic acids. Here, we present draft metagenome-assembled genomes from this microbiome, obtained from reactors in which xylose was the primary carbon source.

Entities:  

Year:  2022        PMID: 35343806      PMCID: PMC9022542          DOI: 10.1128/mra.01151-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Using sludge from a well-characterized bioreactor producing hexanoic and octanoic acids from lignocellulosic residues (1–3), we sought to enrich for different community members by operating a new bioreactor with a synthetic medium that contained xylose as the primary carbon source at a concentration of 18,700 mg L−1. Biomass samples were collected after 96 (sample 16X), 102 (sample 17X), and 108 (sample 18X) days of bioreactor operation. DNA was extracted using a phenol-chloroform extraction method described previously (3). DNA aliquots of ∼3,000 ng for each of the three samples were shipped to the Joint Genome Institute (JGI, Berkeley, CA, USA; jgi.doe.gov) for sequencing. The HiSeq 2500 system (Illumina, San Diego, CA) was used to generate 150-bp paired-end reads. Assembly was performed using JGI’s metagenome workflow (4) with jgi_mga_meta_rqc.py version 2.1.0. Trimmed, screened, and paired-end Illumina reads were corrected using BFC (5) version r181. Reads were assembled from single samples with SPAdes (6) version 3.11.1 with “—meta” mode enabled, and binning was performed using MetaBAT (7) version 0.32.5 with default settings to create metagenome-assembled genomes (MAGs). CheckM (8) version 1.0.11 was used with default settings to assess MAG quality, and GTDB-Tk (9) version 0.1.6 was used for taxonomic assignment using the GTDB (10) database release 202. GTDB-Tk was run using the following commands: gtdbtk identify --genome_dir ./--out_dir ./GTDBTk/--extension fasta --cpus 16; gtdbtk align --identify_dir ./GTDBTk/--out_dir ./GTDBTk/align --cpus 16. The software program dRep (11) version 3.2.2 was used to select representative MAGs from 31 MAGs obtained in this work plus 10 MAGs from prior analyses (1, 3). Default dRep settings were used except that the completeness threshold was set to 60%, the contamination weight was set to 0.5, and the N50 weight was set to 5. The phylogenetic tree was created using RAxML-NG version 0.9.0 with the following sequential commands: raxml-ng --parse --msa gtdbtk.bac120.user_msa.fasta --model LG+G8+F –prefix T1; raxml-ng --all --msa T1.raxml.rba --model LG+G8+F --prefix RAxML_ --threads 13 --seed 2. The 16X assembly had an N50 value of 13.3 kb, an L50 value of 288, and an average coverage of 453× and produced 10 MAGs from 118,660,326 filtered reads. The 17X assembly had an N50 value of 5.6 kb, an L50 value of 722, and an average coverage of 402× and produced 9 MAGs from 111,508,430 filtered reads. The 18X assembly had an N50 value of 9.2 kb, an L50 value of 490, and an average coverage of 394× and produced 12 MAGs from 117,409,984 filtered reads. From the three assemblies, we obtained a total of 31 MAGs (completeness, >75%; contamination, <10%). After dereplication, eight MAGs were selected as representative MAGs (Table 1), two of which are improvements to previously published MAGs (1, 3). COR1.1 is an update of COR1 (NCBI BioSample SAMN09651346), and LAC5.1 is an update of LAC5 (NCBI BioSample SAMN09651352). The other six MAGs represent new draft genome sequences of organisms within the GTDB (10) taxonomic orders Acetobacterales (ACET1), Coriobacteriales (COR4, COR5, and COR6), Lactobacillales (LAC6), and Lachnospirales (LCO2) (Fig. 1). Organisms related to Acetobacterales, Coriobacteriales, Lactobacillales, and Lachnospirales are regularly found via 16S rRNA gene amplicon sequencing (12) as abundant organisms during chain elongation. We provide the draft genome sequences of these organisms as a resource to further elucidate metabolic processes in this emerging biotechnological field.
TABLE 1

Summary of recovered metagenome-assembled genomes

MAG IDMAG nameTaxonomic classification (GTDB)aTaxonomic classification (NCBI)Comp. (%)bCont. (%)cSize (Mb)No. of scaffoldsNearest GTDBa genome (NCBI assembly no.)ANI (%)dAF (%)eNCBI genome accession no.
ACET1UW_Xyl_ACET1s__Acetobacter senegalensisAcetobacter sp.99.503.136 GCF_001580995.1 97.5689 JAJGAA000000000
COR1.1UW_Xyl_COR1.1s__UBA7748 sp900314535Atopobium sp.99.193.232.5130 GCA_900314535.1 97.380 JAJGAB000000000
COR4UW_Xyl_COR4s__Olegusella sp002407925Olegusella sp.78.652.821.4165 GCA_002407925.1 98.5372 JAJGAC000000000
COR5UW_Xyl_COR5g__OlsenellaOlsenella sp.1006.452.746 GCF_009695875.1 92.9184 JAJGAD000000000
COR6UW_Xyl_COR6g__RUG013Denitrobacterium sp.93.032.491.9133 GCF_001486445.1 88.7185 JAJGAE000000000
LAC5.1UW_Xyl_LAC5.1s__Limosilactobacillus mucosaeLimosilactobacillus sp.99.1802.017 GCF_001436025.1 96.4988 JAJGAF000000000
LAC6UW_Xyl_LAC6s__Lentilactobacillus buchneriLactobacillus sp.99.0602.522 GCF_001434735.1 97.3494 JAJGAG000000000
LCO2UW_Xyl_LCO2g__Eubacterium_HEubacterium sp.77.142.122.5384NANANA JAJGAH000000000

GTDB, Genome Taxonomy Database (10); g_, genus name; s_, species name.

Comp., completeness percentage estimated with CheckM (8).

Cont., contamination percentage estimated with CheckM (8).

ANI, average nucleotide identity with nearest GTDB genome estimated with GTDB-Tk (9).

AF, alignment fraction with nearest GTDB genome estimated with GTDB-Tk (9).

NA, not applicable. GTDB-Tk did not report a nearest genome because the AF with near genomes was below the default threshold of 65%.

FIG 1

Phylogenetic tree constructed from recovered MAGs and related genomes. For each recovered MAG, three to five of the most closely related genomes were selected based on GTDB-Tk results (e.g., genus, species, average nucleotide identity [ANI], and alignment fraction [AF]). Genomes retrieved from NCBI were processed using GTDB-Tk to create concatenated sequences of 120 bacterial single-copy marker genes. The tree was constructed from the sequences using RAxML-NG (13) with 1,000 bootstraps. Filled circles indicate bootstrap values of 100, and all others are labeled as numbers. ACET, Acetobacterales; COR, Coriobacteriales; LAC, Lactobacillales; LCO2, Lachnospirales.

Phylogenetic tree constructed from recovered MAGs and related genomes. For each recovered MAG, three to five of the most closely related genomes were selected based on GTDB-Tk results (e.g., genus, species, average nucleotide identity [ANI], and alignment fraction [AF]). Genomes retrieved from NCBI were processed using GTDB-Tk to create concatenated sequences of 120 bacterial single-copy marker genes. The tree was constructed from the sequences using RAxML-NG (13) with 1,000 bootstraps. Filled circles indicate bootstrap values of 100, and all others are labeled as numbers. ACET, Acetobacterales; COR, Coriobacteriales; LAC, Lactobacillales; LCO2, Lachnospirales. Summary of recovered metagenome-assembled genomes GTDB, Genome Taxonomy Database (10); g_, genus name; s_, species name. Comp., completeness percentage estimated with CheckM (8). Cont., contamination percentage estimated with CheckM (8). ANI, average nucleotide identity with nearest GTDB genome estimated with GTDB-Tk (9). AF, alignment fraction with nearest GTDB genome estimated with GTDB-Tk (9). NA, not applicable. GTDB-Tk did not report a nearest genome because the AF with near genomes was below the default threshold of 65%.

Data availability.

The raw metagenomic sequencing data are available under NCBI BioProject PRJNA518398 (sample 16X), BioProject PRJNA518399 (sample 17X), and BioProject PRJNA518400 (sample 18X). MAGs are available under NCBI BioProject PRJNA771338. Individual genome accession numbers for the MAGs are provided in Table 1.
  13 in total

1.  BFC: correcting Illumina sequencing errors.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2015-05-06       Impact factor: 6.937

Review 2.  Chain elongators, friends, and foes.

Authors:  Pieter Candry; Ramon Ganigué
Journal:  Curr Opin Biotechnol       Date:  2021-01-30       Impact factor: 9.740

3.  dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication.

Authors:  Matthew R Olm; Christopher T Brown; Brandon Brooks; Jillian F Banfield
Journal:  ISME J       Date:  2017-07-25       Impact factor: 10.302

4.  DOE JGI Metagenome Workflow.

Authors:  Alicia Clum; Marcel Huntemann; Brian Bushnell; Brian Foster; Bryce Foster; Simon Roux; Patrick P Hajek; Neha Varghese; Supratim Mukherjee; T B K Reddy; Chris Daum; Yuko Yoshinaga; Ronan O'Malley; Rekha Seshadri; Nikos C Kyrpides; Emiley A Eloe-Fadrosh; I-Min A Chen; Alex Copeland; Natalia N Ivanova
Journal:  mSystems       Date:  2021-05-18       Impact factor: 6.496

5.  CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

Authors:  Donovan H Parks; Michael Imelfort; Connor T Skennerton; Philip Hugenholtz; Gene W Tyson
Journal:  Genome Res       Date:  2015-05-14       Impact factor: 9.043

6.  metaSPAdes: a new versatile metagenomic assembler.

Authors:  Sergey Nurk; Dmitry Meleshko; Anton Korobeynikov; Pavel A Pevzner
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

7.  Metatranscriptomic and Thermodynamic Insights into Medium-Chain Fatty Acid Production Using an Anaerobic Microbiome.

Authors:  Matthew J Scarborough; Christopher E Lawson; Joshua J Hamilton; Timothy J Donohue; Daniel R Noguera
Journal:  mSystems       Date:  2018-11-20       Impact factor: 6.496

8.  RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference.

Authors:  Alexey M Kozlov; Diego Darriba; Tomáš Flouri; Benoit Morel; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

9.  MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.

Authors:  Dongwan D Kang; Feng Li; Edward Kirton; Ashleigh Thomas; Rob Egan; Hong An; Zhong Wang
Journal:  PeerJ       Date:  2019-07-26       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.