Literature DB >> 33958410

Metagenome-Assembled Genomes of 12 Bacterial Species from Biofouled Plastic Fabrics Harbor Multiple Genes for Degradation of Hydrocarbons.

Osman Radwan1, Oscar N Ruiz2.   

Abstract

We report the metagenome-assembled genomes (MAGs) of 12 different bacterial species recovered from environmental microbiomes associated with biofouled plastic fabrics. The MAGs have estimated sizes of 2.53 to 7.66 Mb with 3,229 to 9,289 proteins, 26.20% to 99.1% genome completeness, 48.9% to 72.6% G+C content, and multiple genes for hydrocarbon degradation.
Copyright © 2021 Radwan and Ruiz.

Entities:  

Year:  2021        PMID: 33958410      PMCID: PMC8103880          DOI: 10.1128/MRA.01458-20

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Several bacterial species have been shown to degrade polymers and hydrocarbon fuel (1–7). The fuel and plastic biodegradation activity of these species is attributed to multiple hydrocarbon-degrading enzymes (1–5) and hydrolytic enzymes (6, 7), respectively. In this study, bioinformatic tools were used for genome assembly and annotation of 12 bacterial species recovered from a shotgun metagenomic library of environmental microbiomes associated with biofouled plastic fabrics (8). As explained by Radwan et al., tent shelter plastic fabric samples exposed for 14 months to the Panama jungle were retrieved from the location site and stored refrigerated at 4°C (8). Samples were cut into 0.5-cm2 pieces for DNA extraction using the Qiagen DNeasy UltraClean kit (catalog number 12224-250) (8). A PrepX DNA library kit and an Apollo 324 next-generation sequencing (NGS) automatic library prep system were used to construct the DNA libraries (WaferGen, Fremont, CA). An Illumina HiSeq 2000 instrument was used to sequence the DNA libraries, generating 161,537,275,209 raw reads with an average length of 100 bp (8). For quality control, Trimmomatic 0.36 (9) was used to remove raw reads with average quality below 15 and those with a length less than 50 bp. Sequence assembly and binning of the different population-level genomes were conducted with an in-house bioinformatic pipeline comprising multiple bioinformatic programs (10) with default parameters unless otherwise noted. The bioinformatic pipeline sequentially applied, as described next, the programs BBtools (https://jgi.doe.gov/data-and-tools/bbtools/), MEGAHIT (11), Bowtie 2 (12), SAMtools (13), Pileup (13), AWK (14), MaxBin (15), SSPACE (16), GapFiller (17), RepeatMasker (18), Prodigal (19), ABySS (20), and HMMER (21). BBtools was used for sorting paired-end reads and normalization to ensure compatibility before sequence assembly using MEGAHIT with the options minimum contig length of 200 bp and meta-sensitive. The produced contigs from MEGAHIT were subjected to Bowtie 2 for mapping raw reads to contigs and to create BAM files that were converted to SAM files using SAMtools to generate the coverage matrix and abundance files in Pileup and AWK, which were finally used in MaxBin, with the options minimum contig length of 2,000 and depth of 2, for binning of individual genomes. An assembly improvement process and a sequence gap-filling process for the 12 bacterial genomes (Table 1) were performed using SSPACE and GapFiller, respectively. The masked genome sequences from RepeatMasker were used in Prodigal to annotate the function of genes. The completeness of the different genomes was extracted from the MaxBin output and ranged from 26.20% for Actinomycetospora chiangmaiensis to 99.1% for Mucilaginibacter polytrichastri. ABySS was used to calculate the genome sizes, L50 values, and G+C contents, which ranged from 2.53 to 7.66 Mb, 3 to 510 contigs, and 48.9% to 72.6%, respectively (Table 1). The proteins potentially involved in hydrocarbon degradation and polymer hydrolysis (Table 1) were identified with HMMER against the Pfam database with an E value of 0.001. The number of identified proteins for each pathway varied among species, with the Gordonia polyisoprenivorans and Williamsia herbipolensis genomes containing the highest numbers of protein-coding genes for both alkane and aromatic degradation (Table 1). Similarly, these two genomes contained a large number of hydrolase and efflux pump genes, which have been associated with resistance to toxic compounds (22). In total, 11 of the 12 bacterial genomes, all except Actinomycetospora chiangmaiensis, contained genes encoding hydrolases. The information derived from the assembly and annotation of the 12 metagenome-assembled genomes (MAGs) supports the ability of these bacterial species to degrade hydrocarbons (1–5) and, potentially, plastics (6, 7).
TABLE 1

Accession number, general statistics of metagenomic-assembled bacterial genomes, and number of genes involved in hydrocarbon degradation, polymer hydrolysis, and efflux pumps from each bacterial genome

CodeGenomePhylumGenBank accession no.Size (bp), coverage (×)L50N50 (bp)No. of proteinsCompleteness (%)No. of degradation genesNo. of hydrolase genesNo. of MFS, no. of ABCb
E08Methylobacterium mesophilicumProteobacteriaJADCRT0000000003,992,127, 2003510,1223,87398.108235, 141
E01Williamsia sp.TerrabacteriaJADCRZ0000000007,661,838, 10519127,3005,81195.3013050, 76
F07Mucilaginibacter polytrichastriBacteroidetesJADCRU0000000005,102,915, 1573936,2084,84399.1001039, 62
F05Williamsia herbipolensisTerrabacteriaJADCRY0000000004,041,607, 2791367,9534,07654.20331265, 74
C12Jatrophihabitans endophyticusActinobacteriaJADCRS0000000003,929,516, 2561567,1235,77088.8072100, 75
F04Gordonia polyisoprenivoransTerrabacteriaJADCRR0000000005,331,221, 19815310,2705,33386.005326104, 103
C15Caulobacter sp.ProteobacteriaJADCRP0000000003,614,289, 2782034,9734,18470.104366, 50
C10Gluconacetobacter diazotrophicusProteobacteriaJADCRQ0000000003,858,963, 2602385,0844,15750.502189, 27
F33Acetobacter sp.ProteobacteriaJADCRW0000000002,529,106, 4462452,6722,78032.700135, 23
D02Terriglobus roseusAcidobacteriaJADCRX0000000003,673,883, 3044832,4893,17933.606522, 16
E11Parafilimonas terraeBacteroidetesJADCRV0000000007,593,549, 1054994,4418,69755.10181145, 165
E07Actinomycetospora chiangmaiensisTerrabacteriaJADCRO0000000005,278,631, 1525123,0366,41726.2090118, 111

The Pfam database with an E value of 0.001 was used for functional annotation.

MFS, major facilitator superfamily efflux pumps; ABC, ABC transporters.

Accession number, general statistics of metagenomic-assembled bacterial genomes, and number of genes involved in hydrocarbon degradation, polymer hydrolysis, and efflux pumps from each bacterial genome The Pfam database with an E value of 0.001 was used for functional annotation. MFS, major facilitator superfamily efflux pumps; ABC, ABC transporters.

Data availability.

The raw metagenomic sequence reads and MAGs were deposited at DDBJ/ENA/GenBank under the BioProject accession number PRJNA656514 with BioSample accession numbers SAMN15786575 to SAMN15786586 and SRA accession numbers SRX9364069 to SRX9364074. The individual accession numbers of the MAGs are provided in Table 1.
  16 in total

1.  Scaffolding pre-assembled contigs using SSPACE.

Authors:  Marten Boetzer; Christiaan V Henkel; Hans J Jansen; Derek Butler; Walter Pirovano
Journal:  Bioinformatics       Date:  2010-12-12       Impact factor: 6.937

2.  Identification of nitrogen-incorporating bacteria in petroleum-contaminated arctic soils by using [15N]DNA-based stable isotope probing and pyrosequencing.

Authors:  Terrence H Bell; Etienne Yergeau; Christine Martineau; David Juck; Lyle G Whyte; Charles W Greer
Journal:  Appl Environ Microbiol       Date:  2011-04-15       Impact factor: 4.792

3.  MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph.

Authors:  Dinghua Li; Chi-Man Liu; Ruibang Luo; Kunihiko Sadakane; Tak-Wah Lam
Journal:  Bioinformatics       Date:  2015-01-20       Impact factor: 6.937

4.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

5.  Toward almost closed genomes with GapFiller.

Authors:  Marten Boetzer; Walter Pirovano
Journal:  Genome Biol       Date:  2012-06-25       Impact factor: 13.583

6.  Draft Genome Sequence of Pseudomonas frederiksbergensis SI8, a Psychrotrophic Aromatic-Degrading Bacterium.

Authors:  Oscar N Ruiz; Lisa M Brown; Richard C Striebich; Susan S Mueller; Thusitha S Gunasekera
Journal:  Genome Announc       Date:  2015-07-16

7.  MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm.

Authors:  Yu-Wei Wu; Yung-Hsu Tang; Susannah G Tringe; Blake A Simmons; Steven W Singer
Journal:  Microbiome       Date:  2014-08-01       Impact factor: 14.650

8.  HMMER web server: 2018 update.

Authors:  Simon C Potter; Aurélien Luciani; Sean R Eddy; Youngmi Park; Rodrigo Lopez; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  Draft Genome Sequence of Gordonia sihwensis Strain 9, a Branched Alkane-Degrading Bacterium.

Authors:  Lisa M Brown; Thusitha S Gunasekera; Richard C Striebich; Oscar N Ruiz
Journal:  Genome Announc       Date:  2016-06-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.