| Literature DB >> 30323887 |
Gamaliel López-Leal1, Fernanda Cornejo-Granados1, Juan Manuel Hurtado-Ramírez1, Alfredo Mendoza-Vargas2, Adrian Ochoa-Leyva1.
Abstract
Microbiome sequencing has become the standard procedure in the study of new ecological and human-constructed niches. To our knowledge, this is the first report of a metagenome from the water of a greenhouse drain. We found that the greenhouse is not a diverse niche, mainly dominated by Rhizobiales and Rodobacterales. The analysis of the functions encoded in the metagenome showed enrichment of characteristic features of soil and root-associated bacteria such as ABC-transporters and hydrolase enzymes. Additionally, we found antibiotic resistances genes principally for spectinomycin, tetracycline, and aminoglycosides. This study aimed to identify the bacteria and functional gene composition of a greenhouse water drain sample and also provide a genomic resource to search novel proteins from a previously unexplored niche. All the metagenome proteins and their annotations are available to the scientific community via http://microbiomics.ibt.unam.mx/tools/metagreenhouse/.Entities:
Keywords: Environmental sample; Greenhouse; Metagenome; Shotgun sequencing; Water drain
Year: 2018 PMID: 30323887 PMCID: PMC6173933 DOI: 10.1186/s40793-018-0326-y
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Study information
| Label | Greenhouse Drain-IBt |
|---|---|
| MG-RAST ID | mgm4717011.3, mgm4717032.3, mgm4716707.3, mgm4716832.3, mgm4716680.3, mgm4716681.3, mgm4716833.3, mgm4717034.3 |
| SRA ID | SRR5689218 (Drain A) |
| Study | NA |
| GOLD ID (sequencing project) | NA |
| GOLD ID (analysis project) | NA |
| NCBI BIOPROJECT | PRJNA390663 |
| Relevance | Water drain sample |
Sample information
| Label | Greenhouse Drain-IBt |
|---|---|
| GOLD ID (biosample) | NA |
| Biome | Culturing environment |
| Feature | Water of greenhouse drain system |
| Material | Water |
| Latitude and Longitude | 18.918611, −99.234167 |
| Vertical distance | 1510 m over sea level |
| Geographic location | Cuernavaca, Morelos. México |
| Collection date and time | 14/09/15, 18:00 h (GMT-5) |
Library information
| Label | Drain-A | Drain-B |
|---|---|---|
| Sample Label(s) | Drain-A | Drain-B |
| Sample prep method | ZR Soil Microbe DNA (Zymo) | ZR Soil Microbe DNA (Zymo) |
| Library prep method(s) | Nextera XT | Nextera XT |
| Sequencing platform(s) | Illumina NextSeq 500 | Illumina NextSeq 500 |
| Sequencing chemistry | V2 SBS Kit | V2 SBS Kit |
| Sequence size (GBp) | 10.4GBp | 0.60GBp |
| Number of reads | 6,976,736 | 401,466 |
| Single-read or paired-end sequencing? | Paired-end | Paired-end |
| Sequencing library insert size | 500 bp | 2000 bp |
| Average read length | 150 bp | 150 bp |
Sequence processing
| Label | Greenhouse Drain-IBt (merged library name) |
|---|---|
| Tool(s) used for quality control | Fast QC, Dynamic Trimm |
| Number of sequences removed by quality control procedures | 169,936 |
| Number of sequences that passed quality control procedures | 7,208,266 |
| Number of artificial duplicate reads | 664,856 |
Metagenome statistics
| Label | Metagenome Label | Comment |
|---|---|---|
| Libraries used | Drain-A and Drain-B | We performed the assembly using all the reads of the two libraries that passed quality filters. |
| Assembly tool(s) used | IDBA-UD and MetaSpades and merged with nucmer | 20–125 of |
| Number of contigs after assembly | 7003 | These numbers correspond to the best assembly merged using nucmer. |
| Number of singletons after assembly | N/A | MetaSpades and IDBA-UD were used in pre-correction mode to discard singletons k-mers. |
| Total bases assembled | 859,091,400 | Total base pairs in the assembly. |
| Contig n50 | 4246 | |
| % of Sequences assembled | 97% | The fraction of the input data in the assembly. |
| Measure for % assembled | 79% | The method used for calculating % assembled was determinate by read mapping using BWA (default parameters) against final assembly and considering the total reads (7,208,266 reads) |
Annotation parameters
| Label | Metagenome Label | Comment |
|---|---|---|
| Annotation system | Drain-IBt | The functional annotation using the reads was obtained using MG-RAST, Kraken, and HUMAnN2, while the functional protein annotation of the assembly was obtained from GO, InterPro, and KEGG using Blast2GO. |
| Gene calling program | Frag Gene Scan | FragGeneScan was training with Illumina reads. |
| Annotation algorithm | ||
| Database(s) used | RefSeq, SEED, ChocoPhlAn, UniRef90, Interpro (data bases) Blast NR data base | |
Metagenome properties
| Label | Metagenome label | Comment |
|---|---|---|
| Number of contigs | 7003 | |
| GBp | 11.0 GBp | |
| Number of features identified | 25,735 | Total number of predicted protein features from the assembly |
| CDS | 21,700 | Total number of proteins annotated by Blast2GO and Interpro. |
| rRNA | 18,612 | Total number of reads determined as ribosomal genes using RiboPicker version 0.4.3. |
| CDSs with GO | 14,328 | Number of proteins with GO terms. |
| CDSs with UniRef90 | 1,619,062 | |
| CDS with SEED subsystem | 786,622 | |
| Alpha diversity | 2.04 and 1.99 | Alpha diversity was determinate at order level comparing MG-RAST and Kraken results. Shannon index was measured using Phyloseq. |
Taxonomic composition
| Phylum | Greenhouse MG-RAST | Greenhouse Kraken |
|---|---|---|
|
| 0.0015405 | 0.0012883 |
|
| 0.0267721 | 0.0316892 |
|
| 0.0003382 | 0.0000208 |
|
| NA | 0.0000139 |
|
| 0.0208696 | 0.0049244 |
|
| 0.0002807 | 0.0000231 |
|
| 0.0012536 | 0.0005910 |
|
| 0.0030123 | 0.0010620 |
|
| 0.0003128 | 0.0001454 |
|
| NA | 0.0001524 |
|
| 0.0047115 | 0.0015376 |
|
| 0.0002573 | 0.0000185 |
|
| 0.0024569 | 0.0021171 |
|
| 0.0000570 | NA |
|
| 0.0000478 | NA |
|
| NA | 0.0005772 |
|
| 0.0000351 | 0.0000277 |
|
| 0.0107219 | 0.0047444 |
|
| 0.0002528 | 0.0000300 |
|
| 0.0003763 | 0.0001316 |
|
| NA | 0.0000162 |
|
| 0.0001322 | NA |
|
| 0.0001816 | 0.0001247 |
|
| 0.0019784 | 0.0006049 |
|
| 0.9125950 | 0.9467869 |
|
| 0.0007344 | 0.0008519 |
|
| 0.0005554 | 0.0000139 |
|
| 0.0000585 | 0.0001893 |
|
| NA | 0.0000231 |
|
| 0.0004699 | 0.0001732 |
|
| 0.0099982 | 0.0021217 |
Relative abundances at phylum level using MG-RAST and Kraken, Relative abundances were determinate using the normalized number of reads of each order divided into the total number of reads
Fig. 1Bacterial abundance at order level. Relative abundance of bacterial orders classified by Kraken and MG-RAST
Functional diversity
| Level 2 category | Relative Abundance |
|---|---|
| Vitamins | 0.071243175 |
| Protein biosynthesis | 0.055205552 |
| Central carbohydrate metabolism | 0.052705254 |
| ABC transporters | 0.046491929 |
| Lipids | 0.043826646 |
| Disease and Defense | 0.037793615 |
| Prophages | 0.037587808 |
| Branched-chain amino acids | 0.033340704 |
| Arginine | 0.032626333 |
| Lysine | 0.031525862 |
Top ten of the most abundant functions annotated by MG-RAST against SEED database. Relative abundances were determined normalizing by the total number of reads
Functional diversity (UniRef90)
| Pathway | Relative abundance |
|---|---|
| Gondoate biosynthesis | 0.071311068 |
| L-isoleucine biosynthesis I | 0.058340078 |
| Adenosine ribonucleotides de novo biosynthesis | 0.05276 |
| Superpathway of guanosine nucleotides de novo biosynthesis I | 0.039785498 |
| L-valine biosynthesis | 0.037358283 |
| Guanosine ribonucleotides de novo biosynthesis | 0.029089431 |
| Superpathway of pyrimidine deoxyribonucleotides de novo biosynthesis | 0.027701595 |
| 5-aminoimidazole ribonucleotide biosynthesis II | 0.026628851 |
| Superpathway of L-threonine biosynthesis | 0.026026654 |
| Mycolate biosynthesis | 0.02416827 |
Top ten of most abundant function annotated by HUMAnN2 against UniRef90 database. Relative abundances were determined normalizing by the total number of reads
Fig. 2Gene Ontology (GO) terms distribution. The pie graphs show the number of genes annotated for cellular component, biological process, and molecular function categories
Fig. 3KEGG pathways associated with ABC-transporters. KEGG pathways of the ABC-transporters with all genes present in the metagenome
Antibiotics resistance genes
| Type | Number of genes | Resistance |
|---|---|---|
| vatb, aad9ib, aph6ic | 5 | Spectinomycin |
| tetpb, tetm, tetx | 4 | Tetracycline |
| emre | 3 | Aminoglycoside |
| aac3ia | 3 | Astromicin, Gentamicin, Sisomicin |
| baca | 3 | Bacitracin |
| ceob, catb1 | 2 | Chloramphenicol |
| cara, tlrc | 2 | Lincosamide, Macrolide, Streptogramin_b |
| acrb | 1 | Acriflavin, Aminoglycoside, beta_lactam, Glycylcycline |
| bl2d_oxa2 | 1 | Cloxacillin, Penicillin |
| mexd | 1 | Erythromycin, Fluoroquinolone, Glycylcycline, Roxithromycin |
| fosa | 1 | Fosfomycin |
| ksga | 1 | Kasugamycin |
| macb | 1 | Macrolide |
| pbp1b | 1 | Penicillin |
| arna | 1 | Polymyxin |
| dfra26 | 1 | Trimethoprim |