Literature DB >> 32923544

Metagenomic data on the composition of bacterial communities in lake environment sediments for fish farming by next generation Illumina sequencing.

María Custodio1, Alberto Ordinola-Zapata2, Ciro Espinoza1, Enedia Vieyra-Peña2, Richard Peñaloza1, Héctor Sánchez-Suárez3, Tessy Peralta-Ortiz2.   

Abstract

This article contains data on the bacterial communities of lagoon sediments with fish potential in the Central Andes of Peru. The surface sediment samples were collected from four lagoons destined for continental water fish farming. DNA extraction was performed from 0.5 g of sample through the Presto™ Soil DNA Extraction Kit. Bacterial sequencing of the 16S rRNA amplicon was performed on the DNA extracted from the sediment. At least 36 Phyla bacteria were detected, the bacterial communities being dominated by Proteobacteria, Cyanobacteria, Actinobacteria, Firmicutes, Chloroflexi. These data can be used for predictive analysis to gain a better understanding of the dynamics of bacterial communities in environments under pressure from fish farming.
© 2020 The Author(s).

Entities:  

Keywords:  Bacterial composition; Fish farming; Gaps; Gen 16S rRNA; Sediment

Year:  2020        PMID: 32923544      PMCID: PMC7475191          DOI: 10.1016/j.dib.2020.106228

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

These data are the first generated using 16S rRNA genes from bacterial communities in lake environments pressured by fish farming in the Peruvian Andes. These metagenomic data may be useful to other researchers to expand molecular studies and compare the composition of bacterial communities under different environmental and anthropogenic factors. These data can be used for predictive analysis to gain a better understanding of the dynamics of bacterial communities in environments under pressure from fish farming.

Data Description

Study area

The study was conducted in the Pomacocha, Habascocha, Tipicocha and Tranca Grande lagoons of glacial origin located in the Central Andes of Peru, in the upper basin of the Perene River, at an altitude between 4310 and 4330 m.a.s.l. [3]. The four lagoons are used for intensive farming of Oncorhynchus mykiss (rainbow trout) in large floating cages (Fig. 1).
Fig. 1

Location map of the study area in the Perene river watershed, Peru.

Location map of the study area in the Perene river watershed, Peru.

Analytical data

The metagenomic data presented in this manuscript provide information on the bacterial communities of lagoon sediments intended for the cultivation of Oncorhynchus mykiss in the Central Andes of Peru. The bacterial taxonomic composition generated through sequencing of the 16S rRNA amplicon using the standard next-generation Illumina MiSeq protocol is shown in Fig. 2. Analysis of the final readings revealed the Bacteria and Archaea domains. In the Habascocha lagoon the readings revealed 33 phyla, 64 classes and 127 orders, in the Pomacocha lagoon 30 phyla, 61 classes and 120 orders, in the Tipicocha lagoon 34 phyla, 61 classes and 130 orders and, in the Tranca Grande lagoon 31 phyla, 55 classes and 127 orders. The readings also revealed 276 bacterial families in the four lakes. However, between 10% and 14% of the total readings were not classified.
Fig. 2

Composition of bacterial communities in lake sediments with fish potential in the Central Andes of Peru.

Composition of bacterial communities in lake sediments with fish potential in the Central Andes of Peru. Table 1 shows the abundance of bacteria in surface sediments of lagoons with fish potential in the Central Andes of Peru, according to phylum, obtained through high performance sequencing. Table 2 shows the mean abundance and percentage contribution of phyla bacteria to the differentiation or similarity between groups, according to the SIMPER analysis. Phylum Actinobacteria presented the highest percentage of contribution to the bacterial communities (29.20%), followed by Cyanobacteria (16.11%) and Proteobacteria (14.66%). The grouping of bacterial orders by SIMPROF analysis, reported five statistically different groups in relation to the number and site of sampling (Fig. 3). The distribution of bacterial families in surface sediments of ponds with fish potential at 70% contribution by SIMPER analysis is shown in Fig. 4.
Table 1

Abundance of bacteria in surface sediment of lagoons with fish potential in the Central Andes of Peru, according to phylum.

PhylumHabascochaPomacochaTipicochaTrancagrande
Acidobacteria2668290132841023
Actinobacteria20,23413,41086683407
Aquificae831197
Armatimonadetes23156618
Bacteroidetes700110,30210,98012,591
Caldiserica20768774
Candidatus Cloacimonetes34227053
Candidatus Korarchaeota0046
Candidatus Saccharibacteria473306578278
Chlamydiae151108161
Chlorobi146914251
Chloroflexi1740223422602964
Chrysiogenetes1000
Crenarchaeota0001
Cyanobacteria13,98620,85510,24522,762
Deferribacteres0440
Deinococcus Thermus190335653250
Dictyoglomi1127587331097
Elusimicrobia2122
Euryarchaeota319253428372668
Fibrobacteres12322541
Firmicutes8616597576134841
Fusobacteria35436555
Gemmatimonadetes20084071818654
Ignavibacteriae442144014181876
Kiritimatiellaeota2322
Nitrospirae653116176182
Planctomycetes455955148
Proteobacteria58,53955,16966,42664,971
Spirochaetes108447469612
Synergistetes16385274
Tenericutes147116198134
Thaumarchaeota72663
Thermodesulfobacteria173329353459
Thermotogae14151319
Verrucomicrobia636516969814
Table 2

Mean abundance and percentage contribution of bacterial phyla in lagoon sediment with fish potential in the Central Andes of Peru, according to SIMPER analysis.

PhylumAv. dissimContrib.%Cumulative%Mean AMean B
Actinobacteria4.9029.2029.2020,2008500
Cyanobacteria2.7016.1145.3014,00018,000
Proteobacteria2.4614.6659.9758,50062,200
Bacteroidetes1.7910.6870.65700011,300
Firmicutes1.036.1776.8186206140
Euryarchaeota0.995.8882.693192680
Ignavibacteriae0.472.8385.524421580
Gemmatimonadetes0.442.6288.142010960
Acidobacteria0.352.0690.2126702400
Dictyoglomi0.311.8792.07112863
Chloroflexi0.311.8693.9317402490
Nitrospirae0.311.8495.77653653
Spirochaetes0.171.0096.77108509
Deinococcus Thermus0.090.5597.32190413
Verrucomicrobia0.090.5297.85636766
Thermodesulfobacteria0.090.5298.36173380
Candidatus Saccharibacteria0.070.3998.75473387
Aquificae0.030.1898.93839
Chlorobi0.030.1899.121487.3
Chlamydiae0.030.1799.291584
Caldiserica0.020.1599.442079
Planctomycetes0.020.1199.544587.3
Synergistetes0.020.1099.641654.7
Tenericutes0.010.0899.72147149
Thaumarchaeota0.010.0699.77723.7
Candidatus Cloacimonetes0.010.0699.833448.3
Fibrobacteres0.010.0599.881232.7
Fusobacteria0.010.0599.933554.3
Armatimonadetes0.010.0599.972333
Candidatus Korarchaeota0.000.0199.9803.33
Deferribacteres0.000.0199.9902.67
Thermotogae0.000.01100.001415.7
Chrysiogenetes0.000.00100.0010
Kiritimatiellaeota0.000.00100.0022.33
Elusimicrobia0.000.00100.0021.67
Crenarchaeota0.000.00100.0000.333
Fig. 3

Dendrogram of similarity of bacterial orders in surface sediment of lagoons with fish potential at 70% accumulated contribution, according to SIMPROF analysis.

Fig. 4

Distribution of bacterial families in surface sediment of ponds with fish potential at 70% contribution.

Abundance of bacteria in surface sediment of lagoons with fish potential in the Central Andes of Peru, according to phylum. Mean abundance and percentage contribution of bacterial phyla in lagoon sediment with fish potential in the Central Andes of Peru, according to SIMPER analysis. Dendrogram of similarity of bacterial orders in surface sediment of lagoons with fish potential at 70% accumulated contribution, according to SIMPROF analysis. Distribution of bacterial families in surface sediment of ponds with fish potential at 70% contribution.

Experimental design, materials and methods

Sediment sampling

Surface sediment samples (10 cm) were collected from four inland water fish (Oncorhynchus mykiss) culture ponds in November 2019. Sediment samples from each lagoon were conditioned in airtight plastic bags and transported on ice to the Universidad Nacional de Tumbes laboratory for analysis [4].

DNA extraction, 16S rRNA genes PCR amplification and sequencing

DNA extraction was performed from 0.5 g sample using the PrestoTM Soil DNA Extraction Kit, in accordance with the manufacturer's instructions and standard protocols. DNA concentration and quality were determined using a NanodropTM ONe quantification spectrophotometer (Thermo Fisher Scientific, Massachusetts, USA) obtaining ranges from 0.3 to 88.5 ng/µl. PCR amplification was performed using the Gene One and GE Healthcare Life Sciences kits by mixing 1 µl of the 16S rRNA F universal primer, 1 µl of the 16S rRNA R universal primer, 22 µl of the PCR mix (containing premix buffer, MgCl2, dNTPs and taqPolymerase) and 1 µl DNA sample obtaining a total reaction volume of 25 µl. Primers 27 F (5′-AGAGTTGATCCTGGCTCAG-3′) and 1392R (5′-GGTACCTTGTACGACTT-3′) were used and amplified for a product of about 1365 bp. Bacterial sequencing of the 16S rRNA amplicon was performed using the standard next-generation Illumina MiSeq [5], [6], [7], [8]. The construction of the library was carried out commercially (ADMERA HEALTH LLC, USA).

Bioinformatic analysis of sequence readings

The FASTQ files generated by the program FASTQC v0.11.9 were processed to know the length of the readings, the quality of the bases and the percentage of nucleotide bases. Subsequently, quality filtering and removal of regions of the primer and adapters present in the readings was performed using the Trimmomatic v0.39 program [9] with minimum trimming values of Q30 and trimming of readings below 30 bp. All individual reads were greater than 150,000 per isolate with a read length of 251 nucleotides and a quality value of each sequenced base greater than 30. The taxonomic analysis was performed using the program [10], based on the database minikraken_20,171,019_4GB. This program also handles multiple scripts for circular representation. Finally, operational taxonomic units were identified and abundances calculated [11,12].

Statistical analysis

Similarity percentage analysis (SIMPER) was performed to calculate the relative contribution of each taxon to the overall average dissimilarity observed between two or more groups of taxonomic assemblages. The groups were defined on the basis of a preliminary similarity profile clustering analysis (SIMPROF) of the same taxonomic occurrence data set [13]. The SIMPROF analysis allowed to test the multivariate structure within groups of samples. Square-root transformed abundances were used to calculate Bray Curtis similarities [14], showing patterns between samples determined by significant similarity measurements (p < 0.05), using clustering and ordering [15]. These analyses were performed in the Primer V7.

Nucleotide sequence access numbers

The 16S rRNA gene sequences reported in this study were sent to the GenBank database with the access number PRJNA657251 (https://www.ncbi.nlm.nih.gov/sra/PRJNA657251).

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
SubjectBiology
Specific subject areaMicrobial ecology
Type of dataTables, figures, FASTQ
How data was acquiredHigh performance sequencing data of the 16S rRNA gene amplicon using Illumina MiSeq sequencing [1].
Data formatRaw and analyzed
Parameters for data collectionIdentification of ponds with fish activity and sediment collection.
Description of data collectionExtraction and amplification of bacterial DNA by PCR and sequencing of 16S bacterial rRNA amplicon [2].
Data source locationLagoons with fish potential located in the Central Andes of Peru, between latitude −11.7808°, longitude −75.2454° and latitude −11.7198, longitude −75.2311 (Fig. 1).
Data accessibilityData is available in the article.
  12 in total

1.  Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform.

Authors:  James J Kozich; Sarah L Westcott; Nielson T Baxter; Sarah K Highlander; Patrick D Schloss
Journal:  Appl Environ Microbiol       Date:  2013-06-21       Impact factor: 4.792

2.  UPARSE: highly accurate OTU sequences from microbial amplicon reads.

Authors:  Robert C Edgar
Journal:  Nat Methods       Date:  2013-08-18       Impact factor: 28.547

3.  Analyses of sublittoral macrobenthic community change in a marine nature reserve using similarity profiles (SIMPROF).

Authors:  P J Somerfield; M Burton; W G Sanderson
Journal:  Mar Environ Res       Date:  2014-06-14       Impact factor: 3.130

4.  Performance comparison of Illumina and ion torrent next-generation sequencing platforms for 16S rRNA-based bacterial community profiling.

Authors:  Stephen J Salipante; Toana Kawashima; Christopher Rosenthal; Daniel R Hoogestraat; Lisa A Cummings; Dhruba J Sengupta; Timothy T Harkins; Brad T Cookson; Noah G Hoffman
Journal:  Appl Environ Microbiol       Date:  2014-09-26       Impact factor: 4.792

5.  Evaluation and optimization of DNA extraction and purification procedures for soil and sediment samples.

Authors:  D N Miller; J E Bryant; E L Madsen; W C Ghiorse
Journal:  Appl Environ Microbiol       Date:  1999-11       Impact factor: 4.792

6.  Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms.

Authors:  J Gregory Caporaso; Christian L Lauber; William A Walters; Donna Berg-Lyons; James Huntley; Noah Fierer; Sarah M Owens; Jason Betley; Louise Fraser; Markus Bauer; Niall Gormley; Jack A Gilbert; Geoff Smith; Rob Knight
Journal:  ISME J       Date:  2012-03-08       Impact factor: 10.302

7.  Metagenomic data of fungal community in Kongsfjorden, Arctic using Illumina next generation sequencing.

Authors:  Farha Arakkaveettil Kabeer; T Jabir; K P Krishnan; Mohamed Hatha Abdulla
Journal:  Data Brief       Date:  2018-12-13

8.  An improved dual-indexing approach for multiplexed 16S rRNA gene sequencing on the Illumina MiSeq platform.

Authors:  Douglas W Fadrosh; Bing Ma; Pawel Gajer; Naomi Sengamalay; Sandra Ott; Rebecca M Brotman; Jacques Ravel
Journal:  Microbiome       Date:  2014-02-24       Impact factor: 14.650

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  16S rRNA gene sequencing of mock microbial populations- impact of DNA extraction method, primer choice and sequencing platform.

Authors:  Fiona Fouhy; Adam G Clooney; Catherine Stanton; Marcus J Claesson; Paul D Cotter
Journal:  BMC Microbiol       Date:  2016-06-24       Impact factor: 3.605

View more
  1 in total

1.  Microbial diversity in intensively farmed lake sediment contaminated by heavy metals and identification of microbial taxa bioindicators of environmental quality.

Authors:  María Custodio; Ciro Espinoza; Richard Peñaloza; Tessy Peralta-Ortiz; Héctor Sánchez-Suárez; Alberto Ordinola-Zapata; Enedia Vieyra-Peña
Journal:  Sci Rep       Date:  2022-01-07       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.