| Literature DB >> 35252502 |
Patricia Gómez-Villegas1, José Luis Guerrero2, Miguel Pérez-Rodríguez3, Juan Pedro Bolivar2, Javier Vigara1, Rosa León1.
Abstract
Around 100 Mt of phosphogypsum (PG) of extreme acidity and with high concentrations of heavy metals and radionuclides have been deposited on the salt marshes of the Tinto River estuary in Huelva (SW Spain) for more than forty years. The microbial community able to thrive in these adverse conditions remains totally unknown, despite the fact that it can highly influence the biogeochemical cycle of the phosphogypsum components and include new species with biotechnological interest. High throughput sequencing of 16S/18S rRNA encoding genes is a potent tool to uncover the microbial diversity of extreme environments. This data article describes for the first time the prokaryotic and eukaryotic diversity of two water samples collected in the Huelva phosphogypsum stacks. The raw amplicons of the 16S/18S rRNA maker genes for the two phosphogypsum samples and two reference samples (seawater and the Tinto River water) obtained after sequencing on MiSeq platform are provided. The operational taxonomic units (OTUs) obtained after the treatment and clustering of the obtained reads with the QIIME2 pipeline and their taxonomic assignation performed by comparison with the SILVA database are also presented to complete the information of the article "Exploring the microbial community inhabiting the phosphogypsum stacks of Huelva (SW, Spain) by a high throughput 16S/18S rDNA Sequencing approach".Entities:
Keywords: 16s rRNA; 18s rRNA; Extreme environment; Metabarcoding; Metataxonomy; Phosphogypsum
Year: 2022 PMID: 35252502 PMCID: PMC8891958 DOI: 10.1016/j.dib.2022.107989
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Aspect of the phosphogypsum stacks of Huelva.
Purity of the Genomic DNA isolated from the two phosphogypsum and the two reference samples.
| Sample description | Sample Name | Concentration (ng/µl) | 260/280 ratio | Volumen (µl) | % G + C |
|---|---|---|---|---|---|
| Tinto River (TR) | D3_16S | 33.74 | 1.81 | 25 | 55.27 |
| Seawater (SW) | D4_16S | 5.84 | 1.58 | 25 | 52.55 |
| Perimeter channel (PC) | D7_16S | 6.27 | 2.04 | 20 | 55.24 |
| Piezometer (PZ) | D8_16S | 4.09 | 1.59 | 20 | 55.35 |
| Tinto River (TR) | D3_18S | 33.74 | 1.81 | 25 | 54.08 |
| Seawater (SW) | D4_18S | 5.84 | 1.58 | 25 | 50.86 |
| Perimeter channel (PC) | D7_18S | 6.27 | 2.04 | 20 | 50.72 |
| Piezometer (PZ) | D8_18S | 4.09 | 1.59 | 20 | 53.80 |
List of FASTQ files included as supplementary material which contain the raw data (forward and reverse) obtained from the sequencing the 16sRNA and the 18sRNA libraries: PC, perimeter channel; PZ, piezometer; TR, Tinto River; SW, seawater.
| 16S rRNA | 18S rRNA | ||||
|---|---|---|---|---|---|
| D7_16S_1.fq | PC, forward | Suppl. Mat S1 | D7_18S_1.fq | PC, forward | Suppl. Mat S2 |
| D7_16S_2.fq | PC, reverse | Suppl. Mat S1 | D7_18S_2.fq | PC, reverse | Suppl. Mat S2 |
| D8_16S_1.fq | PZ, forward | Suppl. Mat S1 | D8_18S_1.fq | PZ, forward | Suppl. Mat S2 |
| D8_16S_2.fq | PZ, reverse | Suppl. Mat S1 | D8_18S_2.fq | PZ, reverse | Suppl. Mat S2 |
| D3_16S_1.fq | TR, forward | Suppl. Mat S1 | D3_18S_1.fq | TR, forward | Suppl. Mat S2 |
| D3_16S_2.fq | TR, reverse | Suppl. Mat S1 | D3_18S_2.fq | TR, reverse | Suppl. Mat S2 |
| D4_16S_1.fq | SW, forward | Suppl. Mat S1 | D4_18S_1.fq | SW, forward | Suppl. Mat S2 |
| D4_16S_2.fq | SW, reverse | Suppl. Mat S1 | D4_18S_2.fq | SW, reverse | Suppl. Mat S2 |
Fig. 2Workflow of data analysis from obtaining of the raw reads to the species annotation.
Number of raw, filtered and merged reads, and the number of clustered Operational Taxonomic Unit, for each sample. Adapted from Gómez-Villegas et al. 2022 [1].
| Mean quality | ||||||||
|---|---|---|---|---|---|---|---|---|
| Code | Sample | Raw Reads | After denoising | Merged inputs | Non chimeric reads | Q30 (%) | Q Score | Observed OTUs |
| D7_16S | PC_16S | 19,129 | 14,995 | 11,338 | 10,555 | 94.60 | ≥ 36 | 680 |
| D7_18S | PC_18S | 10,067 | 8864 | 8716 | 6076 | 93.36 | ≥ 36 | 38 |
| D8_16S | PZ_16S | 17,084 | 13,236 | 10,254 | 9409 | 94.54 | ≥ 36 | 596 |
| D8_18S | PZ_18S | 30,362 | 21,890 | 19,607 | 15,080 | 89.74 | ≥ 36 | 186 |
| D3_16S | TR_16S | 189,164 | 175,352 | 145,395 | 145,395 | 95.12 | ≥ 36 | 348 |
| D3_18S | TR_18S | 160,136 | 126,280 | 123,047 | 103,365 | 90.99 | ≥ 36 | 133 |
| D4_16S | SW_16S | 206,161 | 190,181 | 179,924 | 171,677 | 95.17 | ≥ 36 | 838 |
| D4_18S | SW_18S | 203,677 | 172,627 | 162,688 | 111,906 | 92.24 | ≥ 36 | 399 |
Sample names: PC, perimeter channel; PZ, piezometer; TR, Tinto River; SW, Seawater.
| Subject | Environmental science |
| Specific subject area | Metataxonomy of environmental samples. |
| Type of data | FASTQ and Excel Tables |
| How the data were acquired | Raw data were obtained on Illumine MiSeq platform (Illumina, San Diego, California) with Illumina MiSeq Reagent kit V2 (2 × 250 bp) Processed data were obtained with QIIME 2 (v2020.8) software from the raw data Operational taxonomic units were taxonomically classified by comparison with the SILVA database |
| Data format | Raw and analysed |
| Description of data collection | Two of the samples were collected from two different water bodies of the phosphogypsum stacks deposited on the Tinto salt marshes of Huelva, Spain |
| Data source location | Institution: University of Huelva City/Town/Region: Huelva Country: Spain |
| Data accessibility | As supplementary material with this article and in the Dryad repository (10.5061/dryad.18931zczx) |
| Related research article | EXPLORING THE MICROBIAL COMMUNITY INHABITING THE PHOSPHOGYPSUM STACKS OF HUELVA (SW SPAIN) BY A HIGH THROUGHPUT 16S/18S rDNA SEQUENCING APPROACH. Submitted to Aquatic toxicology. In press |