Literature DB >> 32923549

Metagenomic data on prokaryotic diversity of Kunashir island geothermal spring.

Alexei S Rozanov1,2, Anton V Korzhuk1,2, Valeria N Shlyakhtun1,2, Aleksandra A Shipova1,2, Sergey E Peltek1,2.   

Abstract

This is data on the microbial diversity of a geothermal spring located on the banks of the acidic creek of Kunashir Island. Data was obtained using 16s rRNA amplicon directed metagenomic sequencing on Illumina MiSeq. The raw sequence data used for analysis is available in NCBI under the Sequence Read Archive (SRA) with the BioProject No. PRJNA637298, PRJNA637447 and SRA accession number SRP265942, SRP266050. The data sequences of the 16s rRNA gene are presented at the accession numbers MT604934-MT604967, MT604911-MT604921 in NCBI GenBank database.
© 2020 Published by Elsevier Inc.

Entities:  

Keywords:  16S rRNA; Acidic hot spring; Kunashir Island; Metagenomics

Year:  2020        PMID: 32923549      PMCID: PMC7474399          DOI: 10.1016/j.dib.2020.106244

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Data Description

A description of the sampling points is given in Table 1. Photographs of the sampling sites are shown in Fig. 1.
Table 1

Description of the sampling points

Point No.рНt,°CDescription
82.038The nameless creek originating from a hot spring with a temperature of 73°C and pH 2.0 and flowing into the Kislyi stream. A green bloom and sediment underneath were selected. Coordinates: 43.998029N, 145.767768E
103.018The sour Kislyi stream. Orange-red bottom sediments selected. Coordinates: 43.997836N, 145.767671E
Fig. 1

Sampling locations: A - point No. 8, backwater of a hot spring; B - view of the Kislyi stream at the sampling site.

Description of the sampling points Sampling locations: A - point No. 8, backwater of a hot spring; B - view of the Kislyi stream at the sampling site. The raw sequencing data contain 19,213 paired-end sequences with a length of 301 bp totalling 11.6 M base pairs for a sample No. 10. Whereas for sample No. 8, 40.380 paired-end sequences with a length of 301 bp totalling 24.3 M base pairs were obtained. A total of 52 OTUs including 10631 sequences were obtained for the two communities. For community No. 8 from the geothermal stream, 44 OTU and 11 for the microbial community from the bottom sediments of Kislyi Stream. The distribution of the types and classes of received OTUs is shown in Fig. 2.
Fig. 2

Charts of the taxonomic composition of communities: A - point No. 8; B - point No. 10.

Charts of the taxonomic composition of communities: A - point No. 8; B - point No. 10. In community No. 8, about 38% are occupied by archaea, the rest are bacteria, while in community No. 10 only bacteria are present. The following types (or classes in the case of proteobacteria) occupy more than 1% in community No. 8: Thermi, Euryarchaeota, Actinobacteria, Crenarchaeota, Parvarchaeota, Alphaproteobacteria, Firmicutes, Betaproteobacteria, Nitrospirae. In community No. 8, the following types (or classes in the case of proteobacteria) occupy more than 1%: Betaproteobacteria, Bacteroidetes, Alphaproteobacteria, Gammaproteobacteria, Actinobacteria. The studied communities turned out to be completely dissimilar, which is probably associated with various conditions, primarily with the temperature of the media.

Experimental Design, Materials and Methods

Sample collection and DNA extraction

Bottom sediments were taken from the Kislyi stream and the nameless creek flowing into it, originating from a hot spring with a temperature of 73°C and pH 2.0. The samples was taken in sterile 50 ml Falcon tubes and was stored in alcohol at -70°C. Total DNA was isolated from the samples (0,3 g) using the Genomic DNA from soil NucleoSpin® Soil kit (Macherey-Nagel) according to the manufacturer's protocol.

Library preparation and next-generation DNA sequencing

To obtain the target fragment comprising the V3-V4 region 16s rRNA, degenerate primers U343F (5′-CCTACGGGRSGCAGCAG-3 ') and U806R (5′-GGACTACNVGGGTWTCTAAT-3′) were used, which allow the amplification of the 16s rRNA gene of a wide range of microorganisms. Previously, these primers showed wide coverage in terms of amplification of various types of microorganisms. [1]. The regime of first amplification: 96°C - 2’; 25*(96°C-8”; 54°C-20”; 68°C-30”). Purified PCR product was used for a second PCR reaction which attached Illumina sequencing adapters and dual-index barcodes to the amplicon target. The regime of second amplification: 96°C - 2’; 5*(96°C-8”; 54°C-20”; 68°C-30”); 20*(96°C-8”; 60°C-20”; 68°C-30”). Fusion polymerase Q5 (NewEnglandBiolabs) and low primer annealing temperature were used to obtain gene libraries with the smallest number shift. Sequencing was performed at the Genomics Laboratory of the IMKB SB RAS using an Illumina MiSeq instrument using the TG MiSeq Reagent Kit v3 (600 cycle) kit. Sequenced reads were automatically divided by barcodes.

Taxonomic analysis

Paired reads obtained during sequencing of 16S rRNA libraries were processed using QIIME2 v.2020.2 platform [2]. Noise removal, integration of paired reads, and construction of OTUs were performed using the DADA2 algorithm. For taxonomic classification of OTUs, the scikit-learn classifier was used, trained on fragments of 16S rRNA of the Greengenes v.13_8 database, limited by the primers used.

Ethics Statement

In the work were not involve the use of human subjects, animals, cell lines and endangered species of wild fauna and flora.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.
SubjectBiochemistry, Genetics and Molecular Biology (General)
Specific subject areaGeothermal Metagenomics
Type of dataFigures and 16S rRNA amplicon sequencing data
How data were acquiredIllumina MiSeq platform, QIIME2 v.2020.2
Data formatRaw and analyzed
Parameters for data collectionMetagenomic RNA isolated from sediment samples were prepared by amplifying the V3-V4 region of the 16S rRNA gene paired-end sequenced on an Illumina MiSeq platform.
Description of data collectionMetagenomic DNA extraction, amplicon sequencing of V3-V4 region of 16S rRNA gene and paired reads processing using QIIME2 v.2020.2 platform
Data source locationKunashir island, Russia: 43.997242N, 145.767607E
Data accessibilityRaw sequencing data:Repository name: NCBI SRAData identification number: SRP266050 (point #10) and SRP265942 (point #8)Direct URL to data: https://trace.ncbi.nlm.nih.gov/Traces/sra/?study=SRP266050 (point #10) and https://trace.ncbi.nlm.nih.gov/Traces/sra/?study=SRP265942 (point #8)
  2 in total

1.  Molecular analysis of the benthos microbial community in Zavarzin thermal spring (Uzon Caldera, Kamchatka, Russia).

Authors:  Alexey S Rozanov; Alla V Bryanskaya; Tatiana K Malup; Irina A Meshcheryakova; Elena V Lazareva; Oksana P Taran; Timofey V Ivanisenko; Vladimir A Ivanisenko; Sergey M Zhmodik; Nikolay A Kolchanov; Sergey E Peltek
Journal:  BMC Genomics       Date:  2014-12-19       Impact factor: 3.969

2.  Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2.

Authors:  Evan Bolyen; Jai Ram Rideout; Matthew R Dillon; Nicholas A Bokulich; Christian C Abnet; Gabriel A Al-Ghalith; Harriet Alexander; Eric J Alm; Manimozhiyan Arumugam; Francesco Asnicar; Yang Bai; Jordan E Bisanz; Kyle Bittinger; Asker Brejnrod; Colin J Brislawn; C Titus Brown; Benjamin J Callahan; Andrés Mauricio Caraballo-Rodríguez; John Chase; Emily K Cope; Ricardo Da Silva; Christian Diener; Pieter C Dorrestein; Gavin M Douglas; Daniel M Durall; Claire Duvallet; Christian F Edwardson; Madeleine Ernst; Mehrbod Estaki; Jennifer Fouquier; Julia M Gauglitz; Sean M Gibbons; Deanna L Gibson; Antonio Gonzalez; Kestrel Gorlick; Jiarong Guo; Benjamin Hillmann; Susan Holmes; Hannes Holste; Curtis Huttenhower; Gavin A Huttley; Stefan Janssen; Alan K Jarmusch; Lingjing Jiang; Benjamin D Kaehler; Kyo Bin Kang; Christopher R Keefe; Paul Keim; Scott T Kelley; Dan Knights; Irina Koester; Tomasz Kosciolek; Jorden Kreps; Morgan G I Langille; Joslynn Lee; Ruth Ley; Yong-Xin Liu; Erikka Loftfield; Catherine Lozupone; Massoud Maher; Clarisse Marotz; Bryan D Martin; Daniel McDonald; Lauren J McIver; Alexey V Melnik; Jessica L Metcalf; Sydney C Morgan; Jamie T Morton; Ahmad Turan Naimey; Jose A Navas-Molina; Louis Felix Nothias; Stephanie B Orchanian; Talima Pearson; Samuel L Peoples; Daniel Petras; Mary Lai Preuss; Elmar Pruesse; Lasse Buur Rasmussen; Adam Rivers; Michael S Robeson; Patrick Rosenthal; Nicola Segata; Michael Shaffer; Arron Shiffer; Rashmi Sinha; Se Jin Song; John R Spear; Austin D Swafford; Luke R Thompson; Pedro J Torres; Pauline Trinh; Anupriya Tripathi; Peter J Turnbaugh; Sabah Ul-Hasan; Justin J J van der Hooft; Fernando Vargas; Yoshiki Vázquez-Baeza; Emily Vogtmann; Max von Hippel; William Walters; Yunhu Wan; Mingxun Wang; Jonathan Warren; Kyle C Weber; Charles H D Williamson; Amy D Willis; Zhenjiang Zech Xu; Jesse R Zaneveld; Yilong Zhang; Qiyun Zhu; Rob Knight; J Gregory Caporaso
Journal:  Nat Biotechnol       Date:  2019-08       Impact factor: 54.908

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.