Literature DB >> 26793757

Digital data for Quick Response (QR) codes of thermophiles to identify and compare the bacterial species isolated from Unkeshwar hot springs (India).

Bhagwan N Rekadwad1, Chandrahasya N Khobragade1.   

Abstract

16S rRNA sequences of morphologically and biochemically identified 21 thermophilic bacteria isolated from Unkeshwar hot springs (19°85'N and 78°25'E), Dist. Nanded (India) has been deposited in NCBI repository. The 16S rRNA gene sequences were used to generate QR codes for sequences (FASTA format and full Gene Bank information). Diversity among the isolates is compared with known isolates and evaluated using CGR, FCGR and PCA i.e. visual comparison and evaluation respectively. Considerable biodiversity was observed among the identified bacteria isolated from Unkeshwar hot springs. The hyperlinked QR codes, CGR, FCGR and PCA of all the isolates are made available to the users on a portal https://sites.google.com/site/bhagwanrekadwad/.

Entities:  

Keywords:  DNA bank; DNA signatures; Microbial diversity informatics; Thermal springs

Year:  2015        PMID: 26793757      PMCID: PMC4688402          DOI: 10.1016/j.dib.2015.11.035

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Raw data is available through NCBI׳s BioSample database (www.ncbi.nlm.nih.gov/nuccore). BioSample IDs include JN392966-JN392971, KC120909-KC120919, KM KM998072-KM998074 and KP053645. Data is with this article made available to users Each isolates have two hyperlinked QR codes, CGR, FCGR. Names of isolates, Accession Numbers, QR codes, CG and PCR, FCGR and PCA of isolates made available on internet on website created by us https://sites.google.com/site/bhagwanrekadwad/ Value of the data Microbial community isolated from Unkeshwar hot spring has enormous biotechnological applications. Generated digital information is a limelight for identification and comparison of newly isolated microorganisms. This digitization of 16S RNA sequences of thermophiles were carried out first time by us from Unkeshwar hot spring and made available to users. This generated digital information provides a baseline to any researchers by reducing time and cost on identification and comparison of bacterial diversity in hot springs. The DNA sequence data digitization is a standard, fast and reliable tool for identification of microorganisms up to species level using short DNA sequences.

Experimental design, materials and methods

The Sanger׳s dideoxy method was adopted for DNA sequencing. 16S rRNA gene sequence analysis was carried out to confirm the identity of bacteria using morphological and biochemical tests. The bacterial cultures were enriched in a nutrient agar medium and the DNA was extracted using a phenolchloroform method with slight modification. The method was modified as follow. About 2 mL of cell pellet from each enrichment culture of isolate was suspended in extraction buffer containing (100 mM Tris–HCl, pH 8.0, 100 mM Na2EDTA (pH 8.0) and Proteinase K (Nitrogen, USA) at the final concentration of 100 mg/mL. The resulting mixture was incubated at 55 °C for 2 h with continuous shaking. To this 0.5 M NaCl was added and incubated at 72 °C for 30 min. Subsequently, DNA was extracted by phenol:chloroform:isoamyl alcohol (1:1:1). It was washed twice with 70% ethanol and dissolved in Tris-EDTA buffer. The DNA was analyzed by electrophoresis in a 0.8% agarose gel stained with ethidium bromide and visualized under UV trans-illuminator. The 16S rDNA of the enriched strains were amplified with two different pair of eubacteria specific primers (forward primer 530 F: 5′ GTGCCAGCAGCCGCGG 3′ and reverse primer 1392 R: 5′ACGGGCGGTGTGTAC 3′ and forward primer Bac 8F: 5′ AGAGTTTGATCCTGGCTCAG 3′ and reverse primer 1492 R: 5′ GGTTACCTTGTTACGACTT 3′). The PCR conditions used were an initial denaturation at 94 °C for two minutes, followed by 35 cycles of denaturation at 95 °C for one minute, annealing at 55 °C for one minute and extension at 72 °C for one minute. Finally, extension was given at 72 °C for 10 min. The PCR products were electrophoresed in 1% (w/v) agarose gel containing ethidium bromide (1 µg mL−1) so as to get fragments of DNA. The resulting products were purified and directly sequenced on the Amplified Biosystem Model 3730 XI (96 capillaries) DNA sequencer (Amplified Biosystems, Inc., Foster City, Calif, USA). The sequences of bacterial isolates were determined through a BLAST search. Nucleotide sequences were aligned using the software MEGA 6. The phylogenetic tree was constructed by the neighbor-joining method using a distance Matrix from the alignment. Tree files were generated by PHYLIP and viewed by TREEVIEW program. Bootstrap analysis was also carried out to know the evolutionary history of bacteria [1], [2], [3], [4].

Data

The DNA QR codes of identified bacterial species were generated using DNA BarID downloaded from NEERI-CSIR, Nagpur website. The generated QR codes for the species (Table 1) of bacteria have unique QR codes (Table 2) which do not resembles with any other species or strains in any database. Using these QR codes any smart user can scan QR code and read more information on bacterial species. This information is useful to identify and compare the QR-coded isolates or sequences isolated from hot spring environment/extremes.
Table 1

Names and accession numbers of QR coded isolates isolated and identified Unkeshwar hot springs.

SpeciesAccession numbers
Naxibacter sp. AF_NAK1-3JN392966
Bacillus licheniformisJN392967
Brevibacillus borstelensisJN392968
Actinobacterium EF_NAK1-7JN392969
Brevibacillus sp. EF_TYK1-4JN392970
Bacillus sp. EF_TYK1-5JN392971
Bacillus pumilusKC120909
Brevibacillus brevisKC120910
Bacillus sp. W7KC120911
Burkholderia sp. W11KC120912
Pseudomonas pseudoalcaligenesKC120913
Brevundimonas diminutaKC120914
Acinetobacter baumanniiKC120915
Bacillus megateriumKC120916
Bacillus sp. W3KC120917
Alcaligenes sp. U1(2013)KC120918
Brevibacillus sp. NAK1-14KC120919
Escherichia coli strain NW1KM998072
Escherichia coli strain NW2KM998073
Escherichia coli strain NW3KM998074
Geobacillus thermoleovorans strain rekadwadsisKP053645
Table 2

QR code generated for FASTA format sequences and Gene Bank (full) information using DNA BarID software.

The generated data were compared with other visual techniques such as CGR and FCGR. The phylogenetic tree was constructed using MEGA6 and PCA for comparative analysis (Fig. 1, Fig. 2, Fig. 3).
Fig. 1

Diagram shows constructive flow chart to assess microbial diversity and its digitization.

Fig. 2

Chaos Game representation (CGR) codes of isolates showing difference in composition of DNA base sequences.

Fig. 3

Chaos Game Representation of frequencies (FCGR) of isolates.

Digitization and microbial diversity informatics

QR codes for 16S rRNA gene sequences in FASTA format and for full Gene Bank information was generated using DNA BarID software developed by Purohit et al. [5]. The diversity of microorganisms isolated from various hot springs including Unkeshwar, District Nanded, India (19°85′N and 78°25′E) were observed and compared using phylogenetic tree and PCA (Fig. 4, Fig. 5).
Fig. 4

Evolutionary relationships of taxa (JN392966-JN392971, KC120909-KC120919, KM998072-KM998074 and KP053645 with other species isolated from hot springs). The evolutionary history was inferred using the Neighbor-Joining method [6]. The bootstrap consensus tree inferred from 1000 replicates is taken to represent the evolutionary history of the taxa analyzed [7]. Branches corresponding to partitions reproduced in less than 50% bootstrap replicates are collapsed. The evolutionary distances were computed using the Maximum Composite Likelihood method [8] and are in the units of the number of base substitutions per site. The analysis involved 65 nucleotide sequences. All positions containing gaps and missing data were eliminated. There were a total of 591 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [9].

Fig. 5

Principal component analysis (PCA) of isolates.

QR codes hyper links

The QR codes were hyperlinked using Microsoft word processor software. The QR codes of 21 identified bacteria available to any user on a portal https://sites.google.com/site/bhagwanrekadwad/.

Bacterial sequences

The FASTA format sequences and Gene Bank (full) information of 16S rRNA sequences of 21 isolated bacteria identified by us are taken for digitization. 16S rRNA sequences of identified strains submitted to NCBI repository with accession numbers JN392966-JN392971, KC120909-KC120919, KM998072-KM998074 and KP053645. Using 16S rRNA sequences, the generated QR codes, CGR, FCGR and PCA were made available to any user on website https://sites.google.com/site/bhagwanrekadwad/.
Subject areaBiology
More specific subject areaMicrobial diversity Informatics
Type of dataText file, sequences, table, Quick Response Codes (QR Codes), Chaos Game representation (CGR) and Chaos Game Representation of Frequencies (FCGR), neighbor joining(NJ) plot and Principal Component Analysis (PCA) images
How data was acquiredAmplified Biosystems Model 3730 XI (96 capillary) DNA sequencer
Data formatRaw and analyzed
Experimental factorsDNA fragments were obtained using slightly modified Phenol-Chloroform method.
Experimental featuresGenomic DNA fragmented and then sequenced using Sangers dideoxy DNA sequencing method using Amplified Biosystems DNA sequencer. 16SrRNA gene sequences were used to create QR codes using DNA BarID software.
Data source locationUnkeshwar (19°85′N and 78°25′E), School of Life Sciences, Swami Ramanand Teerth Marathwada University, Nanded, India (19°6′N and 78°17′E).
Data accessibility

Raw data is available through NCBI׳s BioSample database (www.ncbi.nlm.nih.gov/nuccore). BioSample IDs include JN392966-JN392971, KC120909-KC120919, KM KM998072-KM998074 and KP053645.

Data is with this article made available to users

Each isolates have two hyperlinked QR codes, CGR, FCGR.

Names of isolates, Accession Numbers, QR codes, CG and PCR, FCGR and PCA of isolates made available on internet on website created by us https://sites.google.com/site/bhagwanrekadwad/

  4 in total

1.  Prospects for inferring very large phylogenies by using the neighbor-joining method.

Authors:  Koichiro Tamura; Masatoshi Nei; Sudhir Kumar
Journal:  Proc Natl Acad Sci U S A       Date:  2004-07-16       Impact factor: 11.205

2.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors:  Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

3.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

4.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

  4 in total
  11 in total

1.  Genomics dataset of unidentified disclosed isolates.

Authors:  Bhagwan N Rekadwad
Journal:  Data Brief       Date:  2016-06-15

2.  Genomics dataset on unclassified published organism (patent US 7547531).

Authors:  Mohammad Mahfuz Ali Khan Shawan; Md Ashraful Hasan; Md Mozammel Hossain; Md Mahmudul Hasan; Afroza Parvin; Salina Akter; Kazi Rasel Uddin; Subrata Banik; Mahbubul Morshed; Md Nazibur Rahman; S M Badier Rahman
Journal:  Data Brief       Date:  2016-10-05

3.  Genomic Analysis of a Marine Bacterium: Bioinformatics for Comparison, Evaluation, and Interpretation of DNA Sequences.

Authors:  Bhagwan N Rekadwad; Juan M Gonzalez; Chandrahasya N Khobragade
Journal:  Biomed Res Int       Date:  2016-11-01       Impact factor: 3.411

4.  Data on graphical representation (CGR and FCGR) of bacterial and archaeal species from two Soda Lakes.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2017-03-16

5.  Bioinformatics delimitation of the psychrophilic and psychrotolerant actinobacteria isolated from the Polar Frontal waters of the Southern Ocean.

Authors:  Palaniappan Sivasankar; Bhagwan Rekadwad; Subramaniam Poongodi; Kannan Sivakumar; Bhaskar Venkateswaran Parli; N Anil Kumar
Journal:  Data Brief       Date:  2018-03-08

6.  Bioinformatics data supporting revelatory diversity of cultivable thermophiles isolated and identified from two terrestrial hot springs, Unkeshwar, India.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-04-23

7.  Digital data for quick response (QR) codes of alkalophilic Bacillus pumilus to identify and to compare bacilli isolated from Lonar Crator Lake, India.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-04-09

8.  Digital data of quality control strains under general deposit at Microbial Culture Collection (MCC), NCCS, Pune, India: A bioinformatics approach.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-04-26

9.  Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-05-27

10.  Correcting names of bacteria deposited in National Microbial Repositories: an analysed sequence data necessary for taxonomic re-categorization of misclassified bacteria-ONE example, genus Lysinibacillus.

Authors:  Bhagwan N Rekadwad; Juan M Gonzalez
Journal:  Data Brief       Date:  2017-07-05
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.