Literature DB >> 27141529

Digital data for quick response (QR) codes of alkalophilic Bacillus pumilus to identify and to compare bacilli isolated from Lonar Crator Lake, India.

Bhagwan N Rekadwad1, Chandrahasya N Khobragade1.   

Abstract

Microbiologists are routinely engaged isolation, identification and comparison of isolated bacteria for their novelty. 16S rRNA sequences of Bacillus pumilus were retrieved from NCBI repository and generated QR codes for sequences (FASTA format and full Gene Bank information). 16SrRNA were used to generate quick response (QR) codes of Bacillus pumilus isolated from Lonar Crator Lake (19° 58' N; 76° 31' E), India. Bacillus pumilus 16S rRNA gene sequences were used to generate CGR, FCGR and PCA. These can be used for visual comparison and evaluation respectively. The hyperlinked QR codes, CGR, FCGR and PCA of all the isolates are made available to the users on a portal https://sites.google.com/site/bhagwanrekadwad/. This generated digital data helps to evaluate and compare any Bacillus pumilus strain, minimizes laboratory efforts and avoid misinterpretation of the species.

Entities:  

Keywords:  Alkaline environment; Alkalophiles; Bacillus signatures; Lonar Crator Lake; Soda Lake

Year:  2016        PMID: 27141529      PMCID: PMC4838933          DOI: 10.1016/j.dib.2016.03.103

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the data Digital information on Bacillus pumilus isolated from Lonar Crator Lake has enormous biotechnological applications. Generated digital information appears to be white snow for quick identification and comparison of Bacillus pumilus. This digitization of 16S RNA sequences of Bacillus pumilus from Lonar Crator Lake were carried out first time by us and made available to users. This generated digital reduces time and cost on identification and comparison of Bacillus pumilus.

Data

Raw data was obtained through NCBI׳s BioSample database. Digital data of each isolate i.e two quick response (QR) codes, Chaose Game Representation (CGR), Choase Game Representation Frequencies (FCGR) and Principal Component Analysis (PCA) of isolates made available on internet on a portal created by us https://sites.google.com/site/bhagwanrekadwad/ in downloadable format (Table 1, Fig. 1, Fig. 2, Fig. 3, Fig. 4, Fig. 5).
Table 1

QR codes of Bacillus pumilus isolated from Lonar Crator Lake, India

Fig. 1

GBIF map for distribution of Bacillus pumilis: yellow dots indicate places where researcher who have identified Bacillus pumilus strains. Source: GBIF (1).

Fig. 2

Bacillus pumilus: Chaos Game representation (CGR) showing difference in base composition of DNA sequence.

Fig. 3

Bacillus pumilus: Chaos Game Representation of frequencies (FCGR).

Fig. 4

Evolutionary relationships among strains of Bacillus pumilus isolated from Lonar Crator Lake, India. The evolutionary history was inferred using the Neighbor-Joining method [9]. The bootstrap consensus tree inferred from 1000 replicates is taken to represent the evolutionary history of the taxa analyzed [10]. Branches corresponding to partitions reproduced in less than 50% bootstrap replicates are collapsed. The evolutionary distances were computed using the Maximum Composite Likelihood method [11] and are in the units of the number of base substitutions per site. The analysis involved 65 nucleotide sequences. All positions containing gaps and missing data were eliminated. There were a total of 591 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [12].

Fig. 5

Principal component analysis (PCA) of isolates.

Experimental design, materials and methods

16S rRNA sequences (full gene bank and FASTA) were retrieved from NCBI BioSample database [1], [2]. DNA BarID downloaded from NEERI-CSIR, Nagpur website [3]. The QR codes for Bacillus pumilus from full NCBI Genebank sequences and FASTA sequences were generated which do not resembles with any other species or strains in any database. The generated data were compared with other visual techniques such as CGR and FCGR. The phylogenetic tree was constructed using MEGA6. Phylogenetic tree was constructed by the neighbor-joining method using a distance matrix from the alignment. Tree files were generated by PHYLIP and viewed using TREEVIEW program. Bootstrap analysis was also carried out to know the evolutionary history of bacteria. PCA were performed for comparative analysis [4], [5], [6], [7], [8].

Portal for QR codes

The QR codes of Bacillus pumilus were available to any user on a portal https://sites.google.com/site/bhagwanrekadwad/.
Subject areaMicrobiology
More specific subject areaMicrobial diversity Informatics
Type of dataText file, sequences, table, figures QR Codes, CGR, FCGR, NJ plot and PCA.
How data was acquiredThrough NCBI repository
Data formatRaw and analyzed
Experimental factors16S rRNA sequences were used for creation of digital information
Experimental features16SrRNA gene sequences were used to create QR codes using DNA BarID software.
Data source locationLonar soda lake (19° 58’ N; 76° 31’ E), Buldhana District, India.
Data accessibilityData is within this article.
  5 in total

1.  Prospects for inferring very large phylogenies by using the neighbor-joining method.

Authors:  Koichiro Tamura; Masatoshi Nei; Sudhir Kumar
Journal:  Proc Natl Acad Sci U S A       Date:  2004-07-16       Impact factor: 11.205

2.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors:  Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

3.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

4.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

5.  Digital data for Quick Response (QR) codes of thermophiles to identify and compare the bacterial species isolated from Unkeshwar hot springs (India).

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2015-11-24
  5 in total
  8 in total

1.  Genomics dataset of unidentified disclosed isolates.

Authors:  Bhagwan N Rekadwad
Journal:  Data Brief       Date:  2016-06-15

2.  Genomics dataset on unclassified published organism (patent US 7547531).

Authors:  Mohammad Mahfuz Ali Khan Shawan; Md Ashraful Hasan; Md Mozammel Hossain; Md Mahmudul Hasan; Afroza Parvin; Salina Akter; Kazi Rasel Uddin; Subrata Banik; Mahbubul Morshed; Md Nazibur Rahman; S M Badier Rahman
Journal:  Data Brief       Date:  2016-10-05

3.  Genomic Analysis of a Marine Bacterium: Bioinformatics for Comparison, Evaluation, and Interpretation of DNA Sequences.

Authors:  Bhagwan N Rekadwad; Juan M Gonzalez; Chandrahasya N Khobragade
Journal:  Biomed Res Int       Date:  2016-11-01       Impact factor: 3.411

4.  Bioinformatics delimitation of the psychrophilic and psychrotolerant actinobacteria isolated from the Polar Frontal waters of the Southern Ocean.

Authors:  Palaniappan Sivasankar; Bhagwan Rekadwad; Subramaniam Poongodi; Kannan Sivakumar; Bhaskar Venkateswaran Parli; N Anil Kumar
Journal:  Data Brief       Date:  2018-03-08

5.  tRNA diversification among uncultured archeon clones.

Authors:  Mohammad Mahfuz Ali Khan Shawan; Md Ashraful Hasan; Raihana Yesmin; Tareq Hossan; Md Mozammel Hossain; Md Mahmudul Hasan; Afroza Parvin; Mahbubul Morshed; Nahiyan Mohammad Salauddin; Satya Ranjan Sarker; Md Nazibur Rahman; S M Badier Rahman
Journal:  Bioinformation       Date:  2018-07-31

6.  Bioinformatics data supporting revelatory diversity of cultivable thermophiles isolated and identified from two terrestrial hot springs, Unkeshwar, India.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-04-23

7.  Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-05-27

8.  Correcting names of bacteria deposited in National Microbial Repositories: an analysed sequence data necessary for taxonomic re-categorization of misclassified bacteria-ONE example, genus Lysinibacillus.

Authors:  Bhagwan N Rekadwad; Juan M Gonzalez
Journal:  Data Brief       Date:  2017-07-05
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.