Literature DB >> 27331105

Determination of GC content of Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains: A GC dataset for higher level hierarchical classification.

Bhagwan N Rekadwad1, Chandrahasya N Khobragade1.   

Abstract

A total of 16 strains of hyperthermophilic Thermotoga complete genome sequences viz. Thermotoga maritima (AE000512, CP004077, CP007013, CP011107, NC_000853, NC_021214, NC_023151, NZ_CP011107, CP011108, NZ_CP011108, CP010967 & NZ_CP010967), Thermotoga neapolitana (CP000916, & NC_011978) and Thermotoga thermarum (CP002351 & NC_015707) complete genome sequences were retrieved from NCBI BioSample database. ENDMEMO GC used for creation of data on GC content in Thermotoga sp. DNA sequences. Maximum GC content was observed in Thermotoga strains AE000512 & NC_000853 (69 %GC), followed by NZ_CP011108, CP011108, NZ_CP011107, NC_023151, NC_021214, CP011107 & CP004077 (68.5 %GC), followed by NZ_CP010967 & CP010967 (68.3 %GC), followed by CP000916, CP007013 & NC_011978 (68 %GC), followed by CP002351 & NC_015707 (67 %GC) strains. The use of GC dataset ratios helps in higher level hierarchical classification in Bacterial Systematics in addition to phenotypic and other genotypic characters.

Entities:  

Keywords:  ENDMEMO; GC content; Hyperthermophiles; New digital data; Whole genome

Year:  2016        PMID: 27331105      PMCID: PMC4906128          DOI: 10.1016/j.dib.2016.05.045

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data Data provides information of the GC content of hyperthermophilic Thermotoga species. This data would be valuable for quantitative analysis of newly isolated Thermotoga maritima, Thermotoga neapolitana and Thermotoga thermarum strains. This data would be valuable for higher level hierarchical classification of Thermotoga strains in Bacterial Systematics.

Data

This paper contains data on GC percentage of 16 strains of hyperthermophilic Thermotoga. Complete genome sequences of Thermotoga maritima (AE000512, CP004077, CP007013, CP011107, NC_000853, NC_021214, NC_023151, NZ_CP011107, CP011108, NZ_CP011108, CP010967 & NZ_CP010967), Thermotoga neapolitana (CP000916, & NC_011978) and Thermotoga thermarum (CP002351 & NC_015707) were downloaded from NCBI nuccore. Maximum, minimum and average GC percent of 16 Thermotoga strains were determined and digitised using ENDMEMO GC calculating and GC plotting tool. See also NCBI repository http://www.ncbi.nlm.nih.gov/nuccore.

Experimental design, materials and methods

Complete genome sequences of Thermotoga maritima (AE000512, CP004077, CP007013, CP011107, NC_000853, NC_021214, NC_023151, NZ_CP011107, CP011108, NZ_CP011108, CP010967 & NZ_CP010967), Thermotoga neapolitana (CP000916, & NC_011978) and Thermotoga thermarum (CP002351 & NC_015707) strains were downloaded from NCBI nuccore in FASTA format. Using ENDMEMO GC calculating and GC plotting tool was used to determine the exact minimum, maximum and average GC in percent in the complete genome sequence of 16 Thermotoga strains (Table 1). ENDMEMO GC plotter showed a pattern of GC distribution in complete DNA sequence showed through graphical representations in Figs. 1–16 (See supplementary Figs. 1–16). Upper and lower red line indicate maximum and minimum percentage of GC content distribution in complete DNA sequence, while middle blue line indicates average GC percentage [1], [2], [3], [4], [5].
Table 1

GC percentage in complete DNA sequences of hyper-thermophile Thermotoga strains.

SNSpeciesAccession numbersMaximum GC %Minimum GC %Average GC %Supplementary Figs.
s1Thermotoga maritima MSB8AE0005126921.546.2Fig. 1
2Thermotoga maritima MSB8CP00407768.52146.2Fig. 2
3Thermotoga maritima MSB8CP007013682346.2Fig. 3
4Thermotoga maritima MSB8CP01110768.52146.3Fig. 4
5Thermotoga maritima MSB8NC_000853692146.2Fig. 5
6Thermotoga maritima MSB8NC_02121468.52146.2Fig. 6
7Thermotoga maritima MSB8NC_02315168.52146.2Fig. 7
8Thermotoga maritima MSB8NZ_CP01110768.52146.3Fig. 8
9Thermotoga maritima strain Tma100CP01110868.52146.2Fig. 9
10Thermotoga maritima strain Tma100NZ_CP01110868.52146.2Fig. 10
11Thermotoga maritima strain Tma200CP01096768.32146.3Fig. 11
12Thermotoga maritima strain Tma200NZ_CP01096768.32146.3Fig. 12
13Thermotoga neapolitana DSM 4359CP000916682546.5Fig. 13
14Thermotoga neapolitana DSM 4359NC_011978682546.9Fig. 14
15Thermotoga thermarum DSM 5069CP0023516723.540.3Fig. 15
16Thermotoga thermarum DSM 5069NC_0157076723.540.3Fig. 16

Legends to Figs. 1–16

Figs. 1–16 GC percentage in complete DNA sequences of hyperthermophile Thermotoga strains (AE000512, CP004077, CP007013, CP011107, NC_000853, NC_021214, NC_023151, NZ_CP011107, CP011108, NZ_CP011108, CP010967, NZ_CP010967, CP000916, NC_011978, CP002351 and NC_015707).
Subject areaLife Sciences
More specific subject areaExtremophiles, Microbiology, Bacterial Systematics, Bioinformatics
Type of dataTable, graphical representations
How data was acquiredThrough NCBI BioSample database
Data formatRaw and Analyzed
Experimental factorsGC content were determined using bioinformatics tool
Experimental featuresComplete genome sequences were used
Data source locationBioinformatics Research Laboratory, School of Life Sciences, S. R. T. M. University, Nanded, India
Data accessibilityData available within article and via the NCBI repositoryhttp://www.ncbi.nlm.nih.gov/nuccore.
  5 in total

1.  Digital data for Quick Response (QR) codes of thermophiles to identify and compare the bacterial species isolated from Unkeshwar hot springs (India).

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2015-11-24

2.  Bioinformatics data supporting revelatory diversity of cultivable thermophiles isolated and identified from two terrestrial hot springs, Unkeshwar, India.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-04-23

3.  Data on true tRNA diversity among uncultured and bacterial strains.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-04-26

4.  Digital data for quick response (QR) codes of alkalophilic Bacillus pumilus to identify and to compare bacilli isolated from Lonar Crator Lake, India.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-04-09

5.  Digital data of quality control strains under general deposit at Microbial Culture Collection (MCC), NCCS, Pune, India: A bioinformatics approach.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-04-26
  5 in total
  5 in total

1.  Genomics dataset of unidentified disclosed isolates.

Authors:  Bhagwan N Rekadwad
Journal:  Data Brief       Date:  2016-06-15

2.  Genomics dataset on unclassified published organism (patent US 7547531).

Authors:  Mohammad Mahfuz Ali Khan Shawan; Md Ashraful Hasan; Md Mozammel Hossain; Md Mahmudul Hasan; Afroza Parvin; Salina Akter; Kazi Rasel Uddin; Subrata Banik; Mahbubul Morshed; Md Nazibur Rahman; S M Badier Rahman
Journal:  Data Brief       Date:  2016-10-05

3.  Bioinformatics delimitation of the psychrophilic and psychrotolerant actinobacteria isolated from the Polar Frontal waters of the Southern Ocean.

Authors:  Palaniappan Sivasankar; Bhagwan Rekadwad; Subramaniam Poongodi; Kannan Sivakumar; Bhaskar Venkateswaran Parli; N Anil Kumar
Journal:  Data Brief       Date:  2018-03-08

4.  Digital data of quality control strains under general deposit at Microbial Culture Collection (MCC), NCCS, Pune, India: A bioinformatics approach.

Authors:  Bhagwan N Rekadwad; Chandrahasya N Khobragade
Journal:  Data Brief       Date:  2016-04-26

5.  Correcting names of bacteria deposited in National Microbial Repositories: an analysed sequence data necessary for taxonomic re-categorization of misclassified bacteria-ONE example, genus Lysinibacillus.

Authors:  Bhagwan N Rekadwad; Juan M Gonzalez
Journal:  Data Brief       Date:  2017-07-05
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.