| Literature DB >> 26442136 |
Phillip J Brumm1, Miriam L Land2, David A Mead1.
Abstract
Geobacillus thermoglucosidasius C56-YS93 was one of several thermophilic organisms isolated from Obsidian Hot Spring, Yellowstone National Park, Montana, USA under permit from the National Park Service. Comparison of 16 S rRNA sequences confirmed the classification of the strain as a G. thermoglucosidasius species. The genome was sequenced, assembled, and annotated by the DOE Joint Genome Institute and deposited at the NCBI in December 2011 (CP002835). The genome of G. thermoglucosidasius C56-YS93 consists of one circular chromosome of 3,893,306 bp and two circular plasmids of 80,849 and 19,638 bp and an average G + C content of 43.93 %. G. thermoglucosidasius C56-YS93 possesses a xylan degradation cluster not found in the other G. thermoglucosidasius sequenced strains. This cluster appears to be related to the xylan degradation cluster found in G. stearothermophilus. G. thermoglucosidasius C56-YS93 possesses two plasmids not found in the other two strains. One plasmid contains a novel gene cluster coding for proteins involved in proline degradation and metabolism, the other contains a collection of mostly hypothetical proteins.Entities:
Keywords: Biomass; Geobacillus thermoglucosidasius C56-YS93; Hot springs; Prophage; Xylan
Year: 2015 PMID: 26442136 PMCID: PMC4593210 DOI: 10.1186/s40793-015-0031-z
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Geobacillus thermoglucosidasius C56-YS93 [48]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain C56-YS93 | |||
| Gram stain | Positive | IDA | |
| Cell shape | Rods and chains of rods | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Subterminal spores | IDA | |
| Temperature range | 55 to 75 °C | IDA | |
| Optimum temperature | 65 °C | IDA | |
| pH range; Optimum | 5.8–8.0; 7.5 | IDA | |
| Carbon source | Carbohydrate or protein | IDA | |
| GS-6 | Habitat | Hot spring | |
| MIGS-6.3 | Salinity | Not reported | IDA |
| MIGS-22 | Oxygen requirement | Facultative anaerobe | IDA |
| MIGS-15 | Biotic relationship | Free-living | IDA |
| MIGS-14 | Pathogenicity | Non-pathogen | IDA |
| MIGS-4 | Geographic location | Obsidian Spring, Yellowstone National Park | IDA |
| MIGS-5 | Sample collection | September 2003 | IDA |
| MIGS-4.1 | Latitude | 44.6603028 | TAS [ |
| MIGS-4.2 | Longitude | −110.865194 | TAS [ |
| MIGS-4.4 | Altitude | 2416 m | TAS [ |
aEvidence codes - IDA inferred from direct assay, TAS traceable author statement (i.e., a direct report exists in the literature), NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from http://www.geneontology.org/GO.evidence.shtml of the Gene Ontology project [51]
Fig. 1Micrograph of Geobacillus thermoglucosidasius C56-YS93 cells showing individual cells and clumps of cells. Cells were grown in TSB plus 0.4 % glucose for 18 h. at 70 °C. A 1.0 ml aliquot was removed, centrifuged, re-suspended in 0.2 ml of sterile water, and stained using a 50 μM solution of SYTO® 9 fluorescent stain in sterile water (Molecular Probes). Dark field fluorescence microscopy was performed using a Nikon Eclipse TE2000-S epifluorescence microscope at 2000× magnification using a high-pressure Hg light source and a 500 nm emission filter
Fig. 2Molecular phylogenetic analysis by Maximum Likelihood method as detailed in the Material and Methods section. The tree with the highest log likelihood (−3014.19) is shown. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The novel sequenced Geobacillus strains are indicated in bold. The type strains of all validly described species are included (NCBI accession numbers): G. caldoxylolyticus ATCC700356T (AF067651), G. galactosidasius CF1BT (AM408559), G. jurassicus DS1T (FN428697), G. kaustophilus NCIMB8547T (X60618), G. lituanicus N-3T (AY044055), G. stearothermophilus R-35646T (FN428694), G. subterraneus 34 T (AF276306), G. thermantarcticus DSM9572T(FR749957), G. thermocatenulatus BGSC93A1T (AY608935), G. thermodenitrificans R-35647T (FN538993), G. thermoglucosidasius BGSC95A1T (FN428685), G. thermoleovoransDSM5366T (Z26923), G. toebii BK-1T (FN428690), G. uzenensis UT (AF276304) and G. vulcani 3S-1T (AJ293805). The 16S rRNA sequence of Paenibacillus lautusJCM9073T (AB073188) was used to root the tree
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Finished |
| MIGS 28 | Libraries used | 6 and 24 kb |
| MIGS 29 | Sequencing platforms | 454 Titanium, Illumina GAii |
| MIGS 31.2 | Fold coverage | 5.8 |
| MIGS 30 | Assemblers | Phred/Phrap/Consed |
| MIGS 32 | Gene calling method | Prodigal, GenePRIMP |
| Locus tag | GEOTH | |
| Genbank ID | CP002835.1 | |
| Genbank date of release | Dec. 1, 2011 | |
| GOLD ID | Gc01858 | |
| BIOPROJECT | PRJNA40781 | |
| MIGS 13 | Project relevance | Biotechnological |
| Source material identifier | Genome |
Summary of genome: one chromosome and two plasmids
| Label | Size (Mb) | Topology | INSDC identifier | RefSeq ID |
|---|---|---|---|---|
| Chromosome | 3.65 | Circular | CP002050.1 | NC_14206.1 |
| Plasmid 1 | 0.081 | Circular | CP002836.1 | NC_015665 |
| Plasmid 2 | 0.020 | Circular | CP002837.1 | NC_015661 |
Fig. 3Graphical circular map of the chromosome. From outside to the center: Genes on forward strand (color by COG categories) Genes on reverse strand (color by COG categories) RNA genes (tRNAs green, rRNAs red, other RNAs black) GC content, GC skew
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 3,993,793 | 100 |
| DNA coding (bp) | 3,437,131 | 86 |
| DNA G+C (bp) | 1,754,637 | 44 |
| DNA Scaffolds | 3 | 100 |
| Total genes | 4,135 | 100 |
| Protein-coding genes | 4,014 | 97 |
| RNA genes | 121 | 3 |
| Pseudo genes | 255 | 6 |
| Genes in internal clusters | 1,984 | 48 |
| Genes with function prediction | 1,257 | 30 |
| Genes assigned to COGs | 2,607 | 63 |
| Genes with Pfam domains | 3,278 | 79 |
| Genes with signal peptides | 161 | 4 |
| Genes with transmembrane helices | 948 | 23 |
| CRISPR repeats | 6 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with general COG functional categories
| Code | Value | Percent | Description |
|---|---|---|---|
| J | 156 | 5.4 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0 | RNA processing and modification |
| K | 195 | 6.8 | Transcription |
| L | 208 | 7.2 | Replication, recombination and repair |
| B | 1 | 0.03 | Chromatin structure and dynamics |
| D | 30 | 1.0 | Cell cycle control, cell division, chromosome partitioning |
| V | 44 | 1.5 | Defense mechanisms |
| T | 116 | 4.0 | Signal transduction mechanisms |
| M | 103 | 3.6 | Cell wall/membrane/envelope biogenesis |
| N | 61 | 2.1 | Cell motility |
| U | 48 | 1.7 | Intracellular trafficking, secretion, and vesicular transport |
| O | 100 | 3.5 | Posttranslational modification, protein turnover, chaperones |
| C | 207 | 7.2 | Energy production and conversion |
| G | 194 | 6.7 | Carbohydrate transport and metabolism |
| E | 285 | 9.9 | Amino acid transport and metabolism |
| F | 72 | 2.5 | Nucleotide transport and metabolism |
| H | 132 | 4.6 | Coenzyme transport and metabolism |
| I | 105 | 3.6 | Lipid transport and metabolism |
| P | 160 | 5.6 | Inorganic ion transport and metabolism |
| Q | 71 | 2.5 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 330 | 11.5 | General function prediction only |
| S | 261 | 9.1 | Function unknown |
| 1528 | 37.0 | Not in COGs |
The total is based on the total number of protein coding genes in the annotated genome
G. thermoglucosidasius strains used in comparisons
|
| C56-YS93 | M10EXG | TNO-09.020 |
|---|---|---|---|
| Chromosome size | 3,993,793 | 3,773,252 | 3,740,238 |
| Plasmids | 2 | 0 | 0 |
| Protein coding genes | 4326 | 4228 | 4164 |
| Isolation source | Hot Spring, YNP, United States | Sydney, New South Wales, Australia | Dairy production, Netherlands |
Xylan degradation cluster of Geobacillus thermoglucosidasius C56-YS93
| Annotated protein function | C56-YS93 | Bst T-6 | |
|---|---|---|---|
| 1 | Integral membrane sensor signal transduction histidine kinase | Geoth_2272 | xynD |
| 2 | AraC family transcriptional regulator | Geoth_2271 | xynC |
| 3 | Family 1 extracellular solute-binding protein | Geoth_2270 | xynE |
| 4 | Binding-protein-dependent transporters inner membrane component | Geoth_2269 | xynF |
| 5 | Binding-protein-dependent transporters inner membrane component | Geoth_2268 | xynG |
| 6 | Aldose 1-epimerase | Geoth_2267 | araK |
| 7 | Polysaccharide deacetylase | Geoth_2266 | axe1 |
| 8 | Xylan 1,4-beta-xylosidase | Geoth_2265 | xynB2 |
| 9 | Endo-1,4-beta-xylanase | Geoth_2264 | xynA2 |
| 10 | Family 1 extracellular solute-binding protein | Geoth_2262 | aguE |
| 11 | Binding-protein-dependent transporters inner membrane component | Geoth_2261 | aguF |
| 12 | Binding-protein-dependent transporters inner membrane component | Geoth_2260 | aguG |
| 13 | Alpha-glucuronidase | Geoth_2259 | aguA |
| 14 | Xylan 1,4-beta-xylosidase | Geoth_2258 | xynB |
| 15 | PfkB domain-containing protein | Geoth_2257 | kdgK |
| 16 | 2-dehydro-3-deoxyphosphogluconate aldolase/4-hydroxy-2-oxoglutarate aldolase | Geoth_2256 | kgdA |
| 17 | GntR family transcriptional regulator | Geoth_2255 | uxuR |
| 18 | Uronate isomerase | Geoth_2254 | uxaC |
| 19 | Mannonate dehydratase | Geoth_2253 | uxuA |
| 20 | Short-chain dehydrogenase/reductase SDR | Geoth_2252 | uxuB |
| 21 | Hypothetical protein | Geoth_2251 | orfA |
| 22 | Endo-1,4-beta-xylanase | Geoth_2250 | xynA1 |
| 23 | Hypothetical protein | Geoth_2247 | xynX |
| 24 | G-D-S-L family lipolytic protein | Geoth_2246 | axe2 |
| 25 | Xylose isomerase | Geoth_2243 | xylA |
| 26 | Xylulokinase | Geoth_2242 | xylB |
Fig. 4Prophage genes identified in G. thermoglucosidasius C56-YS93 using PHAST genome search software