| Literature DB >> 27471502 |
Kian Mau Goh1, Kok-Gan Chan2, Soon Wee Lim1, Kok Jun Liew1, Chia Sing Chan1, Mohd Shahir Shamsir1, Robson Ee2, Tan-Guan-Sheng Adrian2.
Abstract
A bacterial strain, designated RA, was isolated from water sample of a hot spring on Langkawi Island of Malaysia using marine agar. Strain RA is an aerophilic and thermophilic microorganism that grows optimally at 50-60°C and is capable of growing in marine broth containing 1-10% (w/v) NaCl. 16S rRNA gene sequence analysis demonstrated that this strain is most closely related (<90% sequence identity) to Rhodothermaceae, which currently comprises of six genera: Rhodothermus (two species), Salinibacter (three species), Salisaeta (one species), Rubricoccus (one species), Rubrivirga (one species), and Longimonas (one species). Notably, analysis of average nucleotide identity (ANI) values indicated that strain RA may represent the first member of a novel genus of Rhodothermaceae. The draft genome of strain RA is 4,616,094 bp with 3630 protein-coding gene sequences. Its GC content is 68.3%, which is higher than that of most other genomes of Rhodothermaceae. Strain RA has genes for sulfate permease and arylsulfatase to withstand the high sulfur and sulfate contents of the hot spring. Putative genes encoding proteins involved in adaptation to osmotic stress were identified which encode proteins namely Na(+)/H(+) antiporters, a sodium/solute symporter, a sodium/glutamate symporter, trehalose synthase, malto-oligosyltrehalose synthase, choline-sulfatase, potassium uptake proteins (TrkA and TrkH), osmotically inducible protein C, and the K(+) channel histidine kinase KdpD. Furthermore, genome description of strain RA and comparative genome studies in relation to other related genera provide an overview of the uniqueness of this bacterium.Entities:
Keywords: Longimonas; Rhodothermaceae; Rhodothermus; Rubricoccus; Rubrivirga; Salinibacter; Salisaeta; strain RA
Year: 2016 PMID: 27471502 PMCID: PMC4943939 DOI: 10.3389/fmicb.2016.01109
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
General features and information regarding the sequencing of the strain RA genome, according to the MIGS (Minimum information about a genome sequence) recommendations.
| Gram stain | Negative |
| Cell shape | Rod |
| Pigmentation | Cream |
| Motility | Motile |
| Temperature range; optimum | 37−60°C; 50−60°C |
| Salinity range; optimum | 1−10.0% NaCl (w/v); 1−5% NaCl (w/v) |
| pH range; optimum | 5.0−9.0; 7.6 |
| Carbon source | Varied |
| Submitted_to_insdc | GenBank |
| investigation_type | bacteria_archaea |
| project_name | |
| experimental_factor | NA |
| lat_lon | NA |
| Depth | NA |
| geo_loc_name | Langkawi, Malaysia |
| collection_date | 2014-10-10 |
| env_biome | Hot spring |
| env_feature | Water |
| env_material | Water with high NaCl content |
| env_package | Missing |
| num_replicons | NA |
| ref_biomaterial | NA |
| source_mat_id | NA |
| Pathogenicity | NA |
| biotic_relationship | Free-living |
| trophic_level | Chemoheterotroph |
| rel_to_oxygen | Aerobic |
| isol_growth_condt | Marine agar, pH 7.6, 50°C |
| assembly_name | CLC bio CLC Genomics Workbench version 7.5 |
| finishing_strategy | |
| sequencing Technology | HiSeq 2500 |
General data and comparison of strain RA and other .
| Taxonomy ID | 1,779,382 | 518,766 | 309,807 | 1,089,550 | 716,817 | 1,196,024 | n.a |
| Colony color | Cream | Red | Red | Red | Red | Pale-red | Red |
| Cell shape | Rod | Rod | Rod | Rod | Cocci | Rod | Rod |
| NaCl range (%; w/v) | 1−10 | 2−6 | 11−30 | 5−20 | 1−5 | 1−5 | 4−25 |
| Temperature range | 37−60 | 5−77 | 37−47 | 37−46 | 5−37 | 10−37 | 20−50 |
| pH range | 5−9 | 7.0 | 6.5−8.0 | 6.5−8.5 | 5.0−9.0 | 6.0−9.0 | 6.5−8.5 |
| DNA GC% | 68.3 (genome) | 64.3 (genome) | 66.5 (HPLC); 66.2 (genome) | 62.9 (HPLC); 63.5 (genome) | 68.9 (HPLC) | 64.8−65.8 (HPLC) | 61.5 (HPLC) |
| 16S rRNA similarity to strain RA (%) | 100 | 89.35 | 86.50 | 88.83 | 88.6 | 88.78 | 88.14 |
| Genome size (bp) | 4,616,094 | 3,386,737 | 3,587,328 | 3,192,765 | – | – | – |
| Predicted coding DNA sequences | 3680 | 2914 | 3086 | 2833 | – | – | – |
| Genome contigs no. | 91 | 1 | 1 | 3 | – | – | – |
| ANI against strain RA (%) | 100 | 73.28 | 71.85 | 71.47 | – | – | – |
| 16S rRNA accession number | |||||||
| Genome DDBJ/EMBL/GenBank accession number | – | – | – | ||||
| GOLD ID | Gp0127843 | Gp0001173 | Gp0000409 | Gp0015133 | – | – | – |
| PATRIC ID | 1100069 | 518766 | 309807 | 1089550 | – | – | – |
| KEGG complete genome T number | – | T01119 | T00312 | – | – | – | – |
| References | This study | Alfredsson et al., | Antón et al., | Vaisman and Oren, | Park et al., | Park et al., | Xia et al., |
Figure 1Phylogenetic tree of strain RA and other . Pontibacter xinjiangensis NNRL B-51335T was selected as the out group (https://figshare.com/s/4f470ab9ce3727bf4392).
Genome statistics of strain RA.
| DNA, total number of bases | 4,616,094 | 100.00 |
| DNA coding number of bases | 4,160,472 | 90.13 |
| DNA G+C number of bases | 3,152,009 | 68.28 |
| DNA scaffolds | 61 | 100.00 |
| CRISPR count | 5 | |
| Genes total number | 3680 | 100.00 |
| Protein coding genes | 3630 | 98.64 |
| RNA genes | 50 | 1.36 |
| rRNA genes | 3 | 0.08 |
| 5S rRNA | 1 | 0.03 |
| 16S rRNA | 1 | 0.03 |
| 23S rRNA | 1 | 0.03 |
| tRNA genes | 44 | 1.20 |
| Other RNA genes | 3 | 0.08 |
| Protein coding genes with function prediction | 2815 | 76.49 |
| Without function prediction | 815 | 22.15 |
| Protein coding genes with enzymes | 897 | 24.38 |
| w/o enzymes but with candidate KO based enzymes | 50 | 1.36 |
| Protein coding genes connected to Transporter Classification | 365 | 9.92 |
| Protein coding genes connected to KEGG pathways3 | 912 | 24.78 |
| Not connected to KEGG pathways | 2718 | 73.86 |
| Protein coding genes connected to KEGG Orthology (KO) | 1587 | 43.12 |
| Not connected to KEGG Orthology (KO) | 2043 | 55.52 |
| Protein coding genes connected to MetaCyc pathways | 777 | 21.11 |
| Not connected to MetaCyc pathways | 2853 | 77.53 |
| Protein coding genes with COGs | 2222 | 60.38 |
| With KOGs | 713 | 19.38 |
| With Pfam | 2905 | 78.94 |
| With TIGRfam | 1144 | 31.09 |
| With InterPro | 1900 | 51.63 |
| With IMG Terms | 611 | 16.60 |
| With IMG Pathways | 231 | 6.28 |
| With IMG Parts List | 246 | 6.68 |
| In internal clusters | 734 | 19.95 |
| In chromosomal cassette | 3633 | 98.72 |
| Chromosomal cassettes | 309 | - |
| Biosynthetic clusters | 14 | - |
| Genes in biosynthetic clusters | 236 | 6.41 |
| Fused protein coding genes | 93 | 2.53 |
| Protein coding genes coding signal peptides | 527 | 14.32 |
| Protein coding genes coding transmembrane proteins | 881 | 23.94 |
| COG clusters | 1399 | 62.96 |
| KOG clusters | 485 | 21.83 |
| Pfam clusters | 1958 | 67.40 |
| TIGRfam clusters | 916 | 80.07 |
Figure 2Percentage of CDS involved in COG classifications for strain RA, . B, Chromatin structure and dynamics; L, replication, recombination, and repair; A, RNA processing and modification; K, transcription; J, translation, ribosomal structure, and biogenesis; D, cell cycle control, cell division, chromosome partitioning; N, cell motility; M, cell wall/membrane/envelope biogenesis; Z, cytoskeleton; V, defense mechanisms; W, extracellular structures; U, intracellular trafficking, secretion, and vesicular transport; X, mobilome, prophages, transposons; O, post-translational modification, protein turnover, chaperones; T, Signal transduction mechanisms; E, amino acid transport and metabolism; G, carbohydrate transport and metabolism; H, coenzyme transport and metabolism; C, energy production and conversion; P, inorganic ion transport and metabolism; I, lipid transport and metabolism; F, nucleotide transport and metabolism; Q, secondary metabolites biosynthesis, transport and catabolism; S, function unknown; and R, general function prediction only (https://figshare.com/s/8a2ccd9f2b7aa439f040).