| Literature DB >> 26664700 |
Stepan V Toshchakov1, Aleksei A Korzhenkov1, Nazar I Samarov1, Ilia O Mazunin1, Oleg I Mozhey1, Ilya S Shmyr2, Ksenia S Derbikova3, Evgeny A Taranov3, Irina N Dominova1, Elizaveta A Bonch-Osmolovskaya3, Maxim V Patrushev1, Olga A Podosokorskaya3, Ilya V Kublanov3.
Abstract
A strain of a hyperthermophilic filamentous archaeon was isolated from a sample of Kamchatka hot spring sediment. Isolate 1807-2 grew optimally at 85 °C, pH 6.0-6.5, the parameters being close to those at the sampling site. 16S rRNA gene sequence analysis placed the novel isolate in the crenarchaeal genus Thermofilum; Thermofilum pendens was its closest valid relative (95.7 % of sequence identity). Strain 1807-2 grew organothrophically using polysaccharides (starch and glucomannan), yeast extract or peptone as substrates. The addition of other crenarchaea culture broth filtrates was obligatory required for growth and could not be replaced by the addition of these organisms' cell wall fractions, as it was described for T. pendens. The genome of strain 1807-2 was sequenced using Illumina and PGM technologies. The average nucleotide identities between genome of strain 1807-2 and T. pendens strain HRK 5(T) and "T. adornatus" strain 1910b were 85 and 82 %, respectively. On the basis of 16S rRNA gene sequence phylogeny, ANI calculations and phenotypic differences we propose a novel species Thermofilum uzonense with the type strain 1807-2(T) (= DSM 28062(T) = JCM 19810(T)). Project information and genome sequence was deposited in Genbank under IDs PRJNA262459 and CP009961, respectively.Entities:
Keywords: Crenarchaeota; Kamchatka; Phylogeny; Polysaccharides hydrolysis; Thermofilum; hyperthermophile
Year: 2015 PMID: 26664700 PMCID: PMC4673724 DOI: 10.1186/s40793-015-0105-y
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Electron micrograph of Thermofilum uzonense strain 1807-2T. Bar, 0.5 μm. Electron micrograph of negatively stained cell of Thermofilum uzonense strain 1807-2T. Cells were grown on glucose containing medium at 85 °C and incubated for two days (late exponential phase of growth). Bar, 0.5 μm
Classification and general features of Thermofilum uzonense strain1807-2T [31]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | IDA | ||
| Type strain 1807-2 | IDA | ||
| Gram stain | Not reported | ||
| Cell shape | Thin straight or curved rods | IDA | |
| Motility | Non motile | IDA | |
| Sporulation | Non-sporulating | IDA | |
| Temperature range | 70–90 °C | IDA | |
| Optimum temperature | 85 °C | IDA | |
| pH range; optimum | 5.5–7.0; 6.0–6.5 | IDA | |
| Carbon source | Yeast extract, peptone, starch, glucomannan | IDA | |
| Energy source | Yeast extract, peptone, starch, glucomannan | ||
| MIGS-6 | Habitat | Hot spring | |
| MIGS-6.3 | Salinity | 0–0.5 % NaCl (w/v). Optimum 0 %. | IDA |
| MIGS-22 | Oxygen | Anaerobe | IDA |
| MIGS-15 | Biotic relationship | Free living | IDA |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| Biosafety level | 1 | NAS | |
| Isolation | Water/sediment of hot spring, Uzon Caldera, Kamchatka | IDA | |
| MIGS-4 | Geographic location | Uzon Caldera, Kamchatka, Far-East Russia | IDA |
| MIGS-5 | Sample collection time | 2008 | IDA |
| MIGS-4.1 MIGS-4.2 | Latitude | 54 30.382 | IDA |
| Longitude | 160 00.103 | IDA | |
| MIGS-4.3 | Depth | Surface | IDA |
| MIGS-4.4 | Altitude | 663 m | IDA |
a Evidence codes - IDA inferred from direct assay, TAS traceable author statement (i.e., a direct report exists in the literature), NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [38]
Fig. 216S rRNA gene Maximal-likelihood phylogenetic tree of representatives of Thermofilaceae family. The tree was constructed using the Maximum Likelihood method based on the Tamura-Nei model [39]. The tree with the highest log likelihood (−5739.3765) is shown. The percentage of trees in which the associated taxa clustered together (bootstrap test of 1000 replications) is shown next to the nodes. Initial tree(s) for the heuristic search were obtained by applying the Neighbor-Joining method to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach. A discrete Gamma distribution was used to model evolutionary rate differences among sites (4 categories (+G, parameter = 0.2411)). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 38 nucleotide sequences; all were longer than 1300 nucleotides. All positions containing gaps and missing data were eliminated. There was a total of 1.227 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [40]. Thermoproteaceae branch includes 13 sequences (Additional file 5). Cultivated strains are underlined. Thermofilum uzonense strain 1807-2T is in bold. Complete 16S rRNA gene of Desulfurococcus kamchatkensis (NC_011766.1, Desulfurococcales order) was chosen as an out-group. Bar, 2 substitutions per 100 nucleotides
Project information and its association with MIGS version 2.0 compliance [31]
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Illumina Nextera fragment library (insert mean length of 175 bp) |
| PGM mate-paired library (insert mean length of 2400 bp) | ||
| MIGS 29 | Sequencing platforms | Ion Torrent, Illumina, Sanger |
| MIGS 31.2 | Fold coverage | Ion Torrent 13.6 × |
| Illumina 180.4 × | ||
| MIGS 30 | Assemblers | CLC Bio [ |
| MIGS 32 | Gene calling method | GeneMarKS+ |
| Locus Tag | MA03 | |
| Genbank ID | CP009961 | |
| GenBank Date of Release | April 1, 2015 | |
| GOLD ID | Gp0108853 | |
| BIOPROJECT | PRJNA262459 | |
| MIGS 13 | Source Material Identifier | DSM 28062 |
| Project relevance | Evolution, Diversity |
Genome statistics
| Attribute | Number of genes | % of Total genesa |
|---|---|---|
| Genome size (bp) | 1,611,988 | 100.00 |
| DNA codingb (bp) | 1,270,203 | 78.8 |
| DNA G+C (bp) | 772,789 | 47.9 |
| DNA scaffolds | 1 | 100.00 |
| Total genes | 1697 | 100.00 |
| Protein coding genes | 1455 | 85.7 |
| RNA genes | 50 | 3.0 |
| Pseudo genes | 192 | 11.3 |
| Genes in internal clusters | NDc | NDc |
| Genes with function prediction | 1334 | 78.6 |
| Genes assigned to COGs | 1389 | 81.9 |
| Genes with Pfam domains | 1066 | 62.8 |
| Genes with signal peptides | 343 | 20.2 |
| Genes with transmembrane helices | 129 | 7.6 |
| CRISPR repeats | 2 | – |
a) The total is based on either the size of the genome in base pairs or the total number of genes in the annotated genome
b) Corresponding to functional CDS. Pseudogenes are not included in this calculation
c) ND not determined
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 146 | 10.0 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 81 | 5.6 | Transcription |
| L | 81 | 5.6 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 4 | 0.3 | Cell cycle control, Cell division, chromosome partitioning |
| V | 79 | 5.4 | Defense mechanisms |
| T | 19 | 1.3 | Signal transduction mechanisms |
| M | 35 | 2.4 | Cell wall/membrane biogenesis |
| N | 12 | 0.8 | Cell motility |
| U | 13 | 0.9 | Intracellular trafficking and secretion |
| O | 63 | 4.3 | Posttranslational modification, protein turnover, chaperones |
| C | 116 | 8.0 | Energy production and conversion |
| G | 121 | 8.3 | Carbohydrate transport and metabolism |
| E | 122 | 8.4 | Amino acid transport and metabolism |
| F | 48 | 3.3 | Nucleotide transport and metabolism |
| H | 69 | 4.7 | Coenzyme transport and metabolism |
| I | 24 | 1.6 | Lipid transport and metabolism |
| P | 79 | 5.4 | Inorganic ion transport and metabolism |
| Q | 6 | 0.4 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 216 | 14.8 | General function prediction only |
| S | 55 | 3.8 | Function unknown |
| – | 436 | 30.0 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Fig. 3Circular map of Thermofilum uzonense strain1807-2T generated with CIRCOS [41]. From outside to inside: positive strand CDSs colored by COG FCs (Clusters of Orthologous Genes Functional Categories), negative strand CDSs colored by COG FCs, RNA-genes and CRISPRs (tRNA – purple; rRNA – blue; riboswitches – green, CRISPR repeats - yellow), G+C-content, GC-skew. On the right – COG functional categories color codes