| Literature DB >> 26925197 |
En-Min Zhou1, Senthil K Murugapiran2, Chrisabelle C Mefferd2, Lan Liu3, Wen-Dong Xian4, Yi-Rui Yin4, Hong Ming4, Tian-Tian Yu4, Marcel Huntemann5, Alicia Clum5, Manoj Pillay5, Krishnaveni Palaniappan5, Neha Varghese5, Natalia Mikhailova5, Dimitrios Stamatis5, T B K Reddy5, Chew Yee Ngan5, Chris Daum5, Nicole Shapiro5, Victor Markowitz5, Natalia Ivanova5, Alexander Spunde5, Nikos Kyrpides5, Tanja Woyke5, Wen-Jun Li6, Brian P Hedlund7.
Abstract
Thermus amyloliquefaciens type strain YIM 77409(T) is a thermophilic, Gram-negative, non-motile and rod-shaped bacterium isolated from Niujie Hot Spring in Eryuan County, Yunnan Province, southwest China. In the present study we describe the features of strain YIM 77409(T) together with its genome sequence and annotation. The genome is 2,160,855 bp long and consists of 6 scaffolds with 67.4 % average GC content. A total of 2,313 genes were predicted, comprising 2,257 protein-coding and 56 RNA genes. The genome is predicted to encode a complete glycolysis, pentose phosphate pathway, and tricarboxylic acid cycle. Additionally, a large number of transporters and enzymes for heterotrophy highlight the broad heterotrophic lifestyle of this organism. A denitrification gene cluster included genes predicted to encode enzymes for the sequential reduction of nitrate to nitrous oxide, consistent with the incomplete denitrification phenotype of this strain.Entities:
Keywords: Denitrification; Hot springs; Thermophiles; Thermus; Thermus amyloliquefaciens
Year: 2016 PMID: 26925197 PMCID: PMC4769583 DOI: 10.1186/s40793-016-0140-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Maximum-likelihood phylogenetic tree of the genus Thermus to highlight the position of Thermus amyloliquefaciens strain YIM 77409T. The tree was reconstructed based on 1374 aligned positions that remained after the application of the Lane mask to the 16S rRNA gene sequences using MEGA 5.0 [54]. Complete deletion of gaps and missing data and Kimura’s two-parameter model was applied. Bootstrap analysis was based on 1000 resamplings. Nodes supported in >75 % (black circles) or >50 % (grey circles) of bootstrap pseudoreplicates (1000 resamplings) for both maximum-likelihood and neighbor-joining methods are indicated. Bar, 0.02 changes per nucleotide. The number of genomes available for each species is included in parentheses (see Table 5) and the asterisk indicates that the genome of the type strain is available. The 16S rRNA gene sequences from Marinithermus hydrothermalis T1T/AB079382 and Rhabdothermus arcticus 2 M70-1T/HM856631 were used as outgroups
Comparison of basic genome features of Thermus strains
| Genome Name | Status | Genome Size (Mb) | GC Content (%) | Gene Count | No. of protein coding genes with function prediction | Percentage (%) | No. of protein coding genes with COGs | Percentage (%) | IMG Genome ID |
|---|---|---|---|---|---|---|---|---|---|
|
| Draft | 2.16 | 67.4 | 2313 | 1839 | 79.5 | 1558 | 67.4 | 2579778517 |
|
| Finished | 2.36 | 64.9 | 2514 | 1878 | 74.7 | 1704 | 67.8 | 649633105 |
|
| Draft | 2.48 | 65.5 | 2643 | 2159 | 81.7 | 1808 | 68.4 | 2574179778 |
|
| Draft | 2.07 | 64.8 | 2305 | 1816 | 78.8 | 1484 | 64.4 | 2518645614 |
|
| Draft | 2.17 | 64.8 | 2321 | 1939 | 83.5 | 1654 | 71.3 | 2522572193 |
|
| Draft | 2.22 | 67.2 | 2327 | 1901 | 81.7 | 1646 | 70.7 | 2582581225 |
|
| Draft | 2.56 | 66.4 | 2750 | 2158 | 78.5 | 1818 | 66.1 | 2574179781 |
|
| Draft | 2.44 | 68.7 | 2672 | 2052 | 76.8 | 1704 | 63.8 | 2617270932 |
|
| Finished | 2.12 | 69.5 | 2302 | 1498 | 65.1 | 1550 | 67.3 | 637000323 |
|
| Finished | 2.31 | 69.0 | 2508 | 1984 | 79.1 | 1717 | 68.5 | 2508501108 |
|
| Finished | 2.30 | 68.7 | 2488 | 2024 | 81.4 | 1700 | 68.3 | 2505679077 |
|
| Finished | 2.13 | 69.4 | 2273 | 1517 | 66.7 | 1562 | 68.7 | 637000322 |
|
| Draft | 2.15 | 69.4 | 2366 | 1928 | 81.5 | 1603 | 67.8 | 2554235155 |
|
| Draft | 2.26 | 68.4 | 2470 | 1965 | 79.6 | 1654 | 67.0 | 2524614852 |
|
| Finished | 2.40 | 68.6 | 2548 | 2018 | 79.2 | 1735 | 68.1 | 2508501045 |
|
| Draft | 2.26 | 68.7 | 2409 | 1960 | 81.4 | 1700 | 70.6 | 2515154080 |
|
| Draft | 2.23 | 68.8 | 2379 | 1962 | 82.5 | 1661 | 69.8 | 2515154172 |
|
| Draft | 2.34 | 68.1 | 2595 | 1740 | 67.1 | 1530 | 59.0 | 645058872 |
|
| Draft | 2.48 | 66.8 | 2789 | 2004 | 71.9 | 1709 | 61.3 | 2502171156 |
|
| Finished | 2.26 | 68.6 | 2333 | 1935 | 82.9 | 1655 | 70.9 | 2511231187 |
|
| Draft | 2.04 | 68.3 | 2043 | 1636 | 80.1 | 1326 | 64.9 | 2513237279 |
|
| Draft | 2.29 | 65.3 | 2522 | 1954 | 77.5 | 1666 | 66.1 | 2514885041 |
Fig. 2Scanning electron microscopy image of Thermus amyloliquefaciens strain YIM 77409T grown in Thermus medium broth at 65 °C for 24 h
Classification and general features of Thermus amyloliquefaciens strain YIM 77409T [23]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain: YIM 77409T | TAS [ | ||
| Gram stain | Negative | TAS [ | |
| Cell shape | Rod | TAS [ | |
| Motility | Non-motile | TAS [ | |
| Sporulation | Nonsporulating | TAS [ | |
| Temperature range | 50–70 °C | TAS [ | |
| Optimum temperature | 60–65 °C | TAS [ | |
| pH range; Optimum | 6.0–8.0; 7.0 | TAS [ | |
| Carbon source | Glucose, sucrose, glycerol, maltose, raffinose, trehalose, rhamnose, inositol, xylitol, mannitol, sodium malate, mannose and | TAS [ | |
| MIGS-6 | Habitat | Terrestrial hot springs | TAS [ |
| MIGS-6.3 | Salinity | Not reported | |
| MIGS-22 | Oxygen requirement | Facultatively anaerobic | TAS [ |
| MIGS-15 | Biotic relationship | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| MIGS-4 | Geographic location | Niujie hot spring in Eryuan County, Yunnan Province, southwest China | TAS [ |
| MIGS-5 | Sample collection | 2010 | NAS |
| MIGS-4.1 | Latitude | N 26°15'01. 4" | NAS |
| MIGS-4.2 | Longitude | E 99°59'22. 3" | NAS |
| MIGS-4.4 | Altitude | 2060 m | NAS |
IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [53]
a Evidence codes
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Permanent Draft |
| MIGS-28 | Libraries used | PacBio 10 kb |
| MIGS 29 | Sequencing platforms | PacBio RS |
| MIGS 31.2 | Fold coverage | 384.9X PacBio |
| MIGS 30 | Assemblers | HGAP version 2.1.1 |
| MIGS 32 | Gene calling method | Prodigal 2.5; GenePRIMP |
| Locus Tag | BS74 | |
| Genbank ID | JQMV00000000 | |
| GenBank Date of Release | August 28, 2014 | |
| Database: IMG | 2579778517 | |
| GOLD ID | Gp0050852 | |
| BIOPROJECT | PRJNA234787 | |
| MIGS 13 | Source Material Identifier | YIM 77409T |
| Project relevance | Biotechnological |
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 2,160,855 | 100.0 |
| DNA coding (bp) | 2,031,100 | 94.0 |
| DNA G + C (bp) | 1,457,281 | 67.4 |
| DNA scaffolds | 6 | 100.0 |
| Total genes | 2,313 | 100.0 |
| Protein coding genes | 2,257 | 97.6 |
| RNA genes | 56 | 2.4 |
| Pseudo genesb | 74 | 3.2 |
| Genes in internal clusters | 1,932 | 83.5 |
| Genes with function prediction | 1,839 | 79.5 |
| Genes assigned to COGs | 1,558 | 67.4 |
| Genes with Pfam domains | 1,842 | 79.6 |
| Genes with signal peptides | 110 | 4.8 |
| Genes with transmembrane helices | 439 | 19.0 |
| CRISPR repeats | 5 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
bPseudogenes may also be counted as protein coding or RNA genes, so is not additive under total gene count
Number of genes associated with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 179 | 10.4 | Translation, ribosomal structure and biogenesis |
| A | 4 | 0.2 | RNA processing and modification |
| K | 76 | 4.4 | Transcription |
| L | 63 | 3.7 | Replication, recombination and repair |
| B | 2 | 0.1 | Chromatin structure and dynamics |
| D | 22 | 1.3 | Cell cycle control, Cell division, chromosome partitioning |
| V | 35 | 2.0 | Defense mechanisms |
| T | 66 | 3.8 | Signal transduction mechanisms |
| M | 77 | 4.5 | Cell wall/membrane biogenesis |
| N | 16 | 0.9 | Cell motility |
| U | 14 | 0.8 | Intracellular trafficking and secretion |
| O | 90 | 5.2 | Posttranslational modification, protein turnover, chaperones |
| C | 131 | 7.6 | Energy production and conversion |
| G | 105 | 6.1 | Carbohydrate transport and metabolism |
| E | 183 | 10.6 | Amino acid transport and metabolism |
| F | 75 | 4.4 | Nucleotide transport and metabolism |
| H | 121 | 7.0 | Coenzyme transport and metabolism |
| I | 91 | 5.3 | Lipid transport and metabolism |
| P | 82 | 4.8 | Inorganic ion transport and metabolism |
| Q | 40 | 2.3 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 170 | 9.9 | General function prediction only |
| S | 68 | 4.0 | Function unknown |
| - | 755 | 32.6 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Fig. 3Molecular organization of identified nar operon and neighboring genes involved in denitrification located on the chromosome of T. amyloliquefaciens YIM 77409T, T. tengchongensis YIM 77401, and T. scotoductus SA-01. Fe: heme protein-containing nitrite reductase, Cu: copper-containing nitrite reductase. Numbers below the genes indicate the provisional ORF numbers in T. amyloliquefaciens YIM 77409T and T. tengchongensis YIM 77401, the locations in the chromosome are indicated below. nar: nitrate reductase gene; nir: nitrite reductase gene; nor: nitric oxide reductase gene; dnr: denitrification regulator gene [43, 55, 56]. This figure is modified from Murugapiran et al. [17]