| Literature DB >> 27274784 |
Ulrike Kappler1, Karen Davenport2, Scott Beatson1, Alla Lapidus3, Chongle Pan4, Cliff Han5, Maria Del Carmen Montero-Calasanz6, Miriam Land4, Loren Hauser4, Manfred Rohde7, Markus Göker8, Natalia Ivanova9, Tanja Woyke9, Hans-Peter Klenk6, Nikos C Kyrpides10.
Abstract
Thioalkalimicrobium cyclicum Sorokin et al. 2002 is a member of the family Piscirickettsiaceae in the order Thiotrichales. The γ-proteobacterium belongs to the colourless sulfur-oxidizing bacteria isolated from saline soda lakes with stable alkaline pH, such as Lake Mono (California) and Soap Lake (Washington State). Strain ALM 1(T) is characterized by its adaptation to life in the oxic/anoxic interface towards the less saline aerobic waters (mixolimnion) of the stable stratified alkaline salt lakes. Strain ALM 1(T) is the first representative of the genus Thioalkalimicrobium whose genome sequence has been deciphered and the fourth genome sequence of a type strain of the Piscirickettsiaceae to be published. The 1,932,455 bp long chromosome with its 1,684 protein-coding and 50 RNA genes was sequenced as part of the DOE Joint Genome Institute Community Sequencing Program (CSP) 2008.Entities:
Keywords: Aerobic; CSP 2008; Gram-negative; Mono Lake; Obligate chemolithoautotroph; Piscirickettsiaceae; Sulfur oxidizer
Year: 2016 PMID: 27274784 PMCID: PMC4891895 DOI: 10.1186/s40793-016-0162-x
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phylogenetic tree highlighting the position of T. cyclicum within the family Piscirickettsiaceae . T. cyclicum is shown relative to the type strains of Thioalkalimicrobium and all species from the three closest related genera (Thiomicrospira, Hydrogenovibrio and Galenea) within the family. The tree was inferred from 1,530 aligned characters [58, 59] of the 16S rRNA gene sequence under the ML criterion [60]. The rooting shown was inferred by the midpoint-rooting method [36]. The branches are scaled in terms of the expected number of substitutions per site (bar). Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates [61] (left) and from 1,000 maximum-parsimony bootstrap replicates [62] (right) if larger than 60 %. Lineages with type strain genome sequencing projects registered in GOLD [7] are labeled with one asterisk, those also listed as ‘Complete and Published’ with two asterisks (see [37] and AGFA00000000 for T. aerophilum)
Classification and general features of T. cyclicum ALM1T in accordance with the MIGS recommendations [48] (published by the Genome Standards Consortium [49]) and the NamesforLife database [50]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| (Type) strain: ATMT (AF329082) | |||
| Gram stain | negative | TAS [ | |
| Cell shape | open ring-shaped | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | not reported | ||
| Temperature range | mesophile, about 28 °C | NAS | |
| Optimum temperature | not reported | ||
| pH range; Optimum | not reported | ||
| Carbon source | CO2 | NAS | |
| MIGS-6 | Habitat | water | TAS [ |
| MIGS-6.3 | Salinity | Moderate, 0.6 to 1.5 M NaCl | TAS [ |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | none | NAS |
| MIGS-4 | Geographic location | Lake Mono (California) | TAS [ |
| MIGS-5 | Sample collection | 1999 | TAS [ |
| MIGS-4.1 | Latitude | 38.012 | TAS [ |
| MIGS-4.2 | Longitude | −118.976 | TAS [ |
| MIGS-4.4 | Altitude | 1926 m | TAS [ |
aEvidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [57]
Fig. 2Electron micrograph of T. cyclicum ALM1 DSM 14477T. Micrographs of bacterial cells grown from a culture of DSM 14477T in DSMZ medium 925 at 28 °C were taken with a field-emission scanning electron microscope (FE-Merlin, Zeiss). Bacteria were fixed with formaldehyde and glutaraldehyde, washed with TRIS-EDTA buffer, dehydrated with a graded series of acetone, critical-point dried with liquid CO2, sputter coated with gold-palladium and imaged in a Zeiss Merlin with the HE-SE2 detector and inlens-detector in a 25:75 ratio with an acceleration voltage of 5 kV
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | One 454 pyrosequence standard library, one 454 PE library (9 kb insert size), one Illumina library |
| MIGS 29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS 31.2 | Fold coverage | 1,521.2 × Illumina, 38.2 × pyrosequence |
| MIGS 30 | Assemblers | Newbler version 2.3, Velvet version 0.7.63, phrap version SPS – 4.24 |
| MIGS 32 | Gene calling method | Prodigal |
| Locus Tag | Thicy | |
| Genbank ID | CP002776 | |
| GenBank Date of Release | July 6, 2012 | |
| GOLD ID | Gc01777 | |
| BIOPROJECT | PRJNA52629 | |
| MIGS 13 | Source Material Identifier | DSM 14477 |
| Project relevance | Biotechnological, Bioremediation |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 1,932,455 | 100.0 |
| DNA coding (bp) | 1,818,441 | 94.1 |
| DNA G + C (bp) | 907,872 | 47.0 |
| DNA scaffolds | 1 | |
| Total genes | 1,734 | 100.0 |
| Protein coding genes | 1,684 | 97.1 |
| RNA genes | 50 | 2.9 |
| Pseudo genes | 19 | 1.1 |
| Genes in internal clusters | 543 | 31.3 |
| Genes with function prediction | 1,361 | 78.5 |
| Genes assigned to COGs | 1,459 | 84.1 |
| Genes with Pfam domains | 1,502 | 86.6 |
| Genes with signal peptides | 124 | 7.2 |
| Genes with transmembrane helices | 357 | 20.6 |
| CRISPR repeats | 0 |
Fig. 3Graphical map of the chromosome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content (black), GC skew (purple/olive)
Number of genes associated with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 153 | 9.7 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.1 | RNA processing and modification |
| K | 71 | 4.5 | Transcription |
| L | 86 | 5.4 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 27 | 1.7 | Cell cycle control, Cell division, chromosome partitioning |
| V | 28 | 1.8 | Defense mechanisms |
| T | 48 | 3.0 | Signal transduction mechanisms |
| M | 117 | 7.4 | Cell wall/membrane biogenesis |
| N | 12 | 0.8 | Cell motility |
| U | 56 | 3.5 | Intracellular trafficking and secretion |
| O | 88 | 5.6 | Posttranslational modification, protein turnover, chaperones |
| C | 95 | 6.0 | Energy production and conversion |
| G | 59 | 3.7 | Carbohydrate transport and metabolism |
| E | 123 | 7.8 | Amino acid transport and metabolism |
| F | 49 | 3.1 | Nucleotide transport and metabolism |
| H | 109 | 6.9 | Coenzyme transport and metabolism |
| I | 40 | 2.5 | Lipid transport and metabolism |
| P | 89 | 5.6 | Inorganic ion transport and metabolism |
| Q | 26 | 1.6 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 170 | 10.8 | General function prediction only |
| S | 135 | 8.5 | Function unknown |
| - | 275 | 15.9 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Fig. 4Schematic representation of the gene regions encoding enzymatic components of the Sox (Sulfur Oxidation) pathway; sox genes are shown in grey, all genes encoding proteins not related to the Sox complex are showing in white. Gene numbers and genes encoded are shown in the figure
Fig. 5Phylogenetic analysis of SoxA cytochromes. The phylogenetic tree was generated using the Neighbor-joining algorithm as integrated into the Mega program suite [63], bootstrap values are based on 500 replicates