| Literature DB >> 25709556 |
Reema K Gudhka1, Brett A Neilan2, Brendan P Burns2.
Abstract
Halococcus hamelinensis was the first archaeon isolated from stromatolites. These geomicrobial ecosystems are thought to be some of the earliest known on Earth, yet, despite their evolutionary significance, the role of Archaea in these systems is still not well understood. Detailed here is the genome sequencing and analysis of an archaeon isolated from stromatolites. The genome of H. hamelinensis consisted of 3,133,046 base pairs with an average G+C content of 60.08% and contained 3,150 predicted coding sequences or ORFs, 2,196 (68.67%) of which were protein-coding genes with functional assignments and 954 (29.83%) of which were of unknown function. Codon usage of the H. hamelinensis genome was consistent with a highly acidic proteome, a major adaptive mechanism towards high salinity. Amino acid transport and metabolism, inorganic ion transport and metabolism, energy production and conversion, ribosomal structure, and unknown function COG genes were overrepresented. The genome of H. hamelinensis also revealed characteristics reflecting its survival in its extreme environment, including putative genes/pathways involved in osmoprotection, oxidative stress response, and UV damage repair. Finally, genome analyses indicated the presence of putative transposases as well as positive matches of genes of H. hamelinensis against various genomes of Bacteria, Archaea, and viruses, suggesting the potential for horizontal gene transfer.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25709556 PMCID: PMC4325475 DOI: 10.1155/2015/241608
Source DB: PubMed Journal: Archaea Impact factor: 3.273
General features of the H. hamelinensis genome.
| General features | Number |
|---|---|
| Size (bp) |
|
| GC%1 |
|
| RNA genes |
|
| rRNA genes | 3 |
| 5 S rRNA | 1 |
| 16 S rRNA | 1 |
| 23 S rRNA | 1 |
| tRNA genes | 45 |
| DNA scaffolds | 223 |
| DNA coding bases | 2529448 |
|
|
|
| PCG with known and putative function (%) | 68.67 |
| PCG with enzymes (%) | 22.39 |
| PCG with COGs (%) | 65.13 |
| PCG with Pfam (%) | 65.98 |
| PCG with TIGRfam (%) | 19.48 |
| PCG with InterPro (%) | 87.12 |
| PCG coding signal peptides (%) | 10.07 |
| PCG coding transmembrane proteins (%) | 22.48 |
| PCG in paralog clusters (%) | 20.04 |
| Fused Protein coding genes | 219 |
| COG clusters | 1177 |
| KOG clusters | 635 |
| Pfam clusters | 1207 |
| TIGRfam clusters | 539 |
1GC percentage shown as count of G's and C's divided by a total number of G's, C's, A's, and T's. COGs—Clusters of Orthologous Groups of proteins; Pfam—protein family database; TIGRFAMs—resource consisting of curated multiple sequence alignments, Hidden Markov Models for protein sequence classification, and associated information designed to support automated annotation of proteins; KOG—euKaryotic Orthologous Groups.
Figure 1Radial graph displaying the three-letter codon frequency prediction of H. hamelinensis. Expected codon frequencies are indicated in the inner circle. Squares in the outer circle are indications are overrepresented codons.
Putative osmoadaptive genes identified in H. hamelinensis genome.
| Gene name | Gene ID |
|---|---|
| Probable glycine betaine transporter protein (Bet III) | 2502299791 |
|
| |
| Glycine betaine transporters OpuD | 2502299823 |
|
| |
| Glycine betaine transporter | 2502299930 |
|
| |
| ABC-type proline/glycine betaine permease component | 2502300305 |
|
| |
| Glycine betaine ABC transporter, ATP-binding protein | 2502300306 |
|
| |
| ABC-type proline/glycine betaine permease component | 2502300307 |
|
| |
| Probable glycine betaine transport protein | 2502301633 |
|
| |
| Potassium uptake protein TrkH | fig∣332168.3.peg.1799 |
|
| |
| TrkA system potassium uptake protein | ffig∣332168.3.peg.105 |
|
| |
| Potassiumchannel protein | 2502299500 |
|
| |
| Kef-type K+ transport systems | fig∣332168.3.peg.334 |
Putative genes coding for heavy metal resistance in H. hamelinensis.
| Gene name | Gene ID |
|---|---|
| Arsenical pump-driving ATPase | fig∣332168.3.peg.1469 |
|
| |
| Arsenate reductase (EC 1.20.4.1) | fig∣332168.3.peg.2242 |
|
| |
| Arsenate reductase (EC 1.20.4.1) | fig∣332168.3.peg.3119 |
|
| |
| Arsenical-resistance protein ACR3 | fig∣332168.3.peg.3118 |
|
| |
| DNA gyrase subunit A (EC 5.99.1.3) | fig∣332168.3.peg.722 |
|
| |
| DNA gyrase subunit B (EC 5.99.1.3) | fig∣332168.3.peg.721 |
|
| |
| Aminoglycoside N6′-acetyltransferase (EC 2.3.1.82) | fig∣332168.3.peg.687 |
|
| |
| Copper-translocating P-type ATPase (EC 3.6.3.4) | fig∣332168.3.peg.2377 |
|
| |
| Copper-translocating P-type ATPase (EC 3.6.3.4) | fig∣332168.3.peg.2190 |
|
| |
| Cobalt-zinc-cadmium resistance protein CzcD | fig∣332168.3.peg.2193 |
|
| |
| Cobalt-zinc-cadmium resistance protein | fig∣332168.3.peg.471 |
|
| |
| Permease of the drug/metabolite transporter (DMT) superfamily | fig∣332168.3.peg.1282 |
|
| |
| Permease of the drug/metabolite transporter (DMT) superfamily | fig∣332168.3.peg.2762 |
|
| |
| Permease of the drug/metabolite transporter (DMT) superfamily | fig∣332168.3.peg.2832 |
|
| |
| Permease of the drug/metabolite transporter (DMT) superfamily | fig∣332168.3.peg.2837 |
Figure 2Radial phylogenetic tree comparison between H. hamelinensis and the bacterial and archaeal lineage within IMG/ER database. Genomes of Archaea, Bacteria, and Eukaryota are displayed clockwise on the circular plot. The bar charts in between the circular plot and phylogenetic dendrograms are percentage representations of positive hits of genes between H. hamelinensis and respective genomes.
Figure 3Schematic representation of some of the putative adaptive traits present in H. hamelinensis. The traits identified pregenome is indicated on the left, and those elucidated postgenome in the present study are indicated on the right.