| Literature DB >> 26457129 |
Michael P Manzella1, Dawn E Holmes2, Jessica M Rocheleau2, Amanda Chung2, Gemma Reguera1, Kazem Kashefi1.
Abstract
"Geoglobus ahangari" strain 234(T) is an obligate Fe(III)-reducing member of the Archaeoglobales, within the archaeal phylum Euryarchaeota, isolated from the Guaymas Basin hydrothermal system. It grows optimally at 88 °C by coupling the reduction of Fe(III) oxides to the oxidation of a wide range of compounds, including long-chain fatty acids, and also grows autotrophically with hydrogen and Fe(III). It is the first archaeon reported to use a direct contact mechanism for Fe(III) oxide reduction, relying on a single archaellum for locomotion, numerous curled extracellular appendages for attachment, and outer-surface heme-containing proteins for electron transfer to the insoluble Fe(III) oxides. Here we describe the annotation of the genome of "G. ahangari" strain 234(T) and identify components critical to its versatility in electron donor utilization and obligate Fe(III) respiratory metabolism at high temperatures. The genome comprises a single, circular chromosome of 1,770,093 base pairs containing 2034 protein-coding genes and 52 RNA genes. In addition, emended descriptions of the genus "Geoglobus" and species "G. ahangari" are described.Entities:
Keywords: Archaeoglobales; Autotroph; Euryarchaeota; Extracellular electron transfer; Fe(III) respiration; Guaymas basin; Hydrothermal vent
Year: 2015 PMID: 26457129 PMCID: PMC4600277 DOI: 10.1186/s40793-015-0035-8
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phylogenetic tree. The phylogenetic tree was constructed with the maximum likelihood algorithm comparing the16S rRNA gene sequence from “G. ahangari” to other hyperthermophilic archaea. Bootstrap values were determined from 100 replicates and “Aquifex aeolicus” and Pseudothermotoga thermarum were used as outgroups
Fig. 2Scanning electron micrograph of cells of “G. ahangari” strain 234T growing on insoluble Fe(III) oxides. Bar, 100 nm
Classification and general features of “G. ahangari” 234T according to the MIGS recommendations [106]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain 234T | TAS [ | ||
| Gram stain | Variable | NAS | |
| Cell shape | Irregular coccus | TAS [ | |
| Motility | Motile | TAS [ | |
| Sporulation | Non-sporulating | NAS | |
| Temperature range | 65−90 °C | TAS [ | |
| Optimal temperature | 88 °C | TAS [ | |
| pH range; Optimum | 5.0−7.6 (optimum 7.0) | TAS [ | |
| Carbon source | CO2 | TAS [ | |
| Energy metabolism | Chemolithoautotrophic, chemolithotrophic, chemoorganotrophic | TAS [ | |
| MIGS-6 | Habitat | Marine geothermally heated areas | TAS [ |
| MIGS-6.3 | Salinity | 9.0−38 g/L NaCl | TAS [ |
| MIGS-22 | Oxygen requirement | Anaerobe | TAS [ |
| MIGS-15 | Biotopic relationship | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| Isolation | Hydrothermal vent chimney | TAS [ | |
| MIGS-4 | Geographic location | Guaymas Basin hydrothermal system | TAS [ |
| MIGS-5 | Sample collection time | Unknown | NAS |
| MIGS-4.1 | Latitude | 27° N | TAS [ |
| MIGS-4.2 | Longitude | 111° W | TAS [ |
| MIGS-4.3 | Depth | 2000 m | TAS [ |
| MIGS-4.4 | Altitude | Not applicable |
aEvidence codes – IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [112]
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | 5 independent 100 bp paired-end Illumina shotgun libraries, 150 bp paired-end Illumina shotgun library |
| MIGS-29 | Sequencing platforms | Illumina MiSeq |
| MIGS-31.2 | Sequencing coverage | 1,977 × coverage (100 bp libraries) |
| 100 × (150 bp library) | ||
| MIGS-30 | Assemblers | SeqMan NGen, Velvet, SeqMan Pro |
| MIGS-32 | Gene calling method | JGI-ER, GLIMMER |
| INSDC ID | CP011267 | |
| Genbank Date of Release | May 11, 2015 | |
| GOLD ID | Gp0101274 | |
| NCBI project ID | 258102 | |
| MIGS-13 | Source material identifier | ATCC |
| Project relevance | Phylogenetic diversity, biotechnology, evolution of metal respiration in hyperthermophiles, and anaerobic degradation of hydrocarbons |
Nucleotide content and gene count levels of the genome
| Attribute | Value | % of totala |
|---|---|---|
| Size (bp) | 1,770,093 | 100.0 % |
| Coding region (bp) | 1,662,832 | 93.9 % |
| G + C content (bp) | 940,071 | 53.1 % |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 2072 | 100.0 % |
| RNA genes | 52 | 2.5 % |
| rRNA operons | 2 | |
| Protein-coding genes | 2034 | 100.0 % |
| Pseudogenes | 47 | 2.3 % |
| Genes with function prediction | 1677 | 82.4 % |
| Genes in paralog clusters | 1406 | 69.1 % |
| Genes assigned to COGs | 1470 | 72.2 % |
| Genes assigned Pfam domains | 1667 | 82.0 % |
| Genes with signal peptides | 55 | 2.7 % |
| Genes with transmembrane helices | 409 | 20.1 % |
| CRISPR repeats | 7 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Fig. 3Graphical circular map of the chromosome. From outside to the center: Genes on forward strand (colored by COG categories), genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, and GC skew
Number of genes associated with the 25 general COG functional categories
| Code | Value | % agea | Description |
|---|---|---|---|
| J | 155 | 7.6 % | Translation, ribosomal structure and biogenesis |
| A | 2 | 0.1 % | RNA processing and modification |
| K | 68 | 3.3 % | Transcription |
| L | 58 | 2.9 % | Replication, recombination and repair |
| B | 6 | 0.3 % | Chromatin structure and dynamics |
| D | 20 | 1.0 % | Cell cycle control, Cell division, chromosome partitioning |
| Y | 0 | 0.0 % | Nuclear structure |
| V | 7 | 0.3 % | Defense mechanisms |
| T | 24 | 1.2 % | Signal transduction mechanisms |
| M | 35 | 1.7 % | Cell wall/membrane biogenesis |
| N | 15 | 0.7 % | Cell motility |
| Z | 0 | 0.0 % | Cytoskeleton |
| W | 0 | 0.0 % | Extracellular structures |
| U | 23 | 1.1 % | Intracellular trafficking and secretion |
| O | 57 | 2.8 % | Posttranslational modification, protein turnover, chaperones |
| C | 160 | 7.9 % | Energy production and conversion |
| G | 37 | 1.8 % | Carbohydrate transport and metabolism |
| E | 139 | 6.8 % | Amino acid transport and metabolism |
| F | 49 | 2.4 % | Nucleotide transport and metabolism |
| H | 103 | 5.1 % | Coenzyme transport and metabolism |
| I | 55 | 2.7 % | Lipid transport and metabolism |
| P | 86 | 4.2 % | Inorganic ion transport and metabolism |
| Q | 16 | 0.8 % | Secondary metabolites biosynthesis, transport and catabolism |
| R | 244 | 12.0 % | General function prediction only |
| S | 198 | 9.7 % | Function unknown |
| - | 477 | 23.5 % | Not in COGs |
aThe total is based on the total number of protein coding genes in the genome
Fig. 4Central metabolism in “Geoglobus ahangari” strain 234T
Terminal electron acceptors in the Archaeoglobales
| Electron acceptors | |||||
|---|---|---|---|---|---|
| Organism | Sulfate | Sulfite | Thiosulfate | Nitrate | Fe(III) |
|
| - | - | - | - | + |
|
| - | - | + | + | + |
|
| +/− | + | + | - | - |
Putative c-type cytochromes
| Gene ID: | Annotation: | # of heme binding motifs: | Calculated molecular weight: | TM domains: |
|---|---|---|---|---|
| GAH_00015 | Hypothetical protein | 4 | 58.4 | 0 |
| GAH_00283 | Cytochrome c7 | 4 | 21.2a | 1 |
| GAH_00286 | Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit | 12 | 39.1 | 0 |
| GAH_00301 | Putative redox-active protein (C_GCAxxG_C_C) | 2 | 31.5 | 3 |
| GAH_00504 | Hypothetical protein | 10 | 54.5 | 1 |
| GAH_00505 | Hypothetical protein | 4 | 26.8 | 2 |
| GAH_00506 | Cytochrome c3 | 9 | 48.6 | 0 |
| GAH_00507 | Cytochrome c7 | 4 | 27.4a | 1 |
| GAH_00508 | Hypothetical protein | 5 | 28.5 | 1 |
| GAH_00510 | Hypothetical protein | 4 | 27.3 | 1 |
| GAH_00817 | Seven times multi-haem cytochrome CxxCH | 8 | 53.7 | 1 |
| GAH_01091 | Hypothetical protein | 1 | 11.7 | 1 |
| GAH_01235 | Hypothetical protein | 5 | 21.5 | 0 |
| GAH_01236 | Hypothetical protein | 5 | 22.3b | 0 |
| GAH_01253 | Hypothetical protein | 4 | 16.9 | 0 |
| GAH_01256 | NapC/NirT cytochrome c family, N-terminal region | 10 | 43.6a | 1 |
| GAH_01296 | Cytochrome c family protein | 4 | 17.2 | 1 |
| GAH_01297 | Seven times multi-haem cytochrome CxxCH | 8 | 61.0 | 1 |
| GAH_01306 | Class III cytochrome C family | 8 | 46.3 | 0 |
| GAH_01534 | Hypothetical protein | 1 | 18.5 | 1 |
| GAH_01700 | Hypothetical protein | 3 | 9.9 | 0 |
aNo signal peptide detected
bSignal peptide detected by PRED-SIGNAL