| Literature DB >> 21304717 |
Mathias von Jan, Alla Lapidus, Tijana Glavina Del Rio, Alex Copeland, Hope Tice, Jan-Fang Cheng, Susan Lucas, Feng Chen, Matt Nolan, Lynne Goodwin, Cliff Han, Sam Pitluck, Konstantinos Liolios, Natalia Ivanova, Konstantinos Mavromatis, Galina Ovchinnikova, Olga Chertkov, Amrita Pati, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, Elizabeth Saunders, Thomas Brettin, John C Detter, Patrick Chain, Konrad Eichinger, Harald Huber, Stefan Spring, Manfred Rohde, Markus Göker, Reinhard Wirth, Tanja Woyke, Jim Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk.
Abstract
Archaeoglobus profundus (Burggraf et al. 1990) is a hyperthermophilic archaeon in the euryarchaeal class Archaeoglobi, which is currently represented by the single family Archaeoglobaceae, containing six validly named species and two strains ascribed to the genus 'Geoglobus' which is taxonomically challenged as the corresponding type species has no validly published name. All members were isolated from marine hydrothermal habitats and are obligate anaerobes. Here we describe the features of the organism, together with the complete genome sequence and annotation. This is the second completed genome sequence of a member of the class Archaeoglobi. The 1,563,423 bp genome with its 1,858 protein-coding and 52 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Archaeoglobaceae; GEBA; hydrogen utilization; hydrothermal systems; hyperthermophilic; marine; strictly anaerobic; sulfate respiration
Year: 2010 PMID: 21304717 PMCID: PMC3035285 DOI: 10.4056/sigs.942153
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of A. profundus AV18T relative to the other type strains within the family. The tree was inferred from 1,334 aligned characters [28,29] of the 16S rRNA gene sequence under the maximum likelihood criterion [30] and rooted in accordance with a current taxonomy [31]. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 1,000 bootstrap replicates if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [32] are shown in blue, published genomes in bold: Methanococcus aeolicus (CP000743), Methanocaldococcus fervens (CP001696), Methanocaldococcus jannaschii [33] and A. fulgidus [34], two of the very first organisms whose genome sequences have been revealed.
Figure 2Scanning electron micrograph of cells of A. profundus strain AV18T
Classification and general features of A. profundus strain AV18T according to the MIGS recommendations [36]
| | | | |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain AV18 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | coccoid, highly irregular | TAS [ | |
| Motility | not motile | TAS [ | |
| Sporulation | nonsporulating | NAS | |
| Temperature range | 65-90°C | TAS [ | |
| Optimum temperature | 82°C | TAS [ | |
| Salinity | >9-36 g/l (optimum 18 g/l) | TAS [ | |
| MIGS-22 | Oxygen requirement | obligate anaerobic | TAS [ |
| Carbon source | acetate, pyruvate, lactate, yeast extract, | TAS [ | |
| Energy source | H2 | TAS [ | |
| MIGS-6 | Habitat | deep sea hydrothermal system | TAS [ |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | none | TAS [ |
| Biosafety level | 1 | TAS [ | |
| Isolation | cores of hot sediment | TAS [ | |
| MIGS-4 | Geographic location | Guaymas, Mexico | TAS [ |
| MIGS-5 | Sample collection time | before or around 1989 | NAS |
| MIGS-4.1 | Latitude | 14.84 | NAS |
| MIGS-4.3 | Depth | -2,000 m | TAS [ |
| MIGS-4.4 | Altitude | -2,000 m | TAS [ |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i. e. a direct report exists in the literature); NAS: Non-traceable Author Statement (i. e. not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [42]. If the evidence code is IDA, then the property was directly observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three 454 pyrosequence libraries, |
| MIGS-29 | Sequencing platforms | 454 Titanium, Illumina |
| MIGS-31.2 | Sequencing coverage | 136× 454 Titanium, 30× Illumina GAii |
| MIGS-30 | Assemblers | Newbler, phrap |
| MIGS-32 | Gene calling method | Prodigal, GenePRIMP |
| INSDC ID | CP001857 (chromosome) | |
| GenBank Date of Release | January 20, 2010 | |
| GOLD ID | Gc01188 | |
| NCBI project ID | 32583 | |
| Database: IMG-GEBA | 2501939633 | |
| MIGS-13 | Source material identifier | DSM 5631 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 1,563,423 | 100.00% |
| DNA coding region (bp) | 1,474,996 | 94.34% |
| DNA G+C content (bp) | 656,709 | 42.00% |
| Number of replicons | 2 | |
| Extrachromosomal elements | 1 | |
| Total genes | 1,909 | 100.00% |
| RNA genes | 52 | 2.67% |
| rRNA operons | 1 | |
| Protein-coding genes | 1,858 | 97.33% |
| Pseudo genes | 35 | 1.83% |
| Genes with function prediction | 1,145 | 59.98% |
| Genes in paralog clusters | 167 | 8.75% |
| Genes assigned to COGs | 1,267 | 66.37% |
| Genes assigned Pfam domains | 1,301 | 68.15% |
| Genes with signal peptides | 141 | 7.39% |
| Genes with transmembrane helices | 328 | 17.18% |
| CRISPR repeats | 0 |
Figure 3Graphical circular map of the genome (without the 2.8 kbp plasmid). From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 150 | 8.1 | Translation, ribosomal structure and biogenesis |
| A | 2 | 0.1 | RNA processing and modification |
| K | 67 | 3.6 | Transcription |
| L | 79 | 4.3 | Replication, recombination and repair |
| B | 4 | 0.2 | Chromatin structure and dynamics |
| D | 13 | 0.7 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 4 | 0.2 | Defense mechanisms |
| T | 46 | 2.5 | Signal transduction mechanisms |
| M | 40 | 2.2 | Cell wall/membrane biogenesis |
| N | 18 | 1.0 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 21 | 1.1 | Intracellular trafficking and secretion |
| O | 53 | 2.9 | Posttranslational modification, protein turnover, chaperones |
| C | 109 | 5.9 | Energy production and conversion |
| G | 41 | 2.2 | Carbohydrate transport and metabolism |
| E | 110 | 5.9 | Amino acid transport and metabolism |
| F | 47 | 2.5 | Nucleotide transport and metabolism |
| H | 85 | 4.6 | Coenzyme transport and metabolism |
| I | 24 | 1.3 | Lipid transport and metabolism |
| P | 57 | 3.1 | Inorganic ion transport and metabolism |
| Q | 7 | 0.4 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 201 | 10.8 | General function prediction only |
| S | 169 | 9.1 | Function unknown |
| - | 642 | 34.6 | Not in COGs |
Reverse complement of the 16S rRNA 3'-end
| | | |
|---|---|---|
| AGGAGGT | 555 | 29.9% |
| GGAGGTG | 569 | 30.6% |
| GAGGTGA | 565 | 30.4% |
| AGGTGAT | 549 | 30.0% |
| GGTGATC | 283 | 15.2% |
| GTGATCC | 88 | 4.7% |
| TGATCCA | 89 | 4.8% |
| GATCCAG | 53 | 2.9% |
| ATCCAGC | 58 | 3.1% |
| TCCAGCC | 45 | 2.4% |
Assessment of the frequencies of heptanucleotide sections with one acceptable mismatch in 50 bp regions upstram of all genes in AV18T.
Figure 4Venn-diagram depicting the intersections of protein sets (total numbers in parentheses) of the three completely sequenced Archaeoglobi genomes. All intersections concerning A. profundus are gene counts of AV18T, the remaining intersection between A. fulgidus and F. placidus only, are gene counts in A. fulgidus. Due to variable copy numbers of several genes in the three species, the fragments do not add up to the total numbers of genes for A. fulgidus and F. placidus.
Figure 5Phylogenetic tree of archaeal type strains with fully sequenced genomes, inferred using the maximum likelihood criterion [30], based on an alignment of the RNA polymerase B subunit sequence and rooted with the node which separates Cren- and Euryarchaeota. The alignment was inferred by Muscle [81] software, using the PROTCATLGF substitution model. Bootstrapping was performed using RAxML [30] and values above 60% mark the corresponding nodes. Species containing a conserved split in the RNA polymerase B subunit gene are displayed in bold.