| Literature DB >> 27182430 |
Enzo Messina1, Dimitry Y Sorokin2, Ilya V Kublanov3, Stepan Toshchakov4, Anna Lopatina5, Erika Arcadi1, Francesco Smedile1, Gina La Spada1, Violetta La Cono1, Michail M Yakimov1.
Abstract
Strain M27-SA2 was isolated from the deep-sea salt-saturated anoxic lake Medee, which represents one of the most hostile extreme environments on our planet. On the basis of physiological studies and phylogenetic positioning this extremely halophilic euryarchaeon belongs to a novel genus 'Halanaeroarchaeum' within the family Halobacteriaceae. All members of this genus cultivated so far are strict anaerobes using acetate as the sole carbon and energy source and elemental sulfur as electron acceptor. Here we report the complete genome sequence of the strain M27-SA2 which is composed of a 2,129,244-bp chromosome and a 124,256-bp plasmid. This is the second complete genome sequence within the genus Halanaeroarchaeum. We demonstrate that genome of 'Halanaeroarchaeum sulfurireducens' M27-SA2 harbors complete metabolic pathways for acetate and sulfur catabolism and for de novo biosynthesis of 19 amino acids. The genomic analysis also reveals that 'Halanaeroarchaeum sulfurireducens' M27-SA2 harbors two prophage loci and one CRISPR locus, highly similar to that of Kulunda Steppe (Altai, Russia) isolate 'H. sulfurireducens' HSR2(T). The discovery of sulfur-respiring acetate-utilizing haloarchaeon in deep-sea hypersaline anoxic lakes has certain significance for understanding the biogeochemical functioning of these harsh ecosystems, which are incompatible with life for common organisms. Moreover, isolations of Halanaeroarchaeum members from geographically distant salt-saturated sites of different origin suggest a high degree of evolutionary success in their adaptation to this type of extreme biotopes around the world.Entities:
Keywords: Anoxic habitats; Extremely halophilic archaea; Hypersaline lake; Strictly anaerobic; Sulfur reduction
Year: 2016 PMID: 27182430 PMCID: PMC4866403 DOI: 10.1186/s40793-016-0155-9
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Morphology of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2 cells grown on acetate (a) and pyruvate (b) as electron donors and elemental sulphur as electron acceptor. The scale bars represent 5 μm
Fig. 2Phylogenetic tree of 16S rRNA gene sequences showing the position of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2. Tree was inferred from a 16S rRNA gene sequence alignment with PAUP*4.b10 [59] using a LogDet/paralinear distance method. Support for nodes in the tree corresponds to bootstrap values for 1000 pseudo-replicates. Only bootstrap values greater than 75 % are displayed as solid circles. The tree has been arbitrarily rooted on sequence of Natronomonas pharaonis (D87971) and Halomarina oriensis (AB519798). The 16S rRNA gene sequence of Methanohalophilus halophilus (FN870068) was used as the outgroup. The scale bars represent a 5 % nucleotide sequence divergence
Classification and general features of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2T [48]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain M27-SA2T (CP011564, CP011565) | TAS [ | ||
| Cell shape | Pleomorphic | TAS [ | |
| Motility | Non-motile | TAS [ | |
| Sporulation | Non-sporulating | NAS | |
| Temperature range | 15–50 °C | TAS [ | |
| Optimum temperature | 40 °C | TAS [ | |
| pH range; Optimum | 6.7–8.0; 7.2–7.5 | TAS [ | |
| Carbon source | Acetate, pyruvate | TAS [ | |
| MIGS-6 | Habitat | Hypersaline anoxic lake sediments (brine) | TAS [ |
| MIGS-6.3 | Salinity | 3.0–5.0 M NaCl | TAS [ |
| MIGS-22 | Oxygen requirement | Strictly anaerobic | TAS [ |
| MIGS-15 | Biotic relationship | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| MIGS-4 | Geographic location | Lake Medee, Ionian Sea, Eastern Mediterranean | TAS [ |
| MIGS-5 | Sample collection | 24 September 2012 | TAS [ |
| MIGS-4.1 | Latitude | 34°26.250N | TAS [ |
| MIGS-4.2 | Longitude | 22°19.783E | TAS [ |
| MIGS-4.3 | Depth | 3105 m | TAS [ |
aEvidence codes – IDA Inferred from Direct Assay (first time in publication), TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [58]. If the evidence is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements
Project information for ‘Halanaeroarchaeum sulfurireducens’ M27-SA2
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Illumina standard library, Miseq Reagent kit v2. |
| MIGS-29 | Sequencing platforms | Illumina MiSeq System |
| MIGS-31.2 | Fold coverage | 634x chromosome, 691x plasmid |
| MIGS-30 | Assemblers | Velvet 1.2.10, Geneious 7.1 |
| MIGS-32 | Gene calling method | Geneious 7.1, Glimmer 3.02, tRNAScan-SE |
| Locus Tag | HLASA | |
| GenBank ID | CP011564 (chromosome) | |
| CP011565 (plasmid) | ||
| GenBank date of release | 30/09/2015 | |
| BIOPROJECT | PRJNA284332 | |
| MIGS-13 | Source material identifier | Isolated from the deep-sea hypersaline lake Medee, Ionian Sea, Eastern Mediterranean, water depth 3105 m. Salinity: 230 g/l; pH 6.8. Coordinates 34°26.250N, 22°19.783E. |
| Project relevance | Extremophile hypersaline environments |
Fig. 3Graphical circular map of the genome of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2. From outside to center: genes on forward strand (COG color-coded), genes on reverse strand (COG color-coded), RNA genes and other (only for chromosome: tRNAs green, rRNAs red, CRISPR azure, Prophages dark blue), GC content, GC skew [18, 60]
Genome composition for ‘Halanaeroarchaeum sulfurireducens’ M27-SA2
| Label | Size (Mb) | Topology | INSDC identifier | RefSeq ID |
|---|---|---|---|---|
| Chromosome | 2.129 | circular | CP011564.1 | NZ_CP011564.1 |
| Plasmid | 0.124 | circular | CP011565.1 | NZ_CP011565.1 |
Chromosome statistics for ‘Halanaeroarchaeum sulfurireducens’ M27-SA2
| Attribute | Value | % of total |
|---|---|---|
| Chromosome size (bp) | 2,129,244 | |
| DNA coding (bp) | 1,860,079 | 87.36 % |
| DNA G + C (bp) | 1,345,472 | 63.19 % |
| Total genes | 2,200 | |
| Protein-coding genes | 2,151 | 97.77 % |
| tRNA genes | 46 | 2.09 % |
| rRNA genes (5S-16S-23S) | 3 | 0.14 % |
| Genes assigned to COGs | 1,306 | 60.72 % |
| CRISPR repeats | 1 | |
| Average length (bp) | 861 | |
| Max length (bp) | 5,619 | |
| ATG initiation codon proteins | 1,814 | 84.33 % |
| GTG initiation codon proteins | 278 | 12.93 % |
| TTG initiation codon proteins | 59 | 2.74 % |
Number of genes associated with the general COG functional categories for chromosome
| Code | Value | % age | COG category |
|---|---|---|---|
| J | 129 | 5.99 % | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.05 % | RNA processing and modification |
| K | 79 | 3.67 % | Transcription |
| L | 91 | 4.23 % | Replication, recombination and repair |
| B | 3 | 0.14 % | Chromatin structure and dynamics |
| D | 8 | 0.37 % | Cell cycle control, cell division, chromosome partitioning |
| V | 10 | 0.46 % | Defense mechanisms |
| T | 46 | 2.14 % | Signal transduction mechanisms |
| M | 46 | 2.14 % | Cell wall/membrane/envelope biogenesis |
| N | 28 | 1.30 % | Cell motility |
| U | 9 | 0.42 % | Intracellular trafficking, secretion, and vesicular transport |
| O | 58 | 2.70 % | Posttranslational modification, protein turnover, chaperones |
| C | 93 | 4.32 % | Energy production and conversion |
| G | 27 | 1.26 % | Carbohydrate transport and metabolism |
| E | 133 | 6.18 % | Amino acid transport and metabolism |
| F | 52 | 2.42 % | Nucleotide transport and metabolism |
| H | 80 | 3.72 % | Coenzyme transport and metabolism |
| I | 29 | 1.35 % | Lipid transport and metabolism |
| P | 57 | 2.65 % | Inorganic ion transport and metabolism |
| Q | 7 | 0.33 % | Secondary metabolites biosynthesis, transport and catabolism |
| R | 177 | 8.23 % | General function prediction only |
| S | 143 | 6.65 % | Function unknown |
| - | 845 | 39.28 % | Not in COGs |
Plasmid statistics for ‘Halanaeroarchaeum sulfurireducens’ M27-SA2
| Attribute | Value | % of total |
|---|---|---|
| Plasmid size (bp) | 124,256 | |
| DNA coding (bp) | 103,887 | 83.61 % |
| DNA G + C (bp) | 68,831 | 55.39 % |
| Total genes | 119 | |
| Protein-coding genes | 119 | |
| Genes assigned to COGs | 24 | 20.16 % |
| Average length (bp) | 873 | |
| Max length (bp) | 4,326 | |
| ATG initiation codon proteins | 78 | 65.55 % |
| GTG initiation codon proteins | 28 | 23.53 % |
| TTG initiation codon proteins | 13 | 10.92 % |
Number of genes associated with the general COG functional categories for plasmid
| Code | Value | % age | COG category |
|---|---|---|---|
| K | 4 | 3.36 % | Transcription |
| L | 7 | 5.88 % | Replication, recombination and repair |
| D | 2 | 1.68 % | Cell cycle control, cell division, chromosome partitioning |
| T | 4 | 3.36 % | Signal transduction mechanisms |
| H | 2 | 1.68 % | Coenzyme transport and metabolism |
| P | 1 | 0.84 % | Inorganic ion transport and metabolism |
| R | 2 | 1.68 % | General function prediction only |
| S | 2 | 1.68 % | Function unknown |
| - | 95 | 79.84 % | Not in COGs |
Fig. 4Comparison of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2 vs. ‘Halanaeroarchaeum sulfurireducens’ HSR2T: chromosomes in (a) and plasmids in (b) ACT [20] comparisons at left (98 % ID, 40 bp minimum bitscore cutoff), LAST [21] comparisons at right (default parameters), and Mauve [22] alignments in the middle (default parameters). Genes COG color-coded, comparisons forward ID scale in red tones and reverse ID scale in blue tones
Fig. 5Overview of amino acid biosynthesis pathways in the genome of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2. The green colour indicates the presence of a homolog coding an enzyme that may catalyse this reaction. Red colour indicates the absence of the corresponding gene in M27-SA2 genome. EC numbers are shown in parentheses, while M27-SA2 gene locus_tags are in brackets
Fig. 6Structure of CRISPR system identified in ‘Halanaeroarchaeum sulfurireducens’ M27-SA2, with description of associated protein and repeat region found
CRISPR spacers analysis in the chromosome of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2
| Spacer # | Match to metagenomic library | # of mismatches | 5′ PAM sequence | Match to GenBank nt database |
|---|---|---|---|---|
| 7 | Lake | 4 | TTT | gb|JQ807236.1|, environmental Halophage eHP-15 |
| 24 | Lake | 3 | CTC/TGC | no |
| 34 | Lake | 2 | TTC/TTT | no |
| 37 | Lake | 0 | gb|AY596293.1|, | |
| 52 | Lake | 5 | GTG | no |
| 54 | Lake | 5 | TTC | no |
Fig. 7Comparative genome map with genome alignment results. Relative lytic pleomorphic viruses HRPV1, HRPV2, HRPV3, HHPV1 have seven core genes (colored in shades of magenta) that are shared between them and a prophage 1 from ‘Halanaeroarchaeum sulfurireducens’ M27-SA2. Relative prophages of Haloferax Hela, Hmuc, Hvol (coloured in shades of green) have tRNA and XerC/D integrase/recombinase flanking their genomes. The same flanks has the genome of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2 prophage Hsul. CRISPR array and cas genes are colored in red. XerC/D integrase/recombinase is colored in blue. The same colors of the genes in genomes represent homologous genes. Short designations used: HRPV1, Halorubrum pleomorphic virus 1; HRPV2, Halorubrum pleomorphic virus 2; HRPV3, Halorubrum pleomorphic virus 3; HHPV1, Haloarcula pleomorphic virus 1; Hsul, ‘Halanaeroarchaeum sulfurireducens’ M27-SA2 prophage 1; Hela; Haloferax elongans contig AOLK01000011 [38]; Hmuc, Haloferax mucosum contig AOLN01000011 [42]; Hvol, Haloferax volcanii chrom1 [42]
NCBI blastp results for prophage 1 region of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2
| locus_tag | Length (bp) | NCBI blastp best hit | e-value | gb accession |
|---|---|---|---|---|
| HLASA_0843 | 1239 | integrase [ | 6.00e-176 | WP_015789480.1 |
| HLASA_0844 | 213 | hypothetical protein [ | 5.00e-05 | WP_018258875.1 |
| HLASA_0845 | 417 | DNA-binding protein [ | 1.00e-79 | AHG02386.1 |
| HLASA_0846 | 231 | hypothetical protein [ | 5.00e-37 | WP_020446487.1 |
| HLASA_0847 | 267 | hypothetical protein [uncultured archaeon A07HR60] | 6.00e-07 | WP_023506388.1 |
| HLASA_0848 | 489 | hypothetical protein [ | 4.00e-94 | WP_020221057.1 |
| HLASA_0849 | 552 | hypothetical protein [ | 9.00e-119 | WP_044957257.1 |
| HLASA_0850 | 141 | MULTISPECIES: hypothetical protein [ | 2.00e-21 | WP_004594507.1 |
| HLASA_0851 | 189 | hypothetical protein [ | 4.00e-33 | WP_021056874.1 |
| HLASA_0852 | 381 | hypothetical protein [ | 2.00e-82 | WP_026046123.1 |
| HLASA_0853 | 423 | hypothetical protein [ | 3.00e-44 | WP_004048594.1 |
| HLASA_0854 | 954 | hypothetical protein [ | 6.00e-27 | WP_015761810.1 |
| HLASA_0855 | 798 | hypothetical protein HRPV-1_gp7 [ | 2.00e-26 | YP_002791892.1 |
| HLASA_0856 | 474 | unknown [ | 6.00e-19 | YP_003411999.1 |
| HLASA_0857 | 1581 | MULTISPECIES: hypothetical protein [ | 8.00e-52 | WP_008576942.1 |
| HLASA_0858 | 399 | gp3 [ | 6.00e-47 | YP_009008689.1 |
| HLASA_0859 | 351 | hypothetical protein [ | 5.00e-38 | WP_005538312.1 |
| HLASA_0860 | 813 | CRISPR-associated protein Cas6 [ | 7.00e-151 | WP_008524857.1 |
| HLASA_0861 | 2112 | CRISPR-associated protein Csh1 [ | 0 | WP_008524855.1 |
| HLASA_0862 | 1068 | CRISPR-associated protein, Csh2 family [ | 0 | WP_008164963.1 |
| HLASA_0863 | 801 | CRISPR-associated protein Cas5 [ | 2.00e-124 | WP_008524852.1 |
| HLASA_0864 | 2589 | CRISPR-associated helicase, Cas3 [ | 0 | WP_020936219.1 |
| HLASA_0865 | 555 | CRISPR-associated protein Cas4 [ | 4.00e-99 | WP_015789200.1 |
| HLASA_0866 | 993 | CRISPR-associated protein Cas1 [ | 0 | WP_015789199.1 |
| HLASA_0867 | 264 | CRISPR-associated endonuclease Cas2 [ | 1.00e-37 | WP_015789198.1 |
| repeat_region | 3798 | CRISPR repeat region | ||
| HLASA_0868 | 1338 | replication-related protein [ | 5.00e-145 | WP_006432249.1 |
| HLASA_0869 | 411 | PREDICTED: multidrug and toxin extrusion protein 1 [ | 0.1 | XP_003465357.1 |
NCBI blastp results for prophage 2 region of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2
| locus_tag | Length (bp) | NCBI blastp best hit | e-value | gb accession |
|---|---|---|---|---|
| HLASA_2001 | 1347 | integrase [ | 3.00e-178 | WP_005577816.1 |
| HLASA_2002 | 186 | no | ||
| HLASA_2003 | 1125 | hypothetical protein [ | 5.00e-11 | WP_042662540.1 |
| HLASA_2004 | 996 | zinc finger SWIM domain protein [ | 5.00e-14 | WP_004517574.1 |
| HLASA_2005 | 219 | hypothetical protein [ | 4.00e-15 | WP_006885565.1 |
| HLASA_2006 | 1071 | ORC / cell division control protein 6 [ | 2.00e-98 | WP_008307569.1 |
| HLASA_2007 | 660 | hypothetical protein [ | 9.00e-59 | WP_008312971.1 |
| HLASA_2008 | 978 | hypothetical protein [ | 3.00e-144 | WP_008312969.1 |
| HLASA_2009 | 639 | PHP domain-containing protein [ | 3.00e-106 | WP_008312967.1 |
| HLASA_2010 | 912 | decaprenyl-phosphate phosphoribosyltransferase [ | 0 | WP_008312966.1 |
| HLASA_2011 | 423 | hypothetical protein [ | 2.00e-29 | WP_004518340.1 |
| HLASA_2012 | 1092 | NAD-dependent epimerase/dehydratase [ | 0 | WP_008312962.1 |
| HLASA_2013 | 1299 | hypothetical protein [ | 2.00e-35 | WP_013559525.1 |
| HLASA_2014 | 810 | concanavalin A-like lectin/glucanases family protein [ | 2.00e-18 | CDK38289.1 |
| HLASA_2015 | 597 | hypothetical protein OSG_eHP34_00135 [Environmental halophage eHP-34] | 1.00e-28 | AFH22760.1 |
| HLASA_2016 | 576 | hypothetical protein HGTV1_28 [halovirus HGTV-1] | 2.00e-11 | YP_008059236.1 |
| HLASA_2017 | 912 | hypothetical protein PhiCh1p32 [ | 2.00e-29 | NP_665949.1 |
| HLASA_2018 | 1263 | baseplate J protein [haloarchaeon 3A1_DGR] | 2.00e-178 | WP_039401004.1 |
| HLASA_2019 | 363 | hypothetical protein [haloarchaeon 3A1_DGR] | 7.00e-15 | WP_021074727.1 |
| HLASA_2020 | 681 | hypothetical protein [ | 3.00e-34 | WP_004268274.1 |
| HLASA_2021 | 879 | hypothetical protein [haloarchaeon 3A1_DGR] | 4.00e-97 | WP_021074730.1 |
| HLASA_2022 | 345 | hypothetical protein [haloarchaeon 3A1_DGR] | 3.00e-43 | WP_039401001.1 |
| HLASA_2023 | 564 | hypothetical protein [haloarchaeon 3A1_DGR] | 1.00e-35 | WP_021075289.1 |
| HLASA_2024 | 3123 | prophage pi3 protein 14 [ | 1.00e-55 | WP_008414607.1 |
| HLASA_2025 | 477 | hypothetical protein [haloarchaeon 3A1_DGR] | 1.00e-13 | WP_021074542.1 |
| HLASA_2026 | 1296 | hypothetical protein [haloarchaeon 3A1_DGR] | 2.00e-155 | WP_039400994.1 |
| HLASA_2027 | 588 | hypothetical protein [ | 1.00e-37 | WP_004268261.1 |
| HLASA_2028 | 447 | hypothetical protein [haloarchaeon 3A1_DGR] | 1.00e-45 | WP_039400992.1 |
| HLASA_2029 | 285 | hypothetical protein [haloarchaeon 3A1_DGR] | 6.00e-29 | WP_021074547.1 |
| HLASA_2030 | 387 | hypothetical protein EL22_16975 [ | 2.00e-05 | KDE59819.1 |
| HLASA_2031 | 366 | hypothetical protein HHTV1_22 [halovirus HHTV-1] | 2.00e-05 | YP_008058712.1 |
| HLASA_2032 | 1140 | major capsid protein go21 [halovirus HHTV-1] | 3.00e-91 | YP_008058711.1 |
| HLASA_2033 | 453 | acyl dehydratase [ | 0.032 | WP_014732690.1 |
| HLASA_2034 | 1221 | PREDICTED: myosin-9-like [ | 2.00e-04 | XP_010274858.1 |
| HLASA_2035 | 294 | no | ||
| HLASA_2036 | 216 | no | ||
| HLASA_2037 | 417 | no | ||
| HLASA_2038 | 339 | hypothetical protein [ | 5.00e-09 | WP_005577927.1 |
| HLASA_2039 | 669 | hypothetical protein [ | 9.00e-23 | WP_006649646.1 |
| HLASA_2040 | 381 | hypothetical protein [ | 2.00e-08 | WP_015300135.1 |
| HLASA_2041 | 1605 | uncharacterized protein BN903_58 [ | 1.00e-48 | CDK39659.1 |
| HLASA_2042 | 1710 | hypothetical protein HALG_00007 [ | 1.00e-131 | YP_008126542.1 |
| HLASA_2043 | 444 | hypothetical protein HCTV2_15 [halovirus HCTV-2] | 4.00e-29 | YP_008058377.1 |
| HLASA_2044 | 1320 | DNA methylase [ | 4.00e-120 | KDE56926.1 |
| HLASA_2045 | 219 | hypothetical protein OSG_eHP14_00030 [Environmental halophage eHP-14] | 4.00e-08 | AFH21986.1 |
| HLASA_2046 | 171 | hypothetical protein [ | 8.00e-05 | WP_005538080.1 |
| HLASA_2047 | 303 | no | ||
| HLASA_2048 | 3660 | hypothetical protein [ | 0 | WP_004217537.1 |
Fig. 8Localization of GIs on the chromosome of ‘Halanaeroarchaeum sulfurireducens’ M27-SA2, as predicted by SWGIS [46] a (grey arrows on circular map) and GOTHAM [47] b (red dots). Common predicted regions of both methods are highlighted in b) (blue arrows)