| Literature DB >> 29435649 |
Saber Khelaifia1, Aurelia Caputo2, Claudia Andrieu2, Frederique Cadoret2, Nicholas Armstrong2, Caroline Michelle2, Jean-Christophe Lagier2, Felix Djossou3, Pierre-Edouard Fournier2, Didier Raoult2,4.
Abstract
By applying the culturomics concept and using culture conditions containing a high salt concentration, we herein isolated the first known halophilic archaeon colonizing the human gut. Here we described its phenotypic and biochemical characterization as well as its genome annotation. Strain Arc-HrT (= CSUR P0974 = CECT 9307) was mesophile and grew optimally at 37 °C and pH 7. Strain Arc-HrT was also extremely halophilic with an optimal growth observed at 15% NaCl. It showed gram-negative cocci, was strictly aerobic, non-motile and non-spore-forming, and exhibited catalase and oxidase activities. The 4,015,175 bp long genome exhibits a G + C% content of 65.36% and contains 3911 protein-coding and 64 predicted RNA genes. PCR-amplified 16S rRNA gene of strain Arc-HrT yielded a 99.2% sequence similarity with Haloferax prahovense, the phylogenetically closest validly published species in the Haloferax genus. The DDH was of 50.70 ± 5.2% with H. prahovense, 53.70 ± 2.69% with H. volcanii, 50.90 ± 2.64% with H. alexandrinus, 52.90 ± 2.67% with H. gibbonsii and 54.30 ± 2.70% with H. lucentense. The data herein represented confirm strain Arc-HrT as a unique species and consequently we propose its classification as representative of a novel species belonging to the genus Haloferax, as Haloferax massiliense sp. nov.Entities:
Keywords: Culturomics; Haloferax massiliense; Halophilic archaea; Taxono-genomics
Mesh:
Year: 2018 PMID: 29435649 PMCID: PMC5862939 DOI: 10.1007/s00792-018-1011-1
Source DB: PubMed Journal: Extremophiles ISSN: 1431-0651 Impact factor: 2.395
Fig. 1Reference mass spectrum from Haloferax massiliense strain Arc-HrT. Spectra from 12 individual colonies were compared and a reference spectrum was generated
Fig. 2Phylogenetic tree highlighting the position of Haloferax massiliense strain Arc-HrT relative to other type strains within Haloferax, Halogeometricum, Halobellus and Halobaculum genus. The respective GenBank accession numbers for 16S rRNA genes are indicated in parenthesis. Sequences were aligned using CLUSTALW, and phylogenetic inferences were obtained using the maximum-likelihood method within the MEGA software. The scale bar represents 0.005% nucleotide sequence divergence
Fig. 3Transmission electron microscopy of Haloferax massiliense strain Arc-HrT, using a Morgani 268D (Philips) at an operating voltage of 80 keV. The scale bar represents 500 nm
Fig. 4Gel view comparing Haloferax massiliense strain Arc-HrT to other species within the genus Haloferax. The gel view displays the raw spectra of loaded spectrum files arranged in a pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running spectrum number originating from subsequent spectra loading. The peak intensity is expressed by a Gray scale scheme code. The color bar and the right y-axis indicate the relation between the color of a peak and the peak intensity, in arbitrary units. Displayed species are indicated on the left
Classification and general features of Haloferax massiliense strain Arc-HrT according to the MIGS recommendations (Field et al. 2008)
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain: | TAS (Woese et al. | |
| Phylum: | TAS (Cavalier-Smith | ||
| Class: | TAS (Grant et al. | ||
| Order: | TAS (Grant and Larsen | ||
| Family: | TAS (Grant and Larsen | ||
| Genus: | TSA (Torreblanca et al. | ||
| Species: | IDA | ||
| Type strain: Arc-HrT | IDA | ||
| Gram stain | negative | IDA | |
| Cell shape | Cocci | IDA | |
| Motility | Non motile | IDA | |
| Sporulation | Non spore-forming | IDA | |
| Temperature range | Mesophile | IDA | |
| Optimum temperature | 37 °C | IDA | |
| pH | pH 6.5–8 | ||
| Optimum pH | 7 | ||
| MIGS-6.3 | Salinity | 10–25% | IDA |
| Optimum salinity | 15% NaCl | IDA | |
| MIGS-22 | Oxygen requirement | Strictly aerobic | IDA |
| Carbon source | Unknown | IDA | |
| Energy source | Unknown | IDA | |
| MIGS-6 | Habitat | Human gut | IDA |
| MIGS-15 | Biotic relationship | Free living | IDA |
| Pathogenicity | Unknown | NAS | |
| Biosafety level | 2 | IDA | |
| MIGS-14 | Isolation | Human feces | IDA |
| MIGS-4 | Geographic location | France | IDA |
| MIGS-5 | Sample collection time | December 2013 | IDA |
| MIGS-4.3 | Depth | surface | IDA |
| MIGS-4.4 | Altitude | 0 m above sea level | IDA |
aEvidence codes—IDA, Inferred from Direct Assay; TAS, Traceable Author Statement (i.e., a direct report exists in the literature); NAS, non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from http://www.geneontology.org/GO.evidence.shtml of the Gene Ontology project (Ashburner et al. 2000). If the evidence is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements
Differential characteristics of Haloferax massiliense strain Arc-HrT, Haloferax prahovense (Enache et al. 2007), Haloferax volcanii (Torreblanca et al. 1986), Haloferax denitrificans (Tindall et al. 1989); 4, Haloferax mediterranei (Torreblanca et al. 1986), Haloferax gibbonsii (Juez et al. 1986), Haloferax alexandrinus (Asker and Ohta 2002) and Haloferax lucentense (Gutierrez et al. 2002). na: No available data
| Properties |
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| Oxygen requirement | + | + | + | + | + | + | + | + |
| Gram stain | – | – | – | – | – | – | – | – |
| Salt requirement | + | + | + | + | + | + | + | + |
| Motility | – | – | – | – | – | + | – | + |
| Endospore formation | – | – | – | – | – | – | – | – |
| Indole | – | – | + | + | – | + | + | + |
| Tween 80 hydrolysis | + | + | + | – | – | + | + | + |
| Production of | ||||||||
| Alkaline phosphatase | + | na | na | na | na | na | + | na |
| Catalase | + | + | + | + | + | + | + | + |
| Oxidase | + | + | + | + | + | + | + | + |
| Nitrate reductase | + | – | + | + | – | + | + | – |
| Urease | – | na | na | – | na | na | – | na |
| β-galactosidase | – | na | + | + | + | na | + | – |
| – | Na | na | na | na | na | + | na | |
| Acid from | ||||||||
| – | – | + | na | + | + | + | + | |
| Ribose | – | + | – | – | na | na | + | na |
| Mannose | – | – | – | – | + | + | – | na |
| Mannitol | – | – | na | na | na | + | na | – |
| Sucrose | – | – | + | + | + | + | + | – |
| – | – | + | + | + | + | + | + | |
| + | + | + | + | + | + | + | + | |
| – | + | + | + | + | + | + | + | |
| – | + | – | – | – | + | – | – | |
| Gelatin hydrolysis | – | – | – | + | + | + | + | – |
| Starch hydrolysis | – | + | – | – | – | + | – | – |
| Casein hydrolysis | + | – | – | – | + | + | – | – |
| Habitat | Human Gut | Salt lake | Bottom sedimen | solar saltern | Solar salterns | Solar salt pond | solar saltern | Water of a saltern |
Fig. 5Circular representation of the Haloferax massiliense Arc-HrT genome. Circles from the center to the outside: GC screw (green/purple), GC content (green/purple) and contigs (orange/brown)
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | 1 mate-paired |
| MIGS-29 | Sequencing platforms | MiSeq Illumina |
| MIGS-31.2 | Sequencing coverage | 620 |
| MIGS-30 | Assemblers | Spades |
| MIGS-32 | Gene calling method | Prodigal |
| Genbank ID | CSTE01000001–CSTE01000008 | |
| Genbank date of release | Apr, 2014 | |
| MIGS-13 | Source material identifier | Arc-HrT |
| Project relevance | Mar, 2014 |
Nucleotide content and gene count levels of the genome
| Attribute | Value | % of totala |
|---|---|---|
| Genome size (bp) | 4,015,175 | 100 |
| DNA coding region (bp) | 3,414,159 | 78.50 |
| DNA G + C content (bp) | 2,624,318 | 65.36 |
| Total protein-coding genes | 3911 | 100 |
| rRNA | 3 | 0.08 |
| tRNA | 57 | 1.46 |
| tmRNA | 0 | 0 |
| miscRNA | 4 | 0.11 |
| Genes with function prediction | 2825 | 72.23 |
| Genes assigned to COGs | 3116 | 79.68 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with the 25 general COG functional categories
| Code | Description | Value | % of total |
|---|---|---|---|
| J | Translation, ribosomal structure and biogenesis | 165 | 4.22 |
| A | RNA processing and modification | 1 | 0.03 |
| K | Transcription | 172 | 4.40 |
| L | Replication, recombination and repair | 133 | 3.41 |
| B | Chromatin structure and dynamics | 6 | 0.16 |
| D | Cell cycle control, cell division, chromosome partitioning | 27 | 0.69 |
| Y | Nuclear structure | 0 | 0.0 |
| V | Defense mechanisms | 37 | 0.95 |
| T | Signal transduction mechanisms | 157 | 4.02 |
| M | Cell wall/membrane biogenesis | 119 | 3.05 |
| N | Cell motility | 38 | 0.98 |
| Z | Cytoskeleton | 0 | 0.0 |
| W | Extracellular structures | 0 | 0.0 |
| U | Intracellular trafficking and secretion, and vesicular transport | 35 | 0.90 |
| O | Posttranslational modification, protein turnover, chaperones | 113 | 2.89 |
| C | Energy production and conversion | 208 | 5.32 |
| G | Carbohydrate transport and metabolism | 219 | 5.6 |
| E | Amino acid transport and metabolism | 341 | 8.72 |
| F | Nucleotide transport and metabolism | 75 | 1.92 |
| H | Coenzyme transport and metabolism | 154 | 3.94 |
| I | Lipid transport and metabolism | 79 | 2.02 |
| P | Inorganic ion transport and metabolism | 202 | 5.17 |
| Q | Secondary metabolites biosynthesis, transport and catabolism | 54 | 1.39 |
| R | General function prediction only | 490 | 12.53 |
| S | Function unknown | 291 | 7.45 |
Fig. 6Distribution of functional classes of predicted genes according to cluster of orthologous groups of proteins from Haloferax massiliense strain Arc-HrT
Number of orthologous proteins shared between genomes (upper right), average percentage similarity of nucleotides corresponding to orthologous protein shared between genomes (lower left) and number of proteins per genome (bold)
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| 4086 | 2348 | 2754 | 2761 | 2690 | 2355 |
|
| 97.07 | 2995 | 2409 | 2400 | 2353 | 2412 |
|
| 92.1 | 93.21 | 4180 | 3238 | 2958 | 2696 |
|
| 92.08 | 92.84 | 98.83 | 4109 | 2973 | 2685 |
|
| 93.07 | 93.29 | 92.33 | 92.24 | 4259 | 2459 |
|
| 92.44 | 93.67 | 96.68 | 96.3 | 92,82 | 3053 |
Pairwise comparison of Haloferax massiliense with other species using GGDC, formula 2 (DDH estimates based on identities/HSP length)
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| 100 ± 00% | 50.90 ± 2.64% | 52.90 ± 2.67% | 54.30 ± 2.70% | 53.70 ± 2.69% | 50.70 ± 5.2% |
|
| 100 ± 00% | 73.50 ± 2.90% | 50. 80 ± 2.63% | 53.50 ± 2.68% | 96 ± 2.4% | |
|
| 100 ± 00% | 51.90 ± 2.65% | 53.80 ± 2.69% | 72 ± 5.8% | ||
|
| 100 ± 00% | 82.20 ± 2.69% | 50.70 ± 5.3% | |||
|
| 100 ± 00% | 52.80 ± 5.3% | ||||
|
| 100 ± 00% |
The confidence intervals indicate the inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with the 16S rRNA (Fig. 4) and phylogenomic analyses as well as the GGDC results