| Literature DB >> 28878861 |
Lizbeth Sayavedra1, Manuel Kleiner2, Ruby Ponnudurai3, Stefan E Heiden3, Andrea Thürmer4, Horst Felbeck5, Rabea Schlüter6, Stefan M Sievert7, Rolf Daniel4, Thomas Schweder3,8, Stephanie Markert3,8.
Abstract
Bathymodiolus thermophilus, a mytilid mussel inhabiting the deep-sea hydrothermal vents of the East Pacific Rise, lives in symbiosis with chemosynthetic Gammaproteobacteria within its gills. The intracellular symbiont population synthesizes nutrients for the bivalve host using the reduced sulfur compounds emanating from the vents as energy source. As the symbiont is uncultured, comprehensive and detailed insights into its metabolism and its interactions with the host can only be obtained from culture-independent approaches such as genomics and proteomics. In this study, we report the first draft genome sequence of the sulfur-oxidizing symbiont of B. thermophilus, here tentatively named Candidatus Thioglobus thermophilus. The draft genome (3.1 Mb) harbors 3045 protein-coding genes. It revealed pathways for the use of sulfide and thiosulfate as energy sources and encodes the Calvin-Benson-Bassham cycle for CO2 fixation. Enzymes required for the synthesis of the tricarboxylic acid cycle intermediates oxaloacetate and succinate were absent, suggesting that these intermediates may be substituted by metabolites from external sources. We also detected a repertoire of genes associated with cell surface adhesion, bacteriotoxicity and phage immunity, which may perform symbiosis-specific roles in the B. thermophilus symbiosis.Entities:
Keywords: Autotrophy; Hydrothermal vents; Marine invertebrate symbiosis; Thiotrophy; Uncultured endosymbiont
Year: 2017 PMID: 28878861 PMCID: PMC5581435 DOI: 10.1186/s40793-017-0266-y
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Transmission electron micrographs of Candidatus Thioglobus thermophilus. B. thermophilus gill tissue was homogenized in a glass tissue grinder and subjected to crude density gradient centrifugation using Histodenz™ gradient medium. Subsamples were taken from two visible bands and fixed for electron microscopy (a and b). Both subsamples contained numerous free symbiont cells (S) as well as some intact host vacuoles (V) containing several symbiont cells, besides various other cellular components and host tissue debris. L: Lipid drop or mucus. Scale bar: 5 μm. Electron microscopy method details: samples were fixed in a) 1% glutaraldehyde, 2% paraformaldehyde in IBS (imidazole-buffered saline; 0.49 M NaCl, 30 mM MgSO4*7H2O, 11 mM CaCl2*2H2O, 3 mM KCl, 50 mM imidazole) and b) in 2.5% glutaraldehyde, 1.25% paraformaldehyde in IBS. After embedding in low-gelling agarose and postfixation in 1% osmium tetroxide in cacodylate buffer (0.1 M cacodylate; pH 7.0), samples were dehydrated in a graded ethanol series (30 to 100%) and embedded in a mixture of Epon and Spurr (1:2). Sections were cut on an ultramicrotome (Reichert Ultracut, Leica UK Ltd., Milton Keynes, UK), stained with 4% aqueous uranyl acetate for 5 min followed by lead citrate for 1 min and analyzed with a transmission electron microscope LEO 906 (Zeiss, Oberkochen, Germany)
Fig. 2Phylogenetic tree of Candidatus Thioglobus thermophilus and related free-living and host-associated sulfur oxidizers. Ca. T. thermophilus, the thiotrophic symbiont of Bathymodiolus thermophilus, is displayed in bold. The tree was inferred from closely related 16S rRNA gene sequences obtained from the SILVA database using the SILVA Incremental Aligner (SINA) [51] and was estimated with the 16S rRNA sequence of 46 bacteria. The final alignment covered 1138 nucleotides. Sequence alignment and phylogenetic analysis were performed using the MEGA7 software tool [52]. The phylogenetic tree was constructed using the Maximum Likelihood method based on the Tamura-Nei model implemented in MEGA7 [53]. Branch bootstrap support values were calculated using 1000 replicates and are displayed as circles (black: ≥ 90%, white: ≥ 60%). For the sake of clarity some organisms were merged into groups (wedges): auncultured clones (KC682721, KC682765, JQ678401, AB193934); bwhale fall symbionts (HE814589, HE814588, HE814591 HE814585); cuncultured clones (FM246509, FM246513); duncultured clones (JQ678344, JQ678392); eMytilidae symbionts (AM503921, AM503923); fVesicomyidae symbionts (EU403432, EU403431, CP000488* 1081274–1,082,807, AP009247* 948400–949,934); gLucinidae symbionts (X84979, M99448, M90415); htube worm symbionts (NZ_AFOC01000137* 503–2033, DQ660821, NZ_AFZB01000059* 4132–5662). The lucinid clam symbionts, the vestimentiferan tube worm symbionts, and the free-living Thiomicrospira crunogena XCL-2 were included as outgroup. Branches that are not highlighted by colors represent free-living relatives. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. *these NCBI accession numbers refer to whole genome submissions and not to individually submitted 16S rRNA gene sequences (start and stop positions of the 16S rRNA gene are given after the asterisk). JdFR: Juan de Fuca Ridge, EPR: East Pacific Rise, MAR: Mid-Atlantic Ridge, OMZ: oxygen minimum zone, MFZ: Mendocino Fracture Zone, SBB: Santa Barbara Basin, WH: Woods Hole
Classification and general features of Candidatus Thioglobus thermophilus, the Bathymodiolus thermophilus gill endosymbiont, according to the MIGS recommendations [42]
| MIGS ID | Property | Term | Evidence code a |
|---|---|---|---|
| Classification | Domain Bacteria | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order Unclassified | NAS | ||
| Family Unclassified, S-oxidizing symbionts | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| (Type) strain NA | NAS | ||
| Gram stain | Negative | TAS [ | |
| Cell shape | Coccoid or short rods | TAS [ | |
| Motility | Non-motile inside the host; | NAS | |
| Sporulation | Unknown | ||
| Temperature range | 4 °C - 14 °C (shows psychrophilic growth characteristics) | TAS [ | |
| Optimum temperature | Unknown | ||
| pH range; Optimum | 6.9–8.0; unknown | TAS [ | |
| Carbon source | CO2 (autotroph) | TAS [ | |
| Energy source | H2S and S2O3 2− (chemotroph) | TAS [ | |
| Terminal electron acceptor | O2, NO3 2− | TAS [ | |
| MIGS-6 | Habitat | Intracellular endosymbiont of marine bivalve inhabiting hydrothermal vents | IDA |
| MIGS-6.3 | Salinity | 35.42 PSU | IDA |
| MIGS − 22 | Oxygen | Aerobic (facultative) | TAS [ |
| MIGS-15 | Biotic relationship | Symbiont | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| MIGS-4 | Geographic location | East Pacific Rise (EPR) 9°N | IDA |
| MIGS-5 | Sample collection | January 2014 | IDA |
| MIGS-4.1 | Latitude | 9° 50.39′ N | IDA |
| MIGS-4.2 | Longitude | 104° 17.49′ W | IDA |
| MIGS-4.3 | Altitude | -2511 m | IDA |
aEvidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [50]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Draft genome |
| MIGS-28 | Libraries used | Illumina 112 bp paired-end library (Nextera) |
| MIGS-29 | Sequencing platforms | Genome analyzer II x |
| MIGS-31.2 | Fold coverage | 86× |
| MIGS-30 | Assemblers | SPAdes v. 3.1.1 |
| MIGS-32 | Gene calling method | GeneMarkS+ (NCBI PGAP) |
| Locus Tag | BGC33 | |
| GenBank ID | o MIQH00000000 | |
| GenBank date of release | 11/16/2016 | |
| GOLD ID | - | |
| BioProject | PRJNA339702 | |
| MIGS-13 | Source material identifier | - |
| Project relevance | Vent ecosystems, Chemosynthetic symbioses, Environmental microbiology |
Genome statistics
| Attribute | Value | % |
|---|---|---|
| Genome size (bp) a | 3,088,407 | 100 |
| DNA coding (bp) | 2,621,999 | 84.9 |
| DNA G + C (bp) | 1,164,329 | 37.7 |
| DNA scaffolds | 1281 | 100 |
| Total genes | 3097 | 100 |
| Protein-coding genes | 3045 | 98.3 |
| RNA genes | 46 | 1.5 |
| Pseudo genes | 6 | 0.2 |
| Genes in internal clusters | - | - |
| Genes with function prediction b | 2051 | 67.4 |
| Genes assigned to COGs | 1659 | 54.5 |
| Genes with Pfam domains | 1984 | 65.2 |
| Genes with signal peptides c | 337 | 11.1 |
| Genes with transmembrane helices | 626 | 20.6 |
| CRISPR repeats | 10 |
aAll 1281 scaffolds >200 bp. 478 of these (37.3%) are scaffolds >1000 bp, comprising 2,726,561 bp (88.3% of all base pairs)
bGenes with function prediction are all 3045 protein-coding genes minus those 994 genes annotated as “hypothetical proteins” that have no COG category or fall into the COG categories “unknown function” or “general function prediction only” and that have no Pfam domain or a Pfam “domain of unknown function”
cIncludes genes for which a signal peptide was predicted with at least two of the three tools used. Percentages of genes with function prediction, COGs, Pfam domains, signal peptides and transmembrane helices were calculated against a total of 3045 protein-coding genes
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 179 | 5.88 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03 | RNA processing and modification |
| K | 50 | 1.64 | Transcription |
| L | 126 | 4.14 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 20 | 0.66 | Cell cycle control, cell division, chromosome partitioning |
| V | 81 | 2.66 | Defense mechanisms |
| T | 40 | 1.31 | Signal transduction mechanisms |
| M | 105 | 3.45 | Cell wall/membrane biogenesis |
| N | 6 | 0.20 | Cell motility |
| U | 47 | 1.54 | Intracellular trafficking and secretion |
| O | 99 | 3.25 | Posttranslational modification, protein turnover, chaperones |
| C | 115 | 3.78 | Energy production and conversion |
| G | 34 | 1.12 | Carbohydrate transport and metabolism |
| E | 117 | 3.84 | Amino acid transport and metabolism |
| F | 46 | 1.51 | Nucleotide transport and metabolism |
| H | 104 | 3.42 | Coenzyme transport and metabolism |
| I | 46 | 1.51 | Lipid transport and metabolism |
| P | 61 | 2.00 | Inorganic ion transport and metabolism |
| Q | 98 | 3.22 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 175 | 5.75 | General function prediction only |
| X | 32 | 1.05 | Mobilome: prophages, transposons |
| W | 9 | 0.30 | Extracellular structures |
| S | 68 | 2.23 | Function unknown |
| - | 1386 | 45.52 | Not in COGs |
The percentage is based on a total of 3045 protein-coding genes