| Literature DB >> 28094347 |
Matthew J Neave1,2, Craig T Michell1, Amy Apprill2, Christian R Voolstra1.
Abstract
Endozoicomonas bacteria are globally distributed and often abundantly associated with diverse marine hosts including reef-building corals, yet their function remains unknown. In this study we generated novel Endozoicomonas genomes from single cells and metagenomes obtained directly from the corals Stylophora pistillata, Pocillopora verrucosa, and Acropora humilis. We then compared these culture-independent genomes to existing genomes of bacterial isolates acquired from a sponge, sea slug, and coral to examine the functional landscape of this enigmatic genus. Sequencing and analysis of single cells and metagenomes resulted in four novel genomes with 60-76% and 81-90% genome completeness, respectively. These data also confirmed that Endozoicomonas genomes are large and are not streamlined for an obligate endosymbiotic lifestyle, implying that they have free-living stages. All genomes show an enrichment of genes associated with carbon sugar transport and utilization and protein secretion, potentially indicating that Endozoicomonas contribute to the cycling of carbohydrates and the provision of proteins to their respective hosts. Importantly, besides these commonalities, the genomes showed evidence for differential functional specificity and diversification, including genes for the production of amino acids. Given this metabolic diversity of Endozoicomonas we propose that different genotypes play disparate roles and have diversified in concert with their hosts.Entities:
Mesh:
Year: 2017 PMID: 28094347 PMCID: PMC5240137 DOI: 10.1038/srep40579
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Assembly quality and RAST (Rapid Annotation using Subsystem Technology) annotation results for the Endozoicomonas genomes.
| Genome | RAST ID | Assembly size (bp) | Contigs | Scaffolds | Scaffold N50 (bp) | Max scaffold size (bp) | CDS | RNAs | GC% |
|---|---|---|---|---|---|---|---|---|---|
| 1121862.6 | 5,569,560 | 21 | 2 | 5,569,560 | 5,569560 | 5,021 | 104 | 46.8 | |
| 1027273.4 | 5,602,297 | 83 | 20 | 1,015,541 | 1,412,099 | 5,350 | 114 | 48.5 | |
| 1137799.4 | 6,342,227 | 131 | 31 | 917,146 | 1,695,894 | 5,995 | 95 | 47.1 | |
| 6666666.127878 | 3,624,544 | 1,553 | 1,548 | 10,138 | 63,630 | 3,463 | 55 | 49.6 | |
| 6666666.127879 | 3,413,810 | 1,135 | 1,132 | 18,779 | 107,991 | 3,383 | 54 | 50.6 | |
| 305899.13 | 2,304,083 | 1,698 | 1,698 | 1,686 | 8,572 | 2,142 | 19 | 49.4 | |
| 305899.6 | 5,277,023 | 3,342 | 3,342 | 2,052 | 9,019 | 4,420 | 42 | 53.9 |
1E. elysicola, E. montiporae, and E. numazuensis from Neave et al.35.
Figure 1Endozoicomonas pan-genome showing (A) core and accessory genes, and (B) phylogenetic relationship of the Endozoicomonas genomes based on core protein sequences. In (A), genes shared between genomes are indicated by overlapping segments and the outermost track indicates genome size (million base pairs). In (B), the scale bar indicates the mean number of substitutions per site and confidence from 1000 bootstrap replicates are shown on the branches.
Genomes used for comparative Gene Ontology (GO) analysis.
| Genome | GenBank ID# | Genome size (bps) | Habitat |
|---|---|---|---|
| | PRJNA16064 | 7,215,267 | Sediment |
| | PRJNA182405 | 6,564,965 | Sediment |
| | PRJNA221029 | 3,464,094 | Brine |
| | PRJNA235145 | 3,342,098 | Seawater |
| | PRJNA185608 | 4,505,834 | Seawater |
| | PRJNA185609 | 3,709,807 | Seawater |
| | PRJNA191058 | 4,505,254 | Deep sea |
| | PRJEB1425 | 3,920,328 | Sediment |
| | PRJNA182446 | 5,798,664 | Sediment |
| SAR86A | PRJNA76773 | 1,250,389 | Seawater |
| SAR86B | PRJNA76775 | 1,749,017 | Seawater |
| SAR86E | PRJNA170317 | 1,396,800 | Seawater |
| | PRJNA272 | 1,267,782 | Fruit fly |
| | PRJNA176303 | 1,295,804 | Fruit fly |
| | PRJNA204100 | 4,575,622 | Sediment |
| | PRJNA13391 | 4,845,257 | Seawater |
| | PRJNA13392 | 4,840,251 | Seawater |
| | PRJNA12986 | 4,273,718 | Seawater, |
| | PRJNA19393 | 4,503,336 | Seawater, |
Enriched gene ontology (GO) terms in the biological process category for the Endozoicomonas genomes.
| Term | Function | Annotated | Expected | Fisher’s p-value | |
|---|---|---|---|---|---|
| GO:0006259 | DNA metabolic process | 2124 | 803 | 607.32 | 1.5e-21 |
| GO:0006313 | transposition, DNA-mediated | 450 | 216 | 128.67 | 1.6e-18 |
| GO:0032196 | transposition | 450 | 216 | 128.67 | 1.6e-18 |
| GO:0006310 | DNA recombination | 830 | 342 | 237.32 | 2.2e-15 |
| GO:0046903 | secretion | 424 | 194 | 121.24 | 3.4e-14 |
| GO:0009306 | protein secretion | 417 | 190 | 119.23 | 1.0e-13 |
| GO:0032940 | secretion by cell | 417 | 190 | 119.23 | 1.0e-13 |
| GO:0008643 | carbohydrate transport | 262 | 130 | 74.91 | 4.9e-13 |
| GO:0006024 | glycosaminoglycan biosynthetic process | 21 | 21 | 6 | 3.8e-12 |
| GO:0033036 | macromolecule localization | 808 | 320 | 231.03 | 6.4e-12 |
| GO:0071702 | organic substance transport | 1220 | 648 | 441.67 | <1e-30 |
| GO:0006259 | DNA metabolic process | 1687 | 803 | 610.74 | 2.1e-23 |
| GO:0008643 | carbohydrate transport | 183 | 130 | 66.25 | 7.8e-22 |
| GO:0006310 | DNA recombination | 629 | 342 | 227.72 | 4.5e-21 |
| GO:0006313 | transposition, DNA-mediated | 357 | 216 | 129.24 | 5.4e-21 |
| GO:0009401 | phosphoenolpyruvate-dependent sugar phosphotransferase | 107 | 86 | 38.74 | 8.5e-21 |
| GO:0044765 | single-organism transport | 2661 | 1180 | 963.36 | 4.1e-20 |
| GO:0098656 | anion transmembrane transport | 78 | 64 | 28.24 | 1.1e-16 |
| GO:0006835 | dicarboxylic acid transport | 65 | 55 | 23.53 | 1.1e-15 |
| GO:1903825 | organic acid transmembrane transport | 70 | 56 | 25.34 | 7.5e-14 |
Figure 2Percentage of Endozoicomonas genes annotated into high level functions within the RAST (Rapid Annotation using Subsystem Technology) subsystem classifications.
Figure 3Percentage of Endozoicomonas genes in the RAST (Rapid Annotation using Subsystem Technology) amino acids and derivatives classification.