| Literature DB >> 27871231 |
Juliane Wippler1,1, Manuel Kleiner2,3, Christian Lott4,5, Alexander Gruhl4, Paul E Abraham6, Richard J Giannone6, Jacque C Young6,7, Robert L Hettich6, Nicole Dubilier4.
Abstract
BACKGROUND: The gutless marine worm Olavius algarvensis has a completely reduced digestive and excretory system, and lives in an obligate nutritional symbiosis with bacterial symbionts. While considerable knowledge has been gained of the symbionts, the host has remained largely unstudied. Here, we generated transcriptomes and proteomes of O. algarvensis to better understand how this annelid worm gains nutrition from its symbionts, how it adapted physiologically to a symbiotic lifestyle, and how its innate immune system recognizes and responds to its symbiotic microbiota.Entities:
Keywords: Annelida; Carbon monoxide; Chemosynthetic symbiosis; FREP; Immunology; Oligochaeta; PGRP; Phallodrilinae; RNA-Seq; Respiratory pigment; SRCR
Mesh:
Substances:
Year: 2016 PMID: 27871231 PMCID: PMC5117596 DOI: 10.1186/s12864-016-3293-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Schematic overview of proposed molecular host-symbiont interactions. a) Light micrograph of an Olavius algarvensis worm. Scale bar 5 mm. b) Light micrograph of a longitudinal section through O. algarvensis, tissue stained with toluidine blue. Scale bar 50 μm. C) Transmission electron micrograph of the symbiotic region, longitudinal section. Red asterisks, symbiont cells; black arrow, cuticle; white arrow, epidermal cell extensions. Scale bar 5 μm. The boxes in a), b) and c) frame regions corresponding to the image to their right (in a) and b)) or below (in c). Images a, b, and c do not show the same worm specimen. d) Schematic overview of the main groups of expressed pattern recognition molecules, components of the Toll immune signaling pathway and proposed interactions between the host and its symbionts. Ig, immunoglobulin domain proteins; PGRP, peptidoglycan recognition proteins; SRCR scavenger receptor-like cysteine rich proteins; TLR, Toll-like receptors; FREP, fibrinogen-related proteins; AMPs, antimicrobial proteins
Overview of proposed host-symbiont interaction proteins
| Functional category | Protein family | Transcripts a) | Proteins b) |
|---|---|---|---|
|
| |||
| peptidoglycan recognition proteins (PGRPs) | 6 (16) | 1 | |
| C-type lectins | 33 (119) | 5 | |
| R-type lectins | 6 (18) | 4 | |
| SUEL/rhamnose-binding lectins | 7 (22) | 1 | |
| galectins | 3 (3) | 1 | |
| fucolectin | 1 (23) | 0 | |
| fibrinogen-related proteins (FREPs) | 27 (161) | 1 | |
| toll-like/variable lymphocyte receptor-like (TLR/VLR) | 13 (52) | 1 | |
| scavenger receptor cysteine-rich (SRCR) domain proteins | 25 (164) | 1 | |
| beta-1,3-glucan binding protein | 1 (1) | 1 | |
| novel immunoglobulin I-set proteins | 16 (17) | 1 | |
| novel immunoglobulin V-set proteins | 9 (22) | 1 | |
|
| |||
| lumbricin | 1 (1) | 1 | |
| invertebrate-type lysozyme | 1 (1) | 0 | |
| bactericidal permeability increasing protein BPI | 1 (3) | 1 | |
| insect defensin/reeler-like proteins | 4 (8) | 1 | |
| cysteine-rich secretory proteins (CRSPs) | 6 (28) | 2 | |
| membrane attack complex/perforin | 2 (23) | 0 | |
|
| |||
| ROS modulator 1 | 2 (3) | 1 | |
| alpha-2-macroglobulin | 10 (24) | 1 | |
| kazal-type serpin | 2 (8) | 0 | |
| kunitz-type serpin | 1 (2) | 0 | |
| leukocyte elastase inhibitors | 5 (25) | 1 | |
| phosphatidylethanolamine-binding protein PEBP | 3 (5) | 1 | |
|
| |||
| Toll/interleukin-1 receptor (TIR) domain proteins | 5 (9) | 0 | |
| NF-kappa-B inhibitor Cactus | 2 (6) | 0 | |
| dorsal protein | 2 (3) | 1 | |
| evolutionarily conserved signaling intermediate in Toll (ECSIT) | 1 (1) | 0 | |
| Pelle protein | 1 (1) | 0 | |
| Relish protein | 1 (6) | 0 | |
| mitogen-activated protein kinase kinase kinase 7 (TAK1) | 1 (1) | 0 | |
| I-kappa-B-kinase alpha (IKK α) | 1 (1) | 0 | |
| I-kappa-B-kinase beta (IKK β) | 1 (5) | 0 | |
| interleukin-1 receptor-associated kinase 1 (IRAK1) | 1 (2) | 0 | |
| mitogen-activated protein kinase kinase kinase 4 (MEKK4) | 1 (1) | 0 | |
| sterile alpha and TIR motif-containing protein (SARM) | 1 (3) | 0 | |
| Toll-interacting protein Tollip | 1 (2) | 1 | |
| (LPS-induced) tumor necrosis factor (TNF) | 3 (7) | 0 | |
| Tumor necrosis factor (TNF) | 4 (6) | 0 | |
| tumor necrosis factor alpha-induced protein 3 (TNFAIP3) | 1 (3) | 0 | |
| tumor necrosis factor receptor associated proteins (TRAF) | 9 (16) | 0 | |
| IFN regulatory factor | 8 (10) | 0 | |
| IFN regulatory factor-binding protein | 1 (1) | 0 | |
| IFN-induced GTPase | 7 (23) | 1 | |
| macrophage migration inhibitory factor (MIF) | 3 (24) | 1 | |
| ILN enhancer binding factor 2 | 2 (4) | 1 | |
| ILN-16 | 1 (1) | 0 | |
|
| |||
| carboxypeptidases | 11 (22) | 0 | |
| cathepsins total | 15 (28) | 4 | |
| cathepsin B | 3 (12) | 1 | |
| cathepsin C | 2 (4) | 0 | |
| cathepsin F | 3 (3) | 1 | |
| cathepsin L | 5 (6) | 2 | |
| cathepsin O | 1 (2) | 0 | |
| cathepsin Z | 1 (1) | 0 | |
| chymotrypsins | 3 (17) | 1 | |
| pancreatic elastase | 1 (1) | 1 | |
| alpha amylase | 2 (3) | 1 | |
| lysosomal alpha glucosidase | 1 (3) | 1 | |
| acid trehalase | 2 (4) | 1 | |
| sucrase-isomaltase | 1 (1) | 0 | |
| lysosomal acid lipase | 1 (1) | 0 | |
|
| |||
| hemerythrin | 2 (2) | 2 | |
| giant extracellular hemoglobin, globin chains | 12 (16) | 8 | |
| giant extracellular hemoglobin, linker chains | 6 (18) | 5 | |
| sum | 316 (1032) | 60 | |
For more extensive details see Additional file 2: Table S8
a)Number of transcripts defined as Trinity components, which approximately correspond to genes; see [13]; in parentheses: number of contigs (isoforms or fragments)
b)Number of unique proteins
Difference in symbiont protein content in fresh worms compared to starved worms
| Symbionts fresh worms | Symbionts starved worms | |
|---|---|---|
| average nSpC | 3050.30 | 1869.77 |
| standard deviation | 421.88 | 122.47 |
| # replicates | 3 | 3 |
| p-value | 0.00963 | |
Significant differences between fresh and starved samples were determined with the Student's t-Test; nSpC, normalized spectral counts. For more extensive details see Additional file 1: Table S3
Digestive enzymes expressed in Olavius algarvensis
| Transcript ID a) | Protein ID | Annotation | Consensus localization evidenceb) | Substrate or function |
|---|---|---|---|---|
|
| ||||
| comp310626_c3 | n.d. | pancreatic carboxypeptidase A1 | secreted | release of C-terminal amino acids |
| comp330196_c1 | n.d. | pancreatic carboxypeptidase A2 | secreted | release of C-terminal amino acids |
| comp209868_c0 | n.d. | pancreatic carboxypeptidase A2 | secreted | release of C-terminal amino acids |
| comp329532_c3 | n.d. | pancreatic carboxypeptidase A2 | secreted or membrane | release of C-terminal amino acids |
| comp328734_c12 | n.d. | carboxypeptidase | secreted | peptides |
| comp328734_c1 | n.d. | carboxypeptidase | secreted | peptides |
| comp328734_c4 | n.d. | carboxypeptidase | secreted | peptides |
| comp326419_c0 | n.d. | uncharacterized carboxypeptidase | secreted | peptides |
| comp330196_c2 | n.d. | uncharacterized carboxypeptidase | secreted | peptides |
| comp319717_c3 | n.d. | lysosomal Pro-X carboxypeptidase | lysosomal | proline - amino acid bonds |
| comp320275_c1 | n.d. | chymotrypsin A | secreted | Tyr-/Trp-/Phe-/Leu-|-Xaa bonds |
| comp306409_c1 | n.d. | chymotrypsin B | secreted | Tyr-/Trp-/Phe-/Leu-|-Xaa bonds |
| comp334148_c2 | BF11_334148_c2_seq1_11 BF11_334148_c2_seq2_10 | chymotrypsin-like protease ctrl-1 | secreted | proteins |
| comp331491_c1 | BF11_331491_c1_seq1_7 | pancreatic elastase | secreted | proteins |
| comp334775_c2 | n.d. | cathepsin B | secreted | Arg – Arg/–Xaa bonds |
| comp335560_c0 | BF11_335560_c0_seq5_5 BF11_335560_c0_seq9_5 | cathepsin B | lysosomal | Arg – Arg/–Xaa bonds |
| comp306522_c2 | n.d. | cathepsin F | lysosomal | peptides, cleaves Phe/Leu |
| comp306522_c3 | n.d. | cathepsin F | lysosomal | peptides, cleaves Phe/Leu |
| comp308536_c0 | BF11_308536_c0_seq1_12 | cathepsin F | lysosomal | peptides, cleaves Phe/Leu |
| comp283346_c1 | n.d. | cathepsin L | lysosomal | proteins |
| comp306922_c1 | BF11_306922_c1_seq1_34 | cathepsin L | lysosomal | proteins |
| comp315575_c0 | n.d. | cathepsin L | lysosomal | proteins |
| comp315575_c2 | FD_315575_c2_seq1:64:270:1:+ | cathepsin L | lysosomal | proteins |
| comp328653_c0 | n.d. | cathepsin L | lysosomal | proteins |
| comp329800_c6 | n.d. | cathepsin O | lysosomal | peptides (endopeptidase) |
| comp314408_c1 | n.d. | cathepsin Z | lysosomal | C-terminal amino acids (not Pro) |
| comp308700_c0 | n.d. | cathepsin C (dipeptidyl peptidase I) | lysosomal | release of an N-terminal dipeptide |
| comp329456_c1 | n.d. | cathepsin C (dipeptidyl peptidase I) | lysosomal | release of an N-terminal dipeptide |
| comp328746_c4 | n.d. | alpha amylase | secreted | α1- > 4 glycosidic bonds |
| comp324906_c1 | BF11_324906_c1_seq1_9 BF11_324906_c1_seq2_12 | alpha amylase | secreted or membrane | α1- > 4 glycosidic bonds |
| comp335205_c1 | BF11_335205_c1_seq1_20 | lysosomal alpha glucosidase | secreted or membrane | α1- > 4 glycosidic bonds |
| comp320084_c0 | n.d. | lysosomal beta-mannosidase | lysosomal | cleaves terminal β-D-mannose |
| comp334411_c3 | n.d. | sucrase-isomaltase | secreted or membrane | α1- > 6 glycosidic bonds |
| comp329957_c8 | BF11_329957_c8_seq2_17 | acid trehalase | secreted or membrane | trehalose - > glucose |
| comp335402_c7 | n.d. | acid trehalase | secreted or membrane | trehalose - > glucose |
|
| ||||
| comp22535_c0 | n.d. | lysosomal acid lipase | secreted | hydrolyzes steryl esters |
| comp249291_c0 | n.d. | lysozyme | secreted | peptidoglycan (glycosidic bonds) |
| comp250229_c0 | BF11_250229_c0_seq1_15 BF11_250229_c0_seq2_17 | peptidoglycan recognition protein | secreted | peptidoglycan (peptide bonds) |
| comp335695_c10 | n.d. | peptidoglycan recognition protein | secreted or membrane | peptidoglycan (peptide bonds) |
| comp330541_c4 | n.d. | peptidoglycan recognition protein | secreted | peptidoglycan (peptide bonds) |
| comp314994_c0 | n.d. | peptidoglycan recognition protein | secreted or membrane | peptidoglycan (peptide bonds) |
| comp332570_c2 | n.d. | peptidoglycan recognition protein | secreted or membrane | peptidoglycan (peptide bonds) |
a)Defined as Trinity components; see [13]
b)Probable subcellular localization of proteins based on the results of TMHMM [117, 118], SignalP-4 [119], Phobius [120], TargetP [121], DISTILL [122], LocTree3 [123], BaCelLo [124], and iLoc-Animal [125]. Secreted – enters secretory pathway and is either excreted to the extracellular space or confined to non-cytoplasmic insides of intracellular compartments; membrane – predicted to be membrane integral; lysosomal – predicted to be targeted towards the lysosome. See Additional file 3: Table S14 for details on localization evidence
Fig. 2Domain structures of peptidoglycan recognition proteins. Structure of conserved functional domains in Olavius algarvensis peptidoglycan recognition proteins; OalgPGRP1: comp330541_c4; OalgPGRP2: comp250229_c0; OalgPGRP3: comp335695_c10; OalgPGRP4: comp314994_c0; OalgPGRP5: comp332570_c2; OalgPGRP6: comp1100768_c0
Fig. 3Protein alignment of hemoglobin A2 chains. Protein alignment of hemoglobin A2 chains from marine and terrestrial annelids: Riftia pachyptila (GenBank accession number: CAD29155), Tevnia jerichonana (GenBank accession number: AAP04530), Lamellibrachia satsuma (GenBank accession number: BAN58231), Lamellibrachia sp. XB-2003 (GenBank accession number: AAP04528), Oasisia alvinae (GenBank accession number: AAP04531), Oligobrachia mashikoi (GenBank accession number: Q7M413), Arenicola marina (GenBank accession numbers: A2a, CAI56308; A2b, CAJ32740; A2c, CAJ32741), Lumbricus rubellus (GenBank accession number: BF422675.2), Lumbricus terrestris (GenBank accession number: P02218), Tylorrhynchus heterochaetus (GenBank accession number: P09966) and Olavius algarvensis (comp287449_c0_seq1)
Fig. 4Protein alignment of peptidoglycan recognition proteins. Protein alignment of PGRP domain sequences from different model organisms and Olavius algarvensis; Dmel Drosophila melanogaster (GenBank accession numbers: PGRP-SA, Q9VYX7; PGRP-LA, Q95T64; PGRP-LB, Q8INK6; PGRP-LC, Q9GNK5), Mmus Mus musculus (GenBank accession numbers: PGRP1, O88593; PGRP2, Q8VCS0; PGRP3, A1A547; PGRP4, Q0VB07), Hsap Homo sapiens (GenBank accession numbers: PGRP-S, O75594; PGRP-L, Q96PD5), Oalg Olavius algarvensis (OalgPGRP1, comp330541_c4; OalgPGRP2, comp250229_c0; OalgPGRP3, comp335695_c10; OalgPGRP4, comp314994_c0; OalgPGRP5, comp332570_c2; OalgPGRP6, comp1100768_c0). Conserved active-site residues that confer amidase activity are shown in red; mutation of at least one active-site residue (pink) removes amidase activity
Lectins expressed in Olavius algarvensis
| Lectin group | Transcripts a) | Proteins b) | Domain architectures | Potential functions |
|---|---|---|---|---|
|
| 1 | 0 |
| glycan recognition and host defense [ |
|
| 33 | 5 |
| glycan recognition [ |
|
| 6 | 4 |
| glycan recognition [ |
|
| 3 | 1 |
| [ |
|
| 7 | 1 |
| MAMP recognition [ |
|
| 27 | 1 |
| [ |
a) Number of transcripts, defined as Trinity components; see [13]
b) Number of identified proteins in the proteomes