| Literature DB >> 27389690 |
Jason Macrander1, Michael Broe2, Marymegan Daly2.
Abstract
Cnidarians represent one of the few groups of venomous animals that lack a centralized venom transmission system. Instead, they are equipped with stinging capsules collectively known as nematocysts. Nematocysts vary in abundance and type across different tissues; however, the venom composition in most species remains unknown. Depending on the tissue type, the venom composition in sea anemones may be vital for predation, defense, or digestion. Using a tissue-specific RNA-seq approach, we characterize the venom assemblage in the tentacles, mesenterial filaments, and column for three species of sea anemone (Anemonia sulcata, Heteractis crispa, and Megalactis griffithsi). These taxa vary with regard to inferred venom potency, symbiont abundance, and nematocyst diversity. We show that there is significant variation in abundance of toxin-like genes across tissues and species. Although the cumulative toxin abundance for the column was consistently the lowest, contributions to the overall toxin assemblage varied considerably among tissues for different toxin types. Our gene ontology (GO) analyses also show sharp contrasts between conserved GO groups emerging from whole transcriptome analysis and tissue-specific expression among GO groups in our differential expression analysis. This study provides a framework for future characterization of tissue-specific venom and other functionally important genes in this lineage of simple bodied animals.Entities:
Keywords: column; gene ontology; mesenterial filaments; tentacles; toxins
Mesh:
Substances:
Year: 2016 PMID: 27389690 PMCID: PMC5010892 DOI: 10.1093/gbe/evw155
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Tissues and focal taxa. Comparison of the general morphology of the three focal species (A) A. sulcata, (B) H. crispa, and (C) M. griffithsi. Tentacle morphology varies considerably across these species with long fusiform tentacles (A. sulcata), short filiform tentacles (H. crispa), and broad and highly branching tentacles with acrospheres (M. griffithsi). Unlike the many species of sea anemone that have their column exposed, our focal taxa have their column protected to some degree by either tentacles (A. sulcata), a broad oral disk (H. crispa), or buried in the sediment (M. griffithsi).
Summary of Sequencing, Cleanup, and Assembly of Tissue-Specific and Combined Transcriptomes
| Species | Sequencing | Trimmomatic | Trinity | CEGMA | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Tissue | Yield (Mbases) | # PE Reads (2 × 100 bp) | F and R Recovered | F Only | R Only | Dropped | Isoforms | Genes | N50 | Complete (%) |
| Tentacle | 8,793 | 43,966,062 | 42,158,521 | 1,440,580 | 242,354 | 124,607 | 116,940 | 90,709 | 1,167 | 75 |
| EC | 40,114,944 | 32,725,211 | 3,534,322 | 2,282,523 | 1,572,888 | 54,365 | 39,503 | 1,262 | 77.02 | |
| Column | 10,480 | 52,402,099 | 49,231,820 | 2,561,688 | 385,801 | 222,790 | 134,535 | 105,984 | 1,052 | 90.73 |
| EC | 47,615,852 | 36,988,265 | 4,813,831 | 3,185,361 | 2,628,395 | 55,868 | 41,679 | 1,242 | 81.42 | |
| Filament | 14,657 | 73,285,750 | 69,599,682 | 3,027,316 | 411,698 | 247,054 | 167,752 | 140,339 | 789 | 88.31 |
| EC | 63,044,097 | 51,425,852 | 5,771,906 | 3,334,536 | 2,511,803 | 45,421 | 35,879 | 953 | 59.27 | |
| Combined | 33,930 | 169,653,911 | 160,990,023 | 7,029,584 | 1,039,853 | 594,451 | 303,773 | 245,737 | 1,186 | 94.76 |
| Tentacle | 8,054 | 40,269,872 | 38,950,621 | 1,071,964 | 170,250 | 77,037 | 186,284 | 142,194 | 1,227 | 93.55 |
| EC | 34,831,282 | 28,422,326 | 2,916,966 | 2,165,659 | 1,326,331 | 112,900 | 85,627 | 1,162 | 85.05 | |
| Column | 11,629 | 58,142,549 | 55,456,896 | 2,220,420 | 318,314 | 146,919 | — | — | — | — |
| EC | 15,862,609 | 12,892,450 | 1,480,601 | 919,042 | 570,516 | 103,035 | 97,215 | 289 | 15.73 | |
| Filament | 6,975 | 34,877,218 | 33,252,993 | 1,315,400 | 207,390 | 101,435 | 307,498 | 275,492 | 421 | 55.25 |
| EC | 23,826,103 | 19,561,405 | 1,915,171 | 1,502,125 | 847,402 | 67,733 | 56,039 | 711 | 43.95 | |
| Combined | 26,658 | 133,289,639 | 127,660,510 | 4,607,784 | 695,954 | 325,391 | 655,116 | 581,957 | 545 | 94.34 |
| T. Lobe | 7,201 | 33,993,938 | 31,743,659 | 2,008,599 | 147,255 | 94,425 | 208,940 | 146,020 | 1,119 | 88.71 |
| EC | 27,928,980 | 22,883,168 | 2,916,204 | 1,288,046 | 841,562 | 141,408 | 103,182 | 1,123 | 83.87 | |
| Column | 8,303 | 36,006,836 | 33,419,868 | 2,330,576 | 162,784 | 93,608 | 572,734 | 547,297 | 268 | 19.76 |
| EC | 21,270,705 | 17,482,032 | 2,312,185 | 849,156 | 627,332 | 85,311 | 93,406 | 590 | 17.75 | |
| Filament | 6,799 | 41,516,836 | 39,947,259 | 1,336,863 | 151,386 | 81,328 | 197,441 | 157,660 | 1,144 | 88.71 |
| EC | 35,093,477 | 29,602,445 | 2,759,483 | 1,633,100 | 1,098,449 | 93,566 | 76,840 | 1,157 | 87.50 | |
| Combined | 22,303 | 111,517,610 | 105,110,786 | 5,676,038 | 461,425 | 269,361 | 1,200,511 | 1,073,369 | 392 | 97.18 |
Note.—F, Forward; R, Reverse.
Values represent the number of paired end reads (2 × 100 bp) at each step, EC indicates that the raw reads were subjected to the errorcorrect.pl script from the ALLPATHS-LG pipeline (Gnerre et al. 2011).
Sea Anemone Toxin Tissue-Specific Diversity and Highest TPM Values
| N | C | F | T | N | C | F | T | N | C | F | T | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ASIC toxins | 0 | — | — | — | 2 | 7 | 70 | 0 | — | — | — | |
| Acrorhagins | 4 | 26 | 21 | 1 | 0.31 | 5 | 2 | 0 | 7.44 | |||
| AETX-like | 2 | 21 | 48 | 0 | — | — | — | 0 | — | — | — | |
| Metalloproteases | 90+ | 265 | 157 | 40+ | 3 | 43 | 30+ | 2 | 15 | |||
| Class 9a (KTx) | 1 | 344 | 637 | 0 | — | — | — | 0 | — | — | — | |
| 3 | 2 | 0.19 | 6 | 3 | 19 | 1 | 0 | 0.03 | ||||
| MACPF | 6 | 0.2 | 2 | 7 | 0.38 | 0 | — | — | — | |||
| 23 | 109 | 130 | 1 | 0 | 0 | 2 | 6 | 32 | ||||
| EGF-like | 5 | 7 | 34 | 2 | 1 | 9 | 0 | — | — | — | ||
| 2 | 36 | 89 | 5 | 0.13 | 12 | 3 | 0 | |||||
| PI/Type II KTxs | 39 | 333 | 426 | 37 | 6 | 402 | 54 | 3 | 185 | |||
| 34 | 95 | 814 | 14 | 29 | 55 | 42 | 0.17 | 329 | ||||
| Type V KTxs | 2 | 9 | 47 | 3 | 0 | 3 | 0 | — | — | — | ||
| PLA2 | 17 | 33 | 74 | 19 | 0.28 | 53 | 9 | 0.09 | 70 | |||
Note.—N, number of candidate toxin genes; maximum TPM value for a single transcript in each of the tissue-specific transcriptomes: C, column; F, filament; T, tentacle/tentacular lobe. The highest TPM values are highlighted in bold. The better studied toxin genes are underlined. —indicates no candidate toxin identified. For a full list of candidate toxins, visit the associated bitbucket site (https://bitbucket.org/JasonMacrander/anemone_tissue/).
Only partial sequence recovered from transcripts.
Most Highly Expressed Transcripts with All the ToxProt Data BLAST Hits
| Transcript_id | TPM | ToxPROT | Group | Gene Name | |
|---|---|---|---|---|---|
| Column | |||||
| | |||||
| comp196092_c0_seq1 | 777 | Q8AY75 | Snake | Calglandulin | 2.00E-34 |
| comp203492_c1_seq3 | 484 | Q9TXD8 | Spider | Venom peptide isomerase heavy chain | 6.00E-32 |
| comp203768_c0_seq4 | 357 | A6MFK8 | Snake | Venom prothrombin activator vestarin-D2 | 1.00E-42 |
| | |||||
| | |||||
| comp213984_c0_seq4 | 332 | P0DKM7 | Snail | Turripeptide Lol9.1 | 2.00E-11 |
| comp177042_c0_seq1 | 310 | Q9TXD8 | Spider | Venom peptide isomerase heavy chain | 3.00E-29 |
| comp146633_c0_seq1 | 281 | M5B4R7 | Spider | Translationally controlled tumor protein | 4.00E-12 |
| | |||||
| Filament | |||||
| comp213984_c0_seq4 | 1433 | P0DKM7 | Snail | Turripeptide Lol9.1 | 2.00E-11 |
| | |||||
| comp177042_c0_seq1 | 751 | Q9TXD8 | Spider | Venom peptide isomerase heavy chain | 3.00E-29 |
| | |||||
| comp196092_c0_seq1 | 656 | Q8AY75 | Snake | Calglandulin | 2.00E-34 |
| comp203768_c0_seq4 | 532 | A6MFK8 | Snake | Venom prothrombin activator vestarin-D2 | 1.00E-42 |
| | |||||
| comp203492_c1_seq1 | 432 | Q9TXD8 | Spider | Venom peptide isomerase heavy chain | 2.00E-31 |
| comp146633_c0_seq1 | 423 | M5B4R7 | Spider | Translationally controlled tumor protein | 4.00E-12 |
| | |||||
| Tentacle | |||||
| | |||||
| | |||||
| comp196092_c0_seq1 | 1250 | Q8AY75 | Snake | Calglandulin | 2.00E-34 |
| comp213984_c0_seq4 | 550 | P0DKM7 | Snail | Turripeptide Lol9.1 | 2.00E-11 |
| comp204748_c0_seq1 | 484 | B2DCR8 | Cephalopod | SE-cephalotoxin | 4.00E-04 |
| | |||||
| | |||||
| | |||||
| comp177042_c0_seq1 | 346 | Q9TXD8 | Spider | Venom peptide isomerase heavy chain | 3.00E-29 |
| comp203768_c0_seq4 | 323 | A6MFK8 | Snake | Venom prothrombin activator vestarin-D2 | 1.00E-42 |
| Column | |||||
| comp241739_c0_seq1 | 2563 | P0CI21 | Snail | Augerpeptide hhe53 | 8.00E-05 |
| comp312716_c1_seq1 | 40 | J3SFJ3 | Snake | Translationally controlled tumor protein | 4.00E-13 |
| | |||||
| | |||||
| comp299535_c0_seq1 | 14 | B2DCR8 | Cephalopod | SE-cephalotoxin | 3.00E-07 |
| comp186523_c0_seq1 | 13 | Q2XXL4 | Lizard | Vespryn | 6.00E-05 |
| comp275702_c0_seq1 | 12 | B2DCR8 | Cephalopod | SE-cephalotoxin | 3.00E-06 |
| comp301568_c0_seq1 | 11 | P81236 | Snake | Basic phospholipase A2 acanthin-1 | 2.00E-06 |
| comp312689_c0_seq1 | 11 | B6EWW8 | Snake | Snake venom 5'-nucleotidase | 2.00E-25 |
| comp312679_c0_seq1 | 11 | B6EWW8 | Snake | Snake venom 5'-nucleotidase | 9.00E-10 |
| Filament | |||||
| comp312716_c1_seq1 | 5695 | J3SFJ3 | Snake | Translationally controlled tumor protein | 4.00E-13 |
| comp286957_c0_seq1 | 1413 | Q8AY75 | Snake | Calglandulin | 8.00E-09 |
| comp297877_c0_seq1 | 547 | Q8AY75 | Snake | Calglandulin | 5.00E-12 |
| comp241739_c0_seq1 | 460 | P0CI21 | Snail | Augerpeptide hhe53 | 8.00E-05 |
| comp301796_c0_seq1 | 402 | P24541 | Snake | Kunitz-type serine protease inhibitor | 5.00E-13 |
| comp174862_c0_seq1 | 346 | P0DKM8 | Snail | Turripeptide Ici9.1 | 3.00E-10 |
| comp297655_c0_seq1 | 299 | Q8AY75 | Snake | Calglandulin | 4.00E-11 |
| comp175426_c0_seq1 | 219 | A9YME1 | Wasp | Venom allergen 5 | 2.00E-16 |
| comp298440_c1_seq3 | 110 | A6MFK8 | Snake | Venom prothrombin activator vestarin-D2 | 1.00E-42 |
| comp302180_c0_seq1 | 101 | Q8AY75 | Snake | Calglandulin | 1.00E-31 |
| Tentacle | |||||
| comp312716_c1_seq1 | 7514 | J3SFJ3 | Snake | Translationally controlled tumor protein | 4.00E-13 |
| comp301796_c0_seq1 | 1786 | P24541 | Snake | Kunitz-type serine protease inhibitor | 5.00E-13 |
| | |||||
| comp308440_c1_seq6 | 416 | Q8AY75 | Snake | Calglandulin | 4.00E-33 |
| comp312783_c0_seq1 | 287 | Q7M4I3 | Bee | Venom protease | 5.00E-42 |
| | |||||
| | |||||
| comp295958_c0_seq2 | 212 | P0CV91 | Snake | Peroxiredoxin-4 | 2.00E-14 |
| comp286957_c0_seq1 | 209 | Q8AY75 | Snake | Calglandulin | 8.00E-09 |
| comp298467_c1_seq1 | 193 | Q58L93 | Snake | Venom prothrombin activator porpharin-D | 1.00E-30 |
| Column | |||||
| comp1412551_c0_seq1 | 253 | P0CI21 | Snail | Augerpeptide hhe53 | 1.00E-04 |
| comp1220254_c0_seq1 | 19 | M5B4R7 | Spider | Translationally controlled tumor protein | 3.00E-12 |
| comp1418434_c0_seq1 | 16 | B2DCR8 | Cephalopod | SE-cephalotoxin | 1.00E-04 |
| comp1436556_c1_seq1 | 15 | W4VS99 | Spider | Neprilysin-1 | 2.00E-59 |
| comp1436580_c0_seq2 | 15 | P81383 | Snake | L-amino-acid oxidase | 3.00E-05 |
| comp1436525_c0_seq1 | 14 | B6EWW8 | Snake | Snake venom 5'-nucleotidase | 1.00E-09 |
| comp1436582_c0_seq1 | 14 | A8QL52 | Snake | 2.00E-16 | |
| comp1436570_c0_seq2 | 8 | B6EWW8 | Snake | Snake venom 5'-nucleotidase | 2.00E-25 |
| comp1436570_c0_seq3 | 8 | B6EWW8 | Snake | Snake venom 5'-nucleotidase | 2.00E-25 |
| comp1220254_c0_seq1 | 8 | A0A024BTN9 | Snake | 8.00E-04 | |
| Filament | |||||
| comp1220254_c0_seq1 | 10171 | M5B4R7 | Spider | Translationally controlled tumor protein | 3.00E-12 |
| comp1418434_c0_seq1 | 3712 | B2DCR8 | Cephalopod | SE-cephalotoxin | 1.00E-04 |
| | |||||
| comp1220065_c0_seq1 | 1284 | Q8AY75 | Snake | Calglandulin | 1.00E-34 |
| comp1426766_c0_seq1 | 978 | B2DCR8 | Cephalopod | SE-cephalotoxin | 2.00E-05 |
| comp1430871_c1_seq3 | 395 | J3S9D9 | Snake | Reticulocalbin-2 | 9.00E-35 |
| | |||||
| | |||||
| comp1398541_c0_seq2 | 185 | Q8T3S7 | Spider | Kunitz-type serine protease inhibitor | 3.00E-18 |
| | |||||
| Tentacle | |||||
| comp1220254_c0_seq1 | 4030 | M5B4R7 | Spider | Translationally controlled tumor protein | 3.00E-12 |
| comp1220065_c0_seq1 | 882 | Q8AY75 | Snake | Calglandulin | 1.00E-34 |
| | |||||
| comp1420730_c0_seq1 | 579 | Q6T6S5 | Snake | Kunitz-type serine protease inhibitor bitisilin-2 | 8.00E-12 |
| comp1410836_c0_seq1 | 397 | Q8AY75 | Snake | Calglandulin | 1.00E-34 |
| | |||||
| | |||||
| | |||||
| comp1223905_c1_seq1 | 226 | Q8AY75 | Snake | Calglandulin | 2.00E-16 |
| comp1223905_c0_seq1 | 219 | Q8AY75 | Snake | Calglandulin | 3.00E-10 |
Note.—Transcripts are expressed as TPM; bold BLAST hits have an identified signaling region. Sequences with an E-value >0.001 were not included in our analysis. Repeat hits are highlighted in red and those from anemones are in bold. For a full list of candidate toxins, visit the associated bitbucket site (https://bitbucket.org/JasonMacrander/anemone_tissue/).
. 2.—Relative expression of toxin genes across tissues. CIRCOS plots with line thickness depicting both cumulative TPM values (top row) and transcript-specific TPM values (bottom row) for each toxin gene family across focal species and tissues. The tissue of origin is noted in capital letters, with the different toxin genes color coded across all plots. The transcript-specific plots (bottom row) show only transcripts with a TPM value >10 in at least one tissue type.
. 3.—DE analysis across tissue types. Heat maps depicting differentially expressed genes for A. sulcata, H. crispa, and M. griffithsi, with adjacent trees depicting the transcript clustering analysis. Plots depict upregulated transcripts and identified GO groups based on BLAST searches, with bubble size representing the proportion of each GO group (corrected for repeated GO groups).
Tissue-Specific BLAST2GO Analyses
| Species | Tissue | GO-Slim | BP | MF | CC | ||
|---|---|---|---|---|---|---|---|
| Column | 16,774 | 1,121 | 5,071 | 33,738 | 7,994 | 17,681 | |
| Filament | 13,252 | 933 | 4,890 | 33,136 | 7,644 | 16,374 | |
| Tentacle | 11,693 | 1,146 | 4,194 | 27,757 | 6,619 | 14,502 | |
| Column | 10,303 | 511 | 981 | 5,520 | 1,704 | 2,860 | |
| Filament | 30,737 | 833 | 6,885 | 42,633 | 10,269 | 22,578 | |
| Tentacle | 18,620 | 1,083 | 4,157 | 26,878 | 6,386 | 13,837 | |
| Column | 57,057 | 362 | 5,533 | 22,252 | 9,261 | 10,848 | |
| Filament | 22,247 | 1,223 | 4,736 | 28,831 | 7,496 | 15,538 | |
| Tentacle | 20,884 | 892 | 5,298 | 30,364 | 6,922 | 16,477 |
Note.—For the gene ontology analysis, all values are the occurrence of gene ontology groupings in each of the GO domains. BP, biological process; MF, molecular function; CC, cellular component; N, number of sequences which were in the top 10% of the more highly expressed transcripts; L, average length of transcripts; GO-Slim, number of sequences processed.
. 4.—BLAST2GO GO analysis. Species-level comparison of level 2 GO classifications of transcripts from each tissue-specific transcriptome. Counts are from the most highly expressed transcripts (top 10%, based on TPM values).
. 5.—Trinotate GO analysis. Semantic similarity plots of average TPM per GO group for the Biological Process and Molecular Function GO domains. Labels correspond to the highest average TPM GO groups are shown in A. sulcata.
Hidden Gene Copies within Each Transcriptome
| G | I | G | I | G | I | ||||
|---|---|---|---|---|---|---|---|---|---|
| Type I KTx | 1 | 1 | 2 | 4 | 4 | 5 | 2 | 2 | 3 |
| Type III KTx | 18 | 29 | 34 | 11 | 13 | 14 | 5 | 10 | 42 |
| Cytolysins | 2 | 2 | 3 | 3 | 5 | 6 | 1 | 1 | 1 |
| NaTx | 7 | 9 | 23 | 1 | 1 | 1 | 1 | 1 | 2 |
Note.—G , Trinity “genes”; I, Trinity transcripts; N, Total toxin candidates, including “hidden” gene copies.