| Literature DB >> 27216622 |
Leslie S Babonis1, Mark Q Martindale2,3, Joseph F Ryan2,3.
Abstract
BACKGROUND: The evolution of novel genes is thought to be a critical component of morphological innovation but few studies have explicitly examined the contribution of novel genes to the evolution of novel tissues. Nematosomes, the free-floating cellular masses that circulate through the body cavity of the sea anemone Nematostella vectensis, are the defining apomorphy of the genus Nematostella and are a useful model for understanding the evolution of novel tissues. Although many hypotheses have been proposed, the function of nematosomes is unknown. To gain insight into their putative function and to test hypotheses about the role of lineage-specific genes in the evolution of novel structures, we have re-examined the cellular and molecular biology of nematosomes.Entities:
Keywords: Cnidocyte; Immune; Nematosome; Nematostella vectensis; Novelty; Phagocytosis; RNA-Seq; Taxonomically-restricted genes
Mesh:
Year: 2016 PMID: 27216622 PMCID: PMC4877951 DOI: 10.1186/s12862-016-0683-3
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Nematosomes in N. vectensis. a A live image of a young adult polyp (10-tentacle stage); several nematosomes (N, arrows) are visible at rest along the internal surface of the body wall near the insertions of the mesenteries (M, arrowheads). The pharynx (P) and tentacles (T) are also visible. b A DIC optical section of an isolated nematosome
Fig. 2Nematosome budding. a Clusters of cnidocytes (white arrows) are visible in the ectodermal mesentery before budding (the black arrow indicates a cluster in a different focal plane). b A later stage in the budding process showing the abundance of mature cnidocytes (green; 143 μM DAPI) as the nematosome begins to protrude from the mesentery epithelium. c Proliferative nuclei (red, 100 μM EdU) are visible in the basal region of the epithelium, between and below the cnidocytes (white arrows). Proliferation also occurs outside of the budding zone (black arrows). d A developing cnidocyte from tentacle ectoderm; white arrows indicate the cnidocyst tubule which develops outside of and around the cnidocyst capsule (black arrow). e-g Immunohistochemistry performed in early planula stage embryos reveals abundant developing cnidocytes labeled with anti-mcol1, anti-mcol3, or anti-mcol4 antibodies (yellow). Nematosomes incubated at the same time in the same aliquot of each antibody lack staining (insets); the number of tissues (embryos or nematosomes) observed to have mcol+ cells is indicated. The blastopore of each embryo is indicated by * for orientation. All scale bars represent 10 μm unless otherwise specified
Fig. 5Ciliary structures on the surface of a nematosome. a Double labeling of F-actin (phalloidin; red) and acetylated-tubulin (green) indicates the presence of dense actin-rich apical cones (black arrowhead) and two types of cilia: type I, short (white arrowhead) and type II, long (white arrow). b Cilium length differs significantly between type I and type II cilia (ANOVA, p < < 0.001). c Numerous ciliary cones of different morphologies (arrows) are visible among the abundant microvilli covering the surface of a nematosome. d Type I ciliary cones (white arrows in A) are wide, assembled from multiple layers of stereocilia (white arrow), and surround a short central kinocilium (K; *indicates the distal tip of the cilium). e Type II ciliary cones are narrow and are composed of a single layer of stereocilia (black arrow) that surround a long kinocilium. f A thin section reveals the relationship of the ciliary cones and their underlying cell types. Higher magnification (inset) shows a type I ciliary cone (white arrow) at the apex of a cnidocyte (Cn) and a type II ciliary cone (black arrow) at the apex of a cell type with sub-apical vesicles of various size and electron density immediately adjacent to the cnidocyte. All scale bars represent 10 μm unless otherwise specified
Fig. 6Phagocytes in nematosomes. a A thin section of a nematosome showing four cnidocytes (white arrowheads) and at least five cells with abundant apical vacuoles (black arrows). b At higher magnification, several organelles are visible: (mvb) multivesicular body, (Ly) lysosome-like vesicle, and (En) endosome-like vesicle. c An optical section through a single nematosome showing sub-surface localization of fluorescently-labeled E. coli (white arrows) and latex beads (inset)
Fig. 3Live nematosomes are capable of subduing brine shrimp (Artemia salina). (See also Additional file 4.) a Aggregates of nematosomes (arrowheads) are attached to subdued brine shrimp. b Higher magnification reveals the presence of two spent cnidocysts (arrows) protruding from the exoskeleton of the shrimp. c A high-magnification image of an isolated nematosome reveals two spent cnidocysts (white arrows). Black arrow indicates a spent cnidocyst in another focal plane. All scale bars represent 10 μm unless otherwise specified
Fig. 4Nematosomes comprise multiple cell types. a A single DIC optical section through an isolated nematosome in which two types of cnidocyte are abundant: basitrichous isorhizas (b) characterized by a thin shaft extending nearly the full length of the capsule, and microbasic-p-mastigophores (m) which have a thick shaft that extends only half the length of the capsule and has a distinctive V-shaped notch [18]. Ciliary cones are visible (arrows) at the apex of two mastigophores and several long cilia (arrowheads) can be seen emerging from the perimeter of the tissue. b A 3D rendering of a confocal z-stack through a nematosome indicating nuclei (labeled with 1 μM DAPI; green). Nuclei in cnidocytes often appear semi-lunar in shape (arrows) as they conform to the shape of the cnidocyst capsule. c A 3D rendering of a confocal z-stack through another single nematosome showing the presence of abundant mature cnidocytes (labeled with 143 μM DAPI; blue). d The numbers of nuclei (an indicator of total cell number; closed circles) and cnidocytes (open circles) in individual nematosomes increase as a function of nematosome diameter but these relationships differ across the range of nematosomes studied (nuclei slope: 2.72, cnidocyte slope: 1.34; p < 0.001 ANCOVA). All scale bars represent 10 μm
Comparison of transcriptome assembly statistics
| Trimmed only, | Trimmed only, | Trim + error correction, | |
|---|---|---|---|
| Transcriptome name | TR333942 | TR118377 | NvecRef32743 |
| Assembled “transcripts”a | 333,942 | 118,377 | 32,743 |
| Transcripts with predicted ORFsb | 29,690 (9 %) | 19,461 (16 %) | 17,346 (53 %) |
| Predicted ORFsc | 135,213 (4.6X) | 77,877 (4X) | 27,511 (1.6X) |
| ORFs with no expressiond | 74,503 (55 %) | 28,975 (37 %) | 10,198 (37 %) |
| CEGMA score (complete) e | 242/248 (98 %) | 241/248 (97 %) | 207/248 (83 %) |
| CEGMA score (partial) | 245/248 (99 %) | 245/248 (99 %) | 231/248 (93 %) |
aTotal number of contigs assembled using Trinity [49, 50]. bOpen reading frames (ORFs) ≥ 100 amino acids (aa) in length were predicted using Transdecoder (http://transdecoder.github.io). cThe average number of ORFs predicted from each transcript is listed in parentheses. dTotal number of transcripts with at least one ORF and abundance estimated at < 2 counts in at least one of the three sampled tissues; counts were assayed using Bedtools [55]. eCEGMA scores are listed as % of the 248 conserved eukaryotic genes that form the CEGMA database [56] that were present in the indicated transcriptome (scores for complete and partial sequences are indicated)
Error correction increases the mapping concordance for transcriptome NvecRef32743
| Samplea | Raw reads (paired) | Reads remaining after trimmingb | Reads remaining after trimming & Error Correctionc | Reads alignedd – Trimmed only | Reads aligned – Trimmed & Error Corrected |
|---|---|---|---|---|---|
| M3 | 49,390,203 | 49,377,414 | 46,888,223 | 39,768,569 (81 %) | 43,249,696 (92 %) |
| M4 | 47,477,373 | 47,466,721 | 44,704,490 | 36,255,081 (76 %) | 39,339,951 (88 %) |
| Mes | 26,632,325 | 26,615,206 | 25,785,788 | 24,435,421 (92 %) | 24,380,462 (95 %) |
| N1 | 47,788,007 | 47,770,601 | 44,558,275 | 36,611,388 (77 %) | 39,656,864 (89 %) |
| N2 | 53,500,454 | 53,483,271 | 50,693,980 | 42,513,852 (79 %) | 45,609,374 (90 %) |
| Nem | 20,135,090 | 20,073,479 | 19,174,413 | 17,995,873 (90 %) | 17,834,121 (93 %) |
| T3 | 47,683,996 | 47,670,541 | 44,585,027 | 27,963,673 (77 %) | 39,854,556 (89 %) |
| T4 | 50,314,893 | 50,300,293 | 47,474,608 | 40,154,724 (80 %) | 43,557,953 (92 %) |
| Ten | 21,803,901 | 21,788,366 | 20,983,324 | 19,474,441 (89 %) | 19,384,395 (92 %) |
aSamples M3, M4, Mes are from mesenteries; N1, N2, Nem are from nematosomes; T3, T4, Ten are from tentacles. bTrimming was performed using Trimmomatic [52]. cError correction was performed using the ErrorCorrectReads utility from AllPaths-LG [53]. dAlignments were performed using Bowtie2 [54]
Reference transcriptome NvecRef32743 statistics
| Total number of raw reads pooled for | 137,142,632 |
| Assembled “transcripts” | 32,743 |
| Transcripts remaining after alien index analysis | 32,706 |
| Transcripts encoding an ORF > 100aa in length | 27,511 |
| Transcripts with non-0 expression and ORF >100aa | 17,313 |
| New transcripts (not predicted from the genome) | 2,150 |
| Mean transcript length in bp ( | 1,058/1,646 |
| N50 in bp ( | 1,919/2,282 |
| GC content ( | 44.56 %/45.5 % |
Fig. 7Expression analyses in three N. vectensis tissues. a Principal components plot indicating variation within and among tissue types. PC1 describes 57 % of the variation among tissues and PC2 describes 34 % in the top 1000 expressed transcripts. b Heatmap indicating gene expression (variance stabilized counts) of the top 1000 expressed transcripts. Transcripts are clustered using the Ward’s D method (see Additional file 9); warmer colors indicate higher expression. Dotted lines indicate clusters with ubiquitous expression across tissues, the arrow indicates a cluster with high expression in nematosomes only, and arrowheads indicate clusters with high expression in only mesenteries or tentacles. c Nineteen out of twenty transcripts identified by RNA-Seq as being significantly upregulated in nematosomes relative to both other tissues were also found to have significantly lower expression in the mesenteries and tentacles, relative to nematosomes, using qPCR. Expression of Nv2643 in the tentacles (indicated by *) is not significantly different from 0. Dotted lines indicate 2(log2)-fold change in expression between nematosomes and the other tissues. Mean ± SD
Fig. 8Analysis of taxon-restricted gene expression. a Of 17,313 expressed transcripts identified in this study, approximately 16 % (N = 2792) are Nematostella-specific, 2 % (353) are Edwardsiidae-specific, 14 % (N = 2507) are anthozoan-specific, and 7 % (N = 1248) are cnidarian-specific. The largest class of transcripts sequenced in this study (60 %) were found in at least one other group of metazoans (N = 10,413). As compared with the complete set of expressed transcripts, (b-d) differentially expressed transcripts were characterized by a higher proportion of Nematostella-specific sequences: 26 % in nematosomes and mesenteries (N = 135/528 and N = 154/585, respectively) and 31 % in tentacles (N = 378/1228). Shared/metazoan sequences comprised less than half of the differentially expressed transcripts in each of the three tissues. e-g Nematostella-specific sequences make up an even larger proportion of the uniquely expressed transcripts in each tissue: 65 % (N = 34/52) in the nematosomes, 50 % (N = 29/58) in the mesenteries, and 47 % (N = 140/296) in the tentacles. h Among the differentially expressed transcripts (illustrated in b-d), Nematostella-specific sequences were more likely to be expressed in only a single tissue (35 %, N = 203/667) than were pan-cnidarian sequences (illustrated in grey in panel a; 21 %, N = 344/1632) or shared/metazoan sequences (9 %, N = 62/709)
Pfam/Conserved domain analysis of the top expressed transcripts from each tissue
| Tissue | TrID | GrID | JGI ID | Pfam/CD(s) a | Description |
|---|---|---|---|---|---|
| nematosomes | Nv270 | M | 200843 | Myosin_tail_1, Cast, TPR_MLP1_2, Mplasa_alph_rch | golgi autoantigen B1 |
| Nv46 | N | 239786 | cadherin | uromodulin | |
| Nv1832 | C | N/A | Calponin homology (CH) domain, Spectrin repeat, SMC_prok_A, PRK02224 | utrophin | |
| Nv495 | C | 246793 | reprolysin, pep_M12B_propeptide, ADAM_spacer1 super family, TSP_1, FN3, ZnMc super family, ADAM_CR super family | ADAMTS | |
| Nv791 | C | 27059 | ZnMc_adamalysin_II_like, reprolysin, pep_M12B_propeptide, F5_F8 type C, disintegrin, ADAM_CysRich, TSP_1, FReD super family, EB | hemicentin/thrombospondin | |
| Nv126 | C | 242847 | N/A | uncharacterized | |
| Nv354 | M | 119733 | Homeobox KN domain, PRK12323 Sox_C_TAD super family | Meis2 transcription factor | |
| Nv111 | N | 200034 | N/A | uromodulin | |
| Nv1971 | A | 96874 | FN3, TLD, MAM, F5_F8_typeC, Ion_trans_2 | sidekick-2 (immunoglobulin) | |
| Nv1874 | M | 21107 | PKD channel, REJ, PLAT_polycystin, PKD, GPS, F5_F8_typeC, WSC, DUF4271 super family | cation channel | |
| mesenteries | Nv2 | M | 39271 | CstA, PLA2c | Phospholipase A2 |
| Nv151 | M | 205229 | PHA03307, DUF1943, VWD, Vitellogenin_N | vitellogenin precursor | |
| Nv169 | M | 180912 | CBM_14 | chitin binding protein | |
| Nv224 | M | 168629 | GH18_chitolectin_chitotriosidase, Glyco_18, Retinal, PHA03307, DUF2237 super family | chitinase | |
| Nv274 | M | 109239 | ND2 super family, Tryp_SPc, SR | chymotrypsin | |
| Nv174 | M | 105779 | Atrophin-1, Tryp_SPc, ShKT | chymotrypsin | |
| Nv158 | N | 246653 | LDLa, UPF0104 super family, Tymo_45kd_70kd | uncharacterized membrane protein | |
| Nv215 | M | 15302 | ZnMc_MMP, PG_binding_1, ShKT, Peptidase_M10 | matrix metalloprotease | |
| Nv9181 | M | 246069 | Mito_carr, PTZ00169 | ADP,ATP carrier protein | |
| Nv896 | C | 98917 | ZnMc_astascin like, ShK toxin domain, astacin (Peptidase family M12A), prolyl 4-hydroxylase | matrix metalloprotease | |
| tentacles | Nv302 | C | N/A | DnaJ_zf, GPS, TIG, Glyco_hydro_17 super family, REJ | egg jelly receptor |
| Nv520 | M | 120496 | PTKc, FN3, SEA, Ig_2, Pkinase_Tyr, IGc2 | FGF receptor c | |
| Nv146 | M | 90289 | KBL_like, Preseq_ALAS super family, BioF | aminotransferase | |
| Nv886 | M | 80526 | Ig, Ig super family, Ig2_FGFR_like, Ig super family, I-set, IG_like, IG, IGc2 | Immunoglobulin like activity | |
| Nv94 | C | 114661 | SOUL | heme-binding protein | |
| Nv139 | N | 122916 | Gal_lectin, FYDLN_acid super family | rhamnose binding lectin | |
| Nv325 | M | 202708 | YtcJ_like | aminohydrolase | |
| Nv219 | M | 243257 | Gln-synt_C, Gln-synt_N, PLN02284 | glutamine synthetase | |
| Nv326 | N | 208719 | TSP1, WSC super family | hemicentin | |
| Nv162 | N | 202441 | DUF2457 | uncharacterized |
a Protein family (Pfam) domains were identified from the EMBL-EBI database (http://pfam.xfam.org/) and conserved domains (CD) were identified from NCBI’s conserved domain database (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi). TrID internal transcript ID from NvecRef32743 de novo assembly, GrID orthology group, N Nematostella-specific, A anthozoan-specific, C cnidarian-specific, M shared/metazoan, JGI ID protein ID from the JGI database (http://genome.jgi-psf.org/Nemve1/Nemve1.home.html)
Fig. 9Gene ontology (GO) analyses of RNA-Seq data. a The top 10 GO terms from nematosomes for each GO category and the number of transcripts that map to them from other tissues. White arrowheads indicate GO terms unique (or nearly so) to nematosomes and black arrowheads indicate GO terms shared only by nematosomes and tentacles. b Venn diagram showing the number of GO terms shared among tissues
Abundance of candidate stress genes across tissues
| Nematosomes | Mesenteries | Tentacles | |
|---|---|---|---|
| Transcripts assayeda | 528 | 585 | 1228 |
| Transcripts with reciprocal best hits in stress databaseb | 116 (22 %) | 101 (17 %) | 161 (13 %) |
| Wound ( | 38 (33 %)c | 46 (46 %) | 59 (37 %) |
| Pathogen ( | 87 (75 %) | 57 (56 %) | 109 (68 %) |
| Chemical ( | 6 (2 %) | 9 (9 %) | 10 (6 %) |
aThe complete set of upregulated transcripts for each tissue. bDatabase of putative stress-related transcripts identified by Reitzel et al [63] for the categories: Wound, Pathogen, and Chemical. For a list of genes in each tissue/class see Additional file 8. cValues represent the number (percent) of the transcripts with reciprocal best hits in the stress database that were associated with the indicated stressor. The sum of these values exceeds the number of transcripts with reciprocal best hits because some transcripts had equally high BLAST hits in two different stress categories