| Literature DB >> 28701691 |
Weihong Liu1, Enchao Yu1, Siyu Chen1, Xiaopeng Ma1,2, Yiwen Lu3, Xiao Liu4.
Abstract
To better understand the biological function of long noncoding RNAs, it is critical to determine their spatiotemporal expression patterns. We generated transgenic reporter strains for 149 out of the 170 annotated C. elegans long intervening noncoding RNAs (lincRNAs) and profiled their temporal activity. For the 68 lincRNAs with integrated reporter lines, we profiled their expression at the resolution of single cells in L1 larvae, and revealed that the expression of lincRNAs is more specific, heterogeneous and at lower level than transcription factors (TFs). These expression patterns can be largely attributed to transcriptional regulation because they were observed in assays using reporters of promoter activity. The spatial expression patterns of the 68 lincRNAs were further examined in 18 tissue categories throughout eight developmental stages. We compared the expression dynamics of lincRNAs, miRNAs and TFs during development. lincRNA and miRNA promoters are less active at embryo stage than those of TFs, but become comparable to TFs after embryogenesis. Finally, the lincRNA gene set shows a similar tissue distribution to that of miRNAs and TFs. We also generated a database, CELE, for the storage and retrieval of lincRNA reporter expression patterns and other relevant information. The data and strains described here will provide a valuable guide and resource for future functional exploration of C. elegans lincRNAs.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28701691 PMCID: PMC5507858 DOI: 10.1038/s41598-017-05427-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The single-cell expression profiles of lincRNA reporters. (a) lincRNA reporter expression profiles in newly hatched L1 larvae. The expression level of 64 PlincRNA::reporters in 361 somatic nuclei is shown. The gene expression level is adjusted by calculating (gene expression level + 500)/500, and is plotted as the log2 (adjusted gene expression level). Genes (rows) were clustered according to their expression pattern. 361 somatic cells (columns) were manually arranged according to their cell types and cell position along the anterior-posterior axis of the worm. Grey entries in the heatmap represent missing cells. The data used to generate this figure can be found in Supplementary Table S3. Gene names of eight lincRNAs that are significantly expressed in muscle according to previous SRT-based RNA-seq are in red[27]. (b) 361 somatic cells clustered by lincRNA promoter reporter expression. A terrain map of nuclei was generated by Genesis according to correlation of gene expression between cells[35]. Each dot represents a nucleus. Colors indicate different cell types. The distance between two nuclei in the x-y plane indicates similarity in gene expression. Cells with similar gene expression patterns are close to each other, while cells with different patterns are far apart. b.c., other body cells; b.w.m., body wall muscle; blast, blast cells; glia, glia cells; hyp., hypodermal cells; int., intestine cells; neur., neurons; ph.c., other pharyngeal cells; ph.m., pharyngeal muscle; ph.neur., pharyngeal neurons; s.c., seam cells.
Figure 2Comparison of expression features of lincRNA and TF reporters. (a,b) Expression level (a) and the coefficient of variation in expression level (b) of lincRNA and TF reporters in each cell type (CV is calculated as the the ratio of the standard deviation σ to the mean μ). Cell type abbreviation is the same as in Fig. 1. Error bars equal SEM. (c,d) Distribution of the number of cells (c) and cell types (d) in which lincRNA or TF reporters are expressed.
Figure 3Comparison of spatiotemporal expression patterns of lincRNA, miRNA and TF reporters. (a) Temporal expression patterns of lincRNA, miRNA and TF reporters. Genes in each of the three categories whose promoter reporters show detectable activity are included in the comparison (lincRNA, 142; miRNA, 67; TF, 335). EE, early embryo (pre-comma stage); ME, middle embryo (comma to 1.5-fold stage); LE, late embryo (2-fold to 3-fold stage). *P < 0.05; **P < 0.01; ***P < 0.001, Fisher’s exact test. Detailed information can be found in Supplementary Table S2. (b) Spatial expression patterns of lincRNA, miRNA and TF reporters. The percentage of each gene set expressed in a tissue is shown as an odds ratio. The upper right half of the triangle table shows the correlation of the probabilities of expressed genes in each somatic tissue between the three gene sets, in which Pearson correlation coefficients were calculated based on the log2(odds ratio); The lower left half of the triangle table shows the non-significant P-values resulting from a test of whether the variance of the probabilities that a gene is expressed in each of the somatic tissues is significantly different between each of the three gene sets, P-values were calculated based on the log2(odds ratio) using an F-test. I, intestine cells and intestinal muscle; P, pharynx; HYP, hypodermal cells; SC, seam cells; X, cells of excretory system; SG, somatic gonad; V, vulva and vulva muscle; BN, ventral nerve cord and body neurons; HN, head neurons; TN, tail neurons; HM, head muscle; BM, body muscle; R, anal depressor muscle, anal sphincter, rectal epithelium and rectal gland cell. Corresponding pictures can be found in the C.elegans lincRNA expression (CELE) database (wano.bioinfo.org). Detailed information can be found in Supplementary Table S4. (c) Number of lincRNA and TF reporters expressed in the Z1/Z4-somatic gonad lineage and the P neuroblast-vulva lineage. Black outer circle, the number of whole assayed lincRNA reporters and TF reporters. There is a significant overlap between genes expressed in P neuroblasts and in the vulva, P < 0.05, Fisher’s exact test. Detailed information can be found in Supplementary Table S4, S8. (d,e) Representative lincRNA reporters expressed in somatic gonad (d) and in vulva (e). Scale bar, 10 μm.
Figure 4C. elegans lincRNA expression (CELE) database. (a) CELE database homepage. Hyperlinks at left are for users to navigate to the section of interest. (b) “Browse integrated transgene” section. The tissues and stages in which lincRNA reporter expression was observed are listed, and corresponding pictures can be viewed by clicking the tissue names. Definition of the abbreviation for each tissue and stage can be found at the top of the page (red circle). This database can be searched by “gene”, “tissue” or “stage”. (c) Strain summary page. (d) “Non-integrated transgene” section.