| Literature DB >> 33139759 |
Yan Sun1,2,3, Qichao Yu2,3, Lei Li1,2, Zhanlong Mei2, Biaofeng Zhou1,2,3, Shang Liu1,2,3, Taotao Pan1,2,3, Liang Wu1,2,3, Ying Lei2,3, Longqi Liu2,3, Radoje Drmanac4, Kun Ma5,6, Shiping Liu7,8,9.
Abstract
Recent studies show that non-coding RNAs (ncRNAs) can regulate the expression of protein-coding genes and play important roles in mammalian development. Previous studies have revealed that during C. elegans (Caenorhabditis elegans) embryo development, numerous genes in each cell are spatiotemporally regulated, causing the cell to differentiate into distinct cell types and tissues. We ask whether ncRNAs participate in the spatiotemporal regulation of genes in different types of cells and tissues during the embryogenesis of C. elegans. Here, by using marker-free full-length high-depth single-cell RNA sequencing (scRNA-seq) technique, we sequence the whole transcriptomes from 1031 embryonic cells of C. elegans and detect 20,431 protein-coding genes, including 22 cell-type-specific protein-coding markers, and 9843 ncRNAs including 11 cell-type-specific ncRNA markers. We induce a ncRNAs-based clustering strategy as a complementary strategy to the protein-coding gene-based clustering strategy for single-cell classification. We identify 94 ncRNAs that have never been reported to regulate gene expressions, are co-expressed with 1208 protein-coding genes in cell type specific and/or embryo time specific manners. Our findings suggest that these ncRNAs could potentially influence the spatiotemporal expression of the corresponding genes during the embryogenesis of C. elegans.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33139759 PMCID: PMC7606524 DOI: 10.1038/s41598-020-75801-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of detected genes.
| Gene types | Genes detected per cell | Total genes detected | Genes annotated in Ensembl release 80 | Detect ratio (%) |
|---|---|---|---|---|
| Protein coding | 8746 (1216–13,862) | 20,431 | 20,447 | 99.92 |
| Antisense | 18 (0–45) | 99 | 99 | 100.00 |
| lincRNA | 33 (4–94) | 169 | 169 | 100.00 |
| rRNA | 6 (6–21) | 22 | 22 | 100.00 |
| snoRNA | 28 (4–72) | 338 | 345 | 97.97 |
| pseudogene | 132 (24–644) | 1546 | 1590 | 97.23 |
| snRNA | 6 (0–59) | 126 | 130 | 96.92 |
| tRNA | 5 (0–112) | 571 | 637 | 89.64 |
| Unknowa | 223 (37–1022) | 6972 | 7687 | 90.70 |
aUnknow: ncRNAs of unknown types (median length: 140 bp, range 17–2525 bp).
Figure 1An outline of the 1031 embryonic cells and the detected genes pre cell by time intervals. (a) Number of cells within each time interval. (b) Pearson correlations between detected protein-coding genes and ncRNAs per cell in each time interval.
Comparison of protein-coding genes and ncRNAs detected per cell in different embryo time intervals.
| Time intervals | Protein-coding genes per cell | ncRNAs per cell | |
|---|---|---|---|
| We detected | Packer et al.[ | ||
| < 150 | 4863.8 (2478–5812) | 1958.3 (333–5087) | 327.5 (131–457) |
| 150–270 | 6975.9 (4449–10,437) | 1054.6 (287–4182) | 432.2 (293–956) |
| 270–330 | 7658.6 (3648–13,113) | 937 (250–4983) | 500.7 (223–1631) |
| 330–390 | 7342.7 (3697–13,862) | 847.5 (239–4410) | 475.9 (211–1846) |
| 390–450 | 7849.5 (2543–10,943) | 758.3 (219–4271) | 480.5 (159–977) |
| 450–510 | 6843.5 (4609–9953) | 733 (213–4494) | 416 (227–759) |
| 510–580 | 8949.6 (5528–11,622) | 673.6 (178–3883) | 532.3 (253–778) |
| 580–690 | 9724.1 (1816–11,934) | 639.2 (136–3,819) | 513.9 (298–810) |
| 690–760 | 9781 (1549–11,893) | 912.1 (136–3,695) | 490 (111–861) |
| > 760 | 7654.7 (1216–13,384) | 1409.2 (307–4160) | 433.2 (102–1769) |
Figure 2Clustering of the embryonic cells. (a, c, d) Clustering 1031 embryonic cells using combined protein-coding genes and ncRNAs (a), using protein-coding genes alone (c), and using ncRNAs alone (d). (b) Clustering cells using combined protein-coding genes and ncRNAs, and labelling cells with embryo times. (e) Feature plots of newly identified ncRNA markers: T09E11.11 (early and middle embryonic intestinal cells), tts-1 (late embryonic posterior intestinal cells), Y7A9A.79 (late embryonic anterior intestinal cells), linc-22 (pharyngeal cells), C44H4.10 (hypodermal cells), T02G5.4 (early embryonic cells).
Figure 3ncRNAs and protein-coding genes involving embryo and organ development. (a, b) Smoothed expressions (scaled log2-TPM, loess regression, span = 0.5) of 145 protein-coding genes (a) and 6 ncRNAs (b) along embryo times. The dashed line labels 270 min before which there are only 17 cells, and after which there are 1014 cells. (c) Feature plots showing expression levels of the pharyngeal expressed ncRNAs C14B9.11, F29F11.19, anr-10 and C27A2.11. (d) Smoothed expressions (scaled log2-TPM, loess regression, span = 0.5) of the pharyngeal expressed ncRNAs along embryo times. ncRNAs C14B9.11 and F29F11.19 are expressed earlier, and ncRNAs anr-10 and C27A2.11 later in pharynx.