| Literature DB >> 34911763 |
Alina Isakova1, Norma Neff2, Stephen R Quake3,2,4.
Abstract
The ability to interrogate total RNA content of single cells would enable better mapping of the transcriptional logic behind emerging cell types and states. However, current single-cell RNA-sequencing (RNA-seq) methods are unable to simultaneously monitor all forms of RNA transcripts at the single-cell level, and thus deliver only a partial snapshot of the cellular RNAome. Here we describe Smart-seq-total, a method capable of assaying a broad spectrum of coding and noncoding RNA from a single cell. Smart-seq-total does not require splitting the RNA content of a cell and allows the incorporation of unique molecular identifiers into short and long RNA molecules for absolute quantification. It outperforms current poly(A)-independent total RNA-seq protocols by capturing transcripts of a broad size range, thus enabling simultaneous analysis of protein-coding, long-noncoding, microRNA, and other noncoding RNA transcripts from single cells. We used Smart-seq-total to analyze the total RNAome of human primary fibroblasts, HEK293T, and MCF7 cells, as well as that of induced murine embryonic stem cells differentiated into embryoid bodies. By analyzing the coexpression patterns of both noncoding RNA and mRNA from the same cell, we were able to discover new roles of noncoding RNA throughout essential processes, such as cell cycle and lineage commitment during embryonic development. Moreover, we show that independent classes of short-noncoding RNA can be used to determine cell-type identity.Entities:
Keywords: cell cycle; differentiation; noncoding RNA; single-cell RNA-seq
Mesh:
Substances:
Year: 2021 PMID: 34911763 PMCID: PMC8713755 DOI: 10.1073/pnas.2113568118
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Smart-seq-total performance. (A) Schematic comparison of Smart-seq2 and Smart-seq-total pipelines. Following cell lysis, total cellular RNA is polyadenylated, primed with anchored oligodT, and reverse transcribed in a presence of the custom degradable TSO. After reverse transcription, TSO is enzymatically cleaved, single-stranded cDNA is amplified and cleaned up. Amplified cDNA is then either tagmented or directly indexed, pooled, and depleted from ribosomal sequences using DASH (23). The resulting indexed libraries are then pooled and sequenced on Illumina platform. (B) Distribution of mapped reads across RNA types in human primary fibroblasts, HEK293T, and MCF7 cells. Percentage of total reads mapped to each RNA type. miscRNA class is additionally split into RN7SK, RN7SL, and other miscRNA categories. (C) Examples of coding and noncoding marker genes for each cell type. Top exemplary markers per biotype computed among cell types using Wilcoxon rank sum test. RNY1 belongs to miscRNA, SCARNA23 and SCARNA20 to scaRNA, MT-TD to mitochondrial tRNA class. (D) t-SNE plots of three profiled human cell types generated using indicated subset of genes. From top to bottom: protein coding, lncRNA, miRNA, and other small ncRNA (include snoRNA, snRNA, scaRNA, scRNA, and miscRNA). We have excluded histone coding genes from the protein coding (polyA+) set, since a large fraction of these RNAs are known to lack polyA tails (60).
Fig. 2.Dynamics of cellular noncoding transcripts throughout the cell cycle. (A) Cell-cycle–dependent expression of noncoding genes. Examples of lncRNA, miRNA, and snoRNA differentially expressed throughout the cell cycle in human primary dermal fibroblasts. Circular charts depict average expression of a given gene across all cells identified to be in a certain phase of the cell cycle. (B) Cell-cycle–specific gene clusters comprised of coding and noncoding RNA. Clusters were identified through hierarchical clustering of top 750 mRNA differentially expressed during the cell cycle and all noncoding genes expressed in at least one phase. (C) Expression of known cell-cycle and histone genes across G1, S, and G2M phases. A curated list of histone RNA detected in all three cell types is shown. (D) Examples of histone mRNA differentially expressed between three profiled cell types. Top three marker histone genes per cell type are shown. (E) Size-selection of small RNA library fraction. Bioanalyzer traces corresponding to UMI-tagged and indexed Smart-seq-total v2 library (Upper) and a size-selected library containing small RNA (Lower). (F) Number of detected miRNA. Comparison of miRNA detection by Smart-seq-total v2 and a dedicated single-cell small RNA-seq method (11). (G) Cellular levels of mature miRNA members of miR-17/92, miR-106a/363 and miR106b/25 clusters. Asterisks indicate the significance level estimated from Wicoxon rank sum test: ***<= 0.001,**<= 0.01,*<= 0.05 P value respectively. (H) Pair-wise correlation scores computed for miR-92a-3p and miR-25-3p and their respective mRNA targets (both predicted and validated) across ∼300 HEK293T cells. P values of correlations were computed using t test and adjusted using Benjamini–Hochberg correction).
Fig. 3.Coding and noncoding signature of differentiated single mESCs. (A) Microscope images and corresponding schematic representations of EB formation at four sampled time points. Pie charts represent distribution of mapped reads across RNA types. Genes were assigned to a specific biotype based on GENCODE M23 annotation for the reference chromosomes. tRNA was quantified by mapping the reads, nonmapping to any other RNA type, to the high-confidence gene set obtained from GtRNAdb. (B) Exemplary coding and noncoding genes that are up- or down-regulated during EB formation. Subpanels are grouped according to RNA type. (C) UMAP plot of collected cells colored by timepoint. Cells were clustered using a k nearest-neighbor algorithm and cell lineages were annotated based on the expression of marker genes within the identified clusters. (D) Lineage tree of EB differentiation. Each dot represents a cell colored according to the assigned lineage. Cells are arranged according to the computed pseudotime. (E) UMAP plot of collected cells colored by pseudotime. (F) Heatmap showing the variability in coding and noncoding gene expression across identified clusters. (G) Temporal and lineage-specific expression of selected protein-coding, lncRNA, and miRNA genes. Each column from left to right shows genes specific to: pluripotency state, ectoderm, mesoderm, or endoderm lineages.