| Literature DB >> 35101061 |
Shijie C Zheng1, Genevieve Stein-O'Brien2,3,4,5, Jonathan J Augustin2, Jared Slosberg2, Giovanni A Carosso2, Briana Winer2, Gloria Shin2, Hans T Bjornsson2,6,7,8, Loyal A Goff9,10,11, Kasper D Hansen12,13.
Abstract
BACKGROUND: The cell cycle is a highly conserved, continuous process which controls faithful replication and division of cells. Single-cell technologies have enabled increasingly precise measurements of the cell cycle both as a biological process of interest and as a possible confounding factor. Despite its importance and conservation, there is no universally applicable approach to infer position in the cell cycle with high-resolution from single-cell RNA-seq data.Entities:
Keywords: Cell cycle; Single-cell RNA-sequencing; Transfer learning
Mesh:
Year: 2022 PMID: 35101061 PMCID: PMC8802487 DOI: 10.1186/s13059-021-02581-y
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 17.906
Fig. 1Principal component analysis recovers time ordering in simulations. Simulations are based on cosine functions with Gaussian noise (see the “Methods” section). (a) Expression vs. time for 2 genes with different peak locations and amplitudes. Each of the two gene peaks are replicated 50 times for a total of 500 genes and 1000 time points (cells). (b) Expression vs. permuted time, representing the unknown time order of observed data which obscures the periodicity of the functions. (c) Principal component analysis of the data from (b) and (a); the two datasets have equivalent principal components. We infer cell-cycle position () by the angle of the ellipsoid. The red dot indicates θ=0. (d) Expression vs. inferred cell-cycle position
Fig. 2The cell-cycle ellipsoid and cell-cycle position. a Top 2 principal components of GO cell-cycle genes from E14.5 primary mouse cortical neurospheres, in which the variation is primarily driven by cell cycle. Each point represents a single cell, which is colored by 5-stage cell-cycle representation, inferred using the SchwabeCC method [15]. The cell-cycle position θ (with values in [0,2π); sometimes called cell-cycle pseudotime) is the polar angle. b As in (a), but for a dataset of primary mouse hippocampal progenitor cells from both a mouse model of Kabuki syndrome and a wildtype. c A comparison of the weights on principal component 1 between the cortical neurosphere and hippocampal progenitor datasets. Genes with high weights (|score|>0.1 for either vector) are highlighted in red. PCC: Pearson Correlation Coefficient. d, e The expression dynamics of dTop2A and eSmc4 using the inferred cell-cycle position, with a periodic loess line (see the “Methods” section). f The dynamics of total UMI using the inferred cell-cycle position, with a periodic loess line, illustrating the high agreement of the dynamics between datasets
Fig. 3When principal component analysis fails to describe the cell cycle. Data is from the developing mouse pancreas. a UMAP embedding using all variable genes. Cells are colored by cell type. b PCA plot of the cell-cycle genes; this reflects the differentiation path in (a). c PCA plot of the cell-cycle genes for ductal cells only; this plot reflects cell cycle
Datasets
| Dataset | Species | Platform | # cells | Note | Reference |
|---|---|---|---|---|---|
| mNeurosphere | Mouse | 10x | 12805 | here | |
| mHippNPC | Mouse | 10x | 9188 | here | |
| mPancreas | Mouse | 10x | 3559 | [ | |
| mHSC | Mouse | SMARTer | 1343 | [ | |
| mRetina | Mouse | 10x | 99260 | [ | |
| HeLa 1 | Human | Drop-seq | 1398 | [ | |
| HeLa 2 | Human | Drop-seq | 2463 | [ | |
| mESC | Mouse | Fluidigm C1 | 279 | FACS | [ |
| hESC | Human | Fluidigm C1 | 226 | FACS | [ |
| hU2OS | Human | SMART-seq2 | 1114 | FUCCI | [ |
| hiPSCs | Human | Fluidigm C1 | 888 | FUCCI | [ |
| Fetal tissue atlas | Human | sci-RNA-seq3 | Varies | [ |
Fig. 4A pre-learned weights matrix learned from proliferating cortical neurospheres enables cell-cycle position estimation in other proliferating datasets. a Different datasets (hippocampal NPCs, mouse pancreas, mouse retina, and HeLa set 2) projected into the cell-cycle embedding defined by the cortical neurosphere dataset. Cell-cycle position θ is estimated as the polar angle. b Inferred expression dynamics of Top2A (TOP2A for human), with a periodic loess line (Methods). c UMAP embeddings of top variable genes. All the cells are colored by cell-cycle position using a circular color scale. We put the discrete stage labels in approximated position on the circular legend to help relate the continuous θ to the discrete stages
Fig. 5Evaluation of tricycle on FUCCI datasets. a–c Data from Mahdessian et al. [20]. a FUCCI scores colored by tricycle cell-cycle position. b Comparison between FUCCI pseudotime and tricycle cell-cycle position with a periodic loess line. We are displaying the data on [0.9π,0.9π+2π] (compared to [0,2π] elsewhere) because FUCCI pseudotime of 0 roughly corresponds to a tricycle position of 0.9π. Cells in the dotted rectangle were moved for display purposes in this panel by adding 2π (one period) to tricycle θ to reflect the higher temporal resolution around the anaphase-metaphase transition for FUCCI pseudotime (see the “Results” section). c R2 values of periodic loess line of all projection genes when using tricycle θ and FUCCI pseudotime as the predictor. The dashed line represents y=x. d–g Data from Hsiao et al. [14]. d FUCCI scores colored by tricycle cell-cycle position. e, f Expression dynamics of Top2A with a periodic loess line using either e tricycle cell-cycle position or f FUCCI pseudotime inferred by Hsiao et al. [14]. Cells are colored by corresponding x-axis values. g Similar to (c), but for the data from Hsiao et al. [14]
Fig. 6Application of tricycle on a human fetal tissue atlas. Data is from Cao et al. [35]. a UMAP embedding of human fetal tissue atlas data colored by cell-cycle position θ estimated using mNeurosphere reference. b The percentage of actively proliferating cells in human fetal tissue atlas. Tissues are ordered decreasingly with the percentage. Tissue and cell type annotations are available in Additional file 1: Fig. S25