| Literature DB >> 26288249 |
Yu Hasegawa1, Deanne Taylor2, Dmitry A Ovchinnikov3, Ernst J Wolvetang3, Laurence de Torrenté4, Jessica C Mar5.
Abstract
An analysis of gene expression variability can provide an insightful window into how regulatory control is distributed across the transcriptome. In a single cell analysis, the inter-cellular variability of gene expression measures the consistency of transcript copy numbers observed between cells in the same population. Application of these ideas to the study of early human embryonic development may reveal important insights into the transcriptional programs controlling this process, based on which components are most tightly regulated. Using a published single cell RNA-seq data set of human embryos collected at four-cell, eight-cell, morula and blastocyst stages, we identified genes with the most stable, invariant expression across all four developmental stages. Stably-expressed genes were found to be enriched for those sharing indispensable features, including essentiality, haploinsufficiency, and ubiquitous expression. The stable genes were less likely to be associated with loss-of-function variant genes or human recessive disease genes affected by a DNA copy number variant deletion, suggesting that stable genes have a functional impact on the regulation of some of the basic cellular processes. Genes with low expression variability at early stages of development are involved in regulation of DNA methylation, responses to hypoxia and telomerase activity, whereas by the blastocyst stage, low-variability genes are enriched for metabolic processes as well as telomerase signaling. Based on changes in expression variability, we identified a putative set of gene expression markers of morulae and blastocyst stages. Experimental validation of a blastocyst-expressed variability marker demonstrated that HDDC2 plays a role in the maintenance of pluripotency in human ES and iPS cells. Collectively our analyses identified new regulators involved in human embryonic development that would have otherwise been missed using methods that focus on assessment of the average expression levels; in doing so, we highlight the value of studying expression variability for single cell RNA-seq data.Entities:
Mesh:
Year: 2015 PMID: 26288249 PMCID: PMC4546122 DOI: 10.1371/journal.pgen.1005428
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Investigating inter-cellular gene expression variability.
(A) Interpreting measures of inter-cellular expression variability in a cell population and implications for understanding regulatory control. (B) Experimental design of the Yan data set.
Fig 2Overview of the analysis performed.
The questions that our study seeks to address, and the main results obtained are highlighted.
Fig 3Distribution of gene expression variability during embryonic development.
(A) The distribution reflects the inter-cellular expression variability for all genes in the transcriptome. An overall widening of the distribution is observed as embryos transition from the 4-cell to the blastocyst stage. As a result, there is an increase in the number of genes with higher levels of expression variability in the blastocyst stage than the 4-cell stage. (B) How the inter-cellular expression variability varies amongst the embryos is captured by the SDE. From the density distribution, we see that the embryos have stable profiles during all four stages of development.
Fig 4Stable genes can be classified into three distinct modes of expression activity.
For each developmental stage, inspection of the absolute expression of the stable genes we identified, reveals that these genes fall into three expression modes spanning low, medium and high levels of expression. (A) The black lines denote the smoothed density of gene expression levels that have been averaged across all cells from a specific developmental stage. The yellow lines indicate the boundaries defining the three expression modes, these boundaries were calculated using a mixture model that clustered the genes into these three groups. (B) Single cell gene expression profiles for the stable genes for each developmental stage. These profiles have been plotted based on which expression mode they have been clustered into using the mixture model. The black dots represent the expression level of the stable genes in a single cell. The colored dot represents the average expression per expression mode.
Fig 5Stage-specific variability markers are based on changes in inter-cellular variability.
(A) A schematic illustrating how a narrower distribution corresponds to greater homogeneity in the expression of a variability marker in a cell population (the shading indicates level of expression of a gene). (B-D) The distribution of expression for the variability markers identified for each developmental stage.
Putative variability markers identified for each stage based on changes in gene expression variability.
Gene names in bold and underlined indicate that these genes that were also identified by a standard ANOVA model.
| Stage | Number of Markers | Genes |
|---|---|---|
| 8-cell | 55 |
|
| Morulae | 8 |
|
| Blastocyst | 11 |
|
Fig 6Effect of HDDC2 shRNA-mediated knock-down.
(A) Decrease in HDDC2 mRNA levels after 2 days of constitutive expression of shRNAs (see S11–S13 Tables). (B) Corresponding drop in the expression of pluripotency markers NANOG and (C) DNMT3B suggests a role for HDDC2 in the maintenance of pluripotency.
Fig 7Activation of the endogenous HDDC2 locus using an inducible Cas9-VP64 system attenuates neural differentiation of human pluripotent stem cells.
(A) Upregulation of the HDDC2 mRNA levels after 2 days of induction of activation in pluripotent hES cells. (B-D) Effect of Cas9-VP64-driven HDDC2 up-regulation on gene expression during the early stages (day 3) of neural differentiation. We observed that artificially-maintained levels of HDDC2 expression (B) resulted in more sustained NANOG expression (C) and lower induction of PAX6 (D), a definitive neuroectodermal marker.
Fig 8Identifying regulatory control states.
Genes were assigned to clusters using a mixture model algorithm for each developmental stage based on levels of inter-cellular variability. The goal was to use this algorithm to quantify how many different clusters, or regulatory control states were present for each stage. The mixture models identified (A) four clusters for the 4-cell stage, (B) three clusters for the 8-cell and (C) morulae stages, and (D) two clusters for the blastocyst stage. In (B-D) the distribution of the inter-cellular expression variability are represented for each cluster or control state per stage.
Fig 9A snapshot of cellular heterogeneity across embryonic development.
(A) The variability profile for a group of cells identifies both the regions and proportions of the transcriptome that have different levels of expression variability. (B) A schematic diagram showing how different densities can arise depending on how heterogeneous the underlying cell population is. (C) Observed densities for the 8-cell, morula and blastocyst stages where each line corresponds to a 4-cell combination selected from the total cells available in each developmental stage.