| Literature DB >> 27048384 |
Idan Efroni1,2, Kenneth D Birnbaum3.
Abstract
Single-cell transcriptomics has been employed in a growing number of animal studies, but the technique has yet to be widely used in plants. Nonetheless, early studies indicate that single-cell RNA-seq protocols developed for animal cells produce informative datasets in plants. We argue that single-cell transcriptomics has the potential to provide a new perspective on plant problems, such as the nature of the stem cells or initials, the plasticity of plant cells, and the extent of localized cellular responses to environmental inputs. Single-cell experimental outputs require different analytical approaches compared with pooled cell profiles and new tools tailored to single-cell assays are being developed. Here, we highlight promising new single-cell profiling approaches, their limitations as applied to plants, and their potential to address fundamental questions in plant biology.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27048384 PMCID: PMC4820866 DOI: 10.1186/s13059-016-0931-2
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Questions in plant biology to which single cell profiling could be applied: analytical problems and algorithmic solutions
| Biological problem or plant-specific question | Analytical problems for single-cell data | Potential approaches |
|---|---|---|
| Distinguish genes that show true biological variation | Significant technical noise is present | Hypothesis testing based on identification of variation that exceeds empirical estimations of technical noise [ |
| What genes vary among physiologically distinct cells of seemingly homogenous tissues? | Profiles have no replicates and exhibit zero-biased expression distribution, so traditional statistical methods are inappropriate | |
| Model-driven deconvolution of biological variation using estimations of technical noise [ | ||
| Identify transcriptional signature of rare cell types | Linear dimensionality reduction can obscure close relationships and produce misleading clusters | Non-linear t-SNE to minimize joint probability distribution distance and draw similar cells together [ |
| What is the transcriptional profile of root initials? | Clustering methods might miss small sets of cells | backSPIN to impose an order and partition data [ |
| Find subsets of cells with a unique environmental response | Separation of a continuous cell expression space into types is subjective | RaceID to identify new cell types by detecting a significant number of biological gene outliers [ |
| What is the early response of pathogen-susceptible vs. pathogen-resistant cells of the leaf epidermis? | ||
| Assemble dissociated cells into a developmental sequence | Missing data-points exist owing to false negatives and misleading false positives | De novo trajectory reconstruction to order cells using Monocle [ |
| What is the ordered profile of specific cell types from initial to differentiated cells? | Variation in individual plants can create artificial groupings | Seurat to map cells using a priori data and imputation of missing data-points [ |
| ICI to map cells to known reference types using many markers [ | ||
| Follow identity transitions during wound repair or in vitro regeneration | Detecting transitional and multiple identities must be robust in single-cell data with many false positives and false negatives | ICI to classify cells using a priori knowledge of identity markers for detecting mixed or diminished cell identity [ |
| Do plant cells follow a course of de- or trans- differentiation during regeneration? |
ICI index of cell identity, t-SNE t-distributed stochastic neighbor embedding
Fig. 1Single-cell transcriptomic profiles in plants. a The technical noise profile between two single cells of the same cell type, showing high dispersion for transcripts expressed at a low level. The axes are read-counts representing gene expression levels on a log2 scale. As most genes are expected to be expressed at similar levels, the two axes evaluate replication and show that, at these scales, genes expressed at higher levels show the potential to distinguish biological from technical noise. b (upper) The expression distribution of a gene among pooled samples typically shows a peak frequency on a positive expression value. (lower) Gene expression among single-cell samples typically shows a peak frequency at zero, with a subset of cells showing a second peak of positive read counts in a subset of samples. Density represents the frequency of cells showing a given expression level (read count). c Several gold-standard markers in single-cell profiles of cells with known tissue origins. These functional markers are expressed at higher levels (e.g., more replicable expression in a and non-zero expression in b (lower). In these real samples collected from plant cells, markers for the quiescent center (QC), stele, and epidermis all show detectable expression in target cells and are largely absent in non-target cells, with some false-positive and false-negative expression
Fig. 2Hypothetical example showing the pseudo-time ordering of cells collected from the root meristem. (upper) The green-colored cells represent a reporter marking the endodermis and quiescent center (QC). The color gradient represents a continuum of cellular maturation from birth (at bottom) to differentiation (towards the top). Cells are dissociated and isolated using fluorescence-activated cell sorting (FACS), whereupon ordering information is lost. At the right, single-cell expression profiles are used to infer a pseudo-ordering as cells in an approximate sequence. (lower) Two general methods of pseudo-time ordering are shown. Method 1 is unsupervised, using dimensionality reduction to position cells in a hypothetical space and then imposing an optimal path that infers the developmental progression of cells (e.g., Monocle). Method 2 uses markers to place cells in a specific location or developmental zone, with specific approaches differing in the way they adapt to false negatives and false positives. Seurat infers the expression of missing “gold-standard” markers based on coexpressed genes. Index of cell identity (ICI) employs many markers that “vote” on cell localization, where misleading diagnostic markers from false positives and false negatives are overcome by a majority of true positives. (Schematic by Ramin Rahni)