| Literature DB >> 30094294 |
Aisha A AlJanahi1,2, Mark Danielsen2, Cynthia E Dunbar1.
Abstract
The recent development of single-cell RNA sequencing has deepened our understanding of the cell as a functional unit, providing new insights based on gene expression profiles of hundreds to hundreds of thousands of individual cells, and revealing new populations of cells with distinct gene expression profiles previously hidden within analyses of gene expression performed on bulk cell populations. However, appropriate analysis and utilization of the massive amounts of data generated from single-cell RNA sequencing experiments are challenging and require an understanding of the experimental and computational pathways taken between preparation of input cells and output of interpretable data. In this review, we will discuss the basic principles of these new technologies, focusing on concepts important in the analysis of single-cell RNA-sequencing data. Specifically, we summarize approaches to quality-control measures for determination of which single cells to include for further examination, methods of data normalization and scaling to overcome the relatively inefficient capture rate of mRNA from each cell, and clustering and visualization algorithms used for dimensional reduction of the data to a two-dimensional plot.Entities:
Keywords: RNA sequencing; computational pipeline; drop-seq; microfluidics; principle component analysis; sci-seq; single-cell gene expression; t-distributed stochastic neighbor embedding
Year: 2018 PMID: 30094294 PMCID: PMC6072887 DOI: 10.1016/j.omtm.2018.07.003
Source DB: PubMed Journal: Mol Ther Methods Clin Dev ISSN: 2329-0501 Impact factor: 6.698
Widely Used Single-Cell Sequencing Methods
| Starting Cell No. | Cell Separation | Notes | Cell Capture | Transcript Capture | Representative Library Prep Cost per Cell | |
|---|---|---|---|---|---|---|
| ∼1,000 cells | cells capture in size-specific chambers | must know the size of cells of interest; allows for staining and imaging prior to cell rupture | 96- or 800-chamber units are available | an average of 6,606 genes/cell (no data on percentage) | $1.70 | |
| ∼150,000 cells/run | droplet-based separation | remains the most cost-effective and most customizable | ∼5% of cells per run (approximately 7,000 cells) | ∼10.7% of the cell’s transcripts | $0.06 | |
| ∼1,700 cells/run | droplet-based separation | the most commercially successful method; almost fully automated | ∼65% of cells per run (approximately 1,000 cells) | ∼14% of the cell’s transcripts | $0.10 | |
| ∼500,000 cells (depends on experimental design) | FACS sorter; cells are never singly isolated | combinatorial indexing of individual methanol-fixed permeable cells | 5%–10% of cells | ∼10%–15% of the cell’s transcripts | $0.05–$0.14 |
All of the methods require the establishment of a cell dissociation technique. The price is highly dependent on the number of cells sequenced, the desired depth of sequencing, and the sequencing platform used. For this table, the prices are at the lower end of the price range for single-cell library prep.
As of July 2018.
Based on the 800-chamber medium-size isolation unit.
Dependent on how many cells are prepped for sequencing and how many doublets are tolerated.
Figure 1The Structure of Drop-Seq Bead and Resulting Sequence Libraries
(A) The structure of a DropSeq single-cell sequencing bead. The oligos extending from the bead have a PCR primer, a cell barcode that is unique to the bead to label each cell, a UMI that is unique to each individual oligo arm to allow unique labeling of each captured molecule, and a poly(T) tail to capture poly(A)-tailed mRNAs. (B) Structure of the sequencing ready library. Red: PCR primers which are also used as sequencing primers. Green and blue: the cellular and molecular barcodes from the bead. Orange: the captured transcript with the poly(A/T) tail.
Figure 2Analysis Workflow
Figure 3Elbow Plot Analysis of Principle Components Variance
A plot of the SD of each principal component, representing the amount of variance it contributes to the data. Here, the plot shows an elbow at around PC5. PC4 and PC6 are also valid choices for the PC cutoff.