| Literature DB >> 29644800 |
Laura Mincarelli1, Ashleigh Lister1, James Lipscombe1, Iain C Macaulay1.
Abstract
Cells are a fundamental unit of life, and the ability to study the phenotypes and behaviors of individual cells is crucial to understanding the workings of complex biological systems. Cell phenotypes (epigenomic, transcriptomic, proteomic, and metabolomic) exhibit dramatic heterogeneity between and within the different cell types and states underlying cellular functional diversity. Cell genotypes can also display heterogeneity throughout an organism, in the form of somatic genetic variation-most notably in the emergence and evolution of tumors. Recent technical advances in single-cell isolation and the development of omics approaches sensitive enough to reveal these aspects of cell identity have enabled a revolution in the study of multicellular systems. In this review, we discuss the technologies available to resolve the genomes, epigenomes, transcriptomes, proteomes, and metabolomes of single cells from a wide variety of living systems.Entities:
Keywords: epigenomics; genomics; proteomics; single-cell; technology; transcriptomics
Mesh:
Substances:
Year: 2018 PMID: 29644800 PMCID: PMC6175476 DOI: 10.1002/pmic.201700312
Source DB: PubMed Journal: Proteomics ISSN: 1615-9853 Impact factor: 3.984
Figure 1Definitions of cell type and state. A) In the hematopoietic system, cell types have been typically defined by a combination of cell surface marker expression and functional output in in vitro and in vivo assays. B) Within cell types, multiple cell states are possible, including quiescence, active cycling, senescence, and in some cases, resting and activated states. C) Population‐level characterization enables molecular definition of the differences between cell types, in this hypothetical example using principal components analysis (PCA) of RNA‐seq data. This, however, does not reveal heterogeneity within these phenotypically defined populations. Through (D) single‐cell analysis, it is possible to explore this heterogeneity, even in rare cell populations such as HSCs, revealing novel cell phenotypes—cell types and states—within a “homogeneous” population of cells. Abbreviations: LT‐HSC, long‐term reconstituting HSC; FSR‐HSC, finite self‐renewal HSC; LMPP, lymphoid‐primed multipotential progenitors; CMP, common myeloid progenitor; MK, megakaryocyte; E, eryrthroid; My, Myeloid; T, T‐cell; B, B‐cell; PC, principal component.
Figure 2Methods for isolation and sequencing of single cells. A) FACS‐based cell isolation enables selective deposition of single cells into multiwell plates for downstream molecular processing. Index sorting allows some information about each cell's phenotype to be recorded as it is deposited into the well. Once the cells have been deposited, a number of molecular processes are possible. B) Droplet‐based cell isolation involves the partitioning of single cells into individual droplets with uniquely barcoded oligonucleotides. In the case of single‐cell mRNA‐seq these barcoded oligos prime first strand synthesis of cDNA from the poly‐A tail. Reverse transcription is then performed in a droplet emulsion, resulting in each cDNA molecule being uniquely tagged based on its cell of origin. Unique molecular identifiers (UMIs) are also incorporated to enable unequivocal counting of the number of detected molecules. C) Nanowell‐based approaches use a similar approach, but rather than partitioning cells into droplets, cells are captured in minute wells with uniquely barcoded beads. D) Combinatorial indexing strategies have used a two‐step barcoding strategy for DNA or cDNA molecules to increase throughput without the need for microfluidics. First, a primary barcode is added to small pools of FACS isolated cells/nuclei (in the case of cDNA, this is added during reverse transcription, in the case of DNA this is added through tagmentation with barcoded adaptors) which are then re‐pooled with other distinctly barcoded cells and again sorted into small pools, where they received a second barcode. Thus, each cell receives a unique pairing of barcoded molecules, enabling each sequencing read to be assigned to an individual cell.
Figure 3Methods for the analysis of single‐cell identity. An overview of the methods currently available to study the genome, epigenome, transcriptome and proteomes of single cells, some of which have been combined into multi‐omic single‐cell assays.
Methods for whole‐genome amplification and sequencing of DNA from single cells
| Method | Platform | Number of Cells (typical) | Description | Advantages | Disadvantages | Applications |
|---|---|---|---|---|---|---|
|
| Microwell plate/tubes | 10s–100s | Randomly primed isothermal amplification of DNA by Phi29 polymerase |
Broad genome coverage High fidelity amplification (Error rate 1.2 × 10–5) |
Potential for inaccurate identification of CNVs due to allelic drop out, GC bias and uneven amplification Generation of chimeric molecules |
Genome‐wide or targeted SNV and structural variant analysis WGA‐X uses a thermostable version of phi 29 called Equiphi 2 for the amplification of DNA from microbial cells or cells from environmental samples eMDA (emulsion multiple displacement amplification) can be used to call SNVs |
|
| Microwell plate/tubes | 10s–100s | Two‐stage (linear and exponential) amplification of DNA | Accurate representations of CNVs | Higher error rate than MDA, potential for inaccurate SNV calling | Genome‐wide CNV analysis |
|
| Microwell plate/tubes | 10s–100s | Two‐stage (linear and exponential) amplification of DNA |
Broad genome coverage (85–93%) Accurate representations of CNVs Lower allelic dropout than MDA or PicoPLEX | Higher error rate than MDA |
Genome‐wide CNV analysis Pre‐implantation genetic screening of genomic or mitochondrial DNA (Shang et al., 2017) |
|
| Microwell plate/tubes | 10s–100s | DNA digestion and adaptor ligation, PCR amplification |
Broad genome coverage Lower allelic dropout than MDA, PicoPLEX and MALBAC | Less efficient enrichment in targeted genome sequencing than other methods | Genome‐wide CNV or SNV detection |
|
| Microwell plate/tubes | 10s–100s |
Tagging of individual DNA strands during replication using BrdU followed by selective depletion of tagged strands Direct library preparation from BrdU‐ strands |
Generates sequences from homologous chromosomes Direct library construction without WGA, reduced sequence bias and allelic drop‐out | Requires that the sample can be treated with BrdU for one round of cell division—difficult to apply in vivo |
Detection of copy neutral genomic rearrangements Studying meiotic recombination Studying inheritance‐ by haplotyping/ phasing |
|
| Combinatorial indexing | 1000s–10 000s |
Dual indexing of individual nucleosome depleted genomes Index 1 is added by tagmentation of 96 pools of 2000 nuclei each Tagged cells are pooled and re‐sorted, with Index 2 added by PCR amplification of pools of 22 cells |
High throughput No custom equipment required, uses FACS for isolation and pooling of nuclei | Shallow sequencing of individual cells therefore only large CNVs can be analyzed | CNV analysis in large numbers of single cells |
|
| Custom microfluidics | 1000s–10,000s | Single‐cell encapsulation in droplets with uniquely barcoded adaptors | High throughput |
Shallow sequencing of individual cells, only small (microbial) genomes have been analyzed to date Custom microfluidics required |
Analysis of microbial populations at single‐cell resolution In theory can be applied to eukaryotic cells for CNV analysis |
|
| Custom microfluidics | 10s | Separate the Watson and Crick DNA strands; randomly partition megabase‐size fragments into multiple nanoliter compartments for amplification and construction of barcoded libraries for sequencing |
Low error rate (1 × 10–8) Ability to assemble large fragments of single‐cell genomes (N50 > 7 Mb, largest contig 9 Mb) | Low throughput—only tested on three human cells, though has potential for high‐throughput modifications |
SNV calling in single cells Single‐cell genome assembly and long read, haplotyping information such as HLA haplotyping for donor‐patient matching |
|
| Custom nanowell array | 100s–1000s | Cells are captured in nanowells and MDA performed in ∼12 nL volumes Successful reactions are then picked for sequencing |
Less amplification bias compared to conventional MDA More even coverage than MDA Lower reaction volumes reduce cost Demonstrated to work with microbial and human cells |
No commercial availability of the microwell arrays Low cell loading numbers Micromanipulation used to aspirate amplified DNA from microwells | CNV analysis in single cells |
Methods for epigenomic analysis of single cells
| Modification | Method | Platform | Number of Cells (typical) | Description |
|---|---|---|---|---|
|
| ScBS‐seq | Microwell plate/tubes | 10–100s | Bisulfite conversion of unmodified C to T, 5mC remains unconverted |
| scRRBS | Microwell plate/tubes | 10–100s | Reduced representation bisulfite sequencing, enables single‐base resolution DNA methylation | |
| scAba‐seq | Microwell plate/tubes | 10–100s | Hydroxymethylation (5hmC) profiling. Glucosylation of 5hmc position generates recognition sites for restriction endonuclease AbaSI | |
| CLEVER‐seq | Microwell plate/tubes | 10–100s | Formylcytosine (5fC) detection by direct chemical labeling with malonitrile. Subsequent conversion C‐T at 5fC labeled sites during amplification and sequencing | |
|
| NOME‐seq | Microwell plate/tubes | 10–100s | Methyltransferase (methylase) enzyme is used to label accessible (or nucleosome depleted) DNA prior to bisulfite sequencing which distinguishes between methylated and unmethylated chromatin states |
| scDNAse‐seq | Microwell plate/tubes | <1000 | Method of detecting genome‐wide DHSs, DNase I hypersensitive sites. This technique enables genome‐wide mapping of hypersensitive site, therefore of active regulatory elements of transcription | |
| scATAC‐seq | Fluidigm C1 platform | 10–100s | Individual cells are captured and assayed on Fluidigm platform. Tn5 transposase tags regulatory regions by inserting sequencing adapters into accessible regions of the genome, allowing measurement of open chromatin sites | |
| scATAC‐seq | Combinatorial indexing | 10 000s | Integration of combinatorial cellular indexing and ATAC‐seq to measure chromatin accessibility in large numbers of single cells | |
|
| scChiP‐seq (drop‐Chip) | Custom microfluidics platform | 1000s–10 000s | Droplet‐based microfluidic system process single cell to indexed chromatin fragments. Indexed chromatin from multiple cells can then be combined and subsequently immunoprecipitation can be performed |
|
| scDam‐ID | Microwell plate/tubes | 100 | Enables mapping of genome‐wide nuclear lamina interactions domains in single human cell. Dam adenosine transferase methyltransferase is fused with lamin B1 (constituent of nuclear lamina) and expressed in cells so that sites interactions are mapped from sequence tags after DpnI digestion |
| scHi‐C | Microwell plate/tubes | 10–100s | Identifies DNA sequences in close spatial proximity in the nucleus after restriction enzyme digestion and DNA ligation | |
| scHi‐C | Microwell plate/tubes | 10–100s | Improved Hi‐C protocol able to determine whole‐genome structures of single G1‐phase haploid cells and define 3D models of chromosome organization |
Methods for transcriptomic analysis of single cells
| Method | Platform | Number of Cells (typical) | Description | UMI | Applications | Typical number of sequencing reads per cell |
|---|---|---|---|---|---|---|
| Smart‐seq/Smart‐seq2 | Microwell plate/tubes/Fluidigm C1 platform | 100s–1000s | Template‐switching PCR‐based full‐length transcript amplification. Can be applied to cells or nuclei (scNuc‐seq) | No |
Transcript enumeration Analysis of alternative splicing allelic expression | 500 000–4 000 000 |
|
| Microwell plate/tubes | 100s–1000s | In vitro transcription‐based 3′ transcript amplification | Yes | Transcript quantification | 100 000–1 000 000 |
|
| Microwell plate/tubes (also modified for ICell8 Nanogrid | 100s–1000s | Template‐switching PCR‐based full‐length transcript amplification followed by 5′ selection | Yes | Transcript quantification | 100 000–1 000 000 |
|
| Combinatorial indexing | 1000s–10 000s | Combinatorial indexing approach in which transcripts are first indexed during first strand synthesis and subsequently during PCR of 3′ sequencing tags | Yes | Transcript quantification | 20 000–200 000 |
| Droplet‐based approaches |
Drop‐seq InDrops
10X Genomics Chromium Dolomite Nadia | 1000s–10 000s | Cells are partitioned into individual droplets and cDNA molecules are uniquely barcoded during reverse transcription | Yes | Transcript quantification | 20 000–200 000 |
| Nanowell approaches |
SeqWell
Nanogrid (ICell8) BD Rhapsody | 1000s–10 000s | Cells are partitioned into individual wells of a custom built nanowell chip and cDNA molecules are uniquely barcoded during reverse transcription | Yes | Transcript quantification | 20 000–200 000 |