| Literature DB >> 32929225 |
Jeongwoo Lee1, Do Young Hyeon1, Daehee Hwang2.
Abstract
Advances in single-cell isolation and barcoding technologies offer unprecedented opportunities to profile DNA, mRNA, and proteins at a single-cell resolution. Recently, bulk multiomics analyses, such as multidimensional genomic and proteogenomic analyses, have proven beneficial for obtaining a comprehensive understanding of cellular events. This benefit has facilitated the development of single-cell multiomics analysis, which enables cell type-specific gene regulation to be examined. The cardinal features of single-cell multiomics analysis include (1) technologies for single-cell isolation, barcoding, and sequencing to measure multiple types of molecules from individual cells and (2) the integrative analysis of molecules to characterize cell types and their functions regarding pathophysiological processes based on molecular signatures. Here, we summarize the technologies for single-cell multiomics analyses (mRNA-genome, mRNA-DNA methylation, mRNA-chromatin accessibility, and mRNA-protein) as well as the methods for the integrative analysis of single-cell multiomics data.Entities:
Year: 2020 PMID: 32929225 PMCID: PMC8080692 DOI: 10.1038/s12276-020-0420-2
Source DB: PubMed Journal: Exp Mol Med ISSN: 1226-3613 Impact factor: 8.718
Fig. 1An overview of single-cell multiomics sequencing technologies.
Single-cell multiomics sequencing technologies and the expected outcomes are illustrated. Technologies that measure more than two types of data are included in multiple categories (e.g., scTrio-seq in transcriptome-genome and transcriptome-DNA methylation categories).
Single-cell multiomics technologies.
| Category of multiomics | Methods | Cell isolation and DNA/RNA/protein separation | Molecules measured | Cell throughput | Automation | Notes | Refs |
|---|---|---|---|---|---|---|---|
| Genome-transcriptome | G&T-seq | Flow cytometry (cell isolation); and bead-based separation (DNA and polyadenylated mRNA) | gDNA and polyadenylated mRNA (whole cell) | Medium | Yes | [ | |
| DR-seq | Cell picking by pipette (cell isolation); and preamplification and tagging of DNA and RNA followed by splitting | gDNA and polyadenylated mRNA (whole cell) | Low | No | [ | ||
| SIDR | Microplate-based cell isolation; and separation of nucleus and cytoplasm using hypotonic lysis | gDNA and polyadenylated cytosolic RNA | Low | [ | |||
| TARGET-seq | FACS (cell isolation); and reverse transcription and amplification followed by library preparation | Targeted gDNA, polyadenylated mRNA, and targeted mRNA (whole cell) | High | [ | |||
| Genome-transcriptome- DNA methylome | scTrio-seq | Cell picking by pipette; and centrifugation-based separation of nucleus and cytoplasm followed by bisulfite treatment | CNVs (computational inference), gDNA methylation, and polyadenylated cytosolic RNA | Low | No | Bisulfite treatment | [ |
| Transcriptome-DNA methylome | scM&T-seq | Flow cytometry (cell isolation); and bead-based separation of DNA and polyadenylated mRNA followed by bisulfite treatment | gDNA methylation and polyadenylated RNA (whole cell) | Medium | Yes | Bisulfite treatment | [ |
| scMT-seq | Micropipetting for isolation of single nuclei. | gDNA methylation and polyadenylated cytosolic RNA | Low | Partial | Bisulfite treatment | [ | |
| Transcriptome-chromatin accessibility | sci-CAR | Combinatorial indexing; and lysate splitting followed by library preparation | Chromatin and polyadenylated nuclear RNA | High | [ | ||
| SNARE-seq | Microfluidic channels (cell isolation); and open chromatin tagmentation followed by dual-omics capture | Chromatin and polyadenylated nuclear RNA | High | [ | |||
| Paired-seq | Combinatorial indexing; and preamplification followed by splitting and enzymatic digestion | Chromatin and polyadenylated nuclear RNA | High | [ | |||
| Transcriptome-DNA methylome-chromatin accessibility | scNMT-seq | FACS (cell isolation); GpC labeling; and bead-based separation of nucleus and cytoplasm followed by bisulfite treatment | gDNA methylation, chromatin, and polyadenylated cytosolic RNA, | Medium | Partial | Bisulfite treatment | [ |
| Transcriptome-proteome | PEA/STA | Microfluidic channels (cell isolation); and reverse transcription of PEA probe and RNA followed by targeted amplification | Targeted RNA and targeted proteins | Medium | Yes | [ | |
| PLAYR | Flow or mass cytometry (cell isolation); and detection of amplified product of PLAYR probe pair and antibody staining | Targeted cytosolic RNA and targeted proteins | High | No | Cell fixation and permeabilization | [ | |
| CITE-seq | Drop-seq and 10X Genomics platform (cell isolation); and reverse transcription of mRNA and antibody-derived oligonucleotides followed by separation of libraries | Polyadenylated RNA and targeted cell surface proteins | High | No | [ | ||
| REAP-seq | 10x Genomics Chromium (cell isolation); and reverse transcription of mRNA and antibody-derived oligonucleotides followed by separation of libraries | Polyadenylated RNA and targeted cell surface proteins | High | No | [ | ||
| RAID | Plate-based cell isolation; and cell crosslinking and immunostaining with RNA-barcoded antibodies | Polyadenylated RNA and targeted intracellular proteins | High | Cell fixation and permeabilization | [ | ||
| Transcriptome-proteome-clonotypes-CRISPR perturbations | ECCITE-seq | 10X Genomics platform (cell isolation); and capture of mRNA, sgRNA, and antibody-derived oligonucleotides followed by separation of libraries | Polyadenylated RNA, sgRNA, and targeted cell surface proteins | High | Compatible with existing CRISPR guide libraries | [ |
Cell throughput is categorized as low, medium, or high based on the number of cells applied in each reference. Low, fewer than 50 cells; Medium, from 50 to 200 cells; High, more than 200 cells.
G&T-seq genome and transcriptome sequencing, DR-seq gDNA-mRNA sequencing, SIDR simultaneous isolation of genomic DNA and total RNA, scTrio-seq single-cell triple omics sequencing, scM&T-seq single-cell methylome and transcriptome sequencing, scMT-seq single-cell methylome and transcriptome sequencing, sci-CAR single-cell combinatorial indexing chromatin accessibility and mRNA, SNARE-seq single-nucleus chromatin accessibility and mRNA expression sequencing, Paired-seq parallel analysis of individual cells for RNA expression and DNA accessibility by sequencing, scNMT-seq single-cell nucleosome, methylation and transcription sequencing, PEA/STA proximity extension assay/specific (RNA) target amplification, PLAYR proximity ligation assay for RNA, CITE-seq cellular indexing of transcriptomes and epitopes by sequencing, REAP-seq RNA expression and protein sequencing assay, RAID single-cell RNA and immuno-detection, CRISPR clustered regularly interspaced short palindromic repeats, ECCITE-seq expanded CRISPR-compatible cellular indexing of transcriptomes and epitopes by sequencing, FACS fluorescence-activated cell sorting, Drop-seq droplet-sequencing, sgRNA single-guide RNA, CNV copy number variation.
Fig. 2Single-cell multiomics sequencing protocols for the integrative analyses of the genome and transcriptome.
Protocols for the isolation of single cells or nuclei and the barcoding of gDNA and mRNAs are shown for five types of multiomics analyses of the genome and transcriptome: scTrio-seq (a), DR-seq and G&T-seq (b), SIDR (c), and TARGET-seq (d). Blue solid circles, nucleus; blue dotted line, permeabilized membrane; red and green lines, mRNA and gDNA, respectively; yellow solid circles, beads; Y shapes, antibodies; magenta and green fragments, barcodes or primers; and U shapes, magnets. See text for the definitions of the abbreviations.
Fig. 3Single-cell multiomics sequencing protocols for the integrative analysis of the transcriptome and epigenome.
Protocols for the isolation of single cells or nuclei and the barcoding of gDNA and mRNAs are shown for five types of multiomics analyses of the transcriptome and epigenome: scM&T-seq (a) and scMT-seq (b) for DNA methylation and sci-CAR (c), SNARE-seq (d), and scNMT-seq (e) for chromatin accessibility. Gray cross, nucleosome; Me, CH3. In c, the colors of the border line, inside, and tip of the circles distinguish the barcodes of mRNAs, accessible DNA fragments, and indexed PCR, respectively. See the legend in Fig. 2 for the other symbols and the text for the definitions of the abbreviations.
Fig. 4Single-cell multiomics sequencing protocols for integrative analyses of the transcriptome and proteome.
Protocols for the isolation of single cells and the barcoding of mRNAs and proteins are shown for four types of multiomics analyses of the transcriptome and proteome: PEA/STA (a), PLAYR (b), CITE-seq (c), and RAID (d). Green-blue line, single-stranded DNA (ssDNA) oligos conjugated to antibodies; rotated U shapes, PLAYR probes; green-orange circle, backbone-insert oligos; and DNA fragments containing stars, isotope-labeled probes. See the legend of Fig. 2 for the other symbols and the text for the definitions of the abbreviations.
Fig. 5Strategies for the integrative analysis of single-cell multiomics data.
Blue and green heat maps represent the data matrixes for the transcriptome and DNA methylome, respectively. The symbols n, m, and m denote the numbers of cells (n) and genes with the levels of mRNA (m) and DNA methylation (m). Colors in the heat maps represent the levels of mRNA and DNA methylation (see color bars; Max, the maximum level). a Correlation analysis between mRNA and DNA methylation levels. Scatter plots show mRNA and DNA methylation levels for genes 1 (top) and 2 (bottom). Line, regression line; r, Pearson correlation. Negative and positive correlations are shown for genes 1 and 2, respectively. b Analysis of scRNA-seq data followed by the integration of scBS-seq data. Principal component analysis (PCA) is first applied to scRNA-seq data to obtain score values for k PCs, the pairwise Euclidean distances of cells are computed using the score values for k PCs to generate a distance matrix, t-stochastic neighbor embedding (t-SNE) clustering is applied to the distance matrix to identify cell populations, and scBS-seq data are then integrated into these cell populations as described in the text. C1-3, cell populations 1-3, respectively. c Integrative analysis of scRNA-seq and scBS-seq data to generate the overall single-cell map. The analytical scheme of MOFA is shown. Two-way matrix decomposition is performed for scRNA-seq and scBS-seq data using k factors, resulting in weight matrixes (m × k for scRNA-seq data and m × k for scBS-seq data) and a factor loading matrix (k × n for n cells). Factor loading values are used to compute a distance matrix that is then used for t-SNE clustering. The t-SNE plot shows cell populations 1-4 (C1-4) identified collectively by scRNA-seq and scBS-seq data.