| Literature DB >> 32699019 |
Qiao Rui Xing1,2, Chadi A El Farran1,3, Ying Ying Zeng1,2, Yao Yi1,3, Tushar Warrier1,3, Pradeep Gautam1,3, James J Collins4,5,6, Jian Xu3,7, Peter Dröge2, Cheng-Gee Koh2, Hu Li8, Li-Feng Zhang2, Yuin-Han Loh1,3,9,10.
Abstract
Joint profiling of transcriptome and chromatin accessibility within single cells allows for the deconstruction of the complex relationship between transcriptional states and upstream regulatory programs determining different cell fates. Here, we developed an automated method with high sensitivity, assay for single-cell transcriptome and accessibility regions (ASTAR-seq), for simultaneous measurement of whole-cell transcriptome and chromatin accessibility within the same single cell. To show the utility of ASTAR-seq, we profiled 384 mESCs under naive and primed pluripotent states as well as a two-cell like state, 424 human cells of various lineage origins (BJ, K562, JK1, and Jurkat), and 480 primary cord blood cells undergoing erythroblast differentiation. With the joint profiles, we configured the transcriptional and chromatin accessibility landscapes of discrete cell states, uncovered linked sets of cis-regulatory elements and target genes unique to each state, and constructed interactome and transcription factor (TF)-centered upstream regulatory networks for various cell states.Entities:
Year: 2020 PMID: 32699019 PMCID: PMC7397874 DOI: 10.1101/gr.257840.119
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Assay for single-cell transcriptome and accessibility regions (ASTAR-seq). (A) Overview of ASTAR-seq protocol. (B) Dotplot revealing the percentage of fragments in peaks (y-axis) against the number of fragments (x-axis) of each K562 ASTAR ATAC-seq library. Red dotted lines represent the threshold values set for each criterion. (C) Histogram showing the frequency (y-axis) of fragments with the indicated insert size (x-axis). (D) Dotplot showing Pearson's correlation between K562 ASTAR ATAC-seq and the published K562 scATAC-seq libraries. (E) Dotplot revealing detected gene rate (y-axis) of each K562 ASTAR RNA-seq library plotted against its exon mapping rate (x-axis). Blue dots represent the libraries that pass QC, whereas gray dots represent the libraries of low quality. (F) Line plot representing the coverage ratio (y-axis) of K562 ASTAR RNA-seq reads over the gene bodies of housekeeping genes (x-axis). (G) Dotplot showing Pearson's correlation between K562 ASTAR RNA-seq and the published K562 scRNA-seq libraries. (H) Boxplots showing the number of fragments (left) and percentage of fragments in peaks (right) of scATAC-seq libraries prepared by the scCAT-seq and ASTAR-seq protocol. Two-tailed Student's t-test is used to calculate P-values. (I) Boxplots showing the number of de-duplicated reads (left) and genes (right) detected in scRNA-seq libraries prepared by the scCAT-seq and ASTAR-seq protocol. Two-tailed Student's t-test is used to calculate P-values.
Figure 2.Transcriptomic and epigenetic heterogeneity within primed and naive mESCs. (A) Dotplot revealing the percentage of fragments in peaks (y-axis) and number of fragments (x-axis) of each mouse ASTAR ATAC-seq library. Red dotted lines represent the thresholds set for each criterion. (B) Heatmap showing the correlation among mESCs and 2i cells based on their ASTAR ATAC-seq libraries. (C) Dotplot revealing the detected gene rate (y-axis) of each mouse ASTAR RNA-seq library plotted against its exon mapping rate (x-axis). Blue dots represent the libraries that pass QC, whereas gray dots indicate the low-quality libraries. (D) Heatmap revealing the correlation of each mESC ASTAR RNA-seq library (x-axis) to various lineages of MCA panel (y-axis). Color indicates the correlation level, ranging from gray (low) to red (high). Two-cell-like (2C-like) mESCs are boxed with a dotted line. (E) NMF clustering of mESCs and 2i cells based on the correlative signals of their ASTAR ATAC-seq and ASTAR RNA-seq libraries. (F) Heatmaps revealing pairs of accessible regulatory regions (left) and the corresponding target genes (right) that are differentially enriched between the NMF clusters. Each column represents a library, whereas each row represents a chromatin region (left) or a gene (right). Color indicates the accessibility (left) and expression (right) levels, ranging from blue (low) to red (high). Representative genes are indicated on the right. (G) Line plots showing the differential coaccessibility links between the highlighted regions and its surrounding regions, identified using Cicero. Top plots are constructed from ASTAR ATAC-seq libraries of cluster 1 cells, whereas bottom plots are constructed from that of cluster 2 cells. Peak heights (y-axis) denote the coaccessibility scores. (H) Interactome analysis revealing the top pathways enriched by cluster 2–specific genes. (I,J) Heatmaps showing the enrichment (left columns) of TF motifs on cluster 1–specific (I) and cluster 2–specific (J) accessible regions and their relative expressions (right columns).
Figure 3.Application of ASTAR-seq on human cell lines. (A) Clustering of BJ, JK1, K562, and Jurkat ASTAR ATAC-seq libraries based on the human JASPAR motif deviation scores calculated over the HARs. Color indicates the correlation level among the libraries, ranging from blue (no) to red (high). Side color bar (y-axis) indicates the identity of each cell. (B) Variability plot indicating the variable TF motifs across the ASTAR ATAC-seq libraries of four human cell lines. The y-axis represents the variability score assigned to each JASPAR motif, whereas the x-axis represents the motif rank. Top variable motifs are classified based on their enrichment scores across the cell lines and colored accordingly. (C) Multi-Venn diagram showing the shared and unique TFs across the cell lines. (D, top left) t-SNE clustering of BJ, JK1, K562, and Jurkat ASTAR ATAC-seq libraries based on the deviation scores of human JASPAR motifs. Colors represent the cell lines. (Top right and bottom) Superimposition of motif enrichment scores for FOSL1, GATA1, ZBTB33, and TEAD3 on the t-SNE plot. Colors represent the motif enrichment levels, ranging from blue (no) to red (high). (E, left) PCA clustering of ASTAR RNA-seq libraries based on their correlation to the RCA panel. (Right) Heatmap showing the lineages that each cell correlates to. (F) UCSC screenshots indicating the chromatin accessibility (top) and expression (bottom) levels of GATA1 (left) and SP1 (right) across the cell lines.
Figure 4.Interactive analysis of transcriptome and chromatin accessibility during erythroblast differentiation. (A) Schematic of erythroblast differentiation time points harvested for ASTAR-seq library preparation. (B, left) Trajectory of erythroblast differentiation identified from ASTAR RNA-seq libraries using DDRTree dimension reduction. Colors represent time points. (Right) Superimposition of HBA2 expression on the trajectory. Colors represent expression levels. (C) Trajectory plot revealing the pseudotemporal states. Colors represent pseudotemporal states. (D) RCA heatmap showing the clustering of cells undergoing erythroblast differentiation, based on their correlation to the cells of different lineage origins in RCA panel. Color indicates correlation value, ranging from blue (low) to red (high). Each row indicates one lineage, whereas each column represents an ASTAR RNA-seq library. Pseudotemporal state of each cell is indicated on top. Cellular differentiation status is determined based on their correlation to the cells of RCA panel and indicated below. (E) Superimposition of NMF clusters on the trajectory plot. (F) RCA clustering of JK1, K562, and Jurkat cells with the cells of NMF cluster 1 (top), cluster 2 (middle), and cluster 3 (below), respectively. Color indicates correlation value, ranging from blue (low) to red (high). Each row indicates one lineage, whereas each column represents an ASTAR RNA-seq library. Cell identities are indicated on top. (G) Heatmap showing the expression of NMF cluster–specific genes across NMF clusters and the hematopoietic cell lines. Color indicates expression level, ranging from blue (no) to red (high). (H) Line plots indicating the Cicero coaccessibility links between the regions highlighted in red and the distal sites in the surrounding regions. Height indicates the coaccessibility score of the connected peaks. The links are constructed from ASTAR ATAC-seq libraries of cluster 1 (top), cluster 2 (middle), and cluster 3 (bottom), respectively.