| Literature DB >> 36161934 |
Honggui Wu1,2,3, Xiang Li1,2,4, Fanchong Jian1,2,5, Ayijiang Yisimayi1,2,3, Yinghui Zheng1,2, Longzhi Tan6, Dong Xing1,2, X Sunney Xie1,2,4.
Abstract
Recent advances in single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) and its coassays have transformed the field of single-cell epigenomics and transcriptomics. However, the low detection efficiency of current methods has limited our understanding of the true complexity of chromatin accessibility and its relationship with gene expression in single cells. Here, we report a high-sensitivity scATAC-seq method, termed multiplexed end-tagging amplification of transposase accessible chromatin (METATAC), which detects a large number of accessible sites per cell and is compatible with automation. Our high detectability and statistical framework allowed precise linking of enhancers to promoters without merging single cells. We systematically investigated allele-specific accessibility in the mouse cerebral cortex, revealing allele-specific accessibility of promotors of certain imprinted genes but biallelic accessibility of their enhancers. Finally, we combined METATAC with our high-sensitivity single-cell RNA sequencing (scRNA-seq) method, multiple annealing and looping based amplification cycles for digital transcriptomics (MALBAC-DT), to develop a joint ATAC-RNA assay, termed METATAC and MALBAC-DT coassay by sequencing (M2C-seq). M2C-seq achieved significant improvements for both ATAC and RNA compared with previous methods, with consistent performance across cell lines and early mouse embryos.Entities:
Keywords: coaccessibility; imprinted gene; joint ATAC–RNA; scATAC-seq
Mesh:
Substances:
Year: 2022 PMID: 36161934 PMCID: PMC9546615 DOI: 10.1073/pnas.2206450119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
Fig. 1.METATAC shows high detection efficiency in single cells. (A) Schematic workflow of METATAC. Cells are permeabilized with Omni-ATAC lysis; then nuclei are transposed with META transposome, single nuclei are sorted to 96-well plates via flow cytometry, Tn5 is removed with SDS from bound DNA, and META amplification and cell barcoding are conducted in two steps via an acoustic liquid transfer system. (B) Spearman correlation of chromatin accessibility peaks across bulk and single-cell datasets for K562 and GM12878. (C) Comparison of library size between META transposome (n = 1,099) and conventional Nextera transposome (n = 87) using GM12878 cells. Two samples are sequenced at the same depth, and library size was estimated with the Lander–Waterman equation. QC metrics of scATAC-seq technologies: (D) Library size of all technologies. The median library size for METATAC is 587,707 reads (n = 1,099, GM12878), as compared with s3-ATAC (26) (97,142 reads, n = 2,174, human prefrontal cortex), 10× scATAC-seq (13) (45,515 reads, n = 5,297, GM12878), dscATAC (12) (66,618 reads, n = 2,295, GM12878), Chen plate-based scATAC-seq (14) (26,700 reads, n = 384, K562), HyDrop-ATAC (27) (2,114 reads, n = 1,141, MCF-7), Fluidigm C1 (8) (33,529 reads, n = 382, GM12878), and sciATAC-seq methods (9) (5,300 reads, n = 533, GM12878). (E) Fraction of reads mapped in peaks. For this comparison, peaks are called for each technology individually. The median FRiP of METATAC is 75.412%. (F) Proportion of reads mapped to nuclear genome. The median for METATAC is 95.552%. (G) Comparison of detected DHSs versus cell number across different techniques of GM12878. Each cell number is randomly performed three times.
Fig. 2.High-detectability METATAC revealed the coactuation of functionally related regulatory elements. (A) Definition of coactuation in METATAC with hypergeometric test. (B) METATAC coaccessibility recovers independently validated long-range interactions. The coaccessibility score was calculated with K562 cells (n = 747). A CRISPRi screen for MYC locus on K562 identified seven distal enhancer regions (29), four of which are shown here. Model-based Analysis of PLAC-seq and HiChIP (MAPS) for K562 H3K27ac HiChIP data, Cicero calculated coaccessibility (all and threshold >0.25 are shown), and hypergeometric test calculated coaccessibility (score >2 are shown) only show MYC anchored links. CRISPRi identified enhancers are highlighted with arrowheads. (C) Coaccessibility score of neighboring peaks as a function of the distance between the accessible peaks, which was calculated with GM12878 cells (n = 1,099). (D) Coaccessibility score of E–P loop anchors, grouped by loop size, which was calculated with GM12878 cells (n = 1,099). E–P loops are called from in situ Hi-C data at 5-kb resolution (32). True represents E–P loop anchors, and false represents regions neighboring loop anchors. (E) Hierarchical clustering of RELB ChIP-seq peaks based on coaccessibility scores. Two groups were identified, which we named the high-coaccessibility group and the low-coaccessibility group according to coaccessibility scores within each group, which are calculated with GM12878 cells (n = 1,099). The high-coaccessibility group is highlighted. For visualization, only 2,000 peaks were subsampled. (F) TF motif enrichment for the high-coaccessibility group and the low-coaccessibility group in E.
Fig. 3.Allele-specific accessibility landscape of mouse brain. (A) Uniform Manifold Approximation and Projection (UMAP) visualization of cells derived from mouse cerebral cortex (n = 1,370). (B) Allele-specific accessibility of cell types in A. Red indicates maternal-specific, and blue indicates paternal-specific. (C) TFs motif enrichment for allele-specifically accessible peaks. (D) Allele-specific accessibility of imprinted genes, grouped as maternally imprinted genes and paternally imprinted genes. Positive value indicates maternal-specific accessibility, and negative value indicates paternal-specific accessibility. (E) Scatterplot between expression specificity and accessibility specificity of imprinted genes. (F) The accessibility specificity of enhancers linked to imprinted genes, grouped by maternal/paternal imprinting. (G) A paternal-specifically accessible region (chr7, 59.5 to 62.5 Mb). (H) A maternal-specifically accessible region (chr12, 109.54 to 109.74 Mb). Two microRNA clusters are labeled.
Fig. 4.Joint single-cell transcriptome and chromatin accessibility sequencing. (A) Workflow of our simultaneous ATAC-RNA method. (B) Number of ATAC fragments in peaks. The median number of M2C-seq (GM12878, 31,790, n = 511; K562, 18,540, n = 359) is compared with METATAC only (GM12878, 108,792, n = 1,186), SHARE-seq (17) (GM12878, 5,363, n = 1,204), 10× multiome (GM12878, 10,924, n = 2,714), SNuBar-ARC (42) (K562, 17,369, n = 5,131), ASTAR-seq (21) (K562, 24,231, n = 136), sci-CAR (18) (HEK293T, 558, n = 711; A549, 545, n = 3,427), SNARE-seq (19) (GM12878, 507, n = 140; K562, 507, n = 200), and Paired-seq (20) (HEK293T, 885, n = 1,833; HepG2, 843, n = 1,186). (C) Number of RNA UMIs. The median number of M2C-seq (GM12878, 12,068, n = 560; K562, 12,494, n = 359) is compared with MALBAC-DT only (GM12878, 87,728, n = 948), SHARE-seq (17) (GM12878, 6,173, n = 1,159), 10× multiome (GM12878, 3,716, n = 2,714), SNuBar-ARC (K562, 12,642, n = 6,136), ASTAR-seq (K562, n = 192), sci-CAR (18) (HEK293T, 2,752, n = 812; A549, 2,419, n = 4,277), SNARE-seq (19) (GM12878, 346, n = 140; K562, 482, n = 200), and Paired-seq (20) (HEK293T, 628, n = 1,174; HepG2, 620, n = 1,141). Scatterplots show the correlation of read counts from two technical replicates of (D) RNA profiles and (E) ATAC fragment profiles. (F) Unique ATAC fragments and (G) RNA UMIs mapped to human or mouse genome. The species mixing experiment is performed on a mix of human (K562) and mouse (mESC V6.5) cell lines.