| Literature DB >> 34880271 |
Winnie W I Hui1, Angela Simeone1, Katherine G Zyner1, David Tannahill1, Shankar Balasubramanian2,3,4.
Abstract
G-quadruplexes (G4s) are four-stranded DNA secondary structures that form in guanine-rich regions of the genome. G4s have important roles in transcription and replication and have been implicated in genome instability and cancer. Thus far most work has profiled the G4 landscape in an ensemble of cell populations, therefore it is critical to explore the structure-function relationship of G4s in individual cells to enable detailed mechanistic insights into G4 function. With standard ChIP-seq methods it has not been possible to determine if G4 formation at a given genomic locus is variable between individual cells across a population. For the first time, we demonstrate the mapping of a DNA secondary structure at single-cell resolution. We have adapted single-nuclei (sn) CUT&Tag to allow the detection of G4s in single cells of human cancer cell lines. With snG4-CUT&Tag, we can distinguish cellular identity from a mixed cell-type population solely based on G4 features within individual cells. Our methodology now enables genomic investigations on cell-to-cell variation of a DNA secondary structure that were previously not possible.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34880271 PMCID: PMC8654944 DOI: 10.1038/s41598-021-02943-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Characterisation of bulk G4-CUT&Tag. (a) Schematic diagram of G4-CUT&Tag workflow. Fixed permeabilised cells are incubated with the G4 structure-specific BG4 single-chain variable fragment (scFv), followed the secondary antibodies, and then adapter-loaded proteinA-Tn5 to enable the integration of adapters at G4 target loci for next-generation sequencing library preparation. (b) Example genome browser view of G4-CUT&Tag signals and G4 peaks called obtained from 100,000 K562 (red) and 100,000 U2OS (blue) cells with comparisons to published G4-ChIP-seq data[21] (pink, light blue respectively), and sites that fold into G4 structures in vitro (called observed quadruplex sequences OQs, orange)[20] Light grey shading highlights G4 peaks in the MYC locus. Gene annotations are shown in black below. (c) Hierarchical clustering of the Spearman correlation matrix for G4-CUT&Tag replicates. G4-CUT&Tag was performed on three biological replicates (b1, b2, b3) with two technical replicates (t1, t2) for 100,000 K562 and 100,000 U2OS cells. Spearman correlations between samples were computed at G4 peaks on read coverage normalised to library size. (d) Venn diagram showing the number of G4 peaks and their overlap from 100,000 (100 k), 50,000 (50 k) and 10,000 (10 k) K562 cells.
Figure 2Characterisation of single-nuclei G4-CUT&Tag. (a) Schematic diagram of single-nuclei G4 CUT&Tag (snG4-CUT&Tag) workflow. In fixed nuclei, G4-CUT&Tag is used to integrate adapters at G4 sites. Single intact nuclei are then partitioned with barcoded gel beads to index tagmented fragments from individual nuclei using a 10X Genomics Chromium platform with Next GEM Single Cell ATAC Reagents Kits (see “Methods”). (b) Two example G4 landscapes in the human genome generated by G4-CUT&Tag. Blue genomic tracks: normalised read coverages of bulk G4-CUT&Tag libraries (50,000 MCF7 cells and 100,000 U2OS cells); red genomic tracks: normalised signals from aggregated snG4-CUT&Tag libraries; dark red tiles: top 50 single nuclei based on the total number of fragments in the genomic region; pink: Cell Ranger peaks called; black: gene annotations. (c) Clustering of snG4-CUT&Tag data reveals two groups in mixed U2OS and MCF7 sample. t-SNE plot showing dimensionality reduced snG4-CUT&Tag data from mixed cell lines with 671 imputed U2OS (blue) and 467 imputed MCF7 (red) cells. (d) Graphical representation of G4 distribution across promoter G4 peaks in single cells. As in (c) for each t-SNE plot, the top cluster represents individual MCF7 cells while the bottom cluster represents individual U2OS cells. Each dot shows example data covering single U2OS or MCF7 cells for the indicated genes. Grey colouring indicates that no G4 is detected in that cell for Cell Ranger-called promoter peak(s) (1000 bases upstream or 100 bases downstream from the TSS) of the specified genes while orange to red shading quantifies the number of G4s (i.e. pA-Tn5 cut sites) detected in a single cell at the promoter peak(s) of the specified gene. Two example genes (CNGA4 and ALDH6A1) are shown where the G4 promoter peaks are observed in common in the two cell lines, and two example genes (CAPN5 and HMGN1) are shown in which the with G4 promoter peaks specific to only one cell line.