| Literature DB >> 31597106 |
Chi-Yeh Chung1, Zhibo Ma1, Christopher Dravis1, Sebastian Preissl2, Olivier Poirion2, Gidsela Luna1, Xiaomeng Hou2, Rajshekhar R Giraddi1, Bing Ren3, Geoffrey M Wahl4.
Abstract
Technological improvements enable single-cell epigenetic analyses of organ development. We reasoned that high-resolution single-cell chromatin accessibility mapping would provide needed insight into the epigenetic reprogramming and transcriptional regulators involved in normal mammary gland development. Here, we provide a single-cell resource of chromatin accessibility for murine mammary development from the peak of fetal mammary stem cell (fMaSC) functional activity in late embryogenesis to the differentiation of adult basal and luminal cells. We find that the chromatin landscape within individual cells predicts both gene accessibility and transcription factor activity. The ability of single-cell chromatin profiling to separate E18 fetal mammary cells into clusters exhibiting basal-like and luminal-like chromatin features is noteworthy. Such distinctions were not evident in analyses of droplet-based single-cell transcriptomic data. We present a web application as a scientific resource for facilitating future analyses of the gene regulatory networks involved in mammary development.Entities:
Keywords: ATAC-seq; chromatin profiling; differentiation trajectory; lineage relationships; mammary gland; mammary gland development; pseudotime ordering; single cell; snATAC-seq; transcription factor dynamics
Mesh:
Substances:
Year: 2019 PMID: 31597106 PMCID: PMC6887110 DOI: 10.1016/j.celrep.2019.08.089
Source DB: PubMed Journal: Cell Rep Impact factor: 9.423
Figure 1.snATAC-Seq in Fetal and Adult Mammary Epithelial Cells
(A) Overview of snATAC-seq experimental strategy.
(B) Aggregate ATAC-seq profile (top) and single-nucleus ATAC-seq profile (bottom) of mammary cells. Reads from 500 randomly selected cells are plotted to represent the single-nucleus profile.
(C) t-SNE representation of the snATAC-seq profile generated from either distal regions (>±1.5 kb transcription start site [TSS]) or promoter regions (<±1.5 kb TSS). The plots shown were derived by combining two biological replicates.
(D) Single-cell ID score of major mammary cell types overlaid on the snATAC-seq profile. Cell cluster annotations are shown.
(E) Cellular composition of regions adult cell population derived from snATAC-seq data.
(F) Cellular composition of regions adult-like cell types in the fetal cell population.
See also Figures S1–S3 and Table S1.
Figure 2.Fetal Mammary Cells at E18 Show Epigenetic Features of Partial Lineage Specification
(A) Single-cell basal-to-luminal score overlaid on the snATAC t-SNE plot. Cell cluster identities are shown.
(B) Approach to generating an aggregate snATAC profile based on single-cell clustering.
(C) Signal tracks of the aggregate snATAC profile. Signal ranges are shown in the parentheses.
(D) PCA comparing the aggregate snATAC profile with bulk ATAC-seq of sorted mammary populations. Arrows indicate putative mammary differentiation paths.
See also Figure S4.
Figure 3.Single-Cell Transcription Factor Dynamics during Mammary Development
(A) TF Z score calculated from chromVar overlaid on the snATAC t-SNE plot.
(B) RNA-seq expression of TFs from (A). Mean ± SEM (n = 2).
(C) Computational framework to identify cell-state-predictive TFs.
(D) TF Z score profile in cell types for the 8 TF clusters, grouped into luminal progenitor, mature luminal, mixed basal/fetal, and other TFs. Individual (gray) and median (red) TF Z scores in cell type are shown. Examples of a TF family in a cluster are shown on the top.
(E and F) TF Z score (E) and RNA-seq expression (F) of identified mammary cell-state factors that are less known. Mean ± SEM (n = 2).
See also Figures S5 and S6 and Table S2.
Figure 4.Pseudotime Ordering of Single-Cell TF Profile Infers the Mammary Differentiation Trajectory
(A) Fetal (green) and adult (yellow) cells along the DDRTree pseudotime trajectory. The cell state associated with each branch is indicated.
(B) Representative TF Z score profile overlaid on the pseudotime trajectory plot.
See also Figure S7.
Figure 5.Putative cis-Regulatory Interactions and Gene Accessibility Derived from snATAC-Seq Data
(A and B) Linkage plots of Cicero-predicted cis-regulatory interactions at Sox10 (A) and Krt8 and Krt18 (B) loci (orange boxes indicate Sox10 promoter region or Krt8 and Krt18 putative enhancer region). Previously characterized Sox10 enhancers are shown in green. Signal tracks from aggregate snATAC-seq and H3K27ac ChIP-seq of fetal cells are shown. The height and opaqueness of the loop corresponds to the co-accessibility score between two linked elements. Dotted lines indicate the co-accessibility cutoff.
(C) Single-cell gene accessibility score of cell markers overlaid on the snATAC t-SNE plot. Cell cluster identities are annotated.
(D) RNA-seq expression of genes from (C) in mammary cell types. Mean ± SEM (n = 2).
See also Figure S8.
Figure 6.Identification and Analysis of Mammary Cell-Type-Specific Accessible Genes
(A) Computational framework to identify cell-type-specific open or closed genes.
(B) Boxplot of the gene accessibility score and RNA-seq expression of the top 300 accessible genes in fetal, basal, LP, and ML cells. Each dot is one gene, the thick horizontal middle line is the median, the height of the box is the interquartile range (IQR), and the dotted vertical line is 1.5 × IQR.
(C and D) Top 20 genes (C) and their representative accessibility profile (D) from (B).
(E) Gene Ontology (GO) analysis of the top 300 accessible genes from each cell type. The p values are Bonferroni corrected for multiple testing.
See also Figures S9–S12 and Table S3.
Figure 7.Integration of snATAC-Seq and scRNA-Seq Data
(A) UMAP representation of the co-embedded snATAC and single-cell RNA (scRNA) dataset. Cells from two assays were labeled with two colors as indicated in the plot.
(B) Split view of the UMAP representation of the co-embedded snATAC and scRNA dataset by assay techniques. Cell-type annotations obtained from previous independent snATAC-seq and scRNA-seq analysis were superimposed onto each cell. Each major cell type was represented in a different color.
KEY RESOURCES TABLE
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Antibodies | ||
| Alexa Fluor® 647 anti-mouse CD326 (Ep-CAM) | Biolegend | RRID:AB_1134101 |
| Biotin Rat Anti-Mouse TER-119/Erythroid Cells | BD Biosciences | Cat #: 553672; RRID:AB_394985 |
| Biotin Rat Anti-Mouse CD31 | BD Biosciences | Cat #: 553371; RRID:AB_394817 |
| Biotin Rat Anti-Mouse CD45 | BD Biosciences | Cat #: 553078; RRID:AB_394608 |
| Purified Rat Anti-Mouse CD16/CD32 (Mouse BD Fc Block) | BD Biosciences | Cat #: 553142; RRID:AB_394657 |
| APC Cy7 Streptavidin | Biolegend | Cat #: 405208 |
| Chemicals, Peptides, and Recombinant Proteins | ||
| Collagenase/Hyaluronidase | Stem Cell Technologies | Cat #: 07912 |
| Dispase | Stem Cell Technologies | Cat #: 07913 |
| DAPI | Thermo Fisher Scientific | Cat #: 62248 |
| IGEPAL-630 | Sigma | Cat #: I8896 |
| Digitonin | Promega | Cat #: G9441 |
| Tn5 | N/A | |
| DRAQ7 | Cell Signaling Technologies | Cat #: 7406 |
| Ammonium Chloride | Stem Cell Technologies | Cat #:07800 |
| Critical Commercial Assays | ||
| EpiCult-B Mouse Medium Kit | Stem Cell Technologies | Cat #: 05610 |
| Fetal Bovine Serum | Biowest | Cat #: S1620 |
| NEB Next High Fidelity 2× PCR Master Mix | New England BioLabs | Cat #: M0541 |
| MinElute PCR Purification Kit | QIAGEN | Cat #: 28004 |
| SPRI Beads | Beckman Coulter | Cat #: B23317 |
| Deposited Data | ||
| snATAC-seq FastQ files | This paper | GEO: GSE125523 |
| 10× scRNA-seq FastQ files | GEO:GSE111113 | |
| Bulk RNA-seq, ATAC-seq, and ChlP-seq sequencing files (bed, bigwig, and FastQ files) | GEO: GSE116386 | |
| Experimental Models: Organisms/Strains | ||
| CD1 mice | Charles River | Strain code: 022 |
| Software and Algorithms | ||
| snATAC bioinformatics pipeline | ||
| Sickle 1.33 | ||
| Bowtie2 | ||
| MACS2 | ||
| Samtools | ||
| Bedtools | ||
| UCSC genome browser | ||
| Sushi (R) | ||
| Deeptools | ||
| Rtsne v0.13 (R) | ||
| umap v0.2.0.0 (R) | McInnes et al., 2018 | |
| densityClust (R) | ||
| Rgl (R) | ||
| Barcode collisions identification | ||
| chromVAR | ||
| Caret (R) | ||
| ConsensusClusterPlus (R) | ||
| Monocle 2 (R) | ||
| igraph (R) | ||
| Cellranger v3.0.1 | 10x Genomics | |
| Seurat v2.3 (R) | ||
| Seurat v3.0.2 (R) | ||
| AUCell | ||
| Cicero (R) | ||
| Gviz (R) | ||
| ClueGO | ||
| Cytoscape v3.7.1 | ||
| shiny v1.3.2 (R) | ||
| shinydashboard v.0.7.1 (R) | ||
| R v.3.5 (Mac OS X and windows 10) | The R Project for Statistical Computing | |
| Basic snATAC-seq analysis scripts | This paper | |
| Other | ||
| Resource website for the snATAC-seq data | This paper | |