| Literature DB >> 31509757 |
Vincent P Schulz1, Hongxia Yan2, Kimberly Lezon-Geyda1, Xiuli An3, John Hale2, Christopher D Hillyer2, Narla Mohandas2, Patrick G Gallagher4.
Abstract
Mammalian erythropoiesis yields a highly specialized cell type, the mature erythrocyte, evolved to meet the organismal needs of increased oxygen-carrying capacity. To better understand the regulation of erythropoiesis, we performed genome-wide studies of chromatin accessibility, DNA methylation, and transcriptomics using a recently developed strategy to obtain highly purified populations of primary human erythroid cells. The integration of gene expression, DNA methylation, and chromatin state dynamics reveals that stage-specific gene regulation during erythropoiesis is a stepwise and hierarchical process involving many cis-regulatory elements. Erythroid-specific, nonpromoter sites of chromatin accessibility are linked to erythroid cell phenotypic variation and inherited disease. Comparative analyses of stage-specific chromatin accessibility indicate that there is limited early chromatin priming of erythroid genes during hematopoiesis. The epigenome of terminally differentiating erythroid cells defines a distinct subset of highly specialized cells that are vastly dissimilar from other hematopoietic and nonhematopoietic cell types. These epigenomic and transcriptome data are powerful tools to study human erythropoiesis.Entities:
Keywords: anemia; chromatin; epigenomic; erythrocyte; erythropoiesis; methylation; trait
Mesh:
Substances:
Year: 2019 PMID: 31509757 PMCID: PMC6863094 DOI: 10.1016/j.celrep.2019.08.020
Source DB: PubMed Journal: Cell Rep Impact factor: 9.423
Figure 1.Regions of Open Chromatin Identified during Erythropoiesis
(A) ATAC peaks at the β-globin-like gene locus.
(B)ATAC peaks at the BCL11A gene locus.
(C)Bar chart representation of differential changes in open chromatin as identified by ATAC peaks by stages of erythropoiesis, with sites of acquired ATAC peaks shown in red and sites of lost ATAC peaks shown in blue.
(D)Distribution of ATAC peaks in human erythroid cells at differing stages of erythroid development and differentiation. The human genome was portioned into seven bins relative to RefSeq genes. The percentage of the human genome represented by each bin was color coded, and the distribution of ATAC peaks placed in each bin was graphed on the color-coded bar. The Genome Bar is a graphical representation of the relative proportions of the different location categories in the entire human genome.
TES, transcriptional end site; TSS, transcriptional start site.
Figure 2.Erythroid Cells Exhibit Unique Stage- and Cell Type-Specific Patterns of Nonpromoter Chromatin Accessibility
(A)Chromatin accessibility at the HK1 locus demonstrating erythroid-specific (red) ATAC peaks.
(B)Read count correlation of ATAC peaks in erythroid and nonerythroid hematopoietic cells.
(C)Heatmap of differential ATAC regions between erythroid cells and other hematopoietic cell types. The regions at the top far right, representing terminal erythroid differentiation, show high accessibility in erythroid cells, and the regions at bottom have low accessibility in erythroid cells compared to other hematopoietic cell types.
(D)GREAT (genomic regions enrichment of annotations tool) analysis of erythroid-specific regions of chromatin accessibility revealed significantly enriched terms for erythroid-related traits. The log10 binomial p value of each category is shown at right.
(E)Principal-component analysis of patterns of chromatin accessibility in erythroid, nonerythroid hematopoietic, and nonhematopoietic cells.
(F)Hierarchical cluster analysis shows terminally differentiating erythroid cells clustering separately from all of the other hematopoietic and nonhematopoietic cells (orange box, lower left corner).
Figure 3.Chromatin Accessibility and Gene Expression
Levels of gene expression determined by RNA-seq were correlated with the location of ATAC peaks in linked gene promoters, linked nonpromoter regions, both, or neither.
(A)Patterns of gene expression associated with ATAC peak presence or absence as described are shown in CFU-E, early basophilic erythroblasts, and orthochromatic erythroblasts.
(B)Levels of gene expression and gain or loss of ATAC peaks. In general, as gene loci acquired ATAC peaks, levels of gene expression increased. Shown here in the transitions from HSPC to BFU-E, BFU-E to CFU-E, and CFU-E to proerythroblast (ProE), the gene expression of linked genes that lost ATAC peaks (blue) is significantly lower than the expression of linked genes that gained ATAC peaks (red).
(C)SLC2A1 gene locus. Patterns of chromatin accessibility denoted by ATAC peaks (blue) are apparent in erythroid differentiation before the onset of detectable gene expression identified by RNA-seq (red).
Figure 4.Transcription Factor Activity Changes across Erythropoiesis
(A)The top regulatory protein-binding sites identified by the HOMER algorithm searching ±50 bp from summits of differential increasing stage to stage ATAC peaks. The top four motifs ranked by p value are shown for erythroid cell type transitions.
(B)Gene expression of critical erythroid transcription factors are shown for each stage of erythropoiesis. Transcriptome data are from An et al. (2014) and the present article.
(C)Protein expression of critical erythroid transcription factors are shown for each stage of erythropoiesis. Protein data are from Gautier et al. (2016).
Figure 5.Chromatin Accessibility Bookmarking and Gene Expression
Sites of chromatin accessibility identified in HSPC, BFU-E, CFU-E, or ProE cells, respectively, and all of the other subsequent erythroid stages were defined as exhibiting bookmarking.
(A and B) Examples of bookmarked regions of open chromatin in HSPCs in the HBS1L/MYB intergenic region (A) and the SOX6 locus (B) are shown. Both of these bookmarked sites contain SNPs (associated dbSNP identifiers) linked to genome-wide association study (GWAS)-associated erythroid cell phenotypic traits.
(C and D) Comparison of gene expression of bookmarked-linked genes (red) compared to control, non-bookmarked genes (blue) is shown in BFU-E (C) and in CFU-E (D) cells.
Figure 6.Dynamic Changes in Methylation across Erythropoiesis
(A)Boxplots of the percentage of methylation of 5-kb genomic windows show a gradual decrease in methylation.
(B)Dynamic changes in methylation in the regions of the RBM38 gene locus. Changes from DNA methylation (red, height indicates number of methylated reads) to demethylation (blue, height indicates number of unmethylated reads) are observed.
(C)Numbers of differentially methylated regions are shown in 5-kb tiled regions across the genome. Red indicates regions of gained methylation and blue indicates regions of lost methylation.
(D)Numbers of differentially methylated regions located in open chromatin.
(E)Heatmap display of methylation levels (%) in open chromatin regions with dynamic methylation.
(F)Dynamic changes in DNA methylation, alterations in chromatin configuration, and pattern of gene expression at the ZFPM1 gene locus.
Figure 7.Sites of Chromatin Accessibility Containing Genome-wide Association Study-Linked SNPs Linked to Erythroid Cell Traits
ATAC peaks were mapped onto the GWAS catalog of the National Human Genome Research Institute associated with erythroid cell terms.
(A and B) Examples of erythroid-specific, erythroid trait-associated SNPs in ATAC peaks bookmarked in CFU-E cells are shown at the RCL1 gene locus (A, two SNPs) and the PIEZO1 gene locus (B, one SNP).
(C) The x axis represents the 497 SNPs associated with erythroid cell traits ordered from right to left. The top of the heatmap (red) shows each ATAC peak that contains a GWAS erythroid-associated SNP by erythroid stage. The bottom half of the heatmap (blue) shows the trait associated with each ATAC peak-containing GWAS SNP. Many SNPs were bookmarked, and most were associated with more than one erythroid cell trait.
KEY RESOURCES TABLE
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Antibodies | ||
| PE-Cy7-conjugated mouse anti-human IL3-R | Thermofisher Scientific | Cat# 25-1239-42; RRID: AB_1257136 |
| APC-conjugated mouse anti-human Glycophorin A | BD Biosciences | Cat# 551336; RRID: AB_398499 |
| PE-conjugated mouse anti-human CD34 | BD Biosciences | Cat# 555822; RRID: AB_396151 |
| FITC-conjugated mouse anti-human CD36 | BD Biosciences | Cat# 555454; RRID: AB_2291112 |
| PE-conjugated mouse anti-human GPA | BD Biosciences | Cat# 555570; RRID: AB_395949 |
| APC-conjugated mouse anti-human α4-integrin | Miltenyi Biotec | Cat# 130-093-281; RRID: AB_1036218 |
| FITC-conjugated mouse anti-human band 3 | NY Blood Center | |
| 7-AAD | BD Biosciences | Cat# 559925 |
| Biological Samples | ||
| Human cord blood | New York Blood Center | |
| Chemicals, Peptides, and Recombinant Proteins | ||
| SFEM | StemCell Technologies | Cat# 09650 |
| FBS | GE Healthcare | Cat# S40110H |
| IL-3 | StemCell Technologies | Cat# 78040 |
| SCF | StemCell Technologies | Cat# 78062 |
| EPO | ThermoFisher Scientific | Cat# PHC2054 |
| α-thioglycerol | Sigma-Aldrich | Cat# M6145-25ML |
| human peripheral blood plasma | StemCell Technologies | Cat# 70039.5 |
| human type AB serum | Atlanta Biologicals | Cat# S40110H |
| heparin | StemCell technologies | Cat# 07980 |
| insulin | Sigma-Aldrich | Cat# I9278 |
| Holo-human transferrin | Sigma-Aldrich | Cat# T4132-1G |
| BSA | Sigma-Aldrich | Cat# A7030-100G |
| May-Grunwald stain | Sigma-Aldrich | Cat# MG128-4L |
| Giemsa stain | Sigma-Aldrich | Cat# GS1L-1L |
| MethoCult H4434 Classic | StemCell Technologies | Cat# 04434 |
| MethoCult H4330 | StemCell Technologies | Cat# 04330 |
| MspI restriction enzyme | New England Biolabs | Cat# R0106T |
| T4 DNA polymerase | New England Biolabs | Cat# M0203S |
| T4 DNA ligase reaction buffer | New England Biolabs | Cat# B0202S |
| DNA polymerase I, Large (Klenow) fragment | New England Biolabs | Cat# M0210S |
| T4 polynucleotide kinase | New England Biolabs | Cat# M0201S |
| Deoxynucleotide (dNTP) solution set | New England Biolabs | Cat# N0446S |
| dATP solution | New England Biolabs | Cat# N0440S |
| NE Buffer 2 | New England Biolabs | Cat# B7002S |
| T4 DNA ligase | New England Biolabs | Cat# M0202T |
| Critical Commercial Assays | ||
| Gentra Puregene Blood Kit | QIAGEN | Cat# 158445 |
| QIAquick PCR Purification Kit | QIAGEN | Cat# 28104 |
| MinElute PCR Purficiation Kit | QIAGEN | Cat# 28004 |
| QIAQuick gel extraction set | QIAGEN | Cat# 28115 |
| EZ DNA Methylation Kit | Zymo Research | Cat# D5001 |
| FastStart High Fidelity PCR System | Roche | Cat# 3553400001 |
| AMPure XP for PCR Purification | Beckman Coulter | Cat# A63881 |
| Quant-It dsDNA Assay Kit, High Sensitivity | Invitrogen | Cat# Q33120 |
| Transposase (TDE1) Nextera DNA Sample Prep Kit | Illumina | Cat# FC-131-1024 |
| KAPA HiFi HotStart ReadyMix (2x) | Kapa Biosystems | Cat#KK2602 |
| AMPureXP | Beckman Coulter | Cat#A63880 |
| Omega EZNA size select kit | Omega | Cat#D6488-01 |
| Deposited Data | ||
| Raw and analyzed ATAC data | This paper | GEO: GSE128266 |
| Raw and analyzed methylation data | This paper | GEO: GSE128267 |
| Raw and analyzed RNA-seq data | This paper | GEO: GSE128268 |
| Human dendritic cell: ATAC-seq | NCBI GEO | GEO: GSM1565735 |
| Human corneal epithelial cell: ATAC-seq | NCBI GEO | GEO: GSM1662255 |
| Human fibroblast cell: ATAC-seq | NCBI GEO | GEO: GSM1662258 |
| Human cardiac smooth muscle cell: ATAC-seq | NCBI GEO | GEO: GSM1876021 |
| Human LMPP cell: ATAC-seq | NCBI GEO | GEO: GSM1937378 |
| Human bone marrow CD34+ cell: ATAC-seq | NCBI GEO | GEO: GSM1937399 |
| Human umbilical cord CD34+ cell” ATAC-seq | NCBI GEO | GEO: GSM1937401 |
| Human CD4 T lymphocyte: ATAC-seq | NCBI GEO | GEO: GSM1937406 |
| Human CD8 T lymphocyte: ATAC-seq | NCBI GEO | GEO: GSM1937407 |
| Human CLP: ATAC-seq | NCBI GEO | GEO: GSM1937408 |
| Human CMP: ATAC-seq | NCBI GEO | GEO: GSM1937410 |
| Human GMP: ATAC-seq | NCBI GEO | GEO: GSM1937415 |
| Human HSC: ATAC-seq | NCBI GEO | GEO: GSM1937416 |
| Human MEP: ATAC-seq | NCBI GEO | GEO: GSM1937418 |
| Human MPP: ATAC-seq | NCBI GEO | GEO: GSM19374119 |
| Human MPP: ATAC-seq | NCBI GEO | GEO: GSM1937419 |
| Human NK cell: ATAC-seq | NCBI GEO | GEO: GSM1937421 |
| Human B lymphocyte: ATAC-seq | NCBI GEO | GEO: GSM1937423 |
| Human pancreatic alpha cell: ATAC-seq | NCBI GEO | GEO: GSM1978243 |
| Human pancreatic beta cell: ATAC-seq | NCBI GEO | GEO: GSM1978246 |
| Human pancreatic Ac: ATAC-seq | NCBI GEO | GEO: GSM1978249 |
| Human human keratinocyte: ATAC-seq | NCBI GEO | GEO: GSM2035768 |
| Human neuron: ATAC-seq | NCBI GEO | GEO: GSM2199916 |
| Human non-neuron: ATAC-seq | NCBI GEO | GEO: GSM2199924 |
| Human HUVEC: ATAC-seq | NCBI GEO | GEO: GSM2202931 |
| Human cardiac mesoderm: ATAC-seq | NCBI GEO | GEO: GSM2257299 |
| Human CD8 naive lymphocyte: ATAC-seq | NCBI GEO | GEO: GSM2365846 |
| Human CD8 effector lymphocyte: ATAC-seq | NCBI GEO | GEO: GSM2365849 |
| Human CD8 memory lymphocyte: ATAC-seq | NCBI GEO | GEO: GSM2365852 |
| Human: aortic endothelial cell: ATAC-seq | NCBI GEO | GEO: GSM2394391 |
| Human HSPC H3K27me3: ChIP-seq | NCBI GEO | GEO: GSM1816077 |
| Human mixed proerythroblast H3K27me3: ChIP-seq | NCBI GEO | GEO: GSM1816078 |
| Human HSPC H3K4me1: ChIP-seq | NCBI GEO | GEO: GSM1816066 |
| Human mixed proerythroblast H3K4me1: ChIP-seq | NCBI GEO | GEO: GSM1816067 |
| Human HSPC H3K27Ac: ChIP-seq | NCBI GEO | GEO: GSM1816073 |
| Human mixed proerythroblast H3K27Ac: ChIP-seq | NCBI GEO | GEO: GSM1816074 |
| Human mixed proerythroblast TAL1: ChIP-seq | NCBI GEO | GEO: GSM1816083 |
| Human mixed proerythroblast GATA1: ChIP-seq | NCBI GEO | GEO: GSM970257 |
| Human mixed proerythroblast NFE2: ChIP-seq | NCBI GEO | GEO: GSM1816086 |
| Human HSPC PU1: ChIP-seq | NCBI GEO | GEO: GSM1816089 |
| Human HSPC RUNX1: ChIP-seq | NCBI GEO | GEO: GSM1816091 |
| Human HSPC CTFC 1: ChIP-seq | NCBI GEO | GEO: GSM1655735 |
| Human HSPC SA1 1: ChIP-seq | NCBI GEO | GEO: GSM1655736 |
| Human Erythroid CTFC 1: ChIP-seq | NCBI GEO | GEO: GSM1655739 |
| Human Erythroid SA1 1: ChIP-seq | NCBI GEO | GEO: GSM1655740 |
| Human Proerythroblast 1: RNA-seq | NCBI GEO | GEO: GSM1304777 |
| Human Proerythroblast 2: RNA-seq | NCBI GEO | GEO: GSM1304778 |
| Human Proerythroblast 3: RNA-seq | NCBI GEO | GEO: GSM1304779 |
| Human Early Basophilic 1: RNA-seq | NCBI GEO | GEO: GSM1304780 |
| Human Early Basophilic 2: RNA-seq | NCBI GEO | GEO: GSM1304781 |
| Human Early Basophilic 3: RNA-seq | NCBI GEO | GEO: GSM1304782 |
| Human Late Basophilic 1: RNA-seq | NCBI GEO | GEO: GSM1304783 |
| Human Late Basophilic 2: RNA-seq | NCBI GEO | GEO: GSM1304784 |
| Human Late Basophilic 3: RNA-seq | NCBI GEO | GEO: GSM1304785 |
| Human Polychromatic 1: RNA-seq | NCBI GEO | GEO: GSM1304786 |
| Human Polychromatic 2: RNA-seq | NCBI GEO | GEO: GSM1304787 |
| Human Polychromatic 3: RNA-seq | NCBI GEO | GEO: GSM1304788 |
| Human Orthochromatic 1: RNA-seq | NCBI GEO | GEO: GSM1304789 |
| Human Orthochromatic 2: RNA-seq | NCBI GEO | GEO: GSM1304790 |
| Human Orthochromatic 3: RNA-seq | NCBI GEO | GEO: GSM1304791 |
| Oligonucleotides | ||
| pGATcGGAAGAGcGGTTcAGcAGGAATGccGAG | PE1 | |
| AcAcTcTTTcccTAcAcGAcGcTcTTccGATcxT | PE2 | |
| AATGATACGGCGACCACCGAGATCTACACTTTCCCTACACGACGCTCTTCCGATCxT | PE1.0 | |
| CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCxT | PE 2.0 | |
| Software and Algorithms | ||
| EdgeR | ||
| Trimmomatic version 0.32 | ||
| Goseq | ||
| Burrows-Wheeler Aligner version 0.7.9a | ||
| MACS2 program version 2.1.1.20160309 | ||
| DiffBind version 2.4.8 | ||
| GenomicRanges | ||
| Chipseeker version 1.12.1 | ||
| GREAT | ||
| HOMER | ||
| PhastCons | ||
| Trim Galore v0.4.0 | Babraham Bioinformatics | |
| Bismark | ||
| MethylKit | ||
| PLINK |