| Literature DB >> 30650367 |
Nathan Lawlor1, Eladio J Márquez1, Peter Orchard2, Narisu Narisu3, Muhammad Saad Shamim4, Asa Thibodeau1, Arushi Varshney5, Romy Kursawe1, Michael R Erdos3, Matt Kanke6, Huiya Gu7, Evgenia Pak3, Amalia Dutra3, Sheikh Russell8, Xingwang Li1, Emaly Piecuch9, Oscar Luo1, Peter S Chines3, Christian Fuchbserger10, Praveen Sethupathy6, Aviva Presser Aiden11, Yijun Ruan1, Erez Lieberman Aiden12, Francis S Collins3, Duygu Ucar13, Stephen C J Parker14, Michael L Stitzel15.
Abstract
EndoC-βH1 is emerging as a critical human β cell model to study the genetic and environmental etiologies of β cell (dys)function and diabetes. Comprehensive knowledge of its molecular landscape is lacking, yet required, for effective use of this model. Here, we report chromosomal (spectral karyotyping), genetic (genotyping), epigenomic (ChIP-seq and ATAC-seq), chromatin interaction (Hi-C and Pol2 ChIA-PET), and transcriptomic (RNA-seq and miRNA-seq) maps of EndoC-βH1. Analyses of these maps define known (e.g., PDX1 and ISL1) and putative (e.g., PCSK1 and mir-375) β cell-specific transcriptional cis-regulatory networks and identify allelic effects on cis-regulatory element use. Importantly, comparison with maps generated in primary human islets and/or β cells indicates preservation of chromatin looping but also highlights chromosomal aberrations and fetal genomic signatures in EndoC-βH1. Together, these maps, and a web application we created for their exploration, provide important tools for the design of experiments to probe and manipulate the genetic programs governing β cell identity and (dys)function in diabetes.Entities:
Keywords: (epi)genome; EndoC-βH1; Hi-C; Pol2 ChIA-PET; genetics; human pancreatic islets; karyotype; transcriptome; type 2 diabetes; β cell
Mesh:
Year: 2019 PMID: 30650367 PMCID: PMC6389269 DOI: 10.1016/j.celrep.2018.12.083
Source DB: PubMed Journal: Cell Rep Impact factor: 9.423
Figure 1.Extensive Karyotyping and Genotyping of EndoC-βH1
(A) SKY of EndoC-βH1 for a representative metaphase.
(B) Summary of the frequency of chromosomal abnormalities across 14 metaphases. Black boxes indicate the presence of an event, while white boxes indicate an absence.
(C) Bar plots highlighting the risk allele burden of NHGRI-EBI GWAS Catalog diabetes-associated GWAS loci in EndoC-βH1. T1D, type 1 diabetes; T2D, type 2 diabetes. Glucose traits include fasting plasma-glucose- and fasting glucose-related traits interacting with BMI from the NHGRI-EBI GWAS catalog (MacArthur et al., 2017). Insulin traits include proinsulin and fasting insulin traits interacting with BMI.
(D) Chromosome cartoons illustrating EndoC-βH1 genotypes and the reported locus at glucose trait GWAS SNPs. Cases in which independent association signals mapped to the same locus are indicated by the locus name followed by parentheses containing numbers of SNPs with each risk genotype. Chromosomes 10 and 20 are marked with asterisks to indicate that the previously observed copy-number alterations (illustrated in Figure 1B) may obfuscate interpretation of variant genotypes on these chromosomes.
See also Figure S1 and Table S1.
Figure 2.Multiomic Comparative Analysis of EndoC-βH1 and Human Pancreatic Islets
(A) Integrated view of the EndoC-βH1 and human islet (epi)genomic and transcriptomic features surrounding the PCSK1 locus on chromosome 5. Histone modification ChIP-seq data from EndoC-βH1, human islets, and five Epigenome Roadmap cell types and/or tissues (Roadmap Epigenomics Consortium et al., 2015) were jointly analyzed to determine ChromHMM-based chromatin states in a uniform manner.
(B) Spearman correlation between EndoC-βH1 ATAC-seq profiles and their corresponding profiles from islets, sorted α or β cells, and other cell types and tissues (STAR Methods). α, primary islet α cells; β, primary islet β cells; CD4T, CD4+ T immune cell; GM12878, B-lymphoblast cell line; skeletal, skeletal muscle; PBMC, peripheral blood mononuclear cells. EndoC-βH1 exhibits greatest similarity to islets and their cellular constituents.
(C) Heatmap illustrating Z scores of HOMER enrichment p values for TF motifs in cell-type-specific OCRs.
(D) Comparison of chromatin states between EndoC-βH1 and human islets. Blue box highlights putative enhancer cis-REs in both EndoC-βH1 and human islets; orange box indicates putative EndoC-βH1 enhancers that are repressed in islets.
(E) TF motifs enriched in genomic regions containing putative enhancer cis-REs in both EndoC-βH1 and islets (blue) or EndoC-βH1 only (orange). Points in gray denote TFs that are not enriched in either category.
See also Figure S2 and Table S3.
Top 25 Expressed miRNAs in EndoC-βH1 Cells
| miRNA | EndoC-βH1 Rank | EndoC-βH1 RPMMM | β Cell Enrichment / Depletion |
|---|---|---|---|
| hsa-miR-375 | 1 | 162076.925 | 2.84 |
| hsa-miR-127–3p | 2 | 90512.35 | - |
| hsa-miR-27b-3p | 3 | 85043.195 | 3.09 |
| hsa-miR-192–5p | 4 | 70126.5525 | 2.01 |
| hsa-miR-192–5p_+_1 | 5 | 37621.0275 | - |
| hsa-miR-182–5p | 6 | 36662.2475 | 2.15 |
| hsa-miR-22–3p | 7 | 29623.8 | 2.87 |
| hsa-miR-191–5p | 8 | 25424.1575 | 3.02 |
| hsa-miR-26a-2–5p | 9 | 15578.9175 | 2.94 |
| hsa-miR-26a-1–5p | 10 | 15496.7075 | 2.94 |
| hsa-miR-141–3p | 11 | 13656.6825 | 2.84 |
| hsa-miR-92a-1–3p | 12 | 13615.6425 | 2.30 |
| hsa-miR-30d-5p | 13 | 11108.9275 | 0.75 |
| hsa-miR-654–3p | 14 | 11107.385 | - |
| hsa-miR-148a-3p | 15 | 10651.885 | 2.22 |
| hsa-miR-200b-3p | 16 | 10266.2575 | 1.48 |
| hsa-miR-92a-2–3p | 17 | 10161.165 | 2.30 |
| hsa-miR-381 | 18 | 9342.8875 | 1.08 |
| hsa-miR-25–3p | 19 | 9230.2225 | 1.98 |
| hsa-miR-181a-1–5p | 20 | 8702.47 | 1.81 |
| hsa-miR-181a-2–5p | 21 | 8702.3125 | 1.81 |
| hsa-miR-21–5p | 22 | 8439.755 | 0.21 |
| hsa-miR-183–5p | 23 | 8338.08 | 1.58 |
| hsa-miR-92b-3p | 24 | 7531.295 | 4.63 |
| hsa-miR-125a-5p | 25 | 6743.8375 | 1.24 |
Table consisting of the top 25 expressed miRNAs in EndoC-βH1 cells. Each miRNA rank was determined by its corresponding expression level (reads per million mapped miRNA [RPMMM]). β cell enrichment / depletion was determined by dividing the counts of each miRNA in β cells (from van de Bunt et al., 2013) by the counts in islets.
Figure 3.Generating a Genome-wide Map of Looping in EndoC-βH1 and Human Pancreatic Islets (Hi-C)
(A) Aggregate peak analysis (APA) plots showing the total signal across all loops (top three panels) and EndoC-βH1-specific loops (bottom three panels) in EndoC-βH1 (left), human islet (center), and GM12878 (right) cells. Of note, islets exhibit similar contact point enrichments at EndoC-βH1-specific peaks compared to GM12878.
(B) Cartoon illustrating the different classes of Hi-C loops between example common (gray peaks) or cell-specific (black peaks) ATAC-seq OCRs for two different theoretical cell types.
(C) TF motifs enriched in GM12878 (blue) or EndoC-βH1 (red) Hi-C looping anchors that overlap cell-specific ATAC-seq peaks loop classes B and C in panel B above).
(D) Hi-C contact maps highlighting a specific loop at the SLC30A8 locus (denoted by dotted black circle) observed in both EndoC-βH1 (left) and primary human islets (center) but absent in GM12878 (right).
(E) Multiomics view of Hi-C, ChIA-PET (Pol2), chromatin states, ATAC-seq, RNA-seq, and gene tracks at the SLC30A8 neighborhood containing the Hi-C contact point highlighted in (D). Tracks corresponding to EndoC-βH1, human islet, and GM12878 are colored red, black, and blue, respectively. Dark blue boxes below each gene name represent the reference transcript annotations derived from Gencode v19. The red arrow at the bottom of the image indicates the putative EndoC-βH1- and islet-specific promoter for SLC30A8. The black arrow indicates the putative embryonic stem cell and K562 cell-specific promoter for SLC30A8 (Roadmap Epigenomics Consortium et al., 2015).
See also Figure S3 and Table S4.
Figure 4.RNA Polymerase 2 ChIA-PET Identifies Chromatin Interactions in EndoC-βH1
(A) Heatmap showing the chromatin states of EndoC-βH1 ChIA-PET interaction nodes.
(B) Example of a Pol2 ChIA-PET interaction between active enhancer (blue box) and active promoter (green box) cis-REs in the ISL1 locus on chromosome 5. Asterisks under EndoC-βH1 ChIA-PET interactions (red) indicate interacting sites in the ISL1 locus detected in human islet 4C-seq analyses (Pasquali et al., 2014).
(C) ChIA-PET network connectivity of gene promoters in EndoC-βH1 containing at least three interactions with other regulatory elements. For each gene, the number of connections between other regulatory elements (e.g., active enhancer and weak enhancer) and the proportion of total links in which the chromatin states are EndoC-βH1-specific (blue) or identical in both human islet and EndoC-βH1 (green) are shown in bar plots on the right. The remaining proportions that are neither EndoC-βH1 specific nor common to islet and EndoC-βH1 are not shown. Red font denotes loci containing genes crucial for β cell identity and development.
(D) Top: Bar plot illustrating the proportions of chromatin states at the Pol2 ChIA-PET interacting sites (nodes) shared between EndoC-βH1, islets, and additional Epigenomics Roadmap tissues and cell lines. Bottom: Heatmap demonstrating the chromatin states of EndoC-βH1 Pol2 ChIA-PET interacting sites (nodes) in islets (left) or stomach smooth muscle (right).
See also Figure S4.
Figure 5.Allelic Effects on EndoC-βH1 Transcriptional Regulatory Features
(A) EndoC-βH1 genotype information was integrated with ATAC-seq, H3K27ac, and RNA-seq data to identify sequence variants altering cis-RE accessibility and/or activity (ATAC-seq and H3K27ac) or mRNA levels (RNA-seq) in EndoC-βH1. Pie charts summarize the proportions of variants exhibit significant AIs (blue; FDR < 10%) in each of the corresponding sequencing profiles.
(B) Cartoon representation of approach to identify systematic allelic effects on EndoC-βH1 cis-regulatory networks.
(C) Multiomic view highlighting allelic effects on the SAMD5 locus cis-regulatory network in EndoC-βH1. A variant site exhibiting significant AI in H3K27 acetylation (denoted by blue arrow) is linked (red ChIA-PET interaction) to the transcription start site (TSS) of SAMD5. Within the SAMD5 locus, five transcribed SNPs exhibited significant allelic bias in gene expression (RNA-seq) in a direction consistent with the H3K27ac allelic bias.
(D) Left: Bar plots summarizing the proportions of variants with ATAC-seq/H3K27ac imbalance (blue bars; FDR < 10%) that overlap ChIA-PET interacting loci. Right: Pie charts specifying the chromatin state (ChromHMM) annotations of the overlapping variants.
See also Figure S5 and Table S5.
KEY RESOURCES TABLE
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Antibodies | ||
| H3K27ac-human antibody | Abeam | Abeam Cat# ab4729; RRID:AB_2118291 |
| H3K4me3-mouse antibody | Abeam | Abeam Cat# ab12209; RRID:AB_442957 |
| ENCODE Project Antibody validation H3K4me1 | Abeam | Abeam Cat# ab8895; RRID:AB_306847 |
| ENCODE Project Antibody validation H3K36me3 | Abeam | Abeam Cat# ab9050; RRID:AB_306966 |
| H3K27me3-celegans antibody | Abeam | Abeam Cat# ab6002; RRID:AB_305237 |
| H3K79me2-human antibody | Abeam | Abeam Cat# ab3594; RRID:AB_303937 |
| H3K9me3-human,H3K9me3-mouse antibody | Abeam | Abeam Cat# ab8898; RRID:AB_306848 |
| Rabbit Anti-CTCF Polyclonal Antibody, Unconjugated | Abeam | Abeam Cat# ab70303; RRID:AB_1209546 |
| RNA Polymerase II 8WG16 Monoclonal Antibody, anti-RNA Polymerase II | Covance Research Products | Covance Research Products Cat# MMS-126R; RRID:AB_10013665 |
| Biological Samples | ||
| Human pancreatic islet | NDRI | UNOS ID: ADAC418 |
| Critical Commercial Assays | ||
| SKY Probe | Applied Spectral Imaging INC | Cat# FPRPR0028 |
| HumanOmni2.5–4v1_H BeadChip Array | Illumina | Cat# WG-311–2514 |
| TruSeq ChIP Sample Prep Kit | Illumina | Cat# IP-202–1024 |
| TruSeq Stranded mRNA Kit | Illumina | Cat# 20020595 |
| TruSeq Small RNA Library Kit | Illumina | Cat# RS-200–0036 |
| Deposited Data | ||
| Raw and analyzed data | This paper | NCBI BioProject: PRJNA480287; European Variation Archive: ERZ674947; NCBI GEO:GSE118588 |
| Islet ATAC-seq and RNA-seq | NCBI SRA:SRP117935 | |
| Purified beta and alpha cell ATAC-seq and RNA-seq | NCBI GEO: GSE76268 | |
| Peripheral mononuclear blood cells (PBMC) ATAC-seq | European Genome-phenome Archive: EGAS00001002605 | |
| PBMC RNA-seq | NCBI GEO: GSE90275 | |
| Skeletal muscle ATAC-seq | dbGap: phs001068.v1.p1 | |
| Skeletal muscle RNA-seq | NCBI GEO: GSE78611 | |
| GM12878 and CD4+ T ATAC-seq | NCBI GEO: GSE47753 | |
| GM12878 RNA-seq | NCBI GEO: GSE30400 | |
| CD4+ T RNA-seq | NCBI GEO: GSE18927 | |
| Adipocyte RNA-seq | NCBI GEO: GSE93486 | |
| GM12878Hi-C | NCBI GEO: GSE63525 | |
| K562 RNA Pol2 ChlA-PET | NCBI GEO: GSE39495 | |
| GM12878 RNA Pol2 ChlA-PET | NCBI GEO: GSE72816 | |
| Experimental Models: Cell Lines | ||
| EndoC-betaH1: Cell line | EndoCells/INSERM | RRID: CVCL_L909 |
| Software and Algorithms | ||
| Michigan Imputation Server | ||
| Eagle v2.3 | ||
| SHAPEIT v2.r790 | ||
| minimac3 | ||
| PLINK version 1.9 | ||
| Trimmomatic version 0.33 | ||
| BWA version 0.7.12 | ||
| Samtools version 1.3.1 | ||
| Picard-tools version 1.95 | ||
| Qualimap version 2.2.1 | ||
| MACS version 2.1.0 | ||
| BEDTools version 2.26.0 | ||
| deepTools version 2.4.2 | ||
| ChromHMM version 1.17 | ||
| HOMER version 4.6 | ||
| STAR version 2.53 | ||
| QoRTs version 1.2.42 | ||
| miRquant 2.0 | ||
| Juicer Tools version 1.75 | ||
| Juicebox version 1.6.11 | ||
| GREAT version 3.0 | ||
| ChlA-PET2 version 0.9.2 | ||
| WASP version 0.2.2 | ||
| Other | ||
| Haplotype Reference Consortium (HRC version, hrc.r1.1.2016) panel | ||
| NHGRI-EBI Catalog of SNPs | N/A | |
| R Shiny application for browsing human islet and EndoC-βH1 genomic data | This paper | |