| Literature DB >> 31399598 |
Helen E Speedy1, Renée Beekman2,3, Vicente Chapaprieta4, Giulia Orlando1, Philip J Law1, David Martín-García2,3, Jesús Gutiérrez-Abril5, Daniel Catovsky1, Sílvia Beà2,3, Guillem Clot2,3, Montserrat Puiggròs6, David Torrents6,7, Xose S Puente3,5, James M Allan8, Carlos López-Otín3,5, Elias Campo2,3,4,9, Richard S Houlston10, José I Martín-Subero11,12,13,14.
Abstract
Genome-wide association studies have provided evidence for inherited genetic predisposition to chronic lymphocytic leukemia (CLL). To gain insight into the mechanisms underlying CLL risk we analyze chromatin accessibility, active regulatory elements marked by H3K27ac, and DNA methylation at 42 risk loci in up to 486 primary CLLs. We identify that risk loci are significantly enriched for active chromatin in CLL with evidence of being CLL-specific or differentially regulated in normal B-cell development. We then use in situ promoter capture Hi-C, in conjunction with gene expression data to reveal likely target genes of the risk loci. Candidate target genes are enriched for pathways related to B-cell development such as MYC and BCL2 signalling. At 14 loci the analysis highlights 63 variants as the probable functional basis of CLL risk. By integrating genetic and epigenetic information our analysis reveals novel insights into the relationship between inherited predisposition and the regulatory chromatin landscape of CLL.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31399598 PMCID: PMC6689100 DOI: 10.1038/s41467-019-11582-2
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Linking chronic lymphocytic leukemia (CLL) risk loci to chromatin states. a Top panel: graphical representation of a risk locus with a sentinel single-nucleotide polymorphism (SNP; representing the SNP with the strongest association in relation to CLL development) and its linkage disequilibrium (LD) region (shaded blue), containing all the SNPs that are within LD0.2 (LD with r2 ≥ 0.2) with the sentinel SNP. Lower panel: frequency of chromatin states in CLL (n = 7 biologically independent samples) at the exact position of the sentinel SNPs of the 42 CLL risk loci and in their LD regions (LD0.2 regions), as well as the median log2(fold change) of the different chromatin states in CLL in the LD0.2 regions. On the left, the sentinel SNPs are indicated, and on the right the different chromatin state subgroups (groups 1–5) of the risk loci, as referred to in the main text, as well as the size of the LD0.2 regions in kilobases. b Boxplots of fraction of CLL (n = 42 independent regions), colorectal (CRC, n = 75 independent regions), and breast cancer (BC, n = 165 regions independent regions) risk loci enriched for active regulatory elements in CLL cases (n = 7 biologically independent samples). Mean, median, minimum and maximum fraction of regions, and number of data points for CLL: 0.81, 0.81, 0.74, 0.86, 7; for CRC: 0.52, 0.53, 0.45, 0.59, 7; and for BC: 0.53, 0.52, 0.50, 0.56, 7. Corrected P value, test statistics (W), median difference, and 95% confidence interval for CLL vs. CRC: 6.4 × 10−3, 49, 0.28, 0.23–0.34, for CLL vs. BC: 6.4 × 10−3, 49, 0.29, 0.23–0.33, and for CRC vs. BC: 1.0, 24, 0.01, −0.03 to 0.05. P values were calculated using Wilcoxon rank-sum test (two-sided) and corrected using the Bonferroni multiple testing correction. **P-value < 1.0 × 10−2, n.s. non significant. c Upper panel: Distribution of non-individual H3K27ac peaks with specific activation patterns in CLL as compared to normal B cells. Lower panel: Mean H3K27ac signal in CLL and normal B cells in all non-individual H3K27ac peaks of the selected risk loci. On the right, the different H3K27ac state subgroups are indicated, using the same color codes as the upper panel. Sample sizes were for CLL: n = 7 biologically independent samples and for NBC-PB, NBC-T, GCBC, MBC, and PC-T: n = 3 biologically independent samples. d Representation of chromatin states in seven CLL cases and normal B cells (one representative samples for each subpopulation) in the LD0.2 regions of the sentinel SNPs rs11637565 (left panel) and rs35923643 (right panel). The black arrows represent the sentinel SNPs, the red arrows indicate SNPs that are in LD0.2 with the sentinel SNP that are located in active regulatory elements (ActProm, StrEnh1, StrEn2) in at least one CLL case. Red arrows may point to clusters of SNPs in close proximity to each other. Genomic regions represented (GRCh38) are chr15:69,671,700–69,756,772 (left panel) and chr11:123,470,881–123,525,966 (right panel). ActProm active promoter, WkProm weak promoter, PoisProm poised promoter, StrEnh1 strong enhancer 1, StrEnh2 strong enhancer 2, WkEnh weak enhancer, Txn_Trans transcription transition, Txn_Elong transcription elongation, Wk_Txn weak transcription, H3K9me3_Repr H3K9me3 repressed, H3K27me3_Repr H3K27me3 repressed, Het;LowSign heterochromatin;low signal, NBC-PB naive B cell from peripheral blood, NBC-T naive B cell from tonsil, GCBC germinal center B cell, MBC memory B cell, PC-T plasma cell from tonsil
Fig. 2Quantitative trait locus (QTL) analysis and results overview. a Workflow describing the generation of genotypes for 42 non-HLA risk single-nucleotide polymorphisms (SNPs) in 487 chronic lymphocytic leukemia (CLL) patients for use in QTL analyses. The genotypes were assigned using a combination of directly genotyped (n = 4 independent SNPs) or imputed (n = 38 independent SNPs) SNP6.0 array data (502 independent CLL patients) and whole-genome sequencing (WGS) data (n = 146 independent CLL patients). Fourteen of the 502 CLL cases were excluded owing to non-European ancestry, excessive heterozygosity, or sample relatedness. Five of the 38 imputed SNPs were subject to poor imputation with an information measure <0.85. The imputation fidelity check using the WGS data resulted into the exclusion of one CLL case with >10% discordance between the imputed and WGS data. In the lower part of the panel, summary results for each type of analysis are shown. b Per locus overview of QTL analysis results. Loci are denoted by their respective sentinel SNPs (with their location in GRCh38) and shaded boxes indicate the presence of ≥1 significant QTL at false discovery rate <0.05 for each of the four layers analyzed: assay for transposase-accessible chromatin using sequencing (QTLs marked by dark red shading); H3K27ac (orange); DNA methylation (blue); gene expression (green). In addition, on the right, the candidate target genes identified by expression QTL analysis are indicated
Fig. 3Transcription factor (TF)-binding motif analysis. Examples of altered TF-binding motifs at potential functional risk loci. The affected nucleotides in the different motifs are marked by an asterisk. The sequence surrounding the single-nucleotide polymorphism (SNP) is located under the motif graphs, with the different possible alleles of the potential functional SNP indicated below each other
Fig. 4Characterization of quantitative trait loci (QTLs) and looping interactions at the 15q23 and 8q22.3 chronic lymphocytic leukemia (CLL) risk loci. (Top panels) Boxplots illustrating associations between a rs11637565 and b rs2511713 genotypes and chromatin accessibility (assay for transposase-accessible chromatin(ATAC) QTL), histone H3 lysine 27 acetylation (H3K27ac QTL), DNA methylation (methylation QTL), and candidate target gene transcript levels (expression QTL) in CLL patients (number of independent CLL samples for ATAC: n = 99, H3K27ac: n = 97, DNA methylation: n = 486, and transcript levels: n = 452). (Lower panels) Chromosomal positions of sentinel single-nucleotide polymorphisms (SNPs; rs11637565/rs2511713, gray dots), QTL peaks/probes, and coding genes (black rectangles). The linkage disequilibrium (LD) regions, encompassing SNPs in LD r2 ≥ 0.2 with the sentinel SNPs are indicated by light blue rectangles. Also shown are looping interactions (pink arcs) from the LD regions to the promoters of TLE3 and UBR5. Pink rectangles represent interacting HindIII fragments. Topologically associating domains (dark blue rectangles, TAD) containing all features are truncated for clarity; their extension beyond the figure boundaries is indicated by arrowheads