| Literature DB >> 34220922 |
Junke Wang1, Alyssa I Clay-Gilmour2,3, Ezgi Karaesmen1, Abbas Rizvi1, Qianqian Zhu4, Li Yan4, Leah Preus1, Song Liu4, Yiwen Wang1, Elizabeth Griffiths5, Daniel O Stram6, Loreall Pooler6, Xin Sheng6, Christopher Haiman6, David Van Den Berg6, Amy Webb7, Guy Brock7, Stephen Spellman8, Marcelo Pasquini9, Philip McCarthy5, James Allan10, Friedrich Stölzel11, Kenan Onel12, Theresa Hahn5, Lara E Sucheston-Campbell1,13.
Abstract
The role of common genetic variation in susceptibility to acute myeloid leukemia (AML), and myelodysplastic syndrome (MDS), a group of rare clonal hematologic disorders characterized by dysplastic hematopoiesis and high mortality, remains unclear. We performed AML and MDS genome-wide association studies (GWAS) in the DISCOVeRY-BMT cohorts (2,309 cases and 2,814 controls). Association analysis based on subsets (ASSET) was used to conduct a summary statistics SNP-based analysis of MDS and AML subtypes. For each AML and MDS case and control we used PrediXcan to estimate the component of gene expression determined by their genetic profile and correlate this imputed gene expression level with risk of developing disease in a transcriptome-wide association study (TWAS). ASSET identified an increased risk for de novo AML and MDS (OR = 1.38, 95% CI, 1.26-1.51, Pmeta = 2.8 × 10-12) in patients carrying the T allele at s12203592 in Interferon Regulatory Factor 4 (IRF4), a transcription factor which regulates myeloid and lymphoid hematopoietic differentiation. Our TWAS analyses showed increased IRF4 gene expression is associated with increased risk of de novo AML and MDS (OR = 3.90, 95% CI, 2.36-6.44, Pmeta = 1.0 × 10-7). The identification of IRF4 by both GWAS and TWAS contributes valuable insight on the role of genetic variation in AML and MDS susceptibility.Entities:
Keywords: acute myeloid leukemia; blood and marrow transplantation; genome-wide association study; myelodysplastic syndrome; pleiotropy
Year: 2021 PMID: 34220922 PMCID: PMC8248805 DOI: 10.3389/fgene.2021.554948
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1ASSET analysis and associations by AML and MDS subgroup. Forest plot of the odds ratios (OR) for the association between rs12203592 in IRF4 and MDS and AML subtypes. The variant resides in the Chromosome 6 outside the major histocompatibility complex region. Studies were weighted by inverse of the variance of the log (OR). The solid gray vertical line is positioned at the null value (OR = 1); values to the right represent risk increasing odds ratios. Horizontal lines show the 95% CI and the box is the OR point estimate for each case- control subset with its area proportional to the weight of the patient group. The diamond is the overall effect estimated by ASSET, with the 95% CI given by its width.
FIGURE 2IRF4 region with AML and MDS associated SNP p-values annotated with previous GWAS and Roadmap Epigenome Chromatin States. (A) ASSET analysis AML and MDS SNP associations in the IRF4 region. The x-axis is the chromosome position in kilobase pairs and y-axis shows the –log10 (p-values) for de novo AML and MDS susceptibility. The associated SNPs in the IRF4 region, rs12203592 and rs62389423, are highlighted with sky blue lines drawn through the point to show the relationship of the variant to GWAS hits and Roadmap Epigenome data (2C). rs12203592 and rs62389423 show moderate linkage disequilibrium (r2 = 0.7); rs62389423 and rs62389424 are almost perfectly correlated (r2 = 0.95). (B) Previously reported GWAS SNPs in the IRF4 region. Phenotypes are color coded and all variants are associated at P < 5 × 10–8. (C) Genes in the region annotated with the chromatin-state segmentation track (ChromHMM) from Roadmap Epigenome data for all blood, T-cell, HSC and B-cells. The cell line numbers shown down the left side correspond to specific epigenome road map cell lines. E029:Primary monocytes from peripheral blood; E030:Primary neutrophils from peripheral blood; E031:Primary B cells from cord blood; E032:Primary B Cells from peripheral blood; E033:Primary T Cells from cord blood; E034:Primary T Cells from blood; E035:Primary hematopoietic stem cells; E036:Primary hematopoietic stem cells short term culture; E037:Primary T helper memory cells from peripheral blood 2; E038:Primary T help naïve cells from peripheral blood; E039:Primary T helper naïve cells from peripheral blood; E040:Primary T helper memory cells from peripheral blood 1; E041:Primary T helper cells PMA-Ionomycin stimulated; E042:Primary T helper 17 cells PMA-Ionomycin stimulated; E043:Primary T helper cells from peripheral blood; E044:Primary T regulatory cells from peripheral blood; E045:Primary T cells effector/memory enriched from peripheral blood; E046:Primary Natural Killer cells from peripheral blood; E047:Primary T CD8 naïve cells from peripheral blood; E048:Primary T CD8 memory cells from peripheral blood; E-50:Primary hematopoietic stem cells G-CSF mobilized Female; E-51:Primary hematopoietic stem cells G-CSF mobilized Male; E062:Primary Mononuclear Cells from Peripheral Blood; E0116 Lymphoblastic Cell Line. The colors indicate chromatin states imputed by ChromHMM and shown in the key titled “Roadmap Chromatin State.”
FIGURE 3Manhattan plot of the de novo AML and MDS GWAS and TWAS. The plot represents the TWAS p-values (top) of each gene and de novo AML and MDS GWAS P-values (bottom) of each SNP included in the case-control association study. Significant and suggestive genes are highlighted in orange and labeled by their gene symbols. The orange horizontal line on the top represents the transcriptome-wide significance threshold of P = 4.5 × 10–6. The orange horizontal line on the bottom represents the genome-wide threshold of P = 5.0 × 10–8.
FIGURE 4Regional plots of PrediXcan-TWAS and SNP associations with AML and MDS. Each box represents PrediXcan-TWAS significant genes AKT1, IRF4 and RASGRP2 ± 0.5 megabases. The grey shaded bars represent the gene, where height is gene expression association and width is gene region in base pairs and the purple dots represent SNP associations with AML and MDS -log10 (P-values) are shown on the y-axis. Green and red lines denote the transcriptome-wide and genome wide significant P-values, respectively. Results are filtered for imputation quality (rsq > 0.8) and heterogeneity of effect between cohorts.