| Literature DB >> 29348612 |
Joseph L Wiemels1,2,3,4, Kyle M Walsh5,6, Adam J de Smith5, Catherine Metayer7, Semira Gonseth5,8, Helen M Hansen6, Stephen S Francis5,9, Juhi Ojha5, Ivan Smirnov6, Lisa Barcellos7, Xiaorong Xiao7, Libby Morimoto7, Roberta McKean-Cowdin8, Rong Wang10, Herbert Yu11, Josephine Hoh10, Andrew T DeWan10, Xiaomei Ma12.
Abstract
Childhood acute lymphoblastic leukemia (ALL) (age 0-14 years) is 20% more common in Latino Americans than non-Latino whites. We conduct a genome-wide association study in a large sample of 3263 Californian children with ALL (including 1949 of Latino heritage) and 3506 controls matched on month and year of birth, sex, and ethnicity, and an additional 12,471 controls from the Kaiser Resource for Genetic Epidemiology Research on Aging Cohort. Replication of the strongest genetic associations is performed in two independent datasets from the Children's Oncology Group and the California Childhood Leukemia Study. Here we identify new risk loci on 17q12 near IKZF3/ZPBP2/GSDMB/ORMDL3, a locus encompassing a transcription factor important for lymphocyte development (IKZF3), and at an 8q24 region known for structural contacts with the MYC oncogene. These new risk loci may impact gene expression via local (four 17q12 genes) or long-range (8q24) interactions, affecting function of well-characterized hematopoietic and growth-regulation pathways.Entities:
Mesh:
Year: 2018 PMID: 29348612 PMCID: PMC5773513 DOI: 10.1038/s41467-017-02596-9
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Manhattan plot of P values resulting from meta-analysis of three separate analyses in the California population (Latinos, non-Latino whites, and non-Latino blacks) including study-specific controls and GERA controls (total N = 3263 cases, 15,977 controls). The locations of prior identified GWAS hits are indicated by their gene acronyms. New genetic associations identified in this study are highlighted in bold. These association peaks exceed the Bonferroni genome-wide P value cutoff of 5 × 10−8 indicated by a horizontal line
Lead SNPs near SP4, 8q24, and IKZF3 reaching genome-wide significance in discovery analyses of CCRLP acute lymphoblastic leukemia patients, including replication analyses in the Children’s Oncology Group and the California Childhood Leukemia Study Latino Participants
| SP4—rs2390536 risk allele Aa | 8q24.1—rs4617118 risk allele Gb | IKZF3—rs2290400 risk allele Tc | |||||||
|---|---|---|---|---|---|---|---|---|---|
| RAFd | OR (95% CI) | RAFa | OR (95% CI) | RAFa | OR (95% CI) | ||||
| CCRLP meta-analysise |
| 1.20 (1.13–1.29) |
| 1.27 (1.17–1.38) |
| 1.18 (1.11–1.25) | |||
| CCRLP Latinos | 0.26 | 1.42 × 10−4 | 1.19 (1.09–1.31) | 0.15 | 2.90 × 10−5 | 1.25 (1.12–1.38) | 0.57 | 4.33 × 10−6 | 1.20 (1.11–1.30) |
| CCRLP Whites | 0.35 | 5.69 × 10−5 | 1.23 (1.11–1.35) | 0.18 | 1.74 × 10−3 | 1.26 (1.09–1.45) | 0.50 | 2.72 × 10−3 | 1.15 (1.05–1.26) |
| CCRLP African Americans | 0.22 | 0.73 | 1.07 (0.73–1.58) | 0.23 | 1.55 × 10−3 | 1.54 (1.18–2.01) | 0.52 | 0.18 | 1.19 (0.92–1.53) |
| Replication meta-analysis | 0.075 | 1.10 (0.99–1.23) |
| 1.30 (1.14–1.49) |
| 1.14 (1.03–1.26) | |||
| COG Replication (European) | 0.36 | 0.025 | 1.15 (1.018–1.31) | 0.17 | 0.011 | 1.22 (1.05–1.43) | 0.45 | 0.13 | 1.10 (0.97–1.25) |
| CCLS Replication (Hispanic) | 0.2133 | 0.74 | 0.96 (0.77–1.20) | 0.1069 | 1.03 × 10−3 | 1.58 (1.20–2.07) | 0.6135 | 0.030 | 1.22 (1.02–1.46) |
| Combined datasets |
| 1.18 (1.11–1.24) |
| 1.28 (1.19–1.37) |
| 1.17 (1.11–1.23) | |||
RAF risk allele frequency, CCLS California Childhood Leukemia Study, CCRLP California Cancer Records Linkage Project, COG Children’s Oncology Group.
Bold values indicate P values lower than genome-wide significance (P<5 × 10−8) for CCRLP GWAS and combined datasets, and nominal significance (P<0.05) for replication meta-analysis.
a Rs2390536 was genotyped “on array” for all sample sets apart from COG Replication (Imputed INFO score = 0.99)
b rs4617118 was genotyped “on array for CCRLP datasets, and imputed for COG Replication (INFO = 0.96) and CCLS Replication (INFO = 0.88)
c rs2290400 was genotyped “on array” for all sample sets apart from COG replication (INFO = 0.88)
d RAF calculated among ethnicity-matched controls
e Sample sizes for all analyses: CCRLP Latinos: 1949 cases, 8584 controls; CCRLP whites: 1184 cases, 3551 controls; CCRLP African Americans: 130 cases, 3842 controls. For the replication: COG: 959 cases, 2624 controls; CCLS: 530 cases and 511 controls
Fig. 2Genetic association peaks at chromosomes a 7, b 8, and c 17. Genotypes include both “on array” SNPs and additional SNPs imputed as described in the Methods. The top associated “on array” SNP from the CCRLP/Kaiser discovery analysis is indicated by its “rs” identity (and purple diamond shape), and other SNPs are displayed by color showing their extent of genetic linkage with this top SNP. Recombination rate, genetic position, and the locations of nearby genes are indicated. The P values for the association at chromosome 7p15.3 (rs2390536, near SP4, OR = 1.20, 95% CI: 1.13–1.29, P = 3.6 × 10−8), chromosome 8q24.21 (rs4617118, OR = 1.27, 95% CI: 1.17–1.38, P = 3.1 × 10−9), and chromosome 17q12 (rs2290400, OR = 1.18, 95% CI: 1.11–1.25, P = 2.1 × 10−8)
Fig. 3Chromatin contacts for the 8q24 top hit SNP. Chromosome contacts for lymphoblastoid cell line GM12878 are shown in the region surrounding the SNP. Two large domains to the left and right of the MYC gene are shown (analysis as described by Rao and Huntley[19]), alongside confirmed GWAS catalog alleles for any disease (diagonal line with green hatches indicating previously identified GWAS SNPs for any indication). Despite the large number of disease associations in this region, there are no known diseases with a GWAS hit at the same location as rs4617118. This SNP is located within the genome domain at the right side of the MYC locus. On the figure, distant chromatin contacts are indicated in red color off the diagonal, which is the co-occurrence of sequence using the Hi-C method. The annotations below the diagonal line indicate contact points (yellow lines), the top peaks of contact (cyan, or blue color), and chromatin loop calls (green)
Fig. 4GTEX expression quantitative trait analysis of rs2290400 (chromosome 17, reference allele T, alternative allele C). Data from whole blood samples are shown, the significance of the eQTL was lower than a P value of 10−25 for both genes. Box plots depict the interquartile range, and error bars indicate 1.5 times the interquartile range