Xiao-Xia Zhang1, Yue-Feng Du2, Ya-Jing Zhai1, Fan Gao3, Yu-Juan Yang4, Xian-Cang Ma3, Jun Lu3, Jie Zheng3. 1. Department of Pharmacy, The First Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi, People's Republic of China. 2. Department of Urology, The First Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi, People's Republic of China. 3. Clinical Research Center, The First Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi, People's Republic of China. 4. The Third Department of Cardiology, Shaanxi Provincial People's Hospital, Xi'an, Shaanxi, People's Republic of China.
Abstract
Acute lymphoblastic leukemia (ALL) has been studied intensively for decades, but the details of its etiology and underlying mechanisms have yet to be fully elucidated. It is now generally acknowledged that genetic factors contribute greatly to the development of this disease. The gene encoding CCAAT/enhancer-binding protein ε (CEBPE) is involved in the development of leukemia, and in particular the rs2239633 single nucleotide polymorphism (SNP) of CEBPE. The association between rs2239633 and risk of ALL has been well studied, but remains unclear. Therefore, a meta-analysis was performed in this study to establish a more precise estimation of that relationship. A comprehensive literature search of the PubMed electronic database was conducted, and relevant studies published up to February 20, 2015 were selected for analysis. The references of the retrieved articles were also screened. The extracted data were analyzed statistically, and pooled odds ratios with 95% confidence intervals were calculated using Review Manager (version 5.2) to estimate the association strength. Finally, eleven studies were included in the meta-analysis. The pooled analyses revealed that rs2239633 was associated with an increased risk of childhood ALL in Caucasians under any contrast models (P<0.01). However, this SNP did not affect the risk of ALL in adulthood among Caucasians, or in childhood among East Asians. In conclusion, these findings confirm that the CEBPE rs2239633 SNP could be considered a good marker of pediatric ALL risk in Caucasians, but not in East Asians; it is not a good marker of adult ALL risk in Caucasians.
Acute lymphoblastic leukemia (ALL) has been studied intensively for decades, but the details of its etiology and underlying mechanisms have yet to be fully elucidated. It is now generally acknowledged that genetic factors contribute greatly to the development of this disease. The gene encoding CCAAT/enhancer-binding protein ε (CEBPE) is involved in the development of leukemia, and in particular the rs2239633 single nucleotide polymorphism (SNP) of CEBPE. The association between rs2239633 and risk of ALL has been well studied, but remains unclear. Therefore, a meta-analysis was performed in this study to establish a more precise estimation of that relationship. A comprehensive literature search of the PubMed electronic database was conducted, and relevant studies published up to February 20, 2015 were selected for analysis. The references of the retrieved articles were also screened. The extracted data were analyzed statistically, and pooled odds ratios with 95% confidence intervals were calculated using Review Manager (version 5.2) to estimate the association strength. Finally, eleven studies were included in the meta-analysis. The pooled analyses revealed that rs2239633 was associated with an increased risk of childhood ALL in Caucasians under any contrast models (P<0.01). However, this SNP did not affect the risk of ALL in adulthood among Caucasians, or in childhood among East Asians. In conclusion, these findings confirm that the CEBPErs2239633 SNP could be considered a good marker of pediatric ALL risk in Caucasians, but not in East Asians; it is not a good marker of adult ALL risk in Caucasians.
Acute lymphoblastic leukemia (ALL) is the most common malignancy in childhood, accounting for 30% of all childhood cancers.1 The etiology of childhood ALL is mostly unknown, although infections in the first years of life and some environmental factors, such as ionizing radiation and parental alcohol and tobacco use, are thought to play a causative role.2ALL is known to result from an accumulation of mutations in tumor suppressor genes and oncogenes, and genetic alterations affecting several chromosomes.2 Thus, the genetic variations may be important factors in its development. This finding has recently led to the performance of several genome-wide association studies to identify common single nucleotide polymorphisms (SNPs) affecting the susceptibility to childhood ALL.2–6 One of the most widely investigated SNPs identified in these studies is the common rs2239633 SNP of CEBPE, the gene encoding CCAAT/enhancer-binding protein ε (CEBPE), which plays an important role in the regulation of myelopoiesis.7 Papaemmanuil et al4 also found that CEBPE is a suppressor of myeloid leukemogenesis, and may be involved in B-cell precursor ALL development. The populations studied in the aforementioned genome-wide association studies were limited, and the many subsequent association studies that have been conducted with a view to verifying the described effects of rs2239633 on susceptibility to ALL in childhood in different populations have yielded varying results.8–13A meta-analysis was performed in the present study to review and compare all relevant studies, enabling more accurate conclusions to be drawn. In addition, since the genetic background of ALL differs between the adult and childhood forms, some studies were also performed to make clear if this SNP was a risk factor for adult ALL.14,15 In this study, those two relevant studies were also compared to examine the differential roles of rs2239633 in children and adults.
Materials and methods
Literature search strategy
Two of the authors (Zhang and Du) each performed the literature search of the PubMed database independently. The process was supervised by the third author (Zhai), and any disagreement was resolved by consensus. The search strategy was to identify all possible studies that involved the use of the following test words: (14q11.2 “or” CEBPE “or” rs2239633) “and” (leukemia). The language of publication was restricted to English. The reference lists of retrieved articles were curated manually. The literature search was last updated on February 20, 2015. This study was judged exempt regarding ethics.
Selection criteria
All studies with the aim of investigating the association between presence of the rs2239633 SNP and ALL risk were potentially included. The following inclusion criteria were applied:Published articles on human genetics (full texts or abstracts) without racial restriction.If articles contained more than one geographic or other clinical characteristic subgroup, each subgroup was considered separately.If multiple studies were derived from the same population, only the study with the largest sample was analyzed to avoid overlapping data.Presentation of sufficient data on the distribution of rs2239633 in cases and in control groups or the data necessary to calculate those distributions. If the data were deficient or presented in an inappropriate form, the original authors were contacted and asked for the raw data.
Extracted information
A standard data-collection procedure was used, in line with the aforementioned inclusion criteria. Two of the authors (Zhang and Du) independently extracted the variables from the individual eligible studies in duplicate and made the characteristics compatible in a pooled database. Any encountered disagreements were resolved by discussion in order to reach a consensus. The following information was extracted from all eligible studies: name of the first author, year of publication, ethnicities of the study population, age and number of cases and controls, and main finding. The distribution of the genotypes among the controls was also tested for conformity to Hardy–Weinberg equilibrium (HWE).
Statistical analysis
The meta-analysis examined the overall association for allele contrasts, homozygote contrasts, and recessive and dominant models. The presence of heterogeneity was assessed using Cochran’s Q-statistic (with a significance cutoff of P<0.1) and quantified using the I2 statistic, which is proportional to the degree of heterogeneity; the value of I2 statistic lies in the range 0%–100%, with a value of 0% indicating homogeneity, and a value above 50% indicating the presence of a very high degree of heterogeneity.16The raw data for genotype distribution were used to calculate the study-specific estimates of odds ratio (OR) and 95% confidence interval (CI). The overall pooled OR and 95% CI were estimated using the Mantel–Haenszel method, with a fixed-effects model when no significant clinical or statistical heterogeneity was present; when substantial heterogeneity was present, the random-effects model was used. The significance of the polled OR was determined by a test. The cutoff for statistical significance was set at P<0.05.Potential publication bias was estimated by constructing funnel plots. If most of the data appear at the top of a funnel plot and are distributed roughly symmetrically, this suggests the absence of obvious publication bias, and vice versa.13 There was no need to construct funnel plots when there were too few analyzed studies (ie, n<5). All of these statistical analyses were performed using Review Manager (version 5.2) (Cochrane, London, UK).The conformity of the rs2239633 SNP to HWE was tested among the controls in each individual study via the chi-square test or Fisher’s exact test using SPSS software (SPSS Inc., Chicago, IL, USA). Sensitivity analyses were performed by excluding those studies for which the data deviated from HWE (P<0.05), omitting the highest weight or changing the model.
Results
Literature search and characteristics of the retrieved studies
Figure 1 presents a flow diagram summarizing the process of study selection. The final search generated 65 potentially relevant publications. After reading the titles and abstracts, 44 articles were found to be irrelevant to this study in that the research did not cover the association between rs2239633 and susceptibility to ALL, and were thus excluded from the analysis. Of the remaining 21 publications, 3 were case reports, 2 were reviews, and 1 was a conference proceedings; these were therefore also excluded. Finally, 15 publications were identified, in which the effect of rs2239633 on the risk of both adult and childhood ALL was evaluated. Of these, four did not supply enough data, two were about adults,14,15 and nine were about children.4,8–13,17,18 These studies, providing enough data, were conducted in various populations of different ethnicities: eight involved Caucasians,4,10,11,13–15,17,18 two involved East Asians,9,12 and one involved multiple races.8 The basic data for every eligible study were extracted and are listed in Table 1. Although the authors of some of the publications were contacted to obtain the raw genotype data, some of the data could not be obtained.
Figure 1
Flow chart of study selection process.
Abbreviation: ALL, acute lymphoblastic leukemia.
Table 1
Study characteristics
Study
Ethnicity
Cases
Controls
Main results
Median age (year)
Number of males
Number of females
Median age (year)
Number of males
Number of females
HWE, P
Emerenciano et al8
Brazilian
262
243
0.135
N
Lautner-Csorba et al17
Caucasian
6.4±4.2
308
235
16.1±12.4
305
224
0.508
N
Orsi et al18
European
<15
0.546
N
Papaemmanuil et al4
UK
5.5±3.3
322
255
725
713
0.779
Y
Papaemmanuil et al4,a
UK
5.8±3.8
252
176
0.517
Y
Pastorczak et al11
European
4.9
228
170
22
332
399
0.454
N
Prasad et al13
German
6
663
530
58
762
754
0.233
Y
Prasad et al13,a
UK
6
101
90
59
117
244
0.310
N
Ross et al10
Caucasian
0.091
N
Vijayakrishnan et al12
Thai
6
109
81
18–25
82
100
0.162
N
Wang et al9
Chinese
1–18
349
219
1–18
441
231
0.147
N
Peyrouze et al15
European
18–60
0.355
N
Burmeister et al14
German
16–76
188
134
0.233
N
Enciso-Mora et al24
UK
5.4±3.6
464
360
Y
Migliorini et al25
UK/German
Y
Chokkalingam et al22
Hispanic
5.6±3.6
156
144
5.5±3.6
214
192
Y
Walsh et al23
Hispanic
5.2±3.3
156
141
5.3±3.4
240
214
Y
Note:
Indicated another population studied in the same article.
Abbreviations: HWE, Hardy–Weinberg equilibrium; N, no; Y, yes.
Meta-analysis
rs2239633 SNP and ALL risk in childhood
As stated, nine papers studied the association between rs2239633 and risk of ALL in childhood and provided data about genotype distribution. The data from these studies were pooled and subjected to a meta-analysis. Overall, this SNP was found to be significantly associated with risk of ALL in childhood under the dominant model (Figure 2A, P<0.00001, OR =1.27, 95% CI =1.15–1.39), recessive model (Figure 2B, P<0.00001, OR =1.23, 95% CI =1.14–1.33), and allele contrast (Figure 2C, P<0.00001, OR =1.18, 95% CI =1.12–1.24). However, the presence of heterogeneity was found in the recessive model (Figure 2B, I2=56%) and allele contrast (Figure 2C, I2=57%), indicating the requirement for a subanalysis. Therefore, a submeta-analysis based on races was conducted. Among the included studies, the subjects in six were Caucasians,4,10,11,13,17,18 those in two were East Asians,9,12 and those in the remaining study were a mixed ethnic population.8 The submeta-analysis revealed that under the three contrast models, rs2239633 was significantly associated with childhood ALL in Caucasians (Figure 2A, P<0.00001, OR =1.31, 95% CI =1.18–1.46 for dominant model; Figure 2B, P<0.00001, OR =1.32, 95% CI =1.21–1.44 for recessive model; Figure 2C, P<0.00001, OR =1.23, 95% CI =1.16–1.30 for allele contrast), but not in East Asians (Figure 2). The degree of heterogeneity in both the Caucasian and East Asian subgroups was relatively small (Figure 2, I2=28%, 29%, 36% for Caucasian under the dominant, recessive, and allele contrast models, respectively; I2=0 for East Asian under the dominant, recessive, and allele contrast models).
Figure 2
Meta-analysis of the association between rs2239633 and childhood ALL risk.
Notes: (A) Under the dominant model; (B) under the recessive model; (C) under the allele contrast model. aIndicated another population studied in the same article.
Two of the studies investigated the relationship between the rs2239633 SNP and risk of ALL in adults, and provided the genotype data.14,15 The subjects in both of those studies were Caucasians. After pooling these data, further analysis revealed that this SNP was not associated with risk of ALL in adults in any model of comparison (Figure 3A, P=0.18, OR =1.21, 95% CI =0.91–1.61 for dominant model; Figure 3B, P=0.79, OR =1.03, 95% CI =0.81–1.32 for recessive model; Figure 3C, P=0.35, OR =1.08, 95% CI =0.92–1.26 for allele contrast). There was no heterogeneity in any of the three contrast models (Figure 3, I2=0).
Figure 3
Meta-analysis of the association between rs2239633 and adult ALL risk.
Notes: (A) Under the dominant model; (B) under the recessive model; (C) under the allele contrast model.
The publication bias for the research with pediatric subjects was assessed. Overall, no obvious publication bias was found under the recessive model, and the funnel plot exhibited good symmetry under the dominant model if the study of Vijayakrishnan et al12 was excluded. The result did not change after excluding this article. The symmetry of the funnel plot was poor under the allele contrast model, which suggested obvious publication bias (Figure 4).
Figure 4
The publication bias for researches of childhood.
Notes: (A) Under the dominant model; (B) under the recessive model; (C) under the allele contrast model.
Abbreviation: SE, standard error.
The stability of the results involving Caucasians was evaluated by performing a sensitivity analysis. Neither omitting Prasad et al’s13 study (highest weight) nor changing the model (random-effects model) made a significant difference to the pooled effects of the three contrast models, suggesting that the meta-analysis findings were highly stable.
Discussion
CEBPE is involved in terminal differentiation and functional maturation of neutrophils and macrophages, and is critical for transcription of most granule proteins localized to specific and gelatinase granules as well as for azurophil granule proteins expressed in the late promyelocyte stage.19,20 It has also been reported that CEBPE is targeted by recurrent immunoglobulin heavy-chain (locating on chromosome band 14q32) chromosomal translocations in B-cell precursor ALL, and mutations in CEBPE lead to neutrophil-specific granule deficiency.9,19,20 All of these findings suggested that CEBPE plays a role in the development of ALL. The studies conducted thus far on the relationship between rs2239633 and risk of ALL have yielded a relatively small amount of data and have produced conflicting results. Those data were evaluated in the present meta-analysis. The primary findings of this investigation are the increased OR of childhood ALL susceptibility associated with presence of the rs2239633 SNP in a fixed-effects model, and the lack of an association between this SNP and risk of ALL in adults.Subgroup analysis for childhood ALL according to race revealed that this SNP impacted Caucasians, and not Asians. This finding was extremely robust, remaining significant irrespective of the different model assumptions (fixed-effects versus random-effects models), genetic models, and additional sensitivity analysis. This disparity between races may reflect the existence of a true population-specific disease variant, but may also be attributable to differences in genomic structure at these loci between populations. Wang et al9 determined the linkage disequilibrium (LD) patterns at the locus of rs2239633 in European and Chinese populations from the HapMap database, and found that rs2239633 is located in different LD blocks in the two populations. Therefore, we speculate that the pathogenic locus might not be rs2239633, but rather other SNPs in the LD block. Other factors contributing to the differential results for the two populations include differences in allele frequency and differences in genetic and environmental backgrounds that interact with the variants.12It has been proposed that genetics plays a role in adult cancer susceptibility, although this has been difficult to evaluate because of confounders, such as environmental factors, the immune system, and exposure to pathogens. To the best of our knowledge, no predisposing genetic marker has been proven for adult ALL. No association between rs2239633 and risk of adult ALL was found in this study. Although the failure to detect any weak association may be affected by the relatively small sample, these data suggest that the genetic impact of polymorphisms at these loci is at least weaker for adult ALL than it is for pediatric ALL and highlights genetic differences between ALL occurring in adults versus children.15Based on a meta-analysis, Wang et al21 concluded that rs2239633 confers an increased risk of childhood ALL, especially among Caucasians and Hispanics. The main findings of the present meta-analysis are consistent with that conclusion. While, the adults’ ALL risk related with rs2239633 was also evaluated, three different genetic models were used in our study, which showed more comprehensive evaluation of the impact of rs2239633 on ALL. In addition, we believe that the article of Wang et al21 contained some data errors. The research data – and in particular the OR and 95% CI values – of Chokkalingam et al22 and Walsh et al23 that were used in the meta-analysis performed by Wang et al21 were calculated using logistic regression. We speculate that since the OR and 95% CI values generally differed before and after data adjustment, Wang et al21 did not use the raw data from the studies of Chokkalingam et al22 and Walsh et al23 thus potentially causing bias. It is not possible to derive raw data using the adjusted data. Before completing our study, we attempted to contact the authors of those two articles to obtain the raw genotype data,22,23 but were unsuccessful. Therefore, these two articles were excluded from the analysis, which we believe was the most appropriate action.It is worth noting that there were no differences in the results among the three contrast models, although the funnel figure obtained under the allele contrast model exhibited poor symmetry, and a clear outlier appeared in that obtained under the dominant model, indicating that the impact of genetic models on the results is not large.Our study demonstrated that genotype testing is very important, although just a statistical association was found. Based on genotype testing, the susceptible population will be able to take the necessary protective measures, which is particularly important for those with long-term exposure to environmental pollutants.This study had some limitations. First, the effect of gene–gene and gene–environmental interactions was not addressed in the meta-analysis due to lack of data. Second, the meta-analysis was based on unadjusted estimates, and the confounding factors could not be controlled for because most studies did not provide these data. Third, for publication bias, the best symmetry was observed for the funnel figure produced under the recessive genetic model, but not under the dominant model and allele contrast. Finally, the conclusions regarding risk of ALL among adults and Asian children are based on relatively few cases, and so the statistical power thereof is not strong.
Conclusion
In summary, the findings of the present meta-analysis support a positive association between CEBPErs2239633 and childhood risk of ALL in Caucasians, but there was no such association for risk of ALL in Caucasian adults or East Asian children. Further large-scale studies assessing gene–gene and gene–environment interactions are required.
Authors: V Enciso-Mora; F J Hosking; E Sheridan; S E Kinsey; T Lightfoot; E Roman; J A E Irving; I P M Tomlinson; J M Allan; M Taylor; M Greaves; R S Houlston Journal: Leukemia Date: 2012-03-29 Impact factor: 11.528
Authors: L Orsi; J Rudant; A Bonaventure; S Goujon-Bellec; E Corda; T-J Evans; A Petit; Y Bertrand; B Nelken; A Robert; G Michel; N Sirvent; P Chastagner; S Ducassou; X Rialland; D Hémon; E Milne; R J Scott; A Baruchel; J Clavel Journal: Leukemia Date: 2012-06-04 Impact factor: 11.528
Authors: Gabriele Migliorini; Bettina Fiege; Fay J Hosking; Yussanne Ma; Rajiv Kumar; Amy L Sherborne; Miguel Inacio da Silva Filho; Jayaram Vijayakrishnan; Rolf Koehler; Hauke Thomsen; Julie A Irving; James M Allan; Tracy Lightfoot; Eve Roman; Sally E Kinsey; Eamonn Sheridan; Pamela Thompson; Per Hoffmann; Markus M Nöthen; Thomas W Mühleisen; Lewin Eisele; Martin Zimmermann; Claus R Bartram; Martin Schrappe; Mel Greaves; Martin Stanulla; Kari Hemminki; Richard S Houlston Journal: Blood Date: 2013-08-30 Impact factor: 22.113
Authors: Jun J Yang; Cheng Cheng; Wenjian Yang; Deqing Pei; Xueyuan Cao; Yiping Fan; Stanley B Pounds; Geoffrey Neale; Lisa R Treviño; Deborah French; Dario Campana; James R Downing; William E Evans; Ching-Hon Pui; Meenakshi Devidas; W P Bowman; Bruce M Camitta; Cheryl L Willman; Stella M Davies; Michael J Borowitz; William L Carroll; Stephen P Hunger; Mary V Relling Journal: JAMA Date: 2009-01-28 Impact factor: 56.272
Authors: Orsolya Lautner-Csorba; András Gézsi; Agnes F Semsei; Péter Antal; Dániel J Erdélyi; Géza Schermann; Nóra Kutszegi; Katalin Csordás; Márta Hegyi; Gábor Kovács; András Falus; Csaba Szalai Journal: BMC Med Genomics Date: 2012-09-28 Impact factor: 3.063
Authors: Elli Papaemmanuil; Fay J Hosking; Jayaram Vijayakrishnan; Amy Price; Bianca Olver; Eammon Sheridan; Sally E Kinsey; Tracy Lightfoot; Eve Roman; Julie A E Irving; James M Allan; Ian P Tomlinson; Malcolm Taylor; Mel Greaves; Richard S Houlston Journal: Nat Genet Date: 2009-08-16 Impact factor: 38.330
Authors: Darlen Cardoso Carvalho; Alayde Vieira Wanderley; Fernando Augusto Rodrigues Mello Junior; André Mauricio Ribeiro Dos Santos; Luciana Pereira Colares Leitão; Tatiane Piedade de Souza; Amanda de Nazaré Cohen Lima de Castro; Leandro Lopes de Magalhães; Marianne Rodrigues Fernandes; João Augusto Nunes de Carvalho Junior; André Salim Khayat; Sidney Santos; Paulo Pimentel de Assumpção; Ney Pereira Carneiro Dos Santos Journal: Leuk Res Rep Date: 2019-11-27