Literature DB >> 26517675

An mRNA expression signature for prognostication in de novo acute myeloid leukemia patients with normal karyotype.

Ming-Kai Chuang1, Yu-Chiao Chiu2,3, Wen-Chien Chou1,4, Hsin-An Hou4, Mei-Hsuan Tseng4, Yi-Yi Kuo1,4, Yidong Chen3,5, Eric Y Chuang2,6, Hwei-Fang Tien4.   

Abstract

Although clinical features, cytogenetics, and mutations are widely used to predict prognosis in patients with acute myeloid leukemia (AML), further refinement of risk stratification is necessary for optimal treatment, especially in cytogenetically normal (CN) patients. We sought to generate a simple gene expression signature as a predictor of clinical outcome through analyzing the mRNA arrays of 158 de novo CN AML patients. We compared the gene expression profiles of patients with poor response to induction chemotherapy with those who responded well. Forty-six genes expressed differentially between the two groups. Among them, expression of 11 genes was significantly associated with overall survival (OS) in univariate Cox regression analysis in 104 patients who received standard intensive chemotherapy. We integrated the z-transformed expression levels of these 11 genes to generate a risk scoring system. Higher risk scores were significantly associated with shorter OS (median 17.0 months vs. not reached, P < 0.001) in ours and another 3 validation cohorts. In addition, it was an independent unfavorable prognostic factor by multivariate analysis (HR 1.116, 95% CI 1.035~1.204, P = 0.004). In conclusion, we developed a simple mRNA expression signature for prognostication in CN-AML patients. This prognostic biomarker will help refine the treatment strategies for this group of patients.

Entities:  

Keywords:  acute myeloid leukemia; mRNA signature; normal cytogenetics; prognosis

Mesh:

Substances:

Year:  2015        PMID: 26517675      PMCID: PMC4770759          DOI: 10.18632/oncotarget.5390

Source DB:  PubMed          Journal:  Oncotarget        ISSN: 1949-2553


INTRODUCTION

“Precision medicine” has become a state-of-the-art principle in clinical care. As a highly heterogeneous disease, acute myeloid leukemia (AML) requires precise risk stratification to achieve optimal treatment outcomes for the patients. Although several clinical and genetic factors have been widely incorporated into clinical consideration for choosing treatment regimens, more prognostic factors would be welcome as there are still factors not yet being sorted out completely. Cytogenetics has long been considered the most important prognostic factor for AML, however, about one-half of the patients are cytogenetically normal (CN); this group of patients need further prognostic factors for risk stratification [1]. Recently, several genetic mutations with prognostic significance, such as internal tandem duplication of FLT3 (FLT3-ITD) [2-4], NPM1, and CEBPA mutations [5, 6], have partially compensated for the problem. However, about 24% CN-AML patients have no detectable mutations in these genes [2]. Although the expression levels of genes such as BAALC [7], MN1 [8], and ERG [9] provide further reference for prognostication in this group of patients, the significance of single gene expression remains restrictive in the context of a complicated cellular milieu. DNA microarray technology makes it possible to evaluate the global gene expression profiling of cells. Studies have shown distinct genetic expression profiles in AML with different cytogenetics and gene mutations [10-12]. While gene expression signature-derived scoring systems bear prognostic values in AML [11, 13–20], it is rarely used in clinical practice, mainly because of the large gene numbers in those scoring systems, usually dozens to hundreds of probes. For example, Shaughnessy et al. developed a 70-gene expression scoring system to identify patients with shorter progression-free survival (PFS) and overall survival (OS) in multiple myeloma [21]. Subsequently they simplified the system to 5 genes which carried the most discriminatory power of the 70-gene risk model with similar predictive values [22]. We realize that a considerable portion of CN-AML patients still need reliable parameters for choosing optimal treatment strategies. In this study, we developed a simple gene expression signature with prognostic significance by incorporating limited number of probes through comprehensive analysis of the gene expression profiles from our CN-AML patients. Using ours as a discovery set, we validated our results with three other independent CN-AML cohorts, which are available from public domains. Furthermore, we explored the possible molecular pathways underlying this signature.

RESULTS

Identification of genes with prognostic significance

We recruited a total of 351 adult patients (≥15 years of age) with newly diagnosed de novo AML from 1995 to 2011 at the National Taiwan University Hospital (NTUH), who had adequate cryopreserved bone marrow cells for mRNA array studies. Patients with antecedent hematological malignancies or therapy-related AML were excluded. We focused on the 158 patients (45.0%) with CN-AML. Among these patients, 104 (65.8%) received standard intensive chemotherapy. We analyzed the array data of the 158 CN-AML patients for global gene expression profiles. The expression data were processed and normalized to eliminate systematic biases and facilitate further statistical analyses. Since this study is a retrospective analysis with a group of patients spanning for almost 20 years, we aimed to eliminate biases as much as possible by using the response to induction chemotherapy as a criterion for dividing the patients into two groups, one with good response (GR group, 56 patients) who achieved continuous complete remission without relapse and the other with poor response (PR group, 19 patients) who were refractory to the induction chemotherapy. We compared gene expression profiles between the two groups and identified 46 differentially expressed probes (Student's t-test P value < 0.05 and > 2-fold change). These probes corresponded to 43 unique genes. Interestingly, all of the 46 probes were up-regulated in the PR group. Heatmap visualization of these probes were performed using the Genesis software (Fig. 1A) [23].
Figure 1

A. The heatmap of the 46 differential expressed probes between the 19 patients with poor response (PR group) to the first induction chemotherapy and the 56 achieving continuous complete remission (GR group)

The 11 genes which were significantly associated with OS were highlighted in bold text. GSEA enrichment plots on genes associated with functions of B. acute myeloid leukemia and C. proliferation of myeloid cells are shown. The GSEA plots were used to confirm and visualize the significant terms reported by IPA. GSEA first ranked all genes probed on the microarray based on their significance in differential expression between PR and GR groups (denoted by an arrow in the figure). For a significant IPA term (component genes of which are denoted by black line segments), GSEA adopted a walking scoring method (green curve) to measure the degree to which the genes within an IPA term is overrepresented (i.e., enriched) to the left of all genes. Significance of the enrichment score was assessed by a permutation test. As a result, genes related to the two functions were significantly differentially expressed between the PR and GR groups, suggesting significant correlations between these two pathways and the treatment response.

A. The heatmap of the 46 differential expressed probes between the 19 patients with poor response (PR group) to the first induction chemotherapy and the 56 achieving continuous complete remission (GR group)

The 11 genes which were significantly associated with OS were highlighted in bold text. GSEA enrichment plots on genes associated with functions of B. acute myeloid leukemia and C. proliferation of myeloid cells are shown. The GSEA plots were used to confirm and visualize the significant terms reported by IPA. GSEA first ranked all genes probed on the microarray based on their significance in differential expression between PR and GR groups (denoted by an arrow in the figure). For a significant IPA term (component genes of which are denoted by black line segments), GSEA adopted a walking scoring method (green curve) to measure the degree to which the genes within an IPA term is overrepresented (i.e., enriched) to the left of all genes. Significance of the enrichment score was assessed by a permutation test. As a result, genes related to the two functions were significantly differentially expressed between the PR and GR groups, suggesting significant correlations between these two pathways and the treatment response.

Analysis of functional annotations of 43 genes

In order to dissect the biological functions underlying the 43 genes that likely affect chemosensitivity, we analyzed their functional annotations using the Ingenuity Pathway Analysis (IPA) [24] software. The genes were associated with abundant biological functions related to leukemia (data available upon request). Eight genes (BAALC, CD14, CD34, CD74, DNTT, HLA-DRA, IRF8, and MN1) were all associated with “leukemia” (P = 1.15 × 10−4), “acute myeloid leukemia” (P = 9.37 × 10−3), and “proliferation of myeloid cells” (P = 0.044). We further utilized Gene Set Enrichment Analysis (GSEA) [25] to verify the results derived from IPA. GSEA is an enrichment analysis algorithm that features threshold-free input (i.e., global gene profiles). It analyzes whether genes sharing a common function exhibit a global trend toward up-regulation (or down-regulation) in a given condition, measured by enrichment scores and permutation-based empirical P-values. Notably, genes related to the three associated terms (diseases and biological functions) obtained from IPA showed significant enrichment in the PR/GR differential gene expression profiles: the empirical P-values were <0.001 for functions related to acute myeloid leukemia and leukemia, and P = 0.001 for the proliferation of myeloid cells (enrichment plots in Fig. 1B–1C and Fig. S1). These 3 functional categories related genes contributed to a major fraction of the enrichment score, namely the leading-edge components. Appearing in leading-edge components of all three functions were ABL proto-oncogene 1 non-receptor tyrosine kinase (ABL1), B-cell CLL/lymphoma 2 (BCL2), and CD33 molecule (CD33).

Construction of a risk scoring system

In order to construct a risk scoring system, we analyzed the prognostic significance of expression of the 43 genes in survival. The survival analysis was conducted from the 104 patients (out of the 158 patients) who received standard intensive chemotherapy. Among the 46 probes associated with treatment response, 11 were significantly associated with OS (univariate Cox P < 0.01). These probes represented 11 unique genes (full gene list and results in Table 1; highlighted by boldface in Fig. 1A), and higher expression of each of these genes was associated with unfavorable prognosis (Kaplan Meier curves in Fig. S2). Based on the results we built a scoring system by incorporating the z-values (normalized gene expression levels, as defined in the Materials and Methods section) of the 11 genes with equal unity of weight to calculate a risk score for each patient. The risk score was significantly predictive of OS (univariate Cox P = 1.37 × 10−6 and log-rank P = 1.07 × 10−5; Fig. 2A), and disease free survival (DFS) (univariate Cox P = 1.16 × 10−7 and log-rank P = 9.71 × 10−7, Fig. S3). We further applied random permutation to evaluate the performance of our proposed scoring system against a random baseline (detailed in the Materials and Methods section). Remarkably, the scoring system outperformed all of ten-thousand random systems iteratively constructed by random selections of 11 genes from the dataset (empirical P-value <1.00 × 10−4, see the Materials and Methods section), suggesting the non-randomness of performance achieved by the proposed risk score.
Table 1

The list of 11 genes whose expression were significantly associated with overall survival among the 46 probes differential expressed between the patients with good and poor treatment response

ProbeGeneDescriptionUnivariate Cox PHazard ratio95% confidence interval
1410021AIF1Lallograft inflammatory factor 1-like2.01E-031.6571.203–2.284
450424CXCR7atypical chemokine receptor 31.00E-031.6631.228–2.251
6280243DNTTDNA nucleotidylexotransferase9.10E-031.4711.101–1.966
5490768GPR56G protein-coupled receptor 566.86E-031.6171.141–2.291
630278H1F0H1 histone family, member 07.03E-031.6191.140–2.298
6650242IFITM3interferon induced transmembrane protein 31.96E-042.1521.438–3.222
3780647KIAA0125KIAA01255.06E-031.5581.143–2.125
1690066MX1MX dynamin-like GTPase 15.81E-031.6181.150–2.279
6040053STAB1stabilin 17.39E-031.5661.128–2.174
2680110TM4SF1transmembrane 4 L six family member 15.93E-051.8691.377–2.537
5560561TNS3tensin 37.13E-042.1331.376–3.308
Figure 2

The Kaplan Meier curves for OS according to the scores

A. In NTUH discovery set, patients with higher scores have significant shorter OS than those with lower scores (median 17.0 months vs. not reached, P < 0.001); B–D. In the three validation cohorts, the higher scores are all associated with poorer OS (median 12.2 vs 21.3 months, log rank P = 0.01 in TCGA; median 8.4 vs 24.7 months, log rank P = 0.004 in GSE12417-GPL96; median 10.1 vs 42.6 months, log rank P = 0.001 in GSE12417-GPL570).

The Kaplan Meier curves for OS according to the scores

A. In NTUH discovery set, patients with higher scores have significant shorter OS than those with lower scores (median 17.0 months vs. not reached, P < 0.001); B–D. In the three validation cohorts, the higher scores are all associated with poorer OS (median 12.2 vs 21.3 months, log rank P = 0.01 in TCGA; median 8.4 vs 24.7 months, log rank P = 0.004 in GSE12417-GPL96; median 10.1 vs 42.6 months, log rank P = 0.001 in GSE12417-GPL570). For validation analysis we used three independent gene expression datasets from two studies, one from The Cancer Genome Atlas (TCGA) [26] and two (GSE12417-GPL96 and GSE12417-GPL570) from the study of Metzeler et al. [17]. The prognostic significance of higher risk score for unfavorable OS was validated by these independent cohorts of CN-AML (log rank P = 0.01, 0.004, and 0.001 in TCGA (N = 97), GSE12417-GPL96 (N = 163), and GSE12417-GPL570 (N = 79), respectively) (Fig. 2B, 2C and 2D). We compared the performance of this 11-gene to the 7-gene risk scoring system proposed from another study [20]. Although the two scoring systems do not share any genes, they had equivalent prediction performance as shown by the similar P values in the three validation cohorts (Table 2). This was further confirmed by a multivariate analysis that incorporates these two scoring systems as co-variables (data not shown).
Table 2

Comparison between ours and the published 7-gene scoring system by univariate analysis

Dataset11-gene risk score7-gene unweighted score (Marcucci et al. 2014)
Hazard ratio95% confidence intervalP value*Hazard ratio95% confidence intervalP value*
NTUH (N = 104)1.121.07~1.181.4 × 10−61.291.06~1.560.012
TCGA (N = 97)1.051.00~1.090.0421.161.01~1.330.035
GSE12417-GPL96 (N = 163)1.081.04~1.128.7 × 10−51.281.12~1.473.7 × 10−4
GSE12417-GPL570 (N = 79)1.091.02~1.179.7 × 10−31.351.12~1.621.3 × 10−3

Cox regression univariate analysis.

Cox regression univariate analysis.

Association of the scoring system with clinical and molecular characteristics

A higher risk score was positively associated with older age, lower count of white blood cells, and higher count of platelets (Table 3). FAB M1 leukemia occurred less frequently in the higher score group. The profiles of genetic mutations were significantly different between higher and lower score groups: patients with higher scores more often had FLT3-ITD, RUNX1, MLL-PTD, ASXL1, and DNMT3A mutations, but less likely had NPM1 and CEBPA mutations (Table 4). In particular, nearly all CEBPA-mutated patients were in the lower score group, whereas all MLL-PTD mutated patients were in the higher score group.
Table 3

Correlation between mRNA score and clinical and laboratory features in CN-AML patients (n = 158)

VariantTotalmRNA ScoreP
Low (n = 79)High (n = 79)
Age* (years)58 (16–90)55 (18–87)62 (16–90)0.027
Age, in groups
>6076 (48.1%)32 (40.5%)44 (55.7%)0.056
>5097 (61.4%)43 (54.4%)54 (68.4%)0.072
Gender
 male90 (57.0%)44 (55.7%)46 (58.2%)0.748
Lab data*
 WBC (×103/μL)28.88 (0.65–423.0)41.38 (0.98–423.0)24.72 (0.65–341.4)0.042
 Blasts (×103/μL)13.77 (0–342.1)18.69 (0–342.1)10.78 (0–310.7)0.056
 Hemoglobin, g/dL8.1 (3.7–14.0)8.3 (4.2–14.0)7.9 (3.7–13.2)0.173
 Platelets (×103/μL)53.0 (6–331)42.0 (6–214)60.0 (9–331)0.006
 LDH (U/L)878.0 (274–13130)960.0 (354–13130)804.0 (274–7177)0.053
FAB0.156
 M02 (1.3%)02 (2.5%)0.155
 M137 (23.4%)24 (30.4%)13 (16.5%)0.039
 M253 (33.5%)28 (35.4%)25 (31.6%)0.613
 M452 (32.9%)22 (27.8%)30 (38.0%)0.176
 M512 (7.6%)4 (5.1%)8 (10.1%)0.230
 M62 (1.3%)1 (1.3%)1 (1.3%)>0.999

median (range)

Table 4

Correlation of mRNA score with other gene alterations

MutationTotal (n = 158)mRNA ScoreP
Lower (n = 79)Higher (n = 79)
NPM178 (49.4%)47 (59.5%)31 (39.2%)0.011
FLT3-ITD52 (32.9%)20 (25.3%)32 (40.5%)0.042
NPM1+/FLT3-ITD45 (28.5%)32 (40.5%)13 (16.5%)0.001
CEBPAdouble21 (13.5%)20 (25.6%)1 (1.3%)<0.001
CEBPA30 (19.2%)29 (37.2%)1 (1.3%)<0.001
WT113 (8.3%)4 (5.1%)9 (11.5%)0.148
RUNX126 (16.6%)2 (2.6%)24 (30.4%)<0.001
IDH112 (7.7%)6 (7.7%)6 (7.7%)>0.999
IDH228 (17.9%)18 (23.1%)10 (12.8%)0.095
FLT3-TKD14 (9.0%)7 (9.0%)7 (9.0%)>0.999
MLL-PTD10 (6.5%)010 (12.8%)0.001
KIT2 (1.3%)2 (2.6%)00.155
KRAS3 (1.9%)1 (1.3%)2 (2.6%)0.560
NRAS24 (15.4%)14 (17.9%)10 (12.8%)0.375
ASXL120 (12.7%)3 (3.8%)17 (21.5%)0.001
TET231 (19.9%)15 (19.2%)16 (20.5%)0.841
DNMT3A40 (25.6%)13 (16.7%)27 (34.6%)0.010
median (range)

Survival analysis

The univariate analysis of the clinical parameters and molecular alterations on OS in our CN-AML patients was shown in Table S1. Since higher risk scores seemed to be highly associated with other poor prognostic variables, we sought to investigate whether our scoring system functioned as an independent factor. We included variables that were significantly associated with clinical outcome from univariate analysis, including age, ELN (European LeukemiaNet) genetic group, MLL, RUNX1, TET2 mutations, and mRNA score, for multivariate analysis. We found higher scores appeared to be a strong independent risk factor (Table 5).
Table 5

Multivariate analysis (Cox regression) for the OS in CN-AML cohort

VariablesHazard ratio95% confidence intervalP value
Age1.0301.006~1.0540.012
ELN genetic group1.1910.472~3.0040.711
MLL#1.0580.289~3.8680.932
RUNX1#0.6530.261~1.6350.363
TET2#2.8761.234~6.7010.014
mRNA score1.1461.061~1.2380.001

ELN favorable risk vs. Intermediate-1 risk

mutated vs wild

ELN favorable risk vs. Intermediate-1 risk mutated vs wild

Biological functions associated with the scoring system

To gain biological insights into the risk scores, we further analyzed genes that were differentially expressed in patients with higher or lower risk scores. Patients with risk scores above and below the average by one standard deviation in the NTUH dataset were defined as the high-risk and low-risk groups, respectively. We identified 578 differentially expressed probes (Student's t-test P < 0.05 and >2-fold change) that corresponded to 509 unique genes. In the list, we identified some homeobox genes up-regulated in high-risk patients, including HOXA3 (t-test P = 4.56 × 10−5), HOXA5 (P = 2.73 × 10−7), HOXA6 (P = 3.34 × 10−7), HOXA9 (P = 4.72 × 10−8), HOXA10 (P = 5.83 × 10−8), HOXB2 (P = 5.95 × 10−8), HOXB3 (P = 5.61 × 10−5), HOXB4 (P = 1.37 × 10−6), HOXB5 (P = 3.90 × 10−6), HOXB6 (P = 2.65 × 10−3), HOXB7 (P = 2.21 × 10−3), HOXB8 (P = 1.78 × 10−2), MEIS1 (P = 6.18 × 10−7), and PBX3 (P = 5.02 × 10−4). The homeobox genes are well known for their crucial functions in stemness maintenance, and adverse prognosis in AML when their expression levels are elevated [15, 27, 28]. Furthermore, IPA revealed significant associations between the 509 differentially expressed genes with abundant important biological functions in AML (data available upon request), including proliferation of blood cells (P = 2.25 × 10−10), cell death of leukemia cell lines (P = 2.41 × 10−9), differentiation of hematopoietic progenitor cells (P = 9.85 × 10−7), and quantity of hematopoietic progenitor cells (P = 4.06 × 10−5). All of these functions were validated with significant enrichment by GSEA (empirical P-values all < 0.001; enrichment plots in Fig. 3 and Fig. S4). Taken together, our data indicate that the 11-gene scoring system modulates the treatment response of CN-AML patients through regulation of several crucial cellular functions.
Figure 3

GSEA enrichment plots on genes associated with

A. differentiation of hematopoietic progenitor cells and B. cell death of leukemic cell lines. Genes related to these two functions were significantly differentially expressed between the patients with higher and lower mRNA scores, suggesting significant correlations between these two pathways and the scoring.

GSEA enrichment plots on genes associated with

A. differentiation of hematopoietic progenitor cells and B. cell death of leukemic cell lines. Genes related to these two functions were significantly differentially expressed between the patients with higher and lower mRNA scores, suggesting significant correlations between these two pathways and the scoring.

DISCUSSION

In this study, we grouped the patients by their responses to induction chemotherapy in order to identify genes related to drug sensitivity. By this approach, we aimed to eliminate potential biases raised from our retrospective cohort. IPA analysis showed that 43 differentially expressed genes were closely related to biological functions associated with “leukemia”, “acute myeloid leukemia”, and “proliferation of myeloid cells”. Moreover, using GSEA, we found ABL1, BCL2, and CD33 were among leading-edge components of all three functional categories. Expression of BCL2, both at transcriptional and translational levels, is known to correlate with poor treatment responses to chemotherapy and low complete remission rates in AML [29-31]. While ABL1 has been relatively unexplored in AML, it has been implied for association with the resistance of chemotherapy in chronic myeloid leukemia [32]. Our data indicated these genes may be highly involved in crucial biological functions that determined treatment response. A high risk score was associated with unfavorable mutations (FLT3-ITD, MLL-PTD, ASXL1, RUNX1, and DNMT3A mutations) but inversely associated with favorable ones (NPM1 and CEBPA mutations). Radmacher et al. showed a strong correlation between the prognostic classifier and the status of FLT3-ITD, but not that of MLL-PTD, NPM1, or CEBPA mutations [16]. In addition, Metzeler et al. demonstrated that a high risk signature was associated with FLT3-ITD, WT1, RUNX1, and TET2 mutations, but inversely associated with CEBPA mutations [18]. Nevertheless, we demonstrated the independence of our signature from other important prognostic factors. These observations suggest that our scoring system bears similar biological implications to other parameters on clinical outcome yet stands alone as a new tool for prediction of treatment response. We constructed a simple mRNA signature as a prognostication tool based on the expression levels of 11 genes for CN-AML patients, a subgroup in which the requirement of prognostic parameters is yet unmet. Several study groups have developed gene expression signature for predicting prognosis in AML patients. In six studies [15–20, 33], the results were validated by independent datasets, as in ours. The 11 gene signature of our mRNA scoring system included GPR56, KIAA0125, TM4SF1, AIF1L, CXCR7, DNTT, H1F0, IFITM3, MX1, STAB1, and TNS3; only the first three were seen in another mRNA signature [17], reflecting the variations in study populations or treatment protocols. Although there was no overlap of the genes between our proposed scoring system and the 7-gene signature proposed from an epigenetic study [20] or the 24-gene score improving the ELN classification [33], our system-derived scores were moderately correlated with the scores from the two studies (correlation coefficients, 0.68 and 0.45, respectively, data not shown). Such concordance implies these prognostic scores, though derived from different analysis schemes, might represent common underlying biological mechanisms. Future study may further address this in a larger AML cohort. Current knowledge about the association between the 11 genes and malignancies is summarized in Table 6. In some studies aberrant expression of CXCR7 [34-36], DNTT [37, 38], GPR56 [39], H1F0 [40], and MX1 [41] is seen in leukemia. Little is known about the role of AIF1L and KIAA0125 [42] in pathogenesis of cancers. IFITM3 [43], MX1 [44-46], STAB1 [47], TM4SF1 [48, 49], and TNS3 [50-52] may have roles in various solid cancers, but they are not yet explored in AML. STAB1 [47], TM4SF1 [49], and TNS3 [52] are involved in cell adhesion and motility, which are relevant to cancer metastasis and invasion, however, their roles in interaction between leukemic stem cells and bone marrow niche deserve further investigation.
Table 6

Summary of the association between 11 genes and malignancy

ProbeGeneAssociation with leukemia or solid cancers
1410021AIF1LNo data
450424CXCR7Essential for the survival and growth of tumor cells [34]; Highly expressed in several human myeloid malignant cell lines [35]; Overexpressed in CN-AML patients with adverse clinical outcomes [36]
6280243DNTTLymphoid regulator, up-regulated in RUNX1-mutated CN-AML [37, 38]
5490768GPR56*Influencing adhesion, migration, homing and mobilization of AML stem cells through the RhoA signaling pathway, especially in EVI1 over-expressed leukemia [39]
630278H1F0Important for murine erythroleukemia cell differentiation [40]
6650242IFITM3Overexpression in gastric cancer [43]
3780647KIAA0125*Not reported yet in cancers, but involved in neurogenesis and the pathogenesis of Alzheimer's disease [42]
1690066MX1Diminished expression in AML [41]; Up-regulated in lymph node-positive colorectal cancer [44]; Down-regulated in renal cell carcinoma [45] and head and neck cancers [46]
6040053STAB1Cell adhesion and motility [47]
2680110TM4SF1*Prostate cancer [48]; Pancreatic cancer, cell adhesion and motility [49]
5560561TNS3Renal cell carcinoma [50, 51]; Breast cancer, cell adhesion and motility [52]

genes also seen in the classifier of Metzeler et al. [17]

genes also seen in the classifier of Metzeler et al. [17] In conclusion, we present a simple mRNA expression scoring system for prognostication of CN-AML. The scoring system was validated by three independent cohorts and has comparable performance as the system proposed by Marcucci et al. [20]. Our scoring system is composed of only 11 genes, making it highly potential in clinical use. Its positive association with multiple clinically relevant gene mutations suggests that it has incorporated the prognostic implications of multiple conventional risk factors. Our scoring system may provide another prognostic reference other than genetic mutations currently used for CN-AML. However, a large prospective cohort in which a q-PCR-based measurement of the expression of the 11 genes is necessary for clinical application of this scoring system.

MATERIALS AND METHODS

Patients

We recruited a total of 351 adult patients (≥15 years of age) with newly diagnosed de novo AML from 1995 to 2011 at the National Taiwan University Hospital (NTUH), who had adequate cryopreserved bone marrow cells for mRNA array studies. Patients with antecedent hematological malignancies or therapy-related AML were excluded. The bone marrow specimens were harvested by bone marrow aspiration. The mononuclear cells were isolated by treating with Ficoll-Paque. This study was performed in accordance with the Declaration of Helsinki and was approved by the Research Ethics Committee of the NTUH. We focused on the 158 patients (45.0%) with CN-AML. Among these patients, 104 (65.8%) received standard intensive chemotherapy as described previously [53]. Briefly, they received induction chemotherapy (idarubicin 12 mg/m2 per day on days 1 to 3 and cytarabine 100 mg/m2 per day on days 1 to 7) and then consolidation chemotherapy with 2 to 4 courses of high-dose cytarabine (2000 mg/m2 every 12 hours; total 8 doses), with or without an anthracycline (idarubicin or mitoxantrone), after achieving complete remission (CR). The remaining 54 patients were treated with palliative care or low-dose chemotherapy due to patients' preference or poor performance status. All 158 patients were included for analyses of correlation between the risk score and clinical and other biological parameters, but only the 104 patients who received standard intensive chemotherapy were included for survival analysis. Forty of the 104 patients received allogeneic hematopoietic stem cell transplantation (HSCT); they were censored on the day of stem cell infusion to avoid confounding factors brought by the HSCT therapy. In the survival analysis, >90% statistical power can be achieved based on a sample of at least 93 patients (at a 0.01 significance level to detect a hazard ratio of 2; calculated by PASS software (NCSS, Kaysville, Utah)).

Cytogenetic and mutation analysis

Chromosomal abnormalities and gene mutations were analyzed as described previously [3, 53–56].

mRNA microarray analysis and data processing

We profiled whole-genome gene expression of 158 patients using Illumina HumanHT-12 v4 Expression BeadChip (Illumina, San Diego, CA), following the manufacturer's instructions. Briefly, we verified RNA concentration and integrity with ND-1000 spectrophotometer (NanoDrop Technologies, Wilmington, DE) and 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA). 1.5 μg cRNA of each sample was hybridized to Illumina HumanHT-12 v4 Expression BeadChip. Intensities of bead fluorescence were detected with Illumina BeadArray Reader (Illumina, San Diego, CA), followed by transformation to numeric values using GenomeStudio v2010.1 software (Illumina, San Diego, CA). We performed Log-2 transformation and quantile normalization for the data to achieve normalized probe-level expression values. The microarray data have been deposited in the Gene Expression Omnibus (accession number: GSE71014).

External dataset processing

For validation analysis we included three independent gene expression datasets from two studies, one from The Cancer Genome Atlas (TCGA) [26] and two (GSE12417-GPL96 and GSE12417-GPL570) from the study of Metzeler et al. [17]. The TCGA dataset was composed of gene expression profiles from 197 AML samples (97 CN) achieved by Affymetrix Human Genome U133 Plus 2.0 Array. Level-2 data, which were probe-level pre-normalized signals processed by TCGA, were downloaded and transformed into Log-2 scale. The two datasets from the study by Metzeler et al., including 163 and 79 CN-AML patients, respectively, were profiled with Affymetrix Human Genome U133 Plus 2.0 Array. We used the authors' pre-processed datasets deposited in the Gene Expression Omnibus (GEO, accession ID GSE12417) [57]. In the three datasets, each of the genes with multiple probes was represented by the most “informative” probe that carried the largest coefficient of variation, defined as the ratio of per-probe standard deviation to per-probe average.

Statistical analysis

Statistical significance of differential expression of genes between two groups of samples was assessed using Student's t-test. For survival analysis, expression values of each dataset were first z-transformed (i.e., subtraction of sample mean followed by division by sample standard deviation for each probe) to approximately follow the normal distribution (zero mean and unity standard deviation). We then utilized the univariate Cox proportional hazards model to determine association between expression of individual genes and patient survival. We employed a ten-thousand-time random permutation test to evaluate the performance and randomness of constructed risk scoring system with the process as described previously [58]. Briefly, in each of the ten thousand iterations a random system was constructed by substituting the genes in the proposed scoring system with randomly selected ones from the microarray dataset. Each random system was tested for survival significance. After all iterations, significance of the proposed system was measured by the empirical P-value, which was simply the fraction of random risk systems that achieved higher Cox significance than the proposed system. We adopted Kaplan-Meier estimation to plot survival curves and used log-rank tests to examine the difference between groups. The patients who received allogeneic HSCT were censored on the day of cell infusion. Hazard ratio and 95% confidence interval were estimated by Cox proportional hazards regression models to determine independent risk factors associated with survival in multivariate analyses. For analysis of differential expression, two-sided P values from Student's t-test less than 0.05 were considered statistically significant. The whole patient population was included for analyses of correlation between the risk score and clinical characteristics and molecular alternations; however, only those receiving conventional standard chemotherapy, as mentioned above, were included in analyses of survivals.

Functional annotation analysis

In order to gain biological insights into identified groups of genes, we further incorporated two functional annotation tools, Ingenuity Pathway Analysis (IPA; Qiagen, Redwood City, CA) [24] and Gene Set Enrichment Analysis (GSEA, Java program downloadable athttp://www.broadinstitute.org/gsea/index.jsp) [25]. IPA is a knowledge-based database that features manual curation of a huge volume of published literatures. It employs Fisher's exact test to assess the significance of association between biological functions and the set of genes of interest (e.g. differentially expressed genes). We used GSEA to further verify the results of IPA. Instead of analyzing sets of genes of interest, GSEA is designed to detect whether a biological function is enriched in the whole-genome expression pattern; i.e., significant P-value from GSEA indicates significant overall enrichment of genes sharing a common function in genes with differential expressions. Here the gene sets (biological functions) were downloaded from the IPA database. Significance of enrichment was assessed based on the two-thousand-time random permutation test among genes.
  58 in total

1.  Prognostic relevance of FLT3-TKD mutations in AML: the combination matters--an analysis of 3082 patients.

Authors:  Ulrike Bacher; Claudia Haferlach; Wolfgang Kern; Torsten Haferlach; Susanne Schnittger
Journal:  Blood       Date:  2007-10-26       Impact factor: 22.113

2.  Prediction of chemosensitivity for patients with acute myeloid leukemia, according to expression levels of 28 genes selected by genome-wide complementary DNA microarray analysis.

Authors:  Jun-ichi Okutsu; Tatsuhiko Tsunoda; Yasuyuki Kaneta; Toyomasa Katagiri; Osamu Kitahara; Hitoshi Zembutsu; Rempei Yanagawa; Shuichi Miyawaki; Kazutaka Kuriyama; Nobuyuki Kubota; Yukihiro Kimura; Kohmei Kubo; Fumiharu Yagasaki; Toshio Higa; Hirokuni Taguchi; Tadasu Tobita; Hideki Akiyama; Akihiro Takeshita; Yan-Hua Wang; Toshiko Motoji; Ryuzo Ohno; Yusuke Nakamura
Journal:  Mol Cancer Ther       Date:  2002-10       Impact factor: 6.261

3.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

4.  A 3-microRNA scoring system for prognostication in de novo acute myeloid leukemia patients.

Authors:  M-K Chuang; Y-C Chiu; W-C Chou; H-A Hou; E Y Chuang; H-F Tien
Journal:  Leukemia       Date:  2014-11-27       Impact factor: 11.528

5.  Mutant nucleophosmin (NPM1) predicts favorable prognosis in younger adults with acute myeloid leukemia and normal cytogenetics: interaction with other gene mutations.

Authors:  Konstanze Döhner; Richard F Schlenk; Marianne Habdank; Claudia Scholl; Frank G Rücker; Andrea Corbacioglu; Lars Bullinger; Stefan Fröhling; Hartmut Döhner
Journal:  Blood       Date:  2005-07-28       Impact factor: 22.113

6.  Defining roles for HOX and MEIS1 genes in induction of acute myeloid leukemia.

Authors:  U Thorsteinsdottir; E Kroon; L Jerome; F Blasi; G Sauvageau
Journal:  Mol Cell Biol       Date:  2001-01       Impact factor: 4.272

7.  RUNX1 mutations are associated with poor outcome in younger and older patients with cytogenetically normal acute myeloid leukemia and with distinct gene and MicroRNA expression signatures.

Authors:  Jason H Mendler; Kati Maharry; Michael D Radmacher; Krzysztof Mrózek; Heiko Becker; Klaus H Metzeler; Sebastian Schwind; Susan P Whitman; Jihane Khalife; Jessica Kohlschmidt; Deedra Nicolet; Bayard L Powell; Thomas H Carter; Meir Wetzler; Joseph O Moore; Jonathan E Kolitz; Maria R Baer; Andrew J Carroll; Richard A Larson; Michael A Caligiuri; Guido Marcucci; Clara D Bloomfield
Journal:  J Clin Oncol       Date:  2012-07-02       Impact factor: 44.544

8.  Mutation of CEBPA in familial acute myeloid leukemia.

Authors:  Matthew L Smith; Jamie D Cavenagh; T Andrew Lister; Jude Fitzgibbon
Journal:  N Engl J Med       Date:  2004-12-02       Impact factor: 91.245

9.  Epigenetic silencing of CRABP2 and MX1 in head and neck tumors.

Authors:  Marilia F Calmon; Rodrigo V Rodrigues; Carla M Kaneto; Ricardo P Moura; Sabrina D Silva; Louise Danielle C Mota; Daniel G Pinheiro; Cesar Torres; Alex F de Carvalho; Patrícia M Cury; Fabio D Nunes; Ines Nobuko Nishimoto; Fernando A Soares; Adriana M A da Silva; Luis P Kowalski; Helena Brentani; Cleslei F Zanelli; Wilson A Silva; Paula Rahal; Eloiza H Tajara; Dirce M Carraro; Anamaria A Camargo; Sandro R Valentini
Journal:  Neoplasia       Date:  2009-12       Impact factor: 5.715

10.  CXCR7 is highly expressed in acute lymphoblastic leukemia and potentiates CXCR4 response to CXCL12.

Authors:  Rita de Cássia Carvalho Melo; Ana Leda Longhini; Carolina Louzão Bigarella; Mariana Ozello Baratti; Fabiola Traina; Patrícia Favaro; Paula de Melo Campos; Sara Teresinha Olalla Saad
Journal:  PLoS One       Date:  2014-01-31       Impact factor: 3.240

View more
  20 in total

1.  Construction of a Pyroptosis-Related Signature for Prognostic Prediction and Characterization of Immune Microenvironment in Acute Myelogenous Leukemia.

Authors:  Songyang Liu; Dongmei Luo; Jie Luo; Hanyin Liang; Yunfei Zhi; Dong Wang; Na Xu
Journal:  Int J Gen Med       Date:  2022-03-12

2.  A three-gene expression-based risk score can refine the European LeukemiaNet AML classification.

Authors:  Stefan Wilop; Wen-Chien Chou; Edgar Jost; Martina Crysandt; Jens Panse; Ming-Kai Chuang; Tim H Brümmendorf; Wolfgang Wagner; Hwei-Fang Tien; Behzad Kharabi Masouleh
Journal:  J Hematol Oncol       Date:  2016-09-01       Impact factor: 17.388

3.  High expression of dedicator of cytokinesis 1 (DOCK1) confers poor prognosis in acute myeloid leukemia.

Authors:  Sze-Hwei Lee; Yu-Chiao Chiu; Yi-Hung Li; Chien-Chin Lin; Hsin-An Hou; Wen-Chien Chou; Hwei-Fang Tien
Journal:  Oncotarget       Date:  2017-07-31

4.  The distinct biological implications of Asxl1 mutation and its roles in leukemogenesis revealed by a knock-in mouse model.

Authors:  Yueh-Chwen Hsu; Yu-Chiao Chiu; Chien-Chin Lin; Yuan-Yeh Kuo; Hsin-An Hou; Yi-Shiuan Tzeng; Chein-Jun Kao; Po-Han Chuang; Mei-Hsuan Tseng; Tzu-Hung Hsiao; Wen-Chien Chou; Hwei-Fang Tien
Journal:  J Hematol Oncol       Date:  2017-07-11       Impact factor: 17.388

5.  Higher HOPX expression is associated with distinct clinical and biological features and predicts poor prognosis in de novo acute myeloid leukemia.

Authors:  Chien-Chin Lin; Yueh-Chwen Hsu; Yi-Hung Li; Yuan-Yeh Kuo; Hsin-An Hou; Keng-Hsueh Lan; Tsung-Chih Chen; Yi-Shiuan Tzeng; Yi-Yi Kuo; Chein-Jun Kao; Po-Han Chuang; Mei-Hsuan Tseng; Yu-Chiao Chiu; Wen-Chien Chou; Hwei-Fang Tien
Journal:  Haematologica       Date:  2017-03-24       Impact factor: 9.941

6.  Association of a cytarabine chemosensitivity related gene expression signature with survival in cytogenetically normal acute myeloid leukemia.

Authors:  Han Yan; Lu Wen; Dan Tan; Pan Xie; Feng-Mei Pang; Hong-Hao Zhou; Wei Zhang; Zhao-Qian Liu; Jie Tang; Xi Li; Xiao-Ping Chen
Journal:  Oncotarget       Date:  2017-01-03

7.  High expression of HMGA2 independently predicts poor clinical outcomes in acute myeloid leukemia.

Authors:  Miriam Marquis; Cyrielle Beaubois; Vincent-Philippe Lavallée; Michal Abrahamowicz; Coraline Danieli; Sébastien Lemieux; Imran Ahmad; Andrew Wei; Stephen B Ting; Shaun Fleming; Anthony Schwarer; David Grimwade; William Grey; Robert K Hills; Paresh Vyas; Nigel Russell; Guy Sauvageau; Josée Hébert
Journal:  Blood Cancer J       Date:  2018-07-19       Impact factor: 11.037

8.  SOCS2 is part of a highly prognostic 4-gene signature in AML and promotes disease aggressiveness.

Authors:  Chi Huu Nguyen; Tobias Glüxam; Angela Schlerka; Katharina Bauer; Alexander M Grandits; Hubert Hackl; Oliver Dovey; Sabine Zöchbauer-Müller; Jonathan L Cooper; George S Vassiliou; Dagmar Stoiber; Rotraud Wieser; Gerwin Heller
Journal:  Sci Rep       Date:  2019-06-24       Impact factor: 4.379

9.  High NCALD expression predicts poor prognosis of cytogenetic normal acute myeloid leukemia.

Authors:  Ying Song; Weilong Zhang; Xue He; Xiaoni Liu; Ping Yang; Jing Wang; Kai Hu; Weiyou Liu; Xiuru Zhang; Hongmei Jing; Xiaoliang Yuan
Journal:  J Transl Med       Date:  2019-05-20       Impact factor: 5.531

10.  Genome-wide identification of FHL1 as a powerful prognostic candidate and potential therapeutic target in acute myeloid leukaemia.

Authors:  Yue Fu; Man Xu; Zelong Cui; Zongcheng Yang; Zhiyong Zhang; Xiaolin Yin; Xiangnan Huang; Minran Zhou; Xiaoming Wang; Chunyan Chen
Journal:  EBioMedicine       Date:  2020-02-12       Impact factor: 8.143

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.