Literature DB >> 24990611

Genomic profiling of thousands of candidate polymorphisms predicts risk of relapse in 778 Danish and German childhood acute lymphoblastic leukemia patients.

A Wesołowska-Andersen1, L Borst2, M D Dalgaard1, R Yadav1, K K Rasmussen2, P S Wehner3, M Rasmussen4, T F Ørntoft5, I Nordentoft5, R Koehler6, C R Bartram6, M Schrappe7, T Sicheritz-Ponten1, L Gautier1, H Marquart8, H O Madsen8, S Brunak1, M Stanulla9, R Gupta1, K Schmiegelow10.   

Abstract

Childhood acute lymphoblastic leukemia survival approaches 90%. New strategies are needed to identify the 10-15% who evade cure. We applied targeted, sequencing-based genotyping of 25 000 to 34 000 preselected potentially clinically relevant single-nucleotide polymorphisms (SNPs) to identify host genome profiles associated with relapse risk in 352 patients from the Nordic ALL92/2000 protocols and 426 patients from the German Berlin-Frankfurt-Munster (BFM) ALL2000 protocol. Patients were enrolled between 1992 and 2008 (median follow-up: 7.6 years). Eleven cross-validated SNPs were significantly associated with risk of relapse across protocols. SNP and biologic pathway level analyses associated relapse risk with leukemia aggressiveness, glucocorticosteroid pharmacology/response and drug transport/metabolism pathways. Classification and regression tree analysis identified three distinct risk groups defined by end of induction residual leukemia, white blood cell count and variants in myeloperoxidase (MPO), estrogen receptor 1 (ESR1), lamin B1 (LMNB1) and matrix metalloproteinase-7 (MMP7) genes, ATP-binding cassette transporters and glucocorticosteroid transcription regulation pathways. Relapse rates ranged from 4% (95% confidence interval (CI): 1.6-6.3%) for the best group (72% of patients) to 76% (95% CI: 41-90%) for the worst group (5% of patients, P<0.001). Validation of these findings and similar approaches to identify SNPs associated with toxicities may allow future individualized relapse and toxicity risk-based treatments adaptation.

Entities:  

Mesh:

Year:  2014        PMID: 24990611      PMCID: PMC4320289          DOI: 10.1038/leu.2014.205

Source DB:  PubMed          Journal:  Leukemia        ISSN: 0887-6924            Impact factor:   11.528


Introduction

Host genome variants affect the complex biology involved in pharmacokinetics/pharmacodynamics and need to be addressed to identify the critical factors determining treatment outcomes. Childhood acute lymphoblastic leukemia (ALL) has for several reasons been a model disease for such research owing to its frequency, well-described epidemiology, clinical characteristics and biologic profiles within cytogenetically defined subsets. In general, childhood ALL is very chemosensitive, and patients are almost uniformly treated within collaborative groups that stratify treatment according to known molecular aberrations in the leukemic clone, clinical characteristics and treatment response.[1,2] Most contemporary treatment protocols achieve 5-year event-free survival rates above 80%.[3] However, even with risk group-adapted treatment there is a wide diversity in cure rates, partly explainable by both host and cancer genomes.[3,4] Several candidate gene studies have associated inherited polymorphisms with treatment response and cure rates in childhood ALL.[1,2,5, 6, 7] Still, the biologic relevance and interpretation of multiple single-nucleotide polymorphisms (SNPs) identified in recent genome-wide association studies (GWAS) remain unclear, as the functions of several of the most significant genes are unknown,[8] and effect sizes of single SNP associations are in any case extremely limited. To address some of these limitations and complement the GWAS approach, we applied a multiplexed targeted sequencing method allowing screening of 25 000–34 000 preselected SNPs within biologic domains potentially relevant to childhood ALL, allowing both single variants and multiple SNPs acting in the same pathways to be explored for association with relapse, and finally combining it with known clinical risk factors in a predictive profile.[9]

Materials and methods

Danish patients were 1–15 years of age and diagnosed 1992–2008 with B-cell precursor ALL (BCP-ALL) or T-lineage ALL and treated according to the Nordic Society for Pediatric Hematology and Oncology (NOPHO) ALL92 (N=227) or NOPHO ALL2000 protocols (N=268) (Figure 1, Supplementary Online Material and Supplementary Tables 1). These two treatment protocols are very similar and included a 4-week, 3-drug induction phase (prednisolone, vincristine and doxorubicin (with intrathecal methotrexate)) with no glucocorticosteroid prephase, a risk group-adapted consolidation phase and methotrexate/6-mercaptopurine maintenance therapy up until 2 to 2.5 years from diagnosis[10] (detailed in Supplementary Online Material). End of induction minimal residual disease (MRD) measurements were available for 73% of NOPHO ALL2000 patients, but not included in risk stratification.[10] For cross-protocol validation of relapse-predictive host genomic variants, 500 German childhood ALL patients 1–18 years at diagnosis were included, all treated according to the BFM ALL2000 protocol. The BFM ALL2000 protocol included a 1-week prednisolone prephase, a 4-week, 4-drug induction phase (prednisolone or dexamethasone, vincristine, doxorubicin and L-asparaginase (with intrathecal methotrexate)), a risk group-adapted consolidation phase and methotrexate/6-mercaptopurine maintenance therapy until 2 years from diagnosis[1,11,12] (Supplementary Online Material). A total of 352 Danish and 426 German patients were eligible for the final relapse-risk analysis (Figure 1 and Supplementary Tables S1). Patients were enrolled between 1992 and 2008 with a median follow-up of 7.6 years (50% range: 5.3–9.2 years) for patients in first remission. In this study, both bone marrow and extramedullary relapses were events of interest. Owing to the complexity of the bioinformatic analysis and the application of nonlinear machine learning, and as it is uncertain how to weight competing events (induction failures, deaths in remission and second cancers), we chose, in this exploratory, hypothesis-generating study, to exclude patients with such events already at the time of diagnosis of ALL (Figure 1). The study was conducted in accordance with the Declaration of Helsinki Principles of 1975, and approved by the Danish Data Protection Agency, the Committee on Biomedical Research Ethics and by the Ethics Committee of the Hannover Medical School, Hannover, Germany.
Figure 1

Patient flow. Overview of the patients included in the study. CEU, Utah residents with ancestry from northern and western Europe. *BFM patients were selected upon availability of germline DNA and consecutively enrolled up to the predefined number of 500.

SNP selection and bait design have been previously described in detail (Wesolowska et al.[9] and Supplementary Online Material). SNPs were selected to cover all known and putative clinically relevant genetic variation with regard to childhood ALL treatment (13 drugs), drug pharmacokinetics and pharmacodynamics, relapse risk and several toxicities (Supplementary Figure S1). In short, clinically important genes and SNPs for the 13 most commonly administered antileukemic drugs were evaluated with regard to genes encoding proteins involved in metabolism, transport, target proteins, regulation of drug-target response and to some extent drug-related toxicity (e.g. coagulation, immune function) (Supplementary Online Material). Initially, this was carried out by literature curation, and then expanded to cover additional aspects of response to chemotherapy such as genes encoding proteins involved in apoptosis pathways and DNA repair. This also included consultations with experts within such areas and through various online resources (Supplementary Online Material). Finally, known drug–protein associations and first-order protein–protein interactions were evaluated and added (Supplementary Online Material and Supplementary Figure S1).[9] Approximately half of the selected genomic variants were in noncoding regions. The selected genes were then screened for known polymorphisms with putative functional consequence on their transcript (Supplementary Table S3). Baits for the SureSelect Target Enrichment System (protocol version 1.2 April 2009; Agilent Technologies, Santa Clara, CA, USA) were designed for all identified SNPs. The first design included baits targeting 25 086 clinically relevant SNPs in 1540 genes (Supplementary Figure S2). As baits covered more genomic material than just the targeted SNPs, readouts on the targeted regions covered 116 646 known human variants (Supplementary Table S4). This panel was subsequently updated after sequencing the first group of patients (from NOPHO ALL2000) (Supplementary Online Material and Supplementary Tables S3 and S4). Thus, the NOPHO ALL92 and BFM ALL2000 cohorts were screened using an expanded bait design, covering 33 683 SNPs in 2254 genes. The genomic targets of the two panels are shown in Supplementary Figure S2. The patients were genotyped for either of the two SNP panels by multiplex targeted sequencing as described previously.[9] Briefly, the Agilent SureSelect Target Enrichment System protocol (protocol version 1.2 April 2009; Agilent Technologies) was modified, allowing nucleotide barcoding, and then pooling of samples before target enrichment and sequencing. Genomic DNA from leukemia remission samples was sheared, purified, end-repaired and 3′ adenylated. Custom-made adaptors containing unique four-base barcodes were ligated to the DNA fragments before size selection and amplification. The DNA libraries were subsequently mixed in groups of up to eight samples, and the pooled libraries were hybridized to the custom-designed baits. Posthybridization amplification PCR was performed, and the pooled libraries were sequenced (Supplementary Online Material). The sequencing reads were mapped to the reference human genome (GRCh37), and SNP calling was performed (Supplementary Figure S3). The threshold set for SNP calling was minimum 10 × sequencing depth. Data quality control and the individual–SNP association analyses were performed in PLINK (version 1.07).[13] Patient samples not clustering together with the HapMap central European ancestry samples in the principal component analysis (Supplementary Figures S4 and S5) were removed.[14] To make sure that the data did not contain any bias arising either from sequencing or data processing, the observed minor allele frequencies (MAFs) of the genotyped SNPs were plotted against the HapMap CEU-reported MAFs for the 5962 SNPs, where data were available (Supplementary Figure S7). Furthermore, SNPs for which <50% of the patients could be genotyped at the minimum depth of 10 × and SNPs with MAF <1% were excluded from the analyses. To validate readouts, genotyping from this experiment was compared with genotype calls obtained from the Illumina Human 1M-Duo SNP chip performed on a subset of patients (N=275, overlap of 2394 SNPs on both platforms). The mean genotyping concordance of those two methods was 98.2% (Supplementary Online Material).

SNP associations

Single SNPs were explored for associations with risk of relapse using a Fisher's exact test separately for the NOPHO and BFM cohorts. P-values obtained were corrected for multiple testing by adaptive permutation, and subsequently only SNPs with adjusted P-values below 0.05 in both cohorts were considered significant.[15] Besides investigating such contribution of individual variations, the combined effects of multiple SNPs acting in the same biologic pathways were investigated using nonlinear machine learning. All pathways from the Reactome database[16] and 12-drug metabolism pathways from the PharmGKB database (ref. 27 in Supplementary Online Material), with up to 193 SNPs in a pathway, were interrogated. Briefly, ANN models including all combinations of up to three SNPs together with white blood cell counts (WBC) and age at diagnosis were tested within each pathway and stepwise expanded with up to 15 SNPs in a pathway, if threefold cross-validated performance improved (Supplementary Online Material) (Figure 2). The best model for each pathway was then chosen, and the most informative pathway models were combined with clinical information in classification and regression tree (CART) analysis assigning for each pathway a score from 1 to 10 for every patient based on the relapse prediction from the neural network. As induction therapy differed between the Danish and German cohorts, each of these was dichotomously subdivided based on the approximate median end of induction MRD levels in each cohort being 10−3 and 10−4 in NOPHO and BFM cohorts, respectively (Supplementary Online Material).
Figure 2

Flow diagram of the ANN models and CART (see also Supplementary Online Material). For pathway analysis, we included all nonsynonymous coding, frameshift coding, stop codon and splice site SNPs genotyped in this study with MAF above 0.005 residing in the pathway genes for pathways in Reactome database and for the 12-drug metabolism pathways from the PharmGKB database. Each pathway had between 1 and 193 SNPs, and each SNP was encoded by three values between 0 and 1 corresponding to likelihood of each genotype calculated from VCF file produced by SAMtools (see Supplementary Online Material). Associations with relapse risk were performed by training feedforward ANNs with backpropagation on subsets of SNPs from each pathway with threefold cross-validation. For each pathway, all combinations of up to three SNPs were assessed by means of MCC. The combinations were then further iteratively increased up to 15 SNPs by adding another SNP to the top 20 previous combinations of SNPs, if the MCC increased by at least 0.01. Pathways were then ranked by MCC of the best combination of SNPs for each pathway, and the most predictive pathways for relapse were then included in the CART analysis. This included the 426 patients with complete information on sex, age and WBC at diagnosis, immunophenotype, karyotype, end of induction MRD and risk group. For the large group of patients with low MRD, the SNP profiles of the top Reactome/PharmGKB pathways were included to explore their relapse prediction for this patient subset.

Results

Of the genotyped patients (N=869), 808 fulfilled the quality control and the European ancestry criteria (Figure 1), of which 778 patients could be included in the final relapse-risk analysis. The majority of genotyped variants had very low heterogeneity (MAF <1%) between patients and thus did not contribute with sufficient power to the single SNP analysis. Thus, a total of 4260 and 3865 SNPs in the NOPHO and the BFM cohorts were included in the relapse-risk analysis, respectively. The QQ plots showed good agreement with the null distribution and absence of genomic inflation (Supplementary Figure S8), and several loci were significantly associated with risk of relapse (Supplementary Figure S9). A total of 188 and 152 SNPs were associated with relapse risk in the NOPHO and BFM cohorts, respectively (Supplementary Tables S5 and S6). As some of these will reflect chance findings, we included in the subsequent analyses only the 11 SNPs that were associated with relapse risk in both cohorts. Importantly, these 11 SNPs were related to risk of relapse independent of other known risk factors (Table 1, Supplementary Table S7 and Supplementary Online Material). As further support of their biologic significance, Kaplan–Meier analyses for these SNPs showed a general tendency of gene dose effects (0 versus 1 versus 2 alleles associated with increased risk of relapse) with log-rank trend P-values ranging from 4.8 × 10−6 for rs3216144 (matrix metalloproteinase 7, MMP7) to 0.03 for rs35721373 (dysferlin, DYSF), respectively (Supplementary Figure S9).
Table 1

SNPs associated with risk of relapse discovered in both Danish and German cohorts

SNP
NOPHO ALL92 and 2000
BFM ALL2000
Combined cohorts
Rs IDGeneConsP-valueORMAF relapseMAF CRP-valueORMAF relapseMAF CRMAF relapseMAF CRP-valueOR
Rs3216144MMP7Regulatory4.0E−040.140.050.262.0E−030.320.090.240.070.246.0E−060.26
Rs10502001MMP7NSC4.0E−040.140.050.252.0E−030.320.090.240.080.246.0E−060.26
Rs10795242AKR1C3Intronic8.6E−032.130.310.173.7E−021.850.240.150.270.165.0E−042.01
Rs28730837MPONSC4.7E−024.260.060.021.5E−023.190.060.020.060.021.0E−033.60
Rs6139873CHGBNSC4.0E−0319.560.110.013.9E−023.500.040.010.060.011.1E−035.75
Rs1293945ESR1Regulatory2.0E−021.860.590.432.6E−021.950.590.420.590.431.3E−031.91
Rs3763156LMNB1Intronic2.0E−023.060.190.074.0E−022.150.150.070.160.071.6E−032.43
Rs55684978HTR3DSC1.6E−028.080.070.013.2E−023.850.040.010.050.011.8E−034.88
Rs1058047TMED7Splice site1.3E−028.640.080.013.9E−023.200.050.020.060.023.1E−033.98
Rs35721373DYSFSC1.5E−024.890.130.033.1E−022.520.090.040.100.043.3E−032.96
Rs6601899AKR1C3Intronic3.9E−021.770.310.203.6E−021.800.240.150.270.173.3E−031.84

Abbreviations: cons, consequence of the SNP on its transcript from Ensembl Variant Effect Predictor; MAF CR, minor allele frequency in complete remission patients; MAF relapse, minor allele frequency in relapse patients; NSC, nonsynonymous coding; OR, odds ratio; P-value, adaptive permutation P-values; SC, synonymous coding.

Next, functional SNPs were grouped by biologic pathways, and the relevance of the pathways to risk of relapse was assessed by training ANNs on different combinations of SNPs from each pathway, allowing nonlinear correlations between SNPs. The top-associated Reactome pathway[16] ranked by Matthew's correlation coefficient[17] (MCC, ranging from 0 to 1) was ‘ATP-binding cassette (ABC) family protein-mediated transport' (MCC=0.33, area under the receiver-operator curve (AUC)=0.69) (Supplementary Table S8). Similarly, 12-drug metabolism pathways from the PharmGKB database relevant for the administered drugs were investigated,[18] and the top pathways were ‘Vinka Alkaloid Pathway, Pharmacokinetics' (MCC=0.321, AUC=0.72), ‘Glucocorticoid Pathway (Peripheral Tissue), Pharmacodynamics' (MCC=0.320, AUC=0.75) and ‘Methotrexate Pathway (Brain Cell), Pharmacokinetics' (MCC=0.320, AUC=0.70) (Supplementary Table S9). To explore and illustrate the combined significance of clinical data and host genomic findings, CART analysis was applied to subclassify sequentially patients in a multivariate model, including the 426 patients from both cohorts with complete information on sex, age, immunophenotype, WBC, leukemia karyotype, end of induction MRD, risk group and genotypes of the 11 cross-cohort-associated SNPs (Figure 3a). A total of 426 patients were included. End of induction MRD levels divided the patients into two major groups: one group with high MRD levels (i.e. above median for that cohort) and a high risk of relapse, which could be further stratified by SNPs in the myeloperoxidase (MPO), estrogen receptor 1 (ESR1), lamin B1 (LMNB1) and MMP7 genes, and another large group with low MRD levels and low cumulative relapse risk. As the latter group, because of its size, accounts for 25% of all relapses, a subsequent CART analysis with pathway profiles for the top 10 Reactome and the PharmGKB pathways was added (all having AUCs of approximately 0.70), which indicated a role of ABC transporters and glucocorticosteroid transcription regulation pathways (Figure 3b). Finally, based on MRD, WBC, the aforementioned host genomic data and the resulting observed incidence of relapse (Figures 3a and b), we could define three large subsets of patients with significant differences in the risk of relapse (P<0.001). With 92% of projected relapses by Kaplan–Meier analysis having been observed in the total cohort, the two extreme subgroups of patients had 6-year cumulative risks of relapse of 4% (95% confidence interval: 1.6–6.3%) for the best outcome group (71.5% of all patients) and 76% (95% confidence interval: 41–90%) for the worst outcome group (5% of all patients) (Figure 3c and Table 2), leaving age, immunophenotype (BCP versus T-ALL), cytogenetics (Table 2; high risk, that is, t(9;22), hypodiploid, t(4;11); low risk, that is, high hyperdiploid, t(12;21); other) and risk group nonsignificant with this approach. As MRD was not available for all patients, an additional CART analysis was performed not including MRD (Supplementary Figure S10). All four SNPs from the original CART diagram remained significant for relapse prediction. Survival analysis taking into account the time to event was also performed (Supplementary Figure S11), with results largely agreeing with the model in Figure 3. Multivariate regression analysis to predict risk of relapse found all the features selected by CART analysis to be statistically significant, with MRD being the most significant clinical factor (P=7.9 × 10−6), MPO rs28730837 being the most significant SNP region (P=0.002) and glucocorticosteroid pathway (transcription regulation, pharmacodynamics) (P=2.0 × 10−13) being the most significant pathway.
Figure 3

CART analysis of sequentially subclassified patients by clinical data including WBC, end of induction MRD and genotypes of cross-cohort relapse-associated SNPs for the 426 patients from both cohorts for whom these data were available. The most discriminatory WBC value (74.5 × 109/l) is selected by the CART algorithm. Black and white color in the pie charts represents the percentage of patients who experienced a relapse (black) or stayed in complete remission (white). One group with above median (for that cohort) MRD levels and a high risk of relapse could be further stratified by SNPs in the myeloperoxidase (MPO), estrogen receptor 1 (ESR1), lamin B1 (LMNB1) and MMP7 genes (a). Another group with low MRD and low cumulative relapse risk could be further stratified by pathway profiles of ABC transporters and glucocorticosteroid transcription regulation pathways (b). (c) Kaplan–Meier plots of relapse risk for three subsets of patients identified by the CART analysis. The groups were defined by the observed incidence of relapse within each node of the graph in panels a and b as marked with [A], [B] or [C] for the best, intermediate and worst outcome group, respectively (P<0.001). Vertical lines depict patients with relapse or lack of further follow-up. F, favorable; GC, glucocorticosteroids; PD, pharmacodynamics; U, unfavorable.

Table 2

Clinical characteristics of the patients in the three CART groups

CharacteristicsGroup [A] patientsGroup [B] patientsGroup [C] patients
Total patients (%)305 (71.5)100 (23.5)21 (5)
    
Gender
 Male174 (57)63 (63)12 (57)
 Female131 (43)37 (37)9 (43)
    
Age (years), mean (50% range)5.32 (2.89–6.71)7.38 (3.54–10.97)7.09 (3.75–10.88)
WBC × 109/l, mean (50% range)22.88 (4.6–26.5)100.36 (7.5–108)90.61 (4.51–121)
    
Immunophenotype
 BCP-ALL279 (91.5)77 (77)17 (81)
 T-ALL25 (8.2)23 (23)4 (19)
 Other1 (0.3)0 (0)0 (0)
    
MRD
 Low237 (77.7)67 (67)12 (57)
 High68 (22.3)33 (33)9 (43)
    
Cytogenetics
 t(9;22)4 (1.3)1 (1)1 (4.8)
 t(1;19)3 (1)1 (1)0 (0)
 t(12;21)79 (25.9)13 (13)2 (9.5)
 t(4;11)2 (0.65)1 (1)0 (0)
 Hypodiploid2 (0.65)3 (3)0 (0)
 Hyperdiploid81 (26.5)16 (16)1 (4.8)
 Other17 (5.6)5 (5)2 (9.5)
 Normal/no data120 (39.3)61 (61)15 (71.4)
    
Relapse distribution
 Total112314
 Bone marrow8 (72.7)19 (82.6)12 (85.7)
 CNS1 (9.1)3 (13.1)4 (28.6)
 Other2 (18.2)2 (8.7)0 (0)

Abbreviations: ALL, acute lymphoblastic leukemia; BCP-ALL, B-cell precursor ALL; CART, classification and regression tree; CNS, central nervous system relapses (isolated and combined CNS relapses); MRD, minimal residual disease; T-ALL, T-lineage acute lymphoblastic leukemia; WBC, white blood cell count.

Discussion

Today, many high-risk patients do not respond to intensified treatment, but most relapses occur among non-high-risk patients. The present study, using genomic candidate gene genotyping and front-line bioinformatics analyses, provides a novel biology/pharmacology-driven approach for outcome prediction and goes beyond conventional genome-wide association studies (GWAS) approaches, while still being more cost-effective than whole-genome sequencing (for cost details see Supplementary Online Material). The rapidly growing understanding of the complex human genome and its derived functional biology allows selection of candidate SNPs based on the current knowledge of pharmacogenomics, disease mechanisms, signaling pathways and protein interactions. Large-scale, genomic candidate gene setup facilitates not only single SNP investigations but also associations of multiple variations grouped by their putative function. Further associations of combinations of SNPs grouped by biologic pathways tested with neural network models enable detection of meaningful nonlinear SNP interactions. The results obtained through these strategies can provide conclusions at new levels of genomic complexity, collectively emphasizing the importance of specific biologic mechanisms for the phenotype. The 11 cross-cohort relapse-associated SNPs resided in genes previously suggested as markers for leukemia aggressiveness, involved in steroid response, implicated in resistance mechanisms or toxicities of certain drugs (Supplementary Online Material). The pathway analysis strongly indicated an importance of the ABC family protein-mediated transport, activation of matrix metalloproteinases, toll-like receptor cascade signaling and various signal-transduction pathways, as well as involvement of the cell cycle. The top-associated pathway ‘ABC family protein-mediated transport' confirms the role of pharmacogenomics in drug response by pointing to the importance of drug transport. The ATP-dependent drug efflux pumps have broad substrate specificities; they influence drug accumulation and are associated with the development of resistance to anticancer drugs.[19] Matrix metalloproteinases are involved in tumor progression,[20] whereas the toll-like receptor cascades and nucleotide-binding oligomerization domain-containing protein 1 and 2 signaling pathways support a role for the innate immune system function for proper drug response or ALL biology.[21,22] It is noteworthy that the three top drug metabolism pathways for relapse risk corresponded to pharmacodynamics of glucocorticosteroids and pharmacokinetics of vinka alkaloids and methotrexate. This emphasizes the clinical significance of glucocorticosteroid therapy and cell cycle-arresting agents such as vincristine and methotrexate, and also highlights the potential profound consequence of adverse drug disposition on the effect of these agents. As risk factors linked to host genomics, leukemia biology, treatment response markers and drug pharmacokinetics accumulate, CART analysis facilitates integration of the different layers of molecular complexity with patients' clinical characteristics to identify groups of patients with distinct treatment outcomes, and accordingly candidates for such treatment adaption. Thus, combining patients' genotypes with their clinical features is likely to explain treatment outcome better than single SNPs. Even though the binary recursive partitioning of the CART methodology offers an approach to patient classifications that is easier to perceive from a biologic point of view and to apply in the clinical setting, CART does not necessarily outperform conventional, stepwise, multivariate regression analyses. Interactions between variables (and combinations hereof) may be missed, and the downstream decision branching may be unstable if one variable is removed. Although the CART analysis presented here classified patients into highly different prognostic subsets, and conventional regression analysis confirmed the most significant clinical feature, SNP and pathway, larger independent data sets will be needed to determine which of these two biostatistical approaches is superior for relapse prediction. In future trials, the small group of patients who have the highest risk of relapse can be offered intensified treatments, or be candidates for phase 2 trials, whereas treatment reduction to avoid specific toxicities may be relevant for the large subset of patients who have a projected risk of relapse of <5%. In this context, the least relapse-predictive drug metabolism pathways may point to which drug doses potentially could be safely reduced in complex combination chemotherapy, and/or to which patients a specific drug is important owing to the patient's genomic variants. Although further verification is needed to strengthen the findings, the present approach has, when compared with other large-scale studies, the clear advantage of specifically targeting potential causative variants, which reduces the need to investigate genomic patterns of linkage disequilibrium, or conducting follow-up fine-mapping studies. Functional prioritization of the potential SNPs to be genotyped eliminates the difficulties linked to GWAS when mapping significant variants to genes and genotypes, and can be more directly combined in functional analyses. As an example, only one of the 134 SNPs associated to relapse risk in the recent childhood ALL GWAS study of Yang et al.[8] was also genotyped in the present study, as the remaining SNPs resided in noncoding or functionally unannotated regions, and it was not found significant. Moreover, as the associated SNPs in most cases were difficult to map to the gene on which they exert their effects, it is difficult to evaluate the involved biologic mechanisms. The present study combines large-scale genetic investigations of a GWAS approach with the targeted focus of a candidate gene approach. Thus, the major strength of our approach is that a wide panel of functional SNPs was selected with various prior assumptions of their potential relevance for childhood ALL treatment outcome. Owing to annotation to specific pathways, the results obtained are easier to interpret, and eventually to implement in the clinic based on existing knowledge of ALL disease mechanisms and pharmacokinetics and pharmacodynamics of administered drugs. Thus, the SNP/pathway-based approach can also be applied for prediction of specific side effects of antileukemic therapy. Host genome profiles may then indicate drugs with little importance for cure but associated with a high risk for side effects for specific patients, and this information could be applied for tailored therapy. Accordingly, the strategies and findings of this paper will be validated in the ongoing Nordic/Baltic NOPHO ALL2008 protocol addressing risk grouping,[23] occurrence of toxicities[24] (including infections[25]) and risk of relapse before they are included into future treatment allocation. We acknowledge that not covering the whole genome will certainly miss unknown, important genetic components of treatment response, which may benefit from alternative genome-wide screening approaches such as GWAS or whole-exome/transcriptome/genome sequencing, although the latter is still burdened by high costs owing to required patient numbers for statistical significance, and requires extensive follow-up for unannotated variants to be credibly accepted. In conclusion, this large-scale integration of the knowledge of disease mechanisms and drug pharmacokinetics/pharmacodynamics in host genome studies in childhood ALL can offer both significant improvements to the current relapse prediction algorithms, and, importantly, indicate specific directions for leukemia response and toxicity risk-based treatments adaptation for the individual patient.
  25 in total

1.  Minimal residual disease-directed risk stratification using real-time quantitative PCR analysis of immunoglobulin and T-cell receptor gene rearrangements in the international multicenter trial AIEOP-BFM ALL 2000 for childhood acute lymphoblastic leukemia.

Authors:  T Flohr; A Schrauder; G Cazzaniga; R Panzer-Grümayer; V van der Velden; S Fischer; M Stanulla; G Basso; F K Niggli; B W Schäfer; R Sutton; R Koehler; M Zimmermann; M G Valsecchi; H Gadner; G Masera; M Schrappe; J J M van Dongen; A Biondi; C R Bartram
Journal:  Leukemia       Date:  2008-01-31       Impact factor: 11.528

2.  Cost-effective multiplexing before capture allows screening of 25 000 clinically relevant SNPs in childhood acute lymphoblastic leukemia.

Authors:  A Wesolowska; M D Dalgaard; L Borst; L Gautier; M Bak; N Weinhold; B F Nielsen; L R Helt; K Audouze; J Nersting; N Tommerup; S Brunak; T Sicheritz-Ponten; H Leffers; K Schmiegelow; R Gupta
Journal:  Leukemia       Date:  2011-03-18       Impact factor: 11.528

3.  PharmGKB: the Pharmacogenomics Knowledge Base.

Authors:  Caroline F Thorn; Teri E Klein; Russ B Altman
Journal:  Methods Mol Biol       Date:  2013

4.  Thiopurine methyltransferase (TPMT) genotype and early treatment response to mercaptopurine in childhood acute lymphoblastic leukemia.

Authors:  Martin Stanulla; Elke Schaeffeler; Thomas Flohr; Gunnar Cario; André Schrauder; Martin Zimmermann; Karl Welte; Wolf-Dieter Ludwig; Claus R Bartram; Ulrich M Zanger; Michel Eichelbaum; Martin Schrappe; Matthias Schwab
Journal:  JAMA       Date:  2005-03-23       Impact factor: 56.272

5.  Long-term results of NOPHO ALL-92 and ALL-2000 studies of childhood acute lymphoblastic leukemia.

Authors:  K Schmiegelow; E Forestier; M Hellebostad; M Heyman; J Kristinsson; S Söderhäll; M Taskinen
Journal:  Leukemia       Date:  2009-12-10       Impact factor: 11.528

Review 6.  Pharmacogenetics influence treatment efficacy in childhood acute lymphoblastic leukemia.

Authors:  Marie Louise Davidsen; Kim Dalhoff; Kjeld Schmiegelow
Journal:  J Pediatr Hematol Oncol       Date:  2008-11       Impact factor: 1.289

7.  Risk group assignment differs for children and adults 1-45 yr with acute lymphoblastic leukemia treated by the NOPHO ALL-2008 protocol.

Authors:  Nina Toft; Henrik Birgens; Jonas Abrahamsson; Per Bernell; Laimonas Griškevičius; Helene Hallböök; Mats Heyman; Mette Skov Holm; Erik Hulegårdh; Tobias Wirenfeldt Klausen; Hanne V Marquart; Olafur Gísli Jónsson; Ove Juul Nielsen; Petter Quist-Paulsen; Mervi Taskinen; Goda Vaitkeviciene; Kim Vettenranta; Ann Åsberg; Kjeld Schmiegelow
Journal:  Eur J Haematol       Date:  2013-04-02       Impact factor: 2.997

8.  Pharmacogenetics of minimal residual disease response in children with B-precursor acute lymphoblastic leukemia: a report from the Children's Oncology Group.

Authors:  Stella M Davies; Michael J Borowitz; Gary L Rosner; Kristin Ritz; Meenakshi Devidas; Naomi Winick; Paul L Martin; Paul Bowman; James Elliott; Cheryl Willman; Soma Das; Edwin H Cook; Mary V Relling
Journal:  Blood       Date:  2008-01-08       Impact factor: 22.113

Review 9.  Roles of NOD1 (NLRC1) and NOD2 (NLRC2) in innate immunity and inflammatory diseases.

Authors:  Ricardo G Correa; Snezana Milutinovic; John C Reed
Journal:  Biosci Rep       Date:  2012-12       Impact factor: 3.840

10.  The Reactome pathway knowledgebase.

Authors:  David Croft; Antonio Fabregat Mundo; Robin Haw; Marija Milacic; Joel Weiser; Guanming Wu; Michael Caudy; Phani Garapati; Marc Gillespie; Maulik R Kamdar; Bijay Jassal; Steven Jupe; Lisa Matthews; Bruce May; Stanislav Palatnik; Karen Rothfels; Veronica Shamovsky; Heeyeon Song; Mark Williams; Ewan Birney; Henning Hermjakob; Lincoln Stein; Peter D'Eustachio
Journal:  Nucleic Acids Res       Date:  2013-11-15       Impact factor: 16.971

View more
  10 in total

Review 1.  Inherited genetic variation in childhood acute lymphoblastic leukemia.

Authors:  Takaya Moriyama; Mary V Relling; Jun J Yang
Journal:  Blood       Date:  2015-05-21       Impact factor: 22.113

Review 2.  Childhood Acute Lymphoblastic Leukemia: Progress Through Collaboration.

Authors:  Ching-Hon Pui; Jun J Yang; Stephen P Hunger; Rob Pieters; Martin Schrappe; Andrea Biondi; Ajay Vora; André Baruchel; Lewis B Silverman; Kjeld Schmiegelow; Gabriele Escherich; Keizo Horibe; Yves C M Benoit; Shai Izraeli; Allen Eng Juh Yeoh; Der-Cherng Liang; James R Downing; William E Evans; Mary V Relling; Charles G Mullighan
Journal:  J Clin Oncol       Date:  2015-08-24       Impact factor: 44.544

3.  Transcriptomic and proteomic analysis of mouse radiation-induced acute myeloid leukaemia (AML).

Authors:  Christophe Badie; Agnieszka Blachowicz; Zarko Barjaktarovic; Rosemary Finnon; Arlette Michaux; Hakan Sarioglu; Natalie Brown; Grainne Manning; M Abderrafi Benotmane; Soile Tapio; Joanna Polanska; Simon D Bouffler
Journal:  Oncotarget       Date:  2016-06-28

4.  The Contribution of MMP-8 Promoter Genotypes to Childhood Leukemia.

Authors:  Jen-Sheng Pei; Wen-Shin Chang; Pei-Chen Hsu; Yi-Wen Hung; Shun-Ping Cheng; Chia-Wen Tsai; DA-Tian Bau; Chi-Li Gong
Journal:  In Vivo       Date:  2017 Nov-Dec       Impact factor: 2.155

5.  Cerebellar mutism syndrome in children with brain tumours of the posterior fossa.

Authors:  Morten Wibroe; Johan Cappelen; Charlotte Castor; Niels Clausen; Pernilla Grillner; Thora Gudrunardottir; Ramneek Gupta; Bengt Gustavsson; Mats Heyman; Stefan Holm; Atte Karppinen; Camilla Klausen; Tuula Lönnqvist; René Mathiasen; Pelle Nilsson; Karsten Nysom; Karin Persson; Olof Rask; Kjeld Schmiegelow; Astrid Sehested; Harald Thomassen; Ingrid Tonning-Olsson; Barbara Zetterqvist; Marianne Juhler
Journal:  BMC Cancer       Date:  2017-06-21       Impact factor: 4.430

6.  Analysis of a gene panel for targeted sequencing of colorectal cancer samples.

Authors:  Klaus Højgaard Jensen; Jose M G Izarzugaza; Agnieszka Sierakowska Juncker; Anders Jakobsen; Søren Brunak; Flemming Brandt Sørensen; Rasmus Borup Hansen; Torben Frøstrup Hansen; Pascal Timshel; Thorarinn Blondal; Thomas Skøt Jensen; Eske Rygaard-Hjalsted; Peter Mouritzen; Michael Thorsen; Rasmus Wernersson; Henrik Bjørn Nielsen
Journal:  Oncotarget       Date:  2018-01-10

7.  Lamin B1 Overexpresses in Lung Adenocarcinoma and Promotes Proliferation in Lung Cancer Cells via AKT Pathway.

Authors:  Wei Li; Xiaoqing Li; Xiaoping Li; Mingjiang Li; Pan Yang; Xuhui Wang; Lei Li; Bo Yang
Journal:  Onco Targets Ther       Date:  2020-04-15       Impact factor: 4.147

8.  Can Machine Learning Models Predict Asparaginase-associated Pancreatitis in Childhood Acute Lymphoblastic Leukemia.

Authors:  Rikke L Nielsen; Benjamin O Wolthers; Marianne Helenius; Birgitte K Albertsen; Line Clemmensen; Kasper Nielsen; Jukka Kanerva; Riitta Niinimäki; Thomas L Frandsen; Andishe Attarbaschi; Shlomit Barzilai; Antonella Colombini; Gabriele Escherich; Derya Aytan-Aktug; Hsi-Che Liu; Anja Möricke; Sujith Samarasinghe; Inge M van der Sluis; Martin Stanulla; Morten Tulstrup; Rachita Yadav; Ester Zapotocka; Kjeld Schmiegelow; Ramneek Gupta
Journal:  J Pediatr Hematol Oncol       Date:  2022-04-01       Impact factor: 1.289

9.  Weighted Gene Co-Expression Network Analysis and Support Vector Machine Learning in the Proteomic Profiling of Cerebrospinal Fluid from Extraventricular Drainage in Child Medulloblastoma.

Authors:  Maurizio Bruschi; Xhuliana Kajana; Andrea Petretto; Martina Bartolucci; Marco Pavanello; Gian Marco Ghiggeri; Isabella Panfoli; Giovanni Candiano
Journal:  Metabolites       Date:  2022-08-05

10.  Data on affected cancer-related genes in pediatric t(12;21)-positive acute lymphoblastic leukemia patients harboring unbalanced der(6)t(X;6) translocations.

Authors:  Eigil Kjeldsen
Journal:  Data Brief       Date:  2016-07-05
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.