Literature DB >> 32378698

Epigenome-wide association study reveals CpG sites related to COG of neuroblastoma.

Hao Zhao1,2, Xiaojun Zhou3, Hu Sun2, Dongyun Zhao3, Hongfei Liu2, Bin Huang1,4, Xingang Li1,4, Yinghao Gu2.   

Abstract

BACKGROUND: Neuroblastoma (NB) is the most common extracranial solid tumor in infants and children. Its variable location and complex pathogenesis make NB hard for early diagnosis and risk classification.
METHODOLOGY: We analyzed the methylation data of 236 samples from patients with NB in Therapeutically Applicable Research to Generate Effective Treatments (TARGET) database. Kaplan-Meier survival analysis was used for comparing overall survival of NB patients in different groups. Epigenome-wide association study (EWAS) was conducted to screen CpGs significantly associated with NB patients' Children's Oncology Group (COG). Logistic regression method was used for constructing a model to predict NB patients' COG.
RESULTS: NB patients in low COG showed significantly superior prognosis than those in high COG. A total of seven CpG sites were found closely related to COG. Logistic regression model based on those CpGs showed superior performance in separating NB patients in different COGs.
CONCLUSIONS: The present study highlights the important role of DNA methylation in NB development, which might provide evidence for treatment decisions for children NB.
© 2020 The Author(s).

Entities:  

Keywords:  CpG; EWAS; Logistic regression model; Neuroblastoma; Risk classification

Mesh:

Substances:

Year:  2020        PMID: 32378698      PMCID: PMC7256671          DOI: 10.1042/BSR20200826

Source DB:  PubMed          Journal:  Biosci Rep        ISSN: 0144-8463            Impact factor:   3.840


Introduction

Neuroblastoma (NB) is one of the most common malignant tumors in infants and children. The incidence of NB in extracranial solid tumor in childhood. Approximately 7% of the malignant tumors occur in children born to 14 years old, with a fatality rate of up to 15% [1]. NB usually originates from the sympathetic nervous system in abdomen or chest, most commonly from adrenal gland. Patients with different origins and spread of tumor cells exhibit different symptoms, such as abdominal distension, constipation, dyspnea, skin mass, bone pain and anemia [2]. NB is a self-limiting disease, which is possible for spontaneously cured and recovery when the age of diagnosis is less than 18 months [3]. Children with NB are usually treated with surgical resection, chemotherapy, radiotherapy, and autologous hematopoietic stem cell transplantation. Appropriate treatments based on risk classification could reduce the treatment-related toxicities and improve the prognosis. Patients with low or intermediate risk have satisfactory prognosis, while the recurrence rate of patients with high risk is more than 50%, with 5-year survival rate of 40–50% [4]. However, the occult primary site and diverse phenotypes make NB hard for early diagnosis and most of the patients were at high risk when diagnosed. Advanced NB is highly invasive and rapidly progressing, leading to difficulty in treatment and poor prognosis [5]. Staging and risk classification are two important prognostic factors of NB. The International Neuroblastoma staging system (INSS), which is the most widely used staging standard, classifies NB tumors into stage I, II, III, IV and IV-S according to the tumor resectability [6]. The biological and clinical characteristics of NB are complex, which would be difficult to evaluate the development and risk before treatment only by relying on INSS. Therefore, the International Neuroblastoma Risk Group (INRG) proposed a new stage and risk classification system for pretreatment. According to INSS stage, age of diagnosis, MYCN amplification status, DNA ploidy, and histopathology, patients with NB are classified into low-risk group, intermediate-risk group, and high-risk group [7]. Children’s Oncology Group (COG) suggests that the therapy strategy should be made according to the risk classification of NB [8,9] Assessment of NB COG risk classification requires a lot of comprehensive information, and subjective factors of medical staff are involved. To reduce the uncertain subjectivity and improve the accuracy of risk classification, we performed epigenome-wide association study (EWAS) on methylation data of patients with NB to identify methylation variations that are related to NB COG risk classification. In addition, a logistic regression model was constructed using these CpG sites for accurate prediction of patientsCOG.

Materials and methods

Methylation data of NB

The methylation data of NB were obtained from Therapeutically Applicable Research to Generate Effective Treatments (TARGET) database (https://ocg.cancer.gov/programs/target), which contains information of common tumors in children. We obtained the methylation data and corresponding clinical data of 236 NB patients from TARGET database. All 236 samples were tumor samples, of which 169 samples contained complete survival information. Detailed epidemiological features of those 169 samples are provided in Table 1. The methylation data were detected by Illumina Human Methylation 450 (HM450) arrays platform.
Table 1

Clinicopathological characteristics of NBL patients from TARGET database

CharacteristicsNBL patients (n=169)
No.%
SexFemale6639.05%
Male10360.95%
RaceWhite12372.78%
Black or African American2816.57%
Asian10.59%
Native Hawaiian or other Pacific Islander31.78%
Unknown158.88%
Pathologic stageI74.14%
II10.59%
III52.96%
IV15692.31%
Survival timeLong (>5 years)8751.48%
Short (<5 years)8248.52%
Survival statusDead7946.75%
Alive9053.25%

EWAS

Similar to genome-wide association study (GWAS), which finds genetic mutations in the whole-genome by comparing case group and control group, EWAS is to identify methylation variations that are related to disease by comparing different sample groups at epigenetic level. We used CpGassoc package (version 2.60, https://CRAN.R-project.org/package=CpGassoc) of R language to study the relationship between methylation level and phenotype. In this package, the matrix of methylation β value and the sample phenotype information, here refers to COG risk group, were used as input, and the P-value that represented the relationship between each CpG site and phenotype was calculated by chi-square test. The smaller the P-value is, the closer the relationship between the CpG site and phenotype is. P-value < 1e-8 was used as the threshold for the significant association between CpG site and COG risk group.

Construction of logistic regression model

As a common approach in classification, logistic regression is based on a set of variables to predict the classification results. In the present study, the β value of methylation site is used to predict samples’ COG. The 236 samples were randomly divided into training set and testing set with the same sample size. Based on the β value of CpG site that screened in the previous analysis in the training set, the logistic regression model was constructed with R language. Testing set was used to evaluate the accuracy of the regression model.

Statistical analysis

For the 169 samples with survival information, we used the survival package (version 2.42, https://www.rdocumentation.org/packages/survival) of R language (version 3.5, https://www.r-project.org/) to perform Kaplan–Meier survival analysis. Cox proportional hazards model, which introduced mitosis-karyorrhexis index (MKI), risk groups (COG_risk_group), and tumor percentage (percent_tumor) as variables, was used to calculate the hazard rate (HR) between the sample groups based on each variable. Fisher’s exact test was used to determine the significance of the overall survival between different sample groups.

Results

MKI and COG risk group significantly affect the prognosis and survival

Hazard ratio was calculated based on the Cox proportional hazards model. All 169 samples were divided into groups according to MKI (high, intermediate, low, and unknown), COG risk group (high risk, intermediate risk, and low risk), and tumor percentage (H: > 50%, L: < 50%), respectively. HR > 1 indicated that samples with the factor had a higher risk of death than reference (high MKI, low COG risk group, or low tumor percentage), and HR < 1 indicated samples with the factor had a lower risk of death than reference. Lower.95 and upper.95 are 95% confidence intervals, and P-value<0.05 was considered as threshold. Figure 1A showed that NB patients with low and intermediate MKI had lower risk of death compared with high MKI. Compared with high COG, low and intermediate COG samples had lower risk of death. Tumor percentage showed no significant correlation with prognosis. Kaplan–Meier survival analysis results also supported that low MKI (Figure 1B) and low COG (Figure 1C) correlated with superior NB prognosis.
Figure 1

Correlation between different variables and prognosis

(A) Hazard ratio of sample groups based on MKI, Cog_risk_group, and percent_tumor, respectively. (n, number of samples; *P-value <0.05). (B) Kaplan–Meier survival curves of the correlation between MKI and survival probability. (C) Kaplan–Meier survival curves of the correlation between COG_risk_group and survival probability.

Correlation between different variables and prognosis

(A) Hazard ratio of sample groups based on MKI, Cog_risk_group, and percent_tumor, respectively. (n, number of samples; *P-value <0.05). (B) Kaplan–Meier survival curves of the correlation between MKI and survival probability. (C) Kaplan–Meier survival curves of the correlation between COG_risk_group and survival probability.

EWAS analysis

Samples within different groups based on COG showed significantly different survival time according to previous analysis. To further study the relationship between DNA methylation and COG, we used CpGassoc package to perform EWAS. P-value was used to measure the relationship between the CpG sites methylation levels and tumor COG. The smaller the P-value, the closer the relationship between them. We screened 81 CpG sites with P<1e-8 as the threshold, and 13 CpG sites with P<1e-10 as the threshold. As shown in Figure 2A, the 13 CpG sites were distributed on chromosomes 1, 2, 5, 10, 15, and 22. For details, Figure 2B–G were enlarged Manhattan plots of CpG positions on chromosomes 1, 2, 5, 10, 15, and 22, respectively.
Figure 2

Manhattan plots of CpG positons on chromosomes

(A) Manhattan plot of CpG sites on all chromosomes. The horizontal axis is positions of chromosome, and the vertical axis is −log10 of P-value. The blue line and the red line are thresholds of 1e-8 and 1e-10, respectively. (B–G) Manhattan plots of CpG sites on chromosome 1, 2, 5, 10, 15, and 22, respectively.

Manhattan plots of CpG positons on chromosomes

(A) Manhattan plot of CpG sites on all chromosomes. The horizontal axis is positions of chromosome, and the vertical axis is −log10 of P-value. The blue line and the red line are thresholds of 1e-8 and 1e-10, respectively. (B–G) Manhattan plots of CpG sites on chromosome 1, 2, 5, 10, 15, and 22, respectively.

Construction and evaluation of the logistic regression model in predicting COG risk group

Six of the 13 CpG sites were not annotated to any gene, and seven of them were found with corresponding genes, which were cg25241559 (SNED1), cg20989926 (RLBP1), cg22124648 (FGR), cg23109891 (SOX10), cg23049458 (L1TD1), cg04044188 (PDSS1), and cg16157016 (PICK1). As shown in Figure 3A, we analyzed the methylation levels of these seven sites between high COG and low COG sample group. P-values calculated by Wilcox method were all less than 0.05, indicating that the methylation levels of these seven CpG sites were significantly different between the high and low COG groups, and the methylation level in the high COG group was significantly higher than that in the low COG group. A logistic regression model based on the methylation levels of these seven CpG sites in a random divided training set with half of all samples was constructed to determine the COG risk groups. To verify the accuracy of this model, we used the other half of the samples for prediction. As shown in Figure 3B, the inflection points of the curve were all close to the upper left and the area under curve (AUC) was greater than 0.8 (AUC = 0.8663), indicating the high accuracy of the model. Using information-gain-based approach to sort the reliability of the seven CpG sites in predicting COG risk group from high to low, we found that the prediction accuracy of cg25241559 + cg20989926 reached 0.84, while the accuracy of the next points did not significantly increase, indicating that the model constructed by these two CpG sites was accurate enough for COG group prediction (Figure 3C).
Figure 3

CpG-based model could well predict COG risk level in NB

(A) Methylation levels of the seven CpG sites in the high-risk group and low-risk group. Blue: high-risk group; Yellow: low-risk group; Vertical axis: methylation level. (B) Prediction of the half of the 236 samples using the logistic regression model. (C) Evaluation of the logistic regression model in predicting COG risk level. The horizontal axis represents the number of included points, and the vertical axis represents the prediction accuracy.

CpG-based model could well predict COG risk level in NB

(A) Methylation levels of the seven CpG sites in the high-risk group and low-risk group. Blue: high-risk group; Yellow: low-risk group; Vertical axis: methylation level. (B) Prediction of the half of the 236 samples using the logistic regression model. (C) Evaluation of the logistic regression model in predicting COG risk level. The horizontal axis represents the number of included points, and the vertical axis represents the prediction accuracy.

Discussion

The risk stratification based on clinical and biologic factors of patients with tumor has been used to determine the appropriate treatment and is criteria for predicting prognosis for decades. Since 2006, the COG has been collecting INRG data and revising the COG risk classification system to achieve better predictive performance [10]. However, patients with same risk stratification may have different clinical courses after receiving same therapy [11]. On the basis of the assumption that the occurrence and development of tumor are driven by genetic and biological characteristics, researchers have identified a large number of potential prognostic markers and established classification for improving treatment strategies [12-14]. Although several well-characterized genetic abnormalities, including DNA content [15], gain of chromosome arm 17q, and deletion of chromosome arm 1p and 11q have been discovered to be associated with outcome prediction in NB [16], features of these molecules are inadequate to explain the clinical heterogeneity [17]. There is increasing evidence that abnormal DNA methylation is highly related to the development and progression of many cancers including NB. Hypermethylation was found in aggressive tumors compared with low-grade ones [18], and the tumorigenic properties of NB was inhibited by reversing epigenetic changes in CpG island with demethylating agent [19]. Capper et al. reported that DNA methylation patterns might be useful for the accurate classification of olfactory NB [20]. In this study, we analyzed the methylation data of NB in TARGET database, and found that the DNA methylation in the whole-genome that was associated with MKI, an indicator of NB histological classification [7], was significantly related to prognosis and survival. Meanwhile, COG risk group significantly affected the prognosis and survival, also suggesting that the methylation characteristics showed good potential in NB prognosis prediction. Furthermore, seven CpG sites with significantly different methylation levels in the high- and low-COG risk groups were identified, including cg25241559, cg20989926, cg22124648, cg23109891, cg23049458, cg04044188 and cg16157016, and logistic regression model with these CpG sites was constructed to accurately determine the COG risk of the samples. In addition, by analyzing the prediction reliability of these CpG sites, we found that the model constructed using cg25241559 and cg20989926 was accurate enough for COG risk group prediction. cg25241559 is a probe representing Sushi Nidogen and EGF-like domains 1 (SNED1), which encodes a secreted protein composed of characteristic domains commonly found in ECM proteins [21]. The crucial role of SNED1 in the development of several tumors has been discovered. Longati et al. identified SNED1 as a stromal marker that induces cisplatin resistance in head and neck squamous carcinoma [22]. Naba et al. reported SNED1 as characteristic protein of highly metastatic in mammary carcinoma, and the overexpression of SNED1 was associated with poor outcome for patients with ER−/PR− breast cancer [23]. The homozygous deletions of SNED1 might be involved in tumor metastasis of NB [24], while DNA methylation change of SNED1 was associated with the clinical course of NB [25]. However, the mechanisms of SNED1 involved in invasive phenotype remains unclear. Cg20989926 is a probe representing RLBP1, a retinaldehyde-binding protein gene that encodes human cellular retinaldehyde-binding protein [26]. Mutation of RLBP1 is associated with several inherited retinal disorders, such as Bothnia dystrophy [27], newfoundland rod-cone dystrophy [28], retinitis pigmentosa [29], and retinitis punctata albescens [30]. In oral cancer, the over expression of RLBP1 is associated with increased glucose uptake and aerobic glycolysis-mediated ATP synthesis [31]. The hypermethylation of RLBP1 is also recognized as one of the features of high-risk NB [25]. On the other hand, 5-hydroxymethylcytosine (5-hmC), a key component of DNA methylation, was proved to be related to hypoxic response of NB cell lines [32]. In another study on prognostic factors of NB using COG cohorts, 5-hmC profiles could be used as a biomarker for NB in children [33]. Perhaps the transcriptional network essential for promoting different NB phenotypes is regulated by distinct pattern of DNA modification, which requires further research in the future.

Conclusion

With the discovery of key somatic and germline genomic alterations, new tumor classifications relying on molecular profiles of tumor and host are emerging to provide evidences for treatment strategy of NB. In the present study, the methylation data of NB from TARGET database were analyzed to investigate the relationship between DNA methylation and COG risk groups. Seven CpG sites that significantly associated with risk classification were identified, and the combination of SNED1 and RLBP1 were accurate enough for risk stratification. The present study provided a basis for the refinements in risk classification, although additional research will be needed to clarify the biological functions of these methylation variations to achieve higher accuracy.
  33 in total

1.  Statistical Framework in Support of a Revised Children's Oncology Group Neuroblastoma Risk Classification System.

Authors:  Arlene Naranjo; Meredith S Irwin; Michael D Hogarty; Susan L Cohn; Julie R Park; Wendy B London
Journal:  JCO Clin Cancer Inform       Date:  2018-12

2.  Bothnia dystrophy caused by mutations in the cellular retinaldehyde-binding protein gene (RLBP1) on chromosome 15q26.

Authors:  M S Burstedt; O Sandgren; G Holmgren; K Forsman-Semb
Journal:  Invest Ophthalmol Vis Sci       Date:  1999-04       Impact factor: 4.799

3.  PARP inhibitors enhance replication stress and cause mitotic catastrophe in MYCN-dependent neuroblastoma.

Authors:  V Colicchia; M Petroni; G Guarguaglini; F Sardina; M Sahún-Roncero; M Carbonari; B Ricci; C Heil; C Capalbo; F Belardinilli; A Coppa; G Peruzzi; I Screpanti; P Lavia; A Gulino; G Giannini
Journal:  Oncogene       Date:  2017-04-10       Impact factor: 9.867

4.  Investigation of major genetic alterations in neuroblastoma.

Authors:  Régis Afonso Costa; Héctor N Seuánez
Journal:  Mol Biol Rep       Date:  2018-02-17       Impact factor: 2.316

5.  Cloning and expression analysis of the mouse stroma marker Snep encoding a novel nidogen domain protein.

Authors:  Cornelia Leimeister; Nina Schumacher; Holger Diez; Manfred Gessler
Journal:  Dev Dyn       Date:  2004-06       Impact factor: 3.780

6.  Integration of genetic and epigenetic markers for risk stratification: opportunities and challenges.

Authors:  Nora Pashayan; Daniel Reisel; Martin Widschwendter
Journal:  Per Med       Date:  2016-03-01       Impact factor: 2.512

7.  DNA methylation-based reclassification of olfactory neuroblastoma.

Authors:  David Capper; Nils W Engel; Damian Stichel; Matt Lechner; Stefanie Glöss; Simone Schmid; Christian Koelsche; Daniel Schrimpf; Judith Niesen; Annika K Wefers; David T W Jones; Martin Sill; Oliver Weigert; Keith L Ligon; Adriana Olar; Arend Koch; Martin Forster; Sebastian Moran; Oscar M Tirado; Miguel Sáinz-Jaspeado; Jaume Mora; Manel Esteller; Javier Alonso; Xavier Garcia Del Muro; Werner Paulus; Jörg Felsberg; Guido Reifenberger; Markus Glatzel; Stephan Frank; Camelia M Monoranu; Valerie J Lund; Andreas von Deimling; Stefan Pfister; Rolf Buslei; Julika Ribbat-Idel; Sven Perner; Volker Gudziol; Matthias Meinhardt; Ulrich Schüller
Journal:  Acta Neuropathol       Date:  2018-05-05       Impact factor: 15.887

8.  Extracellular matrix signatures of human mammary carcinoma identify novel metastasis promoters.

Authors:  Alexandra Naba; Karl R Clauser; John M Lamar; Steven A Carr; Richard O Hynes
Journal:  Elife       Date:  2014-03-11       Impact factor: 8.140

9.  Genome-wide methylation profiling identifies novel methylated genes in neuroblastoma tumors.

Authors:  Maja Olsson; Stephan Beck; Per Kogner; Tommy Martinsson; Helena Carén
Journal:  Epigenetics       Date:  2016-01-19       Impact factor: 4.528

10.  5-Hydroxymethylcytosine Profiles Are Prognostic of Outcome in Neuroblastoma and Reveal Transcriptional Networks That Correlate With Tumor Phenotype.

Authors:  Mark A Applebaum; Erin K Barr; Jason Karpus; Ji Nie; Zhou Zhang; Amy E Armstrong; Sakshi Uppal; Madina Sukhanova; Wei Zhang; Alexandre Chlenski; Helen R Salwen; Emma Wilkinson; Marija Dobratic; Robert Grossman; Lucy A Godley; Barbara E Stranger; Chuan He; Susan L Cohn
Journal:  JCO Precis Oncol       Date:  2019-05-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.