BACKGROUND: Uterine corpus endometrial carcinoma (UCEC) is a clinically heterogeneous disease, and this heterogeneity is associated with tumor development, clinical characteristics, and prognostic outcomes. Mutant-allele tumor heterogeneity (MATH) is a novel, non-biased, quantitative measure to assess intra-tumor heterogeneity based on next-generation sequencing data. We aimed to explore the use of MATH as a measure for tumor heterogeneity and its prognostic role in UCEC patients. METHODS: We calculated MATH scores from the available data of 560 UCEC patients from The Cancer Genome Atlas (TCGA) and investigated their correlations with clinical characteristics, genetic alterations, and overall survival. Predictive accuracy was quantified using the area under the receiver operating characteristic curve (AUC) and the index of concordance (C-index). RESULTS: In total, 242 MATH scores were obtained from the UCEC cohort. MATH scores were significantly related to age, race, cancer type, clinical stage, histological grade, molecular type, targeted molecular therapy, and hormonal therapy. Furthermore, the genomic pattern on the basis of MATH scores showed that mutation rates of TP53 (tumor protein p53) and ARID1A (AT-rich interaction domain 1A) were independently associated with MATH scores. Correlation analysis revealed a significantly positive association of MATH scores with the fraction of somatic copy number alteration (SCNA). Importantly, a high MATH score was significantly associated with shorter overall survival [hazard ratio (HR), 2.342; 95% confidence interval (CI), 1.110-4.942]. Multivariate Cox regression combined with stratified analysis revealed that the MATH score is an independent prognostic factor in UCEC patients under 60 years old, and predictive quantification showed the MATH score had an AUC of 0.756 and a C-index of 0.845. CONCLUSIONS: Our results suggest that MATH, a practical and useful way to measure intra-tumor heterogeneity, may serve as a significant biomarker for the prognosis of patients with UCEC, enabling more accurate prediction of clinical outcomes. 2020 Annals of Translational Medicine. All rights reserved.
BACKGROUND: Uterine corpus endometrial carcinoma (UCEC) is a clinically heterogeneous disease, and this heterogeneity is associated with tumor development, clinical characteristics, and prognostic outcomes. Mutant-allele tumor heterogeneity (MATH) is a novel, non-biased, quantitative measure to assess intra-tumor heterogeneity based on next-generation sequencing data. We aimed to explore the use of MATH as a measure for tumor heterogeneity and its prognostic role in UCEC patients. METHODS: We calculated MATH scores from the available data of 560 UCEC patients from The Cancer Genome Atlas (TCGA) and investigated their correlations with clinical characteristics, genetic alterations, and overall survival. Predictive accuracy was quantified using the area under the receiver operating characteristic curve (AUC) and the index of concordance (C-index). RESULTS: In total, 242 MATH scores were obtained from the UCEC cohort. MATH scores were significantly related to age, race, cancer type, clinical stage, histological grade, molecular type, targeted molecular therapy, and hormonal therapy. Furthermore, the genomic pattern on the basis of MATH scores showed that mutation rates of TP53 (tumor protein p53) and ARID1A (AT-rich interaction domain 1A) were independently associated with MATH scores. Correlation analysis revealed a significantly positive association of MATH scores with the fraction of somatic copy number alteration (SCNA). Importantly, a high MATH score was significantly associated with shorter overall survival [hazard ratio (HR), 2.342; 95% confidence interval (CI), 1.110-4.942]. Multivariate Cox regression combined with stratified analysis revealed that the MATH score is an independent prognostic factor in UCEC patients under 60 years old, and predictive quantification showed the MATH score had an AUC of 0.756 and a C-index of 0.845. CONCLUSIONS: Our results suggest that MATH, a practical and useful way to measure intra-tumor heterogeneity, may serve as a significant biomarker for the prognosis of patients with UCEC, enabling more accurate prediction of clinical outcomes. 2020 Annals of Translational Medicine. All rights reserved.
Intra-tumor heterogeneity is a hallmark of cancer, which is characterized by the presence of different subpopulations of cancer cells with distinct genetic, phenotypic, or behavioral characteristics within the same tumor, as well as between a given primary tumor and its metastases (1,2). Currently, the extent of intra-tumor heterogeneity may serve as a clinically useful biomarker in the development of personalized therapies and clinical outcomes (3-5). Genetic heterogeneity is a pivotal type of intra-tumor heterogeneity and has been considered as the product of genomic instability, which results in the presence of different subclonal populations within the tumor, and likely influences the patient’s clinical course and response to therapy (6-9). Most previous investigations have been small-scale studies or consisted of only single-cell analyses. In addition, identified markers of tumor subpopulations may not generalize well across tumor types and could be difficult to standardize and quantify. Moreover, guidelines for scoring and reporting have not yet been established. Furthermore, the analyses of genetic heterogeneity have been limited with respect to sequencing depth and the number of detected somatic mutations (10). While this concept is now well established, the quantification of genetic heterogeneity to assess intra-tumor heterogeneity is still a great challenge in clinical trials and clinical practice.Mutant-allele tumor heterogeneity (MATH) is a quantitative assessment of genetic intra-tumor heterogeneity, which is based on whole-exome sequencing (WES) of tumors and of their matched normal DNA (11). The MATH score, which is the normalized width of this distribution, is derived directly from the mixed population of mutant-allele frequencies within a tumor (11). The MATH score has been reported to be a reliable biomarker of outcomes in patients with head and neck squamous cell carcinoma (HNSCC) (11-13), breast cancer (14,15), colorectal cancer (16), and lung adenocarcinoma (17). Although studies have successfully demonstrated that intra-tumor heterogeneity assessed by MATH is a prognostic biomarker in some cancers, the prognostic significance of the MATH score in various types of tumors has not been thoroughly explored at the genomic level.Uterine corpus endometrial carcinoma (UCEC), which has a mortality rate of approximately 20%, is the most common gynecological malignancy in the western world (18). UCEC has traditionally been classified into two types (types I and II), based on epidemiological, clinical, and endocrine characteristics (19). A number of studies have already demonstrated that UCEC is comprised of a biologically, clinically, morphologically, and genetically heterogeneous group of tumors (20-22). In 2013, The Cancer Genome Atlas (TCGA) Research Network performed a comprehensive genomic and transcriptomic analysis of UCEC, which was based on array and sequencing technologies using common histological types (23). Despite the fact that MATH is a quantitative biomarker to assess intra-tumor heterogeneity that has been applied to some tumors, the prognostic value of MATH in UCEC patients remains to be determined.In this study, we investigated the associations of MATH with clinical, pathological, and overall survival data in UCEC patients from the TCGA database, and explored the potential clinical implications of intra-tumor heterogeneity in UCEC. Our findings suggest that the MATH score is a quantitative measure of intra-tumor heterogeneity, and could serve as a novel biomarker for improved prognosis and personalized therapy for UCEC patients.
Methods
Eligible patients and clinical variables
The publicly available clinical data used in this study were released by TCGA and the Broad Institute of MIT and Harvard. Female patients who were diagnosed with clinical stage I–IV UCEC with acquirable level-III WES data were eligible. For each patient, the following data were acquired based on the clinical file downloaded from http://gdac.broadinstitute.org/: age at initial pathological diagnosis, vital status, days to death, days to last follow-up, clinical stage, histological grade, cancer type, hormonal therapy, pregnancies, radiation therapy, targeted molecular therapy, surgical approach, and presence/absence of diabetes. Additionally, race and body mass index (BMI) were determined based on the clinical file downloaded from https://portal.gdc.cancer.gov/. The molecular type of each sample was downloaded from attachment files of the article “Integrated Genomic Characterization of Endometrial Carcinoma” (23). Since the datasets analyzed during the current study were available in TCGA databases, ethical approval or informed consent was not required.
Genomic variables and generation of MATH
Tumor-specific mutation data from WES were downloaded from the Broad Institute of MIT and Harvard (24), where WES data had been obtained. Mutation data were available for 248 of the 560 UCEC patients. The tumor MATH score for each patient was calculated following the method described by Mroz EA and Rocco JW (11). Tumor-specific mutations had already been identified at the Broad Institute of MIT and Harvard for TCGA. The steps to determine the MATH score are summarized as follows: (I) calculating the mutant-allele fraction (MAF) for each locus as the ratio of mutant reads to total reads; (II) obtaining the absolute difference of each MAF from the median MAF value, and then multiplying the median of these absolute differences by a factor of 1.4826, thus generating the median absolute deviation (MAD); and (III) calculating the MATH score as the percentage ratio of the MAD to the median of the MAFs among the tumor’s mutated genomic loci as follows: MATH = MAD/median ×100%. The patients were separated into high- and low-MATH groups by the median of the MATH scores. Somatic mutations for each patient were downloaded from the Broad Institute for the aforementioned TCGA UCEC subjects and were compared between the low-MATH group and the high-MATH group. The fraction of somatic copy number alteration (SCNA) in TCGA UCEC subjects was obtained from the cBioPortal for Cancer Genomics (http://cbioportal.org).
Statistical and bioinformatic analysis
The association between the MATH score and clinical characteristics was evaluated by one-way analysis of variance (ANOVA), with pairwise comparisons that were adjusted using the Bonferroni test. The receiver operating characteristic (ROC) curve showed the tradeoff between sensitivity and specificity at different choices of the MATH classification cutoff for the whole cohort. To adapt the data to the following analyses, we used the median MATH score of 21.425 as the cutoff point to divide the patients into two groups of equal size; this cutoff was also adopted in subgroup analyses. The correlation coefficient between MATH scores and SCNA fraction was assessed by the Spearman method. Kaplan-Meier survival curves were used to analyze the overall survival of UCEC patients, and log-rank (Mantel-Cox) tests were applied to compare the differences between subgroups. Univariate and multivariate Cox regression was employed to evaluate the prognostic value of the MATH score, where the MATH score was used both as a categorical variable and as a continuous variable, respectively. In the whole cohort, all significant parameters identified by univariate analysis have been included in multivariate analysis. Stratified multivariate analyses were performed using the variables showing P values <0.20 in the univariate analysis, in addition to other variables considered as known prognostic factors. The predictive accuracy of different Cox models was described by the area under the ROC curve (AUC) and the index of concordance (C-index). All analyses were performed using the International Business Machines Corporation statistical software program Statistical Product and Service Solutions (SPSS), version 22.0.0 (SPSS Incorporated, Chicago, IL, USA). MATH scores were calculated by R version 3.3.3 (R Foundation for Statistical Computing, Vienna, Austria). A two-sided P value <0.05 was considered to indicate statistical significance, unless otherwise indicated.
Results
MATH and baseline characteristics in UCEC
Somatic-mutation data were available for 248 of the 560 UCEC patients, and ultimately 242 MATH scores were acquired. For all 242 eligible patients, initial pathological diagnoses were made between 1995 and 2013 (median, 2009); the median follow-up was 36.05 months (range, 0.57–188.37 months), and 33 patients (13.6%) had died. Their ages ranged from 34 to 90 years and their races were defined as the following: White; Black or African American; Asian; Native Hawaiian or other Pacific Islander; and American Indian or Alaskan Native. Tumor samples and corresponding germline DNA were composed of 194 endometrioid (type-I UCEC), 43 serous (type-II UCEC), and five mixed histology cases. Using the MATH scoring method, the scores of MAFs among loci tend to be higher for heterogeneous tumors than for homogeneous tumors. As shown in , tumor MATH scores ranged from 0 to 75.023, with a mean ± standard deviation (SD) of 27.114±14.724 (interquartile range, 16.820–36.203). According to the median (21.425) MATH score, patients were divided into two groups: the low-MATH group and the high-MATH group, with mean ± SD of 16.485±3.456 and 37.742±13.987, respectively. The distribution of MATH scores showed a wide spectrum of allele frequencies and strong clustering of the variants at a high MATH score (). The box plots of MATH distribution according to different clinicopathological characteristics of tumors are displayed in . The distribution of MATH scores yielded considerable variation in MATH scores and indicated that UCEC has high heterogeneity.
Figure 1
Distribution of MATH scores among 242 UCEC patients. (A) MATH scores are displayed along the horizontal axis, and the number of patients within a specific range of MATH scores is displayed on the vertical axis; (B) box plots of MATH scores in different groups of UCEC patients. o, outlier; *, extreme data. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma.
Distribution of MATH scores among 242 UCEC patients. (A) MATH scores are displayed along the horizontal axis, and the number of patients within a specific range of MATH scores is displayed on the vertical axis; (B) box plots of MATH scores in different groups of UCEC patients. o, outlier; *, extreme data. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma.
Relationships of clinical characteristics with MATH scores
The clinical characteristics, including age, histological grade, clinical stage, and BMI, were downloaded. They are listed in . As shown in and , the tumor MATH scores were significantly related to age, race, cancer type, clinical stage, histological grade, molecular type, targeted molecular therapy, and hormonal therapy. In contrast, MATH scores were not significantly correlated with radiation therapy, pregnancies, surgical approach, diabetes, or BMI (). UCEC patients younger than 60 years old had low MATH scores; MATH scores of those over 60 years old were significantly higher (,
). Further analysis revealed that MATH scores were higher in type-II UCEC patients compared to those in type-I UCEC patients (P<0.01; ,
). Additionally, with respect to the clinical stage, UCEC patients with tumor classification T1 and T2 had much lower MATH scores than patients with T3 and T4 (P<0.01; ,
). In terms of histological grade, UCEC patients with tumor histological grade G3 and G4 tended to have significantly higher MATH scores than patients of the G1 and G2 subtypes (P<0.01; ,
). Moreover, as shown in and , targeted molecular therapy was associated with higher MATH scores in UCEC patients (P<0.05), whereas lower MATH scores were significantly associated with menopausal hormonal therapy (P<0.05). Based on the four molecular subtypes of endometrial carcinoma, our analysis revealed that MATH scores were significantly higher in the copy number (CN) high subgroup than in other molecular subtypes (P<0.01; ).
Table 1
Relationships of clinical variables with MATH values
Variables
N
Percent
MATH ± SD
ANOVA P value
Age, years
0.010
≤60
96
39.669
23.585±12.358
61–70
86
35.537
29.362±14.897
>70
60
24.793
29.536±16.921
Race
0.001
White
189
78.099
25.851±13.912
Black or African American
24
9.917
36.082±17.201
Asian
13
5.372
20.839±7.929
Cancer type
1.259E−20
I
194
80.165
23.029±11.553
II
43
17.769
44.051±14.649
Histological grade
4.210E−13
1
76
31.405
21.094±9.711
2
74
30.579
22.593±11.217
3+4
92
38.017
35.723±16.543
Clinical stage
3.000E−5
1+2
179
73.967
24.806±13.242
3+4
63
26.033
33.671±16.739
Molecular type
5.260E−9
CN high
59
24.380
40.891±14.271
CN low
90
37.190
19.619±7.215
MSI
63
26.033
26.197±14.631
POLE
14
5.785
20.306±7.557
Not assigned
16
6.612
28.033±17.925
Targeted molecular therapy
3.000E−6
No
159
65.702
24.058±12.780
Yes
74
30.579
33.663±16.856
Hormonal therapy
0.018
No
141
58.264
27.742±15.457
Yes
23
9.504
19.961±4.748
Radiation therapy
0.505
No
155
64.050
26.640±14.471
Yes
87
35.950
27.958±15.212
Pregnancies
0.351
0
43
17.769
27.984±15.643
1
29
11.983
24.333±11.011
2
67
27.686
26.607±15.311
3
37
15.289
26.091±13.644
≥4
32
13.223
31.732±17.290
Surgical approach
0.958
Open
164
67.769
27.149±15.057
Minimally invasive
78
32.231
27.040±14.094
Diabetes
0.317
No
154
63.636
27.477±15.044
Yes
55
22.727
25.199±12.635
BMI, kg/m2
0.447
≤25
55
22.727
29.068±15.921
26–35
98
40.496
26.769±14.585
≥36
76
31.405
25.799±13.947
One-way ANOVA was used to evaluate the relationships of clinical variables with MATH values. MATH, mutant-allele tumor heterogeneity; ANOVA, analysis of variance; SD, standard deviation; CN, copy number; MSI, microsatellite instability; POLE: polymerase ε.
Figure 2
Relationships of clinical characteristics to the MATH score. Each panel represents the relationships of the clinical characteristic to the MATH score. Within each panel, the horizontal axis indicates the range of MATH scores and the vertical axis represents the percentage of cases with the indicated characteristics. T, tumor classification; G, tumor histological grade; MATH, mutant-allele tumor heterogeneity.
One-way ANOVA was used to evaluate the relationships of clinical variables with MATH values. MATH, mutant-allele tumor heterogeneity; ANOVA, analysis of variance; SD, standard deviation; CN, copy number; MSI, microsatellite instability; POLE: polymerase ε.Relationships of clinical characteristics to the MATH score. Each panel represents the relationships of the clinical characteristic to the MATH score. Within each panel, the horizontal axis indicates the range of MATH scores and the vertical axis represents the percentage of cases with the indicated characteristics. T, tumor classification; G, tumor histological grade; MATH, mutant-allele tumor heterogeneity.
Correlation of MATH with somatic mutations and SCNA
To explore the differences in somatic-mutation frequencies between the high- and low-MATH groups, the most prevalent somatic mutations in each MATH group were analyzed. As shown in , mutations in phosphatase and tensin homolog (PTEN), phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha (PIK3CA), and phosphoinositide-3-kinase regulatory subunit 1 (PIK3R1) were among the top five most prevalent somatic mutations in both the groups. Mutations in titin (TTN) and tumor protein p53 (TP53) were present in the top-five list only in the high-MATH group, and mutations in AT-rich interaction domain 1A (ARID1A) and catenin-beta 1 (CTNNB1) were present only in the top-five list in the low-MATH group. Therefore, TP53 and ARID1A were mutated at significantly different frequencies in the low- and high-MATH groups, as confirmed by the Pearson chi-square test and logistic regression analysis (P<0.05 in both tests). TP53 mutation rates in the low-MATH group and the high-MATH group were 15.7% and 48.8%, respectively. After adjusting variables in the logistic regression model, the difference in TP53 mutation rate was still significant between the high- and low-MATH groups (P=0.020). In the Pearson correlation analysis, TP53 mutation displayed a significantly positive correlation with type-II UCEC (P<0.001) and clinical stage (P<0.001) in UCEC patients. Further subgroup analysis was performed next. As shown in , a significant difference was observed in type-I UCEC patients (P=0.018) but not in type-II UCEC patients (P=0.503). With respect to the clinical stage, a similar result was found in the mutation rates of TP53 between different MATH levels within clinical stages 1–2 and clinical stages 3–4 (P=0.002 and P<0.001, respectively; ). Another gene showing a different mutation rate in both Fisher’s exact test and logistic regression was ARID1A (P=0.001 and P=0.031), which had a significantly higher mutation rate in the low-MATH group than in the high-MATH group (43.8% and 23.1%, respectively). Additionally, we found a negative correlation between ARID1A mutation and type-I UCEC (P<0.001) or clinical stage (P<0.001). Moreover, the difference in the mutation rate of ARID1A was only significant in the clinical stage 1–2 subgroup (P=0.007), but not in the type-I subgroup (P=0.064), type-II group (P=0.256), or clinical stage 3–4 subgroup (P=0.145; ). Next, we explored the relationship between MATH scores and SCNA. As shown in , UCEC patients with high MATH scores had higher SCNA fractions than those with low MATH scores. The median fractions of SCNA in the high- and low-MATH groups were 21.41% and 3.41%, respectively, and the SCNA fraction was significantly different between the high- and low-MATH groups (). Furthermore, Spearman’s correlation analysis revealed a significantly positive association between the MATH score and SCNA fraction, validating that the fraction of SCNA was significantly higher in the high-MATH group ().
Figure 3
Somatic mutations in different MATH groups. (A) Genetic-mutation rates for the most prevalent mutations in UCEC patients from the TCGA database, which were divided into the high-MATH and the low-MATH group; (B) subgroup analyses of the mutation frequency in TP53 and ARID1A between the high- and low-MATH groups according to clinical subtype and clinical stage. Mut, mutation; WT, wild-type; MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; TCGA, The Cancer Genome Atlas.
Figure 4
SCNA in different MATH groups. (A) Scatter diagram of the SCNA fraction in high- and low-MATH groups in UCEC patients from the TCGA database; (B) lower quartile, median, and upper quartile fraction of SCNA in high- and low-MATH groups; (C) correlation between the fraction of SCNA and the MATH score in UCEC patients from the TCGA database. Each circle is an individual sample. SCNA, somatic copy number alteration; MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; TCGA, The Cancer Genome Atlas.
Somatic mutations in different MATH groups. (A) Genetic-mutation rates for the most prevalent mutations in UCEC patients from the TCGA database, which were divided into the high-MATH and the low-MATH group; (B) subgroup analyses of the mutation frequency in TP53 and ARID1A between the high- and low-MATH groups according to clinical subtype and clinical stage. Mut, mutation; WT, wild-type; MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; TCGA, The Cancer Genome Atlas.SCNA in different MATH groups. (A) Scatter diagram of the SCNA fraction in high- and low-MATH groups in UCEC patients from the TCGA database; (B) lower quartile, median, and upper quartile fraction of SCNA in high- and low-MATH groups; (C) correlation between the fraction of SCNA and the MATH score in UCEC patients from the TCGA database. Each circle is an individual sample. SCNA, somatic copy number alteration; MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; TCGA, The Cancer Genome Atlas.
Prognostic value of the MATH score for survival in UCEC patients
With regard to the potential prognostic value of the commonly mutated genes in UCEC, the Kaplan-Meier curves showed that patients with genetic mutations in PTEN, PIK3CA, and ARID1A exhibited longer overall survival rates than those with the wild-type genes (P<0.05; ). However, no significant difference in prognosis was observed between patients with genetic mutations in CTNNB1 and PIK3R1 and patients with their wild-type counterparts (). In contrast, patients with TP53 mutations displayed a poorer prognosis relative to those with the wild-type TP53 gene (P<0.05; ). Survival analysis demonstrated that patients in the high-MATH group had a significantly shorter overall survival time than those in the low-MATH group based either on the median MATH score or on other MATH cutoff values (,
). To investigate whether the MATH score predicts somatic mutations, the relationships between the clinical outcomes of UCEC patients and the prevalent somatic mutations in the high-MATH or low-MATH groups were further analyzed. As shown in , ARID1A or TP53 mutations did not significantly affect prognosis in patients with high or low MATH scores. Interestingly, among patients with wild-type PTEN, patients in the low-MATH group exhibited longer overall survival rates than those in the high-MATH group (P=0.043), whereas among patients with PIK3CA mutation, low MATH scores were strongly correlated with better disease outcomes than high MATH scores (P=0.030; ).
Figure 5
Association between overall survival in UCEC patients and somatic mutations in PTEN, PIK3CA, ARID1A, CTNNB1, PIK3R1, and TP53, as determined by Kaplan-Meier curves. Mut, mutation; WT, wild-type; UCEC, uterine corpus endometrial carcinoma.
Figure 6
Overall survival in different MATH groups. Kaplan-Meier survival curves for all patients (A), somatic mutations and their wild-type counterparts (B), and UCEC patients ≤60 years old (C), >60 and ≤70 years old (D), and >70 years old (E) in the high- and low-MATH groups. Mut, mutation; WT, wild type; MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma.
Figure S1
Kaplan-Meier survival curves for all patients from TCGA according to MATH groups, with MATH cutoff of 40.343 (the more statistically significant calculated cutoff value) (A) or MATH cutoff of 33.560 (the valley level of MATH distribution) (B). TCGA, The Cancer Genome Atlas; MATH, mutant-allele tumor heterogeneity.
Association between overall survival in UCEC patients and somatic mutations in PTEN, PIK3CA, ARID1A, CTNNB1, PIK3R1, and TP53, as determined by Kaplan-Meier curves. Mut, mutation; WT, wild-type; UCEC, uterine corpus endometrial carcinoma.Overall survival in different MATH groups. Kaplan-Meier survival curves for all patients (A), somatic mutations and their wild-type counterparts (B), and UCEC patients ≤60 years old (C), >60 and ≤70 years old (D), and >70 years old (E) in the high- and low-MATH groups. Mut, mutation; WT, wild type; MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma.Next, we explored the prognostic value of the MATH score and other variables in all patients and subgroups. The MATH score, taken as a continuous variable, was strongly related to overall survival by univariate Cox regression analysis (P=0.021), although this correlation was not statistically significant in the multivariate analysis (). When the MATH score was taken as a categorical variable, UCEC patients in the high-MATH group had a hazard of death that was twice as high as that of those in the low-MATH group [hazard ratio (HR), 2.342; 95% CI, 1.110–4.942; P=0.025; ]. We also found that age, cancer type, clinical stage, histological grade, molecular type, and targeted molecular therapy were significantly associated with overall survival by univariate Cox regression (). To further investigate whether the MATH score was an independent prognostic factor, multivariate analysis was conducted with variables statistically significant in univariate analysis as covariates. The results showed that age (P=0.020) and clinical stage (P<0.001) were independent prognostic factors (). However, multivariate Cox regression showed that the MATH score, as a categorical variable, was not an independent prognostic factor in all patients (P=0.715; ). Stratified analysis was then performed for age and clinical stage. Interestingly, the stratified analysis demonstrated that the MATH score was an independent prognostic factor that predicted the outcome of UCEC patients aged ≤60 years, when the MATH score was treated either as a categorical variable (HR, 11.015; 95% CI, 1.352–89.732; P=0.025; ) or as a continuous variable (HR, 1.057; 95% CI, 1.014–1.102; P=0.009; ). However, the MATH score was not a significant predictor for overall survival for patients in other subgroups (). According to the Kaplan-Meier curve, UCEC patients ≤60 years old in the high-MATH group had significantly poorer overall survival than those in the low-MATH group (). Among other age strata, no significant differences were found in overall survival between the high- and low-MATH groups (), indicating that the prognostic power of the MATH score was age dependent.
Table S1
Relationships of clinical variables with overall survival by Cox proportional hazards analysis
Variables
Univariate analysis
Multivariate analysis
HR (95% CI)
P value
HR (95% CI)
P value
Age, years
0.047
0.02
61–70 vs. <60
1.341 (0.529, 3.402)
0.536
0.837 (0.303, 2.314)
0.732
>70 vs. <60
2.742 (1.153, 6.519)
0.022
2.622 (1.087, 6.322)
0.032
Cancer type (II vs. I)
3.447 (1.646, 7.215)
0.001
1.042 (0.327, 3.315)
0.945
Histological grade
0.003
0.113
2 vs. 1
5.141 (1.091, 24.219)
0.038
4.842 (1.022, 22.930)
0.047
3+4 vs. 1
10.199 (2.403, 43.290)
0.002
4.651 (1.026, 21.094)
0.046
Clinical stage (3+4 vs. 1+2)
2.816 (1.390, 5.703)
0.004
3.939 (1.837, 8.445)
<0.001
Targeted molecular therapy (yes vs. no)
2.423 (1.202, 4.887)
0.013
1.357 (0.573, 3.214)
0.487
Molecular type
0.027
0.911
CN low vs. CN high
0.210 (0.077, 0.569)
0.002
0.702 (0.155, 3.172)
0.645
MSI vs. CN high
0.423 (0.181, 0.985)
0.046
1.295 (0.364, 4.605)
0.689
POLE vs. CN high
<0.001 (<0.001, –)
0.98
<0.001 (<0.001, –)
0.979
Not assigned vs. CN high
0.513 (0.149, 1.761)
0.289
0.909 (0.232, 3.561)
0.891
MATH
1.023 (1.003, 1.043)
0.021
0.999 (0.975, 1.024)
0.963
Race
0.536
–
–
Black vs. White
1.375 (0.526, 3.598)
0.516
Asian vs. White
0.418 (0.056, 3.110)
0.394
Hormonal therapy (yes vs. no)
0.192 (0.026, 1.425)
0.107
–
–
Radiation therapy (yes vs. no)
1.257 (0.616, 2.567)
0.53
–
–
Surgical approach (minimally invasive vs. open)
1.309 (0.620, 2.764)
0.48
–
–
Diabetes (yes vs. no)
1.134 (0.536, 2.400)
0.742
–
–
Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival. Age, cancer type, histologic grade, clinical stage, targeted molecular therapy, molecular type and MATH were included in multivariate analysis. MATH was treated as a continuous variable. MATH, mutant-allele tumor heterogeneity; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Table 2
Relationships of clinical variables with overall survival by Cox proportional hazards analysis
Variables
Univariate analysis
Multivariate analysis
HR (95% CI)
P value
HR (95% CI)
P value
Age, years
0.047
0.020
61–70 vs. <60
1.341 (0.529, 3.402)
0.536
0.837 (0.303, 2.314)
0.732
>70 vs. <60
2.742 (1.153, 6.519)
0.022
2.622 (1.087, 6.322)
0.032
Cancer type (II vs. I)
3.447 (1.646, 7.215)
0.001
0.963 (0.330, 2.808)
0.945
Histological grade
0.003
0.113
2 vs. 1
5.141 (1.091, 24.219)
0.038
4.842 (1.022, 22.930)
0.047
3+4 vs. 1
10.199 (2.403, 43.290)
0.002
4.651 (1.026, 21.094)
0.046
Clinical stage (3+4 vs. 1+2)
2.816 (1.390, 5.703)
0.004
3.939 (1.837, 8.445)
<0.001
Targeted molecular therapy (yes vs. no)
2.423 (1.202, 4.887)
0.013
1.357 (0.573, 3.214)
0.487
Molecular type
0.027
0.903
CN low vs. CN high
0.210 (0.077, 0.569)
0.002
0.732 (0.161, 3.319)
0.686
MSI vs. CN high
0.423 (0.181, 0.985)
0.046
1.680 (0.376, 4.980)
0.634
POLE vs. CN high
<0.001 (<0.001, –)
0.98
<0.001 (<0.001, –)
0.979
Not assigned vs. CN high
0.513 (0.149, 1.761)
0.289
1.931 (0.665, 5.606)
0.917
MATH (high vs. low)
2.342 (1.110, 4.942)
0.025
1.172 (0.500, 2.747)
0.715
Race
0.536
–
–
Black vs. White
1.375 (0.526, 3.598)
0.516
Asian vs. White
0.418 (0.056, 3.110)
0.394
Hormonal therapy (yes vs. no)
0.192 (0.026, 1.425)
0.107
–
–
Radiation therapy (yes vs. no)
1.257 (0.616, 2.567)
0.53
–
–
Surgical approach (minimally invasive vs. open)
1.309 (0.620, 2.764)
0.48
–
–
Diabetes (yes vs. no)
1.134 (0.536, 2.400)
0.742
–
–
Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival. HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Table 3
Relationships of clinical variables with overall survival in younger patients with UCEC (age ≤60 years) by Cox proportional hazards analysis
Variables
Univariate analysis
Multivariate analysis
HR (95% CI)
P value
HR (95% CI)
P value
Cancer type (II vs. I)
6.539 (0.727, 58.806)
0.094
2.150 (0.217, 21.297)
0.513
Histological grade
0.304
–
–
2 vs. 1
3.816 (0.397, 36.696)
0.246
3+4 vs. 1
5.624 (0.627, 50.465)
0.123
Clinical stage (3+4 vs. 1+2)
3.013 (0.753, 12.061)
0.119
1.460 (0.240, 8.873)
0.681
Targeted molecular therapy (yes vs. no)
3.667 (0.875, 15.362)
0.075
3.588 (0.839, 15.334)
0.085
Molecular type
0.849
–
–
CN low vs. CN high
0.449 (0.046, 4.347)
0.489
MSI vs. CN high
1.072 (0.120, 9.605)
0.95
POLE vs. CN high
<0.001 (<0.001, –)
0.987
Not assigned vs. CN high
<0.001 (<0.001, –)
0.989
MATH (high vs. low)
11.688 (1.437, 95.081)
0.022
11.015 (1.352, 89.732)
0.025
Race
0.575
–
–
Black vs. White
3.019 (0.335, 27.177)
0.324
Asian vs. White
1.874 (0.207, 16.950)
0.576
Hormonal therapy (yes vs. no)
0.039 (<0.001, 544.563)
0.505
–
–
Radiation therapy (yes vs. no)
0.304 (0.037, 2.482)
0.266
–
–
Surgical approach (minimally invasive vs. open)
2.473 (0.618, 9.902)
0.201
–
–
Diabetes (yes vs. no)
1.246 (0.297, 5.225)
0.763
–
–
Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival in younger patients with UCEC (age ≤60 years). UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Table S2
Relationships of clinical variables with overall survival in younger patients with UCEC (age ≤60 years) by Cox proportional hazards analysis
Variables
Univariate analysis
Multivariate analysis
HR (95% CI)
P value
HR (95% CI)
P value
Cancer type (II vs. I)
6.539 (0.727, 58.806)
0.094
2.594 (0.230, 29.295)
0.441
Histological grade
0.304
–
–
2 vs. 1
3.816 (0.397, 36.696)
0.246
3+4 vs. 1
5.624 (0.627, 50.465)
0.123
Clinical stage (3+4 vs. 1+2)
3.013 (0.753, 12.061)
0.119
1.167 (0.162, 8.425)
0.878
Targeted molecular therapy (yes vs. no)
3.667 (0.875, 15.362)
0.075
3.835 (0.900, 16.341)
0.069
Molecular type
0.849
–
–
CN low vs. CN high
0.449 (0.046, 4.347)
0.489
MSI vs. CN high
1.072 (0.120, 9.605)
0.95
POLE vs. CN high
<0.001 (<0.001, –)
0.987
Not assigned vs. CN high
<0.001 (<0.001, –)
0.989
MATH
1.055 (1.014, 1.098)
0.009
1.057 (1.014, 1.102)
0.009
Race
0.575
–
–
Black vs. White
3.019 (0.335, 27.177)
0.324
Asian vs. White
1.874 (0.207, 16.950)
0.576
Hormonal therapy (yes vs. no)
0.039 (<0.001, 544.563)
0.505
–
–
Radiation therapy (yes vs. no)
0.304 (0.037, 2.482)
0.266
–
–
Surgical approach (minimally invasive vs. open)
2.473 (0.618, 9.902)
0.201
–
–
Diabetes (yes vs. no)
1.246 (0.297, 5.225)
0.763
–
–
Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival in younger patients with UCEC (age ≤60 years). Cancer type, clinical stage, targeted molecular therapy, and MATH were included in multivariate analysis. MATH was treated as a continuous variable. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Table S3
Relationships of clinical variables with overall survival in middle patients with UCEC (60< age ≤70 years) by Cox proportional hazards analysis
Variables
Univariate analysis
Multivariate analysis
HR (95% CI)
P value
HR (95% CI)
P value
Cancer type (II vs. I)
2.107 (0.503, 8.820)
0.307
–
–
Histological grade
0.405
–
–
2 vs. 1
6.4E+5 (<0.001, 5E+148)
0.948
3+4 vs. 1
1.8E+6 (<0.001, 1E+149)
0.943
Clinical stage (3+4 vs. 1+2)
1.064 (0.272, 4.159)
0.929
–
–
Targeted molecular therapy (yes vs. no)
6.967 (1.380, 35.166)
0.019
6.967 (1.380, 35.166)
0.019
Molecular type
0.941
–
–
CN low vs. CN high
<0.001 (<0.001, 1.28E+243)
0.964
MSI vs. CN high
0.556 (0.137, 2.261)
0.412
POLE vs. CN high
<0.001 (<0.001, –)
0.983
Not assigned vs. CN high
0.578 (0.069, 4.836)
0.613
MATH (high vs. low)
2.959 (0.628, 13.944)
0.17
1.544 (0.303, 7.880)
0.601
Race
0.881
–
–
Black vs. White
1.488 (0.315, 7.023)
0.615
Asian vs. White
<0.001 (<0.001, –)
0.985
Hormonal therapy (yes vs. no)
0.504 (0.062, 4.103)
0.522
–
–
Radiation therapy (yes vs. no)
1.643 (0.471, 5.729)
0.436
–
–
Surgical approach (minimally invasive vs. open)
1.220 (0.312, 4.774)
0.775
–
–
Diabetes (yes vs. no)
0.565 (0.115, 2.772)
0.482
–
–
Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival in middle patients with UCEC (60< age ≤70 years). Targeted molecular therapy and MATH were included in multivariate analysis. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Table S4
Relationships of clinical variables with overall survival in elder patients with UCEC (age >70 years) by Cox proportional hazards analysis
Variables
Univariate analysis
Multivariate analysis
HR (95% CI)
P value
HR (95% CI)
P value
Cancer type (II vs. I)
2.794 (0.960, 8.136)
0.06
1.508 (0.403, 5.651)
0.542
Histological grade
0.227
–
–
2 vs. 1
4.742 (0.491, 45.761)
0.178
3+4 vs. 1
6.020 (0.775, 46.741)
0.086
Clinical stage (3+4 vs. 1+2)
6.315 (2.057, 19.388)
0.001
8.390 (2.413, 29.167)
0.001
Targeted molecular therapy (yes vs. no)
1.599 (0.480, 5.333)
0.445
–
–
Molecular type
0.474
–
–
CN low vs. CN high
0.389 (0.084, 1.796)
0.226
MSI vs. CN high
0.192 (0.024, 1.513)
0.117
POLE vs. CN high
<0.001 (<0.001, –)
0.991
Not assigned vs. CN high
0.706 (0.148, 3.376)
0.663
MATH (high vs. low)
0.760 (0.267, 2.166)
0.608
–
–
Race
0.929
–
–
Black vs. White
0.745 (0.166, 3.345)
0.701
Asian vs. White
<0.001 (<0.001, –)
0.99
Hormonal therapy (yes vs. no)
0.034 (<0.001, 53.505)
0.369
–
–
Radiation therapy (yes vs. no)
2.557 (0.871, 7.510)
0.088
1.980 (0.628, 6.248)
0.244
Surgical approach (minimally invasive vs. open)
1.283 (0.332, 4.966)
0.718
–
–
Diabetes (yes vs. no)
2.525 (0.843, 7.564)
0.098
3.902 (1.196, 12.738)
0.024
Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival in elder patients with UCEC (age >70 years). Cancer type, clinical stage, radiation therapy, diabetes and MATH were included in multivariate analysis. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Table S5
Relationships of clinical variables with overall survival in clinical-stage 1–2 patients with UCEC by Cox proportional hazards analysis
Variables
Univariate analysis
Multivariate analysis
HR (95% CI)
P value
HR (95% CI)
P value
Age, years
0.432
–
–
61–70 vs. <60
1.899 (0.554, 6.506)
0.308
>70 vs. <60
2.235 (0.643, 7.775)
0.206
Cancer type (II vs. I)
2.129 (0.594, 7.630)
0.246
–
–
Histological grade
0.082
0.117
2 vs. 1
5.606 (1.188, 26.453)
0.029
5.110 (1.069, 24.431)
0.041
3+4 vs. 1
5.039 (1.067, 23.785)
0.041
4.322 (0.894, 20.895)
0.069
Targeted molecular therapy (yes vs. no)
0.436 (0.153, 1.246)
0.121
1.369 (0.403, 4.648)
0.615
Molecular type
0.147
0.252
CN low vs. CN high
0.275 (0.082, 0.919)
0.036
0.260 (0.052, 1.290)
0.099
MSI vs. CN high
0.261 (0.069, 0.989)
0.048
0.439 (0.104, 1.858)
0.264
POLE vs. CN high
<0.001 (<0.001, –)
0.979
<0.001 (<0.001, –)
0.985
Not assigned vs. CN high
0.801 (0.211, 3.041)
0.744
1.438 (0.287, 7.212)
0.658
MATH (high vs. low)
1.910 (0.736, 4.959)
0.183
1.042 (0.304, 3.570)
0.948
Race
0.894
–
–
Black vs. White
0.614 (0.081, 4.653)
0.637
Asian vs. White
<0.001 (<0.001, –)
0.98
Hormonal therapy (yes vs. no)
0.301 (0.039, 2.303)
0.247
–
–
Radiation therapy (yes vs. no)
1.346 (0.502, 3.608)
0.554
–
–
Surgical approach (minimally invasive vs. open)
2.416 (0.941, 6.200)
0.067
3.279 (1.241, 8.662)
0.017
Diabetes (yes vs. no)
1.887 (0.740, 4.812)
0.183
1.869 (0.699, 4.998)
0.212
Univariate and multivariate Cox regression model were used to evaluate the relationships of variables with overall survival in clinical-stage 1–2 patients with UCEC. Histologic grade, targeted molecular therapy, molecular type, surgical approach, diabetes and MATH were included in multivariate analysis. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Table S6
Relationships of clinical variables with overall survival in clinical-stage 3–4 patients with UCEC by Cox proportional hazards analysis
Variables
Univariate analysis
Multivariate analysis
HR (95% CI)
P value
HR (95% CI)
P value
Age, years
0.014
0.016
61–70 vs. <60
0.727 (0.163, 3.252)
0.677
0.755 (0.169, 3.375)
0.713
>70 vs. <60
3.897 (1.169,12.993)
0.027
3.895 (1.168, 12.986)
0.027
Cancer type (II vs. I)
2.551 (0.902, 7.219)
0.078
1.385 (0.336, 5.711)
0.653
Histological grade
0.252
–
–
2 vs. 1
0.997 (<0.001, 6.5E+4)
0.999
3+4 vs. 1
41.577 (0.062, 2.8E+5)
0.262
Targeted molecular therapy (yes vs. no)
1.079 (0.340, 3.429)
0.897
–
–
Molecular type
0.625
–
–
CN low vs. CN high
0.184 (0.023, 1.454)
0.108
MSI vs. CN high
0.930 (0.311, 2.781)
0.896
POLE vs. CN high
<0.001 (<0.001, –)
0.989
Not assigned vs. CN high
<0.001 (<0.001, –)
0.991
MATH (high vs. low)
2.452 (0.691, 8.705)
0.165
1.825 (0.469, 7.093)
0.385
Race
1.000
–
–
Black vs. White
1.000 (0.287, 3.482)
1.000
Asian vs. White
1.000 (<0.001, 2.7E+7)
1.000
Hormonal therapy (yes vs. no)
0.041 (<0.001, 167.003)
0.451
–
–
Radiation therapy (yes vs. no)
0.843 (0.299, 2.379)
0.748
–
–
Surgical approach (minimally invasive vs. open)
0.558 (0.126, 2.477)
0.443
–
–
Diabetes (yes vs. no)
0.521 (0.116, 2.334)
0.394
–
–
Univariate and multivariate Cox regression model were used to evaluate the relationships of variables with overall survival in clinic-stage 3–4 patients with UCEC. Age, cancer type and MATH were included in multivariate analysis. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival. HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival in younger patients with UCEC (age ≤60 years). UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Evaluation of MATH for predictive ability in UCEC patients
The diagnostic accuracy of the MATH score for predicting prognosis was assessed using the AUC. The diagnostic accuracy of the MATH score was higher in UCEC patients ≤60 years old (AUC, 0.756) than in all patients (AUC, 0.635) (). The predictive accuracy of different Cox models was described by the C-index. All variables in the predictive model, including age, cancer type, histological grade, clinical stage, molecular type, and targeted molecular therapy, were considered as known prognostic factors. For younger UCEC patients (age ≤60 years), the C-index of the predictive model was 0.699 (95% CI, 0.482–0.916), and the addition of MATH scores improved the C-index from 0.699 to 0.845 (95% CI, 0.749–0.941). However, there was no difference in C-index of the predictive model with or without MATH scores for all UCEC patients. These results indicate the MATH score is an additional prognostic variable for UCEC patients aged ≤60 years.
Figure 7
Receiver operating characteristic (ROC) curves for the MATH score in all patients (A) and patients ≤60 years old (B). The MATH cutoff value was selected as 21.425 (the median MATH score), 33.560 (the valley level of the MATH distribution), and 40.343 (the most statistically significant calculated cutoff value). The curves show how different choices of MATH cutoff value affect the specificity and sensitivity of outcome prediction by the MATH score. AUC, the area under the ROC curve; MATH, mutant-allele tumor heterogeneity.
Receiver operating characteristic (ROC) curves for the MATH score in all patients (A) and patients ≤60 years old (B). The MATH cutoff value was selected as 21.425 (the median MATH score), 33.560 (the valley level of the MATH distribution), and 40.343 (the most statistically significant calculated cutoff value). The curves show how different choices of MATH cutoff value affect the specificity and sensitivity of outcome prediction by the MATH score. AUC, the area under the ROC curve; MATH, mutant-allele tumor heterogeneity.
Discussion
Recent advances have demonstrated that UCEC is a complex disease characterized by a biologically, clinically, morphologically, and genetically heterogeneous group of tumors. Traditional classifications have not entirely taken this heterogeneity into account and have been limited in predicting responses to therapy, since heterogeneity is prognostic in nature. Despite numerous efforts to improve our understanding of the molecular mechanisms underlying UCEC, its clinical outcome in patients is not satisfactory and varies considerably due to the molecular heterogeneity of UCEC. In this study, we calculated the MATH scores to measure intra-tumor heterogeneity based on publicly available data obtained from TCGA, investigated their correlation with clinical characteristics, somatic mutations, or SCNA, and further explored the potential prognostic value of the MATH score in UCEC. We found that high MATH scores were significantly associated with poor clinical outcomes in UCEC patients, providing direct evidence—based on integrative genomic analysis—that high genetic heterogeneity, as indicated by high MATH scores, is related to short overall survival in UCEC patients.A high MATH score indicates a high percentage of subclonal mutations, which are indicative of high intra-tumor heterogeneity, which also likely make the tumor more aggressive. Studies based on MATH scores have also been reported for other malignant human tumors. In HNSCC, high MATH scores were associated with advanced-stage tumors and shorter overall survival (12,13). Another study suggested that higher MATH scores in breast cancer were associated with tumor stage and triple-negative or basal-like subtypes (14,15). Moreover, high-MATH groups showed a tendency for worse overall survival in breast cancer patients (15). MATH was reported to be an independent risk factor in male colorectal cancer patients (16). Additionally, high MATH scores were correlated with higher risks of metastasis in stage-II and -III colon cancer (25). Recently, the MATH score has been shown to be a new parameter for describing the internal heterogeneity of lung tumors (17). Although the mechanisms underlying the links between high genetic heterogeneity and short overall survival cannot be determined from these data, our present results are consistent with the previous findings, confirming that MATH is a useful measure of intra-tumor heterogeneity and that it may serve as a novel biomarker for therapy selection and survival prediction for some human tumors.A genomic classification of UCEC based on somatic mutations can entirely take into account this heterogeneity and identify potential targets for treatment in different subgroups of the disease (21). The current study found that TP53 was mutated significantly more often in patients with higher MATH scores at different clinical stages, mainly at advanced stages. Survival analysis indicated TP53 mutation was correlated with poor prognosis in UCEC patients. These results confirmed the findings of a previous study, which showed that TP53 was involved in the maintenance of genetic stability, and correlated with poor prognosis in breast cancer (26), lung cancer (27), and colorectal cancer (28). However, in this study, in both the subgroup with TP53 mutations and the TP53 wild-type subgroup, no significant difference in overall survival was found between the high- and low-MATH groups. Interestingly, in UCEC patients with PIK3CA mutations, overall survival was correlated with the MATH score. A similar tendency was found in UCEC patients with wild-type PTEN. These results suggest that the major influence of mutated PIK3CA or wild-type PTEN on the outcomes of UCEC patients may be a result of their tendency to regulate genetic heterogeneity, and even prognosis. Studies have demonstrated that quantification of whole-genome alteration, including the fraction of SCNA and somatic mutation count burden, can now be correlated with clinicopathologic characteristics and the therapeutic response, and may be useful in predicting patient outcomes (29,30). For example, a higher SCNA fraction was associated with advanced pathologic features, including tumor grade and stage, and was independently associated with poor survival in renal cell carcinoma cases from the TCGA database (31). Additionally, a high SCNA fraction was correlated with worse survival and platinum resistance in epithelial ovarian cancer (32). Notably, more SCNA events were independently related to the MATH score in male colorectal cancer patients, which might explain the gender-related prognostic value of the MATH score (16). Our present results demonstrated that the fraction of SCNA, which could contribute to the genetic heterogeneity measured by MATH, was indeed positively correlated with the MATH score in the TCGA UCEC cohort. Interestingly, Noorbakhsh et al. reported that MATH-based clinical predictions were driven by CN aberrations that altered mutant allele frequencies (33). Specifically, we found that the predictive power of the MATH score was robust in UCEC. This further supports the notion that measurement of heterogeneity by MATH was predictive of survival through an association with CN variation. It is yet to be determined whether SCNA can drive intra-tumor heterogeneity.Our results showed that the UCEC patients ≤60 years old had significantly lower MATH scores than older patients, implying that the level of intra-tumor heterogeneity may be dependent on age. Previous reports have shown that the MATH score is similarly associated with adverse outcomes in clinically high-risk patients at advanced stages or with high histological grades, while the other groups have not been found to be associated with age (15,16,25). However, our present study revealed the MATH score is an independent prognostic factor for patients with UCEC who are ≤60 years old. Furthermore, the MATH score had better ROC characteristics, with an AUC above 0.7 in UCEC patients ≤60 years old, and was considered to provide higher predictive accuracy than the current prognostic factors, demonstrating it is a potential prognostic parameter to predict the outcome of UCEC in patients ≤60 years old. This could be explained in several ways. (I) Advanced age was considered to be a risk factor in endometrial carcinoma. However, this was not attributable to the fact that advanced-stage tumors are more likely to be found in older women, who tend to delay seeking medical care in the face of clinical symptoms. (II) Compared with older women, patients of ≤60 years old had an excellent survival and virtually no excess mortality during 5 years of follow-up. It was more helpful to objectively evaluate the prognostic value of different MATH scores. (III) The small number of patients in the cohort likely limited the multivariate analysis in this study.Tumor heterogeneity is a potential cause of low efficacy or failure of therapies and may contribute to the development of drug resistance. As shown in , hormonal therapy resulted in a significant decrease in MATH scores, but the opposite results were found with respect to targeted molecular therapy; no significant differences in MATH score were found for radiation therapy. The findings of the present study indicate that individual treatment strategies may be adapted to tumor heterogeneity based on the MATH score, thereby improving the clinical outcomes of UCEC patients. The MATH score, in combination with clinical and pathological variables, can be used as a novel biomarker to improve our understanding of tumor heterogeneity, contribute to the identification of low-risk or high-risk tumors, and guide personalized treatment decisions.
Conclusions
Our findings suggest that high MATH scores were significantly associated with poor clinical outcomes among UCEC patients; the MATH score was an independent prognostic factor in UCEC patients ≤60 years old, and might therefore help to predict the survival of patients with UCEC. Our study had several limitations. Although much information about clinicopathologic parameters and outcomes was available from the TCGA database, these data were not collected prospectively, and many cases lacked long-term follow-up. The prognostic value of the MATH score needs to be further validated in additional independent cohorts with complete clinical information. Despite these limitations, our results improve our understanding of the quantification of intra-tumor heterogeneity, provide a potential prognostic biomarker for predicting UCEC outcomes, and contribute to the development of personalized therapeutic strategies in UCEC patients.Kaplan-Meier survival curves for all patients from TCGA according to MATH groups, with MATH cutoff of 40.343 (the more statistically significant calculated cutoff value) (A) or MATH cutoff of 33.560 (the valley level of MATH distribution) (B). TCGA, The Cancer Genome Atlas; MATH, mutant-allele tumor heterogeneity.Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival. Age, cancer type, histologic grade, clinical stage, targeted molecular therapy, molecular type and MATH were included in multivariate analysis. MATH was treated as a continuous variable. MATH, mutant-allele tumor heterogeneity; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival in younger patients with UCEC (age ≤60 years). Cancer type, clinical stage, targeted molecular therapy, and MATH were included in multivariate analysis. MATH was treated as a continuous variable. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival in middle patients with UCEC (60< age ≤70 years). Targeted molecular therapy and MATH were included in multivariate analysis. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.Univariate and multivariate Cox regression model were used to evaluate the relationships of clinical variables with overall survival in elder patients with UCEC (age >70 years). Cancer type, clinical stage, radiation therapy, diabetes and MATH were included in multivariate analysis. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.Univariate and multivariate Cox regression model were used to evaluate the relationships of variables with overall survival in clinical-stage 1–2 patients with UCEC. Histologic grade, targeted molecular therapy, molecular type, surgical approach, diabetes and MATH were included in multivariate analysis. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.Univariate and multivariate Cox regression model were used to evaluate the relationships of variables with overall survival in clinic-stage 3–4 patients with UCEC. Age, cancer type and MATH were included in multivariate analysis. MATH, mutant-allele tumor heterogeneity; UCEC, uterine corpus endometrial carcinoma; HR, hazard ratio; CI, confidence interval; CN, copy number; MSI, microsatellite instability; POLE, polymerase ε.
Authors: Alba Mota; Eva Colás; Pablo García-Sanz; Irene Campoy; Alejandro Rojo-Sebastián; Sonia Gatius; Ángel García; Luis Chiva; Sonsoles Alonso; Antonio Gil-Moreno; Xavier González-Tallada; Berta Díaz-Feijoo; August Vidal; Patrycja Ziober-Malinowska; Marcin Bobiński; Rafael López-López; Miguel Abal; Jaume Reventós; Xavier Matias-Guiu; Gema Moreno-Bueno Journal: Mod Pathol Date: 2016-09-02 Impact factor: 7.842
Authors: Tanya Keenan; Beverly Moy; Edmund A Mroz; Kenneth Ross; Andrzej Niemierko; James W Rocco; Steven Isakoff; Leif W Ellisen; Aditya Bardia Journal: J Clin Oncol Date: 2015-09-14 Impact factor: 44.544
Authors: Dylan E O'Sullivan; Kevin C Johnson; Lucy Skinner; Devin C Koestler; Brock C Christensen Journal: Epigenetics Date: 2016-04-12 Impact factor: 4.528
Authors: Stephanie J Yaung; Christine Ju; Sandeep Gattam; Alan Nicholas; Nicolas Sommer; Johanna C Bendell; Herbert I Hurwitz; John J Lee; Fergal Casey; Richard Price; John F Palma Journal: Cancers (Basel) Date: 2022-04-29 Impact factor: 6.575