| Literature DB >> 29399148 |
Sipeng Shen1,2, Yongyue Wei1, Ruyang Zhang1, Mulong Du1, Weiwei Duan1, Sheng Yang1, Yang Zhao1, David C Christiani2, Feng Chen1,3.
Abstract
Genetic intratumor heterogeneity is associated with tumor occurrence, development and overall outcome. The present study aims to explore the association between mutant-allele fraction (MAF) heterogeneity and patient overall survival in lung cancer. Somatic mutation data of 939 non-small cell lung cancer (NSCLC) cases were obtained from The Cancer Genome Atlas. Entropy-based mutation allele fraction (EMAF) score was used to describe the uncertainty of individual somatic mutation patterns and to further analyze the association with patient overall survival. Results indicated that association between EMAF and overall survival was significant in the discovery set [hazard ratio (H)R=1.62; 95% confidence interval (CI): 1.08-2.41; P=0.018] and replication set (HR=1.63; 95% CI: 1.11-2.37; P=0.011). In addition, EMAF was also significantly different in lung adenocarcinoma and squamous cell carcinoma. Furthermore, a significant difference was indicated in early-stage patients. Results from c-index analysis indicated that EMAF improved the model predictive performance on the 3-year survival beyond that of traditional clinical staging, particularly in early-stage patients. In conclusion, EMAF successfully reflected MAF heterogeneity among patients with NSCLC. Additionally, EMAF improved the predictive performance in early-stage patient prognosis beyond that of traditional clinical staging. In clinical application, EMAF appears to identify a subset of early-stage patients with a poor prognosis and therefore may help inform clinical decisions regarding the application of chemotherapy after surgery.Entities:
Keywords: information entropy; mutant-allele fraction heterogeneity; non-small cell lung cancer; overall survival
Year: 2017 PMID: 29399148 PMCID: PMC5772758 DOI: 10.3892/ol.2017.7428
Source DB: PubMed Journal: Oncol Lett ISSN: 1792-1074 Impact factor: 2.967
Demographic and clinical characteristics of lung cancer patients in The Cancer Genome Atlas.
| Characteristic | Discovery set (n=469) | Replication set (n=470) | P-value |
|---|---|---|---|
| Median survival time (months) | 45.30 | 41.33 | 0.326 |
| Censored rate (%) | 76.33 | 72.97 | 0.260 |
| Age (year) | 65.82±9.72 | 66.29±9.06 | 0.443 |
| Gender | |||
| Male | 268 | 295 | 0.083 |
| Female | 201 | 175 | |
| Race | 0.851 | ||
| White | 370 | 348 | |
| American Indian/Alaska native | 1 | 0 | |
| Asian | 8 | 8 | |
| Black/African American | 28 | 31 | |
| Missing | 62 | 83 | |
| Tobacco history | 0.776 | ||
| Never smoke/quit >15 y | 142 | 146 | |
| Current smoker/quit <15 y | 310 | 318 | |
| Missing | 17 | 16 | |
| Histology type | 0.896 | ||
| Adenocarcinoma | 231 | 229 | |
| Squamous cell carcinoma | 238 | 241 | |
| T classification | 0.657 | ||
| T1 | 140 | 123 | |
| T2 | 251 | 273 | |
| T3 | 54 | 53 | |
| T4 | 22 | 20 | |
| Missing or not available | 2 | 1 | |
| N classification | 0.148 | ||
| N0 | 309 | 290 | |
| N1 | 108 | 102 | |
| N2 | 44 | 61 | |
| N3 | 2 | 5 | |
| Missing or not available | 6 | 12 | |
| M classification | 0.903 | ||
| M0 | 350 | 349 | |
| M1 | 16 | 14 | |
| Missing or not available | 103 | 107 | |
| Clinical stage | 0.826 | ||
| I | 243 | 236 | |
| II | 134 | 130 | |
| III | 74 | 84 | |
| IV | 18 | 20 |
Figure 1.(A) Flow chart of our study design. (B and C) MAF distribution of the discovery set (A) and replication set (B) cases smoothed by kernel density function with a bandwidth of 0.3. Patients were divided into low-EMAF (red) and high-EMAF (blue) groups by the median value. Distribution of MAF values within each patient was represented as a smooth line in the figures.
Figure 2.EMAF and NSCLC patients' overall survival. Survival curves for the discovery set (A) and replication set (B) cases, which were divided into low-EMAF (red) and high-EMAF (blue) groups by the median value. Hazard ratios (HRadjust) and P-values were estimated by Cox regression with adjustment for age, gender, smoking status, clinical stage, T classification, N classification and histology type l.
Cox regression analysis of clinical characteristics and EMAF.
| Discovery set (n=469) | Replication set (n=470) | |||||||
|---|---|---|---|---|---|---|---|---|
| Univariable | Multivariable | Univariable | Multivariable | |||||
| Characteristics | HR (95% CI) | P-value | HR (95% CI) | P-value | HR (95% CI) | P-value | HR (95% CI) | P-value |
| High EMAF | 1.50 (1.03–2.18) | 0.035 | 1.62 (1.08–2.41) | 0.018 | 1.47 (1.04–2.09) | 0.031 | 1.63 (1.11–2.37) | 0.011 |
| Age (per year) | 0.99 (0.97–1.01) | 0.708 | 1.00 (0.98–1.03) | 0.660 | 1.01 (0.99–1.04) | 0.065 | 1.02 (1.00–1.04) | 0.034 |
| Gender (Female) | 1.01 (0.69–1.47) | 0.951 | 0.93 (0.62–1.39) | 0.726 | 0.92 (0.64–1.33) | 0.681 | 0.96 (0.64–1.46) | 0.878 |
| Clinical Stage (per stage) | 1.55 (1.28–1.89) | <0.001 | 1.34 (0.98–1.84) | 0.064 | 1.36 (1.15–1.62) | <0.001 | 0.98 (0.71–1.37) | 0.951 |
| Smoking status (Current smoker/quit <15 y) | 0.79 (0.65–0.96) | 0.021 | 0.82 (0.67–1.00) | 0.058 | 1.04 (0.87–1.24) | 0.615 | 0.96 (0.79–1.16) | 0.681 |
| T classification (per stage) | 1.34 (1.07–1.68) | 0.009 | 1.05 (0.80–1.39) | 0.702 | 1.56 (1.24–1.95) | <0.001 | 1.54 (1.15–2.06) | 0.003 |
| N classification (per stage) | 1.49 (1.18–1.87) | <0.001 | 1.16 (0.84–1.59) | 0.368 | 1.43 (1.16–1.77) | <0.001 | 1.35 (0.97–1.90) | 0.074 |
| Histology type (LUSC) | 0.75 (0.52–1.09) | 0.143 | 0.75 (0.50–1.13) | 0.179 | 1.14 (0.80–1.63) | 0.452 | 1.10 (0.72–1.69) | 0.637 |
EMAF, entropy-based mutation allele fraction; LUSC, lung squamous cell carcinoma; HR, hazard ratio; CI, confidence interval.
Figure 3.Subgroup analysis with histology type and clinical stage. Subgroup survival curves for 460 LUAD (A) and 479 LUSC (B) cases were analyzed. In addition, we also analyzed the early-stage (stage<3) patients in (C) LUAD and (D) LUSC. Further, survival curves were plotted depicting relation of EMAF and clinical stage to overall survival in (E) LUAD and (F) LUSC cases.
Performances of prognostic prediction on 3-year overall survival.
| Death rate, %(n/N) | Prognostic prediction performance, c-index (95%CI) | ||||
|---|---|---|---|---|---|
| Study population | Low-EMAF group | High-EMAF group | EMAF only | Clinical characteristics only[ | Both |
| LUAD | 20.6 (46/223) | 25.3 (60/237) | 0.60 (0.53,0.66) | 0.70 (0.63,0.78) | 0.76 (0.68,0.83) |
| Early-stage | 17.0 (33/193) | 22.2 (36/162) | 0.60 (0.51,0.68) | 0.65 (0.55,0.75) | 0.73 (0.63,0.83) |
| LUSC | 22.2 (57/257) | 33.8 (75/222) | 0.58 (0.53,0.63) | 0.58 (0.53,0.64) | 0.63 (0.57,0.68) |
| Early-stage | 20.7 (45/217) | 30.4 (52/171) | 0.60 (0.54,0.66) | 0.58 (0.51,0.65) | 0.64 (0.57,0.71) |
Clinical variables including age, gender, smoking status, clinical stage, T classification, and N classification. LUSC, lung squamous cell carcinoma; EMAF, entropy-based mutation allele fraction; LUAD, lung adenocarcinoma; CI, confidence interval.