| Literature DB >> 29510676 |
Bao-Feng Jin1, Fan Yang2, Xiao-Min Ying3, Lin Gong1, Shuo-Feng Hu3, Qing Zhao1, Yi-Da Liao2, Ke-Zhong Chen2, Teng Li1, Yan-Hong Tai4,5, Yuan Cao4, Xiao Li2, Yan Huang1, Xiao-Yan Zhan1, Xuan-He Qin1, Jin Wu1, Shuai Chen1, Sai-Sai Guo1, Yu-Cheng Zhang1, Jing Chen1, Dan-Hua Shen6, Kun-Kun Sun6, Lu Chen7, Wei-Hua Li1, Ai-Ling Li1, Na Wang1, Qing Xia1, Jun Wang8, Tao Zhou9.
Abstract
BACKGROUND: Non-small-cell lung cancer (NSCLC) is characterized by abnormalities of numerous signaling proteins that play pivotal roles in cancer development and progression. Many of these proteins have been reported to be correlated with clinical outcomes of NSCLC. However, none of them could provide adequate accuracy of prognosis prediction in clinical application.Entities:
Keywords: Adenocarcinoma; Non-small-cell lung cancer; Prognosis; Protein signature; Squamous cell carcinoma
Mesh:
Substances:
Year: 2018 PMID: 29510676 PMCID: PMC5840771 DOI: 10.1186/s12885-018-4104-4
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Clinical-pathological characteristics of non-small-cell lung cancer patients
| Variable | All patients | BJ cohort | CQ cohort |
|---|---|---|---|
| Age | |||
| ≤60 | 151 (39.32%) | 73 (34.60%) | 78 (45.09%) |
| > 60 | 233 (60.68%) | 138 (65.40%) | 95 (54.91%) |
| Gender | |||
| Male | 270 (70.31%) | 135 (63.98%) | 135 (78.03%) |
| Female | 114 (29.69%) | 76 (36.02%) | 38(21.97%) |
| Smoking index (pack years) | |||
| Never smokers | 142 (36.98%) | 94 (44.55%) | 48 (27.75%) |
| Light smokers (< 20) | 41 (10.68%) | 26 (12.32%) | 15 (8.67%) |
| Heavy smokers (≥20) | 183 (47.65%) | 88 (41.71%) | 95 (54.91%) |
| Unknown | 18 (4.69%) | 3 (1.42%) | 15 (8.67%) |
| Histology | |||
| Adenocarcinoma | 206 (53.65%) | 122 (57.82%) | 84 (48.55%) |
| Squamous cell carcinoma | 178 (46.35%) | 89 (42.18%) | 89 (51.45%) |
| Follow-up (months; median; IQR) | 58 (22–75) | 59 (19–74) | 56 (29–76) |
| Deaths | 175 (45.57%) | 93 (44.07%) | 82 (47.40%) |
| Differentiation grade | |||
| Well | 75 (19.53%) | 45 (21.33%) | 30 (17.34%) |
| Moderate | 140 (36.46%) | 67 (31.75%) | 73 (42.20%) |
| Poor | 138 (35.94%) | 98 (46.45%) | 40 (23.12%) |
| Unknown | 31 (8.07%) | 1 (0.47%) | 30 (17.34%) |
| Pathologic stage | |||
| I | 188 (48.96%) | 95 (45.02%) | 93 (53.76%) |
| II | 101 (26.30%) | 47 (22.28%) | 54 (31.21%) |
| III | 95 (24.74%) | 69 (32.70%) | 26 (15.03%) |
| Adjuvant chemotherapy | |||
| No | 141 (36.72%) | 100 (47.39%) | 41 (23.70%) |
| Yes | 243 (63.28%) | 111 (52.61%) | 132 (76.30%) |
Data are number (%), unless otherwise stated. IQR = interquantile range
Fig. 1Flow chart of signature identification and validation strategy. Abbreviations: BJ cohort, patients from Peking University People’s Hospital; CQ cohort, patients from Southwest Hospital of Chongqing; ADC, adenocarcinoma; SCC, squamous cell carcinoma; IHC, immunohistochemistry; EGFR-TKI, epidermal growth factor receptor tyrosine kinase inhibitor
Fig. 2Identification and validation of the 6-protein signature for adeno-carcinoma (ADC). a The ROC curve of the ADC training set. The cutoff point of prognosis scores is shown. b-c Patients of the training set (b) and testing set (c) were classified into poor- and good-prognosis groups using the 6-protein ADC signature. The Kaplan-Meier estimates of overall survival for the two predicted prognosis groups are shown. (d) The prognosis score distribution, prognosis prediction using the ADC signature, the three-year survival status and the expression profile of the 6-protein signature proteins were summarized on ADC patients of BJ cohort. Each column represents an individual patient. e ADC patients of the independent CQ validation cohort were classified into poor- and good-prognosis groups using the 6-protein ADC signature. The Kaplan-Meier survival curves for the two prognosis groups are shown. f The prognosis score distribution, prognosis prediction using the signature, the three-year survival status and the expression profile of the 6-protein signature proteins of CQ ADC patients are shown
Cox regression analysis of overall survival in the independent CQ validation cohort
| Histology | Variable | Univariate analysis | Multivariate analysis | ||
|---|---|---|---|---|---|
| HR(95% CI) | HR(95% CI) | ||||
| ADC | Six-protein signaturea | 2.89 (1.52–5.48) | 0.001 | 3.07 (1.29–7.32) | 0.011 |
| Stageb | 2.43 (1.64–3.60) | < 0.001 | 2.43 (1.46–4.03) | 0.001 | |
| Age > 60 years | 1.33 (0.70–2.53) | 0.390 | 1.37 (0.65–2.87) | 0.413 | |
| Tumor size | 1.09 (0.91–1.30) | 0.341 | 1.02 (0.84–1.24) | 0.880 | |
| Smoking index | 1.01 (0.99–1.03) | 0.425 | 0.99 (0.97–1.01) | 0.426 | |
| Chemotherapy | 1.09 (0.52–2.30) | 0.816 | 1.50 (0.61–3.67) | 0.379 | |
| SCC | Five-protein signaturea | 10.84 (4.15–28.31) | < 0.001 | 7.84 (2.88–21.31) | < 0.001 |
| Stageb | 2.85 (1.78–4.57) | < 0.001 | 2.24 (1.26–3.99) | 0.006 | |
| Age > 60 years | 1.73 (0.86–3.50) | 0.127 | 1.48 (0.70–3.14) | 0.307 | |
| Tumor size | 1.15 (1.00–1.32) | 0.058 | 1.02 (0.85–1.23) | 0.838 | |
| Smoking index | 1.00 (0.99–1.01) | 0.994 | 1.00 (0.98–1.02) | 0.659 | |
| Chemotherapy | 1.23 (0.56–2.71) | 0.615 | 1.45 (0.55–3.83) | 0.452 | |
ADC, adenocarcinoma; SCC, squamous cell carcinoma; HR, hazard ratio
aCompared with low-risk group. bModeled as a continuous variable
Fig. 3Identification and validation of the 5-protein signature for squamous cell carcinoma (SCC). a The ROC curve of the SCC training set. The cutoff point of prognosis scores is shown. b-c Patients of the training set (b) and testing set (c) were classified into poor- and good-prognosis groups using the 5-protein SCC signature. The Kaplan-Meier estimates of overall survival for the two predicted prognosis groups are shown. d The prognosis score distribution, prognosis prediction using the signature, the three-year survival status and the expression profile of the 5-protein signature proteins were summarized on SCC patients of BJ cohort. Each column represents an individual patient. e SCC patients of the independent CQ validation cohort were classified into poor- and good-prognosis groups using the 5-protein SCC signature. The Kaplan-Meier survival curves for the two prognosis groups are shown. f Prognosis score distribution, prognosis prediction using the SCC protein signature, patient survival status and the expression profile of the 5-protein signature proteins of CQ SCC patients are shown
Fig. 4Analysis of adjuvant chemotherapy benefit based on the 6-protein adenocarcinoma (ADC) signature. Kaplan-Meier estimates of overall survival for the patients at stages IB, II and IIIA with or without adjuvant chemotherapy in the good-prognosis (a) and poor-prognosis (b) group was analyzed
Fig. 5Permutation validation and enrichment analysis. In permutation validation, ten thousand protein combinations were generated randomly. The model with each combination was trained on the training set using SVM. The ability of each combination to separate the testing set into good- and poor-prognosis groups was evaluated using the log-rank test. Histograms of the χ2 value from the log-rank test on ADC testing set (a) and SCC testing set (b) were illustrated. The x axis indicates the χ2 value. A larger χ2 value indicates a lower P value and a more statistically significant ability to separate the testing set. The y axis shows the frequency and higher values indicate a larger fraction of the population. The performance of the ADC/SCC signature is marked with a red arrow. In enrichment analysis, ten thousand signatures were identified on 10,000 randomly partitioned training sets using random forest algorithm. For each protein, the fraction of the signatures containing the protein (i.e. percentage of subsets) in ADC (c) and SCC (d) patients from BJ cohort was calculated. A zoom-in on the 15 most enriched proteins is also shown. Each column corresponds to a protein, the signature proteins are denoted in red