Literature DB >> 36248337

Development and validation of a polygenic hazard score to predict prognosis and adjuvant chemotherapy benefit in early-stage non-small cell lung cancer.

Dan-Hua Li1, Yong-Qiao He1, Tong-Min Wang1, Wen-Qiong Xue1, Chang-Mi Deng1, Da-Wei Yang2, Wen-Li Zhang1, Zi-Yi Wu1, Lian-Jing Cao1, Si-Qi Dong1, Yi-Jing Jia2, Lei-Lei Yuan2, Lu-Ting Luo2, Yan-Xia Wu1, Xia-Ting Tong2, Jiang-Bo Zhang1, Mei-Qi Zheng1, Ting Zhou1,3, Xiao-Hui Zheng1,3, Xi-Zhao Li1,3, Pei-Fen Zhang1,3, Shao-Dan Zhang1,3, Ye-Zhu Hu1,3, Xun Cao1, Xin Wang1, Wei-Hua Jia1,2,3.   

Abstract

Background: It remains controversial who would benefit from adjuvant chemotherapy (ACT) in patients with early-stage non-small cell lung cancer (NSCLC). We aim to construct a polygenic hazard score (PHS) to predict prognosis and ACT benefit among NSCLC patients.
Methods: We conducted a retrospective study including 1,395 stage I-II NSCLC patients. We performed a genome-wide association study (GWAS) on overall survival (OS) in patients treated with ACT (SYSUCC ACT set, n=404), and then developed a PHS using LASSO Cox regression in a random subset (training, n=202) and tested it in the remaining set (test, n=202). The PHS was further validated in two independent datasets (SYSUCC surgery set, n=624; PLCO cohort, n=367).
Results: The GWAS-derived PHS consisting of 37 single-nucleotide polymorphisms (SNPs) was constructed to classify patients into high and low PHS groups. For patients treated with ACT, those with low PHS had better clinical outcomes than high PHS (test set: HR =0.21, P<0.001; PLCO ACT set: HR =0.33, P=0.260). Similar results were found in the extended validation cohorts including patients with or without ACT (SYSUCC: HR =0.48, P<0.001; PLCO: HR =0.60, P=0.033). Within subgroup analysis by treatment or clinical factors, we further observed consistent results for the prognostic value of the PHS. Notably, ACT significantly improved OS in stage II patients with low PHS (HR =0.26, P<0.001), while there was no ACT survival benefit among patients with high PHS (HR =0.97, P=0.860). Conclusions: The PHS improved prognostic stratification and could help identify patients who were most likely to benefit from ACT in early-stage NSCLC. 2022 Translational Lung Cancer Research. All rights reserved.

Entities:  

Keywords:  Non-small cell lung cancer (NSCLC); adjuvant chemotherapy (ACT); benefit; polygenic hazard score (PHS); predict

Year:  2022        PMID: 36248337      PMCID: PMC9554676          DOI: 10.21037/tlcr-22-139

Source DB:  PubMed          Journal:  Transl Lung Cancer Res        ISSN: 2218-6751


Introduction

Lung cancer is the leading cause of cancer-related deaths in China and around the world (1), and non-small cell lung cancer (NSCLC) accounts for about 85% of lung cancer (2). Surgical resection is considered the preferred treatment for stage I-II NSCLC patients but tumor recurrence and metastasis remain the main cause of treatment failure. Currently, adjuvant chemotherapy (ACT) has not been routinely recommended for stage I patients especially in stage IB, since its benefit remains undetermined (3). The National Comprehensive Cancer Network (NCCN) guidelines recommend that ACT is an appropriate option for stage IB patients with high-risk factors such as a poorly differentiated tumor, vascular invasion, wedge resection, tumor size greater than 4 cm, and pleural vascular invasion (4). However, the American Society of Clinical Oncology (ASCO) guidelines do not recommend ACT for stage IB patients (5). Moreover, although ACT is recommended universally without any risk stratification for stage II patients, the overall benefit of ACT is limited (6,7) and not all patients derive survival benefit from it (8-10). Therefore, the current tumor-node-metastasis (TNM) stage has limitations in predicting treatment response and guiding ACT application, and there is an urgent need to develop additional predictors to predict who would benefit from ACT. Previous studies have found that genetic variation was associated with lung cancer predisposition, treatment response, and disease progression (11-13). Genetic variations involved in therapeutic drugs pharmacokinetics and pharmacodynamics, which may partially account for inter-individual differences in chemotherapy benefits (14). It is well known that the DNA repair pathway is an important signaling pathway in chemotherapy response (15-17). P53 and PI3K/PTEN/AKT pathways have also been reported to be associated with treatment response, survival, and toxicity in NSCLC patients treated with chemotherapy (18,19). In addition to candidate gene strategies, genome-wide association studies (GWAS) had also identified several single-nucleotide polymorphisms (SNPs) associated with the survival of NSCLC patients treated with chemotherapy (20-22). These researches indicated that the genetic variations may play an important role in the prognosis and suggested individual genetics as a potential biomarker for individualized therapeutic decision-making. However, the difficulty in applying genetic data to the clinical utility is that the effect of a single variant is relatively small, and is not informative enough for predicting prognosis. The polygenic risk score, combining the genotype dosage of multiple SNPs by their respective weight, has emerged as the main approach for predicting the genetic component of a specific outcome (23-27), and recent studies have developed GWAS-derived polygenic scores using SNPs to predict clinical survivals and treatment response (28-30). To the best of our knowledge, the association between polygenic score and prognosis of early-stage NSCLC patients has not been studied and their applications remain to be explored. In this study, we developed and validated a polygenic hazard score (PHS) to estimate the clinical outcomes for patients with early-stage NSCLC. Further, we investigated whether the PHS could identify patients who would benefit most from ACT. We present the following article in accordance with the TRIPOD reporting Checklist (available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-22-139/rc).

Methods

Patients and study design

We conducted a retrospective study of patients with stage I–II NSCLC. Participants were recruited from two populations described before: the Guangzhou GSA GWAS (11) and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening (PLCO) trial (31). The patients in Guangzhou GSA GWAS were collected from Sun Yat-sen University Cancer Center (Guangzhou, China) between January 2006 and December 2017. The PLCO Trial was a randomized control study that included ~155,000 volunteers aged 55–74 at enrollment between 1993 and 2011 from 10 medical centers in the United States. The flow chart of the patients is represented in . Eligible patients were aged 18 years or older with histologically confirmed stage I or II NSCLC, radical surgery, and complete clinical and follow-up data. The exclusion criteria included previous or concurrent malignant disease and neoadjuvant treatments for NSCLC. Ultimately, 404 patients who treated with surgery plus ACT (SYSUCC ACT set), with 388/404 (96%) of these patients receiving platinum-based ACT, and 624 patients who treated with surgery alone (SYSUCC surgery set) were included in this study from Guangzhou GSA GWAS. The SYSUCC ACT set was randomly divided into a training set (n=202) and a test set (n=202). The training set was used to construct the PHS, and the test set and SYSUCC surgery set were used as validation cohort (SYSUCC validation cohort, n=826). Eligible patients from PLCO Trail were used as an external validation cohort (PLCO validation cohort, n=367). The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The Institutional Review Board of Sun Yat-sen University Cancer Center approved this study (Approval No. B2022-131-01). Because of the retrospective nature of the study, the requirement for informed consent was waived.
Figure 1

Schematic illustrating the use of the study datasets in this study. GSA, Global Screening Array; GWAS, genome-wide association study; PLCO, the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial; ACT, adjuvant chemotherapy; SYSUCC, Sun Yat-sen University Cancer Center.

Schematic illustrating the use of the study datasets in this study. GSA, Global Screening Array; GWAS, genome-wide association study; PLCO, the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial; ACT, adjuvant chemotherapy; SYSUCC, Sun Yat-sen University Cancer Center.

Clinical and follow-up information

For participants from the Guangzhou GSA GWAS, patients’ clinical information, including sex, age at diagnosis, smoking status, family history of cancer, histologic grade, EGFR mutation status, TNM stage, and treatments were reviewed from the medical records. These patients were followed up by outpatient visits or telephone contacts as the clinicians recommended (last follow-up in September 2020). The survival status of the patients was obtained from the follow-up department of the hospital and the death registration at the public security bureau. The PLCO Trial gathered clinical information from the first screening visit, such as sex, age, family history, and smoking history. The patients were followed up for more than 13 years after clinical enrollment (32). The endpoint of the study was overall survival (OS), which was defined as the date of surgery to the date of death or the last day of follow-up.

Genotyping, quality control, and imputation

For patients from the Guangzhou GSA GWAS, the methods of DNA extraction, genotyping, and imputation process have been detailed described (11). Briefly, DNA was extracted from peripheral blood and SNP genotyping was detected by Illumina Infinium® Global Screening Array (GSA) v1.0. Quality control at sample level and SNP level have been done according to the criteria described in the previous study (11). Qualified genotypes were imputed by a two-stage imputation approach, using SHAPEIT2 (33) for phasing and IMPUTE2 for imputation (34) and the Phase III 1000 Genomes Project was set as the reference. For patients from PLCO Trial, genomic DNA extracted from the blood samples was genotyped with Illumina HumanHap240Sv1.0, HumanHap300v1.1, HumanHap550v3.0 and Human610-Quadv1_B (dbGaP accession: phs000093.v2.p2 and phs000336.v1.p1) (35,36). We conducted similar quality control and imputation methods as we did in the Guangzhou GSA GWAS. In brief, we excluded samples with genotype completion rates <95%, gender discrepancy, familial relationships, extreme heterozygosity rates (>6 SD), or population outliers defined by principal component analysis (PCA) via EIGENSTRAT. Then, we excluded SNPs with call rates <95%, minor allele frequencies (MAFs) <0.01, or P<10-12 in testing of Hardy-Weinberg equilibrium (HWE). We performed quality control filtering by using PLINK. Qualified genotypes were imputed by a two-stage imputation approach, using SHAPEIT2 for phasing and IMPUTE2 for imputation and using the Phase III 1000 Genomes Project (Europeans) as the reference.

PHS construction

Analysis was restricted to the 2,943,474 SNPs shared between Guangzhou GSA GWAS and PLCO datasets. The PHS was created in three steps. Firstly, we conducted a GWAS to identify variants associated with OS in the SYSUCC ACT set. The GWAS was performed using the multivariate cox regression model with adjustment for sex (male or female), age (≤59 or >59 years, median age as the cutoff), smoking status (never smoking, quitting smoking, or current smoking), histology (lung adenocarcinoma, lung squamous cell carcinoma or other types), grade (G1 or G2 or G3), TNM stage (stage I or II), and the top three principal components. We performed LD pruning (r2>0.3) to keep the independent SNPs and selected candidate SNPs with P<1.0×10−3 from the GWAS results. Secondly, to avoid overfitting, we used the least absolute shrinkage and selection operator (LASSO) model to select the most important variants in the training set and the 1 standard error of the minimum criteria (the 1-SE) were used as criteria. This analysis was performed ten times using ten different random number seeds and only SNPs selected by all ten models were retained to calculate the PHS. Thirdly, the PHS was calculated by LASSO Cox model using the sum of values weighted by the coefficients of the most discriminative SNPs. We calculated PHS for each participant included in this study and the median PHS in the training set was used as the cut-off value to split patients into low PHS or high PHS groups.

Statistical analysis

The GWAS was performed with the “gwasurvivr” package of R software (37). The LASSO method was used to construct the PHS using the “glmnet” package (38). Survival analysis was conducted using the R package “survival” and “survminer”. Log-rank test was used to compare the survival time and survival curves were estimated using the Kaplan–Meier method. The association of PHS and other clinical factors with OS was assessed using univariate and multivariable Cox regression models. Hazard ratios (HR) and corresponding 95% confidence interval (CI) were evaluated according to Cox proportional hazard models. In propensity score matching analyses (39), patients in the surgery alone group were matched to the ACT plus surgery group with a 1:1 matching based on the sex, age, smoking status, histology, grade, and TNM stage. All statistical tests were two-sided and considered significant when the P was less than 0.05. All analyses were performed in R software, version 3.6.3.

Results

Patient clinical characteristics

One thousand and three hundred ninty-five eligible NSCLC patients were included in this study, including 202 patients in the training set, 826 patients in SYSUCC validation cohort, and 367 patients in PLCO validation cohort. The median follow-up time was 96.8 months [interquartile range (IQR), 78.5–113.1 months] for the training set, 96.4 months (IQR, 81.5–113.4 months) for SYSUCC validation cohort, and 171.0 months (IQR, 128.5–207.5 months) for PLCO validation cohort. For the above three populations, five-year survival (5-year OS) rates were 71.1%, 74.5%, and 74.1%, respectively. The baseline characteristics of the participants included in this study were summarized in . Most of the clinical characteristics were significantly different (P<0.05) between the study sets, which may partly be due to the different study designs of the original study cohorts ().
Table 1

Baseline characteristics of patients in this study

CharacteristicsAll patients, n=1,395Training set (n=202)SYSUCC validation cohort (n=826)PLCO validation cohort (n=367)P
Sex0.083
   Male809 (58.0%)123 (60.9%)491 59.4%)195 (53.1%)
   Female586 (42.0%)79 (39.1%)335 (40.6%)172 (46.9%)
Age (year)<0.001*
   ≤59503 (36.1%)112 (55.4%)364 (44.1%)27 (7.4%)
   >59892 (63.9%)90 (44.6%)462 (55.9%)340 (92.6%)
Family cancer history<0.001*
   No954 (68.4%)160 (79.2%)650 (78.7%)144 (39.3%)
   Yes440 (31.6%)42 (20.8%)176 (21.3%)222 (60.7%)
Smoking<0.001*
   Never571 (40.9%)102 (50.5%)428 (51.8%)41 (11.2%)
   Ever301 (21.6%)26 (12.9%)145 (17.6%)130 (35.4%)
   Current523 (37.5%)74 (36.6%)253 (30.6%)196 (53.4%)
Histology<0.001*
   AD891 (63.9%)138 (68.3%)546 (66.1%)207 (56.4%)
   SCC378 (27.1%)41 (20.3%)228 (27.6%)109 (29.7%)
   Other126 (9.0%)23 (11.4%)52 (6.3%)51 (13.9%)
EGFR<0.001*
   Wild type397 (28.5%)82 (40.6%)315 (38.1%)0 (0.00%)
   Mutation262 (18.8%)57 (28.2%)205 (24.8%)0 (0.00%)
   Unknown736 (52.8%)63 (31.2%)306 (37.0%)367 (100%)
Grade<0.001*
   G1634 (45.4%)111 (55.0%)380 (46.0%)143 (39.0%)
   G2524 (37.6%)81 (40.1%)312 (37.8%)131 (35.7%)
   G3187 (13.4%)10 (5.0%)114 (13.8%)63 (17.2%)
   Unknown50 (3.6%)0 (0.0%)20 (2.4%)30 (8.17%)
TNM stage<0.001*
   I980 (70.3%)90 (44.6%)582 (70.5%)308 (83.9%)
   II415 (29.7%)112 (55.4%)244 (29.5%)59 (16.1%)
ACT<0.001*
   Without935 (67.0%)0 (0.0%)624 (75.5%)311 (84.7%)
   With460 (33.0%)202 (100%)202 (24.5%)56 (15.3%)
Radiotherapy<0.001*
   Without1,350 (96.8%)189 (93.6%)815 (98.7%)346 (94.3%)
   With45 (3.23%)13 (6.44%)11 (1.33%)21 (5.72%)
Survival status<0.001*
   Alive784 (56.2%)115 (56.9%)527 (63.8%)142 (38.7%)
   Dead611 (43.8%)87 (43.1%)299 (36.2%)225 (61.3%)

†, other subtypes include large cell, adenosquamous, sarcomatoid, basaloid, and unclassifiable NSCLC. *, P<0.05. AD, lung adenocarcinoma; SCC, lung squamous cell carcinoma; EGFR, epidermal growth factor receptor; ACT, adjuvant chemotherapy.

†, other subtypes include large cell, adenosquamous, sarcomatoid, basaloid, and unclassifiable NSCLC. *, P<0.05. AD, lung adenocarcinoma; SCC, lung squamous cell carcinoma; EGFR, epidermal growth factor receptor; ACT, adjuvant chemotherapy. We investigated which clinical factors were associated with OS using a Cox regression model. In univariate and multivariate analyses (Table S1), the independent prognostic factors were sex (adjusted HR =0.64, 95% CI: 0.51–0.79, P<0.001), age (adjusted HR =1.68, 95% CI: 1.35–2.08, P<0.001), tumor grade (adjusted HR =0.90, 95% CI: 0.74–1.10, P=0.299 for grade G2; adjusted HR =0.54, 95% CI: 0.40–0.74, P<0.001 for grade G3), and TNM stage (adjusted HR =1.76, 95% CI: 1.43–2.19, P<0.001), which were adjusted as covariates in following analyses.

PHS construction and validation

To build a PHS for predicting ACT treatment outcomes, we performed a GWAS among patients who received surgery plus ACT (SYSUCC ACT set, n=404) to identify variants associated with clinical outcomes following ACT (). GWAS identified 125 SNPs with P<1.0×10−3 and r2 threshold <0.3. The 37 most discriminative variants selected by LASSO analyses were used to derive a PHS in the training set (; Table S2). The median PHS of 2.237 in the training set was used as the cutoff to divide each participant into high- and low-PHS subgroups. We then analyzed the correlation between PHS groups and clinical characteristics and found most of them were not associated with PHS, except for family cancer history (Table S3).
Figure 2

The construction of PHS. (A) Manhattan plot of P values derived from GWAS. The red horizontal line indicates suggestive level (P=1.0×10−3). (B) Partial likelihood deviance for LASSO coefficient profiles, the two vertical dotted lines are shown at the optimal values by minimum criteria (right) and 1-SE criteria (left). (C) LASSO coefficient profiles of selected SNPs. Thirty-seven SNPs remained with their nonzero LASSO coefficients by 1-SE criteria (left). Kaplan-Meier plot for (D) Testing set (n=202); (E) PLCO ACT set (n=56). P values comparing PHS groups were calculated with the log-rank test. Hazard ratios (HRs) and 95% CIs were for low vs. high PHS in univariate COX regression analyses. PLCO, the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial; PHS, Polygenic Hazard Score; ACT, adjuvant chemotherapy.

The construction of PHS. (A) Manhattan plot of P values derived from GWAS. The red horizontal line indicates suggestive level (P=1.0×10−3). (B) Partial likelihood deviance for LASSO coefficient profiles, the two vertical dotted lines are shown at the optimal values by minimum criteria (right) and 1-SE criteria (left). (C) LASSO coefficient profiles of selected SNPs. Thirty-seven SNPs remained with their nonzero LASSO coefficients by 1-SE criteria (left). Kaplan-Meier plot for (D) Testing set (n=202); (E) PLCO ACT set (n=56). P values comparing PHS groups were calculated with the log-rank test. Hazard ratios (HRs) and 95% CIs were for low vs. high PHS in univariate COX regression analyses. PLCO, the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial; PHS, Polygenic Hazard Score; ACT, adjuvant chemotherapy. To test whether PHS can predict clinical outcomes following ACT, we investigated the impact of PHS groups on clinical outcomes among those who received ACT. Low-PHS was significantly associated with prolonged OS in test set (HR =0.21, 95% CI: 0.12–0.36, P<0.001; ). The 5-year OS was 90.5% (95% CI: 84.7–96.6) for the low-PHS group and 60.3% (95% CI: 51.3–70.8) for the high-PHS group. We further validated the PHS in an independent population (PLCO ACT set, n=56) and a similar trend was observed (HR =0.33, 95% CI: 0.04–2.48, P=0.260; ), although the results did not reach statistical significance likely due to the limited sample size. When including sex, age, grade, and TNM stage in a multivariate Cox regression analysis, the PHS still had an independent association with ACT outcomes (Table S4). These results suggested that PHS was associated with ACT outcomes and could affect patients’ survival possibly by modulating the ACT response.

Prognostic value for PHS

In addition to the significant associations between PHS and treatment outcomes following ACT as described above, we further assessed whether PHS was a stable prognostic biomarker independent of treatment strategies and other clinical factors. We firstly assessed whether PHS has a reliable prognostic value regardless of the treatment strategy (such as for patients treated with surgery alone). We observed similar results among the patients with surgery alone that the low PHS patients would have better OS than high PHS patients (SYSUCC surgery alone: HR =0.63, 95% CI: 0.48–0.84, P<0.001; PLCO surgery alone: HR =0.63, 95% CI: 0.38–1.03, P=0.063; Combined: HR =0.63, 95% CI: 0.50–0.80, P<0.001; ). We also observed consistent results in the total population for patients with or without ACT (SYSUCC: HR =0.48, 95% CI: 0.38–0.62, P<0.001; PLCO: HR =0.60, 95% CI: 0.37–0.96, P=0.033; Combined: HR =0.52, 95% CI: 0.42–0.65, P<0.001; ).
Figure 3

Kaplan-Meier and forest plot of subgroup analysis for PHS in validation cohorts. (A) Kaplan-Meier plot for patients treated with surgery alone in SYSUCC (n=624, left), PLCO (n=311, middle), and the combined (n=935, right) surgery alone sets. (B) Kaplan-Meier plot for total patients treated with ACT or surgery alone in SYSUCC (n=826, left), PLCO (n=367, middle), and the combined (n=1,193, right) validation cohorts. P values comparing PHS groups were calculated with the log-rank test. Hazard ratios (HRs) and 95% CIs were for low vs. high PHS in univariate COX regression analyses. (C) Forest plot of HRs for PHS in different subgroups stratified by clinical parameters in the above three cohorts. HRs and 95% CIs were tested in multivariate Cox regression analyses adjusting for sex, age, grade, TNM stage, and ACT. SYSUCC, Sun Yat-sen University Cancer Center; PLCO, the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial; PHS, Polygenic Hazard Score; ACT, adjuvant chemotherapy; AD, lung adenocarcinoma; SCC, lung squamous cell carcinoma; EGFR, epidermal growth factor receptor.

Kaplan-Meier and forest plot of subgroup analysis for PHS in validation cohorts. (A) Kaplan-Meier plot for patients treated with surgery alone in SYSUCC (n=624, left), PLCO (n=311, middle), and the combined (n=935, right) surgery alone sets. (B) Kaplan-Meier plot for total patients treated with ACT or surgery alone in SYSUCC (n=826, left), PLCO (n=367, middle), and the combined (n=1,193, right) validation cohorts. P values comparing PHS groups were calculated with the log-rank test. Hazard ratios (HRs) and 95% CIs were for low vs. high PHS in univariate COX regression analyses. (C) Forest plot of HRs for PHS in different subgroups stratified by clinical parameters in the above three cohorts. HRs and 95% CIs were tested in multivariate Cox regression analyses adjusting for sex, age, grade, TNM stage, and ACT. SYSUCC, Sun Yat-sen University Cancer Center; PLCO, the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial; PHS, Polygenic Hazard Score; ACT, adjuvant chemotherapy; AD, lung adenocarcinoma; SCC, lung squamous cell carcinoma; EGFR, epidermal growth factor receptor. We further assessed the prognostic value of the PHS within each clinical subgroup of patients stratified by TNM stage, sex, age, smoking status, histology, EGFR mutation status, and grade in the validation cohorts. Kaplan-Meier survival analyses showed significant differences in OS between patients with different PHS subgroups in all subgroups (Figure S1). We next performed multivariate Cox regression analyses and still observed consistent results (). Compared with patients with high PHS, patients with low PHS had longer OS in all subgroup analyses (). These results suggest that PHS shows potential as a stable prognostic biomarker independent of treatment strategies and clinical factors.

Stage II patients with low-PHS benefit most from ACT

Consistent with previous clinical studies, compared with patients treated with surgery alone, stage II patients could achieve additional benefit from ACT, while stage I patients did not (stage I: HR =1.06, 95% CI: 0.78–1.44, P=0.720; stage II: HR =0.66, 95% CI: 0.48–0.92, P=0.022; ). Of note, the survival benefit from ACT is moderate in stage II patients, with 12.3% 5-year OS rates improved (ACT: 68.3% vs. surgery alone: 56.0%).
Figure 4

Kaplan-Meier and multivariate analyses of overall survival by treatment in the combined validation cohort. (A) Kaplan-Meier plots for stage I patients for all patients (n=890, left), low-PHS group (n=280, middle), and high-PHS group (n=610, right). (B) Kaplan-Meier plots for stage II patients for all patients (n=303, left), low-PHS group (n=121, middle), and high-PHS group (n=182, right). P values were calculated with the log-rank test. HRs and 95% CIs for surgery plus ACT vs. surgery alone were tested in univariate COX regression analyses. (C) Forest plots of HRs of ACT for stage I (n=890, left), and for stage II (n=303, right). HRs and 95% CIs were tested in multivariate Cox regression analyses adjusting for sex, age, grade, and TNM stage. PHS, Polygenic Hazard Score; HR, hazard ratio; ACT, adjuvant chemotherapy.

Kaplan-Meier and multivariate analyses of overall survival by treatment in the combined validation cohort. (A) Kaplan-Meier plots for stage I patients for all patients (n=890, left), low-PHS group (n=280, middle), and high-PHS group (n=610, right). (B) Kaplan-Meier plots for stage II patients for all patients (n=303, left), low-PHS group (n=121, middle), and high-PHS group (n=182, right). P values were calculated with the log-rank test. HRs and 95% CIs for surgery plus ACT vs. surgery alone were tested in univariate COX regression analyses. (C) Forest plots of HRs of ACT for stage I (n=890, left), and for stage II (n=303, right). HRs and 95% CIs were tested in multivariate Cox regression analyses adjusting for sex, age, grade, and TNM stage. PHS, Polygenic Hazard Score; HR, hazard ratio; ACT, adjuvant chemotherapy. Since the low PHS was significantly associated with longer OS in patients treated with ACT, we hypothesized that the low PHS was associated with better ACT benefits. To verify this conjecture, we next used the PHS to evaluate the survival benefit of ACT. When stratified by PHS, for stage I NSCLC patients, ACT did not exhibit additional OS benefit, regardless of PHS subgroups (). Interestingly, for stage II patients with low PHS, those who received ACT experienced significantly longer OS (HR =0.26, 95% CI: 0.12–0.58, P<0.001; ) than those who received surgery alone, with 24.7% 5-year OS rate improved (ACT: 90.4% vs. surgery: 65.7%). However, among patients with high PHS, patients who received ACT showed similar OS to those who underwent surgery alone (HR =0.97, 95% CI: 0.68–1.38, P=0.860; ). Multivariate analyses showed similar results (). The HRs of ACT were lower (more protective) in all the low-PHS groups than in the corresponding high-PHS groups. These results indicated that the addition of ACT to surgery was more likely to provide a survival benefit in stage II patients with low PHS. We used propensity score matching to match the patients with and without ACT. After matching, 516 patients were included (258 in each group) and no statistically significant difference was found in any characteristics (Table S5). Similar results were obtained in the matched cohort. Notably, ACT significantly improved OS in stage II patients with low PHS (HR =0.26, 95% CI: 0.11–0.58, P<0.001), while there was no ACT survival benefit among patients with high PHS (HR =0.84, 95% CI: 0.57–1.22, P=0.360; Figure S2). Therefore, the ACT decision should be carefully made based on the TNM stage and PHS as well.

Discussion

Accurate prediction of clinical outcome and treatment response is critical for prognostic stratification and treatment decision-making for NSCLC patients. In this study, we developed and validated a GWAS-based PHS to predict clinical outcomes and ACT benefit in early-stage NSCLC. The PHS was significantly associated with clinical outcomes and could be used to identify candidates for ACT. The TNM stage has been widely used to predict prognosis and guide ACT in NSCLC patients (4). For patients with stage I NSCLC, the prognosis is generally favorable and they are likely to be cured by surgery alone (4). For stage II NSCLC, they were thought at a high risk of recurrence, and ACT was recommended by NCCN guidelines (4). However, the anatomically based TNM stage judge the risk of disease progression mainly according to tumor invasion without considering the other factors associated with individuals’ heterogeneity, such as the genetic background. Our study demonstrated that the PHS could complement the TNM stage by providing additional predictive information. The PHS could classify patients with different long-term clinical outcomes independently of the TNM stage and other clinical factors. More importantly, the PHS showed predictive value for ACT benefit and the addition of ACT to surgery was more likely to provide a survival benefit in stage II patients with low PHS. Hence, A key potential utility of the work is to use the PHS to guide individualized treatment plans. For instance, for stage II patients with low PHS, the ACT is strongly recommended because they can derive considerable survival benefits from the ACT. However, for stage II patients with high PHS, the ACT might not be the optimal adjuvant therapy, and alternative treatment should be received as soon as possible to catch the optimal treatment timing. Several other predictive signatures for ACT benefit prediction have been developed in previous research, such as tumor tissue-based transcriptional signatures (40-42), blood-based circulating tumor DNA signatures (43), and radiomic features (44,45). However, few studies have been made on polygenic scores for prediction prognosis and treatment response in NSCLC. Investigation into the genetic features of the individuals could give additional insight into biological rationales for prognosis and drug response. Interestingly, several SNPs included in the PHS have been reported associated with disease progression, implying the potential role of the susceptibility genes involved in NSCLC development and survival. For example, rs17080884, located at chromosome 5q35.3 within the intergenic of TRIM7 and MIR463, is associated with the survival of NSCLC patients (for T allele: OR =2.18, P=8.75×10−6). The expression quantitative trait locus (eQTL) from GTEX.v8 data suggested that rs17080884-T allele is associated with higher expression of TRIM7 expression in several tissues, indicating the potential regulatory effect of rs17080884 on TRIM7. TRIM7 is a member of the tripartite motif family and its overexpression was associated with poor prognosis in osteosarcoma (46). Another study found that the high expression of TRIM7 increased lung tumor burden via increasing tumor growth in a Ras-driven cancer model (47). Another variant rs3857953 located at intronic region of MTSS1, was associated with OS (for C allele: OR =0.57, P=2.29×10−4). The eQTL analysis suggested that rs3857953-C allele is associated with higher expression of MTSS1 expression in the artery aorta. MTSS1 is a latent metastasis suppressor gene and has been reported to inhibit prostate cancer cell migration and proliferation (48), and also plays a role in the invasion and metastasis of lung cancer cells (49). The collective evidence suggested that these variants may be associated with the prognosis of NSCLC via the regulation effect on gene expression. We think this work provides more evidence to verify the concept that treatment recommendation strategy can be further optimized with genetic data. However, the biological mechanism of PHS affecting the treatment outcome has not been elucidated and further investigations into the SNPs functions might provide possible treatment targets. The potential benefits of polygenic scores include predicting disease risk (11), stratifying patients (50), and delivering personalized treatment (29,30). Despite these potential benefits, there are potential risks that should be acknowledged. First of all, the construction of a standard and robust genetic score is an important premise of polygenic scores applications. In our study, The GWAS population had strict inclusion and exclusion criteria, as a subgroup from a larger study cohort (Guangzhou GSA GWAS). Patients in the GWAS cohort received the same treatment region (surgery plus ACT) and the homogenous population allowed us to focus on genetic variants with fewer biases. Second, it is very important that test the polygenic score performance in an independent dataset so that its reproducibility can be validated. In the present study, the performance of the PHS was firstly validated in an independent dataset of SYSUCC surgery set and subsequently in PLCO cohort, which proceeds from a population-based cohort better representing a real-world study cohort (31). Finally, genetics is merely one contributing factor to prognosis and additional predictive biomarkers (such as those mentioned above), as well as clinical factors, would further improve the ability of clinicians to optimize ACT strategies. The limitations of our study are worthy of discussion. First, the SNPs used to construct the PHS did not exceed the genome-wide threshold for significance of 5×10−8 given the limited sample size of this GWAS. Nevertheless, it can be noted that despite the small sample sizes, our PHS was successfully validated associated with clinical outcomes in independent datasets. In the future, multi-center studies with larger sample sizes are still needed to optimize and validate the PHS. Second, the PHS showed a relatively higher prognostic value for Asian population (Guangzhou GSA GWAS cohort) than that for the European populations (PLCO cohort). Previous studies have shown that the prediction accuracy for polygenic scores would be decreased across different racial populations, especially when the discovery and target samples are from different races, with the population-specific allele frequencies and linkage disequilibrium (LD) patterns being two main reasons for this phenomenon (25,51-53). Therefore, in further research, it is needed to develop population-specific PHSs for other ancestral groups (e.g., European or African ancestry). Third, although the PLCO cohort was a subset of a prospective cohort, the other datasets were retrospective cohorts which could have introduced selection bias. While we have attempted to minimize the bias by adjusting for baseline clinical factors, other factors, that we failed to include in this study, could play a role, such as lymphovascular invasion (LVI), the spread of tumor through the airspaces (STAS), PD-L1 expression status and other driver gene’s expression. Considering the limitation mentioned, future studies with more rigorous design and comprehensive clinical data are needed to confirm the study findings. In conclusion, we constructed a GWAS-derived PHS that effectively predicted clinical outcomes and ACT benefits in NSCLC patients. The PHS might help clinicians identify which patients are expected to benefit from ACT. Further research is also needed to optimize the PHS and ascertain how PHS can be effectively applied to clinical practice. The article’s supplementary files as
  52 in total

1.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

2.  The NCBI dbGaP database of genotypes and phenotypes.

Authors:  Matthew D Mailman; Michael Feolo; Yumi Jin; Masato Kimura; Kimberly Tryka; Rinat Bagoutdinov; Luning Hao; Anne Kiang; Justin Paschall; Lon Phan; Natalia Popova; Stephanie Pretel; Lora Ziyabari; Moira Lee; Yu Shao; Zhen Y Wang; Karl Sirotkin; Minghong Ward; Michael Kholodov; Kerry Zbicz; Jeffrey Beck; Michael Kimelman; Sergey Shevelev; Don Preuss; Eugene Yaschenko; Alan Graeff; James Ostell; Stephen T Sherry
Journal:  Nat Genet       Date:  2007-10       Impact factor: 38.330

3.  Baseline chest radiograph for lung cancer detection in the randomized Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial.

Authors:  Martin M Oken; Pamela M Marcus; Ping Hu; Thomas M Beck; William Hocking; Paul A Kvale; Jill Cordes; Thomas L Riley; Stephen D Winslow; Steven Peace; David L Levin; Philip C Prorok; John K Gohagan
Journal:  J Natl Cancer Inst       Date:  2005-12-21       Impact factor: 13.506

4.  L1 penalized estimation in the Cox proportional hazards model.

Authors:  Jelle J Goeman
Journal:  Biom J       Date:  2010-02       Impact factor: 2.207

5.  What Are Polygenic Scores and Why Are They Important?

Authors:  Leo P Sugrue; Rahul S Desikan
Journal:  JAMA       Date:  2019-05-14       Impact factor: 56.272

6.  Predictive value of single-nucleotide polymorphism signature for recurrence in localised renal cell carcinoma: a retrospective analysis and multicentre validation study.

Authors:  Jin-Huan Wei; Zi-Hao Feng; Yun Cao; Hong-Wei Zhao; Zhen-Hua Chen; Bing Liao; Qing Wang; Hui Han; Jin Zhang; Yun-Ze Xu; Bo Li; Ji-Tao Wu; Gui-Mei Qu; Guo-Ping Wang; Cong Liu; Wei Xue; Qiang Liu; Jun Lu; Cai-Xia Li; Pei-Xing Li; Zhi-Ling Zhang; Hao-Hua Yao; Yi-Hui Pan; Wen-Fang Chen; Dan Xie; Lei Shi; Zhen-Li Gao; Yi-Ran Huang; Fang-Jian Zhou; Shao-Gang Wang; Zhi-Ping Liu; Wei Chen; Jun-Hang Luo
Journal:  Lancet Oncol       Date:  2019-03-14       Impact factor: 41.316

Review 7.  Clinical use of current polygenic risk scores may exacerbate health disparities.

Authors:  Alicia R Martin; Masahiro Kanai; Yoichiro Kamatani; Yukinori Okada; Benjamin M Neale; Mark J Daly
Journal:  Nat Genet       Date:  2019-03-29       Impact factor: 38.330

8.  CT derived radiomic score for predicting the added benefit of adjuvant chemotherapy following surgery in Stage I, II resectable Non-Small Cell Lung Cancer: a retrospective multi-cohort study for outcome prediction.

Authors:  Pranjal Vaidya; Kaustav Bera; Amit Gupta; Xiangxue Wang; Germán Corredor; Pingfu Fu; Niha Beig; Prateek Prasanna; Pradnya Patil; Priya Velu; Prabhakar Rajiah; Robert Gilkeson; Michael Feldman; Humberto Choi; Vamsidhar Velcheti; Anant Madabhushi
Journal:  Lancet Digit Health       Date:  2020-02-13

9.  Identification of risk loci and a polygenic risk score for lung cancer: a large-scale prospective cohort study in Chinese populations.

Authors:  Juncheng Dai; Jun Lv; Meng Zhu; Yuzhuo Wang; Na Qin; Hongxia Ma; Yong-Qiao He; Ruoxin Zhang; Wen Tan; Jingyi Fan; Tianpei Wang; Hong Zheng; Qi Sun; Lijuan Wang; Mingtao Huang; Zijun Ge; Canqing Yu; Yu Guo; Tong-Min Wang; Jie Wang; Lin Xu; Weibing Wu; Liang Chen; Zheng Bian; Robin Walters; Iona Y Millwood; Xi-Zhao Li; Xin Wang; Rayjean J Hung; David C Christiani; Haiquan Chen; Mengyun Wang; Cheng Wang; Yue Jiang; Kexin Chen; Zhengming Chen; Guangfu Jin; Tangchun Wu; Dongxin Lin; Zhibin Hu; Christopher I Amos; Chen Wu; Qingyi Wei; Wei-Hua Jia; Liming Li; Hongbing Shen
Journal:  Lancet Respir Med       Date:  2019-07-17       Impact factor: 30.700

10.  The E3 ubiquitin ligase Trim7 mediates c-Jun/AP-1 activation by Ras signalling.

Authors:  Atanu Chakraborty; Markus E Diefenbacher; Anastasia Mylona; Olivier Kassel; Axel Behrens
Journal:  Nat Commun       Date:  2015-04-08       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.