| Literature DB >> 26056042 |
Shin Yup Lee1,2, Mi Jeong Hong3,4, Hyo-Sung Jeon3, Yi Young Choi3, Jin Eun Choi3,5, Hyo-Gyoung Kang3,5, Deuk Kju Jung3,4, Chengcheng Jin3,4, Sook Kyung Do3,4, Seung Soo Yoo1,2, Yangki Seok6, Eung Bae Lee6, Kyung Min Shin7, Ji Yun Jeong8, Won Kee Lee9, Jaehee Lee1, Seung Ick Cha1, Chang Ho Kim1, Young Tae Kim10, Sanghoon Jheon10, Jae Yong Park1,2,3,5,4.
Abstract
We searched for potential regulatory single nucleotide polymorphisms (SNPs) in excision repair cross-complementing group 1 (ERCC1) using RegulomeDB, a database integrating information from the Encyclopedia of DNA Elements (ENCODE) project, and investigated their association with survival after surgery in non-small cell lung cancer (NSCLC). Among 364 SNPs found within ERCC1 region using RegulomeDB, four top priority SNPs (rs2298881C>A, rs1049739A>G, rs10415949A>G and rs6509214G>T) were selected for this study. The four SNPs were investigated in 316 patients. A replication study was performed (n = 579). Of the four SNPs analyzed in the discovery set, rs2298881C>A and rs6509214G>T were significantly associated with survival outcomes. The association was consistently observed only for rs2298881C>A in the validation cohort. In combined analysis, rs2298881C>A was significantly associated with worse overall survival and disease-free survival (P = 0.0002 and 0.02, respectively). A decreased reporter gene expression for rs2298881 A allele was observed compared with C allele by luciferase assay (P = 0.02). ERCC1 rs2298881C>A, an intronic SNP, is the first genetic polymorphism with functional evidence of regulating its expression, and the SNP is associated with prognosis of NSCLC. Our result supports the role of RegulomeDB as a comprehensive source of prioritized candidate SNPs for genetic association studies.Entities:
Keywords: ERCC1; lung cancer; polymorphisms; regulomeDB
Mesh:
Substances:
Year: 2015 PMID: 26056042 PMCID: PMC4695203 DOI: 10.18632/oncotarget.4083
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Univariate analysis for overall survival and disease-free survival by clinicopathological characteristics in the discovery cohorts and validation cohorts
| Variables | No. of cases | Discovery set | No. of cases | Validation set | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Overall survival | Disease-free survival | Overall survival | Disease-free survival | |||||||||||
| No. of death (%) | 5Y-OSR (%) | Log-Rank | No. of event (%) | 5Y-DFSR (%) | Log-Rank | No. of death (%) | 5Y-OSR (%) | Log-Rank | No. of event (%) | 5Y-DFSR (%) | Log-Rank | |||
| 316 | 126 (39.9) | 53 | 160 (50.6) | 43 | 579 | 164 (28.3) | 67 | 296 (51.1) | 45 | |||||
| ≤63 | 161 | 60 (37.3) | 57 | 0.1 | 81 (50.3) | 44 | 0.59 | 306 | 70 (22.9) | 74 | 0.0001 | 146 (47.7) | 50 | 0.01 |
| > 63 | 155 | 66 (42.6) | 48 | 79 (51.0) | 41 | 273 | 94 (34.4) | 58 | 150 (55.0) | 38 | ||||
| Female | 74 | 19 (25.7) | 62 | 0.04 | 33 (44.6) | 45 | 0.63 | 164 | 39 (23.8) | 73 | 0.08 | 81 (49.4) | 46 | 0.54 |
| Male | 242 | 107 (44.2) | 50 | 127 (52.5) | 42 | 415 | 125 (30.1) | 65 | 215 (51.8) | 45 | ||||
| Never | 72 | 21 (29.2) | 63 | 0.09 | 34 (47.2) | 43 | 0.73 | 202 | 55 (27.2) | 69 | 0.37 | 109 (54.0) | 40 | 0.61 |
| Ever | 244 | 105 (43.0) | 50 | 126 (51.6) | 43 | 377 | 109 (28.9) | 66 | 187 (49.6) | 48 | ||||
| < 40 | 106 | 43 (40.6) | 51 | 0.37 | 75 (51.4) | 42 | 0.80 | 175 | 47 (26.9) | 68 | 0.18 | 85 (48.6) | 42 | 0.37 |
| ≥ 40 | 138 | 62 (44.9) | 49 | 51 (52.0) | 43 | 202 | 62 (30.7) | 64 | 102 (50.5) | 47 | ||||
| SCC | 172 | 67 (39.0) | 54 | 0.83 | 81 (47.1) | 46 | 0.34 | 241 | 67 (27.8) | 68 | 0.18 | 111 (46.1) | 51 | 0.16 |
| AC | 139 | 56 (40.3) | 51 | 76 (54.7) | 37 | 312 | 86 (27.6) | 68 | 169 (54.2) | 41 | ||||
| LCC | 5 | 3 (60.0) | 60 | 3 (60.0) | 60 | 26 | 11 (42.3) | 52 | 16 (61.5) | 42 | ||||
| I | 194 | 55 (28.4) | 61 | 2.0 × 10−6 | 72 (37.1) | 53 | 4.0 × 10−10 | 348 | 82 (23.6) | 71 | 0.0006 | 152 (43.7) | 51 | 5.0 × 10−7 |
| II+IIIA | 122 | 71 (58.2) | 40 | 88 (72.1) | 27 | 231 | 82 (35.5) | 61 | 144 (62.3) | 36 | ||||
| No | 67 | 37 (55.2) | 39 | 0.79 | 47 (70.1) | 29 | 0.56 | 110 | 40 (36.4) | 65 | 0.87 | 72 (65.5) | 37 | 0.75 |
| Yes | 55 | 34 (61.8) | 41 | 41 (74.6) | 26 | 121 | 42 (34.7) | 56 | 72 (59.5) | 34 | ||||
Abbreviations: SCC, Squamous cell carcinoma; AC, adenocarcinoma; LCC, Large cell carcinoma.
Row percentage.
Five year-overall survival rate (5Y-OSR) and 5 year-disease free survival rate (5Y-DFSR), proportion of survival derived from Kaplan-Meier analysis.
In ever-smokers.
Association of rs2298881C>A and rs6519214G>T and survival outcomes in the discovery and validation sets
| ID No. | Genotypes | Discovery cohort | Validation cohort | Combined cohort | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Overall survival | Disease-free survival | Overall survival | Disease-free survival | Overall survival | Disease-free survival | ||||||||
| HR (95%CI) | HR (95%CI) | HR (95%CI) | HR (95%CI) | HR (95%CI) | HR (95%CI) | ||||||||
| rs2298881 | CC | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | ||||||
| CA | 1.18 (0.75−1.86) | 0.47 | 1.07 (0.73−1.57) | 0.73 | 1.47 (0.97−2.23) | 0.07 | 1.26 (0.95−1.68) | 0.11 | 1.30 (0.96−1.75) | 0.09 | 1.19 (0.95−1.49) | 0.13 | |
| AA | 1.92 (1.15−3.20) | 0.01 | 1.59 (1.01−2.48) | 0.04 | 2.00 (1.26−3.16) | 0.003 | 1.31 (0.93−1.84) | 0.13 | 1.88 (1.34−2.63) | 0.000 | 1.38 (1.05−1.80) | 0.02 | |
| Dominant | 1.37 (0.90−2.09) | 0.14 | 1.20 (0.84−1.72) | 0.31 | 1.63 (1.10−2.42) | 0.02 | 1.28 (0.98−1.67) | 0.08 | 1.46 (1.10−1.94) | 0.01 | 1.24 (1.00−1.54) | 0.05 | |
| Recessive | 1.73 (1.14−2.63) | 0.01 | 1.52 (1.04−2.22) | 0.03 | 1.55 (1.09−2.19) | 0.01 | 1.12 (0.85−1.49) | 0.42 | 1.58 (1.21−2.06) | 0.001 | 1.23 (0.98−1.54) | 0.07 | |
| Additive | 1.39 (1.06−1.81) | 0.02 | 1.25 (0.99−1.58) | 0.06 | 1.41 (1.13−1.77) | 0.003 | 1.15 (0.97−1.36) | 0.11 | 1.37 (1.16−1.63) | 0.000 | 1.17 (1.03−1.34) | 0.02 | |
| rs6509214 | GG | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | ||||||
| GT | 1.47 (0.97−2.24) | 0.07 | 1.63 (1.13−2.37) | 0.01 | 0.91 (0.60−1.38) | 0.65 | 0.97 (0.71−1.31) | 0.82 | 1.17 (0.87−1.57) | 0.30 | 1.21 (0.95−1.52) | 0.12 | |
| TT | 1.67 (0.94−2.97) | 0.08 | 1.61 (0.97−2.67) | 0.06 | 0.87 (0.51−1.49) | 0.61 | 0.75 (0.50−1.12) | 0.16 | 1.10 (0.75−1.61) | 0.64 | 0.99 (0.73−1.35) | 0.95 | |
| Dominant | 1.51 (1.01−2.26) | 0.05 | 1.63 (1.14−2.33) | 0.01 | 0.90 (0.60−1.33) | 0.59 | 0.90 (0.67−1.20) | 0.45 | 1.15 (0.87−1.52) | 0.33 | 1.14 (0.91−1.43) | 0.24 | |
| Recessive | 1.31 (0.80−2.16) | 0.29 | 1.18 (0.76−1.83) | 0.45 | 0.93 (0.58−1.48) | 0.75 | 0.77 (0.54−1.09) | 0.14 | 1.00 (0.71−1.40) | 0.99 | 0.89 (0.68−1.16) | 0.38 | |
| Additive | 1.32 (1.01−1.73) | 0.05 | 1.32 (1.04−1.66) | 0.02 | 0.93 (0.72−1.21) | 0.59 | 0.88 (0.73−1.06) | 0.19 | 1.06 (0.89−1.28) | 0.52 | 1.02 (0.88−1.18) | 0.78 | |
Abbreviations: HR, hazard ratio; CI, confidence interval.
Information about polymorphisms and IDs were obtained from NCBI database (http://www.ncbi.nlm.nih.gov/SNP).
HRs, 95% CIs and their corresponding P-values were calculated using multivariate Cox proportional hazard models, adjusted for age, gender, smoking status, tumor histology, pathologic stage, adjuvant therapy.
The numbers of patients in discovery, validation, and combined cohort for rs2298881 were 316, 579, and 895, respectively.
The numbers of patients in discovery, validation, and combined cohort for rs6509214 were 316, 412, and 728, respectively, due to lack of available samples for validation.
Figure 1Kaplan-Meier plot of overall and disease free survival curves according to ERCC1 rs2298881C > A genotype in discovery cohort
A. replication cohort B. and combined cohort C. P values in the multivariate Cox proportional hazard model.
Figure 2Functional analysis of the ERCC1 rs2298881C>A
A. Schematic representation of the constructs that were used for the reporter gene assays. Promoters are marked by white blocks and the fragments including rs2298881C> A site by black blocks, and arrow indicates the direction of transcription. The first base of translation start site is denoted as +1. ERCC1 promoter was amplified from human genomic DNA and cloned into the pGL3 basic vector (pGL3-ERCC1pro). DNA fragments containing the SNP site were cloned into other multi cloning site (pGL3-ERCC1pro_C and pGL3-ERCC1pro_A). B. Luciferase activity according to ERCC1 rs2298881C>A. H1299 cells were transfected with pGL3-ERCC1pro, pGL3-ERCC1pro_C and pGL3-ERCC1pro_A constructs, respectively. Each bars represent mean ± S.E.M. of firefly luciferase activity normalized to Renilla luciferase activity. Experiments were performed in triplicate. P value, a Student's t-test. luc, luciferase.