| Literature DB >> 28903315 |
Yang Cheng1, Tao Jiang1, Meng Zhu1, Zhihua Li1, Jiahui Zhang1, Yuzhuo Wang1, Liguo Geng1, Jia Liu1, Wei Shen1, Cheng Wang1, Zhibin Hu1,2, Guangfu Jin1,2, Hongxia Ma1,2, Hongbing Shen1,2, Juncheng Dai1,2.
Abstract
In the past ten years, great successes have been accumulated by taking advantage of both candidate-gene studies and genome-wide association studies. However, limited studies were available to systematically evaluate the genetic effects for lung cancer risk with large-scale and different ethnic populations. We systematically reviewed relevant literatures and filtered out 241 important genetic variants identified in 124 articles. A two-stage case-control study within specific subgroups was performed to assess the effects [Training set: 2,331 cases vs. 3,077 controls (Chinese population); testing set: 1,937 cases vs. 1,984 controls (European population)]. Variable selection and model development were used LASSO penalized regression and genetic risk score (GRS) system. Further change in area under the receiver operator characteristic curves (AUC) made by the epidemiologic model with and without GRS was used to compare predictions. It kept 38 genetic variants in our study and the ratios of lung cancer risk for subjects in the upper quartile GRS was three times higher compared to that in the low quartile (odds ratio: 4.64, 95% CI: 3.87-5.56). In addition, we found that adding genetic predictors to smoking risk factor-only model improved lung cancer predictive value greatly: AUC, 0.610 versus 0.697 (P < 0.001). Similar performance was derived in European population and the combined two data sets. Our findings suggested that genetic predictors could improve the predictive ability of risk model for lung cancer and highlighted the application among different populations, indicating that the lung cancer risk assessment model will be a promising tool for high risk population screening and prediction.Entities:
Keywords: ethnic populations; genetic risk score; lung cancer; polymorphism; risk prediction
Year: 2016 PMID: 28903315 PMCID: PMC5589554 DOI: 10.18632/oncotarget.10403
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Association of 38SNPs stained by lasso with lung cancer risk in the training data set
| SNP | Position | Allelea | MAFb | OR (95% CI)c | Author | PMID | |||
|---|---|---|---|---|---|---|---|---|---|
| rs17728461 | chr22:30598552 | C/G | 0.17 | 0.70 | 1.37 (1.24–1.51) | 8.50E-10 | 0.0535 | Zhibin Hu | 21725308 |
| rs465498* | chr5:1325803 | A/G | 0.16 | 0.11 | 0.75 (0.67–0.84) | 6.83E-07 | 0.0523 | Zhibin Hu | 21725308 |
| rs753955 | chr13:24293859 | A/G | 0.29 | 0.76 | 1.23 (1.13–1.35) | 1.33E-06 | 0.0482 | Zhibin Hu | 21725308 |
| rs2895680 | chr5:146644115 | T/C | 0.28 | 0.62 | 1.21 (1.11–1.32) | 1.04E-05 | 0.0415 | Dong J | 22797725 |
| rs12296850* | chr12:100820085 | A/G | 0.25 | 0.14 | 0.82 (0.75–0.90) | 3.09E-05 | 0.0402 | Dong J | 23341777 |
| rs4488809 | chr3:189356261 | C/T | 0.47 | 1.00 | 1.21 (1.12–1.31) | 2.39E-06 | 0.0375 | Zhibin Hu | 21725308 |
| rs2736100 | chr5:1286516 | A/C | 0.41 | 0.30 | 1.20 (1.11–1.30) | 8.84E-06 | 0.0374 | Chen XF | 22370939 |
| rs9439519 | chr1:5364634 | T/C | 0.27 | 0.93 | 1.18 (1.08–1.29) | 2.18E-04 | 0.0361 | Dong J | 22797725 |
| rs383362 | chr16:79245820 | G/T | 0.15 | 0.62 | 1.17 (1.05–1.30) | 3.97E-03 | 0.0357 | Huang D | 22693020 |
| rs6573* | chr1:112255389 | C/A | 0.13 | 0.47 | 0.82 (0.73–0.93) | 1.27E-03 | 0.0346 | Zu Y | 23232114 |
| rs247008* | chr5:131447104 | G/A | 0.47 | 0.06 | 0.83 (0.77–0.90) | 6.27E-06 | 0.0343 | Dong J | 22797725 |
| rs4809957 | chr20:52771171 | G/A | 0.35 | 0.20 | 1.18 (1.09–1.28) | 7.11E-05 | 0.0341 | Dong J | 22797725 |
| rs4246215* | chr11:61564299 | G/T | 0.41 | 0.85 | 0.82 (0.76–0.89) | 2.04E-06 | 0.0335 | Ming Yang | 19618370 |
| rs1663689* | chr10:9025195 | T/C | 0.42 | 0.97 | 0.85 (0.79–0.92) | 8.03E-05 | 0.0313 | Dong J | 22797725 |
| rs7086803 | chr10:114498476 | G/A | 0.28 | 0.62 | 1.16 (1.06–1.26) | 1.06E-03 | 0.0297 | Lan Q | 23143601 |
| rs4083914 | chr6:153427706 | G/C | 0.14 | 0.19 | 1.16 (1.04–1.29) | 7.36E-03 | 0.0293 | Li H | 23228068 |
| rs2286455* | chr4:16020162 | C/T | 0.23 | 0.14 | 1.15 (1.05–1.26) | 3.70E-03 | 0.0284 | Mei Cheng | 23715500 |
| rs3764340 | chr16:78466437 | C/G | 0.07 | 1.00 | 1.20 (1.04–1.39) | 0.012 | 0.0283 | Huang D | 22693020 |
| rs36600 | chr22:30337586 | C/T | 0.09 | 0.82 | 1.39 (1.22–1.58) | 8.38E-07 | 0.0281 | Zhibin Hu | 21725308 |
| rs842461 | chr3:195535614 | T/G | 0.27 | 0.27 | 1.18 (1.09–1.29) | 1.19E-04 | 0.0253 | Zili Zhang | 24204934 |
| rs2285053 | chr16:55512377 | C/T | 0.24 | 0.25 | 0.90 (0.82–0.99) | 0.029 | 0.0247 | GA Patricia | 22455335 |
| rs2131877* | chr3:194858374 | A/G | 0.44 | 0.07 | 0.91 (0.84–0.99) | 0.025 | 0.0240 | Kyong-Ah Yoon | 20876614 |
| rs1801133 | chr1:11856378 | G/A | 0.44 | 0.36 | 1.16 (1.07–1.26) | 1.76E-04 | 0.0232 | Lian-Hua Cui | 21342495 |
| rs3866958* | chr17:19281006 | C/A | 0.15 | 0.44 | 0.87 (0.78–0.97) | 0.015 | 0.0225 | Fuman Qiu | 23804708 |
| rs1800625 | chr6:32152442 | A/G | 0.13 | 0.75 | 1.12 (1.00–1.26) | 0.046 | 0.0218 | Wang X | 23071492 |
| rs9387478* | chr6:117786180 | C/A | 0.5 | 0.86 | 0.91 (0.84–0.98) | 0.013 | 0.0216 | Lan Q | 23143601 |
| rs743572 | chr10:104597152 | G/A | 0.4 | 1.00 | 1.09 (1.01–1.18) | 0.026 | 0.0209 | Zhang Y | 22658813 |
| rs4291 | chr17:61554194 | A/T | 0.37 | 0.08 | 1.10 (1.02–1.20) | 0.015 | 0.0208 | Gao Min | 22538550 |
| rs10845498* | chr12:12394574 | A/G | 0.18 | 0.11 | 0.89 (0.80–0.98) | 0.023 | 0.0202 | Dehou Deng | 24843317 |
| rs7326277* | chr13:28876214 | T/C | 0.33 | 0.65 | 0.91 (0.84–0.99) | 0.038 | 0.0189 | Wang H | 24891316 |
| rs931127* | chr11:65405300 | G/A | 0.48 | 0.08 | 0.91 (0.84–0.99) | 0.028 | 0.0189 | Chenli Xie | 23661532 |
| rs2016520 | chr6:35378778 | T/C | 0.27 | 0.47 | 1.10 (1.01–1.20) | 0.037 | 0.0161 | Eric A. Engels | 17596594 |
| rs25406* | chr20:5099636 | G/A | 0.36 | 0.56 | 0.91 (0.84–0.99) | 0.025 | 0.0158 | J.A Doherty | 23565320 |
| rs2240688* | chr4:15970349 | T/G | 0.26 | 0.58 | 0.91 (0.83–1.00) | 0.040 | 0.0134 | Mei Cheng | 23715500 |
| rs34843907 | chr6:32610059 | G/T | 0.33 | 0.39 | 1.09 (1.00–1.18) | 0.041 | 0.0121 | Takashi Kohno | 20061363 |
| rs2070600* | chr6:32151443 | C/T | 0.23 | 0.29 | 0.91 (0.83–1.00) | 0.046 | 0.0109 | Wang X | 23071492 |
| rs189037 | chr11:108093833 | G/A | 0.43 | 0.07 | 1.08 (1.00–1.18) | 0.049 | 0.0080 | Jing Liu | 25541996 |
| rs3817963 | chr6:32368087 | T/C | 0.25 | 0.07 | 1.08 (0.99–1.18) | 0.078 | 0.0075 | Shiraishi K | 22797724 |
aAllele means the change from major allele to minor allele;
bMinor allele frequency among controls; HWE among controls;
cLogistic regression with adjustment for age, sex, pack year and PCA1;
dThe coefficient derived from LASSO by adjusting age, sex, smoking statue and PCA1,* means the β coefficient was transformed into the reverse correspond to the risk allele.
Cumulative effects of associated SNPs and environmental risk factors on the risk of lung cancer
| Case (%) | Control (%) | OR (95% CI)b | ||||
|---|---|---|---|---|---|---|
| 4268 | 5061 | |||||
| GRSa | ||||||
| 0 (< Q25) | 251 (10.77) | 775 (25.19) | 1 | |||
| 1 (Q25–Q50) | 430 (18.45) | 768 (24.96) | 1.80 (1.48–2.19) | 2.85E-09 | ||
| 2 (Q50–Q75) | 590 (25.31) | 761 (24.73) | 2.48 (2.05–2.99) | 3.14E-21 | ||
| 3 (≥ Q75) | 1060 (45.47) | 773 (25.12) | 4.64 (3.87–5.56) | 4.04E-62 | 7.52E-69 | |
| Smoke + GRS | ||||||
| 0 (< Q25) | 204 (8.75) | 773 (25.12) | 1 | |||
| 1 (Q25–Q50) | 337 (14.45) | 767 (24.93) | 1.78 (1.44–2.20) | 1.22E-07 | ||
| 2 (Q50–Q75) | 557 (23.90) | 768 (24.96) | 2.99 (2.43–3.66) | 1.28E-25 | ||
| 3 (≥ Q75) | 1233 (52.90) | 769 (24.99) | 7.01 (5.72–8.58) | 3.54E-79 | 5.41E-94 | |
| GRS | ||||||
| 0 (< Q25) | 363 (18.74) | 496 (25.00) | 1 | |||
| 1 (Q25–Q50) | 442 (22.82) | 494 (24.90) | 1.19 (0.98–1.46) | 7.77E-02 | ||
| 2 (Q50–Q75) | 531 (27.41) | 496 (25.00) | 1.50 (1.23–1.82) | 4.23E-05 | ||
| 3 (≥ Q75) | 601 (31.03) | 498 (25.10) | 1.66 (1.37–2.01) | 2.08E-07 | 1.68E-08 | |
| Smoke + GRS | ||||||
| 0 (< Q25) | 148 (7.64) | 496 (25.00) | 1 | |||
| 1 (Q25–Q50) | 388 (20.03) | 493 (24.85) | 2.67 (2.12–3.36) | 4.31E-17 | ||
| 2 (Q50–Q75) | 625 (32.27) | 497 (25.05) | 4.35 (3.48–5.43) | 1.50E-38 | ||
| 3 (≥ Q75) | 764 (39.44) | 493 (24.85) | 5.36 (4.30–6.68) | 2.36E-50 | 1.81E-53 | |
| GRS | ||||||
| 0 (< Q25) | 536 (12.56) | 1268 (25.05) | 1 | |||
| 1 (Q25–Q50) | 944 (22.12) | 1266 (25.01) | 1.70 (1.48–1.94) | 2.35E-14 | ||
| 2 (Q50–Q75) | 1271 (29.78) | 1264 (24.98) | 2.17 (1.90–2.47) | 4.62E-30 | ||
| 3 (≥ Q75) | 1517 (35.54) | 1263 (24.96) | 2.31 (2.02–2.64) | 1.51E-34 | 1.31E-34 | |
| Smoke + GRS | ||||||
| 0 (< Q25) | 390 (9.14) | 1256 (24.82) | 1 | |||
| 1 (Q25–Q50) | 728 (17.06) | 1272 (25.13) | 1.91 (1.65–2.22) | 5.43E-18 | ||
| 2 (Q50–Q75) | 1223 (28.66) | 1260 (24.90) | 3.39 (2.93–3.92) | 2.31E-61 | ||
| 3 (≥ Q75) | 1915 (44.87) | 1268 (25.05) | 5.38 (4.66–6.21) | 4.40E-116 | 4.27E-134 |
aGRS means the genetic risk score with adjustment for age, sex, smoking statue and PCA1;
bAdjust for age, sex and PCA1;
cFor the testing set (the EAGLE study), the smoking status has five missing data.
Area under curves (AUC) as a measure of predictive strength for risk-prediction models based on different indicators
| AUC | 95% CI | |||
|---|---|---|---|---|
| Epidemiologic model | 0.61 | 0.597–0.623 | 1 | |
| Genetic model | 0.653 | 0.639–0.668 | < 0.001 | |
| The extended model | 0.697 | 0.683–0.711 | < 0.001 | 0.483 |
| Epidemiologic model | 0.625 | 0.613–0.637 | 1 | |
| Genetic model | 0.558 | 0.540–0.576 | < 0.001 | |
| The extended model | 0.647 | 0.630–0.664 | 0.004 | 0.662 |
| Epidemiologic model | 0.625 | 0.615–0.634 | 1 | |
| Genetic model | 0.604 | 0.593–0.616 | < 0.001 | |
| The extended model | 0.658 | 0.647–0.669 | < 0.001 | 0.792 |
aCalculated by Hosmer-Lemeshow test.
Figure 1The area under curves (AUCs) for lung cancer risk predicting models calculated by risk score method in the two data sets (A) For Chinese GWAS; (B) For the EAGLE study; (C) For the combined data set