| Literature DB >> 24591304 |
Shicheng Guo1, Yu-Long Wang, Yi Li, Li Jin, Momiao Xiong, Qing-Hai Ji, Jiucun Wang.
Abstract
Recently, five thyroid cancer significantly associated genetic variants (rs965513, rs944289, rs116909374, rs966423, and rs2439302) have been discovered and validated in two independent GWAS and numerous case-control studies, which were conducted in different populations. We genotyped the above five single nucleotide polymorphisms (SNPs) in Han Chinese populations and performed thyroid cancer-risk predictions with nine machine learning methods. We found that four SNPs were significantly associated with thyroid cancer in Han Chinese population, while no polymorphism was observed for rs116909374. Small familial relative risks (1.02-1.05) and limited power to predict thyroid cancer (AUCs: 0.54-0.60) indicate limited clinical potential. Four significant SNPs have limited prediction ability for thyroid cancer.Entities:
Keywords: Genetic; SNPs; risk prediction; thyroid cancer
Mesh:
Year: 2014 PMID: 24591304 PMCID: PMC4101765 DOI: 10.1002/cam4.211
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
Odds ratio for five SNPs from GWAS and case–control association study in previous study
| OR ( | ||||||||
|---|---|---|---|---|---|---|---|---|
| Study | Population | Method | rs965513 | rs944289 | rs116909374 | rs966423 | rs2439302 | Reference |
| 1 | Iceland | GWAS | 1.73 (7.5e-13) | 1.48 (8.6e-7) | – | – | – | |
| Iceland all | Combined | 1.77 (6.8e-20) | 1.44 (2.5e-8) | |||||
| USA | Case–control | 1.81 (1.2e-7) | 1.32 (1.2e-2) | |||||
| Spain | Case–control | 1.54 (6.5e-3) | 1.14 (4.3e-1) | |||||
| USA and Spain | Case–control | 1.72 (3.7e-9) | 1.26 (1.1e-2) | |||||
| All combined | Combined | 1.75 (1.7e-27) | 1.37 (2.0e-9) | |||||
| 2 | Chernobyl | GWAS | 1.76 (4.9e-9) | 1.13 (0.17) | – | – | – | |
| Combined | 1.65 (4.8e-12) | – | ||||||
| 3 | Japan | Case–control | 1.69 (1.27e-4) | 1.21 (0.0121) | – | – | – | |
| 4 | UK | Case–control | 1.98 (6.35e-34) | 1.33 (6.95e-7) | – | – | – | |
| 5 | Iceland | Case–control | 1.70 (3.0e-18) | 1.36 (4.2e-5) | 2.03 (5.4e-7) | 1.26(3.8e-4) | 1.41 (1.3e-6) | |
| Netherland | Case–control | – | 1.39 (0.013) | 1.95 (0.024) | 1.80(4.2e-6) | 1.24 (0.088) | ||
| USA | Case–control | – | 1.51 (0.0067) | 1.98 (0.018) | 1.36 (3.5e-3) | 1.33 (6.1e-3) | ||
| Spain | Case–control | – | 1.17 (0.31) | 3.37 (2.6e-3) | 1.20 (0.24) | 1.34 (0.073) | ||
| All combined | Case–control | – | 1.36 (4.9e-8) | 2.09 (4.6e-11) | 1.34 (1.3e-9) | 1.36 (2.0e-9) | ||
| 6 | USA | Case–control | 2.10 (<2e-16) | 1.28 (1.99e-3) | 1.97 (1.11e-3) | 1.35 (1.75e-4) | 1.51 (4.24e-7) | |
| Poland | Case–control | 1.78 (<2e-16) | 1.21 (3.55e-3) | 1.73 (6.27e-3) | 1.15 (3.13e-2) | 1.27 (2.20e-4) | ||
| 7 | China | Case–control | 1.53 | 1.51 | – | 1.32 | 1.40 | |
| 1.53 | 1.53 | 1.31 | 1.41 | |||||
GWAS, genome-wide association studies; OR, odds ratio.
ORs were calculated based on the multiplicative model. For the combined study populations, the OR value were estimated using the Mantel–Haenszel model.
ORs were calculated for the risk allele with using multiple logistic regression analyses.
Estimation of familial relative risk of thyroid cancer for the five SNPs in population of Han Chinese
| SNPs | Familial relative risk | Proportion (100%) | |
|---|---|---|---|
| rs965513 | 1.0189 (1.0186–1.0192) | 0.843 (0.806–0.880) | <2.2e-16 |
| s944289 | 1.0419 (1.0415–1.0422) | 1.969 (1.922–2.016) | <2.2e-16 |
| rs116909374 | N.A. | N.A. | N.A. |
| rs966423 | 1.0493 (1.0485–1.0500) | 2.191 (2.093–2.289) | <2.2e-16 |
| rs2439302 | 1.0207 (1.0205–1.0210) | 0.977 (0.939–1.015) | <2.2e-16 |
rs116909374 SNP was not detected in the Chinese population.
Model performance with methods based on five significant SNPs
| AUC | Sensitivity | Specificity | Accuracy | Range of 95% CI of AUC | |
|---|---|---|---|---|---|
| K-nearest neighbors | 0.5589 | 0.3861 | 0.6591 | 0.533 | [0.4293, 0.7101] |
| Logistic regression | 0.6044 | 0.4982 | 0.5648 | 0.5346 | [0.4433, 0.7368] |
| Naïve Bayes | 0.5996 | 0.3921 | 0.7206 | 0.5686 | [0.4571, 0.7469] |
| Random forest | 0.5743 | 0.3169 | 0.7558 | 0.5535 | [0.4405, 0.7233] |
| Support vector machine | 0.5494 | 0.2762 | 0.7775 | 0.547 | [0.4187, 0.7086] |
| Bayesian additive regression trees | 0.5906 | 0.4779 | 0.5571 | 0.5211 | [0.4385, 0.7211] |
| Boosting | 0.6024 | 0.4723 | 0.5544 | 0.5157 | [0.4584, 0.7287] |
| Recursive partitioning | 0.5871 | 0.4085 | 0.7218 | 0.5778 | [0.3926, 0.7048] |
| Fuzzy rule-based system | 0.5396 | 0.4931 | 0.5006 | 0.4968 | [0.4115, 0.6710] |
AUC, sensitivity, specificity, and accuracy were its mean value in 10-fold validations. Range of 95% CI of AUC represents the range of the 95% CI of AUC in 10-fold Cross-validation. SVM represents support vector machines and Kernel Methods.