Literature DB >> 24591304

Significant SNPs have limited prediction ability for thyroid cancer.

Shicheng Guo1, Yu-Long Wang, Yi Li, Li Jin, Momiao Xiong, Qing-Hai Ji, Jiucun Wang.   

Abstract

Recently, five thyroid cancer significantly associated genetic variants (rs965513, rs944289, rs116909374, rs966423, and rs2439302) have been discovered and validated in two independent GWAS and numerous case-control studies, which were conducted in different populations. We genotyped the above five single nucleotide polymorphisms (SNPs) in Han Chinese populations and performed thyroid cancer-risk predictions with nine machine learning methods. We found that four SNPs were significantly associated with thyroid cancer in Han Chinese population, while no polymorphism was observed for rs116909374. Small familial relative risks (1.02-1.05) and limited power to predict thyroid cancer (AUCs: 0.54-0.60) indicate limited clinical potential. Four significant SNPs have limited prediction ability for thyroid cancer.
© 2014 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.

Entities:  

Keywords:  Genetic; SNPs; risk prediction; thyroid cancer

Mesh:

Year:  2014        PMID: 24591304      PMCID: PMC4101765          DOI: 10.1002/cam4.211

Source DB:  PubMed          Journal:  Cancer Med        ISSN: 2045-7634            Impact factor:   4.452


Introduction

Thyroid cancer is the fifth most common type of female cancer and its incidence is increasing. It has been considered as one of highest familial risk carcinomas among all kinds of cancers 1,2. Most common diseases are caused by multiple genetic rather than few loci. In the last 2 years, two independent genome-wide association studies (GWAS) have been conducted to identify single nucleotide polymorphisms (SNPs) associated with thyroid cancer risk. Five SNPs (rs965513, rs944289, rs116909374, rs966423, and rs2439302) which were highly significantly associated with papillary thyroid carcinoma (PTC) were discovered by genome-wide association study. In addition, these five SNP were validated by continued case–control studies in more than three different populations (Han Chinese, Ohio, Poland, etc. Table 1).
Table 1

Odds ratio for five SNPs from GWAS and case–control association study in previous study

OR (P-value)1,2

StudyPopulationMethodrs965513rs944289rs116909374rs966423rs2439302Reference
11IcelandGWAS1.73 (7.5e-13)1.48 (8.6e-7)12
Iceland allCombined1.77 (6.8e-20)1.44 (2.5e-8)
USACase–control1.81 (1.2e-7)1.32 (1.2e-2)
SpainCase–control1.54 (6.5e-3)1.14 (4.3e-1)
USA and SpainCase–control1.72 (3.7e-9)1.26 (1.1e-2)
All combinedCombined1.75 (1.7e-27)1.37 (2.0e-9)
21ChernobylGWAS1.76 (4.9e-9)1.13 (0.17)13
Combined1.65 (4.8e-12)
32JapanCase–control1.69 (1.27e-4)1.21 (0.0121)14
42UKCase–control1.98 (6.35e-34)1.33 (6.95e-7)15
51IcelandCase–control1.70 (3.0e-18)1.36 (4.2e-5)2.03 (5.4e-7)1.26(3.8e-4)1.41 (1.3e-6)16
NetherlandCase–control1.39 (0.013)1.95 (0.024)1.80(4.2e-6)1.24 (0.088)
USACase–control1.51 (0.0067)1.98 (0.018)1.36 (3.5e-3)1.33 (6.1e-3)
SpainCase–control1.17 (0.31)3.37 (2.6e-3)1.20 (0.24)1.34 (0.073)
All combinedCase–control1.36 (4.9e-8)2.09 (4.6e-11)1.34 (1.3e-9)1.36 (2.0e-9)
61USACase–control2.10 (<2e-16)1.28 (1.99e-3)1.97 (1.11e-3)1.35 (1.75e-4)1.51 (4.24e-7)11
PolandCase–control1.78 (<2e-16)1.21 (3.55e-3)1.73 (6.27e-3)1.15 (3.13e-2)1.27 (2.20e-4)
71,2ChinaCase–control1.531 (7.1e-4)1.511 (2.8e-9)1.321 (0.006)1.401 (2.1e-4)3
1.532 (1.4e-4)1.532 (2.0e-10)1.312 (0.001)1.412 (2.7e-5)

GWAS, genome-wide association studies; OR, odds ratio.

ORs were calculated based on the multiplicative model. For the combined study populations, the OR value were estimated using the Mantel–Haenszel model.

ORs were calculated for the risk allele with using multiple logistic regression analyses.

Odds ratio for five SNPs from GWAS and case–control association study in previous study GWAS, genome-wide association studies; OR, odds ratio. ORs were calculated based on the multiplicative model. For the combined study populations, the OR value were estimated using the Mantel–Haenszel model. ORs were calculated for the risk allele with using multiple logistic regression analyses. To examine the prediction ability based on variants with highly significant associations, we use all five SNPs to predict thyroid cancer by nine classification methods (K-nearest neighbors, logistic regression, naïve Bayes, random forest, support vector machine, Bayesian additive regression trees (BART), recursive partitioning, fuzzy rule-based system, boosting). Contradictory to our intuitiveness, we found that although all these five SNPs were significantly associated with thyroid cancer, the precision of their prediction for thyroid cancer was very low.

Methods

The five SNPs were genotyped in 845 PTC and 1005 controls in Han Chinese population using the SNaPshot multiplex single-nucleotide extension system. PTC patients who were treated in the Department of Head and Neck Surgery, Fudan University Shanghai Cancer Center, Shanghai, China from January to December 2010 were enrolled in this study. All patients were ethnically Chinese Han and came from Eastern China. A total of 1005 cancer-free unrelated individuals were recruited from the Taizhou Longitudinal Study (TZL). The SNPs were genotyped with the SNaPshot multiplex single-nucleotide extension system. Details of SNPs (Table S1) and primers were listed in our previous article 3. The relative risk to daughters of an affected thyroid cancer individual attributable to a given SNP is calculated by the formula: , where p is the frequency of the risk allele, q = 1 − p, r1 and r2 are the relative risks (estimated by odds ratio [ORs]) for heterozygotes relative to common homozygotes and rare homozygotes relative to common homozygotes in the population, respectively 4,5. Assuming a multiplicative interaction, the proportion of the familial risk attributable to the SNP is calculated by log(λ*)/log(λo), where λo is the overall familial relative risk (FRR), estimated to be 8.48 for thyroid cancer 1. Gender- and age-matched cases and controls were constructed by 1000 times resampling technology. Nine machine learning methods were used to make prediction for PTC from health individuals, including K-nearest neighbors 6, logistic regression, naïve Bayes 7, random forest 8, support vector machine 7, BART 9, boosting, recursive partitioning, and fuzzy rule-based system 10. The parameters in the models were optimally selected. Classification accuracy, sensitivity, specificity, and AUC were used to evaluate the performance of the methods. They were calculated by 10-fold cross-validation.

Results

Marginal FRR of the significant SNPs

As the previous studies showed that the five SNPs with large OR were significantly associated with thyroid cancer in various populations (Table 1). Our previous data also showed that SNPs were significantly associated with thyroid cancer in Chinese population (the seventh study of Table 1). In present study, we estimated the FRR for five significantly associated SNPs in Chinese population. We found that the FRRs were low, ranging from 1.02 to 1.05. These five SNPs counted only 5.98% of the overall familial risk (Table 2) which was very closed to that of polish population (about 6%) 11. Our finding suggested that majority of the heritability was undiscovered.
Table 2

Estimation of familial relative risk of thyroid cancer for the five SNPs in population of Han Chinese

SNPsFamilial relative riskProportion (100%)P-value
rs9655131.0189 (1.0186–1.0192)0.843 (0.806–0.880)<2.2e-16
s9442891.0419 (1.0415–1.0422)1.969 (1.922–2.016)<2.2e-16
rs116909374N.A.1N.A.1N.A.1
rs9664231.0493 (1.0485–1.0500)2.191 (2.093–2.289)<2.2e-16
rs24393021.0207 (1.0205–1.0210)0.977 (0.939–1.015)<2.2e-16

rs116909374 SNP was not detected in the Chinese population.

Estimation of familial relative risk of thyroid cancer for the five SNPs in population of Han Chinese rs116909374 SNP was not detected in the Chinese population.

Genetic risk prediction for thyroid cancer based on five SNPs

The five significant SNPs were used to predict risk of thyroid cancer by nine classification methods. The results were summarized in Table 3. The prediction accuracies ranged from 0.52 to 0.57 in the nine prediction methods, while receiver operating characteristics (ROCs) ranged from 0.54 to 0.60. The sensitivity of the prediction (0.28–0.48) was much less than specificity (0.56–0.76), which suggested the clinical application value might be limited (Table 3). In addition, the AUC of classification based on five SNPs and gender, and based on five SNPs, gender, and age ranged from 0.49 to 0.58, and from 0.50 to 0.59, respectively. This indicated that including gender and age information will not improve prediction (Tables S2 and S3, Fig. S1).
Table 3

Model performance with methods based on five significant SNPs

AUCSensitivitySpecificityAccuracyRange of 95% CI of AUC
K-nearest neighbors0.55890.38610.65910.533[0.4293, 0.7101]
Logistic regression0.60440.49820.56480.5346[0.4433, 0.7368]
Naïve Bayes0.59960.39210.72060.5686[0.4571, 0.7469]
Random forest0.57430.31690.75580.5535[0.4405, 0.7233]
Support vector machine0.54940.27620.77750.547[0.4187, 0.7086]
Bayesian additive regression trees0.59060.47790.55710.5211[0.4385, 0.7211]
Boosting0.60240.47230.55440.5157[0.4584, 0.7287]
Recursive partitioning0.58710.40850.72180.5778[0.3926, 0.7048]
Fuzzy rule-based system0.53960.49310.50060.4968[0.4115, 0.6710]

AUC, sensitivity, specificity, and accuracy were its mean value in 10-fold validations. Range of 95% CI of AUC represents the range of the 95% CI of AUC in 10-fold Cross-validation. SVM represents support vector machines and Kernel Methods.

Model performance with methods based on five significant SNPs AUC, sensitivity, specificity, and accuracy were its mean value in 10-fold validations. Range of 95% CI of AUC represents the range of the 95% CI of AUC in 10-fold Cross-validation. SVM represents support vector machines and Kernel Methods.

Conclusion

In the present study, we estimated the FRR and evaluated thyroid cancer prediction accuracy of the five SNPs that showed significant association with thyroid cancer in several association studies. The results showed that although the OR of each SNPs was large, the FRR of each SNPs was very marginal. By 10-fold cross-validation, we found that the prediction accuracy of five SNPs was low across all nine classification methods. Particularly, the sensitivity of five SNPs was very low. It suggested that the clinical application of five SNPs might be limited. Our results strongly demonstrate that complex diseases are caused by a large number of SNPs, environments, and their interactions. GWAS addressing common variants have come to its limit and missing heritability for most complex disorders is very high. Only about 5–10% heritability was found based on common disease common variant (CDCV) model. To improve prediction of genetic variation for complex diseases, we need to incorporate more common and rare SNPs, copy number variations (CNVs), and nongenetic susceptibility factors, such as iodine intake, exposure to radiation in the classification analysis. Novel statistical methods for variable screening should be developed to optimally select SNPs and CNVs across the genome for disease risk prediction.
  11 in total

1.  Modification of cancer risks in offspring by sibling and parental cancers from 2,112,616 nuclear families.

Authors:  C Dong; K Hemminki
Journal:  Int J Cancer       Date:  2001-04-01       Impact factor: 7.396

2.  Familial relative risk of colorectal cancer: a population-based study.

Authors:  N Andrieu; G Launoy; R Guillois; C Ory-Paoletti; M Gignoux
Journal:  Eur J Cancer       Date:  2003-09       Impact factor: 9.162

3.  Discovery of common variants associated with low TSH levels and thyroid cancer risk.

Authors:  Julius Gudmundsson; Patrick Sulem; Daniel F Gudbjartsson; Jon G Jonasson; Gisli Masson; Huiling He; Aslaug Jonasdottir; Asgeir Sigurdsson; Simon N Stacey; Hrefna Johannsdottir; Hafdis Th Helgadottir; Wei Li; Rebecca Nagy; Matthew D Ringel; Richard T Kloos; Marieke C H de Visser; Theo S Plantinga; Martin den Heijer; Esperanza Aguillo; Angeles Panadero; Enrique Prats; Almudena Garcia-Castaño; Ana De Juan; Fernando Rivera; G Bragi Walters; Hjordis Bjarnason; Laufey Tryggvadottir; Gudmundur I Eyjolfsson; Unnur S Bjornsdottir; Hilma Holm; Isleifur Olafsson; Kristleifur Kristjansson; Hoskuldur Kristvinsson; Olafur T Magnusson; Gudmar Thorleifsson; Jeffrey R Gulcher; Augustine Kong; Lambertus A L M Kiemeney; Thorvaldur Jonsson; Hannes Hjartarson; Jose I Mayordomo; Romana T Netea-Maier; Albert de la Chapelle; Jon Hrafnkelsson; Unnur Thorsteinsdottir; Thorunn Rafnar; Kari Stefansson
Journal:  Nat Genet       Date:  2012-01-22       Impact factor: 38.330

4.  The FOXE1 locus is a major genetic determinant for radiation-related thyroid carcinoma in Chernobyl.

Authors:  Meiko Takahashi; Vladimir A Saenko; Tatiana I Rogounovitch; Takahisa Kawaguchi; Valentina M Drozd; Hisako Takigawa-Imamura; Natallia M Akulevich; Chanavee Ratanajaraya; Norisato Mitsutake; Noboru Takamura; Larisa I Danilova; Maxim L Lushchik; Yuri E Demidchik; Simon Heath; Ryo Yamada; Mark Lathrop; Fumihiko Matsuda; Shunichi Yamashita
Journal:  Hum Mol Genet       Date:  2010-03-29       Impact factor: 6.150

5.  The FOXE1 and NKX2-1 loci are associated with susceptibility to papillary thyroid carcinoma in the Japanese population.

Authors:  Michiko Matsuse; Meiko Takahashi; Norisato Mitsutake; Eijun Nishihara; Mitsuyoshi Hirokawa; Takahisa Kawaguchi; Tatiana Rogounovitch; Vladimir Saenko; Andrey Bychkov; Keiji Suzuki; Keitaro Matsuo; Kazuo Tajima; Akira Miyauchi; Ryo Yamada; Fumihiko Matsuda; Shunichi Yamashita
Journal:  J Med Genet       Date:  2011-07-05       Impact factor: 6.318

6.  Cumulative risk impact of five genetic variants associated with papillary thyroid carcinoma.

Authors:  Sandya Liyanarachchi; Anna Wojcicka; Wei Li; Malgorzata Czetwertynska; Elzbieta Stachlewska; Rebecca Nagy; Kevin Hoag; Bernard Wen; Rafal Ploski; Matthew D Ringel; Izabella Kozłowicz-Gudzinska; Wojciech Gierlikowski; Krystian Jazdzewski; Huiling He; Albert de la Chapelle
Journal:  Thyroid       Date:  2013-08-29       Impact factor: 6.568

7.  Systematic population-based assessment of cancer risk in first-degree relatives of cancer probands.

Authors:  D E Goldgar; D F Easton; L A Cannon-Albright; M H Skolnick
Journal:  J Natl Cancer Inst       Date:  1994-11-02       Impact factor: 13.506

8.  Confirmation of papillary thyroid cancer susceptibility loci identified by genome-wide association studies of chromosomes 14q13, 9q22, 2q35 and 8p12 in a Chinese population.

Authors:  Yu-Long Wang; Shou-Hao Feng; Shi-Cheng Guo; Wen-Jun Wei; Duan-Shu Li; Yu Wang; Xiaofeng Wang; Zhuo-Ying Wang; Yan-Yun Ma; Li Jin; Qing-Hai Ji; Jiu-Cun Wang
Journal:  J Med Genet       Date:  2013-07-11       Impact factor: 6.318

9.  Common variants on 9q22.33 and 14q13.3 predispose to thyroid cancer in European populations.

Authors:  Julius Gudmundsson; Patrick Sulem; Daniel F Gudbjartsson; Jon G Jonasson; Asgeir Sigurdsson; Jon T Bergthorsson; Huiling He; Thorarinn Blondal; Frank Geller; Margret Jakobsdottir; Droplaug N Magnusdottir; Sigurborg Matthiasdottir; Simon N Stacey; Oskar B Skarphedinsson; Hafdis Helgadottir; Wei Li; Rebecca Nagy; Esperanza Aguillo; Eduardo Faure; Enrique Prats; Berta Saez; Mariano Martinez; Gudmundur I Eyjolfsson; Unnur S Bjornsdottir; Hilma Holm; Kristleifur Kristjansson; Michael L Frigge; Hoskuldur Kristvinsson; Jeffrey R Gulcher; Thorvaldur Jonsson; Thorunn Rafnar; Hannes Hjartarsson; Jose I Mayordomo; Albert de la Chapelle; Jon Hrafnkelsson; Unnur Thorsteinsdottir; Augustine Kong; Kari Stefansson
Journal:  Nat Genet       Date:  2009-02-06       Impact factor: 38.330

10.  Thyroid cancer susceptibility polymorphisms: confirmation of loci on chromosomes 9q22 and 14q13, validation of a recessive 8q24 locus and failure to replicate a locus on 5q24.

Authors:  Angela M Jones; Kimberley M Howarth; Lynn Martin; Maggie Gorman; Radu Mihai; Laura Moss; Adam Auton; Catherine Lemon; Hisham Mehanna; Hosahalli Mohan; Susan E M Clarke; Jonathan Wadsley; Elena Macias; Andrew Coatesworth; Matthew Beasley; Tom Roques; Craig Martin; Paul Ryan; Georgina Gerrard; Danielle Power; Caroline Bremmer; Ian Tomlinson; Luis G Carvajal-Carmona
Journal:  J Med Genet       Date:  2012-01-25       Impact factor: 6.318

View more
  9 in total

1.  Genetic variants in miR-196a2 and miR-499 are associated with susceptibility to esophageal squamous cell carcinoma in Chinese Han population.

Authors:  Fangyuan Shen; Jiejun Chen; Shicheng Guo; Yinghui Zhou; Yabiao Zheng; Yajun Yang; Junjie Zhang; Xiaofeng Wang; Chenji Wang; Dunmei Zhao; Mengyun Wang; Meiling Zhu; Lixia Fan; Jiaqing Xiang; Yong Xia; Qingyi Wei; Li Jin; Jiucun Wang; Minghua Wang
Journal:  Tumour Biol       Date:  2015-10-30

Review 2.  Genetic predisposition for nonmedullary thyroid cancer.

Authors:  Rebecca Nagy; Matthew D Ringel
Journal:  Horm Cancer       Date:  2014-10-22       Impact factor: 3.869

3.  Identification and validation of the methylation biomarkers of non-small cell lung cancer (NSCLC).

Authors:  Shicheng Guo; Fengyang Yan; Jibin Xu; Yang Bao; Ji Zhu; Xiaotian Wang; Junjie Wu; Yi Li; Weilin Pu; Yan Liu; Zhengwen Jiang; Yanyun Ma; Xiaofeng Chen; Momiao Xiong; Li Jin; Jiucun Wang
Journal:  Clin Epigenetics       Date:  2015-01-22       Impact factor: 6.551

Review 4.  Genetic Polymorphism Predisposing to Differentiated Thyroid Cancer: A Review of Major Findings of the Genome-Wide Association Studies.

Authors:  Vladimir A Saenko; Tatiana I Rogounovitch
Journal:  Endocrinol Metab (Seoul)       Date:  2018-06

5.  Association of rs944289, rs965513, and rs1443434 in TITF1/TITF2 with Risks of Papillary Thyroid Carcinoma and with Nodular Goiter in Northern Chinese Han Populations.

Authors:  Xin Zhang; Yulu Gu; Yong Li; Heran Cui; Xiaoli Liu; Hui Sun; Qiong Yu; Yaqin Yu; Yawen Liu; Siyan Zhan; Yi Cheng
Journal:  Int J Endocrinol       Date:  2020-02-11       Impact factor: 3.257

6.  Investigating the Association Between rs2439302 Polymorphism and Thyroid Cancer: A Systematic Review and Meta-Analysis.

Authors:  Yawen Guo; Wanchen Zhang; Ru He; Chuanming Zheng; Xuefeng Liu; Minghua Ge; Jiajie Xu
Journal:  Front Surg       Date:  2022-04-26

Review 7.  Learning Causal Effects From Observational Data in Healthcare: A Review and Summary.

Authors:  Jingpu Shi; Beau Norgeot
Journal:  Front Med (Lausanne)       Date:  2022-07-07

8.  Replication and Meta-Analysis of Common Gene Mutations in TTF1 and TTF2 with Papillary Thyroid Cancer.

Authors:  Yan Gao; Fei Chen; Shuli Niu; Shiyu Lin; Suping Li
Journal:  Medicine (Baltimore)       Date:  2015-09       Impact factor: 1.817

9.  Abnormal DNA methylation may contribute to the progression of osteosarcoma.

Authors:  Xiao-Gang Chen; Liang Ma; Jia-Xin Xu
Journal:  Mol Med Rep       Date:  2017-10-25       Impact factor: 2.952

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.