Literature DB >> 33193711

Assessing the Relationship Between Leukocyte Telomere Length and Cancer Risk/Mortality in UK Biobank and TCGA Datasets With the Genetic Risk Score and Mendelian Randomization Approaches.

Yixin Gao1, Yongyue Wei2, Xiang Zhou3,4, Shuiping Huang1,5, Huashuo Zhao1,5, Ping Zeng1,5.   

Abstract

BACKGROUND: Telomere length is an important indicator of tumor progression and survival for cancer patients. Previous work investigated the associations between genetically predicted telomere length and cancers; however, the types of cancers investigated in those studies were relatively limited or the telomere length-associated genetic variants employed often came from genome-wide association studies (GWASs) with small sample sizes.
METHODS: We constructed the genetic risk score (GRS) for leukocyte telomere length based on 17 associated genetic variants available from the largest telomere length GWAS up to 78,592 individuals. Then, a comprehensive analysis was undertaken to evaluate the association between the constructed GRS and the risk or mortality of a wide range of cancers [i.e., 37 cancers in the UK Biobank and 33 cancers in The Cancer Genome Atlas (TCGA)]. We further applied the two-sample Mendelian randomization (MR) to estimate the causal effect of leukocyte telomere length on UK Biobank cancers via summary statistics.
RESULTS: In the UK Biobank dataset, we found that the GRS of leukocyte telomere length was associated with a decreased risk of nine types of cancer (i.e., significant association with multiple myeloma, chronic lymphocytic leukemia, kidney/renal cell cancer, bladder cancer, malignant melanoma, basal cell carcinoma, and prostate cancer and suggestive association with sarcoma/fibrosarcoma and Hodgkin's lymphoma/Hodgkin's disease). In addition, we found that the GRS was suggestively associated with an increased risk of leukemia. In the TCGA dataset, we observed suggestive evidence that the GRS was associated with a high death hazard of rectum adenocarcinoma (READ), sarcoma (SARC), and skin cutaneous melanoma (SKCM), while the GRS was associated with a low death hazard of kidney renal papillary cell carcinoma (KIRP). The results of MR further supported the association for leukocyte telomere length on the risk of malignant melanoma, Hodgkin's lymphoma/Hodgkin's disease, chronic lymphocytic leukemia and multiple myeloma.
CONCLUSION: Our study reveals that telomere played diverse roles in different types of cancers. However, further validations in large-scale prospective studies and deeper investigations of the biologic mechanisms are warranted.
Copyright © 2020 Gao, Wei, Zhou, Huang, Zhao and Zeng.

Entities:  

Keywords:  Mendelian randomization; TCGA; UK Biobank; cancer; genetic risk score; leukocyte telomere length

Year:  2020        PMID: 33193711      PMCID: PMC7644901          DOI: 10.3389/fgene.2020.583106

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

Telomere is a special structure with a 6-bp TTAGGG repeat sequence and plays an important role in genomic stability by protecting DNA against damage and fusion 0 (de Lange, 2005). Due to the inability of DNA polymerase to fully extend the 3′ end of DNA strand, the telomere becomes progressively shorter during each round of cell division. The length of telomere is thus a biomarker of cellular and overall biological aging. Once a critically short telomere length is reached, the cell would be triggered to enter senescence, which would ultimately lead to cell growth arrest or apoptosis (Shay and Wright, 2019). In stem and progenitor cells, the length of telomere is maintained by enzyme telomerase (Hackett and Greider, 2002; Shawi and Autexier, 2008). It is shown that enzyme telomerase is activated in almost all human tumors; such an activation can result in the continuous division of cancer cells and is the key component of the tumorigenic phenotype of human cancer cells (Stewart and Weinberg, 2006; O’Sullivan and Karlseder, 2010). Prior studies have demonstrated that telomere length is associated with a lot of age-related diseases and disorders (e.g., cancers and neurodegenerative disorders) (Zhu et al., 2011) and that a shorter telomere length in tumor tissues is an important indicator of tumor progression and survival for cancer patients (Ma et al., 2011; Xu et al., 2016). However, not all studies reported consistent findings (Supplementary Table S1), partly reflecting the complicated function of telomere on human cancers. The diversity in cancer types, ethnicities, study designs, measurement methods, and selected tissues for telomere length in previous work further complicates the observed association. Given the severe disease burden of cancers worldwide (Siegel et al., 2019), understanding the association between telomere length and cancers can provide valuable insights into the development of cancers and has the potential to improve the prevention and treatment strategies for cancers. On the other hand, in the past few years, a number of single nucleotide polymorphisms (SNPs) have been identified to be associated with leukocyte telomere length through genome-wide association studies (GWASs) (Levy et al., 2010; Gu et al., 2011; Mangino et al., 2012; Codd et al., 2013; Pooley et al., 2013; Dorajoo et al., 2019). Relying on associated genetic variants, many studies have been undertaken to investigate the association between genetically predicted leukocyte telomere length and cancers. However, the types of cancers investigated in previous studies (Zhang et al., 2015; Li et al., 2020) were relatively limited. In addition, the telomere length-associated SNPs employed in previous studies (Zhang et al., 2015; Rode et al., 2016; Haycock et al., 2017) often came from GWASs with small sample sizes (Levy et al., 2010; Codd et al., 2013). Recently, a large-scale GWAS of leukocyte telomere length was conducted with the largest sample size to date (up to ∼80,000) (Li et al., 2020), which allows us to choose more appropriate SNPs to study the multilocus genetic profile of leukocyte telomere length via the genetic risk score (GRS) approach (Ripatti et al., 2010; Dudbridge et al., 2013; Eusden et al., 2015; Guo et al., 2016; Goldman, 2017; Tosto et al., 2017; Bogdan et al., 2018; De La Vega and Bustamante, 2018; Zeng et al., 2019b). Briefly, GRS is an efficient and powerful genetic method to explore the association between an exposure and complex diseases by integrating multiple genetic variants with weak effects, and it dramatically enhances the predictability of complex diseases through genetic polymorphisms (Belsky et al., 2013; Khera et al., 2018; Duncan et al., 2019; Khera et al., 2019). Moreover, several cancer-relevant cohorts, such as The UK Biobank (Bycroft et al., 2018) and The Cancer Genome Atlas (TCGA) (Hoadley et al., 2018), have collected a variety of cancer-related omics and clinical information, which makes it feasible to systematically investigate a large number of types of cancers. Based on these valuable data resources, in the present work, we evaluated the association between leukocyte telomere length and 37 cancers from the UK Biobank cohort as well as 33 cancers from the TCGA dataset using the genetic risk score method. We further applied the two-sample Mendelian randomization (MR) (Burgess et al., 2017; Hartwig et al., 2017) to assess the association between leukocyte telomere length and multiple cancers, for which the summary statistics can be available from the UK Biobank cohort. Our study revealed that telomere played cancer-specific roles and that a shorter leukocyte telomere length can either increase or decrease the risk/mortality of cancers. However, further validations in large-scale prospective studies and deeper investigations of the biological mechanism of leukocyte telomere length on various types of cancers are warranted.

Materials and Methods

Selection of Instrumental Variables for Leukocyte Telomere Length

We obtained the summary statistics (e.g., effect size and effect allele) of leukocyte telomere length from the ENGAGE consortium as well as the EPIC-CVD and EPIC-InterAct cohorts (Supplementary Table S2; Li et al., 2020), which was the largest GWAS of telomere length (N = 78,592) undertaken in the European population to date. In this study, leukocyte telomere length was measured as a continuous variable and the linear additive regression was implemented to investigate the association for each genetic variant (Li et al., 2020). Particularly, in the association analysis, the age of participants was considered as a covariate to remove the influence of biological age. We selected 17 independent index SNPs that were strongly associated with leukocyte telomere length (p < 5.00E-8; see Table 1) to construct GRS. Note that, given the fact that the length of telomere would shorten progressively with age, to facilitate the explanation of our results, we made a sign transformation for the effect sizes of these used SNPs so that the relationship under investigation corresponded to a shorter leukocyte telomere length.
TABLE 1

Independent index single nucleotide polymorphisms (SNPs) associated with leukocyte telomere length in the European population.

SNPChrPositionGeneA1/A2EAFBetaSEpPVEF
rs32191041226,562,621PARP1C/A0.83–0.0420.0069.60E-116.23E-0449.0
rs557496053101,232,093SENP7A/C0.580.0370.0072.45E-083.55E-0427.9
rs109366003169,514,585TERCT/A0.240.0860.0067.18E-512.61E-03205.4
rs13137667471,774,347MOB1BC/T0.96–0.0770.0142.43E-083.85E-0430.2
rs46918954164,048,199NAF1C/G0.78–0.0580.0061.58E-211.19E-0393.4
rs770552651,285,974TERTA/C0.33–0.0820.0065.34E-452.37E-03186.8
rs34991172625,480,328CARMIL1G/T0.070.0610.0116.19E-093.91E-0430.8
rs2736176631,587,561PRRC2AC/G0.31–0.0350.0063.53E-104.33E-0434.0
rs592946137124,554,267POT1A/C0.290.0410.0061.17E-135.94E-0446.7
rs941995810105,675,946OBFC1C/T0.860.0640.0075.05E-191.06E-0383.6
rs22859511108,105,593ATMA/G0.420.0290.0051.43E-084.28E-0433.6
rs23025881473,404,752DCAF4C/G0.10–0.0480.0081.68E-084.58E-0436.0
rs37850741669,406,986TERF2G/A0.26–0.0350.0064.64E-104.33E-0434.0
rs620535801674,680,074RFWD3G/A0.170.0390.0074.08E-083.95E-0431.0
rs71947341682,199,980MPHOSPH6T/C0.780.0370.0066.94E-104.84E-0438.0
rs81057671922,215,441ZNF208G/A0.30–0.0390.0055.42E-137.74E-0460.8
rs756910802062,269,750STMN3T/C0.090.0670.0095.99E-147.05E-0455.4
Independent index single nucleotide polymorphisms (SNPs) associated with leukocyte telomere length in the European population.

Construction of Genetic Risk Score

The genetic risk score for leukocyte telomere length is calculated in a weighted way (Ripatti et al., 2010; Guo et al., 2016; Zeng et al., 2019b). where is the estimated marginal SNP effect on the shorter leukocyte telomere length for the jth selected index SNP (e.g., Table 1) (Li et al., 2020). G is the individual-level genotype of the same SNP in the UK Biobank (Bycroft et al., 2018) or TCGA dataset (Hoadley et al., 2018) and is coded to be 0, 1, and 2, representing the number of effect allele. Following prior work (Zeng et al., 2019b), we do not directly rescale the GRS as its p-value would not be altered regardless of whether the GRS is scaled or not. We instead standardize the GRS so that its mean is zero and the variance is equal to 1.

Two-Stage Regression Model in the UK Biobank and TCGA Using GRS

To link GRS with the risk of cancers from the UK Biobank (Table 2; Bycroft et al., 2018), we apply an additive logistic regression while adjusting for a set of available covariates (i.e., age, gender, smoke, drink, and BMI).
TABLE 2

Association between the genetic risk score (GRS) of leukocyte telomere length and the risk of 37 UK Biobank cancers.

Types of cancerOR (95%CI)pFDRCaseM/FAge (years)
Leukemia1.20 (1.02–1.41)0.0250.05814779/6867.99 ± 8.17
Rectal cancer1.10 (0.96–1.25)0.1650.193231134/9770.64 ± 6.17
Tongue cancer1.06 (0.88–1.29)0.5260.40710265/3768.89 ± 7.35
Squamous cell carcinoma1.04 (0.93–1.15)0.5140.401332168/16470.89 ± 6.21
Testicular cancer1.02 (0.95–1.11)0.5490.417595595/064.78 ± 8.02
Primary bone cancer1.02 (0.81–1.29)0.8450.5247244/2867.56 ± 8.02
Non-melanoma skin cancer1.02 (0.93–1.12)0.6480.458472280/19270.01 ± 6.97
Large bowel cancer/Colorectal cancer1.02 (0.93–1.12)0.7390.490440260/18071.54 ± 5.80
Rodent ulcer1.01 (0.92–1.11)0.8930.538437203/23470.50 ± 5.77
Esophageal cancer1.01 (0.85–1.19)0.9460.552137110/2771.98 ± 6.30
Cervical cancer1.01 (0.95–1.06)0.8570.52712730/1,27366.16 ± 7.65
Non-Hodgkin’s lymphoma0.99 (0.92–1.08)0.9450.552593355/23869.40 ± 7.34
Pre-cancer cells cervix0.99 (0.94–1.06)0.9220.54611171/1,11663.99 ± 7.98
Breast cancer0.98 (0.96–1.01)0.1640.193733037/7,29370.08 ± 6.48
Colon cancer/sigmoid cancer0.97 (0.92–1.04)0.3990.3421055631/42472.30 ± 5.68
Uterine/endometrial cancer0.95 (0.89–1.02)0.1760.2007520/75271.27 ± 5.89
Ovarian cancer0.95 (0.87–1.03)0.2220.2245120/51269.11 ± 7.33
Brain cancer/primary malignant brain tumor0.95 (0.80–1.13)0.5390.41212862/6664.77 ± 8.81
Prostate cancer0.94 (0.91–0.98)0.0050.02024102,410/073.96 ± 4.08
Skin cancer0.94 (0.88–1.00)0.0650.106943478/46571.21 ± 6.38
Basal cell carcinoma0.94 (0.90–0.97)0.0010.01029161,206/1,71070.02 ± 6.84
Stomach cancer0.94 (0.77–1.14)0.5160.4029656/4071.33 ± 6.22
Malignant melanoma0.91 (0.88–0.95)4.57E-069.56E-0525261,031/1,49568.95 ± 7.41
Larynx/throat cancer0.91 (0.80–1.04)0.1610.191228190/3871.12 ± 6.43
Bladder cancer0.91 (0.84–0.98)0.0100.030725548/17772.30 ± 5.87
Eye and/or adnexal cancer0.90 (0.74–1.11)0.3250.2979544/5168.82 ± 7.48
Thyroid cancer0.90 (0.80–1.01)0.0670.10829352/24167.27 ± 7.63
Small intestine/small bowel cancer0.90 (0.76–1.06)0.2060.21613377/5672.28 ± 5.55
Hodgkin’s lymphoma/Hodgkin’s disease0.89 (0.79–0.99)0.0330.069321184/13764.96 ± 8.13
Chronic myeloid leukemia0.88 (0.71–1.10)0.2730.2628144/3768.35 ± 7.98
Lung cancer0.88 (0.74–1.06)0.1720.19712382/4172.60 ± 5.70
Kidney/renal cell cancer0.86 (0.78–0.95)0.0030.017401261/14070.22 ± 6.36
Cancer of lip/mouth/pharynx/oral/cavity0.86 (0.68–1.09)0.2130.2206943/2670.42 ± 5.98
Sarcoma/fibrosarcoma0.84 (0.72–0.98)0.0280.06316476/8866.73 ± 7.58
Chronic lymphocytic leukemia0.82 (0.71–0.94)0.0050.020206131/7571.28 ± 6.28
Lymphoma0.80 (0.64–1.01)0.0570.0987851/2768.69 ± 8.27
Multiple myeloma0.77 (0.63–0.93)0.0060.02110862/4670.37 ± 7.08
Association between the genetic risk score (GRS) of leukocyte telomere length and the risk of 37 UK Biobank cancers. where μ is the expectation of y, with y = 1 or 0 representing the status of individual i with or without cancer; θ is the effect size of GRS; and is the vector of standardized covariates with effect sizes α. Of note, we assume that all of the entries in the first column of are 1, representing the intercept term. We next evaluate the effect of GRS on the mortality of cancers from TCGA (Table 3; Hoadley et al., 2018) with the Cox proportional hazards model (Cox, 1972) while controlling for available clinical covariates (i.e., age at diagnosis, gender, and stage).
TABLE 3

Association between the genetic risk score (GRS) of leukocyte telomere length and the mortality of 33 TCGA cancers.

CancerHR (95%CI)pFDRNMedian survival time
M/FAge at diagnosis (years)Stage or grade (1/2/3/4/5)
AllEventCensor
DLBC2.24 (0.88–5.67)0.0900.3174231.8519.8332.419/2355.33 ± 14.398/17/5/12
PCPG2.16 (0.95–4.92)0.0680.28317825.2815.0825.678/10047.30 ± 15.12NA
READ1.72 (1.09–2.73)0.0200.13815721.224.3321.0285/7264.34 ± 11.6730/51/51/25
UVM1.47 (0.94–2.30)0.0920.3207925.7719.6827.3744/3561.68 ± 13.940/39/36/4
PRAD1.44 (0.72–2.87)0.3060.61050130.829.1730.87501/060.93 ± 6.81NA
SARC1.29 (1.06–1.58)0.0110.13826031.7721.636.4119/14160.80 ± 14.61NA
ESCA1.28 (0.99–1.66)0.0630.27416213.5713.3813.57137/2562.40 ± 11.7418/79/56/9
TGCT1.23 (0.10–15.59)0.8700.8168137.53116.4837.5381/032.85 ± 10.1855/12/14/0
SKCM1.19 (1.03–1.37)0.0180.13841133.231.9334.5256/15558.82 ± 15.5177/140/171/23
KICH1.17 (0.46–2.99)0.7430.7926574.9328.590.4338/2751.15 ± 13.9920/25/14/6
CESC1.14 (0.90–1.45)0.2740.58329521.2720.2323.120/29547.88 ± 13.47160/69/46/20
THCA1.12 (0.66–1.89)0.6760.77650331.6734.0331.47136/36747.28 ± 15.78284/52/113/54
BRCA1.11 (0.93–1.32)0.2580.56992426.3844.1324.230/92458.84 ± 13.14156/523/219/14/12
LUSC1.06 (0.93–1.21)0.3780.65948721.7718.1324.58359/12867.31 ± 8.58239/157/84/7
MESO1.05 (0.82–1.35)0.7050.7838617.115.2338.9370/1663.08 ± 9.7210/16/44/16
LUAD1.04 (0.90–1.21)0.5840.74950321.8720.4722.33232/27165.16 ± 10.07277/121/80/25
UCEC1.04 (0.84–1.28)0.7420.79154630.4723.6332.20/54663.99 ± 11.13338/52/127/29
KIRC1.04 (0.90–1.20)0.6390.76653239.227.3548.1342/19060.57 ± 12.07267/57/125/83
HNSC1.03 (0.90–1.19)0.6550.77045021.3714.0827.4324/12660.90 ± 12.1327/73/82/268
LIHC1.00 (0.83–1.21)0.9630.83135019.4313.6721.47239/11159.03 ± 13.30174/86/85/5
GBM1.00 (0.91–1.10)0.9490.82959512.2712.78.67364/23157.87 ± 14.41NA
ACC0.99 (0.68–1.44)0.9670.8328837.9318.3848.4529/5947.07 ± 16.439/43/18/18
OV0.98 (0.88–1.08)0.6290.76356933.5735.7728.570/56959.71 ± 11.4616/30/437/86
PAAD0.97 (0.80–1.19)0.7920.80218215.5513.1316.92100/8264.92 ± 11.0621/152/4/5
STAD0.96 (0.83–1.11)0.5760.74640714.5311.618.87260/14765.37 ± 10.7055/128/181/43
BLCA0.96 (0.82–1.12)0.5760.74641117.8713.6821.27303/10868.10 ± 10.583/131/141/136
COAD0.93 (0.76–1.13)0.4480.69645822.3213.4724.33239/21967.03 ± 13.0679/183/131/65
LGG0.89 (0.73–1.09)0.2700.58051222.4727.1320.97284/22842.99 ± 13.340/247/265/0
LAML0.89 (0.73–1.09)0.2520.56318612.179.123.3102/8455.53 ± 16.06NA
UCS0.85 (0.59–1.23)0.3950.6695620.2516.7227.60/5669.38 ± 8.8921/5/20/10
THYM0.85 (0.41–1.76)0.6540.77012141.7728.4342.3362/5958.37 ± 12.9437/61/15/8
KIRP0.66 (0.47–0.93)0.0190.13825724.6720.825.37190/6761.50 ± 12.03171/20/51/15
CHOL0.64 (0.38–1.08)0.0970.3313621.516.6731.4216/2063.03 ± 12.6719/9/1/7
Association between the genetic risk score (GRS) of leukocyte telomere length and the mortality of 33 TCGA cancers. where t is the observed survival time and h0(t) is an arbitrary baseline hazard function. Cancer-specific covariates are considered for some cancers in TCGA [e.g., the status of estrogen and progesterone receptors for breast invasive carcinoma (BRCA)]. In the logistic or Cox model, we are mainly interested in estimating θ and testing for the null hypothesis H0: θ = 0. We further examine the interaction effect between GRS and each of the clinical covariates (e.g., GRS × gender) if GRS is detected to be associated with some cancer.

Two-Sample MR Analysis

Besides the GRS method, we also perform the two-sample MR analysis to estimate the causal effect of leukocyte telomere length on cancers in the UK Biobank using summary statistics (Sudlow et al., 2015). In observational studies, MR is a flexible approach for causal inference to avert confounding and reverse causality (Zeng et al., 2019a; Yu et al., 2020). In brief, we estimate the causal effect of leukocyte telomere length (again, denoted as θ) relying on all the available instrumental variables (Table 1) through the commonly employed inverse-variance weighted (IVW) method (Burgess et al., 2017; Hartwig et al., 2017). where and are the effect size and the variance, respectively, of the instrumental variable j for the exposure X (i.e., leukocyte telomere length; Li et al., 2020), and and are the effect size and the variance, respectively, for the same instrumental variable j on the outcome Y (i.e., cancer in the UK Biobank; Sudlow et al., 2015). To guarantee the validity of our MR analysis, before the formal analysis, we examine the pleiotropic effects of instruments by removing index SNPs that may be potentially related to individual cancers if the Bonferroni-adjusted p-values are less than 0.05. We also conduct a series of sensitivity analyses: (i) weighted median-based (Bowden et al., 2016b) and maximum likelihood methods (Burgess et al., 2013), which are robust when some instrumental variables might be invalid; (ii) MR-Egger regression (Bowden et al., 2016a; Burgess and Thompson, 2017), which guards against horizontal pleiotropic effects; and (iii) leave-one-out (LOO) analysis (Noyce et al., 2017) and Mendelian randomization pleiotropy residual sum and outlier (MR-PRESSO) test (Verbanck et al., 2018) to examine potential instrumental outliers.

UK Biobank and TCGA Cancer Datasets

The UK Biobank dataset consists of approximately 500,000 individuals (Bycroft et al., 2018). We selected age, gender, smoke, drink, and BMI as covariates and originally chose 79 self-reported cancers up to 337,198 independent individuals (28,820 cases and 308,378 controls) of European ancestry, but only included cancers with at least 60 cases (to some extent, this cutoff value was used arbitrarily) and treated cancer-free individuals to be controls. Finally, a total of 37 cancers were left up to 335,036 individuals (27,641 cases for various cancers and 307,395 shared cancer-free controls after removing individuals with missing values). The genotypes were provided by the UK Biobank after the research application was approved. However, we can only obtain 15 SNPs because two were missing (i.e., rs3219104 on PARP1 and rs55749605 on SENP7) in the UK Biobank. In addition, because summary-level statistics are necessary for the two-sample MR analysis, herein we can only consider 28 cancers from the UK Biobank (n = 420,473) (Sudlow et al., 2015; Supplementary Table S6). The summary statistics of these cancers were obtained from https://pan.ukbb.broadinstitute.org/. Then, we obtained the survival and clinical information of 33 cancers from TCGA (Hoadley et al., 2018). We selected the overall survival time and status as the outcome and primarily included age at diagnosis, gender, and pathologic tumor stage as covariates because many other important clinical covariates were missing for most of the patients. When the pathologic tumor stage cannot be available, we instead employed the clinical stage (i.e., for CESC, DLBC, OV, THYM, UCEC, and UCS) or histological grade (i.e., for LGG). It needs to be stated that all three stage variables were missing in five cancers (i.e., GBM, LAML, PCPG, PRAD, and SARC). For each cancer, we only kept samples from the primary cancer tissue and excluded those with missing values in clinical covariates. More details about these TCGA cancers are demonstrated in Table 3 and Supplementary Table S3. For each cancer, we filtered out SNPs that had a missingness rate >0.95 across individuals, genotype calling rate <0.95, minor allele frequency (MAF) > 0.01, or Hardy–Weinberg equilibrium (HWE) p-value < 10–4. We next performed an imputation procedure by first phasing the genotypes with SHAPEIT (Delaneau et al., 2013), then imputed the SNPs based on the Haplotype Reference Consortium panel (McCarthy et al., 2016) on the Michigan Imputation Server using minimac3 (Das et al., 2016). The filtering procedure for the imputed genotypes included an HWE p-value < 10–4, a genotype call rate <95%, a MAF < 0.01, and an imputation score <0.30. After the imputation of genotypes, all of the 17 SNPs were yielded in TCGA.

Power Evaluation

Finally, we performed power calculation to detect a non-zero causal effect for GRS with regards to cancers based on the UK Biobank and TCGA datasets. Firstly, we simulated genotypes for 17 independent SNPs with varying MAFs (Table 1) and then calculated the GRS. Two independent covariates (i.e., one was binary and the other was continuous) were also included, with each having an effect size of 0.5. We generated a case–control variable y with the probability of exp(η)/(1 + exp(η)) and η = GRS × θ + 0.5X1 + 0.5X2. We created 2,000,000 individuals to be the population and then randomly sampled 50 (or 100 and 150) cases and 300,000 controls (as well as their GRS and covariates) to be a subset for the final simulation analysis. Secondly, to simulate survival datasets, we first generated genotypes and calculated the GRS in the same way as described above. Again, two independent covariates were included, with each having an effect size of 0.5. Then, we employed the inverse probability method (Bender et al., 2005) to create survival time which followed a Weibull distribution, with the shape parameter being 1 and the scale parameter being 0.01. The location parameter of this Weibull distribution was determined by the GRS and the two covariates [i.e., μ = exp(η), with η = GRS × θ + 0.5X1 + 0.5X2]. The censored rate was fixed to be 50% in a random manner (the high censored rate corresponded to a similar situation observed in the TCGA cancer dataset). The sample size varied from 100, 300, to 500. In both simulations, the effect size of GRS θ was set to 0.05, 0.10, or 0.20, approximately corresponding to odds ratios (ORs) [or hazard ratio (HR)] of 1.05, 1.10, and 1.20. The simulation was repeated 1,000 times, and the power calculated by the proportion of the p-value of GRS was less than 1.67E-3, approximately equal to the significance level after the Bonferroni correction of 30 types of cancers. Throughout our study, we utilized the R software (version 3.6.1) to implement all the analyses. The association was declared to be statistically significant if the false discovery rate (FDR) is <0.05 (Benjamini and Hochberg, 1995), while the association was deemed to be suggestive if the unadjusted p-value is <0.05.

Results

Association Between GRS and UK Biobank Cancers

The 17 selected index SNPs collectively explain about 1.37% phenotypic variance of leukocyte telomere length, and all the F statistics are above 10 (ranging from 27.9 to 205.4, with an average of 63.3) (Table 1), largely ruling out the possibility of weak instrument bias (Cragg and Donald, 1993; Burgess et al., 2017; Zeng and Zhou, 2019a). Based on the constructed GRS, we first investigate the association between leukocyte telomere length and the risk of UK Biobank cancers (Table 2). We detect that the GRS of leukocyte telomere length is significantly associated with a decreased risk of seven types of cancers (Table 2), including multiple myeloma [OR = 0.77, 95% confidence interval (CI) = 0.63–0.93, FDR = 0.021], chronic lymphocytic leukemia (OR = 0.82, 95%CI = 0.71–0.94, FDR = 0.020), kidney/renal cell cancer (OR = 0.86, 95%CI = 0.78–0.95, FDR = 0.017), bladder cancer (OR = 0.91, 95%CI = 0.84–0.98, FDR = 0.030), malignant melanoma (OR = 0.91, 95%CI = 0.88–0.95, FDR = 9.56E-05), basal cell carcinoma (OR = 0.94, 95%CI = 0.90–0.97, FDR = 0.010), and prostate cancer (OR = 0.94, 95%CI = 0.91–0.98, FDR = 0.020). Suggestive associations are observed for two types of cancers including sarcoma/fibrosarcoma (OR = 0.84, 95%CI = 0.72–0.98, FDR = 0.063) and Hodgkin’s lymphoma/Hodgkin’s disease (OR = 0.89, 95%CI = 0.79–0.99, FDR = 0.069). In addition, we discover that the GRS of leukocyte telomere length is also marginally related to an increased risk of leukemia (OR = 1.20, 95%CI = 1.02–1.41, FDR = 0.058). We further examine the interaction effect of GRS and one of the covariates (e.g., age, gender, smoke, drink, or BMI) for each of the 10 cancers. We observe that the interaction term is statistically significant between smoke and GRS for sarcoma/fibrosarcoma (OR = 0.83, 95%CI = 0.71–0.97) as well as between drink and GRS for leukemia (OR = 0.82, 95%CI = 0.69–0.97) (Supplementary Table S4).

Association Between GRS and TCGA Cancers

We now examine the effect size of GRS on 33 TCGA cancers through the Cox proportional hazards model. We observe suggestive evidence that the GRS of leukocyte telomere length is related to a higher death hazard of READ (HR = 1.72, 95%CI = 1.09–2.73, p = 0.020), SARC (HR = 1.29, 95%CI = 1.06–1.58, p = 0.011), and SKCM (HR = 1.19, 95%CI = 1.03–1.37, p = 0.018) and is associated with a lower death hazard of KIRP (HR = 0.66, 95%CI = 0.47–0.93, p = 0.019), suggesting that a genetically decreased leukocyte telomere length can lead to a worse overall survival of READ, SARC, and SKCM while can result in a better overall survival of KIRP. However, all these associations become non-significant after accounting for multiple comparisons (FDR > 0.05). Neither suggestive nor significant associations are identified between GRS and the remaining cancers (Table 3). We further examine the interaction effect of GRS and each of the covariates (e.g., age at diagnosis, gender, or stage) for each of the four cancers. We do not identify any statistically significant interactions (Supplementary Table S5).

Association Between Leukocyte Telomere Length and UK Biobank Cancers Using the Two-Sample MR

With the selected 17 instrumental variables, we further perform MR analysis to investigate the causal effect of leukocyte telomere length on each of the 28 cancers from the UK Biobank. As no evidence of effect heterogeneity is presented across instruments (all the p-values for the Cochran’s Q test are greater than 0.05), thus, only the results estimated via the fixed-effects IVW method are displayed below. Among the 28 cancers, we identify that leukocyte telomere length is associated with a decreased risk of nine cancers (Supplementary Table S6), including basal cell carcinoma, malignant melanoma, skin cancer, bladder cancer, kidney/renal cell cancer, Hodgkin’s lymphoma/Hodgkin’s disease, thyroid cancer, chronic lymphocytic leukemia, and multiple myeloma. We also observe that leukocyte telomere length is associated with an increased risk of leukemia (Supplementary Table S6). We now validate the observed causal associations shown above through various sensitivity analyses (Supplementary Table S6). Here, we focus on the associations that are significant in all sensitivity analyses (i.e., P and P < 0.05) and have no horizontal pleiotropic effects (i.e., P > 0.05). Then, four types of cancers are left, including malignant melanoma (OR = 0.58, 95%CI = 0.44–0.79, FDR = 0.004), Hodgkin’s lymphoma/Hodgkin’s disease (OR = 0.30, 95%CI = 0.13–0.69, FDR = 0.008), chronic lymphocytic leukemia (OR = 0.20, 95%CI = 0.08–0.54, FDR = 0.004), and multiple myeloma (OR = 0.18, 95%CI = 0.05–0.66, FDR = 0.018). Of note is that both the weighted median method and the maximum likelihood method generate consistent causal effect estimates compared with the IVW method (Supplementary Table S6). In addition, we create scatter plots for the SNP effect sizes of leukocyte telomere length and these four cancers (Figure 1); we find that no instruments may be potential outliers. The finding is also supported by MR-PRESSO, which displays the absence of instrument outliers at the significance level of 0.05.
FIGURE 1

Relationship between the single nucleotide polymorphism (SNP) effect sizes of leukocyte telomere length (LTL) (x-axis) and the corresponding effect sizes of cancer (y-axis). (A) Malignant melanoma. (B) Hodgkin’s lymphoma/Hodgkin’s disease. (C) Chronic lymphocytic leukemia. (D) Multiple myeloma. In the plot, horizontal/vertical lines represent the 95% confidence interval.

Relationship between the single nucleotide polymorphism (SNP) effect sizes of leukocyte telomere length (LTL) (x-axis) and the corresponding effect sizes of cancer (y-axis). (A) Malignant melanoma. (B) Hodgkin’s lymphoma/Hodgkin’s disease. (C) Chronic lymphocytic leukemia. (D) Multiple myeloma. In the plot, horizontal/vertical lines represent the 95% confidence interval. To further examine whether a single instrumental variable may strongly influence the causal effects of leukocyte telomere length on these four cancers, we performed the LOO analysis. Again, the LOO analysis results demonstrate that none of the 17 instruments can substantially impact the estimated casual effect. Therefore, we can conclude that it is likely that a shorter leukocyte telomere length can decrease the risk of malignant melanoma, Hodgkin’s lymphoma/Hodgkin’s disease, chronic lymphocytic leukemia, and multiple myeloma. This finding here is also consistent with the results derived by the GRS regression above.

Power Calculation for the Association Between GRS and Cancers in the UK Biobank/TCGA Datasets

In terms of our simulations, we have sufficient power to detect the association in the UK Biobank as the total sample size is large, although only a few of the cancer cases are included. Specifically, we observe that the estimated power approaches 100% even when the number of cases is only 50 and the OR is only 1.05. In contrast, due to the relatively weak effect size and small sample size in the simulated TCGA cancer dataset, under our simulation settings, we have only low to moderate power to detect the association between GRS and the survival risk of cancer (Figure 2). For example, when the sample size is 300, the statistical power is only 3.0 or 10.7% when the HR was set to be 1.05 or 1.10. As can be expected, the power improves with the increase in the sample sizes and effect sizes.
FIGURE 2

Estimated power in the simulation to evaluate the association between genetic risk score (GRS) and cancers in The Cancer Genome Atlas (TCGA). In the simulation, the effect sizes of GRS were set to 0.05, 0.10, and 0.20 and the sample sizes of cancer were set to 100, 300, and 500.

Estimated power in the simulation to evaluate the association between genetic risk score (GRS) and cancers in The Cancer Genome Atlas (TCGA). In the simulation, the effect sizes of GRS were set to 0.05, 0.10, and 0.20 and the sample sizes of cancer were set to 100, 300, and 500.

Discussion

Summary of the Results of the Present Study

The main objective of our study was to investigate whether there existed associations between genetically predicted leukocyte telomere length and various types of cancers. To achieve this, we first constructed the GRS of leukocyte telomere length based on associated SNPs from a large-scale GWAS and evaluated the effect of GRS on the risk and mortality of cancers. We found statistical evidence supporting the existence of associations between GRS and cancers in the UK Biobank and TCGA. Briefly, based on the GRS, a shorter leukocyte telomere length was identified to be associated with the decreased risk of some cancers (i.e., multiple myeloma, chronic lymphocytic leukemia, kidney/renal cell cancer, bladder cancer, malignant melanoma, basal cell carcinoma, prostate cancer, sarcoma/fibrosarcoma, and Hodgkin’s lymphoma/Hodgkin’s disease) as well as related to the decreased mortality of KIRP. In addition, inverse associations were observed for shorter leukocyte telomere length on the risk of leukemia as well as on the mortality of READ, SARC, and SKCM. The results of the MR analysis also supported the existence of an association between leukocyte telomere length and various cancers, including malignant melanoma, Hodgkin’s lymphoma/Hodgkin’s disease, chronic lymphocytic leukemia, and multiple myeloma. The diverse associations between leukocyte telomere length and cancers may in part reflect the different carcinogenic mechanisms acted by telomere in specific cancer types, further suggesting that telomere length is a valuable indicator of cancer risk and prognosis.

Discoveries Combined With the Previous Study

We found that the observed associations between leukocyte telomere length and cancers in the present study (i.e., multiple myeloma, chronic lymphocytic leukemia, kidney/renal cell cancer, bladder cancer, malignant melanoma, basal cell carcinoma, and prostate cancer) are greatly consistent with prior findings obtained in terms of MR (Supplementary Table S1; Zhang et al., 2015; Ojha et al., 2016; Haycock et al., 2017; Machiela et al., 2017; Li et al., 2020; Went et al., 2020). Particularly, several previous studies demonstrated that a shorter telomere length was associated with a decreased lung cancer risk or mortality and that the association was present in adenocarcinoma while absent in squamous cell carcinoma (Supplementary Table S1; Zhang et al., 2015; Haycock et al., 2017; Kachuri et al., 2018; Yuan et al., 2018), which may be attributed to the discrepancy in the biological characteristics of various subtypes of lung cancer. In the present study, inconsistent correlations were also identified within different subtypes of cancer. For example, we discovered that leukocyte telomere length had an opposite effect on the risk of leukemia and chronic lymphocytic leukemia. However, we observed that leukocyte telomere length displayed similar effects on the risk of malignant melanoma and basal cell carcinoma. These findings suggest that leukocyte telomere may influence the risk or mortality of cancer in a histologic way and also emphasize the unique roles of leukocyte telomere in the development of cancers. Although the molecular mechanism remains unclear, some prior studies implied that both short and long telomere length played an important role in the etiology of cancers (Cui et al., 2012; Cheng et al., 2017; Nelson and Codd, 2020). Cells with longer telomere lengths have greater proliferative potential and more probability of accruing mutations (Hanahan and Weinberg, 2011); therefore, telomere shortening is generally considered to be a protective mechanism against tumorigenesis (Rode et al., 2016; Zhang et al., 2017; Kuo et al., 2019). However, it has been proposed that telomere shortening can generally give rise to end-to-end chromosome fusions and attenuates DNA damage response, thus increasing genomic instability and finally initiating carcinogenesis (Wu et al., 2003). These findings indicate that telomere plays a dual role in cancer development, and such role seems to depend on the types of cancers and the balance of the proliferation and senescence of cells in cancers.

Strengths and Limitations of Our Study

One advantage of our study is that more than 50 diverse types of cancers were investigated; it is thus feasible to undertake a systematic evaluation in the present analysis. In addition, methodologically, the GRS analysis can be viewed to be a two-stage regression model within the framework of instrumental variable-based causal inference (Baum et al., 2003; Hernán and Robins, 2006; Zeng et al., 2019a). Specifically, leukocyte telomere length is the exposure of interest and the associated SNPs are the carefully selected instrumental variables which are supposed to satisfy the necessary assumptions of instruments (Lawlor et al., 2008; Sheehan et al., 2008; Zeng et al., 2019a; Zeng and Zhou, 2019a, b). In the first stage, the effect size of each instrumental variable is estimated with an external large-scale GWAS dataset; in the second stage, the influence of leukocyte telomere length on various cancers is assessed based on the genetically determined leukocyte telomere length which is predicted with the chosen instrumental variables. Therefore, in terms of the principle of instrumental variable inference, the estimated effect of GRS can be interpreted as causal. In this sense, besides the MR method, we are actually investigating the causal association between leukocyte telomere length and cancers by constructing a GRS. Finally, some shortcomings of this study should also be mentioned. Firstly, the majority of the individuals of the UK Biobank and TCGA were of European ancestry, so our results may not be applicable to other populations. Secondly, in our study, telomere length measured in blood leukocytes was employed and not in all cell types in vivo; however, leukocyte telomere length was demonstrated to be highly correlated with that in cells from other tissues (Friedrich et al., 2000; Wilson et al., 2008; Butt et al., 2010). Thirdly, as described before, the effect sizes of leukocyte telomere length on the mortality of TCGA cancers were only suggestive and the sample size of these cancers was not sufficiently large to maintain high power to detect weak associations. Therefore, further investigations with a larger sample size are required to validate our results.

Conclusion

Our study reveals that telomere played diverse roles in different types of cancers; however, further validations in large-scale prospective studies and deeper investigations of the biologic mechanisms are warranted.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author Contributions

PZ conceived the idea for the study. PZ, YW, XZ, SH, and HZ obtained the data. PZ and YG cleared up the datasets, performed the data analyses, and drafted the manuscript. PZ, YG, and YW interpreted the results of the data analyses. All authors approved the manuscript and provided relevant suggestions.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  73 in total

Review 1.  Genetic determinants of telomere length and cancer risk.

Authors:  Christopher P Nelson; Veryan Codd
Journal:  Curr Opin Genet Dev       Date:  2020-03-11       Impact factor: 5.578

Review 2.  Polygenic Risk Scores in Clinical Psychology: Bridging Genomic Risk to Individual Differences.

Authors:  Ryan Bogdan; David A A Baranger; Arpana Agrawal
Journal:  Annu Rev Clin Psychol       Date:  2018-05-07       Impact factor: 18.561

3.  Genome-wide association identifies OBFC1 as a locus involved in human leukocyte telomere biology.

Authors:  Daniel Levy; Susan L Neuhausen; Steven C Hunt; Masayuki Kimura; Shih-Jen Hwang; Wei Chen; Joshua C Bis; Annette L Fitzpatrick; Erin Smith; Andrew D Johnson; Jeffrey P Gardner; Sathanur R Srinivasan; Nicholas Schork; Jerome I Rotter; Utz Herbig; Bruce M Psaty; Malinee Sastrasinh; Sarah S Murray; Ramachandran S Vasan; Michael A Province; Nicole L Glazer; Xiaobin Lu; Xiaojian Cao; Richard Kronmal; Massimo Mangino; Nicole Soranzo; Tim D Spector; Gerald S Berenson; Abraham Aviv
Journal:  Proc Natl Acad Sci U S A       Date:  2010-04-26       Impact factor: 11.205

4.  Telomere length in different tissues of elderly patients.

Authors:  U Friedrich; E Griese; M Schwab; P Fritz; K Thon; U Klotz
Journal:  Mech Ageing Dev       Date:  2000-11-15       Impact factor: 5.432

Review 5.  Telomerase, senescence and ageing.

Authors:  May Shawi; Chantal Autexier
Journal:  Mech Ageing Dev       Date:  2007-12-14       Impact factor: 5.432

Review 6.  Shelterin: the protein complex that shapes and safeguards human telomeres.

Authors:  Titia de Lange
Journal:  Genes Dev       Date:  2005-09-15       Impact factor: 12.890

7.  Polygenic Prediction of Weight and Obesity Trajectories from Birth to Adulthood.

Authors:  Amit V Khera; Mark Chaffin; Kaitlin H Wade; Sohail Zahid; Joseph Brancale; Rui Xia; Marina Distefano; Ozlem Senol-Cosar; Mary E Haas; Alexander Bick; Krishna G Aragam; Eric S Lander; George Davey Smith; Heather Mason-Suares; Myriam Fornage; Matthew Lebo; Nicholas J Timpson; Lee M Kaplan; Sekar Kathiresan
Journal:  Cell       Date:  2019-04-18       Impact factor: 66.850

8.  Mendelian randomization analysis with multiple genetic variants using summarized data.

Authors:  Stephen Burgess; Adam Butterworth; Simon G Thompson
Journal:  Genet Epidemiol       Date:  2013-09-20       Impact factor: 2.135

9.  PRSice: Polygenic Risk Score software.

Authors:  Jack Euesden; Cathryn M Lewis; Paul F O'Reilly
Journal:  Bioinformatics       Date:  2014-12-29       Impact factor: 6.937

10.  A reference panel of 64,976 haplotypes for genotype imputation.

Authors:  Shane McCarthy; Sayantan Das; Warren Kretzschmar; Olivier Delaneau; Andrew R Wood; Alexander Teumer; Hyun Min Kang; Christian Fuchsberger; Petr Danecek; Kevin Sharp; Yang Luo; Carlo Sidore; Alan Kwong; Nicholas Timpson; Seppo Koskinen; Scott Vrieze; Laura J Scott; He Zhang; Anubha Mahajan; Jan Veldink; Ulrike Peters; Carlos Pato; Cornelia M van Duijn; Christopher E Gillies; Ilaria Gandin; Massimo Mezzavilla; Arthur Gilly; Massimiliano Cocca; Michela Traglia; Andrea Angius; Jeffrey C Barrett; Dorrett Boomsma; Kari Branham; Gerome Breen; Chad M Brummett; Fabio Busonero; Harry Campbell; Andrew Chan; Sai Chen; Emily Chew; Francis S Collins; Laura J Corbin; George Davey Smith; George Dedoussis; Marcus Dorr; Aliki-Eleni Farmaki; Luigi Ferrucci; Lukas Forer; Ross M Fraser; Stacey Gabriel; Shawn Levy; Leif Groop; Tabitha Harrison; Andrew Hattersley; Oddgeir L Holmen; Kristian Hveem; Matthias Kretzler; James C Lee; Matt McGue; Thomas Meitinger; David Melzer; Josine L Min; Karen L Mohlke; John B Vincent; Matthias Nauck; Deborah Nickerson; Aarno Palotie; Michele Pato; Nicola Pirastu; Melvin McInnis; J Brent Richards; Cinzia Sala; Veikko Salomaa; David Schlessinger; Sebastian Schoenherr; P Eline Slagboom; Kerrin Small; Timothy Spector; Dwight Stambolian; Marcus Tuke; Jaakko Tuomilehto; Leonard H Van den Berg; Wouter Van Rheenen; Uwe Volker; Cisca Wijmenga; Daniela Toniolo; Eleftheria Zeggini; Paolo Gasparini; Matthew G Sampson; James F Wilson; Timothy Frayling; Paul I W de Bakker; Morris A Swertz; Steven McCarroll; Charles Kooperberg; Annelot Dekker; David Altshuler; Cristen Willer; William Iacono; Samuli Ripatti; Nicole Soranzo; Klaudia Walter; Anand Swaroop; Francesco Cucca; Carl A Anderson; Richard M Myers; Michael Boehnke; Mark I McCarthy; Richard Durbin
Journal:  Nat Genet       Date:  2016-08-22       Impact factor: 38.330

View more
  6 in total

1.  Assessing the causal role of epigenetic clocks in the development of multiple cancers: a Mendelian randomization study.

Authors:  Fernanda Morales Berstein; Daniel L McCartney; Ake T Lu; Konstantinos K Tsilidis; Emmanouil Bouras; Philip Haycock; Kimberley Burrows; Amanda I Phipps; Daniel D Buchanan; Iona Cheng; Richard M Martin; George Davey Smith; Caroline L Relton; Steve Horvath; Riccardo E Marioni; Tom G Richardson; Rebecca C Richmond
Journal:  Elife       Date:  2022-03-29       Impact factor: 8.713

2.  Predicted leukocyte telomere length and risk of germ cell tumours.

Authors:  Shannon S Cigan; John J Meredith; Ava C Kelley; Tianzhong Yang; Erica K Langer; Anthony J Hooten; John A Lane; Benjamin R Cole; Mark Krailo; A Lindsay Frazier; Nathan Pankratz; Jenny N Poynter
Journal:  Br J Cancer       Date:  2022-04-02       Impact factor: 9.075

Review 3.  Stress and telomere shortening: Insights from cellular mechanisms.

Authors:  Jue Lin; Elissa Epel
Journal:  Ageing Res Rev       Date:  2021-11-01       Impact factor: 10.895

4.  Decoupling blood telomere length from age in recipients of allogeneic hematopoietic cell transplant in the BMT-CTN 1202.

Authors:  Tsung-Po Lai; Simon Verhulst; Casey L Dagnall; Amy Hutchinson; Stephen R Spellman; Alan Howard; Hormuzd A Katki; John E Levine; Wael Saber; Abraham Aviv; Shahinaz M Gadalla
Journal:  Front Immunol       Date:  2022-10-03       Impact factor: 8.786

5.  Investigating the influence of breastfeeding on asthma in children under 12 years old in the UK Biobank.

Authors:  Wenyan Hou; Fengjun Guan; Lei Xia; Yue Xu; Shuiping Huang; Ping Zeng
Journal:  Front Immunol       Date:  2022-09-29       Impact factor: 8.786

6.  Novel Autophagy-Related Gene Signature Investigation for Patients With Oral Squamous Cell Carcinoma.

Authors:  Lihong Huang; Xinghao Yu; Zhou Jiang; Ping Zeng
Journal:  Front Genet       Date:  2021-06-17       Impact factor: 4.599

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.