Literature DB >> 27515689

Meta-analysis of Genome Wide Association Studies Identifies Genetic Markers of Late Toxicity Following Radiotherapy for Prostate Cancer.

Sarah L Kerns1, Leila Dorling2, Laura Fachal3, Søren Bentzen4, Paul D P Pharoah2, Daniel R Barnes2, Antonio Gómez-Caamaño5, Ana M Carballo5, David P Dearnaley6, Paula Peleteiro5, Sarah L Gulliford6, Emma Hall7, Kyriaki Michailidou2, Ángel Carracedo8, Michael Sia9, Richard Stock10, Nelson N Stone10, Matthew R Sydes11, Jonathan P Tyrer2, Shahana Ahmed12, Matthew Parliament13, Harry Ostrer14, Barry S Rosenstein15, Ana Vega16, Neil G Burnet17, Alison M Dunning12, Gillian C Barnett18, Catharine M L West19.   

Abstract

Nearly 50% of cancer patients undergo radiotherapy. Late radiotherapy toxicity affects quality-of-life in long-term cancer survivors and risk of side-effects in a minority limits doses prescribed to the majority of patients. Development of a test predicting risk of toxicity could benefit many cancer patients. We aimed to meta-analyze individual level data from four genome-wide association studies from prostate cancer radiotherapy cohorts including 1564 men to identify genetic markers of toxicity. Prospectively assessed two-year toxicity endpoints (urinary frequency, decreased urine stream, rectal bleeding, overall toxicity) and single nucleotide polymorphism (SNP) associations were tested using multivariable regression, adjusting for clinical and patient-related risk factors. A fixed-effects meta-analysis identified two SNPs: rs17599026 on 5q31.2 with urinary frequency (odds ratio [OR] 3.12, 95% confidence interval [CI] 2.08-4.69, p-value 4.16×10(-8)) and rs7720298 on 5p15.2 with decreased urine stream (OR 2.71, 95% CI 1.90-3.86, p-value=3.21×10(-8)). These SNPs lie within genes that are expressed in tissues adversely affected by pelvic radiotherapy including bladder, kidney, rectum and small intestine. The results show that heterogeneous radiotherapy cohorts can be combined to identify new moderate-penetrance genetic variants associated with radiotherapy toxicity. The work provides a basis for larger collaborative efforts to identify enough variants for a future test involving polygenic risk profiling.
Copyright © 2016 The Ohio State University Wexner Medical Center. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Cancer survivorship; Genome-wide association study; Prostate cancer; Quality of life; Radiation toxicity; Radiogenomics

Mesh:

Substances:

Year:  2016        PMID: 27515689      PMCID: PMC5036513          DOI: 10.1016/j.ebiom.2016.07.022

Source DB:  PubMed          Journal:  EBioMedicine        ISSN: 2352-3964            Impact factor:   8.143


Introduction

Radiotherapy is used in the treatment of up to 50% of cancer patients and around 40% of long-term cancer survivors underwent radiotherapy at some point in their treatment. For example, approximately half of the 1.1 million men diagnosed with prostate cancer worldwide each year receive radiotherapy, and the 5-year relative survival rates approach 100% for non-metastatic disease (Howlader et al., 2013). Although modern treatments minimize radiation doses to surrounding normal tissues, some men develop long-term toxicity (Bentzen et al., 2010). The risk of severe toxicity limits doses, which aim to keep the prevalence below 5%. Mild and moderate effects are common (10–50% of those treated) (Alemozaffar et al., 2011, Dearnaley et al., 2012, Heemsbergen et al., 2006, Kneebone et al., 2004, Resnick et al., 2013, Syndikus et al., 2010), impact negatively on quality-of-life, and are an important factor when men consider treatment options (Davison et al., 2002). There is a need for a test that reflects a cancer patient's radiosensitivity and predicts the likelihood of toxicity. Many assays have been explored but none proved sufficiently reliable for clinical application. Over the past 15 years interest increased in identifying the genetic variants associated with risk of toxicity. The rationale behind the work is that a future test based on a germline polygenic risk score will not suffer from the poor reproducibility associated with other assays measuring radiosensitivity (Barnett et al. 2015). Mutations associated with well-characterized radiosensitivity syndromes such as ataxia telangiectasia (Taylor et al., 1975) are rare and do not explain the general inter-individual variation in toxicity following radiotherapy (Safwat et al., 2002). Rather, it is hypothesized that common genetic variants, such as single nucleotide polymorphisms (SNPs) account for most of the heritability of radiosensitivity (West and Barnett, 2011). Studies have begun to identify common variants associated with radiotherapy toxicity. Candidate gene studies showed rs2868371 in HSPB1 (MIM 602195) (Lopez Guerra et al., 2011, Pang et al., 2013) and rs1800469 in TGFB1 (MIM 190180) (Guerra et al., 2012) are associated with late effects of lung radiotherapy; and rs1800629 in TNF (MIM 191160) (Talbot et al., 2012) and rs1139793 in TXNRD2 (MIM 606448) (Edvardsen et al., 2013) are risk SNPs for late toxicity following breast radiotherapy. Genome-wide association studies (GWAS) identified a locus on chr11q14.3 associated with rectal bleeding (Kerns et al., 2013b) and a locus on chr2q24.1 within TANC1 (MIM 611397) associated with overall toxicity (Fachal et al., 2014) following radiotherapy for prostate cancer. Another study showed more associations at the p-value < 5 × 10− 7 level than expected by chance, providing the strongest evidence to date that many common genetic variants are associated with risk of toxicity (Barnett et al., 2014). Recently published GWAS have limitations that we aimed to overcome by using a meta-analysis approach. The published studies used a multi-stage approach, where a small first-stage cohort was analyzed for a genome-wide panel of SNPs and only the most significant SNPs were genotyped in validation datasets. Thus, true positive SNPs were likely missed because they were not tested in the full set of individuals. Here, the Radiogenomics Consortium (West and Rosenstein, 2010) undertook a meta-analysis of four GWAS in order to maximize statistical power (Cohn and Becker, 2003) to discover additional risk variants. It is known that risk factors for late toxicity include not only genetics but also dosimetric parameters, co-morbidities, and patient demographics (Barnett et al. 2009). The latter factors can vary between cohorts as can the treatment (e.g. in prostate cancer: external beam or brachytherapy; type of fractionation – large or small doses per fraction; variable use of hormone therapy; variable use of surgery) and scales used to assess toxicity. There were concerns, therefore, whether the heterogeneity across cohorts might limit our ability to identify variants. This study is important because our ability to identify enough SNPs for a risk profile for clinical implementation is dependent on combining multiple heterogeneous cohorts. The aim was to show that multi-center radiotherapy cohorts could be harmonized and analyzed to identify risk SNPs by increasing the number of individuals analyzed in a single stage (Skol et al., 2006). STROGAR guidelines (Kerns et al., 2014) for reporting radiogenomic studies, which build on the STREGA and STROBE guidelines (Little et al., 2009, von Elm et al., 2007), were followed throughout.

Subjects & Methods

Participants

The four cohorts (RAPPER, RADIOGEN, Gene-PARE, and CCI) comprised individuals with adenocarcinoma of the prostate treated with radiotherapy with curative intent. Table 1 shows the number of individuals in each cohort the number with genome-wide SNP data available, and the final number included in the GWAS meta-analysis after excluding samples for quality control or due to missing data. Informed consent was obtained from all study participants and all studies conform to standards indicated by the Declaration of Helsinki.
Table 1

Number of individuals in each cohort.

RAPPERRADIOGENGene-PARECCI
Total in cohort727741895155
Genotyped via genome-wide SNP chip672741381155
 Excluded: > 5% SNPs missing18181
 Excluded: cryptic relatedness1419160
 Excluded: excess heterozygosity19
 Excluded: PCA outlier68
 Excluded: non-European ancestry based on PCA w/ HapMap samples211673
 Excluded: lacking all 2 yr toxicity or essential covariate data735501
Number included in analysis of at least one toxicity endpoint527597a290b150c

Additional follow-up of the RADIOGEN cohort increased the number with late toxicity data available from the 417 reported previously (Fachal et al., 2014). Of the 597 participants in RADIOGEN, one lacked data on rectal toxicity and seven data on rectal volume and were excluded from the analysis of rectal bleeding. 120 participants had no data on baseline urinary frequency and 119 were missing data on decreased urine stream and were excluded from the respective analyses.

Of the 290 participants in Gene-PARE, 55 were lacking data for rectal volume and were excluded from analysis of rectal bleeding. 35 participants did not complete the urinary questionnaire and were excluded from analysis of urinary frequency and decreased urine stream.

Of the 150 participants in CCI, 15 were lacking data for rectal volume or diabetes and were excluded from analysis of rectal bleeding.

RAPPER (UKCRN1471; n = 727) (Burnet et al., 2006) was approved by the Cambridge South Research Ethics Committee (05/Q0108/365). Individuals received neoadjuvant androgen suppression and external beam radiotherapy, (EBRT): 233 from MRC RT01 (ISRCTN47772397) (Sydes et al., 2004) and 494 from CHHiP (ISRCTN97182923) (Dearnaley et al., 2012). RADIOGEN (n = 741) was approved by the Galician Ethical Committee. Individuals received conformal radical or post-prostatectomy EBRT at the Clinical University Hospital of Santiago de Compostela, Spain (Fachal et al., 2012), and 511 individuals had hormone therapy. Gene-PARE (n = 895) (Ho et al., 2006) was approved by the Mount Sinai Medical Center Institutional Review Board. Individuals had brachytherapy with/without EBRT at the Mount Sinai Hospital, New York, and 472 received hormone therapy. 125I (160Gy; TG-43) was used in those undergoing brachytherapy alone and 103Pd (124Gy) in those also receiving EBRT (Stock et al., 1995, Stone et al., 2003). The CCI cohort (n = 155) (Kerns et al., 2013b) was approved by the Health Research Ethics Board of Alberta (Cancer). Individuals were recruited from the Cross Cancer Institute in Edmonton and the Tom Baker Cancer Centre in Calgary, Alberta, Canada. All 155 individuals underwent EBRT and 82 received hormonal therapy. Total biologic effective dose (BED) (Stock et al., 2006) was calculated for individuals in all four studies to compare radiation exposure across studies (α/β = 3).

Assessment of Late Radiotherapy Toxicity

Participants were assessed prospectively for toxicity (see Table S1). Toxicity was measured 1.5–2.5 years following treatment with the latest value used if more than one assessment was recorded during this follow-up window. For rectal bleeding in Gene-PARE, a 1–5 year window was allowed, because the scoring system assigns grades based on whether rectal bleeding occurs as a single incident or intermittent symptoms over time. Urinary endpoints were re-graded to harmonize across the studies (Table S2). As urinary daytime frequency and nocturia were recorded separately (RAPPER, Gene-PARE) or as a single endpoint (RADIOGEN, CCI), they were collapsed into a single endpoint in RAPPER and Gene-PARE by taking the maximum score between the two. Changes in scores from baseline were calculated for each endpoint. Improvement from baseline (i.e. a negative change in score) was coded as 0. Table S3 lists the two-year prevalence of the toxicity endpoints in the four cohorts, including samples that were not genotyped. A Standardized Total Average Toxicity (STAT) score was calculated as described previously (Barnett et al., 2012b) to provide an overall measure of toxicity. All samples with 2-year toxicity data available and the endpoints described in Table S1 were used to calculate STAT. Data simulation determined the most statistically powerful way to analyze toxicity: ordinal logistic regression of exact change in toxicity grade from baseline; binary logistic regression with no change in toxicity grade compared with a ≥ 1 point increase in toxicity grade; or binary logistic regression with change in toxicity grade ≤ 1 point compared with a ≥ 2 point change. Genotype and toxicity phenotype data for 600 individuals were simulated under two models: (1) null (no association), and (2) alternative model of linear association characterized by a log-linear per-allele increase in odds of toxicity. Under both types of model, genotypes were randomly assigned to each subject according to one of five minor allele frequencies (MAF): 0.05, 0.15, 0.25, 0.35 or 0.45. To create the null data set, a toxicity grade was also randomly assigned to each subject. Datasets representing a log-linear association between genotype and toxicity were created by assigning each subject a toxicity risk score that was calculated using their genotype and a pre-specified effect size (log(odds ratio) = log(1.5)): for subjects i = 1 … n; β the pre-specified effect size; (2 × MAF × β) the population average. A random risk was also added to each score, representing non-genetic factors that may influence toxicity, to create a logistic distribution of toxicity risk. This logistic distribution was then used to assign a toxicity grade to each subject such that subjects lying at the high end of the risk distribution had higher toxicity grades than those at the low end of the distribution. Under both types of model, two phenotype distributions were simulated to reflect the distributions of urinary frequency and rectal bleeding observed in RAPPER. Each simulation was repeated 100,000 times. Three regression models were fitted to each set of simulated data, with toxicity grade as the outcome variable and genotype as the independent variable. Each model produced a p-value for comparison. The power of each type of model applied to the log-linear association data was assessed by tabulating the number of p-values (out of 100,000) achieving statistical significance. Applying a simple Bonferroni correction to the observed p-values to account for 100,000 test repeats, a test was considered statistically significant if p-value < 5 × 10− 6. The model that detected the most significant associations was deemed most powerful. p-Values obtained from analyses of the null data sets were assessed for type I error. Results from 100,000 simulations showed that both ordinal logistic and binary (no versus ≥ 1 point change) logistic regression were equally powerful statistical approaches (Table S4), and binary logistic regression (no change in toxicity grade compared with a ≥ 1 point increase in toxicity grade) was used for the primary analysis. No model displayed higher-than-expected type I errors. Data simulation was performed using R (R Core Development Team, 2014) with the Ordinal package (Christensen, 2013).

Genotyping, Quality Control and Imputation

Germline DNA was genotyped using commercial SNP arrays as part of previously completed GWAS (Barnett et al., 2014, Fachal et al., 2014, Kerns et al., 2013a, Kerns et al., 2013b, Kerns et al., 2013c). Quality control filtering removed SNPs that were missing in > 2.5% of samples, had MAF < 1% or displayed frequencies deviating from those expected under Hardy-Weinberg Equilibrium (p-value < 10− 6). Samples with > 5% of SNPs missing, showing cryptic relatedness, showing excess heterozygosity or being principle component analysis (PCA) outliers were removed (Table 1). After filtering, the four datasets had genotyping rates > 99%. To minimize potential confounding by population structure, individuals with non-European ancestry (Table 1) were excluded using PCA performed with samples of known ancestry from the International HapMap Project (International HapMap, 2003). Imputation using IMPUTE2 software (Howie et al., 2009) with the 1000 Genomes phase I, version 3 (release date 3/16/2012) reference panel (1092 individuals from all 14 populations) (Genomes Project et al., 2012) yielded ~ 38 million SNPs, small insertions and deletions, and structural variants in each study. These datasets were filtered as above in addition to removing SNPs with information score, a measure of imputation certainty, < 0.3. The final datasets included in the GWAS meta-analysis were: 6,672,177 SNPs in 527 RAPPER participants; 6,767,156 SNPs in 597 RADIOGEN participants; 6,627,946 SNPs in 290 Gene-PARE participants; and 6,504,337 SNPs in 150 CCI participants. SNP rs17599026 was directly genotyped in the RADIOGEN and RAPPER samples using a TaqMan assay (forward primer sequence TGCTCATGATGAAGGTATGCTTTCT, reverse primer sequence ACAAAACTGTATTCCCAAGACAAAGC and probe sequences CACCATCCTAAAGCAGTG plus ACCATCCTAAAACAGTG; Applied Biosystems, Thermo Fisher Scientific) according to the manufacturer's standard protocol (PCR annealing and extension performed at 60 °C for 1 min × 40 cycles). It was not possible to design a TaqMan assay with high specificity for rs7720298 as it is immediately adjacent to another SNP (rs7720176).

Statistical Analysis

Individual GWAS were analyzed using binary logistic (individual toxicity endpoints) or linear (STAT) regression, adjusting for non-genetic risk factors identified by QUANTEC (Michalski et al., 2010, Viswanathan et al., 2010). These were: age at treatment, diabetes, rectal volume and BED in analysis of rectal bleeding; age at treatment, transurethral resection of the prostate (TURP) prior to radiotherapy, baseline symptom grade and BED in analysis of urinary endpoints; and all factors in analysis of STAT. Hormone therapy was also included in the Gene-PARE and CCI analyses because baseline symptoms were assessed prior to completion of hormone therapy. Multivariable regression was performed using SNPTEST software (Marchini et al., 2007) with the frequentist test and expected method. The expected method uses the genotype dosages produced from imputation, which account for the uncertainty in using imputed genotypes rather than experimentally determined genotypes. Thus, inaccuracy related to using imputed data was accounted for in the statistical analysis. An additive genetic inheritance model was assumed in all analyses. A fixed-effects meta-analysis with inverse variance weighting was performed using the regression beta coefficients and standard errors produced by SNPTEST. Studies have shown fixed-effects to be more statistically powerful compared with random-effects when the primary purpose is SNP discovery rather than refinement of the effects size estimate (Evangelou and Ioannidis, 2013). A chi-squared test of heterogeneity was performed for each SNP. SNPs considered significant had a meta-p-value ≤ 5 × 10− 8 and agreement in effect direction across all cohorts. Significant SNPs were re-analyzed using an ordinal logistic regression model. Regions of linkage disequilibrium were defined based on r2 ≥ 0.5 in SNPs from the 1000 Genomes European population with the meta-analysis top hits using Haploview software (Barrett et al., 2005). The most recent release of the 1000 Genomes data (version 5, release date 5/2/2013) was used to estimate linkage disequilibrium.

Power Calculations

Power calculations were performed using the web-based Genetic Power Calculator (Purcell et al., 2003) assuming a sample size of 1564 individuals, a phenotype prevalence of 20%, and type I error of 5 × 10− 8.

Results

Table 2 summarizes the cohort characteristics and Table 3 the toxicity prevalences of the 1564 men treated for prostate cancer who were included in the analysis. Two years following radiotherapy 17.8% (277 of 1557) of individuals experienced rectal bleeding, 15.0% (212 of 1410) an increase in urinary frequency, and 8.1% (101 of 1245) a decrease in urine stream. The meta-analysis had ≥ 99% power to detect SNPs with MAF ≥ 10% and per-allele effect size ≥ 2 assuming a genome-wide significance threshold of 5 × 10− 8 (Table 4).
Table 2

Individual study patient characteristics.

RAPPER, N = 527RADIOGEN, N = 597Gene-PARE, N = 290CCI, N = 151
Age, mean (sda)67 (5.7)71 (5.8)64 (7.5)67 (7.2)
TURPb, n (%)56 (11%)45 (8%)6 (2%)6 (4%)
Diabetes, n (%)39 (7%)144 (24%)16 (6%)24 (16%)
Gleason score, n (%)
 ≤ 6261 (50%)366 (61%)193 (67%)30 (20%)
 7231 (44%)164 (28%)69 (24%)97 (64%)
 ≥ 825 (5%)60 (10%)28 (10%)21 (14%)
 missing10 (2%)7 (1%)03 (2.0%)
Stage, n (%)
 T1187 (35%)214 (35.8%)150 (52%)38 (25%)
 T2267 (51%)323 (54.1%)128 (44%)89 (59%)
 T361 (10%)44 (7.4%)12 (4%)16 (11%)
 T40001 (1%)
 missing12 (2%)17 (2.8%)07 (5%)
Pre-treatment PSAc, mean (sd)13.2 (7.5)16 (20.5)9.4 (19.4)15.5 (13.4)
Hormone therapy, n (%)527 (100%)415 (69.5)147 (51%)74 (49%)
BEDd, mean (sd)120.5 (6.2)120.5 (5.8)191.9 (22.4)125.5 (6.2)
Rectum Volume (cm3), mean (sd)NAe56.9 (29.9)33.9 (13.9)76.3 (27.7)

sd, standard deviation.

TURP, transurethral resection of the prostate.

PSA, prostate specific antigen.

BED, biologically effective dose.

NA, not available.

Table 3

Prevalence of radiotherapy toxicity endpoints.

Change in toxicity grade from baselinea
0123
Rectal bleeding
RAPPER446 (85%)64 (12%)16 (3%)1 (< 0%)
RADIOGEN516 (88%)52 (9%)16 (3%)5 (1%)
Gene-PARE208 (72%)25 (9%)49 (17%)8 (3%)
CCI110 (73%)29 (19%)6 (4%)6 (4%)



Urinary frequency
RAPPER482 (91%)42 (8%)3 (1%)0
RADIOGEN423 (89%)46 (10%)7 (2%)1(< 1%)
Gene-PARE173 (68%)52 (20%)25 (10%)5 (2%)
CCI120 (80%)24 (16%)7 (5%)0



Decreased urine stream
RAPPER483 (94%)18 (4%)9 (2%)2 (< 1%)
RADIOGEN472 (99%)2 (< 1%)3 (1%)1 (< 1%)
Gene-PARE189 (74%)34 (13%)24 (9%)8 (3%)
CCINANANANA

For the Gene-PARE, CCI and RADIOGEN studies, the post-treatment rectal bleeding grade already accounts for baseline symptoms. For the CCI study, the post-treatment urinary frequency grade already accounts for baseline symptoms.

Table 4

Power to detect significant associations among 1564 radiotherapy patientsa.

Minor allele frequency
0.050.100.150.250.350.50
Per-allele odds ratio1.15000000
1.250001%1%1%
1.51%7%21%47%59%56%
2.064%99%100%100%100%100%

Assuming a type I error rate of 5 × 10− 8 and a disease prevalence of 20%, which was the approximate average prevalence of the toxicities included in the study.

Meta-analysis Q-Q plots (Fig. S1) show no genomic inflation, suggesting that population structure was adequately controlled using principal components analysis to exclude outliers with non-European ancestry. The most statistically significant SNPs associated with late toxicity are listed in Table 5, Table 6, Table 7, Table 8, which show several SNPs had meta-p-values reaching or approaching significance (p-value ≤ 5 × 10− 6) with concordant effect direction across the individual studies. Three SNPs reached genome-wide significance: rs17599026 on 5q31.2 with urinary frequency (3.12, 95% CI 2.08–4.69, p-value 4.16 × 10− 8); rs7720298 on 5p15.2 with decreased urine stream (OR 2.71, 95% CI 1.90–3.86, p-value = 3.21 × 10− 8); and rs11230328 on 11q12.2 with STAT score (Beta 0.31, 95% CI 0.21–0.41, p-value = 6.27 × 10− 10). rs17599026 was directly genotyped in the GenePARE and CCI studies and imputed in the RADIOGEN (information score = 0.90) and RAPPER (information score = 0.87) studies. rs7720298 was imputed in all four studies (RADIOGEN information score = 0.91, RAPPER information score = 0.93, GenePARE information score = 0.94, and CCI information score = 0.95) and rs11230328 was imputed in all four studies (RADIOGEN information score = 0.68, RAPPER information score = 0.61, GenePARE information score = 0.88, and CCI information score = 0.97). To provide a more interpretable effect size for rs11230328, STAT score was dichotomized at the mean and analyzed using logistic regression, which resulted in an OR of 1.59 (95% CI 1.25–2.02). There were no genome-wide significant SNPs associated with rectal bleeding, and though several approached significance, these require investigation in future larger studies. Fig. 1 shows forest plots for the SNPs that reached the genome-wide significance threshold.
Table 5

Rectal bleeding multivariable analysis results (p < 5 × 10− 6).

rsIDChrMinor alleleMAFaRAPPER, N = 527
RADIOGEN, N = 589
Gene-PARE, N = 235
CCI, N = 136
Meta-analysis, N = 1487
ORb(95% CIc)p-ValueOR(95% CI)p-ValueOR(95% CI)p-ValueOR(95% CI)p-ValueOR(95% CI)p-Valuephetd
rs124975183p12.3A0.470.66 (0.45,0.95)0.0250.60 (0.41,0.87)5.44 × 10− 30.52 (0.33,0.84)5.67 × 10− 30.58 (0.34,0.97)0.0350.60 (0.48,0.74)1.41 × 10− 60.903
rs141044160e23q23C0.052.62 (1.53,4.48)7.86 × 10− 4NAfNA2.77 (1.48,5.21)1.06 × 10− 3NANA2.68 (1.78,4.04)2.26 × 10− 60.892
rs7432328g3p26.1A0.063.15 (1.78,5.58)1.34 × 10− 4NANANANA4.32 (1.40,13.29)0.0113.36 (2.02,5.59)3.23 × 10− 60.620
rs69998598q21.13G0.281.73 (1.15,2.61)9.00 × 10− 32.10 (1.40,3.16)4.12 × 10− 41.36 (0.84,2.20)0.2191.6 (0.86,2.99)0.1381.73 (1.37,2.17)3.42 × 10− 60.592
rs480413419p13.2G0.310.63 (0.44,0.91)0.0110.48 (0.32,0.71)1.58 × 10− 40.69 (0.42,1.12)0.1310.68 (0.38,1.22)0.1860.60 (0.48,0.74)3.47 × 10− 60.621
rs3600711q42.12T0.491.69 (1.20,2.40)2.80 × 10− 31.77 (1.23,2.54)2.08 × 10− 31.43 (0.92,2.23)0.1101.40 (0.82,2.38)0.2171.61 (1.32,1.97)3.62 × 10− 60.835

MAF, minor allele frequency.

OR, odds ratio.

CI, confidence interval.

phet = p-value from chi-squared test of heterogeneity.

For rs141044160, RADIOGEN MAF = 0.048, imputation info = 0.749, OR (95% CI) = 0.81, (0.36, 1.81), p-value = 0.593 and CCI = Not imputed.

NA, not available.

For rs7432328, RADIOGEN MAF = 0.030, imputation info = 0.946, OR (95%CI) = 0.37 (0.08, 1.65), p-value = 0.134; Gene-PARE MAF = 0.041, imputation info = 0.953, OR (95% CI) = 0.91 (0.33, 2.52), p-value = 0.735.

Table 6

Urinary frequency multivariable analysis results (p < 5 × 10− 6).

rsIDChrMinor alleleMAFaRAPPER, N = 527
RADIOGEN, N = 477
Gene-PARE, N = 255
CCI, N = 150
Meta-analysis, N = 1409
ORb(95% CIc)p-ValueOR(95% CI)p-ValueOR(95% CI)p-ValueOR(95% CI)p-ValueOR(95% CI)p-Valuephetd
rs175990265q31.2T0.082.57 (1.28,5.16)0.0123.19 (1.57,6.46)2.36 × 10− 38.16 (3.04,21.9)5.88 × 10− 61.63(0.59,4.54)0.3583.12 (2.08,4.69)4.16 × 10− 80.139
rs1157453212q13.13T0.071.70 (0.70,4.33)0.2823.69 (1.78,7.68)8.29 × 10− 43.57 (1.53,8.35)3.03 × 10− 35.76 (1.66,19.99)5.50 × 10− 33.25 (2.08,5.07)2.11 × 10− 70.420
rs3424424q22.1T0.380.39 (0.22,0.68)3.50 × 10− 40.56 (0.36,0.89)0.0110.51 (0.32,0.8)2.88 × 10− 30.62(0.31,1.25)0.1740.51 (0.39,0.66)3.86 × 10− 70.700
rs453463612p13.31A0.201.27 (0.71,2.27)0.4292.48 (1.54,3.99)2.35 × 10− 42.06 (1.25,3.41)4.31 × 10− 32.03(0.97,4.25)0.04611.96 (1.49,2.59)1.67 × 10− 60.373
rs809870118q21.1T0.072.93 (1.39,6.15)7.63 × 10− 32.75 (1.52,4.97)1.15 × 10− 31.70 (0.86,3.34)0.1332.53(0.77,8.28)0.1362.41 (1.68,3.47)2.11 × 10− 60.684
rs7366282e1q41C0.062.22 (1.02,4.82)0.0564.11 (1.82,9.3)1.27 × 10− 33.86 (1.55,9.63)3.03 × 10− 3NAfNA3.20 (1.98,5.16)2.03 × 10− 60.501
rs102096972q36.1A0.074.09 (1.65,10.1)4.35 × 10− 34.32 (2.19,8.5)5.14 × 10− 51.53 (0.74,3.14)0.2551.36(0.44,4.27)0.5992.66 (1.77,3.99)2.27 × 10− 60.093
rs49978236q14.1G0.350.40 (0.22,0.74)1.90 × 10− 30.37 (0.2,0.67)5.69 × 10− 40.68 (0.42,1.11)0.1180.44(0.20,0.99)0.0400.49 (0.36,0.66)2.35 × 10− 60.372
rs73569456p24.1T0.332.56 (1.58,4.13)1.10 × 10− 41.63 (1.08,2.48)0.0221.21 (0.8,1.85)0.3682.42(1.25,4.69)7.32 × 10− 31.74 (1.38,2.21)3.71 × 10− 60.097
rs600398222q11.23G0.440.67 (0.37,1.19)0.1660.46 (0.27,0.79)3.70 × 10− 30.39 (0.23,0.67)4.33 × 10− 60.60(0.30,1.19)0.1380.51 (0.38,0.68)4.28 × 10− 60.549
rs101011588q24.3A0.441.63 (0.95,2.81)0.0741.56 (0.98,2.48)0.0631.99 (1.3,3.06)1.24 × 10− 32.17(1.14,4.11)0.0151.80 (1.40,2.31)4.39 × 10− 60.792

MAF, minor allele frequency.

OR, odds ratio.

CI, confidence interval.

phet = p-value from chi-squared test of heterogeneity.

rs7366282 was not imputed in CCI.

NA, not available.

Table 7

Decreased stream multivariable analysis results (p < 5 × 10− 6).

rsIDChrMinor alleleMAFaRAPPER, N = 512
RADIOGEN, N = 478
Gene-PARE, N = 255
Meta-analysis, N = 1245
ORb(95% CIc)p-ValueOR(95% CI)p-ValueOR(95% CI)p-ValueOR(95% CI)p-Valuephetd
rs77202985p15.2G0.242.13 (1.21,3.74)0.0103.63 (1.08,12.16)0.0443.09 (1.9,5.04)2.34 × 10− 62.71 (1.90,3.86)3.21 × 10− 80.545
rs17362923e8p23.2G0.183.60 (1.98,6.55)3.12 × 10− 5NAfNA2.18 (1.29,3.66)3.20 × 10− 32.70 (1.83,4.00)6.79 × 10− 70.459
rs762734961q31.3C0.074.02 (1.83,8.81)1.31 × 10− 33.69 (0.66,20.46)0.1693.32 (1.43,7.68)5.13 × 10− 33.68 (2.13,6.33)2.71 × 10− 60.948
rs220320523q21.1C0.291.70 (1.15,2.52)0.0102.35 (0.97,5.74)0.0541.71 (1.26,2.31)5.87 × 10− 41.74 (1.38,2.20)2.86 × 10− 60.790
rs1445969113q28A0.094.41 (2.09,9.28)2.38 × 10− 41.01 (0.09,11.12)0.9953.28 (1.44,7.45)4.78 × 10− 33.60 (2.11,6.17)2.94 × 10− 60.493
rs6209136818p11.32A0.076.78 (2.26,20.37)1.26 × 10− 36.96 (1.71,28.3)0.0152.66 (1.08,6.57)0.0374.36 (2.33,8.14)3.95 × 10− 60.334
rs1413427195q23.3T0.125.39 (2.43,11.94)5.09 × 10− 51.35 (0.23,7.9)0.7482.77 (1.27,6.07)0.0123.50 (2.05,5.95)3.97 × 10− 60.273
rs67378318p11.32G0.312.88 (1.52,5.44)1.06 × 10− 32.28 (0.68,7.6)0.1832.28 (1.33,3.91)3.24 × 10− 32.49 (1.69,3.67)4.33 × 10− 60.853

MAF, minor allele frequency.

OR, odds ratio.

CI, confidence interval.

phet = p-value from chi-squared test of heterogeneity.

For rs17362923, RADIOGEN MAF = 0.182; imputation info = 0.980. This SNP had a very large standard error in the RADIOGEN dataset, and so it was excluded from meta-analysis.

NA, not available.

Table 8

Overall toxicity (STAT) multivariable analysis results (p < 5 × 10− 6).

rsIDChrMinor alleleMAFaRAPPER, N = 527
RADIOGEN, N = 474
Gene-PARE, N = 209
CCI, N = 136
Meta-analysis, N = 1,346
Beta (95% CIb)p-valueBeta (95% CI)p-valueBeta (95% CI)p-valueBeta (95% CI)p-valueBeta (95% CI)p-valuephetc
rs1123032811q12.2G0.270.23 (0.06,0.40)7.96 × 10− 30.45 (0.29,0.61)3.50 × 10− 80.20 (− 0.03,0.43)0.0830.22 (− 0.09,0.53)0.1700.31 (0.21,0.41)6.27 × 10− 100.154
rs147596965d10p14T0.07NAeNA0.75 (0.47,1.03)2.73 × 10− 7NANA0.44 (− 0.04,0.92)0.0720.67 (0.43,0.91)6.19 × 10− 80.150
rs775304482q31.1G0.090.07 (− 0.18,0.33)0.5760.51 (0.31,0.71)4.01 × 10− 70.41 (0.12,0.70)5.60 × 10− 30.29 (− 0.13,0.70)0.1790.36 (0.23,0.49)7.36 × 10− 80.060
rs4906759f15q12T0.060.34 (− 0.02,0.71)0.0630.86 (0.54,1.18)2.28 × 10− 70.32 (− 0.08,0.72)0.110NANA0.55 (0.34,0.75)1.55 × 10− 70.047
rs71610881g4p15.2A0.060.70 (0.42,0.97)1.30 × 10− 6NANANANA0.36 (− 0.06,0.79)0.0980.60 (0.36,0.83)5.41 × 10− 70.198
rs141799618h19q13.2G0.060.33 (0.02,0.65)0.0390.50 (0.26,0.74)5.17 × 10− 5NANA0.45 (− 0.04,0.94)0.0720.44 (0.26,0.62)1.22 × 10− 60.698
rs284216910q26.2C0.100.33 (0.12,0.53)1.73 × 10− 30.24 (0.07,0.41)7.54 × 10− 30.28 (− 0.02,0.58)0.0670.39 (− 0.02,0.81)0.0650.28 (0.17,0.40)1.45 × 10− 60.863
rs1121906811q24.1A0.150.38 (0.17,0.59)3.56 × 10− 40.18 (− 0.01,0.37)0.0620.16 (− 0.11,0.43)0.2520.49 (0.18,0.79)2.25 × 10− 30.28 (0.17,0.40)1.74 × 10− 60.204
rs807556517p13.2T0.130.37 (0.16,0.59)6.43 × 10− 40.33 (0.14,0.52)5.91 × 10− 40.23 (− 0.03,0.49)0.080− 0.01 (− 0.33,0.30)0.9330.28 (0.16,0.39)2.20 × 10− 60.217
rs65350284q28.3T0.100.22 (0.02,0.43)0.0310.33 (0.14,0.52)8.62 × 10− 40.42 (0.11,0.73)8.13 × 10− 30.16 (− 0.19,0.51)0.3740.29 (0.17,0.40)2.70 × 10− 60.615
rs477560215q21.1C0.160.12 (− 0.04,0.28)0.1350.27 (0.12,0.42)5.54 × 10− 40.40 (0.17,0.63)7.39 × 10− 40.11 (− 0.23,0.44)0.5250.23 (0.13,0.32)3.02 × 10− 60.240
rs604817451p13.3A0.100.20 (− 0.07,0.46)0.1490.39 (0.14,0.64)2.69 × 10− 30.65 (0.22,1.08)3.11 × 10− 30.57 (0.08,1.05)0.0220.37 (0.22,0.53)3.53 × 10− 60.260
rs78297598p23.2A0.110.46 (0.23,0.69)1.29 × 10− 40.22 (− 0.02,0.46)0.0780.33 (0.02,0.64)0.0420.26 (− 0.20,0.72)0.2640.33 (0.19,0.47)3.84 × 10− 60.551
rs79604958i15q24.2C0.07NANA0.35 (0.04,0.66)0.0240.44 (0.10,0.78)0.0130.77 (0.34,1.21)6.41 × 10− 40.47 (0.27,0.68)4.33 × 10− 60.289
rs1077085712p13.31A0.370.18 (0.05,0.31)6.02 × 10− 30.14 (0.02,0.26)0.0170.15 (− 0.03,0.33)0.0970.28 (0.05,0.51)0.0180.17 (0.10,0.25)4.98 × 10− 60.774
rs1259143615q25.1G0.340.23 (0.09,0.37)1.56 × 10− 30.16 (0.04,0.28)8.79 × 10− 30.14 (− 0.07,0.35)0.1780.20 (− 0.05,0.46)0.1240.18 (0.10,0.26)5.66 × 10− 60.885

MAF, minor allele frequency.

CI, confidence interval.

phet = p-value from chi-squared test of heterogeneity.

For rs147596965, RAPPER MAF = 0.045, info = 0.619, beta (95%CI) = − 0.14 (− 0.53, 0.25), p-value = 0.478; Gene-PARE MAF = 0.044, info = 0.662, beta (95% CI) = − 0.31 (− 0.83, 0.20), p-value = 0.237.

NA, not available.

For rs4906759, CCI MAF = 0.040, info = 0.604, beta (95% CI) = − 0.71 (− 1.37, − 0.05), p-value = 0.035.

For rs71610881, RADIOGEN MAF = 0.040, info = 0.986, beta (95% CI) = 0.04 (− 0.24, 0.33), p-value = 0.761; Gene-PARE MAF = 0.022, info = 0.995, beta (95% CI) = − 0.07 (− 0.77, 0.63), p-value = 0.841.

For rs141799618, Gene-PARE MAF = 0.037, info = 0.780, beta (95% CI) = − 0.27, (− 0.89, 0.35), p-value = 0.395.

For rs79604958, RAPPER MAF = 0.045, info = 0.499, beta (95% CI) = 0.60 (0.20,1.00), p = 0.0036.

Fig. 1

Forest plots of significant SNPs. SNPs with meta-p-value < 5 × 10− 8 associated with: (a) urinary frequency (rs17599026); (b) decreased urine stream (rs7720298); and (c) overall toxicity (rs11230328). p-Values are from meta-analysis of regression coefficients and standard errors from regression analysis performed in each study. Odds ratios (OR; urinary frequency and decreased urine stream) or regression beta coefficient (STAT score) are shown for each individual study as well as the meta-analysis (Summary). The size of the box marking each odds ratio or beta is proportional to the precision of estimate for the given study. Lines on the boxes denote 95% confidence intervals. ‘Toxicity’ and ‘No toxicity’ (rs17599026 and rs7720298) was defined as ≥ 1 point increase in toxicity grade versus no change in toxicity grade from pre-radiotherapy scores. Overall toxicity was analyzed as a continuous variable (STAT).

In order to confirm the quality of the imputation, rs17599026 was genotyped directly in the RADIOGEN and RAPPER samples. Genotyping was successful in 645 of 654 RADIOGEN samples (correlation between the imputed and genotyped data of 0.88) and 595 of 600 RAPPER samples (correlation between the imputed and genotyped data of 0.85). These results confirmed that the genotypes are in strong agreement with the imputed data. When the direct genotype data was used in analysis, rs17599026 showed an association signal consistent with that seen using the imputed data, though the p-value fell short of the strict threshold for genome-wide significance (OR 2.48, 95% CI 1.68–3.63, p-value = 3.66 × 10− 6). Associations between the top SNPs (meta-analysis p-value ≤ 5 × 10− 6) and toxicity severity were modeled using ordinal logistic regression (Table 9). For all SNPs, the p-value from ordinal logistic regression was similar to that from binary logistic regression. For several SNPs the association signal improved slightly with the ordinal logistic model: rs12497518, rs141044160, rs6999859, and rs4804134 associated with rectal bleeding; rs17599026, rs7366282, rs4534636, rs10101158, rs10209697, rs6003982, and rs7356945 associated with urinary frequency; rs7720298, rs673783, rs62091368, and rs144596911 associated with decreased urine stream.
Table 9

Ordinal logistic regression results showing association between top SNPs and toxicity grade.

Toxicity endpointSNPRAPPER
RADIOGEN
Gene-PARE
CCI
Meta-analysis
ORa(95% CIb)p-ValueOR(95% CI)p-ValueOR(95% CI)p-ValueOR(95% CI)p-ValueOR(95% CI)p-Valuephet
Rectal bleedingrs124975180.66 (0.45,0.95)0.0230.60 (0.41,0.87)5.41 × 10− 30.52 (0.33,0.83)4.37 × 10− 30.58 (0.34,0.96)0.0330.60 (0.48,0.73)9.80 × 10− 70.894
rs1410441602.54 (1.52,4.26)8.70 × 10− 4NAcNA2.62 (1.52,4.52)7.79 × 10− 4NANA2.58 (1.77,3.75)7.47 × 10− 70.942
rs69998591.74 (1.16,2.62)8.37 × 10− 32.10 (1.40,3,14)3.87 × 10− 41.38 (0.86,2.22)0.1901.51 (0.82,2.75)0.1831.70 (1.37,2.13)2.40 × 10− 60.567
rs48041340.64 (0.44,0.91)0.0120.47 (0.32,0.70)1.20 × 10− 40.68 (0.42,1.10)0.1130.69 (0.39,1.22)0.2090.60 (0.48,0.74)3.15 × 10− 60.572
rs3600711.69 (1.19,2.38)2.83 × 10− 31.74 (1.21,2.49)2.72 × 10− 31.43 (0.92,2.22)0.1081.40 (0.83,2.38)0.2131.60 (1.31,1.95)4.34 × 10− 60.851
rs74323282.88 (1.67,4.98)5.29 × 10− 4NANANANA3.97 (1.38,11.39)0.0123.09 (1.90,5.01)5.29 × 10− 60.598
Urinary frequencyrs175990262.56 (1.28,5.12)0.0123.77 (1.79,7.64)6.97 × 10− 44.07 (1.96,8.64)1.71 × 10− 41.84 (0.64,4.93)0.2503.09 (2.11,4.53)8.06 × 10− 90.542
rs115745321.65 (0.65,4.14)0.3083.40 (1.64,6.86)0.0022.62 (1.23,5.53)0.0134.44 (1.41,13.87)0.0122.79 (1.84,4.24)1.41 × 10− 60.534
rs3424420.39 (0.22,0.68)3.57 × 10− 40.56 (0.35,0.88)9.88 × 10− 30.54 (0.35,0.83)4.48 × 10− 30.63 (0.31,1.23)0.1830.52 (0.40,0.67)5.57 × 10− 70.685
rs73662822.44 (1.11,5.36)0.0363.69 (1.62,8.08)2.40 × 10− 33.80 (1.69,8.40)1.44 × 10− 3NANA3.25 (2.05,5.14)4.60 × 10− 70.681
rs45346361.28 (0.71,2.28)0.4172.50 (1.55,4.03)2.05 × 10− 42.07 (1.29,3.34)2.57 × 10− 32.18 (1.06,4.49)0.0352.00 (1.53,2.62)4.75 × 10− 70.361
rs80987013.01 (1.44,6.31)6.05 × 10− 32.59 (1.44,4.54)1.79 × 1031.46 (0.75,2.74)0.2522.90 (0.87,8.97)0.0802.28 (1.60,3.24)4.40 × 10− 60.426
rs101011581.62 (0.94,2.78)0.0771.55 (0.98,2.46)0.0641.96 (1.31,2.94)9.06 × 10− 42.06 (1.12,3.93)0.0201.78 (1.39,2.27)3.67 × 10− 60.830
rs102096973.88 (1.58,9.53)5.55 × 10− 33.86 (1.98,7.40)1.25 × 10− 41.75 (0.86,3.45)0.1191.38 (0.41,4.04)0.5842.63 (1.78,3.90)1.38 × 10− 60.196
rs60039820.65 (0.37,1.17)0.1450.47 (0.27,0.80)4.98 × 1030.39 (0.23,0.65)2.48 × 10− 40.61 (0.30,1.18)0.1440.50 (0.38,0.67)2.74 × 10− 60.557
rs49978230.40 (0.22,0.74)1.89 × 10− 30.35 (0.19,0.64)4.02 × 10− 40.72 (0.44,1.15)0.1700.46 (0.20,0.98)0.0440.50 (0.37,0.67)3.74 × 10− 60.279
rs73569452.52 (1.57,4.04)1.21 × 10− 41.60 (1.06,2.43)0.0261.26 (0.84,1.89)0.2582.37 (1.25,4.63)8.03 × 10− 31.74 (1.38,2.19)2.77 × 10− 60.122
Decreased streamrs77202982.20 (1.25,3.88)7.45 × 10− 33.38 (1.00,11.37)0.058d3.10 (1.96,4.90)9.29 × 10− 7NANA2.75 (1.96,3.88)6.52 × 10− 90.618
rs173629233.58 (1.97,6.51)3.28 × 10− 5NANA1.96 (1.20,3.20)7.75 × 10− 3NANA2.50 (1.71,3.65)2.15 × 10− 60.126
rs762734963.96 (1.83,8.59)1.31 × 10− 33.25 (0.65,16.14)0.191d2.94 (1.37,6.32)7.72 × 10− 3NANA3.39 (2.02,5.68)3.45 × 10− 60.865
rs22032051.71 (1.16,2.54)8.72 × 10− 32.28 (0.94,5.53)0.063d1.63 (1.22,2.19)1.13 × 10− 3NANA1.69 (1.35,2.13)5.35 × 10− 60.785
rs1413427195.11 (2.33,11.21)7.29 × 10− 51.32 (0.22,7.88)0.762d2.58 (1.22,5.45)0.015NANA3.28 (1.96,5.52)6.96 × 10− 60.270
rs6737832.83 (1.50,5.34)1.26 × 10− 32.14 (0.66,6.93)0.210d2.32 (1.38,3.89)1.52 × 10− 3NANA2.47 (1.69,3.61)3.36 × 10− 60.865
rs620913685.93 (2.14,16.41)1.62 × 10− 36.85 (1.70,27.60)0.016d2.81 (1.17,6.73)0.024NANA4.29 (2.36,7.80)1.89 × 10− 60.422
rs1445969114.41 (2.10,9.24)2.24 × 10− 41.07 (0.10,12.00)0.954d3.44 (1.56,7.59)2.60 × 10− 3NANA3.69 (2.18,6.25)1.20 × 10− 60.534

OR, odds ratio.

CI, confidence interval.

NA, not available.

Adjusted for age and biologically effective dose (BED).

Regions of linkage disequilibrium (LD) were defined for the three loci tagged by SNPs reaching genome-wide significance. rs17599026, associated with urinary frequency, tags a 106 kb region of LD (base position 137,657,783–137,763,798; Fig. 2A) containing part of KDM3B (including the promoter region through exon 20), the upstream FAM53C, and part of the upstream CDC25C (the promoter region through exon 6). rs7720298, associated with decreased urine stream, tags a 39 kb region of LD (base position 13,858,328–13,897,362; Fig. 2B) that contains exons 16 through 30 of DNAH5. rs11230328 was not in strong linkage disequilibrium with any other common SNPs found in the more recent release of the 1000 Genomes population data, and this locus may represent a spurious association. rs11230328 lies within a LINE element, and may be difficult to map in the genome due to its location within a region of high homology. Imputation coverage (i.e. the number of SNPs successfully imputed in the study datasets out of common SNPs (MAF ≥ 0.05) within the 1000 Genomes European population) was high within the regions tagged (correlation r2 ≥ 0.5) by rs17599026 (173/183, 94.5%) and rs7720298 (130/136, 95.6%).
Fig. 2

Regional plots for two loci. Loci tagged by rs17599026 (a) and rs7720298 (b) with points representing the tag SNP reported in this paper shown in purple circles. Points representing nearby SNPs are color coded according to linkage disequilibrium r2 value as indicated in the legends. The recombination rate is estimated from samples from the International HapMap project.

SNPs rs17599026 and rs7720298 are located in non-coding regions, as is common in GWAS, and the LD blocks tagged by these SNPs cover coding and non-coding regions. rs17599026, associated with urinary frequency, lies in an intronic region 23 bp downstream of exon 20 of KDM3B (MIM 609373; NM_016604.3:c4753 + 23C > T) encoding the lysine-specific demethylase 3B protein. KDM3B is highly expressed in bladder tissue from the Human Protein Atlas project (Uhlen et al., 2015), suggesting that this gene could be involved in normal bladder function and potentially dysfunction following damage from radiation exposure. rs17599026 itself does not lie in a site of known transcription factor binding or chromatin modification from the Encyclopedia of DNA Elements (ENCODE) catalog (ENCODE Project Consortium, 2012) and there are no significant expression quantitative trail loci (eQTLs) for this SNP in the Genotype-Tissue Expression (GTEx) project (GTEx Consortium, 2013). Given that rs17599026 is very close to exon 20, it could have an effect on splicing. However, no significant splicing motif alteration was detected using Human Splicing Finder (Desmet et al., 2009). ENCODE data show that the large LD block tagged by this SNP contains multiple transcription factor binding sites, DNase hypersensitive sites, histone methylation sites (methylation of lysine 4 at histone 3, H3K4Me3 and methylation of lysine 4 at histone 1, H3K4Me1), and histone acetylation sites (acetylation of lysine 27 at histone 3, H3K27Ac) that may affect regulation of the nearby genes. rs7720298, associated with decreased urine stream, lies in an intronic region just downstream of exon 30 of DNAH5 (MIM 603335; NM_001369.2:c.4950 + 1233G > C) encoding the dynein, axonemal, heavy chain 5 protein that is part of a microtubule-associated motor protein complex. Rare mutations in DNAH5 can result in development of abnormal cilia and flagella in cells that lead to primary ciliary dyskinesia, which is a disorder characterized in part by chronic respiratory tract infections (Escudier et al., 2009). In addition to playing an important role in the lung, DNAH5 is expressed in both kidney and bladder tissue, suggesting a biologic role in normal function of the urinary tract (Uhlen et al., 2015). This SNP does not lie in a site of known transcription factor binding or chromatin modification from the ENCODE catalog nor is it an eQTL based on data from GTEx, but the region of LD tagged by rs7720298 contains several transcription factor binding sites and sites of DNase hypersensitivity measured in ENCODE cell lines, suggesting that it may tag a site of transcriptional regulation.

Discussion & Conclusions

This meta-analysis aimed to identify SNPs associated with late radiotherapy toxicity in a single tumor site. By having a total sample size in a single stage analysis of > 1500 men with prostate cancer (versus ~ 600 in published studies), the study identified two risk loci. This study had ≥ 99% power to identify common SNPs (MAFs ≥ 10%) that confer a relatively large increased risk for developing late toxicity (OR ≥ 2.0). Identification of multiple loci in this single-stage meta-analysis versus single loci in published GWAS that used a staged approach is consistent with the increased power. The meta-analysis showed that heterogeneity in radiotherapy datasets is not a barrier for future multi-cohort radiogenomic studies. The absence of significant SNPs associated with rectal bleeding is consistent with the relatively limited statistical power. Most common variants identified via GWAS have more modest effects (ORs 1.15 to 1.5) than this study was powered to detect. Our group previously reported an excess of associations at the p < 5 × 10− 7 level for rectal bleeding showing many SNPs should be identified as sample sizes increase (Barnett et al., 2014). In addition, our current approach of dichotomizing toxicity at grade 0 versus grade 1 or worse and considering two years of follow-up may not be optimal for rectal bleeding, which could be explored as our radiogenomic cohorts increase. The significant SNPs lie in non-coding regions, as is common in GWAS where ~ 64% of identified SNPs lie in enhancer regions (Hnisz et al., 2013). Gene enhancer elements are noncoding segments of DNA involved in regulating transcriptional programs (Corradin and Scacheri, 2014). Their location is predicted by DNase I hypersensitive regions of open chromatin flanked by sties of histone methylation. These enhancers can be active (associated with the acylation of lysine 27 of histone 3 [H3K27Ac]) or repressed (correlated with histone marks H3K27Me3 and H3K9Me3). Although premature to speculate on the functional consequences of these SNPs, they lie in intronic regions of genes that are expressed in tissues exposed to radiation during treatment of prostate cancer and are involved in the symptoms experienced by men having radiotherapy for this malignancy. Fine mapping studies and functional characterization of causal variants contained within these loci should expand our understanding of the biology underlying the pathogenesis of radiotherapy toxicity. rs17599026 was genotyped to check the quality of the imputed data from RADIOGEN and RAPPER. Direct genotyping confirmed the high quality of the imputation, but we were unable to genotype directly the other two SNPs for which imputed data were used for all four cohorts. rs7720298 had high imputation information scores (> 0.95) in all three studies of urinary decreased stream, lending confidence to this association. However, rs11230328 had a lower imputation quality (< 0.70) in two of the four studies of overall toxicity, and it is not present in the latest 1000 Genomes Project release (v5a; 05/02/2013). Thus, rs11230328 may represent a spurious association. None of the previously studied candidate gene SNPs for radiotherapy toxicity were among the top loci identified in this meta-analysis, consistent with a recent validation study showing no evidence for association between 92 candidate SNPs and radiotherapy toxicity among ~ 1600 individuals treated with radiotherapy for breast or prostate cancer (Barnett et al., 2012a). No SNP discovered in previous GWAS was identified in this de novo analysis, likely due to methodological differences. The published locus on chr2q24.1, identified in RADIOGEN and replicated in RAPPER and Gene-PARE (Fachal et al., 2014), was excluded because the MAFs of the tagging SNPs were < 5%, the threshold set in this meta-analysis to maximize statistical power. Subsequent to the meta-analysis, we tested SNPs in the chr2q24.1 locus for replication in CCI (the only cohort not included in the initial publication) using the methods of the originally published paper. All seven previously reported SNPs showed a consistent direction of association with overall toxicity in CCI, and rs264651 showed a statistically significant association (linear regression beta = 0.66; 95% CI 0.06, 1.27; p-value = 0.03). The locus was also tested for association with the three individual toxicity endpoints investigated in this paper, and appeared to be predominantly associated with urinary toxicity (Table S5). SNPs rs7582141, rs6432512, rs264588, and rs264631 were significantly associated with an increased risk for decreased urinary stream and increased urinary frequency (meta-analysis p-values < 0.05, Table S5). The directions of the associations were consistent across the other three SNPs, with rs264663 and rs264651 reaching statistical significance for urinary frequency. The direction of the association of six of the seven SNPs was consistent for the rectal bleeding endpoint, but none of the SNPs approached statistical significance, suggesting that the urinary endpoints are predominantly driving the association of this locus with overall toxicity. This finding thus supports the initial report that chr2q24.1 is a risk locus for late radiotherapy toxicity in prostate cancer. rs7120482 on chr11q14.3 was previously identified in Gene-PARE and replicated in a combined dataset that included RADIOGEN and CCI (Kerns et al., 2013b), but the association with rectal bleeding was restricted to a recessive inheritance model. In this meta-analysis, dominant and recessive models were not considered in order to avoid an increased burden of multiple comparisons correction. Subsequent to this meta-analysis, we tested rs7120482 for replication in RAPPER (the only cohort not included in that initial publication) using the methods of the originally published paper. The initial association with rectal bleeding was not replicated in RAPPER (odds ratio = 0.74; 95% CI 0.08, 7.33; p-value = 0.79). This lack of replication could be due to differences in the radiotherapy regimen or scoring of toxicity - the incidence of rectal bleeding was low in RAPPER (3%) compared with the other studies (10% to 18%). However, the initial association fell just short of genome-wide significance and might be a false positive result. At this early stage of radiogenomic GWAS we expect some initial associations to be lost as sample sizes increase and false-positive findings are reduced, as seen in other GWAS meta-analyses (Evangelou and Ioannidis, 2013). Finally, Gene-PARE previously identified risk SNPs for erectile dysfunction (Kerns et al., 2010, Kerns et al., 2013a), but this endpoint was not assessed in the other studies in the meta-analysis. Thus, previously identified SNPs remain important, and the meta-analysis reports additional loci. This study has several strengths. First, toxicity was assessed prospectively in all four studies, which minimizes recall bias. Second, baseline information ensures any SNP-toxicity associations identified occur because of the radiotherapy rather than any pre-existing (possibly tumor-related) symptoms. Third, a multivariable approach accounted for relevant treatment and clinical variables. Radiogenomics involves phenotypes that occur in response to an environmental exposure (i.e. therapeutic radiation) that is measurable and can be adjusted for. Fourth, reliable and current imputation methods were used to obtain comparable and dense SNP maps across the four studies, allowing meta-analysis of studies involving different SNP genotyping platforms. Without imputation, only 15,144 SNPs were common to all four datasets, reducing the likelihood of identifying SNPs associated with late radiotherapy toxicity. This study has limitations. First, the four studies were designed independently and involved different methods for recording toxicity. Endpoints needed harmonizing and the approach might be sub-optimal, but serves as a useful guide for future radiogenomics studies involving multiple cohorts. Although endpoints were dichotomized to minimize differences in toxicity grades across the cohorts, the simulation experiment showed minimal loss of statistical power. Second, there was the heterogeneity in radiotherapy protocols. For example, Gene-PARE included individuals who received brachytherapy with EBRT, which increases urinary toxicity (Sanda et al., 2008). Similarly, the proportion of individuals who received androgen deprivation differed across the studies. This heterogeneity was adjusted for using multivariable analysis and by a meta-analysis approach. Importantly, none of the top SNPs showed evidence of heterogeneity in effect size across studies (Table 5, Table 6, Table 7, Table 8). The first SNPs identified are those that rise above such noise in the data, and the significance of our work is that we can find variants despite imperfect datasets. We attempted to control for differences in radiation dose by adjusting for total biologically effective dose in each GWAS. However, this is a surrogate for the doses received by specific normal tissues. Future cohorts with detailed dosimetry data to the different organs at risk will improve on our ability to identify SNPs. The heterogeneity in cohorts should be embraced, as any predictive test must use SNPs that are associated with toxicity across treatment centers and protocols. Last, in order to minimize confounding by population structure, the present analysis was restricted to individuals of European ancestry. It will be important for future studies to focus on other ancestral groups, both for replication of the loci identified here and for discovery of additional loci. While this study was successful in identifying radiosensitivity SNPs via a meta-analysis of GWAS, it is modestly sized compared with GWAS of other diseases and traits, and it should be viewed as the starting point for expanded radiogenomics studies. Given that our study was not powered to detect SNPs with odds ratio < 2, many true positive SNPs will have been missed. Also, though the SNPs identified here reach statistical significance and show consistency across multiple independent studies, there is still a possibility that they will fail to replicate in other cohorts. This is a challenge in GWAS (McCarthy et al., 2008), but meta-analysis and replication studies have proved successful in validating hundreds of risk SNPs for a wide variety of diseases and phenotypes. Future large GWAS meta-analyses will identify additional SNPs, and this work is underway. The Radiogenomics Consortium is a participant in the OncoArray Network, which is a large collaborative effort to gain new insight into the genetic architecture and mechanisms underlying several cancers as well as outcomes related to the their treatment. Approximately 5000 additional DNA samples from individuals treated with radiotherapy for prostate cancer are being genotyped using the customized OncoArray, and meta-analysis of this greatly expanded set of GWAS data using the methods developed in this paper will uncover additional risk SNPs, provide a platform for further validation of the SNPs identified here, and serve as the basis for future post-GWAS analyses on these confirmed loci. The last decade saw a rapid expansion of knowledge of the genetics of disease susceptibility in the general population. Numerous large collaborative GWAS showed that polygenic risk profiles can be built based on multiple SNPs each conferring small effects but together a significant proportion of susceptibility to common diseases. With the rapid decline in costs for genetic testing, there is growing acceptance that risk prediction models incorporating genetic and environmental factors will be important in future healthcare provision (Chatterjee et al., 2016). In 2012 there were an estimated 32.6 million people alive five years after being diagnosed with cancer (http://www.cdc.gov/cancer/international/statistics.htm), and many will be living with the consequences of treatment. Research developing models predicting susceptibility to long-term radiation effects is important and the findings reported here shows that heterogeneous radiotherapy cohorts can be combined to identify common genetic variants associated with toxicity. The work provides a basis for larger collaborative efforts to identify enough variants for a future test involving polygenic risk profiling.

Funding Sources

This work was supported by Cancer Research UK (C1094/A11728 to CMLW and NGB for the RAPPER study, C26900/A8740 to GCB, and C8197/A10865 to AMD), the Royal College of Radiologists (C26900/A8740 to GCB), the National Institute for Health Research (GCB; no grant number), Addenbrooke's Charitable Trust (GCB; no grant number), Institute of Cancer Research (National Institute for Health Research) Biomedical Research Centre (C46/A2131 to DPD and SG), the National Institute for Health Research Cambridge Biomedical Research Centre (NGB; no grant number), UK Medical Research Council (RG70550 to LD), the Joseph Mitchell Trust (AMD; no grant number), the Experimental Cancer Medicine Centre (CMLW; no grant number), Cancer Research UK Program grant Section of Radiotherapy (C33589/A19727 to SLG), the United States National Institutes of Health (1R01CA134444 to BSR and HO, 2P30CA014520-34 to SB, and 1K07CA187546-01A1 to SLK), the American Cancer Society (RSGT-05-200-01-CCE to BSR), the U.S. Department of Defense (PC074201 to BSR and HO), Mount Sinai Tisch Cancer Institute Developmental Fund Award (BSR; no grant number), the Instituto de Salud Carlos III (FIS PI10/00164 and PI13/02030 to AV and PI13/01136 to AC), Fondo Europeo de Desarrollo Regional (FEDER 2007–2013 to AV and AC; no grant number), Instituto de Salud Carlos III (FIS PI10/00164 and PI13/02030 to AV and PI13/01136 to AC), Xunta de Galicia and the European Social Fund (POS-A/2013/034 to LF), and the Alberta Cancer Board Research Initiative Program (103.0393.71760001404 to MP). AMD receives support from the REQUITE study, which is funded by the European Union's Seventh Framework Programme for research, technological development and demonstration under grant agreement no. 601826. Laboratory infrastructure for the RAPPER study was funded by Cancer Research UK [C8197/A10123] and the Manchester Experimental Cancer Medicine Centre. The RAPPER cohort comprises individuals and data recruited into the RT01 and CHHiP UK radiotherapy trials. The RT01 trial was supported by the UK Medical Research Council. The CHHiP trial (CRUK/06/016) was supported by the Department of Health and Cancer Research UK (C8262/A7253); trial recruitment was facilitated within centers by the National Institute for Health Research Cancer Research Network. DPD and SLG acknowledge NHS funding to the NIHR Biomedical Research Centre at the Royal Marsden NHS Foundation Trust and Institute of Cancer Research.

Conflicts of Interest

The authors declare no conflicts of interest. No one was paid to write this article by a pharmaceutical company or agency.

Web Resources

Online Mendelian Inheritance in Man (http://www.omim.org) Genetic Power Calculator (http://pngu.mgh.harvard.edu/~purcell/gpc/) R statistical computing environment (http://www.R-project.org) Centers for Disease Control and Prevention, Global Cancer Statistics (http://www.cdc.gov/cancer/international/statistics.htm)

Author Contributions

SLK, LD, LF and GCB analyzed and interpreted data and drafted the initial manuscript. DPD, SLG, EH, MRS, NGB, GCB and CMLW collected data for the RAPPER study. NGB and CMLW were the principal investigators for the RAPPER study. AMC, AC, AGC, PP and AV collected data for the RADIOGEN study. AV was the principal investigator for the RADIOGEN study. RS, NNS, SLK, HO and BSR collected data for the Gene-PARE study. HO and BSR were the principal investigators for the Gene-PARE study. MS, and MP collected data for the CCI study. MP was the principal investigator for the CCI study. SA designed and performed TaqMan genotyping assays. SLK, LD, LF, SB, PDPP, DRB, KM, JPT, HO, AMD, GCB and CMLW contributed to the data analysis plan and statistical methods used. All authors contributed intellectual input to the study design and interpretation of results, and all authors reviewed the manuscript prior to submission. CMLW approved the final manuscript for submission.
  58 in total

1.  Functional promoter rs2868371 variant of HSPB1 associates with radiation-induced esophageal toxicity in patients with non-small-cell lung cancer treated with radio(chemo)therapy.

Authors:  Jose Luis Lopez Guerra; Qingyi Wei; Xianglin Yuan; Daniel Gomez; Zhensheng Liu; Yan Zhuang; Ming Yin; Minghuan Li; Li-E Wang; James D Cox; Zhongxing Liao
Journal:  Radiother Oncol       Date:  2011-09-19       Impact factor: 6.280

2.  How meta-analysis increases statistical power.

Authors:  Lawrence D Cohn; Betsy J Becker
Journal:  Psychol Methods       Date:  2003-09

Review 3.  Radiation dose-volume effects of the urinary bladder.

Authors:  Akila N Viswanathan; Ellen D Yorke; Lawrence B Marks; Patricia J Eifel; William U Shipley
Journal:  Int J Radiat Oncol Biol Phys       Date:  2010-03-01       Impact factor: 7.038

4.  Genetic predictors of adverse radiotherapy effects: the Gene-PARE project.

Authors:  Alice Y Ho; David P Atencio; Sheila Peters; Richard G Stock; Silvia C Formenti; Jamie A Cesaretti; Sheryl Green; Bruce Haffty; Karen Drumea; Larisa Leitzin; Abraham Kuten; David Azria; Mahmut Ozsahin; Jens Overgaard; Christian N Andreassen; Cynthia S Trop; Janelle Park; Barry S Rosenstein
Journal:  Int J Radiat Oncol Biol Phys       Date:  2006-07-01       Impact factor: 7.038

5.  TGFβ1 SNPs and radio-induced toxicity in prostate cancer patients.

Authors:  Laura Fachal; Antonio Gómez-Caamaño; Manuel Sánchez-García; Ana Carballo; Paula Peleteiro; Ramón Lobato-Busto; Angel Carracedo; Ana Vega
Journal:  Radiother Oncol       Date:  2012-03-03       Impact factor: 6.280

6.  Super-enhancers in the control of cell identity and disease.

Authors:  Denes Hnisz; Brian J Abraham; Tong Ihn Lee; Ashley Lau; Violaine Saint-André; Alla A Sigova; Heather A Hoke; Richard A Young
Journal:  Cell       Date:  2013-10-10       Impact factor: 41.582

7.  A 2-stage genome-wide association study to identify single nucleotide polymorphisms associated with development of erectile dysfunction following radiation therapy for prostate cancer.

Authors:  Sarah L Kerns; Richard Stock; Nelson Stone; Michael Buckstein; Yongzhao Shao; Christopher Campbell; Lynda Rath; Dirk De Ruysscher; Guido Lammering; Rosetta Hixson; Jamie Cesaretti; Mitchell Terk; Harry Ostrer; Barry S Rosenstein
Journal:  Int J Radiat Oncol Biol Phys       Date:  2012-09-26       Impact factor: 7.038

Review 8.  Incorporating Genetic Biomarkers into Predictive Models of Normal Tissue Toxicity.

Authors:  G C Barnett; S L Kerns; D J Noble; A M Dunning; C M L West; N G Burnet
Journal:  Clin Oncol (R Coll Radiol)       Date:  2015-07-10       Impact factor: 4.126

Review 9.  Developing and evaluating polygenic risk prediction models for stratified disease prevention.

Authors:  Nilanjan Chatterjee; Jianxin Shi; Montserrat García-Closas
Journal:  Nat Rev Genet       Date:  2016-05-03       Impact factor: 53.242

10.  STROGAR - STrengthening the Reporting Of Genetic Association studies in Radiogenomics.

Authors:  Sarah L Kerns; Dirk de Ruysscher; Christian N Andreassen; David Azria; Gillian C Barnett; Jenny Chang-Claude; Susan Davidson; Joseph O Deasy; Alison M Dunning; Harry Ostrer; Barry S Rosenstein; Catharine M L West; Søren M Bentzen
Journal:  Radiother Oncol       Date:  2013-08-27       Impact factor: 6.280

View more
  25 in total

1.  Dicentric Dose Estimates for Patients Undergoing Radiotherapy in the RTGene Study to Assess Blood Dosimetric Models and the New Bayesian Method for Gradient Exposure.

Authors:  Jayne Moquet; Manuel Higueras; Ellen Donovan; Sue Boyle; Stephen Barnard; Clare Bricknell; Mingzhu Sun; Lone Gothard; Grainne O'Brien; Lourdes Cruz-Garcia; Christophe Badie; Elizabeth Ainsbury; Navita Somaiah
Journal:  Radiat Res       Date:  2018-09-20       Impact factor: 2.841

2.  Machine Learning on a Genome-wide Association Study to Predict Late Genitourinary Toxicity After Prostate Radiation Therapy.

Authors:  Sangkyu Lee; Sarah Kerns; Harry Ostrer; Barry Rosenstein; Joseph O Deasy; Jung Hun Oh
Journal:  Int J Radiat Oncol Biol Phys       Date:  2018-01-31       Impact factor: 7.038

Review 3.  Radiogenomics and radiotherapy response modeling.

Authors:  Issam El Naqa; Sarah L Kerns; James Coates; Yi Luo; Corey Speers; Catharine M L West; Barry S Rosenstein; Randall K Ten Haken
Journal:  Phys Med Biol       Date:  2017-08-01       Impact factor: 3.609

Review 4.  The future of personalised radiotherapy for head and neck cancer.

Authors:  Jimmy J Caudell; Javier F Torres-Roca; Robert J Gillies; Heiko Enderling; Sungjune Kim; Anupam Rishi; Eduardo G Moros; Louis B Harrison
Journal:  Lancet Oncol       Date:  2017-04-26       Impact factor: 41.316

Review 5.  Radiogenomics: Identification of Genomic Predictors for Radiation Toxicity.

Authors:  Barry S Rosenstein
Journal:  Semin Radiat Oncol       Date:  2017-10       Impact factor: 5.934

6.  Prediction of Results of Radiotherapy With Ku70 Expression and an Artificial Neural Network.

Authors:  Tomokazu Hasegawa; Masanori Someya; Masakazu Hori; Takaaki Tsuchiya; Yuuki Fukushima; Yoshihisa Matsumoto; Koh-Ichi Sakata
Journal:  In Vivo       Date:  2020 Sep-Oct       Impact factor: 2.155

Review 7.  A Review of Prostate Cancer Genome-Wide Association Studies (GWAS).

Authors:  Sarah Benafif; Zsofia Kote-Jarai; Rosalind A Eeles
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2018-01-18       Impact factor: 4.254

Review 8.  Genomics models in radiotherapy: From mechanistic to machine learning.

Authors:  John Kang; James T Coates; Robert L Strawderman; Barry S Rosenstein; Sarah L Kerns
Journal:  Med Phys       Date:  2020-06       Impact factor: 4.071

9.  One Size Fits All: Does the Dogma Stand in Radiation Oncology?

Authors:  David Azria; Celine Bourgier; Muriel Brengues
Journal:  EBioMedicine       Date:  2016-07-20       Impact factor: 8.143

Review 10.  Physical and Biological Characteristics of Particle Therapy for Oncologists.

Authors:  Hwa Kyung Byun; Min Cheol Han; Kyungmi Yang; Jin Sung Kim; Gyu Sang Yoo; Woong Sub Koom; Yong Bae Kim
Journal:  Cancer Res Treat       Date:  2021-06-16       Impact factor: 4.679

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.