Literature DB >> 34836419

Salicylic Acid and Risk of Colorectal Cancer: A Two-Sample Mendelian Randomization Study.

Aayah Nounu1,2, Rebecca C Richmond1, Isobel D Stewart3, Praveen Surendran4,5,6,7, Nicholas J Wareham3, Adam Butterworth4,5,6,8,9, Stephanie J Weinstein10, Demetrius Albanes10, John A Baron11, John L Hopper12,13, Jane C Figueiredo14,15, Polly A Newcomb16,17, Noralane M Lindor18, Graham Casey19, Elizabeth A Platz20, Loïc Le Marchand21, Cornelia M Ulrich22, Christopher I Li16, Fränzel J B van Dujinhoven23, Andrea Gsur24, Peter T Campbell25, Víctor Moreno26,27,28,29, Pavel Vodicka30,31,32, Ludmila Vodickova30,31,32, Efrat Amitay33, Elizabeth Alwers33, Jenny Chang-Claude34,35, Lori C Sakoda16,36, Martha L Slattery37, Robert E Schoen38, Marc J Gunter39, Sergi Castellví-Bel40, Hyeong-Rok Kim41, Sun-Seog Kweon42,43, Andrew T Chan44,45,46,47,48,49, Li Li50, Wei Zheng51, D Timothy Bishop52, Daniel D Buchanan53,54,55, Graham G Giles12,56,57, Stephen B Gruber58, Gad Rennert59,60,61, Zsofia K Stadler62, Tabitha A Harrison23, Yi Lin23, Temitope O Keku63, Michael O Woods64, Clemens Schafmayer65, Bethany Van Guelpen66,67, Steven Gallinger68, Heather Hampel69, Sonja I Berndt10, Paul D P Pharoah7, Annika Lindblom70,71, Alicja Wolk72,73, Anna H Wu74, Emily White23,75, Ulrike Peters23,75, David A Drew45,47, Dominique Scherer76, Justo Lorenzo Bermejo76, Hermann Brenner34,77,78, Michael Hoffmeister34, Ann C Williams2, Caroline L Relton1.   

Abstract

Salicylic acid (SA) has observationally been shown to decrease colorectal cancer (CRC) risk. Aspirin (acetylsalicylic acid, that rapidly deacetylates to SA) is an effective primary and secondary chemopreventive agent. Through a Mendelian randomization (MR) approach, we aimed to address whether levels of SA affected CRC risk, stratifying by aspirin use. A two-sample MR analysis was performed using GWAS summary statistics of SA (INTERVAL and EPIC-Norfolk, N = 14,149) and CRC (CCFR, CORECT, GECCO and UK Biobank, 55,168 cases and 65,160 controls). The DACHS study (4410 cases and 3441 controls) was used for replication and stratification of aspirin-use. SNPs proxying SA were selected via three methods: (1) functional SNPs that influence the activity of aspirin-metabolising enzymes; (2) pathway SNPs present in enzymes' coding regions; and (3) genome-wide significant SNPs. We found no association between functional SNPs and SA levels. The pathway and genome-wide SNPs showed no association between SA and CRC risk (OR: 1.03, 95% CI: 0.84-1.27 and OR: 1.08, 95% CI: 0.86-1.34, respectively). Results remained unchanged upon aspirin use stratification. We found little evidence to suggest that an SD increase in genetically predicted SA protects against CRC risk in the general population and upon stratification by aspirin use.

Entities:  

Keywords:  Mendelian randomization; aspirin; colorectal cancer; salicylic acid

Mesh:

Substances:

Year:  2021        PMID: 34836419      PMCID: PMC8620763          DOI: 10.3390/nu13114164

Source DB:  PubMed          Journal:  Nutrients        ISSN: 2072-6643            Impact factor:   6.706


1. Introduction

Colorectal cancer (CRC) is the fourth most common cancer in the UK and worldwide [1,2]. Although incidence rates among the over 50s have remained relatively stable, rates in younger age groups have increased in both the UK and US populations [3,4]. This highlights a need to find better and complementary prevention strategies to reduce risk of cancer. Salicylic acid (SA) is a dietary metabolite that can be found in various fruits, vegetables, herbs, and spices [5,6,7]. Results from a meta-analysis of 19 cohort studies found that combined intake of fruits and vegetables reduced the risk of colorectal cancer (summary relative risk (RR): 0.90, 95% CI: 0.83–0.98) [8]. Whilst dietary fibre obtained from fruits and vegetables indicates a possible mechanism for decreased risk [9], it has also been suggested that increased levels of SA obtained through their consumption may play a role [7]. In addition, salicylates can be obtained through pharmacological intervention in the form of aspirin (acetylsalicylic acid), a well-known analgesic used to treat fever, inflammation, and acute pain [10], which is rapidly deacetylated to form SA [11,12] (Figure 1), the active form of the aspirin metabolic pathway [13,14]. Whilst SA can be obtained from the diet, the concentrations achieved (male and female median intake from diet 4.4 mg/day and 3.2 mg/day, respectively [6]) are much lower than through aspirin ingestion (aspirin doses ranging between 75 mg to ≥325 mg given daily/alternate days) [15]. Therefore, it is unclear whether concentrations achieved from the diet alone are sufficient to protect against cancer or whether larger doses obtained through pharmacological intervention are required.
Figure 1

Aspirin metabolism pathway. Roughly 10% of aspirin remains unchanged and is excreted in the urine as aspirin. Aspirin is broken down into various metabolites, the most active of them being salicylic acid [14,16]. Various enzymes are involved in the metabolism pathway. The percentages indicate how much of the drug is being metabolised in that pathway. Abbreviations: BChE, butyrylcholinesterase; PAFAH1b2, platelet-activating factor acetylhydrolase 2; PAFAH1b3, platelet-activating factor acetylhydrolase 3; UGT1A6, UDP-glucuronosyltransferase 1–6; ACSM2B, Acyl-CoA Synthetase Medium-Chain Family Member 2B and CYP450, cytochrome P450.

As of yet, no primary prevention trials have been carried out to assess the effect of SA intervention on CRC risk, but the evidence of aspirin as a chemopreventive agent is clear [17]. A long-term follow up of a randomised controlled trial (RCT) in the Women’s Health Study (WHS) showed that alternate day aspirin intake reduced the risk of CRC after a median of 17.5 years follow up (HR: 0.80, 95% confidence intervals (CI): 0.67–0.97) [18] and a meta-analysis of observational studies showed that aspirin is protective against CRC (relative risk (RR): 0.79, 95% CI: 0.74–0.85) [19]. Further evidence comes from RCTs for primary and secondary prevention of vascular events. These showed that aspirin reduces the risk of CRC incidence and mortality (HR: 0.76, 95% CI: 0.60–0.96 and odds ratio (OR): 0.79, 95% CI: 0.68–0.92, respectively) [20,21]. Considering aspirin is rapidly deacetylated to form SA in under 30 min [22], and that evidence in the form of in vivo and in vitro experiments have previously shown SA to be an antiproliferative and antitumour agent [23,24,25], it may be that metabolism of aspirin leading to increased circulating SA levels may partially explain aspirin’s chemopreventive mode of action. Although many observational studies have shown an inverse association between aspirin use and CRC risk, few have directly assessed the association between SA itself and CRC. In order to identify the true effect of SA on CRC risk, conducting an RCT would be the ideal study design. However, RCTs for cancer primary prevention are lengthy and costly, therefore it would be helpful to test this association using statistical methods such as Mendelian Randomization (MR). MR uses genetic variants (mostly single nucleotide polymorphisms (SNPs)) related to modifiable factors (such as metabolite levels) to investigate the causal role of these factors on risk of disease [26,27,28]. Through this method, MR has been likened to RCTs in that genetic variants are randomly allocated at conception the same way that an intervention is randomly allocated at the start of a trial [29,30]. This lends many advantages such as overcoming the issues of confounding and reverse causation, which are commonly encountered in observational epidemiology [29]. MR has previously been useful in predicting trial outcomes such as the case of selenium and prostate cancer in The Selenium and Vitamin E Cancer Prevention Trial (SELECT) [31]. Results from an MR study mimicked the findings of this RCT and may have been useful to inform whether to conduct a trial that cost $114 million and that was weakly associated with increasing high-grade prostate cancer risk [32]. For this reason, we applied an MR approach using genetic “instruments” or proxies for SA to assess the causal effect of this metabolite on the risk of CRC. Since aspirin is rapidly deacetylated to SA [22], and therefore a plausible proxy of increased SA levels, we also stratified our analysis between aspirin users and non-users to test the hypothesis of whether diet-derived levels of SA alone would affect risk of CRC or whether higher concentrations achieved through pharmacological intervention in the form of aspirin was required to identify an effect. Based on previous observational evidence, we hypothesise that a genetically predicted increase in SA levels would reduce the risk of CRC, with a stronger effect observed in aspirin users.

2. Materials and Methods

2.1. Genetic Variants for Salicylic Acid

We applied a two-sample MR study design to test for the association of SA levels (sample 1) with risk of CRC (sample 2). GWAS and meta-analysis of salicylate levels were performed using 5841 participants from the EPIC-Norfolk study [33] and 8455 from the INTERVAL study [34]. The percentage of samples with missing salicylate measurements was low (0.43% and 1.44% in EPIC-Norfolk and INTERVAL respectively), providing a total sample size of 14,149. Salicylate was measured independently in each study as one of many metabolites measured using the Metabolon DiscoveryHD4® platform (Metabolon, Inc., Durham, NC, USA), from non-fasted plasma samples (predominantly non-fasted samples in EPIC-Norfolk) collected at baseline. Measures that were median normalised for run day were natural log transformed, winsorised to 5 standard deviations from the mean, before being regressed against age, sex, and study-specific variables (measurement consignment in EPIC-Norfolk and measurement consignment, INTERVAL centre, plate number, appointment month, lag time between blood donation appointment and sample processing, and the first 5 ancestry principal components in INTERVAL) using linear regression. Residuals from this regression were standardised (mean 0, standard deviation 1) and used for further analysis. Genotyping was performed in both studies using the Affymetrix Axiom UK Biobank genotyping array. In INTERVAL, genotype imputation was performed using the combined UK10K+1000 Genomes Phase 3 reference panel. In EPIC-Norfolk, imputation was performed using the Haplotype Reference Consortium reference panel, with additional variants imputed using the UK10K+1000 Genomes Phase 3 reference panel. Genome-wide association analyses were performed using BOLT-LMM (version 2.2) [35], and variants with a MAF < 0.01% and INFO < 0.3 were excluded. Associations from the two studies were pooled using inverse variance weighted fixed effect meta-analysis implemented in METAL [36], applying a minor allele count threshold in each study of >10. The causal effect of SA on risk of CRC was assessed using 3 sets of genetic variants (SNPs) related to SA: (1) functional SNPs that influence aspirin and SA metabolising enzymes’ activity (derived from Figure 1)—termed “functional SNPs”; (2) pathway SNPs, those that are present in the coding regions of genes that are involved in aspirin and SA metabolism (based on the NCBI Build 37/UCSC hg19 from https://grch37.ensembl.org/index.html (accessed on 14 December 2016), Supplementary Table S1) termed “pathway SNPs”; (3) genome-wide significant SNPs associated with levels of circulating aspirin metabolites—termed “genome-wide SNPs”. Pathways SNPs were defined as having a Bonferroni threshold of association (p value 0.05/2701 = 1.85 × 10−5), a MAF ≥ 0.01%, as well as a consistent direction of effect in both EPIC-Norfolk and INTERVAL. Genome-wide signals were defined as having an association p value < 5 × 10−8 in the meta-analysis, a MAF ≥ 0.01%, consistent direction of effect across the two studies, and association p value < 0.01 in both studies To account for genetic correlation, linkage disequilibrium (LD) clumping at an R2 < 0.001 and 10,000 kb window was performed to retain the SNP most strongly associated with the metabolite for downstream analysis. Since an R2 < 0.001 is considered highly stringent, we also used an R2 < 0.8 to incorporate more variants while accounting for residual correlation in the model (see Statistical Analysis). An F-statistic for each SNP–exposure association was calculated to reflect the strength of the genetic instrument and indicate any possibility of weak instrument bias, usually inferred when F < 10 [37]. Power calculations were conducted using the mRnd online calculator to identify the OR in both directions that could be detected with the sample size available [38].

2.2. Genetic Variants for CRC Incidence

SNP–outcome associations were obtained from the Colon Cancer Family Registry (CCFR), Colorectal Cancer Transdisciplinary Study (CORECT), Genetics and Epidemiology of Colorectal Cancer (GECCO) consortia, and the UK Biobank (55,168 cases and 65,160 controls), hereafter collectively termed as GECCO [39,40,41]. Genetic data from a population-based case-control study from southwestern Germany (Darmkrebs: Chancen der Verhütung durch Screening (DACHS)) was used to assess replication of the findings, and to run an MR analysis stratified by aspirin intake, since this study recorded aspirin use (defined as twice per week for at least a year) [42,43,44]. This study comprised 4410 cases, of which 810 (18.37% of cases) were aspirin users and 3340 (75.74%) were non-users, and 260 cases (5.90%) were excluded as they had reported use of other non-aspirin NSAIDs. This study also contained 3441 controls, of which 779 (22.64%) had recorded aspirin use and 2320 (67.42%) were recorded as non-users, and 342 controls (9.94%) were excluded as they had reported use of other non-aspirin NSAIDs. To assess the causal effect of SA on CRC risk, we tested for association in GECCO, and also stratified the analysis between aspirin users and non-users in DACHS to investigate whether increased SA levels via pharmacological intervention is required to see an effect. We obtained summary association statistics from GECCO and also conducted logistic regression analyses adjusting for age and sex in the DACHS study for all the participants. We then stratified the participants of the DACHS study to aspirin users and non-users before repeating the logistic regression analyses again. Genetic instruments that had a MAF ≤ 0.01 in both GECCO and DACHS (all participants) were excluded from further analyses.

2.3. Statistical Analyses

Analyses were carried out in R version 3.2.3 using the “Two-Sample MR” package [45]. This package allows the formatting, harmonisation, and analysis of summary data from genetic association studies in a semi-automated manner. The Two-Sample MR package automatically assigns effect alleles so that SNP associations with the exposure are positive i.e., so the effect allele is “metabolite-increasing”. The SNPs identified as associated with SA can then be extracted from the outcome datasets. Allele harmonization ensures that the effect (metabolite-increasing) allele in the exposure dataset is also treated as the effect allele in the outcome dataset. When only one SNP was associated with the metabolite, Wald ratios (SNP–outcome estimate ÷ SNP–exposure estimate) were calculated to assess the change in log OR per SD increase in the metabolite. When more than one SNP was available, a weighted mean weighted by the inverse variance of the Wald ratio estimates (inverse-variance weighted (IVW) method) was used to assess the causal effect of increased metabolite levels on risk of CRC incidence [46]. To assess the quality of our instruments, we calculated the variance in SA levels explained by the SNPs and the F statistic. The variance explained for each SNP was calculated using the formula: , where p is the minor allele frequency, b is the SNP effect on the exposure (beta) and var is the variance of the exposure. The F statistic was calculated using the formula: where r is the sum of the variance explained by the set of SNPs, n is the sample size of the exposure GWAS and k is the number of SNPs used to proxy the exposure. In the presence of weak instruments, we conducted an MR robust adjusted profile score (MR RAPS), which is a method that provides robust inference when many weak instruments are present [47]. Furthermore, the presence of one invalid instrument, e.g., one that is associated with exposures other than the exposure of interest (horizontal pleiotropy), may bias the results from the IVW method [48]. For this reason, alternative methods that produce an unbiased estimator even when some of the genetic instruments are invalid were used as a sensitivity analysis when more than 2 SNPs were used as exposure instruments (weighted mode, weighted median, and MR Egger) [45,49,50,51]. The MR Egger test is not constrained to pass through an effect size of 0, unlike the IVW method, allowing the assessment of the presence of directional pleiotropy through the y intercept [48,51]. We also measured the Q statistic to measure the presence of pleiotropy between our instruments. If all the SNPs are valid instruments, then the individual MR estimates for each SNP will only vary by chance. A larger amount of heterogeneity would indicate that one or more of the SNPs are pleiotropic [52]. Due to the presence of a small number of independent SNPs associated with the metabolite, we also conducted a weighted generalised linear regression (WGLR), whereby SNPs in LD (R2 < 0.8) could be used with the incorporation of their correlation as weights in the regression analysis [53]. This was performed using the “LDlinkR” and “MendelianRandomization” packages in R (version 3.5.1). The use of multiple SNPs explains more of the variance in the metabolite levels and therefore improves power to detect an effect [53]. We also assessed the possibility of reverse causation through the use of the MR Steiger test, found in the “TwoSampleMR” package, which does so by comparing the variance explained by the SNPs for the exposure and the outcome [45].

3. Results

3.1. Functional SNPs and CRC Risk

To interrogate the effect of SA on CRC risk, we used three methods to select our exposure instruments (Figure 2). In our first approach, we identified four functional SNPs that have been shown to affect enzyme efficiency in the aspirin metabolic pathway (Figure 1). For BChE (rs6445035), the presence of an A allele increase has been associated with a decrease in aspirin hydrolysis by around 1.2 nmol/mL/min [54]. The UGT1A6 variants, rs2070959 and rs1105879, predict a higher metabolic activity of the enzyme than the wild type [55,56]. Furthermore, a variant in CYP2C9 (rs1799853) encodes an enzyme with reduced activity [57].
Figure 2

Instrument selection for functional, pathway, and genome-wide SNPs. Abbreviations: SA, salicylic acid; EAF, effect allele frequency; BF, Bonferroni.

These SNPs were tested for association with SA in the INTERVAL and (EPIC)-Norfolk study, however none of the SNPs reached nominal significance with the metabolite (Figure 3A) (Supplementary Table S2). For this reason, these SNPs were therefore not taken forward in an MR analysis.
Figure 3

Functional SNP metabolite associations and two-sample pathway MR analysis. (A) Forest plot of single SNP associations with salicylic acid for the functional SNPs. (B) Forest plot of one SD increase in SA and its effect on CRC risk, instrumented by pathway SNPs and applying three methods: IVW after applying an LD threshold of R2 < 0.001 (black), MR RAPS after applying an LD threshold of R2 < 0.001 (grey), IVW after applying an LD threshold of R2 < 0.8 (red) and a WGLR after applying an LD threshold of R2 < 0.8 (green). (C) Forest plot of one SD increase in SA and its effect on CRC risk, instrumented by genome-wide SNPs and applying three methods: WR after applying an LD threshold of R2 < 0.001 (black), MR RAPS after applying an LD threshold of R2 < 0.001 (grey), IVW after applying an LD threshold of R2 < 0.8 (red) and a WGLR after applying an LD threshold of R2 < 0.8 (green). Abbreviations: OR, odds ratio; IVW, inverse variance weighted; WGLR, weighted generalised linear regression; WR, Wald ratio; LD, linkage disequilibrium.

3.2. Pathway SNPs and CRC Risk

We investigated genetic variants within the coding regions of the enzymes involved in aspirin and SA metabolism (Figure 1). These were BChE, PAFAH1b2, PAFAH1b3, UGT1A6, ACSM2B and CYP2C9. We obtained summary statistics for 2701 SNPs within the genetic coding regions of the enzymes for SA. We applied a Bonferroni threshold of association (P value 0.05/2701 = 1.85 × 10−5) for SNPS and restricted to SNPs with a consistent direction of effects in both studies and a minor allele frequency of ≥0.01 in the exposure and outcome studies. This identified 45 SNPs that could be used to instrument SA. These SNPs were then clumped at an R2 < 0.001 and 0.8, providing two and six SNPs, respectively, to instrument SA levels (Figure 2). These explained 0.03% and 0.09% of the variance in SA levels, and had an F statistic of 1.74 and 2.16, respectively (Table 1).
Table 1

Exposure instruments used in the MR analysis.

SNP SetStudyOutcome Sample SizePercentage Cases (%)N SNPsLD R2Variance Explained R2 (%)F StatisticsOR Detected at 80% Power
Decreased RiskIncreased Risk
Pathway SNPsGECCO120,32845.85 55,168/120,328)20.0010.0251.740.901.11
DACHS785156.1720.681.51
DACHS aspirin users1589(4410/7851)20.432.38
DACHS aspirin non-users566050.9820.641.64
GECCO120,328(810/1589)60.80.0922.160.951.06
DACHS785159.0160.811.24
DACHS aspirin users1589(3340/5660)60.631.58
DACHS aspirin non-users566045.85 (55,168/120,328)60.781.30
Genome-wide SNPsGECCO120,32845.85 55,168/120,328)20.0010.0537.440.931.07
DACHS785156.1720.761.32
DACHS aspirin users1589(4410/7851)20.551.83
DACHS aspirin non-users566050.9820.731.42
GECCO120,328(810/1589)60.80.0903.180.951.06
DACHS785159.0160.811.24
DACHS aspirin users1589(3340/5660)60.631.59
DACHS aspirin non-users566045.85 (55,168/120,328)60.781.30

Abbreviations: SA, salicylic acid; LD, linkage disequilibrium; NA, not applicable; OR, odds ratio.

After LD clumping at an R2 < 0.001, 2 SNPs were taken forward in an IVW analysis. We found little evidence of an association between an SD increase in SA and CRC risk (GECCO OR: 1.03, 95% CI: 0.84–1.27 and DACHS OR: 1.10, 95% CI: 0.58–2.07) (Figure 3B). Since aspirin is rapidly deacetylated to form SA [22] and therefore a plausible proxy for increased SA levels, we stratified our analysis between aspirin users and non-users in the DACHS study. Our power calculations show that after stratification, we had 80% power to detect an effect of an SD increase in SA on CRC risk with an OR of ≤0.43 and ≥2.38 in the reciprocal direction for aspirin users (n = 1589). For non-users (n = 5660), we had 80% power to detect an OR of ≤0.64 and ≥1.64 in the reciprocal direction (Table 1). However, our MR analysis showed little evidence of an association between SA and CRC risk (OR: 0.93, 95% CI: 0.23–3.73 and OR: 1.24, 95% CI: 0.57–2.69, respectively) (Figure 3B). The variance explained by these two instruments and their F statistics indicate the possibility of weak instrument bias. For this reason, we conducted MR RAPS, a method that provides robust inference even in the presence of weak instruments [47]. Through this method, no association was found between an SD increase in SA and CRC risk (GECCO OR: 1.04, 95% CI: 0.87–1.23 and DACHS OR:1.10, 95% CI: 0.57–2.12). Results remain unchanged, even after stratification between aspirin users and non-users (OR: 0.93, 95% CI: 0.22–3.87 and OR: 1.24, 95% CI: 0.56–2.76). Since this LD threshold is known to be very stringent, we used a more relaxed threshold (R2 < 0.8) to increase the number of SNPs available to instrument the metabolite and therefore explain more of the variance in SA levels. This provided six SNPs associated with SA shown in Supplementary Table S3, of which SNP associations with the outcome are also provided in Supplementary Table S4. These SNPs showed no association between SA and CRC risk (GECCO OR: 1.01, 95% CI: 0.91–1.12 and DACHS OR:1.14, 95% CI: 0.77–1.68). Stratification between aspirin use and non-use found no association between the metabolite and CRC risk in aspirin users or non-users (OR: 1.02, 95% CI: 0.44–2.40 and OR: 1.26, 95% CI: 0.78–2.01, respectively). Using the alternative MR methods (weighted mode, weighted median, and MR Egger), no other association between SA and CRC in both GECCO and DACHS was observed, regardless of stratification (Supplementary Table S5). Since all the SNPs were found to be on chromosome 16 (Supplementary Table S3), a WGLR method was carried out to account for the SNP correlations and include them as weights into the regression. Through this method, there was no association between SA and CRC risk in DACHS (OR: 0.81, 95% CI: 0.36–1.83) but a positive association in the GECCO sample (OR: 1.11, 95% CI: 1.01–1.21). No association was observed between SA and CRC risk in aspirin users or non-users (OR: 0.35, 95% CI: 0.05–2.47 and OR: 1.10, 95% CI: 0.55–2.16, respectively) (Figure 3B). As a sensitivity analysis, the heterogeneity of the results was appraised through a Q statistic, but no evidence of pleiotropy was observed- i.e., no evidence that the instruments may also be associated with another phenotype (Supplementary Table S6). To identify whether the causal pathway was in the direction from SA to CRC and not the reverse, we performed the MR Steiger method using the functional SNPs [58]. This method suggested that the causal direction was indeed from SA to CRC, because the SNPs explained more variation in SA levels than CRC risk (Supplementary Table S7).

3.3. Genome-Wide Significant SNPs and CRC Risk

Initially, 72 SNPs were associated with SA at genome-wide significance. We restricted our analysis to SNPs with a MAF threshold of ≥0.01 in the exposure and outcome studies, and only included those with a consistent direction of effect in both studies. This resulted in 58 SNPs that were available to instrument SA. After removing SNPs in LD at an R2 < 0.001 and R2 < 0.8, one SNP and four SNPs were available to instrument SA, respectively (Figure 2). These explained 0.05% and 0.09% of the variance in SA levels and had an F statistic of 7.44 and 3.18, respectively (Table 1). Using the one independent SNP associated with SA at genome-wide significance, WR results showed no association between the genetically predicted metabolite levels and cancer risk (GECCO OR: 1.08, 95% CI: 0.86–1.34 and DACHS OR: 1.01, 95% CI: 0.44–2.31). Our power calculations show that after stratification between aspirin users and non-users in the DACHS study, we had 80% power to detect an effect of an SD increase in SA on CRC risk with an OR of ≤0.55 and ≥1.83 in the reciprocal direction for aspirin users (n = 1589). For non-users (n = 5660), we had 80% power to detect an OR of ≤0.73 and ≥1.42 in the reciprocal direction (Table 1), however, we found no association between SA levels and CRC in aspirin users (OR: 0.66, 95% CI: 0.11–4.12) and non-users (OR: 1.12, 95% CI: 0.42–2.97) (Figure 3C). Due to the possibility of weak instrument bias, we also conducted an MR RAPS approach, but results remained unchanged (GECCO OR: 1.08, 95% CI: 0.86–1.36, DACHS OR: 1.01, 95% CI: 0.44–2.36, DACHS aspirin users OR: 0.66, 95% CI: 0.10–4.33 and DACHS aspirin non-users OR: 1.12, 95% CI: 0.41–3.04). To explain more of the variance, we used a less stringent LD threshold of R2 < 0.8, and therefore four SNPs to instrument SA (Supplementary Table S8, associations with CRC are also found in Supplementary Table S9). IVW results also showed no association between the metabolites and CRC risk (GECCO OR: 1.03, 95% CI: 0.92–1.15, and DACHS OR: 1.06, 95% CI: 0.69–1.63) and no association was found upon stratification by aspirin use (users OR: 0.99, 95% CI: 0.38–2.57, non-users OR: 1.10, 95% CI: 0.66–1.84). Using the alternative MR methods (weighted mode, weighted median, and MR Egger), no association between SA and CRC in both GECCO and DACHS was seen, regardless of stratification (Supplementary Table S10). Since these four SNPs were all found on chromosome 16 (Supplementary Table S8), a WGLR method was applied to account for their correlation. We found a positive association between SA and CRC risk in the GECCO sample (OR: 1.13, 95% CI: 1.05–1.22) but no association in the DACHS sample (OR: 0.51, 95% CI: 0.16–1.67), DACHS aspirin users (OR: 0.12, 95% CI: 0.01–2.67) and DACHS aspirin non-users (OR: 0.70, 95% CI: 0.30–1.65) (Figure 3C). As a sensitivity analysis, the heterogeneity of the results was assessed through a Q statistic, but no evidence of heterogeneity was seen (Supplementary Table S11). We repeated the MR Steiger method using the genome-wide significant SNPs, and results again suggested that the causal direction was indeed from SA to CRC, as the SNPs explained more variation in SA levels than CRC risk (Supplementary Table S6).

4. Discussion

In this study, we aimed to assess whether increasing levels of SA affected risk of CRC, using an MR approach, and whether higher levels of SA proxied by pharmacological intervention in the form of aspirin use was required to identify an effect. Our analysis focused on aspirin, since 90% of the drug is rapidly deacetylated to form SA [16], which is the active metabolite of the drug [13,14], and therefore increases SA levels more than would be achieved through the diet. Three different approaches were applied to identify genetic variants (instrument variables) which could serve as proxies for SA and understand the causal nature of their role in determining CRC risk. The three approaches involved selecting (i) functional, (ii) pathway and (iii) genome-wide SNPs each associated with SA. The functional genetic variants were selected through the established role of the genes in aspirin metabolism from various sources of evidence. With regards to the pathway and genome-wide significant SNPs, all were found on chromosome 16, either within or near the coding region for the enzyme ACSM2B, which is the enzyme involved in breaking down SA into its metabolite salicyluric acid, thereby providing a plausible biological link between these SNPs and levels of SA. We found no association between the functional SNPs and levels of SA, therefore did not take them forward to instrument SA levels. Using pathway and genome-wide SNPs, we identified two and one independent SNPs (R2 < 0.001) to proxy for SA levels, respectively, and found no association between increasing metabolite levels and CRC risk using an IVW and MR RAPS approach, regardless of aspirin stratification. Furthermore, due to the small number of instruments, we applied a less stringent LD threshold (R2 < 0.8) and identified six pathway SNPs and four genome-wide SNPs to proxy for an SD increase in SA levels. Using these SNPs, we found consistent null results using the IVW method and alternative MR methods (weighted median, weighted mode, and MR Egger). However, after accounting for SNP correlation using a WGLR method, we found that an SD increase in SA increased the risk of CRC in GECCO (OR: 1.11, 95% CI: 1.01–1.21, p-value: 0.03 and OR: 1.13, 95% CI: 1.05–1.22, p-value: 1.42 × 10−3, respectively). We acknowledge that when the LD clumping threshold was relaxed to 0.8, there may have been some overlap with SNPs used in both the functional and the genome-wide analysis. We also acknowledge that the sensitivity analyses used were limited in detecting heterogeneity due to the low number of SNPs. The Cochran Q statistic also requires a large number of SNPs, otherwise there is little power to detect heterogeneity [59]. Overall, we found little evidence to suggest that SA affects risk of CRC, regardless of stratification. It is thought that one reason why fruit and vegetable consumption may prevent CRC [8] is due to the presence of SA [60], although no formal RCTs have been carried out to confirm this. In vitro studies have also shown that salicylic acid inhibits the growth of colorectal cancer cells [61]. SA is the primary metabolite of aspirin, of which both observational and RCT evidence have shown aspirin as a chemopreventive agent [18,19,20,21]. Our MR results show little evidence of an association between the metabolite and CRC risk, regardless of aspirin use. However, we discuss some of the possible reasons why below. Whilst we found no association between functional SNPs known to affect aspirin metabolism enzymes’ activity and levels of SA, this may be due to a more complex relationship between genotype and metabolite levels, rather than the assumed linear additive model. For example, with regards to the functional SNPs, Nagar et al. (2004) identified that whilst individuals with homozygous mutant alleles of UGT1A6 had the highest metabolic activity, those that were heterozygous for alleles in three SNPs (including rs1105879 and rs2070959) were actually less active than homozygous wildtype enzymes [56], indicating a non-linear association between the alleles and the metabolites, which is a common assumption made in regression analyses [62]. This non-linear association between alleles and enzyme activity needs to also be addressed between alleles and metabolite levels to derive instrumental variables. To our knowledge, our GWAS for SA is the largest performed for this metabolite (n = 14,149), with others having much smaller sample sizes and not being publicly available. By using a much larger sample size, we were able to identify genome-wide significant associations that would have otherwise been missed in smaller studies. However, the variance explained and the strength of the instruments still indicated weak instruments despite strong associations with the metabolite. In order to improve the results and conclusions observed in this study, ideally we would need to identify the SNP associations with SA levels stratified between aspirin users and non-users, similar to what was carried out in our CRC outcome sample. However, to our knowledge, metabolite, genotype, and phenotype data (of aspirin use) are not currently large enough to run this analysis. If a stronger association exists between the SNPs and SA levels in aspirin users, this would provide more strength of the appropriateness of the genetic instruments used to proxy for SA levels. We also acknowledge another limitation in this study is that the measurement of metabolites was through an untargeted metabolomics approach, and so the variables generated are assessed in units of measurement called “ion counts” which are calculated from the area under the curve of the corresponding peak in the mass spectrum. This means that metabolite measurements are quantitative values of relative changes as opposed to the absolute quantification of metabolite concentrations that can be achieved through targeted metabolomics [63]. For this reason, it is important to focus on the direction of effect and strength of association (p-values) in this study, as opposed to the magnitude of effect. This may have also impacted on the calculation of variance explained and the F statistics, which mostly indicate that the instrumental variables used in the MR were weak as they explain little of the variance, and the F-statistic is below the conventionally applied indicative threshold of 10 [64]. However, without carrying out a more targeted metabolomic approach and quantifying the exact effect of these SNPs on the metabolite levels, it is difficult to draw firm conclusions about the strength of the instruments used for MR. Furthermore, larger sample sizes of recorded aspirin use are required as currently, our study may have been underpowered to detect an effect, hence explaining the null results using the IVW approach. Our power calculations show that in aspirin users, we had sufficient power to detect an effect of SA on CRC risk with an OR of ≤0.43 and ≥2.38, whereas observationally, the effect of aspirin on CRC risk is RR: 0.79 (95% CI: 0.74–0.85) [19], and in a long-term observational follow-up of a trail, the hazard ratio was 0.80 (95% CI: 0.67–0.97) [18]. Therefore, it would be useful to repeat this analysis in a larger sample with comprehensive data on aspirin use.

5. Conclusions

Overall, the analyses presented have shown that dietary levels of SA, as well as increased levels proxied by aspirin use, may be insufficient at reducing risk of CRC, although based on the variance explained in SA levels by our SNPs and the F statistic, we acknowledge that the analysis needs to be repeated again with stronger instruments that proxy the metabolite levels.
  56 in total

1.  Salicylate-induced growth arrest is associated with inhibition of p70s6k and down-regulation of c-myc, cyclin D1, cyclin A, and proliferating cell nuclear antigen.

Authors:  B K Law; M E Waltner-Law; A J Entingh; A Chytil; M E Aakre; P Nørgaard; H L Moses
Journal:  J Biol Chem       Date:  2000-12-08       Impact factor: 5.157

Review 2.  Salicylic acid: a link between aspirin, diet and the prevention of colorectal cancer.

Authors:  J R Paterson; J R Lawrence
Journal:  QJM       Date:  2001-08

3.  Genome-wide association study of colorectal cancer identifies six new susceptibility loci.

Authors:  Fredrick R Schumacher; Stephanie L Schmit; Shuo Jiao; Christopher K Edlund; Hansong Wang; Ben Zhang; Li Hsu; Shu-Chen Huang; Christopher P Fischer; John F Harju; Gregory E Idos; Flavio Lejbkowicz; Frank J Manion; Kevin McDonnell; Caroline E McNeil; Marilena Melas; Hedy S Rennert; Wei Shi; Duncan C Thomas; David J Van Den Berg; Carolyn M Hutter; Aaron K Aragaki; Katja Butterbach; Bette J Caan; Christopher S Carlson; Stephen J Chanock; Keith R Curtis; Charles S Fuchs; Manish Gala; Edward L Giovannucci; Stephanie M Gogarten; Richard B Hayes; Brian Henderson; David J Hunter; Rebecca D Jackson; Laurence N Kolonel; Charles Kooperberg; Sébastien Küry; Andrea LaCroix; Cathy C Laurie; Cecelia A Laurie; Mathieu Lemire; David Levine; Jing Ma; Karen W Makar; Conghui Qu; Darin Taverna; Cornelia M Ulrich; Kana Wu; Suminori Kono; Dee W West; Sonja I Berndt; Stéphane Bezieau; Hermann Brenner; Peter T Campbell; Andrew T Chan; Jenny Chang-Claude; Gerhard A Coetzee; David V Conti; David Duggan; Jane C Figueiredo; Barbara K Fortini; Steven J Gallinger; W James Gauderman; Graham Giles; Roger Green; Robert Haile; Tabitha A Harrison; Michael Hoffmeister; John L Hopper; Thomas J Hudson; Eric Jacobs; Motoki Iwasaki; Sun Ha Jee; Mark Jenkins; Wei-Hua Jia; Amit Joshi; Li Li; Noralene M Lindor; Keitaro Matsuo; Victor Moreno; Bhramar Mukherjee; Polly A Newcomb; John D Potter; Leon Raskin; Gad Rennert; Stephanie Rosse; Gianluca Severi; Robert E Schoen; Daniela Seminara; Xiao-Ou Shu; Martha L Slattery; Shoichiro Tsugane; Emily White; Yong-Bing Xiang; Brent W Zanke; Wei Zheng; Loic Le Marchand; Graham Casey; Stephen B Gruber; Ulrike Peters
Journal:  Nat Commun       Date:  2015-07-07       Impact factor: 14.919

4.  Aspirin hydrolysis in plasma is a variable function of butyrylcholinesterase and platelet-activating factor acetylhydrolase 1b2 (PAFAH1b2).

Authors:  Gang Zhou; Gopal K Marathe; Jaana Hartiala; Stanley L Hazen; Hooman Allayee; W H Wilson Tang; Thomas M McIntyre
Journal:  J Biol Chem       Date:  2013-03-18       Impact factor: 5.157

5.  Association of Aspirin and Nonsteroidal Anti-Inflammatory Drugs With Colorectal Cancer Risk by Molecular Subtypes.

Authors:  Efrat L Amitay; Prudence R Carr; Lina Jansen; Viola Walter; Wilfried Roth; Esther Herpel; Matthias Kloor; Hendrik Bläker; Jenny Chang-Claude; Hermann Brenner; Michael Hoffmeister
Journal:  J Natl Cancer Inst       Date:  2019-05-01       Impact factor: 13.506

6.  Mendelian randomization: using genes as instruments for making causal inferences in epidemiology.

Authors:  Debbie A Lawlor; Roger M Harbord; Jonathan A C Sterne; Nic Timpson; George Davey Smith
Journal:  Stat Med       Date:  2008-04-15       Impact factor: 2.373

Review 7.  Dietary fibre, whole grains, and risk of colorectal cancer: systematic review and dose-response meta-analysis of prospective studies.

Authors:  Dagfinn Aune; Doris S M Chan; Rosa Lau; Rui Vieira; Darren C Greenwood; Ellen Kampman; Teresa Norat
Journal:  BMJ       Date:  2011-11-10

8.  Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression.

Authors:  Jack Bowden; George Davey Smith; Stephen Burgess
Journal:  Int J Epidemiol       Date:  2015-06-06       Impact factor: 7.196

9.  Aspirin inhibits colon cancer cell and tumor growth and downregulates specificity protein (Sp) transcription factors.

Authors:  Satya Pathi; Indira Jutooru; Gayathri Chadalapaka; Vijayalekshmi Nair; Syng-Ook Lee; Stephen Safe
Journal:  PLoS One       Date:  2012-10-26       Impact factor: 3.240

10.  Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator.

Authors:  Jack Bowden; George Davey Smith; Philip C Haycock; Stephen Burgess
Journal:  Genet Epidemiol       Date:  2016-04-07       Impact factor: 2.135

View more
  2 in total

1.  Essential metals, vitamins and antioxidant enzyme activities in COVID-19 patients and their potential associations with the disease severity.

Authors:  Iman Al-Saleh; Nujud Alrushud; Hissah Alnuwaysir; Rola Elkhatib; Mohamed Shoukri; Fouad Aldayel; Razan Bakheet; Maha Almozaini
Journal:  Biometals       Date:  2022-01-07       Impact factor: 2.949

Review 2.  An Overview of Methods and Exemplars of the Use of Mendelian Randomisation in Nutritional Research.

Authors:  Derrick A Bennett; Huaidong Du
Journal:  Nutrients       Date:  2022-08-19       Impact factor: 6.706

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.