| Literature DB >> 35764673 |
Alan Z Yang1, Luke Jostins-Dean2.
Abstract
A combination of genetic susceptibility and environmental exposure is thought to cause inflammatory bowel disease (IBD), but the non-genetic component remains poorly characterized. We therefore undertook a search for environmental variables and gene-environment interactions associated with future IBD diagnosis in a large UK cohort. Using self-report and electronic health records, we identified 1946 Crohn's disease (CD) and 3715 ulcerative colitis (UC) patients after quality control in the UK Biobank. Based on prior literature and biological plausibility , we tested 38 candidate environmental variables for association with CD, UC, and overall IBD using Cox proportional hazard regressions. We also tested whether these variables interacted with polygenic risk in predicting disease, following up significant (FDR < 0.05) results with tests for SNP-environment associations. We performed robustness analyses on all significant results. As in previous reports, appendectomy protected against UC, smoking (both current and previous) elevated risk for CD, current smoking protected against UC, and previous smoking imparted a risk for UC. Childhood antibiotic use associated with IBD, as did sun exposure during the winter. Socioeconomic deprivation was conferred a risk for IBD, CD, and UC. We uncovered negative interactions between polygenic risk and previous oral contraceptive use for IBD and UC. Polygenic risk also interacted negatively with previous smoking in predicting UC. There were no individually significant SNP-environment interactions. Thus, for a limited set of environmental variables, there was strong evidence of association with IBD diagnosis in the UK Biobank, and interaction with polygenic risk was minimal.Entities:
Mesh:
Year: 2022 PMID: 35764673 PMCID: PMC9240024 DOI: 10.1038/s41598-022-13222-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Number of IBD cases in the UK Biobank based on method of identification. ICD10-coded diagnoses were recorded in hospital episode statistics (HES) while self-reported conditions were gathered by survey at recruitment.
Figure 2Lag time between self-reported IBD and HES-coded IBD for patients self-reporting IBD after 1997, measured by number of years from earliest recollection of first diagnosis to first instance of relevant ICD10-code in HES records.
Characteristics of environmental variables investigated. PRS = polygenic risk score. OCT = oral contraceptive therapy. HRT = hormone replacement therapy.
| Environmental variable | # of CD cases/# of participants in analysis | # of UC cases/# of participants in analysis | Prospective versus retrospective | Notes on variable definitions | Covariates used in regression |
|---|---|---|---|---|---|
| Diet pattern—4 variables (frequency per week): red meat, processed meat, fresh fruit, alcohol | ~ 400/ ~ 355,000 (differs slightly for each variable) | ~ 910/ ~ 355,000 (differs slightly for each variable) | Prospective | PRS, age, sex, 10 genetic principal components (ancestry), UK Biobank assessment center location | |
| 24-h dietary recall—17 variables (amount consumed daily based on 24 h recall): fiber, fat, polyunsaturated fats, saturated fats, sugar, alcohol, iron, calcium, potassium, magnesium, protein, vitamin B6, folate, vitamin B12, vitamin C, vitamin D, vitamin E | 10/18,291 | 31/18,291 | Prospective | We did not include intake of vitamin supplements in our analysis because supplemental intake is not quantified in the UK Biobank | PRS, age, sex, 10 genetic principal components (ancestry), UK Biobank assessment center location, daily caloric intake |
| Socioeconomic deprivation (Index of Multiple Deprivation 2010) | 439/353,075 | 961/353,375 | Prospective | PRS, age, sex, 10 genetic principal components (ancestry) | |
| Sun exposure during the summer (hours spent outdoors on a typical day) | 450/361,895 | 990/362,192 | Prospective | ||
| Sun exposure during the winter (hours spent outdoors on a typical day) | 450/361,895 | 990/362,192 | Prospective | ||
| Latitude at recruitment | 446/358,812 | 980/358,812 | Prospective | ||
| Latitude at birth | 1088/345,732 | 1915/345,732 | Retrospective | ||
| Cesarean section | 354/124,664 | 680/124,664 | Retrospective | PRS, age, sex, 10 genetic principal components (ancestry), UK Biobank assessment center location | |
| Breastfed as baby | 1134/364,796 | 1998/364,796 | Retrospective | ||
| Maternal smoking around birth | 1108/359,405 | 1972/359,405 | Retrospective | ||
| Appendectomy | 1946/364,898 | 3714/364,898 | Retrospective | PRS, age, sex, 10 genetic principal components (ancestry), UK Biobank assessment center location | |
| Prolonged exposure to antibiotics during childhood (surveyed) | 288/119,936 | 604/119,927 | Retrospective | ||
| Regular NSAID use | 451/361,849 | 989/361,849 | Prospective | Includes aspirin. Participants were classed as “regular users” if they used NSAIDs 4 or more times a week for the past for weeks at time of survey | |
| Smoking (current use) | 1684/310,960 | 3154/310,960 | Retrospective (time-varying) | Participants who provided ages for starting (or stopping) smoking which did not fall within 5 years of each other were removed. Those who did not smoke for longer than a year were excluded | |
| Smoking (previous use) | 1684/310,960 | 3154/310,960 | Retrospective (time-varying) | ||
| Oral contraceptive therapy (current use) | 942/175,001 | 1627/175,001 | Retrospective (time-varying) | Participants who provided ages for starting (or stopping) OCT which did not fall within 5 years of each other were excluded. Those who did not use OCT for longer than a year were excluded | |
| Oral contraceptive therapy (previous use) | 942/175,001 | 1627/175,001 | Retrospective (time-varying) | ||
| Hormone replacement therapy (current use) | 942/175,001 | 1627/1,750,001 | Retrospective (time-varying) | Participants who provided ages for starting (or stopping) HRT which did not fall within 5 years of each other were excluded. Those who did not use HRT for longer than a year were excluded | |
| Hormone replacement therapy (previous use) | 942/175,001 | 1627/1,750,001 | Retrospective (time-varying) | ||
Figure 3Schematic of the primary analyses we carried out for each variable. All variables were measured at a single time point except 24-h diet recall, which represented the average value from 3 to 5 surveys taken at multiple time points. IBD = inflammatory bowel disease, IMD = Index of Multiple Deprivation (a measure of socioeconomic status), OCT = oral contraceptive therapy.
Figure 4Forest plot of hazard ratios (dots) and 95% confidence intervals (lines) obtained from Cox regressions. Hazard ratios were adjusted for other covariates, including polygenic risk. Statistically significant results (FDR < 0.05) represented by filled circles. For continuous variables, hazard ratios are given per standard deviation of the variable. For binary variables, raw hazard ratios are given. Results for 24-h dietary variables are shown in Supplemental Fig. 3. Results for hormone replacement therapy not displayed because they did not meet the proportional hazards assumption.
Figure 5Forest plot of hazard ratios (dots) and 95% confidence intervals (lines) for PRSxE interactions obtained from Cox regressions. Hazard ratios were adjusted for other covariates, including polygenic risk. Statistically significant results (FDR < 0.05) represented by filled circles. x-axis truncated at 2. For continuous variables, hazard ratios are given per standard deviation of the variable per standard deviation of PRS. For binary variables, hazard ratios are given per standard deviation of PRS. Results for 24-h dietary variables are shown in Supplemental Fig. 4. Results for hormone replacement therapy not displayed because they did not meet the proportional hazards assumption.
Figure 6Kaplan–Meier curves for the statistically significant (FDR < 0.05 in Cox regressions conditional on polygenic risk and other covariates) environmental variables with IBD diagnosis as the event. Shading indicates 95% confidence interval. For IMD, hazard ratios are given per standard deviation of the variable. IMD = Index of Multiple Deprivation.
Figure 7Kaplan–Meier curves for variables with statistically significant PRSxE interactions (FDR < 0.05 in Cox regressions conditional on polygenic risk and other covariates), with IBD diagnosis as the event. “Never” indicates participants never exposed to variable, “previous” refers to participants who started and subsequently stopped exposure, “lowPRS” refers to polygenic risk below median, “highPRS” refers to polygenic risk above median. Curves of current users are omitted to emphasize the interactive effect. OCT = oral contraceptive therapy.