| Literature DB >> 35232999 |
Koldo Garcia-Etxebarria1,2, Olga Merino3, Adrián Gaite-Reguero4, Pedro M Rodrigues5,6,7, Amaia Herrarte8, Ane Etxart8, David Ellinghaus9, Horacio Alonso-Galan8,10, Andre Franke9, Urko M Marigorta4,7, Luis Bujanda5,8, Mauro D'Amato11,7,12.
Abstract
Inflammatory bowel disease (IBD) is characterised by chronic inflammation of the gastrointestinal tract. Although its aetiology remains unknown, environmental and genetic factors are involved in its development. Regarding genetics, more than 200 loci have been associated with IBD but the transferability of those signals to the Basque population living in Northern Spain, a population with distinctive genetic background, remains unknown. We have analysed 5,411,568 SNPs in 498 IBD cases and 935 controls from the Basque population. We found 33 suggestive loci (p < 5 × 10-6) in IBD and its subtypes, namely Crohn's Disease (CD) and Ulcerative Colitis (UC), detecting a genome-wide significant locus located in HLA region in patients with UC. Those loci contain previously associated genes with IBD (IL23R, JAK2 or HLA genes) and new genes that could be involved in its development (AGT, BZW2 or FSTL1). The overall genetic correlation between European populations and Basque population was high in IBD and CD, while in UC was lower. Finally, the use of genetic risk scores based on previous GWAS findings reached area under the curves > 0.68. In conclusion, we report on the genetic architecture of IBD in the Basque population, and explore the performance of European-descent genetic risk scores in this population.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35232999 PMCID: PMC8888637 DOI: 10.1038/s41598-022-07401-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographics and features of the Basque cohort analysed in the present study.
| Inflammatory bowel disease | Crohn’s disease | Ulcerative colitis | Controls | |
|---|---|---|---|---|
| N | 498 | 284 | 208 | 935 |
| Male | 256 (51.4%) | 141 (49.6%) | 113 (54.3%) | 628 (67.2%) |
| Female | 242 (48.6%) | 143 (51.4%) | 95 (45.7%) | 307 (32.8%) |
| Age (SE) | 51.4 (13.9) | 41.46 (11.9) | ||
| Ileal | – | 132 (46.5%) | – | – |
| Colorectal | – | 30 (10.6%) | – | – |
| Ileocolonic | – | 116 (40.8%) | – | – |
| Upper GI | – | 6 (2.1%) | – | – |
| Proctitis | – | – | 24 (11.5%) | – |
| Left-sided | – | – | 105 (50.5%) | – |
| Extensive | – | – | 72 (34.6%) | – |
| Inflammatory | – | 191(67.2%) | – | – |
| Stricturing | – | 61 (21.5%) | – | – |
| Penetring | – | 64 (22.5%) | – | – |
Figure 1Genetic background of the Basque cohort analysed in the present study. (A) Relationship of the Basque cohort within 1000 genomes project European populations, according to Principal Component Analysis. (B) Principal Component Analysis of the Basque cohort, coloured by their ancestry according to Admixture analysis. Graphics were depicted using R language 4.0.5 (https://www.r-project.org) and ggplot2 3.3.5 (https://ggplot2.tidyverse.org).
Basque IBD GWAS association results and annotation, suggestive loci.
| Lead SNP | Position | EA | OA | EAF | IBD (498/935) | CD (284/935) | UC (208/935) | Nearest gene | Previously known | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| p | OR | p | OR | UC | UC | |||||||
| rs871822 | chr1:3312914 | G | T | 0.266 | 0.8 | 0.739 | 1.0 | 0.5 | PRDM16 | |||
| rs10082259 | chr1:4495821 | C | T | 0.832 | 1.5 | 1.9 | 0.294 | 1.2 | RNF220 | |||
| rs17436816 | chr1:6204161 | A | G | 0.112 | 1.6 | 2.0 | 0.739 | 1.1 | CHD5 | |||
| rs77894461 | chr1:19385772 | T | C | 0.051 | 2.2 | 1.9 | 2.7 | UBR4 | ||||
| rs6702829 | chr1:29766241 | C | T | 0.232 | 0.6 | 0.6 | 0.6 | |||||
| rs6660226 | chr1:67744601 | A | G | 0.453 | 0.7 | 0.6 | 0.7 | IL23R | IBD, CD, UC | |||
| rs1710775 | chr1:106081424 | C | A | 0.670 | 1.4 | 1.7 | 0.172 | 1.2 | ||||
| rs12566217 | chr1:230792242 | G | A | 0.144 | 1.5 | 0.103 | 1.3 | 2.0 | COG2 | |||
| rs12185578 | chr2:17609239 | A | T | 0.282 | 1.5 | 1.5 | 1.5 | RAD51AP2 | ||||
| rs148746268 | chr2:25605572 | G | A | 0.022 | 2.7 | 3.8 | 0.563 | 1.3 | DTNB | |||
| rs72949953 | chr2:214026384 | A | G | 0.039 | 2.3 | 1.8 | 3.2 | IKZF2 | ||||
| rs11129387 | chr3:30221063 | A | T | 0.364 | 0.7 | 0.5 | 0.639 | 0.9 | RBMS3 | |||
| rs62266031 | chr3:116539468 | G | C | 0.366 | 0.6 | 0.6 | 0.6 | |||||
| rs2030413 | chr3:120082782 | A | G | 0.464 | 1.5 | 1.6 | 1.3 | LRRC58 | ||||
| rs9307388 | chr4:114075688 | T | A | 0.027 | 2.9 | 3.8 | 2.0 | ANK2 | ||||
| rs143431075 | chr4:173051881 | A | G | 0.025 | 2.5 | 3.6 | 0.623 | 1.2 | GALNTL6 | |||
| rs3121685 | chr5:65662133 | C | T | 0.532 | 1.5 | 1.5 | 1.6 | |||||
| rs2052483 | chr5:102948185 | T | C | 0.214 | 1.6 | 1.8 | 1.4 | NUDT12 | ||||
| rs10515625 | chr5:148647553 | C | T | 0.105 | 0.5 | 0.5 | 4.4E | 0.5 | AFAP1L1 | |||
| rs3910312 | chr6:30008746 | C | A | 0.255 | 1.4 | 0.087 | 1.2 | 1.8 | ZNRD1ASP | IBD, UC | ||
| rs41291790 | chr6:31572664 | A | G | 0.022 | 3.1 | 0.160 | 1.7 | 5.3 | AIF1 | IBD, CD, UC | ||
| rs6946352 | chr7:16706549 | G | T | 0.045 | 2.4 | 1.9 | 3.2 | BZW2 | ||||
| rs10952655 | chr7:146391520 | A | G | 0.782 | 1.4 | 1.8 | 0.951 | 1.0 | CNTNAP2 | |||
| rs4370571 | chr8:138304595 | A | G | 0.582 | 0.8 | 0.573 | 0.9 | 0.6 | LOC101927915 | |||
| rs10119004 | chr9:5071049 | G | A | 0.543 | 0.7 | 0.6 | 0.053 | 0.8 | JAK2 | IBD, CD, UC | ||
| rs75486977 | chr10:63579821 | T | C | 0.015 | 2.7 | 0.781 | 1.1 | 4.9 | CABCOCO1 | |||
| rs17231595 | chr10:68724763 | T | G | 0.017 | 2.8 | 0.244 | 1.6 | 4.8 | CTNNA3 | |||
| rs1826333 | chr11:27850631 | C | A | 0.721 | 1.4 | 1.6 | 1.3 | BDNF | ||||
| rs2806899 | chr13:46483515 | A | G | 0.622 | 0.7 | 0.7 | 0.6 | ZC3H13 | ||||
| rs177206 | chr14:78538806 | C | T | 0.466 | 1.3 | 1.6 | 0.808 | 1.0 | NRXN3 | |||
| rs1998136 | chr14:92280675 | A | G | 0.188 | 0.5 | 0.5 | 0.5 | TC2N | ||||
| rs72755010 | chr15:64304844 | C | T | 0.042 | 1.8 | 0.636 | 1.1 | 2.8 | DAPK2 | |||
| rs11648328 | chr16:4618931 | C | T | 0.265 | 1.5 | 1.8 | 0.246 | 1.2 | C16orf96 | |||
Below the analysis, between brackets, the number of cases and controls analysed.
EA effect allele; OA other allele; EAF effect allele frequency; IBD results from the analysis of all IBD patients; CD results from the analysis of only Crohn’s Disease patients; UC results from the analysis of only Ulcerative Colitis patients; p p-value of the effect allele, in bold nominally significant results; OR odds-ratio of the effect allele; Nearest gene nearest gene within 100 kb from lead SNP (if any).
Basque IBD GWAS association results in each subtype, suggestive loci.
| Lead SNP | Ileal CD (132/935) | Ileocolinic CD (116/935) | Left-sided UC (105/935) | Extensive UC (72/935) | ||||
|---|---|---|---|---|---|---|---|---|
| p | OR | p | OR | p | OR | p | OR | |
| rs871822 | 0.600 | 0.9 | 0.727 | 1.1 | 0.4 | 0.5 | ||
| rs10082259 | 1.7 | 2.0 | 0.055 | 1.4 | 0.971 | 1.0 | ||
| rs17436816 | 2.2 | 2.2 | 0.581 | 1.2 | 0.735 | 0.9 | ||
| rs77894461 | 2.3 | 0.273 | 1.4 | 2.5 | 2.7 | |||
| rs6702829 | 0.140 | 0.8 | 0.6 | 0.6 | 0.373 | 0.8 | ||
| rs6660226 | 0.7 | 0.6 | 0.7 | 0.7 | ||||
| rs1710775 | 1.7 | 1.7 | 0.754 | 1.1 | 0.701 | 1.1 | ||
| rs12566217 | 0.644 | 1.1 | 0.084 | 1.4 | 2.0 | 2.2 | ||
| rs12185578 | 1.4 | 1.6 | 1.4 | 1.5 | ||||
| rs148746268 | 4.2 | 3.3 | 0.870 | 1.1 | 0.414 | 0.4 | ||
| rs72949953 | 0.211 | 1.6 | 2.3 | 3.4 | 3.4 | |||
| rs11129387 | 0.7 | 0.4 | 0.447 | 0.9 | 0.454 | 0.8 | ||
| rs62266031 | 0.7 | 0.6 | 0.6 | 0.081 | 0.7 | |||
| rs2030413 | 1.6 | 1.6 | 0.132 | 1.2 | 1.6 | |||
| rs9307388 | 3.9 | 4.4 | 2.9 | 0.629 | 1.4 | |||
| rs143431075 | 3.9 | 3.6 | 0.291 | 1.7 | 0.373 | 0.4 | ||
| rs3121685 | 1.4 | 1.6 | 1.8 | 0.101 | 1.3 | |||
| rs2052483 | 1.6 | 1.8 | 0.055 | 1.4 | 0.174 | 1.4 | ||
| rs10515625 | 0.4 | 0.171 | 0.7 | 0.4 | 0.079 | 0.6 | ||
| rs3910312 | 0.158 | 1.2 | 0.152 | 1.3 | 1.7 | 1.8 | ||
| rs41291790 | 0.671 | 1.3 | 0.281 | 1.7 | 5.4 | 5.6 | ||
| rs6946352 | 0.075 | 1.8 | 2.1 | 2.0 | 4.7 | |||
| rs10952655 | 1.8 | 1.6 | 0.753 | 0.9 | 0.577 | 1.1 | ||
| rs4370571 | 0.7 | 1.3 | 0.6 | 0.5 | ||||
| rs10119004 | 0.7 | 0.6 | 0.364 | 0.9 | 0.6 | |||
| rs75486977 | 0.486 | 1.5 | 0.396 | 0.4 | 5.7 | 4.7 | ||
| rs17231595 | 0.166 | 2.1 | 0.392 | 0.4 | 3.1 | 8.4 | ||
| rs1826333 | 2.0 | 0.084 | 1.3 | 1.5 | 0.772 | 1.1 | ||
| rs2806899 | 0.6 | 0.763 | 1.0 | 0.073 | 0.8 | 0.5 | ||
| rs177206 | 1.6 | 1.8 | 0.942 | 1.0 | 0.874 | 1.0 | ||
| rs1998136 | 0.5 | 0.5 | 0.5 | 0.070 | 0.6 | |||
| rs72755010 | 0.806 | 1.1 | 0.459 | 1.3 | 2.7 | 3.4 | ||
| rs11648328 | 1.9 | 1.8 | 0.169 | 1.3 | 0.940 | 1.0 | ||
Below the subtype, between brackets, the number of cases and controls analysed. p, p-value of the effect allele, in bold nominally significant results; OR, odds-ratio of the effect allele.
Figure 2Genetic regression of the results of the present study and their counterparts from IIBDGC, for IBD and its subtypes. Circle size and colour depict regression coefficients. Inside the circle the significance of the regression coefficient, ***p < 0.001, **p < 0.01, *p < 0.05; otherwise, not significant.
HLA imputation association results in the Basque IBD cohort, significant alleles.
| HLA allele | MAF | IBD (498/935) | CD (284/935) | UC (208/935) | Previously known | |||
|---|---|---|---|---|---|---|---|---|
| p | OR | p | OR | p | OR | |||
| HLA_A_3301 | 0.016 | 0.4 | 0.087 | 0.4 | 0.163 | 0.5 | ||
| HLA_A_2902 | 0.092 | 0.264 | 1.2 | 1.4 | 0.552 | 0.9 | UC | |
| HLA_A_0201 | 0.262 | 1.4 | 0.336 | 1.1 | 2.0 | UC | ||
| HLA_A_3002 | 0.037 | 0.6 | 0.060 | 0.6 | 0.165 | 0.6 | ||
| HLA_A_6801 | 0.018 | 0.3 | 0.2 | 0.166 | 0.5 | |||
| HLA_C_0102 | 0.035 | 0.6 | 0.5 | 0.411 | 0.8 | |||
| HLA_C_1601 | 0.087 | 0.111 | 1.3 | 1.4 | 0.834 | 1.0 | UC | |
| HLA_C_0702 | 0.125 | 0.470 | 1.1 | 0.605 | 0.9 | 1.4 | UC | |
| HLA_B_0702 | 0.122 | 0.311 | 1.1 | 0.668 | 0.9 | 1.5 | UC | |
| HLA_DRB1_1101 | 0.063 | 0.360 | 1.2 | 0.722 | 0.9 | 1.5 | UC | |
| HLA_DRB1_0301 | 0.107 | 0.7 | 0.6 | 0.175 | 0.8 | CD, UC | ||
| HLA_DRB1_0701 | 0.193 | 0.953 | 1.0 | 1.3 | 0.6 | CD, UC | ||
| HLA_DQA1_0104 | 0.024 | 0.545 | 1.2 | 0.394 | 0.7 | 1.9 | ||
| HLA_DQA1_0201 | 0.193 | 0.890 | 1.0 | 1.3 | 0.7 | CD, UC | ||
| HLA_DQA1_0501 | 0.107 | 0.7 | 0.7 | 0.191 | 0.8 | |||
| HLA_DQB1_0503 | 0.024 | 0.545 | 1.2 | 0.394 | 0.7 | 1.9 | ||
| HLA_DQB1_0201 | 0.106 | 0.7 | 0.6 | 0.191 | 0.8 | CD, UC | ||
| HLA_DQB1_0202 | 0.162 | 0.464 | 0.9 | 0.193 | 1.2 | 0.6 | UC | |
| HLA_DPB1_1101 | 0.062 | 0.230 | 1.2 | 1.5 | 0.735 | 0.9 | UC | |
Below the subtype, between brackets, the number of cases and controls analysed.
MAF minor allele frequency; IBD results from the analysis of all IBD patients; CD results from the analysis of only Crohn’s Disease patients; UC results from the analysis of only Ulcerative Colitis patients; p p-value of the allele, in bold nominally significant results; OR odds-ratio of the allele.
Figure 3Polygenic risk score (PRS) analysis of IBD and its subtypes. T-test p, p-value of the T-test comparing the PRS scores of cases and controls. (A) PRS calculated for all Inflammatory Bowel Disease samples using the PRS derived in Khera et al.[7]. (B) Optimal PRS calculated for all Inflammatory Bowel Disease samples using IIBDGC results as model. (C) Optimal PRS calculated only for Crohn’s Disease samples using IIBDGC results as model. (D) Optimal PRS calculated only for Ulcerative Colitis samples using IIBDGC results as model. (E) Optimal PRS calculated only for Ulcerative Colitis samples, excluding HLA region, using IIBDGC results as model.