| Literature DB >> 35386986 |
Michael A Portelli1, Kamini Rakkar1, Sile Hu2, Yike Guo2, Ian M Adcock3, Ian Sayers1.
Abstract
Asthma affects more than 300 million people globally and is both under diagnosed and under treated. The most recent and largest genome-wide association study investigating moderate to severe asthma to date was carried out in 2019 and identified 25 independent signals. However, as new and in-depth downstream databases become available, the translational analysis of these signals into target genes and pathways is timely. In this study, unique (U-BIOPRED) and publicly available datasets (HaploReg, Open Target Genetics and GTEx) were investigated for the 25 GWAS signals to identify 37 candidate causal genes. Additional traits associated with these signals were identified through PheWAS using the UK Biobank resource, with asthma and eosinophilic traits amongst the strongest associated. Gene expression omnibus dataset examination identified 13 candidate genes with altered expression profiles in the airways and blood of asthmatic subjects, including MUC5AC and STAT6. Gene expression analysis through publicly available datasets highlighted lung tissue cell specific expression, with both MUC5AC and SLC22A4 genes showing enriched expression in ciliated cells. Gene enrichment pathway and interaction analysis highlighted the dominance of the HLA-DQA1/A2/B1/B2 gene cluster across many immunological diseases including asthma, type I diabetes, and rheumatoid arthritis. Interaction and prediction analyses found IL33 and IL18R1 to be key co-localization partners for other genes, predicted that CD274 forms co-expression relationships with 13 other genes, including the HLA-DQA1/A2/B1/B2 gene cluster and that MUC5AC and IL37 are co-expressed. Drug interaction analysis revealed that 11 of the candidate genes have an interaction with available therapeutics. This study provides significant insight into these GWAS signals in the context of cell expression, function, and disease relationship with the view of informing future research and drug development efforts for moderate-severe asthma.Entities:
Keywords: GWAS; SNP; causal genes; eQTL; moderate to severe asthma
Year: 2021 PMID: 35386986 PMCID: PMC8974692 DOI: 10.3389/falgy.2021.738741
Source DB: PubMed Journal: Front Allergy ISSN: 2673-6101
Figure 1Translational analysis pipeline. Twenty-five signals from the Moderate to Severe Asthma GWAS were taken through signal to trait and signal to gene association analyses. Signals were searched for in publicly available databases such as GeneATLAS, PheWAS, Open Target Genetics, GTex, HaploReg and LDLink as well as our U-BIOPRED (UB) datasets of Bronchial Biopsies, Bronchial Brushes, and Nasal Brushes. Gene association was scored as per Table 3 and 37 genes were identified. Candidate causal genes were then searched for in gene expression datasets from subjects with asthma vs. controls and single-cell-RNA lung tissue datasets. Asthma related pathways and interactions between genes were identified using the DAVID Functional Annotation Tool and GeneMANIA prediction server and drug gene interactions were identified using the Drug Gene Interaction Database.
Scoring system for signal to gene analyses.
|
|
|
|---|---|
| GTEX respiratory relevant cell/tissue eQTL ( | 1 |
| GTEX Blood eQTL ( | 1 |
| GTEX respiratory relevant cell/tissue sQTL ( | 1 |
| GTEX blood sQTL ( | 1 |
| OTG eQTL Resp ( | 1 |
| OTG eQTL blood ( | 1 |
| OTG V2G score (presence of) | 1 |
| OTG co-localization study (H4 > 0.8) | 1 |
| HaploReg eQTL ( | 1 |
| Functional variant (presence of in an LD | 1 |
| UBIOPRED brush eQTL ( | 2 |
| UBIOPRED biopsy eQTL ( | 2 |
| UBIOPRED nasal eQTL (P < 0.05) | 2 |
| Literature association (presence of) | 1 |
|
|
|
Each potential candidate causal gene was scored based on supporting evidence. eQTL and sQTL associations were reported on presenting a p-value of <0.05, co-localization studies on presenting an H4 > 0.8 and functional variants if present withing a linkage disequilibrium block with an r.
The 25 signals from the moderate to severe asthma GWAS.
|
|
|
|
|
|
|---|---|---|---|---|
| rs61816761 | FLG | A | G | 2.37% |
| rs7523907 | CD247 | T | C | 54.08% |
| rs12479210 | IL1RL1 | T | C | 38.73% |
| rs34290285 | D2HGDH | G | A | 74.26% |
| rs560026225 | KIAA1109 | GATT | G | 23.60% |
| rs1837253 | TSLP | C | T | 74.16% |
| rs1438673 | WDR36 | C | T | 50.78% |
| rs3749833 | C5orf56 | C | T | 26.08% |
| rs1986009 | RAD50 | A | C | 18.71% |
| rs9273410 | HLA-DQB1 | A | C | 55.30% |
| rs776111176 | HLA-DQA1 | A | AAT | 14.85% |
| rs367983479 | BACH2 | CA | C | 61.50% |
| rs71266076 | MIR5708 | C | CT | 36.93% |
| rs144829310 | IL33 | T | G | 16.40% |
| rs10905284 | GATA3 | C | A | 42.94% |
| rs11603634 | MUC5AC | G | A | 50.36% |
| rs7936312 | C11orf30 | T | G | 47.42% |
| rs7305461 | RPS26 | A | C | 44.61% |
| rs703816 | STAT6 | C | T | 43.41% |
| rs10519068 | RORA | G | A | 87.25% |
| rs72743461 | SMAD3 | A | C | 23.60% |
| rs7203459 | CLEC16A | T | C | 75.44% |
| rs2941522 | IKZF3 | T | C | 48.29% |
| rs112502960 | ZNF652 | A | G | 35.92% |
| rs61840192 | LOC101928272 | G | A | 57.30% |
The 25 signals with the original reported gene (closest), asthma and non-risk allele and asthma risk allele frequency are listed. All analyses conducted in this study are based on the risk allele.
Insertion and deletions (Indels).
SNPs utilized in pipeline analyses.
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| rs61816761 | 988,882 | 20 | rs61816766 | 0.50 | rs61816764 | 0.21 | - | - | - | - |
| rs7523907 | 41,739 | 56 | rs3108155 | 0.83 | rs2056625 | 0.73 | - | - | - | - |
| rs12479210 | 458,394 | 578 | - | - | rs2270298 | 0.47 | rs2241116 | 0.46 | - | - |
| rs34290285 | 69,955 | 157 | - | - | rs34077392 | 0.43 | - | - | - | - |
| rs560026225 | 622,215 | 150 | rs72687036 | 0.66 | - | - | - | - | - | - |
| rs1837253 | 71,462 | 29 | - | - | - | - | - | - | - | - |
| rs1438673 | 227,233 | 353 | - | - | rs2289277 | 0.73 | - | - | rs7524421 | 1.00 |
| rs3749833 | 526,932 | 325 | - | - | - | - | - | - | rs11748326 | 0.76 |
| rs1986009 | 365,939 | 239 | rs12652920 | 1.00 | rs12652920 | 1.0 | rs4705952 | 0.21 | - | - |
| rs9273410 | 554,975 | 5,739 | - | - | - | - | - | - | - | - |
| rs776111176 | 574,227 | 1,447 | rs3997872 | 0.82 | - | - | - | - | - | - |
| rs367983479 | 244,383 | 191 | rs1504215 | 0.85 | rs4142967 | 0.56 | - | - | - | - |
| rs71266076 | 175,000 | 206 | rs7824993 | 0.81 | - | - | - | - | - | - |
| rs144829310 | 496,605 | 234 | - | - | rs1929996 | 0.43 | - | - | - | - |
| rs10905284 | 67,858 | 96 | - | - | - | - | - | - | - | - |
| rs11603634 | 146,648 | 187 | - | - | rs11602802 | 0.40 | - | - | - | - |
| rs7936312 | 316,410 | 277 | rs7936323 | 0.96 | - | - | - | - | - | - |
| rs7305461 | 315,101 | 120 | rs1131017 | 0.75 | - | - | - | - | - | - |
| rs703816 | 284,371 | 165 | - | - | rs167769 | 0.82 | - | - | - | - |
| rs10519068 | 108,343 | 123 | - | - | rs2279292 | 0.91 | rs2279296 | 0.13 | rs2279296 | 0.13 |
| rs72743461 | 30,796 | 45 | - | - | - | - | rs10152544 | 0.30 | rs10152544 | 0.30 |
| rs7203459 | 429,719 | 435 | - | - | rs12919828 | 0.98 | rs6498135 | 0.30 | - | - |
| rs2941522 | 805,248 | 896 | - | - | rs907091 | 0.96 | - | - | - | - |
| rs112502960 | 270,023 | 370 | rs62076439 | 1.00 | - | - | - | - | - | - |
| rs61840192 | 201,710 | 249 | rs1031163 | 1.00 | - | - | - | - | - | - |
Each sentinel SNP reported in the original moderate to severe asthma GWAS corresponds to a haplotype consisting of related SNPs (R.
Indicates the sentinel SNP was not found in 1,000 Genomes Project accessed through LDLink (Phase 3) or HaploReg (Phase 1), ergo values are reflective of the proxy SNP used.
This sentinel SNP (rs71266076) was not found in the 1,000 Genomes Project Phase 3 data accessed through LDLink therefore Phase 1 data accessed through HaploReg was used instead.
MAF, minor allele frequency in the European population from 1,000 Genomes Project Phase 3.
Figure 2PheWAS of signals for moderate-severe asthma. Signals (risk allele in brackets) or their proxies were searched for in the GeneATLAS PheWAS database of the UK Biobank cohort. A Bonferroni adjusted p-value of 6.42 × 10−5 was used. The interaction between signals (top) and PheWAS traits (right) are represented on a grid and the area of the circle represents the -log10 (p-value) of the association. A larger area indicates a lower p-value. Only selected PheWAS traits have been displayed and organized into asthma (blue), blood/immune cell (red), allergy (yellow), other respiratory (green), inflammatory (purple), and auto-immune (gray) groups. The trait “basophil percentage” did not meet the Bonferroni corrected P-value for any of the signals and therefore was not included in the Figure. A full list of PheWAS terms and Beta and P-values are available in Supplementary Table 1. For disease traits (i.e., all except blood cell traits) a black dot represents an odds ratio of <1 with respect to the moderate to severe asthma risk allele. Proxies were used for the following signals which were not present in the database: rs61816761 (rs61816766, R2 = 0.50), rs367983479 (rs1504215, R2 = 0.85), rs71266076 (rs7824993, R2 = 0.81), rs7305461 (rs1131017, R2 = 0.75), rs112502960 (rs62076439, R2 = 1.0), rs61840192 (rs1031163, R2 = 1.0), rs560026225 (rs72687036, R2 = 0.66), and rs776111176 (rs3997872, R2 = 0.82).
Selection of genes of interest relative to signal of association.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| rs61816761 | FLG | FLG | 2 | 0 | 2 |
| TUFT1 | 0 | 1 | 2 | ||
| SELENBP1 | 0 | 1 | 2 | ||
| C1orf68 | 0 | 1 | 2 | ||
| rs7523907 | CD247 | CD247 | 7 | 0 | 7 |
| BRP44 | 0 | 1 | 2 | ||
| CREG1 | 0 | 1 | 2 | ||
| rs12479210 | IL1RL1 | IL1RL1 | 11 | 1 | 13 |
| IL1R1 | 0 | 1 | 2 | ||
| IL18R1 | 6 | 0 | 6 | ||
| rs34290285 | D2HGDH | D2HGDH | 11 | 1 | 13 |
| PDCD1 | 3 | 0 | 3 | ||
| GAL3ST2 | 1 | 0 | 1 | ||
| KIF1A | 1 | 1 | 3 | ||
| rs560026225 | KIAA1109 | KIAA1109 | 4 | 0 | 4 |
| rs1837253 | TSLP | TSLP | 2 | 0 | 2 |
| WDR36 | 1 | 0 | 1 | ||
| rs1438673 | WDR36 | WDR36 | 3 | 0 | 3 |
| CAMK4 | 1 | 0 | 1 | ||
| TMEM232 | 0 | 1 | 2 | ||
| TSLP | 1 | 1 | 3 | ||
| STMN1 | 0 | 1 | 2 | ||
| rs3749833 | C5orf56 | SLC22A5 | 8 | 0 | 8 |
| SLC22A4 | 2 | 0 | 2 | ||
| P4HA2 | 2 | 0 | 2 | ||
| C5orf56 | 1 | 0 | 1 | ||
| IRF1 | 3 | 0 | 3 | ||
| LOC553103 | 0 | 1 | 2 | ||
| SEPT8 | 0 | 1 | 2 | ||
| ANKRD43 | 0 | 1 | 2 | ||
| rs1986009 | RAD50 | SLC22A5 | 3 | 0 | 3 |
| SLC22A4 | 3 | 0 | 3 | ||
| ACSL6 | 0 | 1 | 2 | ||
| IL13 | 0 | 1 | 2 | ||
| IL4 | 0 | 1 | 2 | ||
| rs9273410 | HLA-DQB1 | HLA-DQB1 | 2 | 3 | 8 |
| HLA-DQB2 | 2 | 3 | 8 | ||
| HLA-DQA1 | 2 | 0 | 2 | ||
| HLA-DQA2 | 2 | 0 | 2 | ||
| ATF6B | 1 | 0 | 1 | ||
| rs776111176 | HLA-DQA1 | HLA-DQA1 | 6 | 2 | 8 |
| HLA-DQA2 | 6 | 0 | 6 | ||
| HLA-DQB1 | 6 | 6 | 12 | ||
| HLA-DQB2 | 6 | 0 | 6 | ||
| HLA-DRB3 | 0 | 2 | 2 | ||
| HLA-DOB | 4 | 0 | 4 | ||
| AGER | 0 | 2 | 2 | ||
| LY6G6E | 0 | 2 | 2 | ||
| PFDN6 | 0 | 2 | 2 | ||
| NEU1 | 0 | 2 | 2 | ||
| DOM3Z | 0 | 2 | 2 | ||
| LY6G6D | 0 | 2 | 2 | ||
| COL11A2 | 0 | 2 | 2 | ||
| rs367983479 | BACH2 | BACH2 | 3 | 0 | 3 |
| ANKRD6 | 0 | 1 | 2 | ||
| MAP3K7 | 0 | 1 | 2 | ||
| GABRR2 | 0 | 1 | 2 | ||
| rs71266076 | MIR5708 | ZBTB10 | 1 | 0 | 1 |
| rs144829310 | IL33 | IL33 | 2 | 2 | 6 |
| ERMP1 | 1 | 0 | 1 | ||
| TPD52L3 | 0 | 1 | 2 | ||
| rs10905284 | GATA3 | GATA3 | 2 | 0 | 2 |
| rs11603634 | MUC5AC | MUC5AC | 2 | 2 | 6 |
| TNNT3 | 0 | 1 | 2 | ||
| rs7936312 | C11orf30 | LRRC32 | 4 | 0 | 4 |
| BRCA2 | 1 | 0 | 1 | ||
| rs7305461 | RPS26 | RPS26 | 4 | 0 | 4 |
| SUOX | 4 | 0 | 4 | ||
| RAB5B | 2 | 0 | 2 | ||
| ERBB3 | 2 | 0 | 2 | ||
| ESYT1 | 1 | 0 | 1 | ||
| GDF11 | 1 | 0 | 1 | ||
| RNF41 | 1 | 0 | 1 | ||
| rs703816 | STAT6 | STAT6 | 5 | 0 | 5 |
| NEMP1 | 1 | 0 | 1 | ||
| RBMS2 | 1 | 0 | 1 | ||
| SPRYD4 | 1 | 0 | 1 | ||
| EEF1AKMT3 | 1 | 0 | 1 | ||
| ZBTB39 | 0 | 1 | 2 | ||
| CDK4 | 0 | 1 | 2 | ||
| ESYT1 | 0 | 1 | 2 | ||
| rs10519068 | RORA | RORA | 4 | 0 | 4 |
| ICE2 | 0 | 1 | 2 | ||
| ANXA2 | 0 | 1 | 2 | ||
| FOXB1 | 0 | 1 | 2 | ||
| s72743461 | SMAD3 | SMAD3 | 4 | 0 | 4 |
| AAGAB | 3 | 0 | 3 | ||
| C15orf61 | 0 | 1 | 2 | ||
| BPGM | 0 | 1 | 2 | ||
| MAP2K1 | 0 | 1 | 2 | ||
| rs7203459 | CLEC16A | CLEC16A | 2 | 1 | 4 |
| TEKT5 | 0 | 1 | 2 | ||
| PRM1 | 0 | 1 | 2 | ||
| DEXI | 3 | 0 | 3 | ||
| rs2941522 | IKZF3 | ORMDL3 | 4 | 0 | 4 |
| GSDMB | 5 | 0 | 5 | ||
| GSDMA | 2 | 0 | 2 | ||
| PGAP3 | 3 | 0 | 3 | ||
| MSL1 | 1 | 1 | 3 | ||
| IKZF3 | 2 | 0 | 2 | ||
| ARL5C | 0 | 1 | 2 | ||
| rs112502960 | ZNF652 | ZNF652 | 3 | 0 | 3 |
| GNGT2 | 3 | 0 | 3 | ||
| PHOSPHO1 | 2 | 0 | 2 | ||
| TMEM92 | 0 | 1 | 2 | ||
| HOXB4 | 0 | 1 | 2 | ||
| NCRNA00253 | 0 | 1 | 2 | ||
| rs61840192 | LOC101928272 | GATA3 | 4 | 0 | 4 |
| GATA3-AS1 | 2 | 0 | 2 |
Signals were analyzed using the translational pipeline including publicly available datasets such as Open Target Genetics, GTex, HaploReg and LDLink and our UBIOPRED (UB) datasets of Bronchial Biopsies, Bronchial Brushes, and Nasal Brushes. Each gene was scored based on supporting evidence as laid out in .
Figure 3Bubble plots identifying the eQTL patterns for each selected SNP:Gene pairing as identified in Table 3 and tissue specific eQTLs. Plot identifies each eQTL P-value (x-axis) and B-value (y-axis) stratified based on the asthma risk allele for each identified compartment as denoted by the corresponding color; Blue = Lung, Red = Whole Blood, Green = T-cells, Yellow = U-BIOPRED Bronchial Biopsy, Orange = U-BIOPRED Bronchial Brush and Purple—U-BIOPRED Nasal Brush. The size of the bubble indicates the generated score in Table 3. EQTLs listed are those that achieved a P-value of P < 0.05 in one of the interrogated datasets.
Gene function of candidate causal genes.
|
|
|
|
|---|---|---|
|
| ||
|
| Filaggrin | (1) protein binding |
|
| Mucin 5AC, oligomeric mucus/gel-forming | (1) phosphatidylinositol-mediated signaling |
|
| ||
|
| Gasdermin B | (1) wide pore channel activity |
|
| ||
|
| BTB domain and CNC homolog 2 | (1) sequence-specific double-stranded DNA binding |
|
| GATA binding protein 3 | (1) transcription regulatory region sequence-specific DNA binding |
|
| Interferon regulatory factor 1 | (1) protein binding |
|
| RAR related orphan receptor A | (1) DNA-binding transcription factor activity |
|
| Ribosomal protein S26 | (1) negative regulation of RNA splicing |
|
| SMAD family member 3 | (1) RNA polymerase II core promoter proximal region sequence-specific DNA binding |
|
| Signal transducer and activator of transcription 6 | (1) protein binding |
|
| WD repeat domain 36 | (1) poly(A) RNA binding |
|
| ||
|
| Alpha and gamma adaptin binding protein | (1) protein binding |
|
| C-type lectin domain containing 16A | (1) possible involvement in autophagy and endosomal transport |
|
| Solute carrier family 22 member 4 | (1) carnitine transport |
|
| Solute carrier family 22 member 5 | (1) carnitine transport |
|
| ORMDL sphingolipid biosynthesis regulator 3 | (1) Protein binding |
|
| ||
|
| D-2-hydroxyglutarate dehydrogenase | (1) (R)-2-hydroxyglutarate dehydrogenase activity |
|
| Post-GPI attachment to proteins phospholipase 3 | (1) Possible involvement in GPI anchor metabolic process and hydrolase activity, acting on ester bonds |
|
| MSL complex subunit 1 | (1) Histone H4-K16 acetylation |
|
| Sulfite oxidase | (1) Possible involvement in sulfur compound metabolic processing |
|
| ||
|
| CD247 molecule | (1) Protein binding |
|
| G protein subunit gamma transducin 2 | (1) GTPase activity |
|
| Major histocompatibility complex, class II, DQ beta 1 | (1) Humoral immune response mediated by circulating immunoglobulin |
|
| Interleukin 18 receptor 1 | (1) Protein binding |
|
| Interleukin 1 receptor like 1 | (1) protein binding |
|
| Interleukin 33 | (1) positive regulation of chemokine secretion |
|
| Leucine rich repeat containing 32 | (1) TGF-beta binding and signaling pathway |
|
| Programmed cell death 1 | (1) protein binding |
|
| Thymic stromal lymphopoietin | (1) cytokine activity |
|
| ||
|
| DEXI homolog | Unknown |
|
| KIAA1109 | (1) Protein binding |
|
| Kinesin family member 1A | (1) Identical protein binding |
|
| Zinc finger protein 652 | (1) Protein binding |
|
| Zinc finger and BTB domain containing 10 | (1) Protein binding |
Gene function and biological process for the candidate genes as determined using the Gene 2 Function and UniProt databases, listing associated disease states and gene function annotations.
Figure 4Tissue expression and colocalization analysis of candidate genes. (A) Data was taken from the Human Protein Atlas for both total lung RNA expression scores (bar chart, NX) and single cell RNA expression [heatmap, log10 (median transcripts per million protein coding genes)] in healthy tissue. Bar directly corresponds to gene in heatmap. The Human Protein Atlas was also analyzed for genes which showed enriched/enhanced gene expression in epithelial cells (blue bar) and blood/immune cells (red bar) compared to other cell types. A full list of enriched/enhanced tissue and cell gene expression data per gene is available in Supplementary Table 2. No data was available for FLG and therefore has not been shown. (B) Co-localization analysis (H4 scores) data was taken from the Open Target Genetics (OTG) platform for lung (blue squares) or blood/immune cells (red circles) in studies with the asthma trait (exclusive). Scores represent evidence of association between candidate gene, specific tissue and asthma trait. A higher score indicates greater association. Dotted line shows cut off value of 0.8. Candidate genes AAGAB, CD247, DEXI, FLG, GATA3, HLA-DQA1/A2, B2, IL33, KIF1A, KIAA1109, MSL1, MUC5AC, PGAP3, and ZBTB10 had no association data for lung/blood/immune cells/tissue in OTG and therefore have not been shown.
Figure 5mRNA expression of candidate causal genes in bronchial epithelial cells taken from patients with asthma and controls. Boxes shows the median and IQR and the whiskers show the minimum and maximum data. Bronchial epithelial brush samples were from controls (C, n = 20) and patients with mild-moderate (MA, n = 50) and severe (SA, n = 38) asthma. Data is shown for (A) IL18R1, (B) IL1RL1, (C) MUC5AC, (D) ORMDL3, (E) RPS26, and (F) SLC22A4. Expression values were taken from the dataset and a Kruskal-Wallis test with a two-stage linear step-up procedure of Benjamini, Krieger and Yekutieli used to control the FDR at 5% was performed. *P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001.
Figure 6mRNA expression of candidate causal genes in blood cells taken from patients with asthma and controls. Boxes shows the median and IQR and the whiskers show the minimum and maximum data. Blood samples were from controls (C, n = 87) and patients with mild-moderate (MA, n = 77) and severe (SA, n = 246) asthma. Data is shown for (A) BACH2, (B) CD247, (C) HLA-DQA1/A2, (D) ORMDL3, (E) RORA, (F) IL1RL1, and (G) STAT6. Expression values were taken from the dataset and either a Kruskal-Wallis test or Welch's ANOVA (STAT6), both with a two-stage linear step-up procedure of Benjamini, Krieger and Yekutieli used to control the FDR at 5%, was performed. *P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001.
DAVID functional annotation tool analysis of the genetic association database of complex diseases and disorders (GAD).
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Asthma | FLG, IL33, GSDMB, SLC22A5, SMAD3, TSLP, GATA3, SUOX, MUC5AC, IL1RL1, IRF1, ORMDL3, STAT6, HLA-DQA2, IL18R1, HLA-DQA1, HLA-DQB1 | 17 (45.95) | 14.53 | 1.78 × 10−15 | 1.07 × 10−12 |
| Diabetes mellitus, type 1 | KIAA1109, GSDMB, CLEC16A, PDCD1, SUOX, HLA-DQA2, BACH2, HLA-DQA1, HLA-DQB1 | 9 (24.32) | 32.76 | 1.58 × 10−10 | 4.76 × 10−08 |
| Celiac disease | IL1RL1, KIAA1109, IRF1, CLEC16A, CD247, BACH2, IL18R1, HLA-DQA1, HLA-DQB1 | 9 (24.32) | 22.97 | 2.72 × 10−09 | 5.44 × 10−07 |
| Nasal polyposis | IL33, IL1RL1, WDR36, HLA-DQA1, HLA-DQB1 | 5 (13.51) | 140.38 | 3.00 × 10−08 | 4.30 × 10−06 |
| Obesity|asthma | IL33, IL1RL1, GSDMB, TSLP, ORMDL3, IL18R1 | 6 (16.22) | 60.47 | 3.57 × 10−08 | 4.30 × 10−06 |
| Ulcerative colitis | SLC22A4, GSDMB, SLC22A5, ORMDL3, STAT6, HLA-DQA1, HLA-DQB1 | 7 (18.92) | 31.27 | 6.45 × 10−08 | 6.46 × 10−06 |
| Arthritis, rheumatoid | SLC22A4, IL1RL1, KIAA1109, CD247, HLA-DQA2, HLA-DQA1, HLA-DQB1 | 7 (18.92) | 19.79 | 9.79 × 10−07 | 8.41 × 10−05 |
| Crohn's disease | GSDMB, SLC22A5, SMAD3, ORMDL3, BACH2, HLA-DQA1 | 6 (16.22) | 23.12 | 4.63 × 10−06 | 3.48 × 10−04 |
| Diabetes, type 1 | SLC22A4, SLC22A5, IRF1, CLEC16A, PDCD1, HLA-DQA1, HLA-DQB1 | 7 (18.92) | 14.48 | 6.02 × 10−06 | 3.62 × 10−04 |
| Rheumatoid arthritis | SLC22A4, SLC22A5, CLEC16A, PDCD1, HLA-DQB2, HLA-DQA1, HLA-DQB1 | 7 (18.92) | 14.48 | 6.02 × 10−06 | 3.62 × 10−04 |
The top 10 genes clusters and disease associations are shown, the full table is available in .
Fold enrichment.
Figure 7Gene ontology (GO) term and pathway analyses. The DAVID bioinformatic tools was used to assess candidate genes in (A) GO Term and (B) KEGG and REACTOME Pathway analyses. -log10 (p-value) was plotted against fold enrichment for gene groups and the area of the circle is proportional to number of genes in the group. 5% FDR. Full analyses can be viewed in Supplementary Tables 5, 6.
Figure 8Gene interaction analysis. GeneMANIA was used to explore and predict gene interactions. Black circles represent inputted candidate genes and gray circles are predicted genes (selected to show top 20 with strongest predicted association). The area of the gray circle represents prediction scores and thickness of line represent interaction scores, with a larger area/thicker line representing stronger prediction/interaction.
Known drug interactions and molecules currently in clinical trials of 37 candidate causal genes and five predicted genes.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| AAGAB | - | - | - | - | - |
| BACH2 | - | - | - | - | - |
| CD247 | MUROMO-B-CD3 | - | 2503348, | 15.95 | - |
| CD247 | BLI-TUMOMAB | - | - | 3.04 | - |
| CD247 | AZACITIDINE | - | 15795105 | 1.25 | - |
| CD274 | AVELUMAB | Antibody (inhibitory) | 28472902, | 23.92 |
|
| CD274 | ATEZOLIZUMAB | Antibody (inhibitory) | 24403232, | 10.63 |
|
| CD274 | DURVALUMAB | Antibody (inhibitory) | 28472902, | 7.97 |
|
| CD274 | LODAPOLIMAB | Antibody (inhibitory) | - | 5.32 | - |
| CD274 | CX-072 | Clinical Trials/ CytomX Therapeutics/Solid tumor | - | 5.32 | NCT03993379, |
| CD274 | BMS-936559 | Antibody (inhibitory) | - | 5.32 | NCT01452334, |
| CD274 | ENVAFOLIMAB | Antibody (inhibitory) | - | 5.32 | NCT04480502 |
| CD274 | PIDILIZUMAB | Antibody (inhibitory) | - | 2.66 | NCT02530125 |
| CD274 | NIVOLUMAB | Antibody (inhibitory) | 26086854, | 2.48 |
|
| CD274 | PEMBROLIZUMAB | Antibody (inhibitory) | 25891174, | 2.17 |
|
| CD274 | M-7824 | Clinical Trials | - | 1.77 |
|
| CLEC16A | - | - | - | - | - |
| D2HGDH | - | - | - | - | - |
| DEXI | - | - | - | - | - |
| FLG | - | - | - | - | - |
| GATA3 | PEGASPARGASE | - | 24141364 | 2.45 | - |
| GATA3 | SB010 | Clinical Trial/ Sterna Biologicals GmbH & Co. KG/Asthma | - | - | NCT01743768 |
| GATA3 | SB011 | Clinical Trial/ Sterna Biologicals GmbH & Co. KG/ Mild to Moderate Atopic Dermatitis | - | - | NCT02079688 |
| GATA3 | SB012 | Clinical Trial/ Sterna Biologicals GmbH & Co. KG/Ulcerative Colitis | - | - | NCT02129439 |
| GNGT2 | - | - | - | - | - |
| GSDMB | - | - | - | - | - |
| HLA-DQA1 | LUMIRACOXIB | - | 20639878 | 7.97 | - |
| HLA-DQA1 | LAPATINIB | - | 24687830, | 2.28 | - |
| HLA-DQA1 | AZATHIOPRINE | - | 25217962 | 1.28 | - |
| HLA-DQA2 | - | - | - | - | - |
| HLA-DQB1 | LUMIRACOXIB | - | 20639878 | 2.66 | - |
| HLA-DQB1 | BUCILLAMINE | - | - | 2.66 | - |
| HLA-DQB1 | CLAVULANIC ACID | - | 10535882, | 2.28 | - |
| HLA-DQB1 | FLOXACILLIN | - | 30664875 | 1.52 | - |
| HLA-DQB2 | - | - | - | - | - |
| IL18R1 | - | - | - | - | - |
| IL1RL1 | MSTT1041A | Clinical Trial/University of Leicester/COPD | - | - | NCT03615040 |
| IL1RL1 | GSK3772847 | Clinical Trial/GSK/Moderately Severe Asthma | NCT03207243 | ||
| IL33 | ITEPEKIMAB | Clinical Trial/Regerenron/COPD | - | - | NCT04701983 |
| IL33 | MSTT1041A | Clinical Trial/ Hoffmann-La Roche/Severe Asthma | - | - | NCT02918019 |
| IL33 | MEDI3506 | Clinical Trial/AstraZenica/Uncontrolled Moderate-Severe Asthma | - | - | NCT04570657 |
| IL33 | ANB020 | Clinical Trial/University of Leicester/AnaptysBio/Asthma | - | - | NCT04256044 |
| IL37 | - | - | - | - | - |
| IRAK4 | PF-06650833 | Inhibitory | - | 3.36 | NCT04575610, |
| IRAK4 | KT-474 | Small molecule degrader | - | - | NCT04772885 |
| IRAK4 | CA-4948 | - | - | NCT04278768, | |
| IRF1 | - | - | - | - | - |
| KIF1A | - | - | - | - | - |
| KIAA1109 | - | - | - | - | - |
| LRRC32 | - | - | - | - | - |
| MSL1 | - | - | - | - | - |
| MUC5AC | ENSITUXIMAB | - | - | 31.9 | - |
| ORMDL3 | - | - | - | - | - |
| PDCD1 | CEMIPLIMAB | Antibody (inhibitory), inhibitor (inhibitory) | 29863979, | 9.11 | - |
| PDCD1 | SPARTALIZUMAB | Antibody (inhibitory) | - | 9.11 | - |
| PDCD1 | TISLELIZUMAB | Antibody (inhibitory) | - | 9.11 | - |
| PDCD1 | PIDILIZUMAB | Antibody (inhibitory) | - | 6.83 | - |
| PDCD1 | AMP-224 | Antibody (inhibitory) | - | 4.56 | - |
| PDCD1 | MGA-012 | - | - | 4.56 | - |
| PDCD1 | BALSTILIMAB | - | - | 4.56 | - |
| PDCD1 | SYM-021 | - | - | 4.56 | - |
| PDCD1 | SASANLIMAB | - | - | 4.56 | - |
| PDCD1 | SINTILIMAB | - | - | 4.56 | - |
| PDCD1 | DOSTARLIMAB | - | - | 4.56 | - |
| PDCD1 | NIVOLUMAB | Inhibitor (inhibitory), antibody (inhibitory) | 23289116 | 2.13 | - |
| PDCD1 | M-7824 | - | - | 1.52 | - |
| PDCD1 | PEMBROLIZUMAB | Antibody (inhibitory), antagonist (inhibitory), inhibitor (inhibitory) | 25685857 | 1.04 | - |
| PDCD1 | Pembrolizumab/ Vibostolimab Coformulation (MK-7684A) | Clinical Trial/Merck Sharp & Dohme Corp./ Metastatic Non-Small Cell Lung Cancer | - | - | NCT04725188 |
| PDCD1LG2 | AMP-224 | Antibody | - | 15.95 | - |
| PDCD1LG2 | PEMBROLIZUMAB | - | 28619999 | 2.9 | - |
| PGAP3 | - | - | - | - | - |
| RORA | CHOLESTEROL | Agonist (activating) | 10592235, | 31.9 | - |
| RORA | T091317 | Agonist (activating) | - | 2.66 | - |
| RORA | MELATONIN | - | 8702678, | 1.77 | - |
| RPS26 | - | - | - | - | - |
| SLC22A4 | IMATINIB | - | 23127916, | 2.81 | - |
| SLC22A5 | CARNITINE | - | 21422191 | 63.79 | - |
| SLC22A5 | IMATINIB | - | 28762371, | 1.41 | - |
| SMAD3 | VACTOSERTIB | Clinical Trial/ MedPacto, Inc./Solid state tumors | - | - | NCT02160106 |
| STAT6 | CHEMBL1374370 | - | - | 5.32 | - |
| STAT6 | CHEMBL363332 | - | - | 2.66 | - |
| SUOX | - | - | - | - | - |
| TSLP | MEDI9929 | Clinical Trial/ MedImmune LLC/Severe Asthma, Atopic Dermatitis | - | - | NCT02698501, |
| TSLP | MRx-4DP0004 | Clinical Trial/ 4D pharma plc/Asthma & COVID-19 | - | - | NCT03851250, |
| WDR36 | - | - | - | - | - |
| ZAP70 | TRIDOLGOSIR | - | 17897956 | 21.26 | |
| ZAP70 | ALOISINE | Inhibitory | 10.63 | ||
| ZBTB10 | - | - | - | - | - |
| ZNF652 | - | - | - | - | - |
Interaction scores for known drug interactions with genes highlighted by our signal to gene analysis from the Drug Gene Interaction Database (DGIdb) are listed. The score is a numeric representation of publication count and source count, the ratio of average known gene partners for all drugs to the known partners for the given drug, and the ratio of average known drug partners for all genes to the known partners for the given gene. In interaction score cut off value of >1.0 were selected. Genes with drugs in clinical trials are also listed, CD274 had over 2,000 entries in .
Too many clinical trials to list, see .