| Literature DB >> 34220961 |
Jana Sticht1,2, Miguel Álvaro-Benito2, Stefan Konigorski1.
Abstract
Type 1 diabetes is an autoimmune disease with rising incidence in high-income countries. Genetic and environmental predisposing factors contribute to the etiology of the disease, although their interaction is not sufficiently understood to allow for preventive action. Strongest known associations with genetic variation map to classical HLA class II genes. Because of its genetic complexity, the HLA region has been under-represented in genome-wide association studies, having potentially hindered the identification of relevant associations underlying the etiology of the disease. Here, we performed a comprehensive HLA-wide genetic association analysis of type 1 diabetes including multi-allelic and rare variants. We used high-density whole-exome sequencing data of the HLA region in the large UK Biobank dataset to apply gene-based association tests with a carefully defined type 1 diabetes phenotype (97 cases and 48,700 controls). Exon-based and single-variant association tests were used to complement the analysis. We replicated the known association of type 1 diabetes with the classical HLA-DQ gene. Tailoring the analysis toward rare variants, we additionally identified the lysine methyl transferase EHMT2 as associated. Deeper insight into genetic variation associated with disease as presented and discussed in detail here can help unraveling mechanistic details of the etiology of type 1 diabetes. More specifically, we hypothesize that genetic variation in EHMT2 could impact autoimmunity in type 1 diabetes development.Entities:
Keywords: GWAS; HLA; UK Biobank; type 1 diabetes; whole exome sequencing
Year: 2021 PMID: 34220961 PMCID: PMC8248358 DOI: 10.3389/fgene.2021.683946
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
List of the 26 genes found to be associated with T1D, sorted by p-value of the association test using the NDR-defined T1D definition and the linear SKAT kernel after Bonferroni correction for 147 tests.
| PRRT1 | 33 | ||||||
| HLA-DQA1 | 2.02·10−02 | 3.61·10−04 | 159 | ||||
| HLA-DQB1 | 7.54·10−03 | 6.77·10−03 | 331 | ||||
| HLA-DRB5 | 436 | ||||||
| HLA-DQA2 | 8.70·10−04 | 1.09·10−02 | 2.22·10−01 | 120 | |||
| HLA-DRB1 | 6.15·10−02 | 579 | |||||
| HLA-DQB2 | 1.25·10−03 | 1.20·10−02 | 186 | ||||
| HSPA1A | 4.37·10−03 | 1.20·10−01 | 71 | ||||
| TSBP1 | 3.99·10−03 | 5.11·10−02 | 216 | ||||
| HLA-DRA | 5.82·10−02 | 6.39·10−02 | 52 | ||||
| ABHD16A | 3.11·10−02 | 1.32·10−01 | 197 | ||||
| CLIC1 | 1.47·10−02 | 5.37·10−03 | 69 | ||||
| AL669918.1 | 5.29·10−03 | 4.74·10−02 | 2.60·10−03 | 387 | |||
| VWA7 | 9.27·10−02 | 1.43·10−03 | 233 | ||||
| MSH5 | 4.77·10−02 | 2.95·10−03 | 236 | ||||
| HLA-DOB | 1.32·10−03 | 6.23·10−02 | 83 | ||||
| TAP2 | 7.76·10−03 | 4.52·10−02 | 5.85·10−03 | 328 | |||
| MSH5-SAPCD1 | 4.18·10−02 | 5.04·10−03 | 283 | ||||
| PRRC2A | 4.61·10−02 | 5.14·10−01 | 517 | ||||
| C6orf47 | 2.12·10−03 | 1.00·10+00 | 42 | ||||
| PSMB9 | 6.10·10−03 | 7.19·10−04 | 3.01·10−01 | 1.31·10−01 | 315 | ||
| BRD2 | 1.54·10−02 | 2.44·10−03 | 4.79·10−04 | 1.34·10−01 | 288 | ||
| TAP1 | 5.72·10−03 | 1.01·10−03 | 1.88·10−01 | 2.72·10−01 | 201 | ||
| CYP21A2 | 2.24·10−02 | 2.43·10−02 | 230 | ||||
| LSM2 | 6.21·10−02 | 3.83·10−01 | 39 | ||||
| BTNL2 | 6.36·10−03 | 6.12·10−04 | 2.76·10−01 | 2.02·10−01 | 172 | ||
| AL662899.3 | 3.77·10−04 | 4.52·10−02 | 2.23·10−01 | 277 | |||
| BAG6 | 4.95·10−04 | 1.16·10−01 | 1.25·10−01 | 276 | |||
| DDAH2 | 5.74·10−04 | 3.98·10−02 | 1.06·10−02 | 3.50·10−04 | 43 | ||
| ATP6V1G2 | 7.55·10−04 | 1.23·10−03 | 6.00·10−02 | 9.75·10−01 | 47 | ||
| DXO | 9.45·10−04 | 1.19·10−03 | 4.09·10−02 | 7.69·10−03 | 89 | ||
| CFB | 9.56·10−04 | 5.17·10−04 | 5.91·10−02 | 1.01·10−03 | 169 | ||
| TNXB | 1.33·10−03 | 1.59·10−02 | 1,022 | ||||
| ATP6V1G2-DDX39B | 1.62·10−03 | 4.28·10−04 | 2.20·10−03 | 4.74·10−02 | 2.04·10−01 | 119 | |
| AGER | 1.82·10−03 | 4.58·10−03 | 1.95·10−01 | 1.53·10−01 | 113 | ||
| MICB | 2.50·10−03 | 7.87·10−04 | 2.02·10−03 | 6.11·10−03 | 2.66·10−01 | 137 | |
| DDX39B | 2.72·10−03 | 8.82·10−04 | 3.47·10−03 | 5.28·10−02 | 1.33·10−01 | 94 | |
| NOTCH4 | 3.78·10−03 | 3.66·10−04 | 6.85·10−03 | 3.95·10−01 | 1.18·10−03 | 404 | |
| VARS1 | 5.56·10−03 | 8.50·10−04 | 1.01·10−01 | 9.05·10−01 | 277 | ||
| ATF6B | 5.97·10−03 | 6.73·10−04 | 3.32·10−03 | 9.56·10−02 | 5.97·10−01 | 201 | |
| GPANK1 | 6.47·10−03 | 7.12·10−04 | 8.25·10−02 | 9.01·10−02 | 106 | ||
| PPT2 | 7.63·10−03 | 1.69·10−02 | 2.01·10−01 | 6.03·10−01 | 88 | ||
| PPT2-EGFL8 | 1.84·10−02 | 3.46·10−04 | 3.65·10−02 | 1.61·10−01 | 8.43·10−02 | 203 | |
| GPSM3 | 1.86·10−02 | 3.97·10−02 | 5.81·10−01 | 1.17·10−01 | 57 | ||
| RNF5 | 1.91·10−02 | 3.36·10−02 | 4.73·10−01 | 1.32·10−01 | 36 | ||
| AL645922.1 | 2.39·10−02 | 4.77·10−04 | 1.06·10−03 | 1.48·10−02 | 8.50·10−02 | 376 | |
| EHMT2 | 2.84·10−02 | 3.57·10−03 | 2.53·10−03 | 2.32·10−02 | 1.95·10−01 | 296 | |
| ZNRD1 | 3.09·10−02 | 2.51·10−03 | 5.60·10−02 | 2.38·10−01 | 1.28·10−01 | 44 | |
| NELFE | 5.03·10−01 | 1.47·10−01 | 2.66·10−01 | 3.79·10−01 | 9.13·10−01 | 91 |
Additionally, results using the linear SKAT kernel with the four alternative T1D definitions (ICD10, weak, stringent, self-reported) as well as using the linear-weighted SKAT kernel with the NDR-defined T1D definition are also shown. The number of SNVs per analyzed gene is given in the last column. Significant p-values after Bonferroni correction (<0.05/147 i.e., 3.40·10.
Figure 1(A) Manhattan plot of all 147 analyzed genes showing the position of the 26 genes found significantly associated after Bonferroni correction using the linear kernel SKAT test (see also Table 1). (B) Manhattan plot for all 1,209 analyzed exons showing the position of the 40 exons found significantly associated after Bonferroni correction using the linear kernel SKAT test (see also Table 2). (C) Manhattan plot for all 20,236 analyzed SNVs showing the position of the 92 SNVs found significantly associated after Bonferroni correction using logistic regression (see also Supplementary Table 6). The line indicates the significance cutoff after Bonferroni correction that is (A) [–log (0.05/147) = 3.47], (B) [–log (0.05/1209) = 4.38] and (C) [–log (0.05/20236) = 5.61]. A circle is drawn for every SNV or at every gene's or exon's start position.
List of exons found to be significantly associated with NDR-defined T1D using the linear as well as the linear-weighted SKAT kernel, after Bonferroni correction for 1,209 tests.
| ENSE00001707124 | PPP1R10 | exon8 | 2 | ||
| ENSE00003635259 | BAG6 | exon16 | 3 | ||
| ENSE00001656329 | COL11A2 | exon55 | 3 | ||
| ENSE00001766857 | HLA-DQA1 | exon1 | 3.64·10−02 | 16 | |
| ENSE00001836503 | SLC44A4 | exon21 | 6 | ||
| ENSE00003786549 | FLOT1 | exon9 | 4 | ||
| ENSE00001792039 | TNXB | exon30 | 11 | ||
| ENSE00001619685 | HLA-DPA1 | exon1 | 14 | ||
| ENSE00003421994 | HLA-DQB1 | exon2 | 2.33·10−02 | 71 | |
| ENSE00001596914 | HLA-DQB1 | exon3 | 3.73·10−01 | 25 | |
| ENSE00001930619 | HLA-DRB1 | exon1 | 9.00·10−04 | 69 | |
| ENSE00001911406 | HLA-DPB1 | exon1 | 8 | ||
| ENSE00003562109 | HLA-DQB1 | exon4 | 4.93·10−01 | 13 | |
| ENSE00001641881 | HLA-DRB5 | exon3 | 5.82·10−05 | 50 | |
| ENSE00001703275 | HLA-DQB2 | exon4 | 1.76·10−02 | 15 | |
| ENSE00003658074 | HLA-DQA1 | exon3 | 5.99·10−01 | 24 | |
| ENSE00001614414 | HLA-DQA2 | exon4 | 9.10·10−02 | 22 | |
| ENSE00001723559 | HLA-DQB2 | exon3 | 6.27·10−02 | 26 | |
| ENSE00001768120 | HLA-DRB5 | exon2 | 1.20·10−01 | 107 | |
| ENSE00001731226 | HLA-DRB1 | exon2 | 1.23·10−01 | 104 | |
| ENSE00001465231 | HLA-DQA1 | exon4 | 1.14·10−02 | 35 | |
| ENSE00003589061 | ABCF1 | exon15 | 2 | ||
| ENSE00001715371 | HLA-DRB5 | exon4 | 2.41·10−01 | 13 | |
| ENSE00001646872 | PSMB8 | exon1 | 8 | ||
| ENSE00003839368 | TNXB | exon44 | 4.41·10−02 | 16 | |
| ENSE00003843744 | CYP21A2 | exon10 | 1.19·10−01 | 42 | |
| ENSE00003744412 | TSBP1 | exon16 | 2.09·10−01 | 2 | |
| ENSE00003555889 | VWA7 | exon8 | 8 | ||
| ENSE00001663669 | HLA-DQB2 | exon5 | 5.00·10−05 | 45 | |
| ENSE00003725416 | TSBP1 | exon32 | 3.27·10−01 | 13 | |
| ENSE00003739283 | TSBP1 | exon1 | 1.09·10−02 | 9 | |
| ENSE00001690505 | HSPA1A | exon1 | 1.20·10−01 | 71 | |
| ENSE00003465858 | AL669918.1 | exon11 | 10 | ||
| ENSE00003685114 | TAP2 | exon11 | 10 | ||
| ENSE00003703225 | HSPA1A | exon2 | 2.27·10−01 | 32 | |
| ENSE00001801024 | HLA-DRB1 | exon3 | 8.41·10−04 | 61 | |
| ENSE00001858405 | CLIC1 | exon1 | 2.63·10−04 | 13 | |
| ENSE00001625708 | HLA-DOB | exon1 | 8 | ||
| ENSE00001775810 | HLA-DQB2 | exon2 | 1.88·10−01 | 9 | |
| ENSE00001691563 | MICB | exon3 | 2.94·10−01 | 14 | |
| ENSE00001727868 | PSMB9 | exon5 | 4.15·10−03 | 10 | |
| ENSE00003580154 | EHMT2 | exon2 | 1.61·10−01 | 9 | |
| ENSE00003463002 | VARS2 | exon26 | 7.16·10−01 | 8 | |
| ENSE00003569101 | SKIV2L | exon8 | 7.73·10−01 | 8 |
Results are sorted by p-value of the association test using the linear SKAT kernel. Significant p-values after Bonferroni correction (<0.05/1,209 i.e., <4.14·10.
Figure 2(A) Venn diagram for genes found to be significantly associated with the T1D definitions NDR, ICD10, weak and stringent using the linear SKAT model. (B) Venn diagram for genes found to be significantly associated with NDR-defined T1D in the single-SNV analysis and the gene- or exon-based test using the linear SKAT model. Six genes (HLA-DQA1, -DQB1, -DRB1, -DQA2, -DQB2, -DRB5) were identified in all tests, eight genes (AL669918.1, CLIC1, CYP21A2, HLA-DOB, HSPA1A, TAP2, TSBP1, and VWA7) in the gene- and exon-based SKAT tests, and three genes (HLA-DRA, PRRT1, and TAP1) in the gene-based SKAT and the single-SNV analysis. TNXB was identified in the exon-based and the single-SNV analysis. (C) Venn diagram for genes found to be significantly associated with NDR-defined T1D in the gene- or exon-based test using the linear-weighted SKAT model. Three genes (HLA-DOB, TNXB, and EHMT2) were identified in both tests.