| Literature DB >> 30863397 |
Manuel Martínez-Bueno1, Marta E Alarcón-Riquelme2.
Abstract
The importance of low frequency and rare variation in complex disease genetics is difficult to estimate in patient populations. Genome-wide association studies are therefore, underpowered to detect rare variation. We have used a combined approach of genome-wide-based imputation with a highly stringent sequence kernel association (SKAT) test and a case-control burden test. We identified 98 candidate genes containing rare variation that in aggregate show association with SLE many of which have recognized immunological function, but also function and expression related to relevant tissues such as the joints, skin, blood or central nervous system. In addition we also find that there is a significant enrichment of genes annotated for disease-causing mutations in the OMIM database, suggesting that in complex diseases such as SLE, such mutations may be involved in subtle or combined phenotypes or could accelerate specific organ abnormalities found in the disease. We here provide an important resource of candidate genes for SLE.Entities:
Keywords: GWAS—genome-wide association study; SLE; aggregated case-control enrichment; imputated rare variation; sequence kernel association test; systemic lupus erythematosus
Mesh:
Year: 2019 PMID: 30863397 PMCID: PMC6399402 DOI: 10.3389/fimmu.2019.00258
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1After QC filtering, imputation provided a set of 5,305,811 markers: 2,595,206 with MAF > 1% (48.93%) and 2,709,605 with MAF < 1% (51.07%). A set of 1,549,436 independent markers was obtained by applying a threshold of r2 < 0.1. Rare variation was less affected by the linkage disequilibrium than common variation, which resulted in 87,853 variants with MAF > 1% (5.56%) and 1,491,583 variants with MAF < 1% (94.44%). This last set of 1,491,583 independent rare-variants constituted our working set.
Number of independent rare variants vs. sum of minor alleles.
| Singletons | 189,893 (14.5%) | 189,893 (0.64%) |
| MAF < 0.01% | 676,621 (51.8%) | 4,671,537 (16.04%) |
| 0.01% > MAF ≤ 1% | 439,810 (33.7%) | 24,266,676 (83.33%) |
| 1,306,324 (100%) | 29,128,106 (100%) |
Rare variation was classified into 3 categories by their minor allele frequencies (MAFs): (1) singletons that in the 8,277 individuals sample means a MAF = 0.006%, (2) “very rare-variants,” 0.006% < MAF < 0.1%, (3) and rare variation in a “strict sense,” 0.1% < MAF ≤ 1%.
Figure 2Numbers of variants were represented by dark columns: in the set of 1,306,324 independent (r2 < 0.1) polymorphic rare variants (MAF < 0.1), 189,893 (14.5%) were singletons (in the 8,277 individuals sample this means a MAF = 0.006%), 676,621 (51.8%) were classified as “very rare-variants” (0.006% < MAF < 0.1%), and 439,810 (33.7%) considered as a rare variation in a “strict sense” (0.1% < MAF ≤ 1%); these numbers of variants were represented by dark columns. Lighter columns represented the sums of alleles of minor frequency in each of the 3 categories of rare variation, these 3 categories sum up 29,128,106 minor alleles: the 189,893 singletons represented only 0.65% of minor alleles; the 676,621 the markers, which were very rare variation, add up to 4,671,537 minor frequency alleles, it was the 16.04%; while the remaining markers, sum up to 24,266,676 of minor frequency alleles representing the 83.33%.
Functional annotation of rare variation with and without r2 filtering.
| Intronic | 2,305,643 (85.09%) | 1,253,088 (84.011%) |
| Intergenic | 175,180 (6.465%) | 101,187 (6.784%) |
| UTR3 | 46,322 (1.71%) | 27,637 (1.853%) |
| Exonic | 46,133 (1.70%) | 30,053 (2.015%) |
| Downstream | 26,690 (0.985%) | 16,688 (1.119%) |
| Upstream | 23,791 (0.878%) | 15,095 (1.012%) |
| UTR5 | 9,872 (0.364%) | 6,215 (0.417%) |
| Splicing | 231 (0.009%) | 171 (0.012%) |
| ncRNA intronic | 68,381 (2.524%) | 36,967 (2.478%) |
| ncRNA exonic | 6,066 (0.224%) | 3,600 (0.241%) |
Figure 3Functional annotation of rare variation. Note that the percentages of the different functional categories in rare variation with and without r2 filtering was similar, being the intronic the most abundant category, 85% of the total.
Figure 4Functional annotation of exonic rare variation. The exonic rare variants represented only 2%, and more than 98% of these exonic mutations were synonymous.
Best gene candidates for SLE association through rare variation in European ancestry population.
| ZEB1 | Zinc finger E-box binding homeobox 1 | 74 | 1077 | 886 | 2.03 | 1.35 | 3.05 | <1.00E-03 | <1.00E-03 |
| PRKAG3 | Protein kinase, AMP-activated, gamma 3 non-catalytic subunit | 4 | 53 | 95 | 0.55 | 0.39 | 0.78 | <1.00E-03 | <1.00E-03 |
| COQ10B | Coenzyme Q10 homolog B (S. cerevisiae) | 3 | 11 | 38 | 0.25 | 0.13 | 0.5 | <1.00E-03 | <1.00E-03 |
| MAD2L2 | MAD2 mitotic arrest deficient-like 2 (yeast) | 19 | 417 | 517 | 0.59 | 0.45 | 0.77 | <1.00E-03 | 3.60E-03 |
| TMEM69 | Transmembrane protein 69 | 5 | 18 | 42 | 0.32 | 0.19 | 0.54 | <1.00E-03 | 4.40E-03 |
| KRTAP9-2 | Keratin associated protein 9-2 | 3 | 4 | 14 | 0.16 | 0.05 | 0.5 | <1.00E-03 | 6.20E-03 |
| SERINC4 | Serine incorporator 4 | 4 | 27 | 4 | 5.98 | 2.23 | 16.03 | <1.00E-03 | 6.40E-03 |
| CCR3 | Chemokine (C-C motif) receptor 3 | 33 | 750 | 431 | 3.74 | 1.74 | 8.02 | <1.00E-03 | 8.40E-03 |
| CYP26B1 | Cytochrome P450, family 26, subfamily B, polypeptide 1 | 20 | 170 | 93 | 1.7 | 1.31 | 2.21 | <1.00E-03 | 1.01E-02 |
| TMEM106B | Transmembrane protein 106B | 24 | 103 | 244 | 0.56 | 0.38 | 0.82 | <1.00E-03 | 1.77E-02 |
| POU3F3 | POU class 3 homeobox 3 | 2 | 66 | 33 | 3.47 | 1.46 | 8.22 | <1.00E-03 | 2.41E-02 |
| TIGD7 | Tigger transposable element derived 7 | 14 | 109 | 55 | 3.04 | 1.58 | 5.86 | 1.80E-03 | 1.39E-02 |
| KLF1 | Kruppel-like factor 1 (erythroid) | 3 | 90 | 48 | 3.01 | 1.46 | 6.2 | 1.80E-03 | 2.29E-02 |
| PSMB8 | Proteasome (prosome, macropain) subunit, beta type, 8 (large multifunctional peptidase 7) | 30 | 439 | 490 | 0.64 | 0.48 | 0.84 | 1.80E-03 | 2.46E-02 |
| MAPK15 | Mitogen-activated protein kinase 15 | 6 | 55 | 78 | 0.57 | 0.4 | 0.82 | 1.80E-03 | 2.67E-02 |
| AMACR | Alpha-methylacyl-CoA racemase | 20 | 347 | 261 | 1.64 | 1.17 | 2.29 | 1.80E-03 | 2.77E-02 |
| UBE3A | Ubiquitin protein ligase E3A | 73 | 1164 | 1340 | 0.53 | 0.35 | 0.82 | 1.80E-03 | 3.93E-02 |
| TRIM16L | Tripartite motif containing 16-like | 13 | 179 | 227 | 0.52 | 0.34 | 0.78 | 3.40E-03 | <1.00E-03 |
| CTSK | Cathepsin K | 3 | 23 | 41 | 0.26 | 0.1 | 0.68 | 3.40E-03 | 6.20E-03 |
| HYPK | Huntingtin interacting protein K | 3 | 33 | 6 | 13.76 | 2.45 | 77.3 | 3.40E-03 | 8.30E-03 |
| C4A | Complement component 4A (Rodgers blood group) | 2 | 17 | 33 | 0.43 | 0.23 | 0.79 | 3.40E-03 | 1.40E-02 |
| WNT10A | Wingless-type MMTV integration site family, member 10A | 16 | 153 | 215 | 0.52 | 0.34 | 0.81 | 3.40E-03 | 1.57E-02 |
| OR5B12 | Olfactory receptor, family 5, subfamily B, member 12 | 3 | 98 | 141 | 0.44 | 0.26 | 0.75 | 3.40E-03 | 1.78E-02 |
| TAGLN3 | Transgelin 3 | 27 | 225 | 275 | 0.58 | 0.4 | 0.84 | 3.40E-03 | 2.62E-02 |
| PALM | Paralemmin | 65 | 653 | 740 | 0.52 | 0.34 | 0.79 | 3.40E-03 | 2.84E-02 |
| SPACA1 | Sperm acrosome associated 1 | 25 | 405 | 279 | 2.02 | 1.3 | 3.12 | 4.80E-03 | 6.10E-03 |
| POPDC2 | Popeye domain containing 2 | 36 | 444 | 309 | 2.47 | 1.39 | 4.4 | 4.80E-03 | 6.30E-03 |
| ALG11 | Asparagine-linked glycosylation 11, alpha-1,2-mannosyltransferase homolog (yeast) | 10 | 60 | 91 | 0.59 | 0.42 | 0.83 | 4.80E-03 | 7.90E-03 |
| SF3B4 | Splicing factor 3b, subunit 4, 49kDa | 3 | 58 | 88 | 0.39 | 0.2 | 0.78 | 4.80E-03 | 1.78E-02 |
| DUSP7 | Dual specificity phosphatase 7 | 4 | 32 | 12 | 8.12 | 2.03 | 32.54 | 4.80E-03 | 3.11E-02 |
| MRGPRX4 | MAS-related GPR, member X4 | 9 | 74 | 108 | 0.62 | 0.46 | 0.85 | 6.30E-03 | 7.50E-03 |
| AQP8 | Aquaporin 8 | 9 | 141 | 83 | 2.5 | 1.42 | 4.41 | 6.30E-03 | 1.15E-02 |
| PI3 | Peptidase inhibitor 3, skin-derived | 2 | 20 | 4 | 3.85 | 1.29 | 11.5 | 6.30E-03 | 1.58E-02 |
| FRS3 | Fibroblast growth factor receptor substrate 3 | 28 | 384 | 447 | 0.66 | 0.49 | 0.87 | 6.30E-03 | 2.78E-02 |
| MICB | MHC class I polypeptide-related sequence B | 47 | 1110 | 1166 | 0.62 | 0.45 | 0.84 | 6.30E-03 | 2.97E-02 |
| GPBP1L1 | GC-rich promoter binding protein 1-like 1 | 10 | 172 | 193 | 0.57 | 0.38 | 0.86 | 7.70E-03 | <1.00E-03 |
| DOLK | Dolichol kinase | 1 | 15 | 3 | 5.66 | 1.59 | 20.2 | 7.70E-03 | 9.90E-03 |
| CSTA | Cystatin A (stefin A) | 23 | 266 | 200 | 2.26 | 1.28 | 3.97 | 7.70E-03 | 1.59E-02 |
| KLHL31 | Kelch-like 31 (Drosophila) | 39 | 465 | 518 | 0.51 | 0.32 | 0.81 | 7.70E-03 | 1.61E-02 |
| GPR26 | G protein-coupled receptor 26 | 58 | 934 | 734 | 332 | 1.25 | 2.94 | 7.70E-03 | 1.65E-02 |
| PRDM1 | PR domain containing 1, with ZNF domain | 26 | 278 | 332 | 0.62 | 0.44 | 0.86 | 7.70E-03 | 2.45E-02 |
| COX17 | COX17 cytochrome c oxidase assembly homolog (S. cerevisiae) | 32 | 398 | 280 | 2.13 | 1.31 | 3.46 | 7.70E-03 | 3.26E-02 |
| SAMSN1 | SAM domain, SH3 domain and nuclear localization signals 1 | 129 | 1977 | 1622 | 1.74 | 1.21 | 2.5 | 9.10E-03 | 1.20E-02 |
| ST8SIA2 | ST8 alpha-N-acetyl-neuraminide alpha-2,8-sialyltransferase 2 | 153 | 1583 | 1653 | 0.51 | 0.32 | 0.81 | 9.10E-03 | 1.75E-02 |
| MAS1L | MAS1 oncogene-like | 12 | 141 | 171 | 0.52 | 0.32 | 0.83 | 9.10E-03 | 3.38E-02 |
| TBCB | Tubulin folding cofactor B | 21 | 236 | 174 | 1.66 | 1.14 | 2.42 | 1.05E-02 | <1.00E-03 |
| FGR | Gardner-Rasheed feline sarcoma viral (v-fgr) oncogene homolog | 6 | 154 | 93 | 1.97 | 1.17 | 3.3 | 1.05E-02 | 2.67E-02 |
| WNK4 | WNK lysine deficient protein kinase 4 | 5 | 161 | 116 | 2.06 | 1.25 | 3.41 | 1.05E-02 | 4.41E-02 |
| MGAT5 | Mannosyl (alpha-1,6-)-glycoprotein beta-1,6-N-acetyl-glucosaminyltransferase | 426 | 3961 | 3984 | 0.51 | 0.33 | 0.79 | 1.18E-02 | 2.64E-02 |
| FEZF1 | FEZ family zinc finger 1 | 3 | 49 | 74 | 0.37 | 0.18 | 0.79 | 1.18E-02 | 4.02E-02 |
| ZNF461 | Zinc finger protein 461 | 6 | 55 | 80 | 0.37 | 0.18 | 0.77 | 1.32E-02 | 2.34E-02 |
| RAB25 | RAB25, member RAS oncogene family | 8 | 82 | 110 | 0.45 | 0.25 | 0.81 | 1.32E-02 | 3.94E-02 |
| APLNR | Apelin receptor | 5 | 65 | 44 | 2.67 | 1.21 | 5.87 | 1.45E-02 | 1.07E-02 |
| DUS1L | Dihydrouridine synthase 1-like (S. cerevisiae) | 15 | 58 | 89 | 0.38 | 0.19 | 0.75 | 1.45E-02 | 2.60E-02 |
| ASAH2 | N-acylsphingosine amidohydrolase (non-lysosomal ceramidase) 2 | 16 | 256 | 177 | 2.34 | 1.3 | 4.24 | 1.45E-02 | 2.76E-02 |
| MRPS18B | Mitochondrial ribosomal protein S18B | 23 | 554 | 584 | 0.61 | 0.42 | 0.88 | 1.58E-02 | 1.76E-02 |
| IL18RAP | Interleukin 18 receptor accessory protein | 22 | 278 | 213 | 1.62 | 1.12 | 2.34 | 1.58E-02 | 2.77E-02 |
| BNIP2 | BCL2/adenovirus E1B 19kDa interacting protein 2 | 21 | 196 | 246 | 0.59 | 0.4 | 0.87 | 1.58E-02 | 2.93E-02 |
| HAX1 | HCLS1 associated protein X-1 | 2 | 47 | 17 | 2.04 | 1.16 | 3.61 | 1.58E-02 | 3.08E-02 |
| ITFG3 | Integrin alpha FG-GAP repeat containing 3 | 72 | 993 | 1076 | 0.5 | 0.3 | 0.8 | 1.58E-02 | 3.63E-02 |
| ZNF99 | Zinc finger protein 99 | 8 | 18 | 32 | 0.24 | 0.08 | 0.74 | 1.71E-02 | 1.51E-02 |
| ZIK1 | zinc finger protein interacting with K protein 1 homolog (mouse) | 15 | 173 | 116 | 1.9 | 1.18 | 3.06 | 1.71E-02 | 2.43E-02 |
| PDZK1 | PDZ domain containing 1 | 12 | 324 | 376 | 0.67 | 0.48 | 0.92 | 1.84E-02 | 2.43E-02 |
| KRCC1 | Lysine-rich coiled-coil 1 | 28 | 272 | 203 | 1.64 | 1.12 | 2.41 | 1.84E-02 | 4.45E-02 |
| MAP1LC3C | Microtubule-associated protein 1 light chain 3 gamma | 10 | 161 | 206 | 0.58 | 0.38 | 0.89 | 1.97E-02 | 2.24E-02 |
| LRP4 | Low density lipoprotein receptor-related protein 4 | 1 | 12 | 2 | 5.28 | 1.16 | 23.99 | 1.97E-02 | 2.59E-02 |
| OR52K2 | Olfactory receptor, family 52, subfamily K, member 2 | 14 | 158 | 196 | 0.57 | 0.36 | 0.88 | 1.97E-02 | 4.29E-02 |
| GML | Glycosylphosphatidylinositol anchored molecule like protein | 66 | 427 | 496 | 0.45 | 0.25 | 0.8 | 2.10E-02 | 1.30E-02 |
| ARL4A | ADP-ribosylation factor-like 4A | 19 | 163 | 97 | 1.96 | 1.17 | 3.28 | 2.10E-02 | 3.85E-02 |
| ANKRD39 | Ankyrin repeat domain 39 | 3 | 36 | 55 | 0.58 | 0.38 | 0.9 | 2.22E-02 | 1.95E-02 |
| HLA-DRB1 | Major histocompatibility complex, class II, DR beta 1 | 6 | 52 | 71 | 0.4 | 0.19 | 0.83 | 2.22E-02 | 2.51E-02 |
| PNO1 | Partner of NOB1 homolog (S. cerevisiae) | 8 | 79 | 44 | 2.49 | 1.17 | 5.31 | 2.22E-02 | 3.94E-02 |
| LCE2D | Late cornified envelope 2D | 3 | 45 | 49 | 0.38 | 0.17 | 0.86 | 2.22E-02 | 4.75E-02 |
| ANAPC11 | Anaphase promoting complex subunit 11 | 1 | 18 | 33 | 0.52 | 0.29 | 0.94 | 2.35E-02 | 4.86E-02 |
| OR7A17 | Olfactory receptor, family 7, subfamily A, member 17 | 7 | 109 | 73 | 2.15 | 1.16 | 4.01 | 2.47E-02 | 3.93E-02 |
| PTGDS | Prostaglandin D2 synthase 21kDa (brain) | 17 | 166 | 124 | 1.76 | 1.09 | 2.86 | 2.60E-02 | 1.01E-02 |
| CCR1 | Chemokine (C-C motif) receptor 1 | 8 | 149 | 89 | 1.81 | 1.09 | 3 | 2.72E-02 | 7.30E-03 |
| CCDC12 | Coiled-coil domain containing 12 | 29 | 380 | 272 | 1.98 | 1.13 | 3.47 | 2.97E-02 | 1.95E-02 |
| REEP4 | Receptor accessory protein 4 | 10 | 115 | 83 | 2.03 | 1.13 | 3.65 | 3.09E-02 | 3.22E-02 |
| ZNF513 | Zinc finger protein 513 | 3 | 18 | 29 | 0.51 | 0.28 | 0.94 | 3.09E-02 | 3.56E-02 |
| SPATA8 | Spermatogenesis associated 8 | 19 | 145 | 100 | 1.87 | 1.1 | 3.16 | 3.22E-02 | 3.60E-03 |
| TAF15 | TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 68kDa | 49 | 439 | 346 | 1.89 | 1.12 | 3.19 | 3.22E-02 | 3.13E-02 |
| POLR2I | Polymerase (RNA) II (DNA directed) polypeptide I, 14.5kDa | 11 | 142 | 96 | 1.7 | 1.07 | 2.71 | 3.22E-02 | 3.84E-02 |
| KCTD5 | Potassium channel tetramerisation domain containing 5 | 24 | 217 | 250 | 0.54 | 0.33 | 0.89 | 3.34E-02 | 1.59E-02 |
| NKX2-5 | NK2 homeobox 5 | 11 | 109 | 74 | 2.06 | 1.11 | 3.79 | 3.46E-02 | 3.23E-02 |
| SCARA3 | Scavenger receptor class A, member 3 | 69 | 593 | 469 | 1.68 | 1.09 | 2.59 | 3.46E-02 | 4.51E-02 |
| EFNA4 | Ephrin-A4 | 7 | 37 | 51 | 0.39 | 0.16 | 0.93 | 3.70E-02 | 2.52E-02 |
| NRM | Nurim (nuclear envelope membrane protein) | 11 | 229 | 233 | 0.65 | 0.44 | 0.95 | 3.82E-02 | 2.26E-02 |
| TMEM55B | Transmembrane protein 55B | 8 | 72 | 45 | 1.57 | 1.06 | 2.31 | 3.94E-02 | 9.30E-03 |
| MPL | Myeloproliferative leukemia virus oncogene | 15 | 83 | 44 | 2.48 | 1.16 | 5.31 | 4.06E-02 | 1.93E-02 |
| PTGS2 | Prostaglandin-endoperoxide synthase 2 (prostaglandin G/H synthase and cyclooxygenase) | 6 | 79 | 109 | 0.29 | 0.1 | 0.86 | 4.18E-02 | 3.61E-02 |
| NT5DC1 | 5'-nucleotidase domain containing 1 | 87 | 736 | 558 | 2.43 | 1.09 | 5.41 | 4.18E-02 | 4.54E-02 |
| MAFG | v-maf musculoaponeurotic fibrosarcoma oncogene homolog G (avian) | 7 | 68 | 90 | 0.69 | 0.5 | 0.97 | 4.54E-02 | 1.11E-02 |
| C5orf28 | Chromosome 5 open reading frame 28 | 10 | 72 | 90 | 0.52 | 0.28 | 0.99 | 4.54E-02 | 3.22E-02 |
| C8orf58 | Chromosome 8 open reading frame 58 | 19 | 347 | 413 | 0.6 | 0.38 | 0.95 | 4.66E-02 | 3.85E-02 |
| SNX5 | Sorting nexin 5 | 63 | 490 | 374 | 1.99 | 1.08 | 3.64 | 4.77E-02 | 2.51E-02 |
| PTER | Phosphotriesterase related | 98 | 846 | 716 | 2.11 | 1.06 | 4.2 | 4.89E-02 | 5.50E-03 |
| ZNF708 | Zinc finger protein 708 | 8 | 68 | 88 | 0.54 | 0.31 | 0.96 | 4.89E-02 | 1.69E-02 |