| Literature DB >> 24587899 |
Paula S Ramos1, Stephanie R Shaftman2, Ralph C Ward2, Carl D Langefeld3.
Abstract
The reasons for the ethnic disparities in the prevalence of systemic lupus erythematosus (SLE) and the relative high frequency of SLE risk alleles in the population are not fully understood. Population genetic factors such as natural selection alter allele frequencies over generations and may help explain the persistence of such common risk variants in the population and the differential risk of SLE. In order to better understand the genetic basis of SLE that might be due to natural selection, a total of 74 genomic regions with compelling evidence for association with SLE were tested for evidence of recent positive selection in the HapMap and HGDP populations, using population differentiation, allele frequency, and haplotype-based tests. Consistent signs of positive selection across different studies and statistical methods were observed at several SLE-associated loci, including PTPN22, TNFSF4, TET3-DGUOK, TNIP1, UHRF1BP1, BLK, and ITGAM genes. This study is the first to evaluate and report that several SLE-associated regions show signs of positive natural selection. These results provide corroborating evidence in support of recent positive selection as one mechanism underlying the elevated population frequency of SLE risk loci and supports future research that integrates signals of natural selection to help identify functional SLE risk alleles.Entities:
Year: 2014 PMID: 24587899 PMCID: PMC3920976 DOI: 10.1155/2014/203435
Source DB: PubMed Journal: Autoimmune Dis ISSN: 2090-0430
Genetic regions with compelling evidence for association with SLE.
| Gene(s) region | Chr | Pos (Mb) |
|---|---|---|
| C1q [ | 1 | 22.96 |
| IL12RB2 [ | 1 | 67.55 |
| PTPN22 [ | 1 | 114.16 |
| FCGR2A, FCGR3A [ | 1 | 159.74 |
| TNFSF4 [ | 1 | 171.42 |
| NMNAT2 [ | 1 | 181.48 |
| NCF2 [ | 1 | 181.79 |
| APOBEC4 [ | 1 | 181.88 |
| CFH [ | 1 | 194.89 |
| CFHR1, CFHR4 [ | 1 | 196.79 |
| CRP [ | 1 | 199.72 |
| IL10 [ | 1 | 205.01 |
| LYST [ | 1 | 233.89 |
| RASGRP3 [ | 2 | 33.51 |
| TET3, DGUOK [ | 2 | 74.21 |
| IFIH1 [ | 2 | 162.83 |
| STAT4 [ | 2 | 191.60 |
| PDCD1 [ | 2 | 242.44 |
| SCN10A [ | 3 | 38.71 |
| TREX1 [ | 3 | 48.48 |
| DNASE1L3 [ | 3 | 58.15 |
| PXK [ | 3 | 58.29 |
| TMEM39A [ | 3 | 120.63 |
| CD80 [ | 3 | 120.73 |
| AFF1 [ | 4 | 88.15 |
| BANK1 [ | 4 | 102.93 |
| LEF1 [ | 4 | 109.19 |
| IL21 [ | 4 | 123.75 |
| PPP2CA [ | 5 | 133.53 |
| TNIP1 [ | 5 | 150.39 |
| PTTG1 [ | 5 | 159.78 |
| C4 [ | 6 | 32.09 |
| HLA-DRB1 [ | 6 | 32.59 |
| ITPR3 [ | 6 | 33.70 |
| UHRF1BP1 [ | 6 | 34.87 |
| BACH2 [ | 6 | 90.69 |
| ATG5, PRDM1 [ | 6 | 106.53 |
| TNFAIP3 [ | 6 | 138.23 |
| ICA1 [ | 7 | 8.12 |
| JAZF1 [ | 7 | 27.84 |
| IKZF1 [ | 7 | 50.31 |
| IRF5, TNPO3 [ | 7 | 128.37 |
| XKR6 [ | 8 | 10.79 |
| BLK [ | 8 | 11.39 |
| LYN [ | 8 | 56.95 |
| ARMC3 [ | 10 | 23.26 |
| LRRC18, WDFY4 [ | 10 | 49.89 |
| ARID5B, RTKN2 [ | 10 | 63.94 |
| SLC29A3 [ | 10 | 72.75 |
| PHRF1, IRF7 [ | 11 | 0.58 |
| CD44, PDHX [ | 11 | 34.94 |
| DDX6 [ | 11 | 118.13 |
| ETS1 [ | 11 | 127.83 |
| CREBL2, GPR19, CDKN1B [ | 12 | 12.66 |
| DRAM1 [ | 12 | 102.27 |
| SLC15A4 [ | 12 | 127.84 |
| ELF1 [ | 13 | 40.40 |
| C2 [ | 14 | 20.75 |
| CSK [ | 15 | 72.86 |
| DNASE1 [ | 16 | 3.64 |
| CLEC16A [ | 16 | 11.04 |
| PRKCB [ | 16 | 23.75 |
| SEZ6L2 [ | 16 | 29.79 |
| ITGAM, ITGAX [ | 16 | 31.18 |
| IRF8 [ | 16 | 84.49 |
| IKZF3, ZPBP2 [ | 17 | 37.91 |
| CD226 [ | 18 | 65.68 |
| TYK2 [ | 19 | 10.32 |
| ICAM1, ICAM4, ICAM5 [ | 19 | 10.40 |
| ACP5 [ | 19 | 11.55 |
| DDA1 [ | 19 | 17.28 |
| UBE2L3 [ | 22 | 20.25 |
| SCUBE1 [ | 22 | 41.93 |
| IRAK1, MECP2 [ | X | 152.93 |
The reference list for each gene region does not intent to be exhaustive; instead, only the first and/or strongest associations reported to date are mentioned. A comprehensive list of all the studies that report each region have been recently reviewed elsewhere [16–18]. Chr: chromosome; Pos: position (in Mega basepairs) according to Human Genome Build hg18.
Regions with evidence for selection on the HapMap populations.
| Gene region | Chr | Mb | iHS |
| FST | Empirical | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Max −log( | Pop | Max −log( | Pop | Max −log( | Value | Pop | Min | Pop | |||
| PTPN22 | 1 | 114.158 | — | — | 3.6 | YRI | 3.2 | 0.75 | YRI versus ASN | — | — |
| TNFSF4 | 1 | 171.419 | 2.5 | ASN | 2.3 | ASN | 2.7 | 0.60 | YRI versus ASN | 0.005 | ASN |
| NMNAT2 | 1 | 181.484 | 2.5 | ASN | 2.4 | CEU | — | — | — | 0.004 | ASN |
| NCF2 | 1 | 181.791 | 2.5 | ASN | — | — | — | 0.65 | CEU versus YRI | 0.004 | ASN |
| APOBEC4 | 1 | 181.882 | 2.5 | ASN | — | — | — | 0.65 | CEU versus YRI | 0.004 | ASN |
| CFH | 1 | 194.888 | — | — | — | — | 3.0 | 0.60 | YRI versus ASN | — | — |
| CFHR1, CFHR4 | 1 | 196.789 | 2.0 | YRI | — | — | 3.0 | 0.60 | YRI versus ASN | — | — |
| TET3, DGUOK | 2 | 74.212 | 2.7 | CEU | 2.6 | ASN | 3.2 | 0.85 | YRI versus ASN | 0.001 | CEU |
| IFIH1 | 2 | 162.832 | — | — | 3.8 | CEU | 2.2 | 0.78 | CEU versus YRI | — | — |
| TREX1 | 3 | 48.481 | 2.4 | ASN | 2.1 | ASN | — | — | — | 0.002 | ASN |
| TNIP1 | 5 | 150.390 | — | — | 3.0 | CEU | — | 0.65 | CEU versus YRI | — | — |
| ITPR3 | 6 | 33.697 | 3.4 | YRI | 3.3 | YRI | 3.3 | 0.60 | YRI versus ASN | — | — |
| UHRF1BP1 | 6 | 34.868 | 2.5 | CEU | 2.4 | YRI | — | 0.50 | — | 0.004 | CEU |
| XKR6 | 8 | 10.791 | 2.7 | ASN | 3.3 | ASN | 2.6 | 0.92 | YRI versus ASN | 0.003 | ASN |
| BLK | 8 | 11.389 | 2.7 | ASN | 3.2 | ASN | 2.6 | 0.92 | YRI versus ASN | 0.005 | ASN |
| ARMC3 | 10 | 23.257 | — | — | 2.5 | CEU | 2.5 | 0.65 | YRI versus ASN | — | — |
| LRRC18, WDFY4 | 10 | 49.893 | — | — | 2.0 | ASN | 2.5 | 0.80 | YRI versus ASN | — | — |
| ITGAM | 16 | 31.179 | — | — | — | — | 3.4 | 0.75 | YRI versus ASN | — | — |
| CD226 | 18 | 65.681 | — | — | 3.1 | CEU | 3.7 | 0.80 | CEU versus YRI | — | — |
Regions were considered to show evidence for selection if the maximum −log(Q) > 3 for either the F ST, D, or iHS, or it had SNPs with FST values in the top 1% within each population comparison, or the empirical P value estimated for the SLE-associated gene and for each population showed significant evidence for selection (P value < 0.01). Cells that did not meet these thresholds or whose −log(Q) > 2 are marked with (—). The table shows the highest −log(Q) value and respective population for the iHS, D, and FST, the FST statistic (value) for SNPs in the top 1% and the population comparison, and the minimum empirical P value in each region. Q is the rank of the observed statistic for a given SNP divided by the total number of SNPs. The statistic that is ranked is obtained independently for each of the three statistics separately for each population. For iHS, for each SNP, 25 SNPs on either side of the SNP are scanned for |iHS| > 2. The proportion of SNPs in this 51 SNP window with |iHS| > 2 is computed. For D, the estimated value of D was used for ranking. For FST, the statistic to be ranked is obtained in a similar manner as that for iHS except for each population comparison, the thresholds for defining a significant FST is based on the top 5% cutoff for each population comparison. See Methods for details. Chr: chromosome, Mb: mega basepairs, Max: maximum, Min: minimum, Pop: population, ASN: East Asian, CEU: European, YRI: African.
Regions with evidence for selection in the HGDP populations.
| Gene region | Chr | Mb |
| iHS | XP-EHH | ||
|---|---|---|---|---|---|---|---|
| Max −log( | Max −log( | Pop | Max −log( | Pop | |||
| PTPN22 | 1 | 114.158 | 2.5 | 3 | Afr | 3.5 | Afr |
| TNFSF4 | 1 | 171.419 | 4.5 | 2.5 | EAsia | 3.5 | EAsia |
| CRP | 1 | 199.719 | 3.5 | — | — | 2.5 | Afr, Eur |
| IL10 | 1 | 205.008 | 4 | 2 | MEast, EAsia | 2.5 | SAsia EAsia |
| TET3, DGUOK | 2 | 74.212 | 2.5 | 2 | SAsia | 3.5 | MEast, SAsia |
| TNIP1 | 5 | 150.390 | 3.5 | 1.5 | MEast | 3 | Amer |
| PTTG1 | 5 | 159.781 | — | 3.5 | Afr | 2.8 | MEast, Afr |
| UHRF1BP1 | 6 | 34.868 | — | 3 | Amer | 3.5 | Amer |
| IKZF1 | 7 | 50.315 | 3.5 | 3 | EAsia | 2.5 | EAsia |
| BLK | 8 | 11.389 | 4 | 3 | SAsia, MEast, Afr | 4 | EAsia |
| ARMC3 | 10 | 23.257 | 2.5 | 2.5 | MEast | 3.5 | MEast |
| SLC15A4 | 12 | 127.844 | 3.5 | — | — | 2.5 | Afr, Eur |
| CLEC16A | 16 | 11.038 | 2 | 4 | Amer | 4 | Amer |
| ITGAM | 16 | 31.179 | 2.5 | 2 | EAsia | 3.5 | EAsia |
| IRF8 | 16 | 84.490 | 2.5 | 2 | SAsia | 4 | SAsia |
| SCUBE1 | 22 | 41.929 | 2.5 | 2 | Oceania | 3 | Oceania |
Regions were considered to show evidence for selection if the maximum −log10(P) > 3 for either the F ST, iHS, or XP-EHH. The table shows the highest −log10(empirical P value) and respective population for the F ST, iHS, and XP-EHH in each region. Regions whose −log10(P) < 2 are marked with (—). See Methods for details. Chr: chromosome, Mb: mega basepairs, Max: maximum, Pop: population. Populations: Bantu-speaking Africans (Afr), Europeans (Eur), Middle Easterners (MEast), Eastern Asians (EAsia), South Asians (SAsia), Americans (Amer), and Oceanians (Oceania).
Figure 1Worldwide distribution of allele frequencies for SLE-associated SNPs rs2736340 in BLK (a), rs9937837 in ITGAM (b), and rs12599402 in CLEC16A (c).
Summary of regions with evidence for selection on both the HapMap and HGDP populations.
| Gene region | HapMap | HGDP | ||||||
|---|---|---|---|---|---|---|---|---|
| iHS |
|
| Min empirical |
| iHS | XP-EHH | ||
| Max −log( | Max −log( | Value | Max −log( | Max −log( | Max −log( | |||
| PTPN22 | 3.6 | 3.2 | 0.75 | 3.0 | 3.5 | |||
| TNFSF4 | 0.6 | 0.005 | 4.5 | |||||
| TET3, DGUOK | 3.2 | 0.85 | 0.001 | 3.5 | ||||
| TNIP1 | 3.0 | 0.65 | 3.5 | 3.0 | ||||
| UHRF1BP1 | |2.28|* | 0.004 | 3.0 | 3.5 | ||||
| BLK | 0.92 | 0.005 | 4.0 | 3.0 | 4.0 | |||
| ITGAM | 3.4 | 0.75 | 3.5 | |||||
Please refer to footnotes on Tables 2 and 3 for details. *iHS = −2.28 for rs11755393.