| Literature DB >> 25479423 |
Diana Chang1, Feng Gao2, Andrea Slavney3, Li Ma4, Yedael Y Waldman2, Aaron J Sams2, Paul Billing-Ross3, Aviv Madar2, Richard Spritz5, Alon Keinan6.
Abstract
Many complex human diseases are highly sexually dimorphic, suggesting a potential contribution of the X chromosome to disease risk. However, the X chromosome has been neglected or incorrectly analyzed in most genome-wide association studies (GWAS). We present tailored analytical methods and software that facilitate X-wide association studies (XWAS), which we further applied to reanalyze data from 16 GWAS of different autoimmune and related diseases (AID). We associated several X-linked genes with disease risk, among which (1) ARHGEF6 is associated with Crohn's disease and replicated in a study of ulcerative colitis, another inflammatory bowel disease (IBD). Indeed, ARHGEF6 interacts with a gastric bacterium that has been implicated in IBD. (2) CENPI is associated with three different AID, which is compelling in light of known associations with AID of autosomal genes encoding centromere proteins, as well as established autosomal evidence of pleiotropy between autoimmune diseases. (3) We replicated a previous association of FOXP3, a transcription factor that regulates T-cell development and function, with vitiligo; and (4) we discovered that C1GALT1C1 exhibits sex-specific effect on disease risk in both IBDs. These and other X-linked genes that we associated with AID tend to be highly expressed in tissues related to immune response, participate in major immune pathways, and display differential gene expression between males and females. Combined, the results demonstrate the importance of the X chromosome in autoimmunity, reveal the potential of extensive XWAS, even based on existing data, and provide the tools and incentive to properly include the X chromosome in future studies.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25479423 PMCID: PMC4257614 DOI: 10.1371/journal.pone.0113684
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
GWAS datasets.
| Dataset | Disease | # SNPs | # Genes (#SNPs in genic regions) | # Cases (males, females) | # Controls (males, females) |
| ALS Finland | Amyotrophic Lateral Sclerosis (ALS) | 207,947 | 970 (72,219) | 400 (198, 202) | 490 (103, 387) |
| ALS Irish | Amyotrophic Lateral Sclerosis (ALS) | 219,300 | 967 (77,043) | 221 (119, 102) | 210 (112, 98) |
| Psoriasis CASP | Psoriasis | 184,246 | 953 (62,106) | 1,209 (588, 621) | 1,271 (585, 686) |
| Celiac Disease CIDR | Celiac Disease | 187,284 | 962 (64,836) | 1,576 (447, 1129) | 504 (225, 279) |
| CD NIDDK | Crohn's Disease (CD) | 176,072 | 837 (58,874) | 791 (378, 413) | 922 (457, 465) |
| CD WT1 | Crohn's Disease (CD) | 150,275 | 930 (49,017) | 1,592 (607, 985) | 1,701 (923, 778) |
| UC WT2 | Ulcerative Colitis (UC) | 196,781 | 963 (67,422) | 2,341 (1148, 1193) | 1,699 (843, 856) |
| MS case control | Multiple Sclerosis (MS) | 183,954 | 842 (61,119) | 943 (312, 631) | 851 (290, 561) |
| MS WT2 | Multiple Sclerosis (MS) | 169,707 | 962 (58,463) | 2,666 (698, 1968) | 1389 (700, 689) |
| Vitiligo GWAS1 | Vitiligo | 157,676 | 958 (54,384) | 1,391 (411, 980) | 4,521 (1985, 2536) |
| Vitiligo GWAS2 | Vitiligo | 187,688 | 962 (64,974) | 415 (144, 271) | 2,552 (973, 1579) |
| T2D GENEVA | Type-2 Diabetes (T2D) | 220,752 | 971 (75,941) | 2,515 (1050, 1465) | 2,850 (1187, 1663) |
| T2D WT1 | Type-2 Diabetes (T2D) | 152,996 | 927 (49,956) | 1,811 (1051, 760) | 1,668 (710, 958) |
| T1D WT1 | Type-1 Diabetes (T1D) | 152,304 | 926 (49,718) | 1,867 (954, 913) | 1,714 (941, 773) |
| RA WT1 | Rheumatoid Arthritis (RA) | 146,907 | 925 (47,880) | 1,772 (443, 1329) | 1,709 (920, 789) |
| AS WT2 | Ankylosing Spondylitis (AS) | 200,042 | 966 (69,441) | 1,472 (976, 496) | 1,260 (665, 595) |
For each of the case-control datasets analyzed in this study, the table lists its name, the disease considered, the number of X-linked SNPs (# SNPs), which include imputed SNPs, the number of genes tested in gene-based tests (# Genes), and the combined number of SNPs mapped to these genes or to within 15 kb of them (# SNPs in genic regions). The number of individuals (# Cases and # Controls) represents the number of samples following QC. The number of males and females in each category is denoted in parenthesis. All datasets consist of individuals of European ancestry.
*As control individuals overlap across these datasets, we only considered non-overlapping control subsets for each of the datasets (Materials and Methods). The size of these subsets is indicated under # Controls.
Figure 1X-linked genes associated with autoimmune disease risk.
All genes that showed evidence of association in a gene-based test and replication, including suggestive replication in any other dataset (see main text) are presented for the a) FMS.comb b) FMF.comb c) FM02 and d) sex-differentiated effect size tests (Materials and Methods). X-axis denotes the different datasets, with their names following the notation from Table 1. Y-axis displays the different gene names. For each gene, the more significant p-value of the truncated tail strength and truncated product methods is displayed on a −log10 scale according to the enclosed color scale. A “*” represents the discovery dataset and “**” indicates datasets in which replication is significant after correcting for the number of genes tested for replication. These appear in grey when the discovery and replication are in datasets of the same disease (or across the related Crohn's disease and ulcerative colitis). Numerical values corresponding to this figure are presented in Tables 2–3.
Gene-based associations replicating in similar diseases.
| Discovery dataset | Gene | p-value (tail, product) | Replication dataset | p-value (tail, product) | combined p-value (tail, product) |
|
| |||||
| Vitiligo GWAS1 | PPP1R3F | 6.60×10−5, 1.39×10−4 | Vitiligo GWAS2 | 8.10×10−3, 2.70×10−3 | 8.26×10−6, 5.93×10−6 |
| Vitiligo GWAS1 | FOXP3 | 1.11×10−4, 2.76×10−4 | Vitiligo GWAS2 | 5.60×10−3, 5.40×10−3 | 9.50×10−6, 2.15×10−5 |
| Vitiligo GWAS1 | GAGE10 | 1.60×10−3, 4.03×10−4 | Vitiligo GWAS2 | 2.80×10−3, 3.80×10−3 | 5.97×10−5, 2.20×10−5 |
| CD WT1 | ARHGEF6 | 1.70×10−3, 3.66×10−4 | UC WT2 | 2.30×10−3, 3.10×10−3 | 5.26×10−5, 1.67×10−5 |
|
| |||||
| Vitiligo GWAS1 | PPP1R3F | 1.14×10−4, 4.96×10−4 | Vitiligo GWAS2 | 3.70×10−3, 5.80×10−3 | 6.61×10−6, 3.96×10−5 |
|
| |||||
| Vitiligo GWAS1 | PPP1R3F | 6.0×10−6, 7.60×10−5 | Vitiligo GWAS2 | 4.80×10−3, 1.30×10−3 | 5.29×10−7, 1.69×10−6 |
| Vitiligo GWAS1 | GAGE12H | 6.34×10−4, 6.34×10−4 | Vitiligo GWAS2 | 4.60×10−3, 4.60×10−3 | 4.01×10−5, 4.01×10−5 |
| Vitiligo GWAS1 | GAGE10 | 1.85×10−3, 2.66×10−4 | Vitiligo GWAS2 | 2.90×10−3, 2.80×10−3 | 7.05×10−5, 1.13×10−5 |
|
| |||||
| CD WT1 | C1GALT1C1 | 1.97×10−3, 2.63×10−4 | UC WT2 | 1.39×10−2, 1.14×10−2 | 3.15×10−4, 4.11×10−5 |
All genes with a discovery nominal P<1×10−3 (in Discovery dataset) that also replicated in a dataset of the same or similar disease (Replicated dataset). Results are presented for each of the 3 tests of association, as well as for the test of sex-differential effect size, as indicated by titles in the table. For both discovery and replication, p-values of both methods of gene-based testing (truncated tail strength and truncated product) are presented. Combined p-values (last column) were calculated using Fisher's method.
Gene-based associations replicating in other diseases.
| Discovery dataset | Gene | p-value (tail, product) | Replication dataset | p-value (tail, product) | combined p-value (tail, product) |
|
| |||||
| ALS Finland | NAP1L2 | 4.51×10−4, 3.80×10−5 | UC WT2 | 5.70×10−3, 3.70×10−3 | 3.57×10−5, 2.36×10−6 |
| Vitiligo GWAS1 | 1.0×10−2, 1.40×10−2 | 6.00×10−5, 8.22×10−6 | |||
| ALS Finland | ITM2A | 2.10×10−3, 4.10×10−4 | Celiac Disease CIDR | 7.90×10−3, 1.06×10−2 | 1.99×10−4, 5.80×10−5 |
| MS case control | FANCB | 5.20×10−5, 1.30×10−3 | RA WT1 | 3.80×10−3, 1.10×10−2 | 3.25×10−6, 1.74×10−4 |
| Vitiligo GWAS1 | CENPI | 2.17×10−4, 1.00×10−3 | ALS Finland | 2.40×10−3, 2.00×10−3 | 8.06×10−6, 2.82×10−5 |
| T2D GENEVA | RP4-562J12.2 | 4.89×10−4, 1.30×10−4 | CD NIDDK | 3.41×10−2, 3.93×10−2 | 2.00×10−4, 5.56×10−4 |
| WT2 AS | 5.60×10−2, 4.30×10−2 | 3.15×10−4, 7.32×10−5 | |||
| T2D WT1 | MAGEC1 | 2.64×10−2, 5.34×10−4 | MS case control | 6.70×10−3, 8.50×10−3 | 1.71×10−3, 6.04×10−5 |
| UC WT21 | NAP1L6 | 1.06×10−3, 5.70×10−5 | ALS Finland | 3.10×10−3, 5.50×10−3 | 4.49×10−5, 5.01×10−6 |
|
| |||||
| CASP | NLGN4X | 8.87×10−4, 1.66×10−2 | Vitiligo GWAS2 | 1.21×10−2, 1.31×10−2 | 1.34×10−4, 2.05×10−3 |
| CIDR Celiac Disease | 5.10×10−2, 4.90×10−2 | 4.98×10−4, 6.66×10−3 | |||
| Celiac CIDR | CENPI | 2.90×10−3, 5.23×10−4 | ALS Finland | 1.12×10−2, 1.00×10−3 | 3.68×10−4, 8.09×10−6 |
| ALS Irish | 2.68×10−2, 1.64×10−2 | 8.13×10−4, 1.09×10−4 | |||
| Vitiligo GWAS1 | 1.55×10−4, 2.60×10−3 | 7.02×10−6, 1.97×10−5 | |||
| Vitiligo GWAS1 | BEND2 | 1.80×10−3, 7.90×10−5 | T2D WT1 | 9.30×20−3, 1.29×10−2 | 2.01×10−4, 1.51×10−5 |
| Vitiligo GWAS1 | CENPI | 1.55×10−4, 2.60×10−3 | ALS Finland | 1.12×10−2, 1.00×10−3 | 2.48×10−5, 3.60×10−5 |
| Celiac CIDR | 2.90×10−3, 5.23×10−4 | 7.02×10−6, 1.97×10−5 | |||
| Vitiligo GWAS2 | MCF2 | 1.70×10−4, 5.76×10−4 | MS WT2 | 2.31×10−2, 2.50×10−2 | 5.28×10−5, 1.75×10−4 |
| CD WT1 | LINC00892 | 1.30×10−3, 8.80×10−5 | MS WT2 | 2.42×10−2, 1.99×10−2 | 3.58×10−4, 2.50×10−5 |
| T2D WT1 | MAGEC1 | 2.75×10−2, 1.81×10−4 | MS case control | 1.42×10−2, 1.50×10−2 | 3.46×10−3, 3.75×10−5 |
| MS WT2 | MAGEE1 | 7.06×10−4, 2.30×10−3 | ALS Finland | 3.23×10−2, 2.36×10−2 | 2.67×10−4, 5.87×10−4 |
|
| |||||
| ALS Finland | NAP1L2 | 5.7×10−4, 1.15×10−4 | UC WT2 | 8.30×10−3, 7.1×10−3 | 6.27×10−5, 1.23×10−5 |
| ALS Finland | ITM2A | 8.43×10−4, 3.07×10−4 | Celiac CIDR | 6.5×10−3, 1.13×10−2 | 7.19×10−5, 4.71×10−5 |
| ALS Finland | CENPI | 1.27×10−3, 1.75×10−4 | Vitiligo GWAS1 | 1.60×10−3, 5.90×10−3 | 2.89×10−5, 1.53×10−5 |
| ALS Finland | TMEM35 | 2.78×10−3, 3.45×10−4 | Vitiligo GWAS1 | 3.80×10−3, 6.20×10−3 | 1.31×10−4, 3.01×10−5 |
| CD WT1 | LINC00892 | 1.73×10−3, 5.29×10−4 | MS WT2 | 6.30×10−3, 6.40×10−3 | 1.35×10−4, 4.60×10−5 |
| Vitiligo GWAS1 | 2.30×10−2, 2.89×10−2 | 4.41×10−4, 1.85×10−4 | |||
| UC WT2 | GPR34 | 2.62×10−4, 1.62×10−4 | MS WT2 | 5.60×10−3, 1.10×10−2 | 2.12×10−5, 2.54×10−5 |
| UC WT2 | NAP1L6 | 1.19×10−3, 4.29×10−4 | ALS Finland | 4.00×10−3, 1.06×10−2 | 6.31×10−5, 6.05×10−5 |
| MS case control | RP11-265P11.2 | 3.03×10−3, 8.55×10−4 | T2D WT1 | 4.42×10−2, 4.68×10−2 | 1.32×10−3, 4.45×10−4 |
| T2D GENEVA | SNORA35 | 2.12×10−3, 4.54×10−4 | AS WT2 | 2.40×10−3, 6.70×10−3 | 6.71×10−5, 4.17×10−5 |
| T2D GENEVA | IL13RA1 | 6.35×10−3, 8.59×10−4 | AS WT2 | 6.20×10−3, 7.20×10−3 | 4.39×10−4, 8.04×10−5 |
| T2D WT1 | MAGEC1 | 2.63×10−2, 6.80×10−5 | MS case control | 1.00×10−2, 1.54×10−2 | 2.43×10−3, 1.55×10−5 |
|
| |||||
| ALS Finland | MAGEE2 | 6.5×10−4, 1.94×10−3 | Vitiligo GWAS1 | 3.08×10−2, 1.64×10−2 | 2.37×10−4, 3.61×10−4 |
| ALS Finland | NDP | 1.41×10−3, 9.34×10−4 | CD WT1 | 8.60×10−3, 1.33×10−2 | 1.49×10−4, 1.53×10−4 |
| CASP | NLGN4X | 2.34×10−4, 1.65×10−2 | Vitiligo GWAS1 | 4.52×10−2, 4.33×10−2 | 1.32×10−4, 5.89×10−3 |
| Celiac CIDR | CENPI | 4.4×10−3, 2.08×10−4 | ALS Finland | 2.03×10−2, 1.78×10−2 | 9.22×10−4, 5.00×10−5 |
| ALS Irish | 9.80×10−3, 4.40×10−3 | 4.88×10−4, 1.36×10−5 | |||
| Vitiligo GWAS1 | BEND2 | 3.99×10−3, 1.28×10−4 | MS case control | 4.60×10−2, 5.20×10−2 | 1.76×10−3, 8.60×10−5 |
| Vitiligo GWAS2 | MCF2 | 7.00×10−4, 1.93×10−3 | MS WT2 | 2.38×10−2, 2.12×10−2 | 2.00×10−4, 4.54×10−4 |
| T2D GENEVA | EFHC2 | 6.09×10−4, 1.12×10−3 | RA WT1 | 1.58×10−2, 1.40×10−3 | 1.21×10−4, 2.42×10−5 |
| RA WT1 | MIR320D2 | 8.69×10−3, 5.68×10−4 | ALS Irish | 2.39×10−2, 2.64×10−2 | 1.97×10−3, 1.82×10−4 |
All genes with a discovery nominal P<1×10−3 that also replicated in a dataset of a different disease (see main text). The table mirrors Table 2, with the only difference being whether replication is in the same disease (Table 2) or a different one (this table). Cases in which the same association is replicated in multiple datasets span several rows.
Gene set associations.
| Dataset | Statistic | P-value |
| XY homologs gene set | ||
| Psoriasis CASP | FMF.comb |
|
| Celiac disease CIDR | FMF.comb | 0.0467 |
| Vitiligo GWAS1 | FMF.comb |
|
| Vitiligo GWAS1 | FM02 | 0.0329 |
| Vitiligo GWAS2 | FMF.comb | 0.0346 |
| CD NIDDK | FM02 | 0.017 |
| CD WT1 | FM02 | 0.0234 |
| T1D WT1 | FMS.comb | 0.0302 |
| Panther immune gene set | ||
| Vitiligo GWAS1 | FM02 |
|
| Vitiligo GWAS1 | FMF.comb | 0.0387 |
| Vitiligo GWAS1 | FMS.comb |
|
| Vitiligo GWAS2 | FM02 |
|
| Vitiligo GWAS2 | FMF.comb | 0.0448 |
| Vitiligo GWAS2 | FMS.comb |
|
| T2D GENEVA | FMS.comb |
|
| KEGG/GO immune gene set | ||
| Vitiligo GWAS1 | FMF.comb |
|
| Vitiligo GWAS1 | FMs.comb |
|
Three curated gene sets were tested for association with disease risk. Displayed are datasets for which P<0.05 for association with the gene set indicated in header rows (XY homologs, Panther, KEGG/GO; Materials and Methods). Bold p-values indicate significant associations after multiple testing correction. P-values are the minimum between that based on the truncated tail strength method and the one based on the truncated product method. Results for all datasets and tests are presented in Table S7.
Figure 2X-linked autoimmune disease risk genes are differentially expressed between tissues.
X-axis presents 13 of the associated X-linked genes for which gene expression data was available for analysis. For each, a z-score is presented for the deviation of expression in each of 74 tissues (y-axis) from the average expression of that gene across all tissues (Materials and Methods). For comparison, the last column shows average z-scores across all 504 X-linked genes that were tested as part of the entire XWAS for which expression data was available. Several associated genes exhibit significantly higher expression in immune-related tissues (see main text and Figure 3).
Figure 3Three X-linked disease risk genes show high expression in immune-related tissues and cells.
ARHGEF6 (a), IL13RA1 (b), and ITM2A (c) show expression greater than 4 standard deviations above the average expression of these genes in T-cells (highest in CD4+ in purple), CD14+ monocytes (blue), and the thymus (red), respectively. Y-axis follows the respective tissues from Figure 2 and x-axis denotes a z-score for the deviation of expression in each tissue from the average expression of that gene. The title of each panel includes the name of the gene and the tissue with the highest expression for that gene.
Figure 4Interactome of X-linked disease risk genes.
All 22 X-linked protein-coding genes that showed evidence of association and replication (Figure 1) are denoted by black diamonds and are presented together with genes that interact with them (grey circles) (Materials and Methods). Physical interactions refer to documented protein-protein interactions. Genetic interactions represent genes where perturbations to one gene affect another. Predicted interactions were obtained from orthology to interactions present in other organisms [159]. All but four of these 22 genes share interacting partners according to these known and predicted interactions. Results of a pathway analysis based on this interactome are presented in Table 5.
Gene-enrichment analysis of the interactome.
| Pathway | Genes | P-value |
| Regulation of actin cytoskeleton |
| 5.55×10−14 |
| T-cell receptor signaling pathway |
| 2.75×10−13 |
| Axon guidance |
| 4.97×10−11 |
| Wnt signaling |
| 4.74×10−9 |
| Systemic lupus erythematosus |
| 4.34×10−8 |
| Chemokine signaling |
| 4.52×10−7 |
| Focal adhesion |
| 6.28×10−7 |
| TGF-beta signaling |
| 7.87×10−7 |
| Pathways in cancer |
| 1.74×10−6 |
| Pancreatic cancer |
| 6.17×10−6 |
Genes we discovered and replicated as associated with any disease tested, and their interacting genes (Figure 4) were enriched for several immune related pathways. We display the ten most significantly enriched pathways. Genes within each pathway that were also within our query set are listed. Displayed p-values are adjusted for multiple testing (Materials and Methods).