Literature DB >> 27723757

Genome-wide association studies of autoimmune vitiligo identify 23 new risk loci and highlight key pathways and regulatory variants.

Ying Jin1,2, Genevieve Andersen1, Daniel Yorgov3, Tracey M Ferrara1, Songtao Ben1, Kelly M Brownson1, Paulene J Holland1, Stanca A Birlea1,4, Janet Siebert5, Anke Hartmann6, Anne Lienert6, Nanja van Geel7, Jo Lambert7, Rosalie M Luiten8, Albert Wolkerstorfer8, J P Wietze van der Veen8,9, Dorothy C Bennett10, Alain Taïeb11, Khaled Ezzedine11, E Helen Kemp12, David J Gawkrodger12, Anthony P Weetman12, Sulev Kõks13, Ele Prans13, Külli Kingo14, Maire Karelson14, Margaret R Wallace15, Wayne T McCormack16, Andreas Overbeck17, Silvia Moretti18, Roberta Colucci18, Mauro Picardo19, Nanette B Silverberg20,21, Mats Olsson22, Yan Valle23, Igor Korobko23,24, Markus Böhm25, Henry W Lim26, Iltefat Hamzavi26, Li Zhou26, Qing-Sheng Mi26, Pamela R Fain1,2, Stephanie A Santorico1,3,27, Richard A Spritz1,2.   

Abstract

Vitiligo is an autoimmune disease in which depigmented skin results from the destruction of melanocytes, with epidemiological association with other autoimmune diseases. In previous linkage and genome-wide association studies (GWAS1 and GWAS2), we identified 27 vitiligo susceptibility loci in patients of European ancestry. We carried out a third GWAS (GWAS3) in European-ancestry subjects, with augmented GWAS1 and GWAS2 controls, genome-wide imputation, and meta-analysis of all three GWAS, followed by an independent replication. The combined analyses, with 4,680 cases and 39,586 controls, identified 23 new significantly associated loci and 7 suggestive loci. Most encode immune and apoptotic regulators, with some also associated with other autoimmune diseases, as well as several melanocyte regulators. Bioinformatic analyses indicate a predominance of causal regulatory variation, some of which corresponds to expression quantitative trait loci (eQTLs) at these loci. Together, the identified genes provide a framework for the genetic architecture and pathobiology of vitiligo, highlight relationships with other autoimmune diseases and melanoma, and offer potential targets for treatment.

Entities:  

Mesh:

Year:  2016        PMID: 27723757      PMCID: PMC5120758          DOI: 10.1038/ng.3680

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


In previous genome-wide linkage and association studies, we identified 27 vitiligo susceptibility loci[3-6] in EUR subjects, principally encoding immunoregulatory proteins, many of which are associated with other autoimmune diseases[7]. Several other vitiligo-associated genes encode melanocyte components that regulate normal pigmentary variation[8] and in some cases are major vitiligo autoimmune antigens, with an inverse association of variation at these loci with vitiligo versus malignant melanoma[4,6]. To detect additional vitiligo-associations with lower odds ratios (ORs), as well as uncommon risk alleles with higher ORs, we conducted a third GWAS (GWAS3) of EUR subjects. We augmented the number of population controls in our previous GWAS1 and GWAS2 and performed genome-wide imputation of all three EUR vitiligo GWAS. After quality control procedures, the augmented studies included 1,381 cases and 14,518 controls (GWAS1), 413 cases and 5,209 controls (GWAS2), and 1,059 cases and 17,678 controls (GWAS3), with genomic inflation factors 1.068, 1.059, and 1.013, respectively. We performed a fixed-effects meta-analysis of the three GWAS datasets for 8,966,411 markers (GWAS123; Online Methods). Replication used an additional 1,827 EUR vitiligo cases and 2,181 controls. Results for the three individual GWAS, the meta-analysis, and the replication study are presented in Table 1, Supplementary Table 1, and Fig. 1. Twenty-three new loci achieved genome-wide significance (P < 5 × 10−8) for association with vitiligo and demonstrated subsequent replication; of these, 21 are completely novel (FASLG, PTPRC, PPP4R3B, BCL2L11, FARP2-STK25, UBE2E2, FBXO45-NRROS, PPP3CA, IRF4, SERPINB9, CPVL, NEK6, ARID5B, a multigenic segment that includes BAD, TNFSF11, KAT2A-HSPB9-RAB5C, TNFRSF11A, SCAF1-IRF3-BCL2L12, a multigenic segment that includes ASIP, PTPN1, and IL1RAPL1), while two, CTLA4 and TICAM1, were suggestive in our previous studies. One previously significant locus, CLNK, was no longer significant (Supplementary Table 1). Another potential new locus, PVT1, exceeded genome-wide significance in the discovery meta-analysis (P = 7.74 × 10−9), but could not be successfully genotyped in the replication study and so remains uncertain. Two other loci, FLI1 and LOC101060498, exceeded genome-wide significance in the discovery meta-analysis (P = 3.76 × 10−8 and P = 3.60 × 10−11, respectively), but did not demonstrate replication. Seven additional novel loci achieved suggestive significance (P < 10−5) in the discovery meta-analysis (STAT4, PPARGC1B, c7orf72, PARP12, FADS2, CBFA2T3, and a chr17 locus in the vicinity of AFMID) and gave evidence of replication, but failed to achieve genome-wide significance (Supplementary Table 1).
Table 1

Allelic associations at vitiligo susceptibility loci following GWAS meta-analysis and replication study

GWAS123 meta-analysisGWAS3 replication studyGWAS123 & GWAS3 replicationstudy meta-analysis



Chr.VariantPosition(Build 37)LocusEA/OAP valueOddsratioP valueOddsratioP valueOdds ratio(95% CI)Heritabilityexplained*(%)
1rs3018078484823REREA/G1.84 × 10−121.224.09 × 10−041.174.14 × 10−151.21 (1.15–1.27)0.003
1rs2476601114377568PTPN22A/G2.21 × 10−141.391.08 × 10−051.361.21 × 10−181.38 (1.29–1.49)0.003
1rs78037977172715702FASLGG/A1.39 × 10−131.338.95 × 10−051.296.74 × 10−171.32 (1.24–1.41)0.003
1rs16843742198672299PTPRCC/T8.84 × 10−090.821.87 × 10−020.881.02 × 10−090.83 (0.79–0.88)0.002
2rs1020015955845109PPP4R3BC/T3.35 × 10−131.483.70 × 10−071.553.73 × 10−191.51 (1.38–1.66)0.003
2rs4308124112010486BCL2L11-MIR4435-2HGC/T4.99 × 10−081.171.67 × 10−021.123.96 × 10−091.15 (1.10–1.21)0.002
2rs2111485163110536IFIH1A/G2.69 × 10−220.758.58 × 10−050.836.40 × 10−250.77 (0.73–0.81)0.008
2rs231725204740675CTLA4A/G2.25 × 10−081.181.57 × 10−031.161.49 × 10−101.18 (1.12–1.24)0.002
2rs41342147242407588FARP2-STK25A/G8.03 × 10−070.801.25 × 10−030.803.70 × 10−090.80 (0.74–0.86)0.003
3rs3516162623512312UBE2E2I/D7.34 × 10−070.871.09 × 10−020.893.13 × 10−080.87 (0.83–0.92)0.001
3rs3434664571557945FOXP1A/C6.11 × 10−140.804.23 × 10−060.817.99 × 10−190.80 (0.76–0.84)0.004
3rs148136154119283468CD80-ADPRHC/T5.02 × 10−151.371.74 × 10−021.174.58 × 10−151.31 (1.22–1.40)0.003
3rs13076312188089254LPPT/C3.58 × 10−221.323.48 × 10−101.331.61 × 10−301.32 (1.26–1.38)0.009
3rs6583331196347253FBXO45-NRROSA/T1.39 × 10−070.863.62 × 10−020.912.53 × 10−080.87 (0.83–0.92)0.002
4rs1031034102223386PPP3CAA/C4.78 × 10−060.862.14 × 10−030.863.43 × 10−080.86 (0.81–0.91)0.001
6rs12203592396321IRF4T/C1.03 × 10−090.773.17 × 10−080.688.86 × 10−160.75 (0.70–0.80)0.001
6rs785216992908591SERPINB9G/A3.33 × 10−060.792.27 × 10−030.802.54 × 10−080.79 (0.73–0.86)0.001
6rs6013126129937335HLA-AD/I2.63 × 10−481.538.01 × 10−201.541.56 × 10−661.54 (1.46–1.61)0.016
6rs927159732591291HLA-DRB1/DQA1A/T3.15 × 10−891.77ndndndnd0.042
6rs7292803890976768BACH2A/G1.12 × 10−111.282.04 × 10−041.251.00 × 10−141.27 (1.19–1.35)0.003
6rs2247314167370230RNASET2-FGFR1OP-CCR6C/T1.97 × 10−130.791.56 × 10−060.791.72 × 10−180.79 (0.75–0.84)0.003
7rs11774408129132279CPVLG/A3.74 × 10−221.951.88 × 10−061.668.72 × 10−261.84 (1.64–2.06)0.004
8rs2687812133931055TG-SLA-WISP1A/T1.98 × 10−111.211.69 × 10−031.152.19 × 10−131.19 (1.14–1.25)0.007
9rs10986311127071493NEK6C/T5.45 × 10−071.165.10 × 10−031.141.01 × 10−081.15 (1.10–1.21)0.001
10rs7067796098824IL2RAC/T1.30 × 10−240.749.25 × 10−050.847.20 × 10−270.77 (0.73–0.81)0.012
10rs7150890363779871ARID5BT/C1.09 × 10−061.181.52 × 10−031.196.93 × 10−091.18 (1.12–1.25)0.001
10rs12771452115488331CASP7A/G9.16 × 10−080.838.42 × 10−060.794.43 × 10−120.82 (0.78–0.87)0.002
11rs104310135274829CD44-SLC1A2G/A2.08 × 10−131.244.20 × 10−061.245.26 × 10−181.23 (1.18–1.29)0.003
11rs1242161564021605PPP1R14B-PLCB3-BAD-GPR137-KCNK4-TEX40-ESRRA-TRMT112-PRDX5A/G3.38 × 10−060.873.78 × 10−030.874.81 × 10−080.87 (0.83–0.91)0.001
11rs112680989017961TYRA/G7.13 × 10−320.672.54 × 10−130.681.16 × 10−430.67 (0.63–0.71)0.012
11rs1102123295320808Gene desertC/T1.01 × 10−211.383.81 × 10−041.222.10 × 10−231.34 (1.26–1.41)0.005
12rs201744556407072IKZF4A/G3.81 × 10−201.311.22 × 10−121.406.62 × 10−311.33 (1.27–1.40)0.005
12rs10774624111833788SH2B3-ATXN2A/G1.88 × 10−140.801.52 × 10−100.756.22 × 10−230.79 (0.75–0.83)0.004
13rs3586023443070206TNFSF11G/T2.82 × 10−061.163.45 × 10−041.204.76 × 10−091.17 (1.11–1.23)0.001
14rs819291725102160GZMBC/T1.37 × 10−101.231.23 × 10−061.298.91 × 10−161.25 (1.18–1.32)0.002
15rs163516828535266OCA2-HERC2A/C6.97 × 10−131.437.45 × 10−031.258.78 × 10−141.37 (1.26–1.49)0.003
16rs426874890026512MC1R?C/T1.63 × 10−200.738.23 × 10−150.662.88 × 10−330.71 (0.67–0.75)0.013
17rs1107903540289012KAT2A-HSPB9-RAB5CA/G3.20 × 10−061.183.19 × 10−051.286.77 × 10−101.21 (1.14–1.29)0.001
18rs808351160028655TNFRSF11AC/A9.42 × 10−101.243.23 × 10−021.132.81 × 10−101.21 (1.14–1.28)0.002
19rs48070004831878TICAM1A/G1.58 × 10−091.192.11 × 10−061.241.94 × 10−141.21 (1.15–1.26)0.002
19rs230420650168871SCAF1-IRF3-BCL2L12A/G6.45 × 10−090.824.52 × 10−020.902.36 × 10−090.84 (0.80–0.89)0.002
20rs605965532665748RALY-EIF252-ASIP-AHCY-ITCHA/G3.58 × 10−130.633.08 × 10−080.571.04 × 10−190.61 (0.55–0.68)0.004
20rs601295349123043PTPN1G/A1.18 × 10−071.161.74 × 10−021.119.47 × 10−091.15 (1.10–1.20)0.002
21rs1248290443851828UBASH3AA/T5.74 × 10−291.431.16 × 10−031.185.84 × 10−291.35 (1.28–1.43)0.010
22rs22952737581485C1QTNF6A/C1.40 × 10−241.341.15 × 10−071.271.14 × 10−301.32 (1.26–1.38)0.006
22rs961156541767486ZC3H7B-TEFC/T1.99 × 10−120.783.34 × 10−040.823.13 × 10−150.79 (0.75–0.84)0.003
xrs7345641129737404IL1RAPL1T/G1.57 × 10−071.725.90 × 10−031.627.34 × 10−101.77 (1.47–2.13)0.001
xrs595255349392721CCDC22-FOXP3-GAGEC/T1.81 × 10−080.853.48 × 10−020.921.05 × 10−090.86 (0.82–0.90)0.001

Heritability explained by all independent signals of the locus. Chr., chromosome; CI, confidence interval; nd, not determined; EA, effect allele; OA, other allele. Bold, novel significant vitiligo susceptibility loci. The chromosome 16 association peak spans a large number of genes, including MC1R.

Figure 1

Genome-wide meta-analysis results. The genome-wide distribution of −log10 (P values) from the Cochran-Mantel-Haenszel meta-analysis for 8,966,411 genotyped and imputed markers from GWAS1, GWAS2, and GWAS3 is shown across the chromosomes. The dotted line indicates the threshold for genome-wide significance (P < 5 × 10−8).

Together, the most significantly associated variants at the 48 loci (Table 1) identified by meta-analyses of the three GWAS account for 17.4% of vitiligo heritability (h2 ~ 0.75). To assess whether additional independent variants at these loci might account for additional vitiligo heritability, we performed logistic regression conditional on the most significant SNP at each locus. Eight loci (FARP2-STK25, IFIH1, IL2RA, LPP, MC1R, SLA/TG, TYR, UBASH3A) and the MHC showed evidence of additional independent associations, accounting for an additional 5.1% of vitiligo heritability, for a total of 22.5%. In general, the ORs for the 23 new confirmed loci were lower than those for loci detected previously[6], 1.15 to 1.27, excepting CPVL (OR = 1.84), RALY-EIF252-ASIP-AHCY-ITCH (OR = 1.64), and IL1RAPL1 (OR = 1.77); for these three signals the associated alleles are uncommon (minor allele frequencies 0.03, 0.07, and 0.01, respectively) and thus were not detected in the previous GWAS due to power limitations. To screen for functional relationships among proteins encoded at the 48 confirmed vitiligo-associated loci, we included all genes under the association peaks at these loci in unsupervised pathway analyses using g:PROFILER[9], PANTHER[10], and STRING[11]. PANTHER and gPROFILER identified an enriched network of BioGRID interactions, most significant for the GO categories immune response, immune system process, positive regulation of response to stimulus, positive regulation of biological process, and regulation of response to stimulus. STRING identified a large potential interaction network (Fig. 2), with a predominance of proteins involved in immunoregulation, T-cell receptor repertoire, apoptosis, antigen processing and presentation, and melanocyte function.
Figure 2

Bioinformatic functional interaction network analysis of proteins encoded by all positional candidate genes at all confirmed and suggestive vitiligo candidate loci. As a first step, unsupervised functional interaction network analysis was carried out using STRING v10.0[11], considering each protein as a node and permitting ≤ 5 second-order interactions to maximize connectivity. Nodes that shared no edges with other nodes were then excluded from the network. Edge colors are per STRING: teal, interactions from curated databases; purple, experimentally determined interactions; green, gene neighborhood; blue, databases; red, gene fusions; dark blue, gene co-occurrence; pale green, text-mining; black, co-expression; lavender, protein homology. Note that SMEK2 is an alternative name for PPP4R3B.

Considering proteins encoded at the 23 newly confirmed vitiligo candidate loci, at least twelve (CTLA4, TICAM1, PTPRC, FARP2, UBE2E2, NRROS, CPVL, ARID5B, PTPN1, TNFSF11, TNFRSF11A, IRF3, and perhaps also IL1RAPL1) play roles in immune regulation, and PPP3CA may regulate FOXP3 via NFATC2 and is associated with canine lupus[12]. Six (FASLG, BCL2L11, BCL2L12, SERPINB9, NEK6, BAD) are regulators of apoptosis, particularly involving immune cells. ASIP is a regulator of melanocyte gene expression, and IRF4 is a key transcription factor for both immune cells and melanocytes. Strikingly, several vitiligo-associated genes encode proteins that interact physically and functionally. BCL2L11 and BAD are binding partners that promote apoptosis[13]. CD80 binds to CTLA4 to inhibit T cell activation[14]. BCL2L12 binds to and neutralizes caspase 7 (CASP7)[15]. SERPINB9 binds to and specifically inhibits granzyme B (GZMB)[16]. Eos (IKZF4) binds and is an obligatory co-repressor of FOXP3 in regulatory T cells[17]. RANK (TNFRSF11A) binds to RANKL (TNFSF11) to regulate many aspects of immune cell function, including interactions of T cells and dendritic cells and thymic tolerization[18]. Agouti signaling protein (ASIP) binds to the melanocortin-1 receptor (MC1R) to down-regulate production of brown-black eumelanin[19]. IRF4 cooperates with MITF to activate transcription of TYR[20]. And the vitiligo-associated HLA-A*02:01:01:01 subtype presents peptide antigens derived from several different melanocyte proteins, including tyrosinase (TYR), OCA2, and MC1R[4,6,21]. Together, these relationships appear to highlight key pathways of vitiligo pathogenesis that are beginning to coalesce. An unexpected finding from vitiligo GWAS has been an inverse relationship between vitiligo and malignant melanoma risk for genes that encode melanocyte structural and regulatory proteins. TYR, OCA2, and MC1R, encode functional components of the melanocyte and are key vitiligo autoantigens. IRF4 encodes a transcription factor for melanocytes as well as lymphoid, myeloid, and dendritic cells[22], controlled by alternative tissue-specific enhancers[23]. ASIP and PPARGC1B encode paracrine regulators of melanocyte gene expression. All six loci play important roles in normal pigmentary variation[8,24], and for all six the specific SNPs associated with vitiligo risk are also associated with melanoma protection, and vice-versa[25-27]. The inverse genetic relationship of susceptibility to vitiligo versus melanoma suggests that vitiligo may represent enhanced immune surveillance against melanoma[27,28], consistent with the threefold reduction in melanoma incidence among vitiligo patients[29,30] and prolonged survival of melanoma patients who develop vitiligo during immunotherapy[31]. Vitiligo is epidemiologically associated with several other autoimmune diseases, including autoimmune thyroid disease, pernicious anemia, rheumatoid arthritis, adult-onset type 1 diabetes, Addison’s disease, and lupus[2,32]. We searched the NHGRI-EBI GWAS Catalog and PubMed for the 48 genome-wide significant and 7 suggestive vitiligo susceptibility loci for associations with other autoimmune, inflammatory, and immune-related disorders. As shown in Fig. 3, of the 23 novel genome-wide significant vitiligo loci, FASLG has been associated with celiac disease[33] and Crohn’s disease[34]; PTPRC with ulcerative colitis[35]; BCL2L11 with primary sclerosing cholangitis[36]; CTLA4 with alopecia areata[37], rheumatoid arthritis[38], autoimmune thyroid disease[39,40], myasthenia gravis[41], and type 1 diabetes autoantibody production[42]; TNFRSF11A with myasthenia gravis[41]; and ARID5B with systemic lupus erythematosus[43]. Of the seven suggestive loci, STAT4 has been associated with Behḉet’s disease[44], Sjögren’s syndrome[45], and lupus[46]; and c7orf72 with lupus[47]. These concordant associations for vitiligo and other autoimmune and inflammatory diseases add to those involving previously identified vitiligo susceptibility loci, which include RERE, PTPN22, IFIH1, CD80, LPP, BACH2, RNASET2-FGFR1OP-CCR6, TG/SLA, IL2RA, CD44, a chr11q21 gene desert, IKZF4, SH2B3-ATXN2, UBASH3A, and C1QTNF6[4,6]. Nevertheless, in most cases it remains uncertain whether apparent shared locus associations for different autoimmune diseases reflect shared or different underlying causal variants.
Figure 3

Concordant associations for vitiligo and other autoimmune and inflammatory diseases. We searched the NHGRI-EBI GWAS Catalog and PubMed for associations of the 48 genome-wide significant and 7 suggestive vitiligo susceptibility loci with other autoimmune, inflammatory, and immune-related disorders, and for association with normal human pigmentation variation. Only reported associations that achieved genome-wide significance (P < 5 × 10−8) are included. RA, rheumatoid arthritis; T1D, type 1 diabetes mellitus; AITD, autoimmune thyroid disease; SLE, systemic lupus erythematosus; IBD, inflammatory bowel disease; MS, multiple sclerosis; MG, myasthenia gravis; AI hepatitis, autoimmune hepatitis.

A majority of loci associated with complex traits involve causal variants that are regulatory in nature[48-52], often corresponding to apparent expression quantitative trait loci (eQTLs)[52]. For TYR[21], GZMB[53], and MC1R[7], principal vitiligo risk derives from missense substitutions, whereas for OCA2[6] and the MHC class I[54] and class II[55] loci principal vitiligo risk is associated with causal variation in nearby transcriptional regulatory elements. To assess the fraction of vitiligo-associated loci for which causal variation is likely regulatory, we carried out conditional logistic regression analysis of all loci to define independent association signals, and for each signal we compiled all variants that could not be statistically distinguished. All variants were then annotated against all available ENCODE datasets for immune-related and melanocyte-related cells (Supplementary Table 2). Overall, at approximately 58% of loci, the most significant variants (or statistically indistinguishable variants) are within a transcriptional regulatory element predicted by ENCODE data[56,57]. Only about 15% are in coding regions, several resulting in missense substitutions. To further assess the general functional categories of apparent causal variants for vitiligo, we applied stratified LD score regression[51] to the GWAS meta-analysis summary statistics. As shown in Fig. 4, greatest enrichment of heritability was observed for markers in regulatory functional categories, with considerably less enrichment of markers in protein coding regions.
Figure 4

Enrichment estimates for functional annotations. The combined CMH GWAS123 summary statistics were analyzed using the stratified LD score regression method utilizing the full baseline model[51]. Regulatory, yellow; protein coding, blue; intron, green. Bar height represents enrichment which is defined to be the proportion of SNP heritability in the category divided by the proportion of SNPs in that category. Error bars represent jackknife standard error around the enrichment. For each category, percentage of the total markers in the category is in parentheses. Dashed line represents a ratio of 1 (no enrichment). Asterisks indicate enrichment significant at P < 0.05 after Bonferroni correction for the 20 categories tested (the categories conserved, repressed, transcribed, and promoter flanking were removed and considered insufficiently specific). CTCF, CCCTC-binding factor; DGF, digital genomic footprint; DHS, DNase hypersensitivity site; TFBS, transcription factor binding site; TSS, transcriptional start site; 5’ and 3’ UTR, 5’ and 3’ untranslated regions. H3K4me1, H3K4me3, H3K9ac, and H3K27ac are regulatory chromatin marks[56,57].

We utilized two approaches to assess correspondence of vitiligo association signals with expression of genes in the vicinity. We used PrediXcan[58] to predict expression of 11,553 genes in whole blood for each study subject and then tested association of predicted expression of each gene with vitiligo affection status. We used a Bayesian method to assess co-localization of cis eQTL signals in purified blood monocytes with the confirmed vitiligo association signals. The PrediXcan analysis found 83 genes with significant differential predicted expression in vitiligo cases versus controls after Bonferroni correction (Supplementary Table 3); of these, 75 were located within 1 Mb of one of the 48 confirmed vitiligo susceptibility loci, demonstrating highly significant enrichment compared with locations of genes non-significant for PrediXcan (P value < 0.00001). The eQTL analysis found that 8 of the confirmed vitiligo association signals showed significant co-localization with eQTL association signals identified in purified monocytes (Supplementary Fig. 1 and Supplementary Table 4). Of the confirmed vitiligo-associated genes that could be tested using both methods, 6 were significant in both analyses (CASP7, HERC2-OCA2, ZC3H7B-TEF, TICAM1, RERE, RNASET2-FGFR1OP-CCR6). For all of these except CASP7, one or more of the most associated SNPs not distinguishable by logistic regression was located within or very close to an ENCODE element likely to regulate gene expression in immune cell types, melanocytes, or both (Supplementary Table 2). Like a jigsaw puzzle, the pieces of the vitiligo pathogenome are thus beginning to fit together, revealing a complex network of immunoregulatory proteins, apoptotic regulators, and melanocyte components that mediate both autoimmune targeting of melanocytes in vitiligo and susceptibility to melanoma. For vitiligo as for other complex diseases, there is enrichment of causal variation in regions that regulate gene expression. This may bode well for identifying potential therapeutic targets, as pharmacologic modulation of dysregulated biological pathways may prove more tractable than attempting to target proteins impacted by amino acid substitutions.

ONLINE METHODS

Subjects

The genome-wide portion of this study included unrelated cases from our three generalized vitiligo GWAS: GWAS1[4] (n = 1514), GWAS2[6] (n = 450), and the current GWAS3 (n = 1090). All cases were of non-Hispanic-Latino European-derived white ancestry (EUR) from North America and Europe, and met strict clinical criteria for generalized vitiligo[59]. All controls were EUR individuals not specifically known to have any autoimmune disease or malignant melanoma, for whom genome-wide genotypes were obtained from the NCBI database of Genotypes and Phenotypes (dbGaP; phs000092.v1.p1, phs000125.v1.p1, phs000138.v2.p1, phs000142.v1.p1, phs000168.v1.p1, phs000169.v1.p1, phs000206.v3.p2, phs000237.v1.p1, phs000346.v1.p1, and phs000439.v1.p1 for GWAS1; phs000203.v1.p1, and phs000289.v2.p1 for GWAS2; phs000196.v2.p1, phs000303.v1.p1, phs000304.v1.p1, phs000368.v1.p1, phs000381.v1.p1, phs000387.v1.p1, phs000389.v1.p1, phs000395.v1.p1, phs000408.v1.p1, phs000421.v1.p1, phs000494.v1.p1, and phs000524.v1.p1 for GWAS3). Control datasets were matched to each of the three GWAS case datasets based on platforms used for genotyping. The independent replication study included 1827 unrelated EUR vitiligo cases and 2181 unrelated EUR controls not included in any of the GWAS. All subjects provided written informed consent. This study was carried out under the jurisdiction of each local IRB with overall oversight of the Colorado Multiple Institutional Review Board (COMIRB).

Genome-wide genotyping

Saliva specimens were obtained using a DNA self-collection kit (Oragene, DNA Genotek), and DNA was prepared using either the Maxwell apparatus/16 LEV Blood DNA kit (Promega) or the DNA Genotek Oragene Purifier protocol. DNA concentrations were measured using either the Qubit dsDNA BR Assay kit and Qubit 2.0 Fluorometer (Invitrogen) or the Promega QuantiFluor ONE dsDNA kit and GloMax®-Multi+ Detection System (Promega). Genome-wide genotyping for the GWAS3 cases was performed for 716,503 variants using Illumina Human OmniExpress BeadChips by the Center for Inherited Disease Research (CIDR). Genotype data for GWAS3 were deposited in dbGaP (phs000224.v3.p1). GWAS1[4] and GWAS2[6] have been described previously.

Genome-wide quality control procedures

Quality control filtering of genome-wide genotype data was carried out using PLINK[60], version 1.9. For each case/control dataset, DNA strand calls were reversed as needed. Cases were excluded on the basis of SNP call rates <98.5%, discordance between reported and observed sex, or inadvertent subject duplication, and controls were excluded on the basis of SNP call rates < 98%. SNPs were excluded on the basis of genotype missing rate > 2% for SNPs with observed minor allele frequency (MAF) ≥ 0.01, and for SNPs with MAF < 0.01 exclusion criteria were genotype missing rate >1% and < 5 minor alleles observed, or significant (P < 10−4) deviation from Hardy-Weinberg equilibrium. For X chromosome SNPs, Hardy-Weinberg equilibrium tests were performed in females, and SNPs with P < 10−4 were excluded from the final analysis. For each GWAS, only SNPs that existed in all case and control datasets were retained for imputation. Within each GWAS, subjects were excluded based on cryptic relatedness identified by pairwise identity-by-descent estimations (pi-hat > 0.0625), in which case the individual with lower SNP call rate was excluded. For each of the three GWAS, the cleaned case dataset was combined with one cleaned control dataset at a time and the genotype data of 270 subjects of Phase I and II of the International HapMap Project from 4 populations, and principal components analysis (PCA) was performed with EIGENSOFT[59] based on tag-SNPs (within which no pair were correlated with r2 >0.2) selected from genotyped SNPs. The first two eigenvectors were used to produce a PCA plot. A PCA plot was first made for cases and HapMap samples, and cases that were clearly separated from the main cluster of cases and HapMap EUR samples were excluded as outliers. A PCA plot of controls and HapMap samples was then made, and the same x and y coordinates that separated the case outliers from the main cluster of cases and HapMap EUR samples were used to identify control outliers. After all QC procedures, the final number of genotyped SNPs remaining in GWAS1, GWAS2, and GWAS3 were 464,902, 494,043, and 483,609, respectively. For autosomal analyses, the final numbers of cases and controls in GWAS1, GWAS2, and GWAS3 were 1,381 and 14,518, 413 and 5,209, and 1,059 and 17,678, respectively, whereas for X chromosome analyses, the final numbers of cases and controls in GWAS1, GWAS2, and GWAS3 were 1,380 and 9,439, 413 and 5,209, and 1,059 and 14,220, respectively. This sample size provided at least 85% power to detect associations with OR ≥ 1.22 at genome-wide significance (P = 5 × 10−8) for MAF ≥ 0.25.

Genome-wide Genotype Imputation

For each GWAS, we used SHAPEIT version2 to pre-phase genotypes to produce best-guess haplotypes, and then performed imputation with these estimated haplotypes using IMPUTE2 and the 1000 Genomes Project phase I integrated variant set version 3 (March, 2012) as the reference panel. All cryptic related individuals and outliers from each GWAS were included in the process to improve imputation accuracy, but were removed for the final analyses. Only genotypes with imputation INFO ≥ 0.5 were retained, which were combined with prior SNP genotype data. Imputed genotypes for variants with MAF ≥ 0.01 calculated from all 3 GWAS combined and without significant (P > 10−5) deviation from Hardy-Weinberg equilibrium were used in the final analysis, which included 8,721,242 autosome variants and 245,169 chromosome X variants.

Replication study genotyping and quality control procedures

For the replication study, genotyping was attempted for 379 variants using a custom Illumina Golden Gate array by CIDR. 71 SNPs were excluded on the basis of genotype missing rate > 2% (which includes apparent technical failures), or significant (P < 10−4) deviation from Hardy-Weinberg equilibrium. For X chromosome SNPs, Hardy-Weinberg equilibrium tests were performed in females. Subjects were excluded on the basis of SNP call rates <95%, or discordance between reported and observed sex. Unintended duplicate samples were identified by pairwise identity-by-descent estimations (pi-hat > 0.99), in which case the individual with lower SNP call rate was excluded. The final numbers of remaining cases and controls were 1,827 and 2,181, respectively, providing at least 80% power to replicate associations at P = 0.05 with Bonferroni correction for up to 48 independent tests for OR ≥ 1.23 for MAF ≥ 0.25.

Statistical analyses

To control for the effects of population stratification, we assigned cases and controls of each GWAS to homogenous clusters using GemTools[60], and performed Cochran-Mantel- Haenszel (CMH) analysis to test for association for each GWAS and the combined GWAS data, with the cluster variable defined by the case-control clusters from each GWAS. After removing variants within the extended MHC, the genomic inflation factor for GWAS1, GWAS2, and GWAS3 was 1.068, 1.059, and 1.013, respectively. For the combined GWAS1–GWAS2–GWAS3 genotype data for shared SNPs, the genomic inflation factor was 1.019. For the replication study, after quality control procedures we compared allele frequencies for the remaining 308 SNPs in the remaining 1,827 cases and 2,181 controls using the Cochran-Armitage trend test. ORs and 95% confidence limits were calculated by logistic regression analysis. We used CMH analysis to obtain ORs and P values for the combined GWAS plus the replication study data, with the cluster variable defined by the case-control clusters from each GWAS and the replication study data as one cluster. To analyze X chromosome SNPs, we assumed complete X-inactivation and similar effect size between males and females, with the effect of having an A allele in a male equal to the effect of having two A alleles in a female[63]. We thus coded males as homozygous for the allele carried for each variant and tested for association by CMH analysis to obtain ORs and P values for each GWAS, the combined GWAS, and the combined GWAS plus the replication study data, and by the Cochran- Armitage trend test for the replication study data. To test heterogeneity of associations across the three GWAS and the replication study data, we performed the Cochran Q test. The analysis was done with PLINK, version 1.07, using the ORs and standard errors estimated from the CMH analysis of each GWAS, and from logistic regression analysis of the replication study data. The I2 statistic from the Q test quantifies heterogeneity and ranges from 0% to 100%[64], with a value of 75% or greater typically taken to indicate a high degree of heterogeneity[65]. To test for multiple independent signals at each locus, we performed logistic regression analysis of each locus conditional on the most significantly associated variant, including as covariates in the model the significant principal components for each GWAS derived from GemTools[62] to control for population stratification, and used a stepwise procedure to select additional variants, one by one, until no additional variants showed conditional P values ≤ 1.0 × 10−5. If a tested variant and the conditional variant could not improve each other significantly (P ≥ 0.05 when comparing the two SNP model to a single SNP model), then both variants were considered to represent the same signal. We calculated the variance explained by a specific variant or a set of variants from the combined GWAS as the Pseudo R2 of a logistic regression model which included the specific variants tested.

Bioinformatic pathway and functional enrichment analyses

To screen for functional relationships among the vitiligo candidate genes, we carried out pathway analysis of the protein products of all positional candidate genes at all 48 confirmed loci and the seven suggestive loci using g:PROFILER[10], PANTHER[11], and STRING[12]. To assess enrichment of association signals in different functional genomic categories contributing to vitiligo heritability, we applied stratified LD score regression[51] to the combined CMH GWAS123 summary statistics. The regression model contained 24 overlapping functional categories, including coding, UTR, promoter and intronic regions, annotations for different histone marks, DNase I hypersensitivity site (DHS) regions, combined ChromHMM and Segway predictions, conserved regions in mammals, super-enhancers and FANTOM5 enhancers. For each of the 24 categories, a 500-bp window was used. Linkage disequilibrium data were provided by the LD score software, estimated from the EUR samples in the 1000 Genomes Project Phase 1. Enrichment per category was calculated by the ratio of the estimated proportion of heritability explained by the category over the proportion of the markers in the category.

PrediXcan and Monocyte eQTL Co-Localization analyses

We carried out a gene-based test of association of vitiligo with “imputed” expression profiles for 11,553 autosomal genes in whole blood using PrediXcan[58]. The analysis included 2,853 cases and 37,412 controls from the combined GWAS. Association testing between expression estimates for each gene and affection status for vitiligo was performed by generalized logistic regression. P values were adjusted for the number of genes tested (n = 11,553). NRROS, ZC3H7B, TNFRSF11A, BCL2L12, RALY, ASIP, OCA2, and TYR were excluded from the PrediXcan analysis due to poor prediction of gene expression in blood cells. We derived expression quantitative trait loci (eQTLs) in peripheral blood monocytes from 414 EUR subjects with paired genotyping and gene expression data[66]. SHAPEIT version2 was used to pre-phase genotypes to produce best-guess haplotypes with imputation performed using IMPUTE2 and the 1000 Genomes Project phase I integrated variant set version 3 (March, 2012) as reference panel. We tested for co-localization of eQTL and vitiligo GWAS autosomal association patterns as described[67,68]. Vitiligo susceptibility loci were defined by windows of robust association plus an added 100 kb buffer on both sides. eQTL probes were selected by choosing probes that resided within these windows. Probe quality annotation was performed using ReMOAT[69] and all probes with an annotation of “bad” were removed. After removing non-autosomal loci and duplicate probe IDs, a total of 904 probes remained. All vitiligo susceptibility loci contained at least one probe with the exception of the gene desert 3’ of TYR, for which the only probe that intersected the locus was excluded due to ReMOAT annotation of “bad”. Within each locus window, all SNPs were tested for association with all probes using linear regression. P values, MAF for each SNP and respective sample sizes were used as input to test for co- localization, simultaneously testing five mutually exclusive hypotheses by generating 5 corresponding posterior probabilities (PP): Posterior probabilities were calculated using the R package “coloc” using default settings for prior probabilities of association. Co-localization was assessed as per Guo et al.[68]; significant co-localization was PP3+PP4 > 0.99 and PP4:PP3 > 5, and suggestive co-localization was PP3+PP4 > 0.95 and PP4:PP3 > 3. H0 (PP0): There is no association with either the GWAS or the eQTL. H1 (PP1): There is association for the GWAS only. H2 (PP2): There is association for the eQTL only. H3 (PP3): There is association for both the GWAS and the eQTL, but the associated variants are different for the GWAS and the eQTL. H4 (PP4): The associated variants are the same for both the GWAS and the eQTL (co-localization).
  68 in total

1.  Quantifying heterogeneity in a meta-analysis.

Authors:  Julian P T Higgins; Simon G Thompson
Journal:  Stat Med       Date:  2002-06-15       Impact factor: 2.373

2.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

3.  The definition and assessment of vitiligo: a consensus report of the Vitiligo European Task Force.

Authors:  Alain Taïeb; Mauro Picardo
Journal:  Pigment Cell Res       Date:  2007-02

4.  Genome-wide association analysis in primary sclerosing cholangitis identifies two non-HLA susceptibility loci.

Authors:  Espen Melum; Andre Franke; Christoph Schramm; Tobias J Weismüller; Daniel Nils Gotthardt; Felix A Offner; Brian D Juran; Jon K Laerdahl; Verena Labi; Einar Björnsson; Rinse K Weersma; Liesbet Henckaerts; Andreas Teufel; Christian Rust; Eva Ellinghaus; Tobias Balschun; Kirsten Muri Boberg; David Ellinghaus; Annika Bergquist; Peter Sauer; Euijung Ryu; Johannes Roksund Hov; Jochen Wedemeyer; Björn Lindkvist; Michael Wittig; Robert J Porte; Kristian Holm; Christian Gieger; H-Erich Wichmann; Pieter Stokkers; Cyriel Y Ponsioen; Heiko Runz; Adolf Stiehl; Cisca Wijmenga; Martina Sterneck; Severine Vermeire; Ulrich Beuers; Andreas Villunger; Erik Schrumpf; Konstantinos N Lazaridis; Michael P Manns; Stefan Schreiber; Tom H Karlsen
Journal:  Nat Genet       Date:  2010-12-12       Impact factor: 38.330

Review 5.  A symbiotic concept of autoimmunity and tumour immunity: lessons from vitiligo.

Authors:  P K Das; R M van den Wijngaard; A Wankowicz-Kalinska; I C Le Poole
Journal:  Trends Immunol       Date:  2001-03       Impact factor: 16.687

6.  Common variants in FOXP1 are associated with generalized vitiligo.

Authors:  Ying Jin; Stanca A Birlea; Pamela R Fain; Christina M Mailloux; Sheri L Riccardi; Katherine Gowan; Paulene J Holland; Dorothy C Bennett; Margaret R Wallace; Wayne T McCormack; E Helen Kemp; David J Gawkrodger; Anthony P Weetman; Mauro Picardo; Giovanni Leone; Alain Taïeb; Thomas Jouary; Khaled Ezzedine; Nanny van Geel; Jo Lambert; Andreas Overbeck; Richard A Spritz
Journal:  Nat Genet       Date:  2010-06-06       Impact factor: 38.330

7.  PGC-1 coactivators regulate MITF and the tanning response.

Authors:  Jonathan Shoag; Rizwan Haq; Mingfeng Zhang; Laura Liu; Glenn C Rowe; Aihua Jiang; Nicole Koulisis; Caitlin Farrel; Christopher I Amos; Qingyi Wei; Jeffrey E Lee; Jiangwen Zhang; Thomas S Kupper; Abrar A Qureshi; Rutao Cui; Jiali Han; David E Fisher; Zoltan Arany
Journal:  Mol Cell       Date:  2012-11-29       Impact factor: 17.970

8.  Markedly reduced incidence of melanoma and nonmelanoma skin cancer in a nonconcurrent cohort of 10,040 patients with vitiligo.

Authors:  Andrea Paradisi; Stefano Tabolli; Biagio Didona; Luciano Sobrino; Nicoletta Russo; Damiano Abeni
Journal:  J Am Acad Dermatol       Date:  2014-09-19       Impact factor: 11.527

9.  Genome-wide association scan in north Indians reveals three novel HLA-independent risk loci for ulcerative colitis.

Authors:  Garima Juyal; Sapna Negi; Ajit Sood; Aditi Gupta; Pushplata Prasad; Sabyasachi Senapati; Jacques Zaneveld; Shalini Singh; Vandana Midha; Suzanne van Sommeren; Rinse K Weersma; Jurg Ott; Sanjay Jain; Ramesh C Juyal; B K Thelma
Journal:  Gut       Date:  2014-05-16       Impact factor: 23.059

10.  Re-Annotator: Annotation Pipeline for Microarray Probe Sequences.

Authors:  Janine Arloth; Daniel M Bader; Simone Röh; Andre Altmann
Journal:  PLoS One       Date:  2015-10-01       Impact factor: 3.240

View more
  78 in total

Review 1.  Vitiligo: Focus on Clinical Aspects, Immunopathogenesis, and Therapy.

Authors:  Katia Boniface; Julien Seneschal; Mauro Picardo; Alain Taïeb
Journal:  Clin Rev Allergy Immunol       Date:  2018-02       Impact factor: 8.667

Review 2.  Genetics of Vitiligo.

Authors:  Richard A Spritz; Genevieve H L Andersen
Journal:  Dermatol Clin       Date:  2017-04       Impact factor: 3.478

3.  Dynamic demethylation of the IL2RA promoter during in vitro CD4+ T cell activation in association with IL2RA expression.

Authors:  Marie-Pierre Belot; Anne-Laure Castell; Sophie Le Fur; Pierre Bougnères
Journal:  Epigenetics       Date:  2018-08-10       Impact factor: 4.528

Review 4.  The genetic architecture of vitiligo.

Authors:  Genevieve H L Roberts; Stephanie A Santorico; Richard A Spritz
Journal:  Pigment Cell Melanoma Res       Date:  2019-12-04       Impact factor: 4.693

5.  Family Clustering of Autoimmune Vitiligo Results Principally from Polygenic Inheritance of Common Risk Alleles.

Authors:  Genevieve H L Roberts; Subrata Paul; Daniel Yorgov; Stephanie A Santorico; Richard A Spritz
Journal:  Am J Hum Genet       Date:  2019-07-18       Impact factor: 11.025

Review 6.  The convergence theory for vitiligo: A reappraisal.

Authors:  Roopal V Kundu; Julia M Mhlaba; Stephanie M Rangel; I Caroline Le Poole
Journal:  Exp Dermatol       Date:  2018-06-28       Impact factor: 3.960

7.  Macrophage Migration Inhibitory Factor in Alopecia Areata and Vitiligo: A Case-Controlled Serological Study.

Authors:  Fatma Eldesouky; Al-Shimaa M Ibrahim; Samar M Sharaf
Journal:  J Clin Aesthet Dermatol       Date:  2020-10-01

8.  Cross-disorder analysis of schizophrenia and 19 immune-mediated diseases identifies shared genetic risk.

Authors:  Jennie G Pouget; Buhm Han; Yang Wu; Emmanuel Mignot; Hanna M Ollila; Jonathan Barker; Sarah Spain; Nick Dand; Richard Trembath; Javier Martin; Maureen D Mayes; Lara Bossini-Castillo; Elena López-Isac; Ying Jin; Stephanie A Santorico; Richard A Spritz; Hakon Hakonarson; Constantin Polychronakos; Soumya Raychaudhuri; Jo Knight
Journal:  Hum Mol Genet       Date:  2019-10-15       Impact factor: 6.150

9.  Improving power of association tests using multiple sets of imputed genotypes from distributed reference panels.

Authors:  Wei Zhou; Lars G Fritsche; Sayantan Das; He Zhang; Jonas B Nielsen; Oddgeir L Holmen; Jin Chen; Maoxuan Lin; Maiken B Elvestad; Kristian Hveem; Goncalo R Abecasis; Hyun Min Kang; Cristen J Willer
Journal:  Genet Epidemiol       Date:  2017-09-01       Impact factor: 2.135

10.  Deep genotype imputation captures virtually all heritability of autoimmune vitiligo.

Authors:  Genevieve H L Roberts; Stephanie A Santorico; Richard A Spritz
Journal:  Hum Mol Genet       Date:  2020-03-27       Impact factor: 6.150

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.