Literature DB >> 20169177

Genome-wide association study in Asian populations identifies variants in ETS1 and WDFY4 associated with systemic lupus erythematosus.

Wanling Yang1, Nan Shen, Dong-Qing Ye, Qiji Liu, Yan Zhang, Xiao-Xia Qian, Nattiya Hirankarn, Dingge Ying, Hai-Feng Pan, Chi Chiu Mok, Tak Mao Chan, Raymond Woon Sing Wong, Ka Wing Lee, Mo Yin Mok, Sik Nin Wong, Alexander Moon Ho Leung, Xiang-Pei Li, Yingyos Avihingsanon, Chun-Ming Wong, Tsz Leung Lee, Marco Hok Kung Ho, Pamela Pui Wah Lee, Yuk Kwan Chang, Philip H Li, Ruo-Jie Li, Lu Zhang, Wilfred Hing Sang Wong, Irene Oi Lin Ng, Chak Sing Lau, Pak Chung Sham, Yu Lung Lau.   

Abstract

Systemic lupus erythematosus is a complex and potentially fatal autoimmune disease, characterized by autoantibody production and multi-organ damage. By a genome-wide association study (320 patients and 1,500 controls) and subsequent replication altogether involving a total of 3,300 Asian SLE patients from Hong Kong, Mainland China, and Thailand, as well as 4,200 ethnically and geographically matched controls, genetic variants in ETS1 and WDFY4 were found to be associated with SLE (ETS1: rs1128334, P = 2.33x10(-11), OR = 1.29; WDFY4: rs7097397, P = 8.15x10(-12), OR = 1.30). ETS1 encodes for a transcription factor known to be involved in a wide range of immune functions, including Th17 cell development and terminal differentiation of B lymphocytes. SNP rs1128334 is located in the 3'-UTR of ETS1, and allelic expression analysis from peripheral blood mononuclear cells showed significantly lower expression level from the risk allele. WDFY4 is a conserved protein with unknown function, but is predominantly expressed in primary and secondary immune tissues, and rs7097397 in WDFY4 changes an arginine residue to glutamine (R1816Q) in this protein. Our study also confirmed association of the HLA locus, STAT4, TNFSF4, BLK, BANK1, IRF5, and TNFAIP3 with SLE in Asians. These new genetic findings may help us to gain a better understanding of the disease and the functions of the genes involved.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20169177      PMCID: PMC2820522          DOI: 10.1371/journal.pgen.1000841

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   5.917


Introduction

Systemic lupus erythematosus (SLE) is a prototype autoimmune disease characterized by auto-antibody production and multi-organ damage. Genetic factors are known to play an important role in the disease, with the monozygotic twin concordance rate between 20–59, and the risk for siblings of affected individuals 30 times higher than that for the general population [1]–[3]. There are also population differences for the disease both in terms of genetic susceptibility and disease manifestations. African Americans, Hispanics and Asians all have higher disease prevalence than Caucasians; with Asians known to have more lupus nephritis than patients of European ancestry [4],[5]. Genome-Wide Association studies (GWAS) have dramatically changed the landscape of SLE genetics, with a pace of discovery the field has never seen before. In less than two years time, STAT4 [6], ITGAM [7]–[9], BLK [8], PXK and KIAA1542 [7], BANK1 [10] and TNFAIP3 [11],[12] and several other genes have been identified as associated with SLE [13]–[16]. More susceptibility loci were reported recently in two other GWAS on this disease [17],[18]. Despite varied disease prevalence and severity across different populations, it is noteworthy that most previous studies were conducted on patients of European ancestry with under-representation of other ethnicities. We previously examined some of the GWAS findings from populations of European origin [19]. In addition to confirming the association of STAT4 and BLK with SLE in our population, our data indicated differences between the Asian and Caucasian populations. For example, our study did not detect any significant disease association for PXK, a result that was later confirmed by an independent study on a Korean population [20]. Data from our study indicated that, although the risk alleles in ITGAM are rare in Asians (<2%), they are risk factors in our populations and are closely related to lupus nephritis in particular [21]. In this study, we first genotyped 320 SLE patients collected in Hong Kong by the Illumina 610-Quad Beadchip and analyzed the data against 1500 control individuals genotyped on the same platform. Selected SNPs were then replicated in four independent sample collections from Hong Kong, Shanghai and Anhui, (China), as well as Bangkok (Thailand). Genetic variants in and around two genes, ETS1 and WDFY4, were identified as associated with SLE with genome-wide significance. Functional characterization of the risk alleles also supported potential roles of these genetic variants in disease pathogenesis.

Results

The association of HLA, STAT4, BLK, BANK1, IRF5, TNFAIP3 with SLE

The whole-genome genotyping data was thoroughly examined by quality control measures and by population substructure analysis. Analysis of principal component using Eigenstrat [22] did reveal that the samples collected in Hong Kong clustered together, suggesting that confounding population substructure or admixture is not a major concern if Hong Kong controls were used in association analyses. It did indicate, however, that samples collected in Taiwan (obtained from deCODE Genetics) and Han Chinese in Beijing (HCB, available from HapMap) cluster very differently from Hong Kong samples, suggesting population substructure among Chinese living in different geographical regions and potential pitfalls in association studies when cases and controls are not well-matched (Figure 1).
Figure 1

Principal component analysis of Chinese samples collected in Hong Kong, Taiwan, and Beijing.

The red dots denote SLE patient samples collected in Hong Kong used in our GWAS study and the blue dots are unaffected samples from Beijing (A), Taiwan (B), and Hong Kong (C). The data on Taiwan samples was received from deCODE Genetics; the data on Han Chinese in Beijing is available from HapMap. The data on Hong Kong controls were from other studies in the University of Hong Kong that were genotyped by the same platform. Repeat of Eigenstrat analysis for 20 times using randomly chosen 100,000 SNPs each time from all the available SNPs produced similar results.

Principal component analysis of Chinese samples collected in Hong Kong, Taiwan, and Beijing.

The red dots denote SLE patient samples collected in Hong Kong used in our GWAS study and the blue dots are unaffected samples from Beijing (A), Taiwan (B), and Hong Kong (C). The data on Taiwan samples was received from deCODE Genetics; the data on Han Chinese in Beijing is available from HapMap. The data on Hong Kong controls were from other studies in the University of Hong Kong that were genotyped by the same platform. Repeat of Eigenstrat analysis for 20 times using randomly chosen 100,000 SNPs each time from all the available SNPs produced similar results. Genome-wide association analysis confirmed significant association of some established susceptibility genes in our population, including SNPs in the HLA locus, STAT4, TNFSF4, BANK1, TNFAIP3, IRF5 and BLK (Table 1 and Figure 2). Similar to the Caucasian findings [12],[23], the risk allele for rs2230926 in TNFAIP3 is low in frequency but with a relatively large effect size in disease association. SNP rs9271366 located between HLA-DRB1 and HLA-DQA1 is the most significantly associated SNP in the whole genome in our study. Although the HLA locus has been consistently shown to be the locus conferring the largest effect size with SLE association, there is little overlap between previous GWAS findings from populations of European ancestry and our results. The most significant SNPs in the Caucasian data [7],[8] are either monomorphic in our population or are not associated with disease susceptibility based on our GWAS result, something worth further pursuit in future studies. Association of STAT4, BLK, IRF5, BANK1 and TNFSF4 with the disease has been reported in our population previously [19],[21],[24],[25].
Table 1

Confirmation of susceptibility genes in the Hong Kong Chinese population by GWAS.

GenesSNPCHRPOSITIONA1F_AF_UA2POR
TNFSF4 rs12343151171445086T0.53940.4678C1.04E-031.33
rs7048401171492818G0.49050.4204T1.21E-031.33
STAT4 rs75748652191672878T0.45890.3356G4.22E-091.68
rs101682662191644049T0.45430.3369C2.17E-081.64
BANK1 rs45228654102934911G0.38010.4577A3.56E-040.73
rs105164874102970099T0.13920.1900C2.82E-030.69
HLA-DRB1/HLA-DQA1 rs9271366632694832G0.22310.1283A7.67E-101.95
HLA-DQB1/HLA-DQA2 rs9275328632774800T0.16560.2594C5.73E-070.57
TNFAIP3 rs22309266138237759G0.05990.0225T3.42E-072.77
rs37571736138231847C0.06780.0356T2.08E-041.97
IRF5 rs7293027128356196C0.25390.3446A1.01E-050.65
rs47281427128361203A0.17030.1201G6.13E-041.50
BLK rs2736340811381382C0.20980.2974T8.56E-060.63
rs2254546811381089A0.18770.2720G1.03E-050.62

A1: minor allele; F_A: minor allele frequency in cases; F_U: minor allele frequency in controls.

Figure 2

Confirmation of association of HLA locus, and TNFSF4, STAT4, TNFAIP3, IRF5, BLK with SLE.

Shown are association results comparing SLE patients with controls collected in Hong Kong analyzed by Plink (−log10(P-value) of SNPs). The best SNP in Chromosome 11 is around ETS1 gene (rs6590330), which is in high LD with rs1128334. And the best SNP in Chromosome 10 is in WDFY4 (rs877819).

Confirmation of association of HLA locus, and TNFSF4, STAT4, TNFAIP3, IRF5, BLK with SLE.

Shown are association results comparing SLE patients with controls collected in Hong Kong analyzed by Plink (−log10(P-value) of SNPs). The best SNP in Chromosome 11 is around ETS1 gene (rs6590330), which is in high LD with rs1128334. And the best SNP in Chromosome 10 is in WDFY4 (rs877819). A1: minor allele; F_A: minor allele frequency in cases; F_U: minor allele frequency in controls. To answer the question on whether there are still other genetic variants contributing to disease susceptibility, we reexamined the Q-Q plot comparing expected and observed P values by removing all the SNPs in the known susceptibility loci mentioned above. After removal of these SNPs, we still observed an excess of association signal (Figure 3), suggesting involvement of additional susceptibility loci for this disease. Since our GWAS involved a limited number of patients, and is therefore prone to false positive and false negative findings, we selected SNPs for replication based on both their significance in GWAS results as well as the function and expression pattern of the nearby genes. Selected SNPs were replicated first using Sequenom genotyping on limited number of additional samples (360 cases and 360 controls), and variants with significant association in the Sequenom data were then examined by TaqMan genotyping on a much expanded sample collection from four independent cohorts. SNPs in and around two genes, v-ets erythroblastosis virus E26 oncogene homolog 1 (avian) (ETS1) and the WDFY family member 4 (WDFY4) regions were chosen based both on initial GWAS data as well as their known function (in the case of ETS1) and expression pattern (in the case of WDFY4). Table 2 displays the SNPs in these two loci that showed disease association with a P value<0.01 from our GWAS data. Initial replication by Sequenom showed consistent results with the GWAS trend for these two genes and they were further tested in the remaining samples.
Figure 3

Quantile-Quantile plot of expected (x-axis) and observed (y-axis) −log10(P value) distribution in our GWAS analysis.

(A) Considering all the available SNPs. (B) SNPs in and around HLA locus, and TNFSF4, STAT4, TNFAIP3, IRF5, BLK, as well as BANK1 were excluded from analysis.

Table 2

SNPs in and around ETS1 and WDFY4 that showed significant association with SLE in the GWAS data.

SNPCHRPOSITIONA1F_AF_UA2POR
ETS1 rs659033011127816269A0.44010.3466G8.92E-061.48
rs793208811127822378T0.33120.3893G6.24E-030.78
rs1089387211127830763T0.51420.4299C1.04E-041.40
rs493733311127835730T0.51100.4279C1.28E-041.40
rs1222394311127900838T0.10090.1409G7.25E-030.68
WDFY4 rs105089081049643864A0.50470.4356G1.47E-031.32
rs115950651049647888G0.51420.4379A4.54E-041.36
rs65375751049649705T0.44320.3819C4.08E-031.29
rs29473441049654696T0.52840.4574C1.14E-031.33
rs26630311049657309A0.52840.4574G1.14E-031.33
rs24485411049684822G0.44160.3532A2.76E-051.45
rs24485391049685217A0.23100.1768C1.49E-031.40
rs108576501049700029G0.36440.2735A4.63E-061.52
rs8778191049712957A0.26180.1758G5.66E-071.66
rs79221691049715462T0.35490.2678G9.83E-061.50
rs111015351049734437T0.46690.3960C9.73E-041.34
rs26630491049736767C0.35170.2654T1.12E-051.50
rs26630521049739401C0.35170.2651T1.03E-051.50
rs26208811049753117A0.33390.2609G2.42E-041.42
rs26716921049767825A0.32810.2594G4.14E-041.39
rs29432441049770895C0.24760.1798A8.10E-051.50
rs26630411049777543C0.35020.2722T7.96E-051.44
rs111015581049783035A0.33120.2631G4.85E-041.39
rs19135171049789060A0.33280.2617G2.70E-041.41
rs70946101049792187C0.33440.2780A4.44E-031.30

Quantile-Quantile plot of expected (x-axis) and observed (y-axis) −log10(P value) distribution in our GWAS analysis.

(A) Considering all the available SNPs. (B) SNPs in and around HLA locus, and TNFSF4, STAT4, TNFAIP3, IRF5, BLK, as well as BANK1 were excluded from analysis.

Association of ETS1 with SLE

Making use of the remaining samples from Hong Kong not included in GWAS, and sample collections from Shanghai and Anhui, China, and Bangkok, Thailand, we went on to replicate the whole-genome findings on these two loci. SNPs in ETS1 listed in Table 2, rs12223943, rs7932088 and rs10893872, were examined in the expanded samples. SNPs rs10893872 and rs4937333 have absolute LD with each other, so only rs10893872 was chosen for replication. SNP rs1128334 was chosen in the place of rs6590330 for replication due to its high LD with rs6590330 (r2 = 0.97, Figure 4) and its relative position to the gene (3′-UTR) and predicted effect on microRNA binding. For SNPs rs12223943 and rs7932088, the association seen from whole-genome data was inconclusive (data not shown) in the replication stage and were not further pursued. We genotyped rs10893872 and rs1128334 in all the samples from the four cohorts and both were found to be highly associated with SLE (Table 3).
Figure 4

Linkage disequilibrium and sequence conservation of SNPs in ETS1.

Shown are LD for significant SNPs in and around ETS1 detected by GWAS (A), and replicated SNPs in the 3′-UTR and downstream of the gene (B), and sequence conservation for sequences around SNP rs1128334 among different species (C).

Table 3

SNPs showed significant association with SLE in a joint analysis of four independent Asian cohorts.

SNPChrNearby GeneLocation relative to geneMINOR ALLELE FREQUENCIES (MAF, %)JOINT P VALUEOR95% CI OF OR
Hong KongShanghaiBangkokHefei, China
Case N = 1073Cont N = 1742Case N = 920Cont N = 1053Case N = 314Cont N = 519Case N = 951Cont N = 860
rs112833411 ETS1 3′-UTR40.635.138.734.535.131.539.829.62.3E-111.291.20–1.39
rs1089387211 ETS1 downstream47.342.845.642.149.142.344.038.11.8E-71.211.13–1.31
rs709739710 WDFY4 Coding, R1816Q35.327.537.234.341.933.839.233.68.15E-121.301.21–1.40
rs87781910 WDFY4 intron23.416.417.318.628.322.220.617.35.57E-71.261.15–1.37

Linkage disequilibrium and sequence conservation of SNPs in ETS1.

Shown are LD for significant SNPs in and around ETS1 detected by GWAS (A), and replicated SNPs in the 3′-UTR and downstream of the gene (B), and sequence conservation for sequences around SNP rs1128334 among different species (C). Independence test by logistic regression by Plink pointed to a major contribution from rs1128334. And indeed, the sequence around rs1128334 has high sequence conservation among different species (Figure 4). Haplotype analysis indicated that the TA haplotype formed by the two SNPs (rs10893872 and rs1128334) is the major risk haplotype, whilst the CG haplotype is the major protective haplotype (Table 4) with other haplotypes having low allele frequencies. Subphenotype analysis was also performed for these SNPs, and both SNPs were found to have larger effect sizes for patients with lupus nephritis in all four cohorts, although no statistical significance was reached in any cases. Analysis of other subphenotypes showed insignificant, or inconsistent results among different cohorts.
Table 4

ETS1 Haplotype analysis on SLE association.

rs10893872-rs1128334F_A (%)F_U(%) P - Value
TA haplotypeHong Kong40.3533.25.89E-06
Anhui36.9928.511.25E-04
Bangkok35.5831.020.06
Shanghai37.9332.930.0023
CG haplotypeHong Kong51.8656.630.0035
Anhui53.7162.310.00022
Bangkok50.7657.280.0113
Shanghai53.3556.060.11

Allelic expression of ETS1 in PBMC

Since rs1128334 is located at the 3′-UTR region of the gene, it is predicted that it may have an effect on the expression level of ETS1. Therefore, we examined allelic expression of ETS1 gene for the two alleles of rs1128334, “A” and “G”, from healthy individuals heterozygous for this SNP (N = 33). This assay assesses directly whether the two alleles of the SNP correlate with different steady state mRNA levels. Pyrosequencing results from PBMC of healthy individuals heterozygous on the SNP showed a significantly higher expression from the “G” allele than from the risk “A” allele, with a P value<0.0001 (Figure 5).
Figure 5

Allelic expression of ETS1 on SNP rs1128334 in PBMC of healthy individuals.

PBMC cDNA processed from 33 healthy individuals heterozygous on rs1128334 were used for allelic expression detection of ETS1 by pyrosequencing. (A) A case example of detection on the “A” allele and the “G” allele from both DNA and cDNA samples from the same individual. (B) The ratio of G/A allelic detection for both DNA and cDNA samples. The median G/A ratio for DNA is 1.09 (95% CI: 1.08–1.11) and the median G/A ratio for cDNA expression is 1.32 (95% CI: 1.21–1.43), P<0.0001 by paired student's t test.

Allelic expression of ETS1 on SNP rs1128334 in PBMC of healthy individuals.

PBMC cDNA processed from 33 healthy individuals heterozygous on rs1128334 were used for allelic expression detection of ETS1 by pyrosequencing. (A) A case example of detection on the “A” allele and the “G” allele from both DNA and cDNA samples from the same individual. (B) The ratio of G/A allelic detection for both DNA and cDNA samples. The median G/A ratio for DNA is 1.09 (95% CI: 1.08–1.11) and the median G/A ratio for cDNA expression is 1.32 (95% CI: 1.21–1.43), P<0.0001 by paired student's t test.

Association of WDFY4 with SLE

Two of the SNPs in WDFY4 that showed the most significant association with the disease in our GWAS, SNPs rs10857650 and rs877819, were selected for further replication. In addition, three nonsynonymous SNPs in this gene not genotype by the Illumina 610-Quad Beadchip, rs2170132 (Ser1528Pro), rs7097397 (Arg1816Gln) and rs2292584 (Pro3118Leu), were also selected to test for disease association, aiming at identifying functional variants in this gene. SNPs rs2170132 and rs2292584 did not show significant difference between the cases and the controls in the Hong Kong cohort (Table 5) and Thai samples (data not shown) and were not further tested in other cohorts. Genotyping results on rs10857650 using TaqMan showed significant discordance with results from the Illumina Beadchip, and was thus removed from further analysis.
Table 5

Replication of disease association for WDFY4 SNPs in the Hong Kong samples.

SNPPosition to the geneBPA1F_AF_UA2CHISQPOR
Rs2170132Ser1528Pro50013402T0.16790.1617C0.27930.59711.046
rs7097397Arg1816Gln50025396G0.35340.2753A2.88E+017.88E-081.438
rs877819intron50042951A0.23390.1639G3.12E+012.33E-081.557
Rs2292584Pro3118Leu50816415T0.36790.3416C3.0750.07951.122
SNP rs7097397 and rs877819 were confirmed to have significant association with the disease (Table 3). Conditional logistic regression test indicate that the nonsynonymous SNP coded by rs7097397 is probably the functional variant, with P = 1.01×10−5 when controlling the effect of rs877819. Independent contribution from rs877819 is questionable, with a P value of 0.088 considering the effect of rs7097397 in the same test. The two SNPs have intermediate LD (r2 = 0.44, Figure 6A). The genetic result is consistent with the fact that the arginine residue at 1816 in WDFY4 protein is well conserved among orthologs in different mammals (Figure 6B). Preliminary analysis on subphenotype stratification suggests that this nonsynonymous change may be more closely involved in male and early onset patients in a case only analysis (onset age 12 and below vs. above, OR = 1.96, P = 0.0021; male vs. female: OR = 1.52, P = 0.023).
Figure 6

LD among WDFY4 SNPs examined in this study (A) and sequence conservation of the three nonsynonymous variations among various species (B).

Association of the SNPs in the two genes with disease risk was also corrected by logistic regression using age and sex as covariates, and the associations found in this study all remain highly significant after all the corrections.

Discussion

Several GWAS on SLE have been conducted on populations of European ancestry [7],[8], but populations of Asian or African ancestry were seriously underrepresented. Only during the process of submission of our current work, a GWAS study on Chinese populations was reported [18]. Considering the population differences in both disease prevalence and clinical manifestations, GWAS on non-Caucasian populations may have novel findings and help to elucidate the differences between populations. An interesting analysis result from our GWAS data is the difference between Hong Kong samples and samples collected in Taiwan and Beijing, shown by principal component analysis (Figure 1). It suggests population substructure for Chinese living in different regions, which may cause spurious findings in association studies when cases and controls are not well matched. With most of the genetic variants of relatively larger effect sizes already being identified, GWAS becomes more susceptible to effects from mismatches between cases and controls in dealing with SNPs of smaller effect sizes. Our analysis echoed two very recent reports delineating population substructures in Chinese populations living in different geographical regions [26],[27]. Ets-1 is a member of the ETS family of transcription factors that share a unique Ets DNA binding domain. They control a wide variety of cellular processes including cell proliferation and differentiation [28]. Ets-1-deficient mice develop lupus-like disease characterized by high titers of IgM and IgG autoantibodies, immune complex-mediated glomerulonephritis, and local activation of complement [29]. Ets-1 is also involved in many cellular abnormalities that are known to participate in SLE pathogenesis as illustrated in Figure 7.
Figure 7

Involvement of ETS1 in Th17 cell and B lymphocyte development and autoimmunity.

Ets-1 is a negative regulator of terminal differentiation of B cells and plays critical roles in maintaining B cell identity [30]–[32]. Ets-1-deficient B cells were present in normal numbers but have a large proportion of IgM plasma cells [33]. Ets-1 blocks the function of B-lymphocyte-induced maturation protein 1 (Blimp-1), an essential transcription factor for plasma cells [34]. The number and frequency of plasma cells were known to correlate with disease activity and the titer of anti-dsDNA antibodies in SLE [35],[36]. Ets-1 is also a negative regulator of Th17 cell differentiation, and naïve CD4+ T cells deficient in Ets-1 undergo greatly enhanced differentiation into Th17 cells when cultured in vitro under Th17-skewing conditions [37]. Th17 cells with specificity for self-antigens are known to be highly pathogenic and lead to the development of inflammation and severe autoimmunity [38]. Higher plasma IL-17, IL-23 and higher number of Th17 cells in SLE patients were reported and correlated positively with SLE disease index (SLEDAI) [39]–[41]. IL-17 and IL-21 produced by Th17 cells may also induce B cell terminal differentiation [42]–[44]. In this study, we found that in the PBMC of healthy individuals, expression of ETS1 from the risk “A” allele is reduced comparing to that from the “G” allele. The expression level of ETS1 may be tightly regulated. It was shown that resting T cells express high levels of ETS1 mRNA and protein, which decreased to very low levels upon T cell activation [45]. Lower expression of ETS1 for the risk allele carriers may play a role in disease pathogenesis through increased differentiation and activity of both plasma cells and Th17 cells. It is likely that the association SNPs identified in this study may affect the response of ETS1 gene to other upstream signals. SNP rs1128334 locates in the 3′-UTR of ETS1, and rs10893872 is in absolute LD with rs4937333, another SNP that is also located in the 3′-UTR of the gene and both SNPs are on putative microRNA (miRNA) binding sites. In a recent study by Du et al, the expression level of a microRNA, miR-326, was found to be related to disease severity in patients with multiple sclerosis and mice with experimental autoimmune encephalomyelitis. ETS1 was shown to be the major target of miR-326, through downregulation of which miR-326 promoted the generation of Th17 cells both in vitro and in vivo [46]. Another microRNA, miRNA-146a, was also found to be involved in SLE pathogenesis [47]. WDFY family member 4 (WDFY4, NCBI GeneID: 57705) codes for a huge protein (3184 amino acid residues) with unknown function. Its closest paralog is WD repeat and FYVE domain containing 3 (WDFY3, NCBI GeneID: 23001). Similar to WDFY3, WDFY4 does contain WD40 domains and a BEACH (Beige and chediak-kigashi) domain, although FYVE zinc finger domain is truncated. Its sequence is well conserved among various species. For example, there is an 84% sequence similarity between the human protein and its orthologs from Bos Taurus and Canis lupus familiaris. And there is a 42% identity between the human protein and protein XP_701288.3 in Danio Rerio (zebra fish). Although protein XP_701288.3 is annotated as WDFY3 in NCBI, it has a higher sequence similarity with human WDFY4 than with WDFY3. WD40 domain is found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly, while the BEACH domains are implicated in membrane trafficking. While very little is known about the potential function of this well conserved protein, an interesting phenomenon is that the gene is predominantly expressed in immune tissues such as lymph node, spleen, thymus and tonsil (UniGene Hs.287379, http://www.ncbi.nlm.nih.gov/UniGene/ESTProfileViewer.cgi?uglist=Hs.287379), unlike WDFY3, which is expressed in a wide variety of tissues and organs. After submission of our work, Han et al reported their GWAS work on SLE on Chinese populations [18] and reported genome-wide significant association signals on ETS1 (rs6590330, downstream of the gene) and WDFY4 (rs1913517, in the intron of both WDFY4 and LRRC18, leucine rich repeat containing 18). Independent identification of these two genes in SLE susceptibility underlined the validity of the findings. Identifying susceptibility genes provided a good start in our effort to elucidate the disease mechanisms and the functions of the genes involved, although many questions remain to be answered. Characterization of the functions of the genes in autoimmunity may eventually help us to translate genetic findings into clinical and therapeutic applications.

Materials and Methods

Ethics statement

This study was conducted according to the principles expressed in the Declaration of Helsinki. The Hong Kong study was approved by the Institutional Review Board of the University of Hong Kong and Hospital Authority, Hong Kong West Cluster, New Territory West Cluster, and Hong Kong East Cluster. The study on Shanghai, Anhui and Thai samples was approved by the Institutional Review Board of Renji Hospital, Research Ethics Committee of Anhui Medical University and the Ethics Committee of the Faculty of Medicine, Chulalongkorn University, respectively. All patients provided written informed consent for the collection of samples and subsequent analysis.

Subjects

1073 SLE samples collected in Hong Kong were from four hospitals in Hong Kong Island and the New Territories: Queen Mary Hospital, Tuen Mun Hospital, Queen Elizabeth Hospital and Pamela Youde Nethersole Eastern Hospital. The patients were all of self-reported Chinese ethnicity living in Hong Kong. The average onset age was 28 years old and the ratio of female to male patients was 9∶1. About half of the patients had renal involvement, and about 70% tested positive for anti-dsDNA antibodies. The SLE samples collected in Shanghai were patients attending Renji Hospital of Jiaotong University Medical School, a tertiary referral hospital covering Shanghai and the surrounding areas. There is an 8∶1 female to male patient ratio, and about 52% of patients have lupus nephritis. 951 SLE patients collected in Anhui were all self-reported Chinese ethnicity living in Anhui province, central China. They were recruited from the Departments of Rheumatology at Anhui Provincial Hospital and the First Affiliated Hospital of Anhui Medical University, both located in Hefei, Anhui province, about 450 km from Shanghai. The average onset age was 31 years old and the ratio of female to male patients was 17∶1. 314 Thai patients with SLE (female∶male ratio = 14∶1) attending King Chulalongkorn Memorial Hospital, a tertiary referral center in Bangkok were also recruited in this study. Medical records were reviewed to confirm that all subjects met the revised criteria of the American College of Rheumatology for SLE diagnoses [48]. Controls used in the GWAS stage were from both healthy individuals and from other studies conducted in the same institution genotyped with the same platform. For the replication stage, Hong Kong controls were healthy blood donors kindly contributed by the Hong Kong Red Cross and were all of self-reported Chinese ethnicity living in Hong Kong. Controls from Shanghai and Anhui were selected from a pool of healthy blood donors recruited from Renji Hospital (Shanghai) and Hefei City (Anhui), respectively, with an effort to match for the age and sex of corresponding SLE patients. Thai controls were recruited from unrelated voluntary healthy donors from the same ethnic background and geographic area as the Thai SLE patients.

Genome-wide association study

320 (27 males, 293 females) SLE patients were genotyped by Illumina 610-Quad Human Beadchip with a total number of SNPs reaching 620,901. 24 individuals were removed due to low call rate or hidden first degree relationship. A total number of 104,395 SNPs were also removed from the initial analysis on the ground of low genotyping call rate (<90%, 30,133 SNPs) and/or low minor allele frequency (MAF) (<0.005, 102,970 SNPs). 2,285 SNPs were also removed due to violation of Hardy-Weinberg equilibrium in controls (P values<0.0001). After quality control measures, 314 case (27 males, 287 females) and 1484 controls (840 males and 644 females) were analyzed on a total of 514,221 SNPs. The call rate for the remaining SNPs reached 0.999 with a genome-wide inflation factor of 1.03, which is an indication of good match between the cases and controls. To overcome any potential effect from the heterogeneity in the controls, three independent comparisons were initially conducted by separating the controls into three different subsets and only SNPs reaching significance in all subsets in association analysis were selected for further replication. The SNPs were analyzed for association with the disease by means of comparison of the minor allele frequency in patients and controls (basic allelic test) as well as other tests using Plink [49]. Linkage disequilibrium (LD) patterns were analyzed and displayed by HaploView [50]. Association of the SNPs with disease risk was also corrected by logistic regression using age and sex as covariates. Average odds ratios (OR) and P values jointly analyzed from four sample collections were obtained by Cochran-Mantel-Haenszel (CMH) test of disease association conditional on SNP frequency differences among different populations. Test of independent contributions of a SNP controlling for the effect of other SNPs in the same locus was done by conditional logistic regression as well as haplotype analyses. Subphenotype stratification was performed by comparing cases with and without a given subphenotype.

Genotyping in replication stage

SNPs rs1128334, rs10893872, rs7097397, rs10857650, and rs877819 were genotyped by TaqMan SNP genotyping method using assay-on-demand probes and primers (Applied Biosystems, Foster City, CA94404, USA). Some of the initial screening was also done using Sequenom MassARRAY iPLEX Gold system. Genotyping accuracy was confirmed by direct sequencing of PCR products for some randomly chosen samples. Genotyping concordance between Illumina Human 610-Quad Beadchip and TaqMan SNP genotyping method was also examined on selected samples and probes: rs877819 has a concordance rate of 99.64% (1 out of 277 differed by the two platforms); rs10893872 has a concordance rate of 99.16% (1 out of 119 samples differed). SNP rs1128334 and rs7097397 were examined by direct sequencing of selected samples and showed complete consistence between TaqMan and sequencing (50 samples each). The results of rs10857650 were discarded due to low concordance between the Illumina Beadchip data and the TaqMan results.

Allele-specific transcription quantification

Thirty-three healthy individuals heterozygous for rs1128334 were chosen to assess the relative ETS1 mRNA levels from the two alleles, “A” and “G”, by pyrosequencing [51],[52]. In the meantime, DNA detection ratio was used as a control for amplification efficiency. Briefly, total RNA was extracted from peripheral blood mononuclear cell (PBMC) from each individual. RNA samples were then treated with DNAase to eliminate genomic DNA contamination before being reverse-transcribed into cDNA using oligo-dT primer. cDNA was then amplified by PCR using transcript-specific primers, together with DNA from the same individuals. The cDNA and DNA PCR products were purified using the Qiaquick PCR purification kit, and then subjected to allele quantitative pyrosequencing. The sequencing primer was designed using Pyrosequencing Assay Design Software v.1.0. Reactions were performed on a Biotage PSQ96MA machine, and allele quantification was analyzed using PSQMA 2.1 software. The average G/A cDNA expression ratio of each individual was normalized by the G/A DNA ratio from the same sample. Paired student' t test was used to compare the normalized expression level from the “A” and “G” alleles.
  52 in total

1.  Single-nucleotide polymorphism analysis by pyrosequencing.

Authors:  A Ahmadian; B Gharizadeh; A C Gustafsson; F Sterky; P Nyrén; M Uhlén; J Lundeberg
Journal:  Anal Biochem       Date:  2000-04-10       Impact factor: 3.365

2.  Structural studies of Ets-1/Pax5 complex formation on DNA.

Authors:  C W Garvie; J Hagman; C Wolberger
Journal:  Mol Cell       Date:  2001-12       Impact factor: 17.970

Review 3.  Ets-1 flips for new partner Pax-5.

Authors:  Miles A Pufall; Barbara J Graves
Journal:  Structure       Date:  2002-01       Impact factor: 5.006

4.  Activation of the early B-cell-specific mb-1 (Ig-alpha) gene by Pax-5 is dependent on an unmethylated Ets binding site.

Authors:  Holly Maier; Jeff Colbert; Daniel Fitzsimmons; Dawn R Clark; James Hagman
Journal:  Mol Cell Biol       Date:  2003-03       Impact factor: 4.272

5.  Genomic dissection of population substructure of Han Chinese and its implication in association studies.

Authors:  Shuhua Xu; Xianyong Yin; Shilin Li; Wenfei Jin; Haiyi Lou; Ling Yang; Xiaohong Gong; Hongyan Wang; Yiping Shen; Xuedong Pan; Yungang He; Yajun Yang; Yi Wang; Wenqing Fu; Yu An; Jiucun Wang; Jingze Tan; Ji Qian; Xiaoli Chen; Xin Zhang; Yangfei Sun; Xuejun Zhang; Bailin Wu; Li Jin
Journal:  Am J Hum Genet       Date:  2009-12       Impact factor: 11.025

Review 6.  Lupus in Hong Kong Chinese.

Authors:  C C Mok; C S Lau
Journal:  Lupus       Date:  2003       Impact factor: 2.911

Review 7.  Studies in familial systemic lupus erythematosus.

Authors:  F C Arnett; L E Shulman
Journal:  Medicine (Baltimore)       Date:  1976-07       Impact factor: 1.889

8.  Correlation between circulating CD27high plasma cells and disease activity in patients with systemic lupus erythematosus.

Authors:  Annett M Jacobi; Marcus Odendahl; Karin Reiter; Anne Bruns; Gerd R Burmester; Andreas Radbruch; Günter Valet; Peter E Lipsky; Thomas Dörner
Journal:  Arthritis Rheum       Date:  2003-05

Review 9.  The biology of the Ets1 proto-oncogene.

Authors:  Jürgen Dittmer
Journal:  Mol Cancer       Date:  2003-08-20       Impact factor: 27.401

Review 10.  B cell abnormalities in systemic lupus erythematosus.

Authors:  Amrie C Grammer; Peter E Lipsky
Journal:  Arthritis Res Ther       Date:  2003-12-02       Impact factor: 5.156

View more
  183 in total

Review 1.  The epigenetics of autoimmunity.

Authors:  Francesca Meda; Marco Folci; Andrea Baccarelli; Carlo Selmi
Journal:  Cell Mol Immunol       Date:  2011-01-31       Impact factor: 11.530

2.  Meta-analysis of genetic polymorphisms in granulomatosis with polyangiitis (Wegener's) reveals shared susceptibility loci with rheumatoid arthritis.

Authors:  Sharon A Chung; Gang Xie; Delnaz Roshandel; Richard Sherva; Jeffrey C Edberg; Megan Kravitz; Paul F Dellaripa; Gary S Hoffman; Alfred D Mahr; Philip Seo; Ulrich Specks; Robert F Spiera; E William St Clair; John H Stone; Robert M Plenge; Katherine A Siminovitch; Peter A Merkel; Paul A Monach
Journal:  Arthritis Rheum       Date:  2012-10

Review 3.  Genetic susceptibility to lupus: the biological basis of genetic risk found in B cell signaling pathways.

Authors:  Samuel E Vaughn; Leah C Kottyan; Melissa E Munroe; John B Harley
Journal:  J Leukoc Biol       Date:  2012-06-29       Impact factor: 4.962

4.  Identification of a New Susceptibility Locus for Systemic Lupus Erythematosus on Chromosome 12 in Individuals of European Ancestry.

Authors:  F Yesim Demirci; Xingbin Wang; Jennifer A Kelly; David L Morris; M Michael Barmada; Eleanor Feingold; Amy H Kao; Kathy L Sivils; Sasha Bernatsky; Christian Pineau; Ann E Clarke; Rosalind Ramsey-Goldman; Timothy J Vyse; Patrick M Gaffney; Susan Manzi; M Ilyas Kamboh
Journal:  Arthritis Rheumatol       Date:  2016-01       Impact factor: 10.995

5.  Negative Correlation Between miR-326 and Ets-1 in Regulatory T Cells from new-Onset SLE Patients.

Authors:  Xiao-Ge Sun; Jin-Hui Tao; Nan Xiang; Xiao-Mei Li; Guo-Sheng Wang; Xuan Fang; Chao Dai; Min Zhang; Zhu Chen; Xiang-Pei Li
Journal:  Inflammation       Date:  2016-04       Impact factor: 4.092

6.  Genome-Wide Association Meta-Analysis Reveals Novel Juvenile Idiopathic Arthritis Susceptibility Loci.

Authors:  Laura A McIntosh; Miranda C Marion; Marc Sudman; Mary E Comeau; Mara L Becker; John F Bohnsack; Tasha E Fingerlin; Thomas A Griffin; J Peter Haas; Daniel J Lovell; Lisa A Maier; Peter A Nigrovic; Sampath Prahalad; Marilynn Punaro; Carlos D Rosé; Carol A Wallace; Carol A Wise; Halima Moncrieffe; Timothy D Howard; Carl D Langefeld; Susan D Thompson
Journal:  Arthritis Rheumatol       Date:  2017-10-12       Impact factor: 10.995

7.  C-reactive protein (CRP) polymorphisms and haplotypes are associated with SLE susceptibility and activity but not with serum CRP levels in Mexican population.

Authors:  Yemil Atisha-Fregoso; Guadalupe Lima; Eduardo Carrillo-Maravilla; Rosalinda Posadas-Sánchez; Nonanzit Pérez-Hernández; Miguel Baños-Peláez; Alejandra Iturralde-Chávez; Nora Hernández-Díaz; Juan Jakez-Ocampo; José Manuel Rodríguez-Pérez; Gilberto Vargas-Alarcón; Luis Llorente; Juanita Romero-Díaz
Journal:  Clin Rheumatol       Date:  2018-03-20       Impact factor: 2.980

8.  A genome-wide association study in Han Chinese identifies a susceptibility locus for primary Sjögren's syndrome at 7q11.23.

Authors:  Yongzhe Li; Kunlin Zhang; Hua Chen; Fei Sun; Juanjuan Xu; Ziyan Wu; Ping Li; Liuyan Zhang; Yang Du; Haixia Luan; Xi Li; Lijun Wu; Hongbin Li; Huaxiang Wu; Xiangpei Li; Xiaomei Li; Xiao Zhang; Lu Gong; Lie Dai; Lingyun Sun; Xiaoxia Zuo; Jianhua Xu; Huiping Gong; Zhijun Li; Shengquan Tong; Min Wu; Xiaofeng Li; Weiguo Xiao; Guochun Wang; Ping Zhu; Min Shen; Shengyun Liu; Dongbao Zhao; Wei Liu; Yi Wang; Cibo Huang; Quan Jiang; Guijian Liu; Bin Liu; Shaoxian Hu; Wen Zhang; Zhuoli Zhang; Xin You; Mengtao Li; Weixin Hao; Cheng Zhao; Xiaomei Leng; Liqi Bi; Yongfu Wang; Fengxiao Zhang; Qun Shi; Wencheng Qi; Xuewu Zhang; Yuan Jia; Jinmei Su; Qin Li; Yong Hou; Qingjun Wu; Dong Xu; Wenjie Zheng; Miaojia Zhang; Qian Wang; Yunyun Fei; Xuan Zhang; Jing Li; Ying Jiang; Xinping Tian; Lidan Zhao; Li Wang; Bin Zhou; Yang Li; Yan Zhao; Xiaofeng Zeng; Jurg Ott; Jing Wang; Fengchun Zhang
Journal:  Nat Genet       Date:  2013-10-06       Impact factor: 38.330

Review 9.  Recent insights into the genetic basis of systemic lupus erythematosus.

Authors:  Ornella Josephine Rullo; Betty P Tsao
Journal:  Ann Rheum Dis       Date:  2012-12-19       Impact factor: 19.103

10.  Genetic variants in STAT4 and HLA-DQ genes confer risk of hepatitis B virus-related hepatocellular carcinoma.

Authors:  De-Ke Jiang; Jielin Sun; Guangwen Cao; Yao Liu; Dongxin Lin; Yu-Zhen Gao; Wei-Hua Ren; Xi-Dai Long; Hongxing Zhang; Xiao-Pin Ma; Zhong Wang; Wei Jiang; Tao-Yang Chen; Yong Gao; Liang-Dan Sun; Ji-Rong Long; Hui-Xing Huang; Dan Wang; Hongjie Yu; Pengyin Zhang; Li-Sha Tang; Bo Peng; Hao Cai; Ting-Ting Liu; Ping Zhou; Fang Liu; Xiaoling Lin; Sha Tao; Bo Wan; He-Xi Ge Sai-Yin; Lun-Xiu Qin; Jianhua Yin; Li Liu; Chen Wu; Yan Pei; Yuan-Feng Zhou; Yun Zhai; Pei-Xin Lu; Aihua Tan; Xian-Bo Zuo; Jia Fan; Jiang Chang; Xiaoli Gu; Neng-Jin Wang; Yang Li; Yin-Kun Liu; Kan Zhai; Hongwei Zhang; Zhibin Hu; Jun Liu; Qing Yi; Yongbing Xiang; Rong Shi; Qiang Ding; Wei Zheng; Xiao-Ou Shu; Zengnan Mo; Yin Yao Shugart; Xue-Jun Zhang; Gangqiao Zhou; Hongbing Shen; S Lilly Zheng; Jianfeng Xu; Long Yu
Journal:  Nat Genet       Date:  2012-12-16       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.