Literature DB >> 20018027

Conditional analysis of the major histocompatibility complex in rheumatoid arthritis.

Kimberly E Taylor1, Lindsey A Criswell.   

Abstract

We performed a whole-genome association study of rheumatoid arthritis susceptibility using Illumina 550k single-nucleotide polymorphism (SNP) genotypes of 868 cases and 1194 controls from the North American Rheumatoid Arthritis Consortium (NARAC). Structured association analysis with adjustment for potential population stratification yielded 200 SNPs with p < 1 x 10-8 for association with RA, all of which were on chromosome 6 in a 2.7-Mb region of the major histocompatibility complex (MHC). Given the extensive linkage equilibrium in the region and known risk of HLA-DRB1 alleles, we then applied conditional analyses to ascertain independent signals for RA susceptibility among these 200 candidate SNPs. Conditional analyses incorporating risk categories of the HLA-DRB1 "shared epitope" revealed three SNPs having independent associations with RA (conditional p < 0.001). This supports the presence of significant effects on RA susceptibility in the MHC in addition to the shared epitope.

Entities:  

Year:  2009        PMID: 20018027      PMCID: PMC2795934          DOI: 10.1186/1753-6561-3-s7-s36

Source DB:  PubMed          Journal:  BMC Proc        ISSN: 1753-6561


Background

Rheumatoid arthritis (RA) is a chronic systemic autoimmune disease characterized by damage to synovial joints as well as extraarticular manifestations. The strongest known genetic risk factor is the HLA-DRB1 gene on chromosome 6, namely a set of alleles sharing a common sequence known as the shared epitope (SE) [1]. Recent whole-genome association studies have revealed new risk genes outside of the HLA region [2-4], and some studies have also provided evidence of additional influences from the HLA class III and class I regions [5,6]. In this analysis we sought first to identify and/or validate RA risk alleles throughout the genome, and then to identify independent associations with RA susceptibility in the major histocompatibility complex (MHC) in addition to the SE.

Methods

Illumina 550k genotyping data from a whole-genome association study by North American Rheumatoid Arthritis Consortium (NARAC) [3] was used for this study as part of the Genetic Analysis Workshop 16; duplicated and contaminated samples had been removed previously. Using the computer program PLINK [7], subjects were filtered who had less than 90% genotyping, and single-nucleotide polymorphisms (SNPs) were filtered that had less than 90% genotyping, Hardy-Weinberg equilibrium in controls p < 0.0001, or minor allele frequency (MAF) < 0.05. Using the computer program EIGENSTRAT [8], population outliers were removed who were >6 standard deviations from any of the first five principal components (PCs) identified in PC analysis. First we analyzed the whole-genome data using structured association analyses of EIGENSTRAT; although the NARAC cases and controls are Caucasian, differences in intra-European ancestry [9] can produce false-positive associations. SNPs used for the PCA were filtered to remove regions of extended local associations (chr 8: 8-12 Mb, chr 6: 24-36 Mb, chr 11: 42-58 Mb, chr 5: 44-51.5 Mb, and chr 17: 40-43 Mb) and pruned to have r2 < 0.2 within a sliding window of 1 kb with a step size of 100, similar to methods of Fellay et al. [10] and Hom et al. [11]. We included correction for the first six PCs (see Results). We used a conservative genome-wide significance threshold of p = 1 × 10-8. Because all SNPs exceeding this threshold (i.e., lower p-value) were in the MHC in a region of extended linkage disequilibrium (LD), we proceeded with conditional analyses to attempt to establish signals that were independent of the shared epitope and each other. We modeled the SE as a multi-allelic marker with values corresponding to negative, low-risk, or high-risk. SE alleles were considered high risk if they were one of DRB1*0401, 0404, 0405, 0408, or 0409. Table 1 shows the case-control ratios for each risk category.
Table 1

HLA-DRB1 risk levels. Definitions and case-control ratios for shared-epitope (SE) categories.

DRB1 risk levelDefinitionCase-control ratio
3 = Highest-risk SEDRB1*0401, 0404, 0405, 0408, or 040972%-28%
2 = Lower-risk SEOther SE alleles52%-48%
1 = No SENot an SE allele23%-77%
HLA-DRB1 risk levels. Definitions and case-control ratios for shared-epitope (SE) categories. Our conditional analyses, using the computer program Whap [12], proceeded as follows. Starting with the SE as the top marker, we tested each two-SNP marker (SE plus each other SNP) for independence of the other SNP; in particular this uses a likelihood-ratio test to determine the significance of the difference between the two-SNP "alternate" model versus the one-SNP (SE in this case) "null" model. As long as the most significant SNP was <0.001, we added this best SNP to the list of independent SNPs, and proceeded to test all three-marker combinations compared to our best two-SNP model; and so on with larger haplotypes. In the final list, we also tested each locus for a significant addition to the model containing all other top SNPs.

Results

After quality control filtering (above), 486,078 SNPs remained for the whole-genome analyses. All subjects had genotyping >90%, and eight controls were removed as outliers detected by EIGENSTRAT, leaving 868 cases and 1186 controls for the final analyses. In structured association analysis with EIGENSTRAT, we corrected for the first six PCs because the scree graph of eigenvalues levelled off at the sixth component. PCs one, two, four, and five were all highly significant in association tests with cases and controls (all p ≤ 10-8). In this whole-genome analysis we identified exactly 200 SNPs with p < 10-8, all between 30.38 Mb and 33.08 Mb in the MHC region. Table 2 shows the significance of associations in this dataset for known RA risk alleles outside of the MHC [2,3,13-17]; for SNPs not in the Illumina 550k panel, there were perfect proxy SNPs (r2 = 1) in the HapMap CEU population [18]. Although not reaching our 10-8 threshold, we observed p-values from 10-5 to 5 × 10-6 for PTPN22 and TRAF1-C5 SNPs, and p = 0.03 for STAT4. This dataset was underpowered to detect any of these risk alleles at a genome-wide level for their published odds ratios (ORs); for example, we had approximately 70% power to detect the highest OR of 1.75 (PTPN22, MAF = 11%) at p = 10-8, and only 50% power to detect the lowest OR of 1.15 (CD40, MAF = 25%) at p = 0.05.
Table 2

Significance of associations with published RA risk alleles

SNP (GENE)ReferenceProxy (r2 = 1)EIGENSTRAT adjusted p-value
rs2476601 (PTPN22)[16]N/A5.3 × 10-6
rs7574865 (STAT4)[2]N/A0.034
rs3761847 (TRAF1-C5)[3]N/A1.1 × 10-5
rs1953126 (TRAF1-C5)[17]N/A8.0 × 10-6
rs10499194 (TNFAIP3)[18]rs131928410.25
rs6920220 (TNFAIP3)[19]rs69334040.17
rs4810485 (CD40)[20]rs15697230.12
Significance of associations with published RA risk alleles Results of our conditional analyses are shown in Table 3. Loci are shown in the order added by the algorithm (see Methods), i.e., rs261946 has the lowest p-value conditional on the SE, rs2074488 has the lowest p-value conditional on both SE and rs261946, and so on. Out of the 200 SNPs, three independent signals were evident in addition to the SE risk levels. One signal is located in the classical HLA class II region between genes BTNL2 and HLA-DRA, and two signals are in the classical HLA class I region (see Figure 1) near TRIM39 and HLA-C.
Table 3

Independent SNPs in conditional analysis of RA susceptibility

Locus (sequence #)aUnadjusted single-markerp-value fromhaploviewbSingle-marker EIGENSTRATp-value (rank)p-value conditional on loci abovecp-value conditional on other 3 locicLocation (kb)Closest gene(s)
Shared epitope (#129)1.9 × 10-188N/AN/A8.6 × 10-7432,655-32,666In DRB1
rs261946 (#1)7.2 × 10-171.9 × 10-9 (185)0.000190.002730,379TRIM39
rs2074488 (#5)2.3 × 10-245.0 × 10-12 (151)6.6 × 10-50.001131,348HLA-C
rs2395175 (#119)1.4 × 10-1171.9 × 10-80 (2)0.000310.0003132,513Between BTNL2 and HLA-DRA

a# indicates marker position in Figure 1. Loci are in order added by conditional algorithm (see Methods).

bFor shared epitope, multi-allelic SNP test in Whap using No-Low-High risk.

cWhap log ratio test using all loci as alternate model versus null model without this locus.

Figure 1

Region with LD of SNPs studied in conditional analyses. Haploview diagram displays increasing red with higher D'. Three independently significant RA SNPs and HLA-DRB1 locations are indicated.

Region with LD of SNPs studied in conditional analyses. Haploview diagram displays increasing red with higher D'. Three independently significant RA SNPs and HLA-DRB1 locations are indicated. Independent SNPs in conditional analysis of RA susceptibility a# indicates marker position in Figure 1. Loci are in order added by conditional algorithm (see Methods). bFor shared epitope, multi-allelic SNP test in Whap using No-Low-High risk. cWhap log ratio test using all loci as alternate model versus null model without this locus. Table 4 shows the case-control frequencies and ORs for the final haplotypes, with the most common haplotype (ACG-1) as the reference haplotype. The highest-risk haplotype of these SNPs and the SE level had OR = 27.2 (95% CI, 16.7-44.4) in comparison to OR = 7.3 (95% CI, 4.7-11.1) overall comparing the SE risk levels alone (data not shown).
Table 4

Frequencies and odds ratios for haplotypes of three SNPs and shared epitope proxy risk level

HaplotypeFrequency in casesFrequency in controlsOR (95% CI)
GAA-39.80%1.90%27.2 (16.7-44.4)
AAA-36.50%1.50%22.2 (12.6-39)
ACA-317.10%4.40%19.9 (13.4-29.7)
GCG-25.30%2.40%14.6 (8.6-24.8)
GCA-312.60%4.20%13.1 (9-19.1)
ACG-33.40%1.80%6.2 (3.7-10.4)
ACG-213.80%9.50%5.2 (3.8-7.1)
AAG-11.80%2.70%2.5 (1.4-4.5)
GAG-11.40%2.40%NS
GCG-16.00%13.00%NS
ACA-11.50%2.50%NS
ACG-120.80%53.80%(Reference group)
Frequencies and odds ratios for haplotypes of three SNPs and shared epitope proxy risk level

Conclusion

In our analysis of the Genetic Analysis Workshop 16 dataset, there was insufficient power to detect known associations with RA susceptibility at a genome-wide significance level outside of the MHC; the most significant association was p = 5.3 × 10-6 for PTPN22. Clearly, the MHC is the most influential genetic region in RA susceptibility, but extensive LD makes isolating the precise loci difficult. We have used conditional analyses as a tool to investigate the presence of multiple RA risk factors in the MHC region in addition to the SE. Out of 200 candidate SNPs having unconditional p-values < 10-8, we have identified an additional HLA class II marker and two HLA class I markers which have significant associations with RA susceptibility that are not fully explained by LD with HLA-DRB1. A better understanding of these genetic influences can be helpful in elucidating the complex genetic components of RA. Previous studies of MHC effects on RA susceptibility beyond the SE have identified additional independent signals but have been largely inconsistent, due at least in part to the difficulty of narrowing down regions of association in the presence of extended LD [1,13]. Multiple studies have implicated the TNF-lymphotoxin locus in class III [1], which were not significant in our conditional analysis. Other studies also using NARAC cases have observed signals in class I [5,14], including HLA-C, our second SNP added in conditional analysis. Our first SNP is in the gene TRIM39, also in class I but not previously implicated. Our third SNP, in class II, is 150 kb upstream from HLA-DRB1 between the BTNL2 and HLA-DRA genes. BTNL2 has been associated with RA, systemic lupus erythematosus, and type 1 diabetes [15]; this is attributed to its association with predisposing HLA DQB1-DRB1 haplotypes, which may explain its presence in our data as well. It is important to note that the NARAC population is primarily Caucasian. Other populations could have quite different distributions of these haplotypes as well as other haplotypes and allele frequencies. A similar analysis in other ethnic groups could be very informative.

List of abbreviations used

LD: Linkage disequilibrium; MAF: Minor allele frequency; MHC: Major histocompatibility complex; NARAC: North American Rheumatoid Arthritis Consortium; OR: Odds ratio; PCA: Principal components analysis; RA: Rheumatoid arthritis; SE: Shared epitope; SNP: Single-nucleotide polymorphism

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

KET performed statistical analyses and drafted the manuscript. LAC recruited patients as part of the NARAC collaboration. LAC and KET designed the study, revised the manuscript, and read and approved the final manuscript.
  19 in total

1.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

2.  Two independent alleles at 6q23 associated with risk of rheumatoid arthritis.

Authors:  Robert M Plenge; Chris Cotsapas; Leela Davies; Alkes L Price; Paul I W de Bakker; Julian Maller; Itsik Pe'er; Noel P Burtt; Brendan Blumenstiel; Matt DeFelice; Melissa Parkin; Rachel Barry; Wendy Winslow; Claire Healy; Robert R Graham; Benjamin M Neale; Elena Izmailova; Ronenn Roubenoff; Alexander N Parker; Roberta Glass; Elizabeth W Karlson; Nancy Maher; David A Hafler; David M Lee; Michael F Seldin; Elaine F Remmers; Annette T Lee; Leonid Padyukov; Lars Alfredsson; Jonathan Coblyn; Michael E Weinblatt; Stacey B Gabriel; Shaun Purcell; Lars Klareskog; Peter K Gregersen; Nancy A Shadick; Mark J Daly; David Altshuler
Journal:  Nat Genet       Date:  2007-11-04       Impact factor: 38.330

3.  Analysis of a functional BTNL2 polymorphism in type 1 diabetes, rheumatoid arthritis, and systemic lupus erythematosus.

Authors:  Gisela Orozco; Peter Eerligh; Elena Sánchez; Sasha Zhernakova; Bart O Roep; Miguel A González-Gay; Miguel A López-Nevot; Jose L Callejas; Carmen Hidalgo; Dora Pascual-Salcedo; Alejandro Balsa; María F González-Escribano; Bobby P C Koeleman; Javier Martín
Journal:  Hum Immunol       Date:  2006-03-09       Impact factor: 2.850

4.  Association of systemic lupus erythematosus with C8orf13-BLK and ITGAM-ITGAX.

Authors:  Geoffrey Hom; Robert R Graham; Barmak Modrek; Kimberly E Taylor; Ward Ortmann; Sophie Garnier; Annette T Lee; Sharon A Chung; Ricardo C Ferreira; P V Krishna Pant; Dennis G Ballinger; Roman Kosoy; F Yesim Demirci; M Ilyas Kamboh; Amy H Kao; Chao Tian; Iva Gunnarsson; Anders A Bengtsson; Solbritt Rantapää-Dahlqvist; Michelle Petri; Susan Manzi; Michael F Seldin; Lars Rönnblom; Ann-Christine Syvänen; Lindsey A Criswell; Peter K Gregersen; Timothy W Behrens
Journal:  N Engl J Med       Date:  2008-01-20       Impact factor: 91.245

5.  STAT4 and the risk of rheumatoid arthritis and systemic lupus erythematosus.

Authors:  Elaine F Remmers; Robert M Plenge; Annette T Lee; Robert R Graham; Geoffrey Hom; Timothy W Behrens; Paul I W de Bakker; Julie M Le; Hye-Soon Lee; Franak Batliwalla; Wentian Li; Seth L Masters; Matthew G Booty; John P Carulli; Leonid Padyukov; Lars Alfredsson; Lars Klareskog; Wei V Chen; Christopher I Amos; Lindsey A Criswell; Michael F Seldin; Daniel L Kastner; Peter K Gregersen
Journal:  N Engl J Med       Date:  2007-09-06       Impact factor: 91.245

6.  A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis.

Authors:  Ann B Begovich; Victoria E H Carlton; Lee A Honigberg; Steven J Schrodi; Anand P Chokkalingam; Heather C Alexander; Kristin G Ardlie; Qiqing Huang; Ashley M Smith; Jill M Spoerke; Marion T Conn; Monica Chang; Sheng-Yung P Chang; Randall K Saiki; Joseph J Catanese; Diane U Leong; Veronica E Garcia; Linda B McAllister; Douglas A Jeffery; Annette T Lee; Franak Batliwalla; Elaine Remmers; Lindsey A Criswell; Michael F Seldin; Daniel L Kastner; Christopher I Amos; John J Sninsky; Peter K Gregersen
Journal:  Am J Hum Genet       Date:  2004-06-18       Impact factor: 11.025

7.  Genetic association of the major histocompatibility complex with rheumatoid arthritis implicates two non-DRB1 loci.

Authors:  Charlotte Vignal; Aruna T Bansal; David J Balding; Michael H Binks; Marion C Dickson; Doug S Montgomery; Anthony G Wilson
Journal:  Arthritis Rheum       Date:  2009-01

Review 8.  Recent advances in the genetics of autoimmune disease.

Authors:  Peter K Gregersen; Lina M Olsson
Journal:  Annu Rev Immunol       Date:  2009       Impact factor: 28.527

9.  Common variants at CD40 and other loci confer risk of rheumatoid arthritis.

Authors:  Soumya Raychaudhuri; Elaine F Remmers; Annette T Lee; Rachel Hackett; Candace Guiducci; Noël P Burtt; Lauren Gianniny; Benjamin D Korman; Leonid Padyukov; Fina A S Kurreeman; Monica Chang; Joseph J Catanese; Bo Ding; Sandra Wong; Annette H M van der Helm-van Mil; Benjamin M Neale; Jonathan Coblyn; Jing Cui; Paul P Tak; Gert Jan Wolbink; J Bart A Crusius; Irene E van der Horst-Bruinsma; Lindsey A Criswell; Christopher I Amos; Michael F Seldin; Daniel L Kastner; Kristin G Ardlie; Lars Alfredsson; Karen H Costenbader; David Altshuler; Tom W J Huizinga; Nancy A Shadick; Michael E Weinblatt; Niek de Vries; Jane Worthington; Mark Seielstad; Rene E M Toes; Elizabeth W Karlson; Ann B Begovich; Lars Klareskog; Peter K Gregersen; Mark J Daly; Robert M Plenge
Journal:  Nat Genet       Date:  2008-09-14       Impact factor: 38.330

10.  A candidate gene approach identifies the TRAF1/C5 region as a risk factor for rheumatoid arthritis.

Authors:  Fina A S Kurreeman; Leonid Padyukov; Rute B Marques; Steven J Schrodi; Maria Seddighzadeh; Gerrie Stoeken-Rijsbergen; Annette H M van der Helm-van Mil; Cornelia F Allaart; Willem Verduyn; Jeanine Houwing-Duistermaat; Lars Alfredsson; Ann B Begovich; Lars Klareskog; Tom W J Huizinga; Rene E M Toes
Journal:  PLoS Med       Date:  2007-09       Impact factor: 11.069

View more
  4 in total

1.  Construction and phenotypic analysis of mice carrying a duplication of the major histocompatibility class I (MHC-I) locus.

Authors:  Olga Ermakova; Ekaterina Salimova; Lukasz Piszczek; Cornelius Gross
Journal:  Mamm Genome       Date:  2012-07-07       Impact factor: 2.957

2.  CIITA is not associated with risk of developing rheumatoid arthritis.

Authors:  P G Bronson; P P Ramsay; M F Seldin; P K Gregersen; L A Criswell; L F Barcellos
Journal:  Genes Immun       Date:  2011-01-20       Impact factor: 2.676

3.  Haplotype-based analysis: a summary of GAW16 Group 4 analysis.

Authors:  Elizabeth Hauser; Nadine Cremer; Rebecca Hein; Harshal Deshmukh
Journal:  Genet Epidemiol       Date:  2009       Impact factor: 2.135

4.  RNA-seq analysis of synovial fibroblasts brings new insights into rheumatoid arthritis.

Authors:  Daniel P Heruth; Margaret Gibson; Dmitry N Grigoryev; Li Qin Zhang; Shui Qing Ye
Journal:  Cell Biosci       Date:  2012-12-21       Impact factor: 7.133

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.