Literature DB >> 18466513

Evaluating gene x gene and gene x smoking interaction in rheumatoid arthritis using candidate genes in GAW15.

Ling Mei1, Xiaohui Li, Kai Yang, Jinrui Cui, Belle Fang, Xiuqing Guo, Jerome I Rotter.   

Abstract

We examined the potential gene x gene interactions and gene x smoking interactions in rheumatoid arthritis (RA) using the candidate gene data sets provided by Genetic Analysis Workshop 15 Problem 2. The multifactor dimensionality reduction (MDR) method was used to test gene x gene interactions among candidate genes. The case-only sample was used to test gene x smoking interactions. The best predictive model was the single-locus model with single-nucleotide polymorphism (SNP) rs2476601 in gene PTPN22. However, no clear gene x gene interaction was identified. Substantial departure from multiplicativity was observed between smoking and SNPs in genes CTLA4, PADI4, MIF, and SNPs on chromosome 5 and one haplotype of PTPN22. The strongest evidence of association was identified between the PTPN22 gene and RA status, which was consistently detected in single SNP association, gene x gene interaction and gene x smoking interaction analyses.

Entities:  

Year:  2007        PMID: 18466513      PMCID: PMC2367546          DOI: 10.1186/1753-6561-1-s1-s17

Source DB:  PubMed          Journal:  BMC Proc        ISSN: 1753-6561


Background

Rheumatoid arthritis (RA) is a complex autoimmune disease. The etiology of the disease is not clearly understood. Risk factors of RA include genetic factors, race (Native American), female gender, obesity, old age, and smoking [1,2]. However, like most complex diseases, few studies of gene × gene interaction and gene × environmental interaction have been performed because a large sample size is required to identify such effects in traditional statistical paradigms. Logistic regression is commonly used in detecting interactive effects between genes or environmental factors in epidemiologic studies. However, the parameters cannot be accurately estimated when there are many independent variables while the sample size is not large enough [3]. Recently, Ritchie et al. [4] introduced a multifactor dimensionality reduction (MDR) method for identifying gene × gene interaction or gene × environmental interaction to overcome this limitation of traditional logistic regression [3-5]. This approach enumerates all possible combinations of genotype or environmental factors associated with high risk and low risk of disease, and it may enable us to find interactions between genes in the absence of main effects [3-5]. To detect potential epistasis in RA, we evaluated 1) disease associations using single SNPs (single-nucleotide polymorphisms) from 15 candidate genes and haplotypes of the PTPN22 gene, 2) gene × gene interactions among the candidate genes using the MDR method and logistic regression, and 3) gene × environmental (smoking) interactions using a case-only study design.

Methods

Materials

The data sets for the candidate gene studies of RA were provided by Genetic Analysis Workshop 15 (GAW15) Problem 2. There were two case-control data sets. The first one included 855 unrelated controls and 839 cases, as well as genotype data on 20 SNPs from 15 candidate genes, which were selected from previously published associations with RA or other autoimmune disorders by Plenge et al. [6]. The second data set included 1519 unrelated controls and 1393 cases, and genotype data on 14 SNPs from the PTPN22 gene. Additional phenotype data, including smoking history, age of onset, sex, and body mass index, were available for cases only in both data sets. There were 408 and 720 affected sibling pairs among cases in the two data sets, respectively.

Statistical analysis

Single SNP and haplotype (PTPN22 only) associations with disease status were first evaluated. To account for the dependency among family members, the generalized estimating equations methods (GEE1) [7] as implemented in the GENMOD procedure of SAS 9.0 was utilized in the association analysis by using family as the cluster factor, i.e., members from the same family were assumed to be correlated and those from different families were assumed to be independent. The haplotype block structure of PTPN22 was evaluated by Haploview [8]. Individual haplotypes were reconstructed using the PHASE 2.0 by assigning each haplotype with maximum probability [9]. Seventy-four percent of haplotype assignments had probabilities of 100% and 93% had probabilities of 80% or better. Individuals whose haplotype assignment had probability below 80% were excluded from subsequent analysis. Association analysis was carried out for each common haplotype in turn. For each haplotype, a dominant model was assumed, i.e., carriers of the particular haplotype versus non-carriers were compared for their RA status. To test gene × gene interactions, MDR was used to determine the genetic model that could most successfully predict the disease status or phenotype from several loci. SNP rs2240340 on the PADI4 gene was excluded from analysis due to its large amount of missing data. One thousand three hundred and thirty case-control samples with completed marker data on 19 SNPs from 14 candidate genes were utilized in the MDR analysis. Cross-validation (CV) consistency and balanced accuracy estimates were calculated for each combination of a pool of genetic polymorphisms. The model with the highest accuracy and maximal CV was considered to be the best [5]. We determined statistical significance by comparing the accuracy of the observed data with the distribution of accuracy under the null hypothesis of no associations derived empirically from 1000 replicates of permutations [10]. The null hypothesis was rejected when the p-value derived from the permutation test was 0.05 or less. As a follow-up, logistic regression analysis was conducted if there was suggestive interaction. We also examined the interaction between SNPs and smoking history in RA cases. The logistic function in the GENMOD procedure was used to quantify departure from multiplicativity. Odds ratios and 95% CIs were estimated. To adjust for multiple tests, empirical p-values were obtained from 1000 permutations. For the PTPN22 gene, interaction effects between PTPN22 haplotypes and smoking among cases were evaluated for RA status.

Results

1. Single SNP and PTPN22 haplotype association

Table 1 lists the association analysis results between disease and individual markers. One SNP from each of the five genes, HAVCRI, CTLA4, SUMO4, MAP3K7IP2, and PTPN22, were found significantly associated with RA.
Table 1

Association between SNPs and RA

Candidate geneSNPp-value (11 vs. 12 vs. 22)p-value (11/12 vs. 22)
HAVCRI5509_5511delCAA0.0660.034a
HAVCRIrs61493070.1890.068
CTLA4CT600.0160.005
CARD15HugotSNP12ms3--b0.754
CARD15HugotSNP8ms20.8380.553
CARD15Hugot_SNP13ms2--0.695
Chr 5IGR2096ms10.4730.243
Chr 5IGR3084ms10.8190.713
Chr 5IGR3138ms10.8610.732
IL3rs314800.6180.384
SUMO4rs2370250.0003<0.0001
MAP3K7IP2rs5770010.0020.001
MIFrs7556220.8420.979
TNFRFF1brs10616220.7040.684
DLG5rs12486960.2690.129
SLC22A4rs20738380.9040.771
PADI4rs22403400.5740.330
IL4rs22432500.3110.147
RUNX1rs22682770.550.583
PTPN22rs2476601<0.0001<0.0001

Allele "1" is the putative susceptibility allele, and allele "2" is the non-susceptibility allele

aBold text indicates p < 0.05.

b--, Results are not available because there is no homozygote "11" in some groups.

Association between SNPs and RA Allele "1" is the putative susceptibility allele, and allele "2" is the non-susceptibility allele aBold text indicates p < 0.05. b--, Results are not available because there is no homozygote "11" in some groups. Five common haplotypes of the PTPN22 with frequency >10% were constructed. Of the two haplotypes with significant associations with RA, one was a risk haplotype (11222221122221; 1: minor allele, 2: major allele; frequency: 11.6%), with a higher carrier frequency in cases than in controls (30.0% vs. 14.9%, p < 0.0001); whereas the other was protective (22122222222222; frequency: 10.9%), with a lower carrier frequency in cases than in controls (16.4% vs. 24.7%, p < 0.0001).

2. Gene × gene interaction

Table 2 lists the results from MDR. The one-locus model with SNP rs2476601 on gene PTPN22 had a maximum test accuracy (p = 0.004) and a maximum CV consistency of 10 out of 10, indicating that this was the best model. The second-best model was a two-locus model consisting of rs1248696 on the DLG5 gene and rs2476601 on PTPN22 (p = 0.013). The combination of rs1248696_22 and rs2476601_22 was associated with being in the low-risk group when compared to others (OR = 0.46, 95%CI: 0.36, 0.60). However, we could not confirm the interactive effect between these two markers in the follow-up logistic regression analysis under the GEE model. No better models were identified for three and/or more locus models.
Table 2

Multilocus interaction model for RA selected from MDR

ModelBalanced accuracyCV consistencyp-value
rs2476601 (PTPN22)0.574710/100.004a
rs2476601 (PTPN22) rs1248696 (DLG5)0.57058/100.013
rs2476601 (PTPN22) rs6149307 (HAVCRI) rs2243250 (IL4)0.55347/100.09
rs2476601 (PTPN22) IGR2096ms1 (chr 5) rs237025 (SUMO4) rs2268277 (RUNX1)0.52436/100.475

aBold text indicates p < 0.05.

Multilocus interaction model for RA selected from MDR aBold text indicates p < 0.05.

3. Gene × smoking interaction

Two categories of environmental exposure, ever smoked and current smoking, were used to test for gene × environmental interactions. No significant departure from multiplicativity was observed between current smoking and markers. Interactive effects with ever smoking were found in the primary analysis for five SNPs, including CT60 on the CTLA4 gene, rs2240340 on PADI4, IGR3084ms1 and IGR3138ms1 on chromosome 5, and rs755622 on the MIF gene. The empirical p-values derived from the 1000 permutations were similar to the nominal ones (Table 3).
Table 3

Gene × smoking interactions

Ever smoked

MarkerHaplotypeNoYesORint95%CIp-ValueEmpirical p
CT60 (CTLA4)116758
12/222893931.691.11, 2.550.0250.023
rs2240340 (PADI4)114029
12/221221622.761.24, 6.160.0260.019
IGR3084ms1 (chr 5)112347
12/223273930.590.36, 0.970.0390.042
IGR3138ms1 (chr 5)113669
12/223143740.620.40, 0.960.0330.04
rs755622 (MIF)22234329
11/121201280.730.53, 0.990.0460.052
Gene × smoking interactions One of the common haplotype of PTPN22 (22222222211221, frequency: 18%) was found to interact with ever smoking at borderline significant level (OR = 0.78, 95%CI: 0.60–1.01, p = 0.06); however, the risk and the protective haplotypes that were identified previously in the case-control sample did not show any departure from multiplicativity with smoking in the case-only study.

Discussion

We explored gene × gene and gene × smoking interactions using the candidate gene data set provided by GAW15. The best predictive model for RA status is the single-locus model containing rs2476601 on gene PTPN22. SNP rs2476601 is a well known functional SNP that is associated with increased risk of RA. The best combination model selected by MDR consisted of rs2476601 on PTPN22 and rs1248696 on DLG5. However, the susceptibility interaction was not confirmed in the following logistic regression analysis. The possible reason for the inconsistent results is that in MDR, we actually did not test statistical interaction which was defined as 'deviation from multiplicativity' as in logistic regression. The significant results from MDR only implies that the combination of the markers contributes to an increased or decreased risk of disease and the effect between the markers could be either multiplicative or deviation from multiplicative. The case-only study has its particular advantage in testing gene × environmental interaction and it requires smaller sample size [11]. It allows us to test interactive effects in the absence of the information from controls under the assumption that the two risk factors are independently distributed in the population at risk [10]. In GAW15 Problem 2, we used this design to identify a gene × smoking interaction in RA because no smoking information was available from controls. We assumed genetic polymorphism and smoking exposure are independent of one another in controls. Substantial departure from multiplicativity was observed between ever smoking and markers from CTLA4, PADI4, MIF, and chromosome 5. Among these markers, only SNP CT60 from gene CTLA4 showed a main effect with RA in the single SNP analysis. One possible explanation for this phenomenon is that the existence of gene × smoking interactions could mask the true genetic effect if we only test the marginal association, especially when the gene status modifies the smoking effect in the opposite directions in the total sample. Another possible explanation is the difference in the tested samples: only cases were used in the gene × smoking interaction studies, while the single SNP association was evaluated in the case-control sample. PTPN22 has been reported to be associated with RA [6,12]. In this study, we tested single gene association, gene × gene interactions and gene × smoking interactions using three different methods. In single SNP analysis, PTPN22 showed the strongest association with RA status (p < 0.0001). In the following gene × gene interaction analyses by MDR, both the best single and the best combined models included PTPN22 gene. Furthermore, haplotype analysis using the second data set identified two haplotypes of the PTPN22 associated with RA and more importantly, there was a trend toward interaction between this gene and smoking. Therefore, the consistent findings here provide further evidence of the genetic involvement of PTPN22 in the etiology of RA.

Conclusion

In conclusion, our analyses confirmed the role of genetic and environmental factors in rheumatoid arthritis. Strong evidence of association was identified for the PTPN22 gene, which was observed in all three analyses. Other genes (HAVCRI, CTLA4, SUMO4, MAP3K7IP2, PAID4, chromosome 5 locus, MIF) may also contribute to the development of rheumatoid arthritis directly or within the context of smoking.

Competing interests

The author(s) declare that they have no competing interests.
  11 in total

1.  A new statistical method for haplotype reconstruction from population data.

Authors:  M Stephens; N J Smith; P Donnelly
Journal:  Am J Hum Genet       Date:  2001-03-09       Impact factor: 11.025

Review 2.  Reporting of model validation procedures in human studies of genetic interactions.

Authors:  Christopher S Coffey; Patricia R Hebert; Harlan M Krumholz; Thomas M Morgan; Scott M Williams; Jason H Moore
Journal:  Nutrition       Date:  2004-01       Impact factor: 4.008

Review 3.  New strategies for identifying gene-gene interactions in hypertension.

Authors:  Jason H Moore; Scott M Williams
Journal:  Ann Med       Date:  2002       Impact factor: 4.709

4.  Haploview: analysis and visualization of LD and haplotype maps.

Authors:  J C Barrett; B Fry; J Maller; M J Daly
Journal:  Bioinformatics       Date:  2004-08-05       Impact factor: 6.937

Review 5.  Computational analysis of gene-gene interactions using multifactor dimensionality reduction.

Authors:  Jason H Moore
Journal:  Expert Rev Mol Diagn       Date:  2004-11       Impact factor: 5.225

6.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer.

Authors:  M D Ritchie; L W Hahn; N Roodi; L R Bailey; W D Dupont; F F Parl; J H Moore
Journal:  Am J Hum Genet       Date:  2001-06-11       Impact factor: 11.025

7.  Longitudinal data analysis for discrete and continuous outcomes.

Authors:  S L Zeger; K Y Liang
Journal:  Biometrics       Date:  1986-03       Impact factor: 2.571

8.  PTPN22 genetic variation: evidence for multiple variants associated with rheumatoid arthritis.

Authors:  Victoria E H Carlton; Xiaolan Hu; Anand P Chokkalingam; Steven J Schrodi; Rhonda Brandon; Heather C Alexander; Monica Chang; Joseph J Catanese; Diane U Leong; Kristin G Ardlie; Daniel L Kastner; Michael F Seldin; Lindsey A Criswell; Peter K Gregersen; Ellen Beasley; Glenys Thomson; Christopher I Amos; Ann B Begovich
Journal:  Am J Hum Genet       Date:  2005-08-10       Impact factor: 11.025

9.  Replication of putative candidate-gene associations with rheumatoid arthritis in >4,000 samples from North America and Sweden: association of susceptibility with PTPN22, CTLA4, and PADI4.

Authors:  Robert M Plenge; Leonid Padyukov; Elaine F Remmers; Shaun Purcell; Annette T Lee; Elizabeth W Karlson; Frederick Wolfe; Daniel L Kastner; Lars Alfredsson; David Altshuler; Peter K Gregersen; Lars Klareskog; John D Rioux
Journal:  Am J Hum Genet       Date:  2005-11-01       Impact factor: 11.025

10.  Smoking, obesity, alcohol consumption, and the risk of rheumatoid arthritis.

Authors:  L F Voigt; T D Koepsell; J L Nelson; C E Dugowson; J R Daling
Journal:  Epidemiology       Date:  1994-09       Impact factor: 4.822

View more
  10 in total

1.  Permutation and parametric bootstrap tests for gene-gene and gene-environment interactions.

Authors:  Petra Bůžková; Thomas Lumley; Kenneth Rice
Journal:  Ann Hum Genet       Date:  2011-01       Impact factor: 1.670

Review 2.  Ethnogenetic heterogeneity of rheumatoid arthritis-implications for pathogenesis.

Authors:  Yuta Kochi; Akari Suzuki; Ryo Yamada; Kazuhiko Yamamoto
Journal:  Nat Rev Rheumatol       Date:  2010-03-16       Impact factor: 20.543

3.  Peptidyl arginine deiminase type IV (PADI4) haplotypes interact with shared epitope regardless of anti-cyclic citrullinated peptide antibody or erosive joint status in rheumatoid arthritis: a case control study.

Authors:  So-Young Bang; Tae-Un Han; Chan-Bum Choi; Yoon-Kyoung Sung; Sang-Cheol Bae; Changwon Kang
Journal:  Arthritis Res Ther       Date:  2010-06-10       Impact factor: 5.156

4.  Supervised machine learning and logistic regression identifies novel epistatic risk factors with PTPN22 for rheumatoid arthritis.

Authors:  F B S Briggs; P P Ramsay; E Madden; J M Norris; V M Holers; T R Mikuls; T Sokka; M F Seldin; P K Gregersen; L A Criswell; L F Barcellos
Journal:  Genes Immun       Date:  2010-01-21       Impact factor: 2.676

Review 5.  Genetic studies of rheumatoid arthritis.

Authors:  Kazuhiko Yamamoto; Yukinori Okada; Akari Suzuki; Yuta Kochi
Journal:  Proc Jpn Acad Ser B Phys Biol Sci       Date:  2015       Impact factor: 3.493

6.  Rheumatoid arthritis-associated gene-gene interaction network for rheumatoid arthritis candidate genes.

Authors:  Chien-Hsun Huang; Lei Cong; Jun Xie; Bo Qiao; Shaw-Hwa Lo; Tian Zheng
Journal:  BMC Proc       Date:  2009-12-15

7.  Detecting single-nucleotide polymorphism by single-nucleotide polymorphism interactions in rheumatoid arthritis using a two-step approach with machine learning and a Bayesian threshold least absolute shrinkage and selection operator (LASSO) model.

Authors:  Oscar González-Recio; Evangelina López de Maturana; Andrés T Vega; Corinne D Engelman; Karl W Broman
Journal:  BMC Proc       Date:  2009-12-15

8.  Allelic based gene-gene interactions in rheumatoid arthritis.

Authors:  Jeesun Jung; Joon Jin Song; Deukwoo Kwon
Journal:  BMC Proc       Date:  2009-12-15

9.  Evaluating epistatic interaction signals in complex traits using quantitative traits.

Authors:  Odity Mukherjee; Krishna Rao Sanapala; Padmanabhan Anbazhagana; Saurabh Ghosh
Journal:  BMC Proc       Date:  2009-12-15

10.  Contributions of renin-angiotensin system-related gene interactions to obesity in a Chinese population.

Authors:  Jian-Bo Zhou; Chang Liu; Wen-Yan Niu; Zhong Xin; Mei Yu; Jian-Ping Feng; Jin-Kui Yang
Journal:  PLoS One       Date:  2012-08-06       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.