| Literature DB >> 20018074 |
Xueying Liang1, Ying Gao, Tram K Lam, Qizhai Li, Cathy Falk, Xiaohong R Yang, Alisa M Goldstein, Lynn R Goldin.
Abstract
Although several genes (including a strong effect in the human leukocyte antigen (HLA) region) and some environmental factors have been implicated to cause susceptibility to rheumatoid arthritis (RA), the etiology of the disease is not completely understood. The ability to screen the entire genome for association to complex diseases has great potential for identifying gene effects. However, the efficiency of gene detection in this situation may be improved by methods specifically designed for high-dimensional data. The aim of this study was to compare how three different statistical approaches, multifactor dimensionality reduction (MDR), random forests (RF), and an omnibus approach, worked in identifying gene effects (including gene-gene interaction) associated with RA. We developed a test set of genes based on previous linkage and association findings and tested all three methods. In the presence of the HLA shared-epitope factor, other genes showed weaker effects. All three methods detected SNPs in PTPN22 and TRAF1-C5 as being important. But we did not detect any new genes in this study. We conclude that the three high-dimensional methods are useful as an initial screening for gene associations to identify promising genes for further modeling and additional replication studies.Entities:
Year: 2009 PMID: 20018074 PMCID: PMC2795981 DOI: 10.1186/1753-6561-3-s7-s79
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Selected genes and regions
| Source | Gene/region | Chromosome | Location (Mb) | No. selected SNPs | |
|---|---|---|---|---|---|
| Association study | 1 | 114.0-114.2 | 21 | [ | |
| 9 | 120.7-121.0 | 38 | [ | ||
| 20 | 44.1-44.2 | 30 | [ | ||
| chr 1 | 1 | 114.0-114.1 | 11 | [ | |
| chr 7 | 7 | 130.8-130.9 | 20 | [ | |
| chr 10 | 10 | 6.0-6.2 | 65 | [ | |
| Linkage study | chr 2 | 2 | 191.8-192.3 | 100 | [ |
| chr 11 | 11 | 40.7-41.2 | 101 | [ | |
| Total | 386 (378)a |
a8 SNPs on chromosome 1 overlapped between two studies; therefore 378 SNPs were in the final list.
Best models detected by MDR analyses
| No. locus/loci in model | Best model | Accuracy (%) | Cross validation | |
|---|---|---|---|---|
| With SE | ||||
| 1-locus | SE | 76.24 | 100% | < 0.0001 |
| 2-locus | SE chr 2 linkage (rs1517835) | 75.54 | 40% | < 0.0001 |
| Without SE | ||||
| 1-locus | 56.27 | 100% | < 0.0001 | |
| 2-locus | 57.64 | 100% | < 0.0001 |
p-Value for each gene/region considering SE as known risk gene in the omnibus method
| Gene 1 | No. SNPs | Score test | |
|---|---|---|---|
| chr 1/ | 9 | 37.48 | |
| chr 2/linkage | 49 | 49.07 | 0.8957 |
| chr 7 | 11 | 17.04 | 0.3417 |
| chr 9/ | 13 | 44.01 | |
| chr 10 | 33 | 61.15 | 0.1100 |
| chr 11/linkage | 41 | 44.96 | 0.7077 |
| chr 20/ | 19 | 32.22 | 0.1920 |
a Bold font indicates p < 0.05.
Detection of important factors using RF
| Ranking by importance | ||||
|---|---|---|---|---|
| Marker | Gene | Chr | MDA | MDG |
| SE | Shared epitope | N/A | 1 | 1 |
| rs8177685 | 10 | 2 | 2 | |
| rs2476601 | 1 | 3 | 4 | |
| rs2274037 | 10 | 4 | 111 | |
| rs1569723 | 20 | 5 | 5 | |
| rs7559874 | linkage | 2 | 6 | 6 |
| rs2416810 | 9 | 7 | 65 | |
| rs7795093 | association | 7 | 8 | 3 |
| rs1179766 | 9 | 9 | 121 | |