| Literature DB >> 20018006 |
Min Zhang1, Yanzhu Lin, Libo Wang, Vitara Pungpapong, James C Fleet, Dabao Zhang.
Abstract
Currently, genome-wide association studies (GWAS) are conducted by collecting a massive number of SNPs (i.e., large p) for a relatively small number of individuals (i.e., small n) and associations are made between clinical phenotypes and genetic variation one single-nucleotide polymorphism (SNP) at a time. Univariate association approaches like this ignore the linkage disequilibrium between SNPs in regions of low recombination. This results in a low reliability of candidate gene identification. Here we propose to improve the case-control GWAS approach by implementing linear discriminant analysis (LDA) through a penalized orthogonal-components regression (POCRE), a newly developed variable selection method for large p small n data. The proposed POCRE-LDA method was applied to the Genetic Analysis Workshop 16 case-control data for rheumatoid arthritis (RA). In addition to the two regions on chromosomes 6 and 9 previously associated with RA by GWAS, we identified SNPs on chromosomes 10 and 18 as potential candidates for further investigation.Entities:
Year: 2009 PMID: 20018006 PMCID: PMC2795913 DOI: 10.1186/1753-6561-3-s7-s17
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1SNPs identified using POCRE-LDA. The x-axis indicates the physical location of each SNP on the chromosome, and the y-axis represents the absolute value of the estimated coefficient, i.e., |b|. Genetic regions with multiple SNPs are identified in chromosomes 6, 8, 9, 10, and 18.
Genomic regions and candidate genes identified for case-control study of rheumatoid arthritis in GAW16
| Chromosome | Genomic region (Mb) | SNP with the largest effect | Number of genes | Candidate genes |
|---|---|---|---|---|
| 6p21.33 | 31.55-31.62 | rs2523647 | 4 | |
| 6p21.32 | 32.33-32.41 | rs10484560 | 1 | |
| 6p21.32 | 32.47-32.69 | rs3135363 | 6 | |
| 6p21.32 | 37.74-32.79 | rs9275601 | 1 | |
| 6p21.32 | 32.87-32.97 | rs9380326 | 5 | |
| 6p21.32 | 33.21-33.29 | rs3130237 | 5 | |
| 9q33.1 | 12.05-12.12 | rs2900180 | 3 | |
| 10q11.22 | 49.64-49.79 | rs2671692 | 2 | |
| 18q12.1 | 26.82-26.88 | rs2852003 | 1 |
aThe identified SNPs are in linkage disequilibrium with these genes.