| Literature DB >> 20018003 |
Qiuying Sha1, Rui Tang, Shuanglin Zhang.
Abstract
With the recent rapid improvements in high-throughout genotyping techniques, researchers are facing a very challenging task of large-scale genetic association analysis, especially at the whole-genome level, without an optimal solution. In this study, we propose a new approach for genetic association analysis based on a variable-sized sliding-window framework. This approach employs principal component analysis to find the optimal window size. Using the bisection algorithm in window size searching, the proposed method tackles the exhaustive computation problem. It is more efficient and effective than currently available approaches. We conduct the genome-wide association study in Genetic Analysis Workshop 16 (GAW16) Problem 1 data using the proposed method. Our method successfully identified several susceptibility genes that have been reported by other researchers and additional candidate genes for follow-up studies.Entities:
Year: 2009 PMID: 20018003 PMCID: PMC2795910 DOI: 10.1186/1753-6561-3-s7-s14
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
The distribution of the window sizes based on 531,501 windows
| Window size | Percentage of total windows |
|---|---|
| 4 | 18.32 |
| 5 | 11.92 |
| 6 | 11.69 |
| 7 | 11.57 |
| 8 | 8.94 |
| 9 | 7.34 |
| 10 | 6.34 |
| 11 | 4.67 |
| 12 | 3.73 |
| 13-29 | 15.48 |
Genetic and physical map locations of window region identified using PC-sliding-window analysis based on the Bonferroni correction
| Window ID | Chra | Physical location | Genesb | CRASGc |
|---|---|---|---|---|
| 1 | 6 | 30014670, 33187144 | ||
| 2 | 1 | 792429, 1101089 | ||
| 3 | 2 | 172768404, 172807000 | ||
| 4 | 12 | 46666298, 46718200 | ||
| 5 | 13 | 113656958, 113861908 | ||
| 6 | 7 | 154133201, 154241160 | ||
| 7 | 17 | 68283979, 68361160 | ||
| 8 | 2 | 98261543, 98370780 | ||
| 9 | 13 | 49336428, 49340230 | ||
| 10 | 1 | 2243956, 3359357 | ||
| 11 | 17 | 66647226, 66750860 | ||
| 12 | 16 | 67482002, 67660490 | ||
| 13 | 18 | 75300466, 75314140 | ||
| 14 | 13 | 74883232, 74941310 | ||
| 15 | 9 | 123211883, 123248000 | ||
| 16 | 20 | 57796484, 57832810 | ||
| 17 | 1 | 15181683, 151846600 | ||
| 18 | 20 | 35438689, 35501280 | ||
| 19 | 22 | 28164734, 28237820 | ||
| 20 | 11 | 64593946, 64661480 | ||
| 21 | 11 | 45207308, 45314100 | ||
| 22 | 8 | 20327035, 20435200 | ||
| 23 | 5 | 137614229, 137825000 | ||
| 24 | 7 | 129525353, 129580000 | ||
| 25 | 3 | 134954925, 134966000 | ||
| 26 | 9 | 104811123, 104815000 | ||
| 27 | 12 | 6924169, 6932652 | ||
| 28 | 10 | 10531181, 10542841 | ||
| 29 | 11 | 3335218, 3524620 | ||
| 30 | 19 | 19106771, 19154190 |
aChr, chromosome
bWe found the significant genes using the NCBI dbSNP database http://www.ncbi.nlm.nih.gov/sites/entrez?db=snp.
cThe confirmed RA susceptibility genes (CRASG) are shown in the last column if they are within or near our significant region.