| Literature DB >> 18667089 |
Javier Gayán1, Antonio González-Pérez, Fernando Bermudo, María Eugenia Sáez, Jose Luis Royo, Antonio Quintas, Jose Jorge Galan, Francisco Jesús Morón, Reposo Ramirez-Lorca, Luis Miguel Real, Agustín Ruiz.
Abstract
BACKGROUND: The difficulty in elucidating the genetic basis of complex diseases roots in the many factors that can affect the development of a disease. Some of these genetic effects may interact in complex ways, proving undetectable by current single-locus methodology.Entities:
Mesh:
Year: 2008 PMID: 18667089 PMCID: PMC2533022 DOI: 10.1186/1471-2164-9-360
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1HFCC data groups and algorithm. HFCC data groups are divided into cases (F), controls (C) and control filter (CF). The number of replication groups is experiment-specific and depends on the available dataset (sample size and number of genetic markers). The HFCC algorithm is defined by a particular analysis flow. For each marker variable (single-locus markers, two-locus marker combinations, etc. depending on the type of study), a sequential number of tests is performed. Case-controls comparisons are performed on each replication group, and control-control comparisons are carried out in each control filter group. If any of these tests is not beyond a statistical threshold, the marker variable is dropped, and the next marker variable is analyzed. Marker variables over the statistical threshold in all case-control tests, and below the threshold in all control-control tests, are selected.
Power Analysis.
| Power (%) | |||
| Noise | Model 1 | Model 2 | Model 3 |
| None | 100 | 100 | 99 |
| GE | 100 | 100 | 96 |
| MS | 100 | 100 | 97 |
| PC | 89 | 100 | 49 |
| GH | 100 | 100 | 82 |
| GE+GH+PC+MS | 51 | 71 | 34 |
Power of HFCC to detect a two-locus interaction at an alpha level of 0.01 under 3 genetic epistasis models and different sources of noise: no noise (None), genotyping error (GE), missing data (MS), phenocopy (PC), genetic heterogeneity (GH), and all sources of noise simultenously (GE+GH+PC+MS). For the datasets with genetic heterogeneity, we reported the power to detect either of the two simulated two-locus effects. Model 1 is a heterozygote-homozygote risk (i.e., risk genotypes ABCC, ABDD, AACD and BBCD); Model 2 represents a "2 and only 2"-allele risk (i.e., A and C) from any of two loci (i.e., risk genotypes AADD, ABCD, and BBCC); Model 3 represents a protective double-heterozygote (i.e., protective genotype ABCD).
HFCC analysis: Preliminary results and effect of noise filters.
| Applied Filters | Unfiltered Results | After Control Filter | After Direction Filter | After Tracking Filter | Selected two-locus SNP pairs | ||
| Simultaneous | Conditional | Epistatic | |||||
| Method I | |||||||
| N | 418,535 | - | 320,265 | - | 71,332 | 248,898 | 35 |
| % | 100% | - | 76.5% | - | 17.0% | 59.5% | 0.01% |
| Method II | |||||||
| N | 418,535 | - | 320,265 | 26,371 | 19,630 | 6,706 | 35 |
| % | 100.0% | - | 76.5% | 6.3% | 6.1% | 2.1% | 0.01% |
| Method III | |||||||
| N | 418,535 | 340,043 | 320,188 | 26,347 | 19,611 | 6,701 | 35 |
| % | 100.0% | 81.2% | 76.5% | 6.3% | 6.1% | 2.1% | 0.01% |
Method I: Only Direction Filter applied.
Method II: Direction and Tracking filters applied.
Method III: All filters applied (Control, Direction, and Tracking filters sequentially in that order).
N: Number of selected two-locus SNP pairs.
%: Percentage of selected two-locus SNP pairs over the preliminary set (unfiltered HFCC results).
A total of 36 SNPs were identified as tracking SNPs in the two methods using tracking filter.
Selected Parkinson's disease two-locus combinations.
| SNP1 | SNP2 | Effect | Genetic Model | Odds Ratio | 2-locus X2 | Case-only X2 |
| rs6542522 | rs3923511 | E | TC*GA | 0.28 | 24.97 | 13.07 |
| rs10799573 | rs1341622 | E | CC*CT | 5.22 | 23.08 | 10.61 |
| rs12520264 | rs2992630 | E | AG*GA | 0.22 | 23.86 | 23.12 |
| rs324454 | rs12672177 | E | GG*AG | 0.21 | 24.31 | 11.55 |
| rs6656554 | rs3898966 | E | GT*TC | 3.83 | 23.44 | 22.34 |
| rs7650598 | rs12353255 | E | AG*AG | 3.99 | 23.94 | 9.40 |
| rs2439525 | rs1327918 | E | TT*CC | 2.39 | 22.04 | 7.13 |
| rs10201616 | rs4495512 | E | CT*TT | 5.04 | 24.08 | 8.63 |
| rs11167062 | rs1270919 | E | TC*GT | 0.20 | 21.83 | 20.65 |
| rs2419117 | rs10512174 | E | GT*CC | 5.55 | 23.33 | 9.33 |
| rs1370699 | rs1673130 | C | CC*CC | 0.35 | 25.50 | 30.15 |
| rs1590957 | rs8043401 | C | TT*AG | 0.32 | 24.17 | 28.18 |
| rs1557615 | rs2582597 | C | TC*AG | 0.35 | 21.76 | 26.38 |
| rs11781101 | rs4775501 | C | CC*CC | 0.22 | 23.99 | 25.69 |
| rs1370699 | rs7897163 | C | CC*CC | 0.34 | 30.65 | 25.68 |
| rs13197142 | rs2206699 | C | GG*AA | 2.33 | 21.85 | 25.16 |
| rs732594 | rs12373417 | C | AA*GG | 3.62 | 22.31 | 25.16 |
| rs10498269 | rs1551355 | C | TC*CC | 0.38 | 23.42 | 25.00 |
| rs2555614 | rs331617 | C | AA*TT | 0.35 | 24.15 | 24.22 |
| rs3894377 | rs10774863 | C | AG*CT | 0.31 | 25.47 | 23.84 |
| rs4799327 | rs2301661 | S | AG*CC | 4.32 | 20.65 | 33.62 |
| rs2336865 | rs7973385 | S | GA*TC | 3.22 | 30.71 | 27.77 |
| rs6779648 | rs270406 | S | GG*AG | 0.26 | 26.57 | 22.49 |
| rs12599027 | rs767055 | S | TT*TT | 0.23 | 26.23 | 24.34 |
| rs3891371 | rs4724620 | S | AG*CC | 0.29 | 35.76 | 23.92 |
| rs2297518 | rs660454 | S | AG*TT | 0.12 | 23.92 | 23.72 |
| rs357968 | rs1159145 | S | AA*AA | 0.26 | 30.29 | 23.47 |
| rs1476097 | rs5766305 | S | TT*TT | 0.37 | 23.93 | 23.19 |
| rs2955005 | rs2169793 | S | GT*AG | 3.29 | 25.74 | 22.76 |
| rs2560790 | rs9390939 | S | AA*CA | 0.32 | 23.35 | 22.62 |
A selection of two-locus marker combinations consistently and significantly associated to Parkinson's disease (p < 10-6) and displaying an epistatic (E) effect, or conditional (C) or simultaneous (S) effects with significant deviations of a multiplicative model. The genetic model tested is specified as SNP1 genotype * SNP2 genotype.
Validation analysis.
| Sample | Preliminary Results | Simultaneous | Conditional | Epistatic |
| Original | 418,535 | 19,630 | 6,706 | 35 |
| Validation 1 | 396,708 | 20,855 | 6,897 | 23 |
| Validation 2 | 283,584 | 19,270 | 6,219 | 18 |
| Validation 3 | 299,846 | 19,261 | 6,000 | 27 |
| Validation 4 | 403,475 | 19,640 | 6,660 | 21 |
| Validation 5 | 337,422 | 19,575 | 6,463 | 22 |
| Validation 6 | 320,406 | 18,879 | 6,230 | 20 |
| Validation 7 | 313,653 | 18,667 | 5,943 | 21 |
| Validation 8 | 294,007 | 19,255 | 6,360 | 26 |
| Validation 9 | 322,754 | 18,478 | 5,772 | 22 |
| Validation 10 | 363,194 | 19,317 | 5,529 | 28 |
| Average Count (St. Dev.) | 341,235 (47,125) | 19,348 (629) | 6,253 (419) | 24 (5) |
| Average % | 100% | 7.92% | 2.56% | 0.010% |
Number of total preliminary results, and number of selected simultaneous, conditional and epistatic two-locus combinations in the original sample, and in ten random-placement validation samples. Average numbers (and standard deviations) for the 11 samples, and average percentage of each category, are included.