| Literature DB >> 20691091 |
Lorenzo Beretta1, Alessandro Santaniello, Piet L C M van Riel, Marieke J H Coenen, Raffaella Scorza.
Abstract
BACKGROUND: Epistasis is recognized as a fundamental part of the genetic architecture of individuals. Several computational approaches have been developed to model gene-gene interactions in case-control studies, however, none of them is suitable for time-dependent analysis. Herein we introduce the Survival Dimensionality Reduction (SDR) algorithm, a non-parametric method specifically designed to detect epistasis in lifetime datasets.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20691091 PMCID: PMC2928804 DOI: 10.1186/1471-2105-11-416
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Survival Dimensionality Reduction (SDR) process. Step 1, survival estimates are calculated in the whole population by the Kaplan-Meier method. Step 2, survival estimates are computed in each multidimensional cell. Step 3, time-point differences in survival estimates between the whole population and each multidimensional cell are calculated. Step 4, time-point differences are normalized to 1 to take into account negative values and then averaged via the geometric mean (GM). Step 5, individual data from multidimensional cells with GM ≤ 1 ("high-risk") are pooled and compared via log rank-test statistic, to individual data from multidimensional cells with GM > 1 ("low-risk").
Power for the Survival Dimensionality Reduction (SDR) algorithm in models with cumulative prevalence K(tn) = 0.750
| Model | RCR | 0.10 | 0.15 | 0.20 | 0.25 |
|---|---|---|---|---|---|
| 50% | 52 | 81 | 95 | 97 | |
| 70% | 59 | 73 | 93 | 93 | |
| 50% | 52 | 87 | 97 | 98 | |
| 70% | 48 | 77 | 87 | 96 | |
| 50% | 51 | 88 | 92 | 99 | |
| 70% | 50 | 87 | 91 | 98 | |
| 50% | 46 | 74 | 93 | 97 | |
| 70% | 43 | 72 | 91 | 95 | |
| 50% | 50 | 82 | 95 | 99 | |
| 70% | 47 | 80 | 90 | 96 | |
Power of the SDR algorithm in simulated datasets modelled using the 5 classes of survival distribution described by the logistic-exponential equation. Different degrees of right-censoring rate (RCR), and cumulative heritability at the survival time [H2(t5)] are considered. UBT, upside-down bathtub-shaped failure rate; DFR, decreasing failure rate; IFR, increasing failure rate; BT bathtub-shaped failure rate; EXP, exponential.
Figure 2Trend of the power for the survival dimensionality reduction algorithm in the different models. Relationship between power and cumulative broad-sense heritability (H2) according to the different degrees of censorship of the sample datasets (squares = 70%; triangles = 50%). The line represents the power estimated by the logistic function (R2 = 0.846).
Survival dimensionality reduction (SDR) model for the rheumatoid arthritis (RA) dataset.
| IBS | ||||
|---|---|---|---|---|
| n-way | SNPs (genes) in each dimension | Training | Testing | p |
| 1 | rs2327832 (TNFAIP3, OLIG3) | 0.263 | 0.2366 | - |
| 3 | rs1801274 (FcγRIIa), rs10954213 (IRF5), rs3761847 (TRAF1) | 0.2219 | 0.2393 | - |
Selection of the best combination of attributes by the SDR method. The model with the minimum testing IBS value in the cross-validated testing sets is indicated in boldface type. p values associated with the log-rank test statistic calculated by the 100-fold permutation test. SNP, single nucleotide polymorphism.
Figure 3Survival dimensionality reduction (SDR) model for the rheumatoid arthritis dataset. a. Multidimensional matrix for the single nucleotide polymorphisms (SNPs) interaction; GM, geometric mean of the differences; n, number of cases; e, percentage of events. b. Kaplan-Meier estimated survival patterns associated with "high-" and "low-risk" assignments. P values are those associated with a chi-square distribution with 1 degree of freedom.