| Literature DB >> 27980659 |
Ellen E Quillen1, John Blangero2, Laura Almasy2.
Abstract
BACKGROUND: The application of pathway and gene-set based analyses to high-throughput data is increasingly common and represents an effort to understand underlying biology where single-gene or single-marker analyses have failed. Many such analyses rely on the a priori identification of genes associated with the trait of interest. In contrast, this variance-component-based approach creates a similarity matrix of individuals based on the expression of genes in each pathway.Entities:
Year: 2016 PMID: 27980659 PMCID: PMC5133490 DOI: 10.1186/s12919-016-0053-6
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Percentage of simulated phenotypes associated with positive control matrices by similarity calculation method
| Simulated DBP | Q1 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Matrix | Average (%) | bhja (%) | corr (%) | dive (%) | ejac (%) | eucl (%) | bhja (%) | corr (%) | dive (%) | ejac (%) | eucl (%) |
| 1 | 29.6 | 34.2 | 65.0 | 34.0 | 61.5 | 36.0 | 1.0 | 5.5 | 3.5 | 3.5 | 1.5 |
| 2 | 18.1 | 23.5 | 55.0 | 14.0 | 50.0 | 25.0 | 0.0 | 3.5 | 1.5 | 2.5 | 0.0 |
| 3 | 13.3 | 13.2 | 34.5 | 18.5 | 31.5 | 15.0 | 1.5 | 6.5 | 1.0 | 5.5 | 2.0 |
| 4 | 10.6 | 4.0 | 35.0 | 4.5 | 28.5 | 8.5 | 0.5 | 4.5 | 0.0 | 3.5 | 1.0 |
| 5 | 8.1 | 4.5 | 16.0 | 4.5 | 14.5 | 4.5 | 0.5 | 4.5 | 1.5 | 5.0 | 0.5 |
| 6 | 6.0 | 1.0 | 7.0 | 3.0 | 6.0 | 1.0 | 1.0 | 7.0 | 0.5 | 4.5 | 0.5 |
| 7 | 5.2 | 1.5 | 7.0 | 4.5 | 6.0 | 2.0 | 0.5 | 3.0 | 1.5 | 3.0 | 0.0 |
| 8 | 3.4 | 0.0 | 2.5 | 2.5 | 2.5 | 0.5 | 0.5 | 3.5 | 0.0 | 2.5 | 0.5 |
| 9 | 2.5 | 0.0 | 2.5 | 2.0 | 2.5 | 0.0 | 1.5 | 4.0 | 1.5 | 4.5 | 1.5 |
| 10 | 1.0 | 0.0 | 0.5 | 1.0 | 1.0 | 0.0 | 1.0 | 3.5 | 1.5 | 3.0 | 1.0 |
bhja Bhjattacharyya distance, corr correlation, dive divergence distance, ejac extended Jaccard distance, eucl euclidean distance
Formulas for selected distance matrix calculations
| Method | Formula |
|---|---|
| Bhjattacharyya distance | sqrt(sumi (sqrt(xi)–sqrt(yi))2) |
| Correlation | xy/sqrt(xx * yy) for centered x,y |
| Divergence distance | sumi (xi-yi)^2/(xi + yi)2 |
| Euclidean distance | sqrt(sumi (xi-yi)2)) |
| Extended Jaccard distance | xy/(xx + yy-xy) |
Fig. 1Expected vs. observed χ2 values for correlation matrices associated with (a) Q1, (b) simulated DBP, and (c) real DBP
Fig. 2Pathway size vs. chi-squared value in real data