| Literature DB >> 21846375 |
Stacey J Winham1, Alison A Motsinger-Reif.
Abstract
BACKGROUND: A breadth of high-dimensional data is now available with unprecedented numbers of genetic markers and data-mining approaches to variable selection are increasingly being utilized to uncover associations, including potential gene-gene and gene-environment interactions. One of the most commonly used data-mining methods for case-control data is Multifactor Dimensionality Reduction (MDR), which has displayed success in both simulations and real data applications. Additional software applications in alternative programming languages can improve the availability and usefulness of the method for a broader range of users.Entities:
Year: 2011 PMID: 21846375 PMCID: PMC3177775 DOI: 10.1186/1756-0381-4-24
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Summary table for MDR fit with 5-fold cross-validation
| | | | | ||
|---|---|---|---|---|---|
| | | | | ||
| * | | | | | |
| | | | | ||
'*' indicates overall best model
Summary table for MDR fit with three-way split validation
| | | | | ||
|---|---|---|---|---|---|
| | | | | ||
| | | | | ||
| * | | | | | |
'*' indicates overall best model
Figure 1The result of a sample call to 'plot' after an MDR fit with 5-fold cross-validation on a simulated dataset with 250 individuals genotyped at 25 SNPs.
Figure 2The result of a sample call to 'plot' after an MDR fit with three-way split on a simulated dataset with 250 individuals genotyped at 25 SNPs.
Sample run time in seconds for the package 'MDR' and for the GUI version
| Time (seconds) | 'MDR' | 'MDR' | GUI |
|---|---|---|---|
| 221.85 | 41.87 | 1.253 | |
| 1951.05 | 345.67 | 3.902 | |
| 2138.25 | 375.42 | 6.329 |