| Literature DB >> 31245719 |
Aaron Kusmec1, Patrick S Schnable1,2.
Abstract
Genomewide association studies (GWAS) are computationally demanding analyses that use large sample sizes and dense marker sets to discover associations between quantitative trait variation and genetic variants. FarmCPU is a powerful new method for performing GWAS. However, its performance is hampered by details of its implementation and its reliance on the R programming language. In this paper, we present an efficient implementation of FarmCPU, called FarmCPUpp, that retains the R user interface but improves memory management and speed through the use of C++ code and parallel computing.Entities:
Keywords: bioinformatics; genomewide association study; quantitative trait; software
Year: 2018 PMID: 31245719 PMCID: PMC6508500 DOI: 10.1002/pld3.53
Source DB: PubMed Journal: Plant Direct ISSN: 2475-4455
Bin sizes used for model selection at different marker densities. Sizes are given in base‐pairs
| Simulation | Bin sizes |
|---|---|
| 10,000 markers | 500; 1,000; 1,500; 2,000 |
| 50,000 markers | 2,500; 5,000; 7,500; 10,000 |
| 60,000 markers | 2,500; 5,000; 7,500; 10,000 |
| 100,000 markers | 5,000; 10,000; 15,000; 20,000 |
| 500,000 markers | 25,000; 50,000; 75,000; 100,000 |
| 1,000,000 markers | 50,000; 100,000; 150,000; 200,000 |
| 5,000,000 markers | 250,000; 500,000; 750,000; 1,000,000 |
Figure 1Effect of sample size on the runtime of FarmCPU and FarmCPUpp. (a) Total runtime in seconds. (b) Time for single‐marker regression in seconds. Dots represent the time spent in each iteration at the indicated sample size
Figure 2Effect of number of markers on the runtime of FarmCPU and FarmCPUpp. (a) Total runtime in seconds. (b) Time for single‐marker regression in seconds. Dots represent the time spent in each iteration at the indicated number of markers
Figure 3Effects of sample size and number of CPU cores on the runtime of FarmCPU and FarmCPUpp in hours. The full dataset has 4,890 samples, 2,452,207 markers, and three population structure covariates