| Literature DB >> 27721650 |
Carlos Montemuiño1, Antonio Espinosa1, Juan C Moure1, Gonzalo Vera2, Porfidio Hernández1, Sebastián Ramos-Onsins2.
Abstract
The msParSm application is an evolution of msPar, the parallel version of the coalescent simulation program ms, which removes the limitation for simulating long stretches of DNA sequences with large recombination rates, without compromising the accuracy of the standard coalescence. This work introduces msParSm, describes its significant performance improvements over msPar and its shared memory parallelization details, and shows how it can get better, if not similar, execution times than <span class="Gene">MaCS. Two case studies with different mutation rates were analyzed, one approximating the <span class="Species">human average and the other approximating the Drosophila melanogaster average. Source code is available at https://github.com/cmontemuino/msparsm.Entities:
Keywords: HPC; MPI; coalescence; recombination; sequential Markov coalescent
Year: 2016 PMID: 27721650 PMCID: PMC5047705 DOI: 10.4137/EBO.S40268
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1Speedup analysis of case study 1 in function of the number of cores used for computation, grouped by a scaled recombination rate (A to D), registered in case study 1. Subfigures show speedup of the application increases adding more computational cores, slowly decaying when considering recombination rates of 4Nρ = 8000.
Figure 2Obtained speedup of case study 2 in function of the number of cores used for computation, grouped by used recombination rate (A to D). As in the previous case study 1, this figure shows increasing speedups when adding more computational resources.
Figure 3Average consumed memory for case study 1 in function increasing the number of computational cores and recombination values, registered in case study 1. Each subfigure shows the consumed memory values when running the simulation using 12, 24, 48, and 96 cores, starting with a scaled recombination of 4Nρ = 1000 in (A), and doubling it through the remaining subfigures (C to D) until reaching a scaled recombination of 4Nρ = 8000.
Figure 4Average consumed memory for case study 2 in function increasing the number of cores used for computation. Each subfigure (A to D) shows the same information as in Figure 3, but with data registered when running the case study 2.
Comparison of average time cost (in minutes) between ms, MaCS, msPar, and msParSm. The execution times of both msParSm and msPar are grouped in function of the number of cores used for computation. A combination of font styles and background color is used to facilitate the reading: values in italics are related to msPar, while bold style is used for msParSm; shaded table cells indicate cases that the execution time of either msParSm or msPar is worse than MaCS.
| Rho | Ms | MaCS | msParSm/ | 48 CORES | 24 CORES | 12 CORES | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1000 | 3.81 | 3.82 | ||||||||
| 2000 | 29.77 | 7.16 | ||||||||
| 4000 | 322.80 | 13.76 | ||||||||
| 8000 | 2605.58 | 26.86 | ||||||||
| 1000 | 3.95 | 5.80 | ||||||||
| 2000 | 34.85 | 9.14 | ||||||||
| 4000 | 320.30 | 15.88 | ||||||||
| 8000 | 2602.00 | 29.32 | ||||||||