Ilya Shlyakhter1, Pardis C Sabeti1, Stephen F Schaffner2. 1. Broad Institute of MIT and Harvard, MA 02142 and Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA Broad Institute of MIT and Harvard, MA 02142 and Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA. 2. Broad Institute of MIT and Harvard, MA 02142 and Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
Abstract
MOTIVATION: Efficient simulation of population genetic samples under a given demographic model is a prerequisite for many analyses. Coalescent theory provides an efficient framework for such simulations, but simulating longer regions and higher recombination rates remains challenging. Simulators based on a Markovian approximation to the coalescent scale well, but do not support simulation of selection. Gene conversion is not supported by any published coalescent simulators that support selection. RESULTS: We describe cosi2, an efficient simulator that supports both exact and approximate coalescent simulation with positive selection. cosi2 improves on the speed of existing exact simulators, and permits further speedup in approximate mode while retaining support for selection. cosi2 supports a wide range of demographic scenarios, including recombination hot spots, gene conversion, population size changes, population structure and migration. cosi2 implements coalescent machinery efficiently by tracking only a small subset of the Ancestral Recombination Graph, sampling only relevant recombination events, and using augmented skip lists to represent tracked genetic segments. To preserve support for selection in approximate mode, the Markov approximation is implemented not by moving along the chromosome but by performing a standard backwards-in-time coalescent simulation while restricting coalescence to node pairs with overlapping or near-overlapping genetic material. We describe the algorithms used by cosi2 and present comparisons with existing selection simulators. AVAILABILITY AND IMPLEMENTATION: A free C++ implementation of cosi2 is available at http://broadinstitute.org/mpg/cosi2.
MOTIVATION: Efficient simulation of population genetic samples under a given demographic model is a prerequisite for many analyses. Coalescent theory provides an efficient framework for such simulations, but simulating longer regions and higher recombination rates remains challenging. Simulators based on a Markovian approximation to the coalescent scale well, but do not support simulation of selection. Gene conversion is not supported by any published coalescent simulators that support selection. RESULTS: We describe cosi2, an efficient simulator that supports both exact and approximate coalescent simulation with positive selection. cosi2 improves on the speed of existing exact simulators, and permits further speedup in approximate mode while retaining support for selection. cosi2 supports a wide range of demographic scenarios, including recombination hot spots, gene conversion, population size changes, population structure and migration. cosi2 implements coalescent machinery efficiently by tracking only a small subset of the Ancestral Recombination Graph, sampling only relevant recombination events, and using augmented skip lists to represent tracked genetic segments. To preserve support for selection in approximate mode, the Markov approximation is implemented not by moving along the chromosome but by performing a standard backwards-in-time coalescent simulation while restricting coalescence to node pairs with overlapping or near-overlapping genetic material. We describe the algorithms used by cosi2 and present comparisons with existing selection simulators. AVAILABILITY AND IMPLEMENTATION: A free C++ implementation of cosi2 is available at http://broadinstitute.org/mpg/cosi2.
Authors: Pier Francesco Palamara; Laurent C Francioli; Peter R Wilton; Giulio Genovese; Alexander Gusev; Hilary K Finucane; Sriram Sankararaman; Shamil R Sunyaev; Paul I W de Bakker; John Wakeley; Itsik Pe'er; Alkes L Price Journal: Am J Hum Genet Date: 2015-11-12 Impact factor: 11.025
Authors: Camelia C Minică; Giulio Genovese; Christina M Hultman; René Pool; Jacqueline M Vink; Michael C Neale; Conor V Dolan; Benjamin M Neale Journal: Twin Res Hum Genet Date: 2017-02-27 Impact factor: 1.587
Authors: Franz Baumdicker; Gertjan Bisschop; Daniel Goldstein; Graham Gower; Aaron P Ragsdale; Georgia Tsambos; Sha Zhu; Bjarki Eldon; E Castedo Ellerman; Jared G Galloway; Ariella L Gladstein; Gregor Gorjanc; Bing Guo; Ben Jeffery; Warren W Kretzschumar; Konrad Lohse; Michael Matschiner; Dominic Nelson; Nathaniel S Pope; Consuelo D Quinto-Cortés; Murillo F Rodrigues; Kumar Saunack; Thibaut Sellinger; Kevin Thornton; Hugo van Kemenade; Anthony W Wohns; Yan Wong; Simon Gravel; Andrew D Kern; Jere Koskela; Peter L Ralph; Jerome Kelleher Journal: Genetics Date: 2022-03-03 Impact factor: 4.402
Authors: Yuri T Utsunomiya; Ana M Pérez O'Brien; Tad S Sonstegard; Johann Sölkner; José F Garcia Journal: Front Genet Date: 2015-02-10 Impact factor: 4.599