Ekaterina Noskova1, Vladimir Ulyantsev1, Klaus-Peter Koepfli1,2, Stephen J O'Brien1,3, Pavel Dobrynin1,2. 1. Computer Technologies Laboratory, ITMO University, 49 Kronverkskiy Pr., St. Petersburg 197101, Russian Federation. 2. Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, 3001 Connecticut Ave., NW Washington, D.C. 20008, USA. 3. Guy Harvey Oceanographic Center, Nova Southeastern University Ft. Lauderdale, 8000 North Ocean Drive, Ft. Lauderdale, Florida 33004, USA.
Abstract
BACKGROUND: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in populations. The joint AFS is commonly used to reconstruct the demographic history of multiple populations, and several methods based on diffusion approximation (e.g., ∂a∂i) and ordinary differential equations (e.g., moments) have been developed and applied for demographic inference. These methods provide an opportunity to simulate AFS under a variety of researcher-specified demographic models and to estimate the best model and associated parameters using likelihood-based local optimizations. However, there are no known algorithms to perform global searches of demographic models with a given AFS. RESULTS: Here, we introduce a new method that implements a global search using a genetic algorithm for the automatic and unsupervised inference of demographic history from joint AFS data. Our method is implemented in the software GADMA (Genetic Algorithm for Demographic Model Analysis, https://github.com/ctlab/GADMA). CONCLUSIONS: We demonstrate the performance of GADMA by applying it to sequence data from humans and non-model organisms and show that it is able to automatically infer a demographic model close to or even better than the one that was previously obtained manually. Moreover, GADMA is able to infer multiple demographic models at different local optima close to the global one, providing a larger set of possible scenarios to further explore demographic history.
BACKGROUND: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in populations. The joint AFS is commonly used to reconstruct the demographic history of multiple populations, and several methods based on diffusion approximation (e.g., ∂a∂i) and ordinary differential equations (e.g., moments) have been developed and applied for demographic inference. These methods provide an opportunity to simulate AFS under a variety of researcher-specified demographic models and to estimate the best model and associated parameters using likelihood-based local optimizations. However, there are no known algorithms to perform global searches of demographic models with a given AFS. RESULTS: Here, we introduce a new method that implements a global search using a genetic algorithm for the automatic and unsupervised inference of demographic history from joint AFS data. Our method is implemented in the software GADMA (Genetic Algorithm for Demographic Model Analysis, https://github.com/ctlab/GADMA). CONCLUSIONS: We demonstrate the performance of GADMA by applying it to sequence data from humans and non-model organisms and show that it is able to automatically infer a demographic model close to or even better than the one that was previously obtained manually. Moreover, GADMA is able to infer multiple demographic models at different local optima close to the global one, providing a larger set of possible scenarios to further explore demographic history.
Authors: Daniel M Portik; Adam D Leaché; Danielle Rivera; Michael F Barej; Marius Burger; Mareike Hirschfeld; Mark-Oliver Rödel; David C Blackburn; Matthew K Fujita Journal: Mol Ecol Date: 2017-08-24 Impact factor: 6.185
Authors: Simon Gravel; Brenna M Henn; Ryan N Gutenkunst; Amit R Indap; Gabor T Marth; Andrew G Clark; Fuli Yu; Richard A Gibbs; Carlos D Bustamante Journal: Proc Natl Acad Sci U S A Date: 2011-07-05 Impact factor: 11.205
Authors: Laura Buggiotti; Andrey A Yurchenko; Nikolay S Yudin; Christy J Vander Jagt; Nadezhda V Vorobieva; Mariya A Kusliy; Sergei K Vasiliev; Andrey N Rodionov; Oksana I Boronetskaya; Natalia A Zinovieva; Alexander S Graphodatsky; Hans D Daetwyler; Denis M Larkin Journal: Mol Biol Evol Date: 2021-07-29 Impact factor: 16.240
Authors: Samuel D Payet; Morgan S Pratchett; Pablo Saenz-Agudelo; Michael L Berumen; Joseph D DiBattista; Hugo B Harrison Journal: Evol Appl Date: 2022-08-05 Impact factor: 4.929