Literature DB >> 32644216

Deep learning for population size history inference: Design, comparison and combination with Approximate Bayesian Computation.

Théophile Sanchez1, Jean Cury1, Guillaume Charpiat1, Flora Jay1.   

Abstract

For the past decades, simulation-based likelihood-free inference methods have enabled researchers to address numerous population genetics problems. As the richness and amount of simulated and real genetic data keep increasing, the field has a strong opportunity to tackle tasks that current methods hardly solve. However, high data dimensionality forces most methods to summarize large genomic datasets into a relatively small number of handcrafted features (summary statistics). Here we propose an alternative to summary statistics, based on the automatic extraction of relevant information using deep learning techniques. Specifically, we design artificial neural networks (ANNs) that take as input single nucleotide polymorphic sites (SNPs) found in individuals sampled from a single population and infer the past effective population size history. First, we provide guidelines to construct artificial neural networks that comply with the intrinsic properties of SNP data such as invariance to permutation of haplotypes, long scale interactions between SNPs and variable genomic length. Thanks to a Bayesian hyperparameter optimization procedure, we evaluate the performance of multiple networks and compare them to well established methods like Approximate Bayesian Computation (ABC). Even without the expert knowledge of summary statistics, our approach compares fairly well to an ABC approach based on handcrafted features. Furthermore, we show that combining deep learning and ABC can improve performance while taking advantage of both frameworks. Finally, we apply our approach to reconstruct the effective population size history of cattle breed populations. This article is protected by copyright. All rights reserved.

Entities:  

Year:  2020        PMID: 32644216     DOI: 10.1111/1755-0998.13224

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  6 in total

1.  Revisiting the out of Africa event with a deep-learning approach.

Authors:  Francesco Montinaro; Vasili Pankratov; Burak Yelmen; Luca Pagani; Mayukh Mondal
Journal:  Am J Hum Genet       Date:  2021-10-08       Impact factor: 11.025

2.  Deep learning from phylogenies to uncover the epidemiological dynamics of outbreaks.

Authors:  J Voznica; A Zhukova; V Boskova; E Saulnier; F Lemoine; M Moslonka-Lefebvre; O Gascuel
Journal:  Nat Commun       Date:  2022-07-06       Impact factor: 17.694

3.  Efficient ancestry and mutation simulation with msprime 1.0.

Authors:  Franz Baumdicker; Gertjan Bisschop; Daniel Goldstein; Graham Gower; Aaron P Ragsdale; Georgia Tsambos; Sha Zhu; Bjarki Eldon; E Castedo Ellerman; Jared G Galloway; Ariella L Gladstein; Gregor Gorjanc; Bing Guo; Ben Jeffery; Warren W Kretzschumar; Konrad Lohse; Michael Matschiner; Dominic Nelson; Nathaniel S Pope; Consuelo D Quinto-Cortés; Murillo F Rodrigues; Kumar Saunack; Thibaut Sellinger; Kevin Thornton; Hugo van Kemenade; Anthony W Wohns; Yan Wong; Simon Gravel; Andrew D Kern; Jere Koskela; Peter L Ralph; Jerome Kelleher
Journal:  Genetics       Date:  2022-03-03       Impact factor: 4.402

4.  Congruent evolutionary responses of European steppe biota to late Quaternary climate change.

Authors:  Philipp Kirschner; Manolo F Perez; Eliška Záveská; Isabel Sanmartín; Laurent Marquer; Birgit C Schlick-Steiner; Nadir Alvarez; Florian M Steiner; Peter Schönswetter
Journal:  Nat Commun       Date:  2022-04-08       Impact factor: 14.919

5.  Neural networks for self-adjusting mutation rate estimation when the recombination rate is unknown.

Authors:  Klara Elisabeth Burger; Peter Pfaffelhuber; Franz Baumdicker
Journal:  PLoS Comput Biol       Date:  2022-08-03       Impact factor: 4.779

6.  fastsimcoal2: demographic inference under complex evolutionary scenarios.

Authors:  Laurent Excofffier; Nina Marchi; David Alexander Marques; Remi Matthey-Doret; Alexandre Gouy; Vitor C Sousa
Journal:  Bioinformatics       Date:  2021-06-23       Impact factor: 6.937

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.