Literature DB >> 33682305

Chromosome-scale inference of hybrid speciation and admixture with convolutional neural networks.

Paul D Blischak1,2, Michael S Barker1, Ryan N Gutenkunst2.   

Abstract

Inferring the frequency and mode of hybridization among closely related organisms is an important step for understanding the process of speciation and can help to uncover reticulated patterns of phylogeny more generally. Phylogenomic methods to test for the presence of hybridization come in many varieties and typically operate by leveraging expected patterns of genealogical discordance in the absence of hybridization. An important assumption made by these tests is that the data (genes or SNPs) are independent given the species tree. However, when the data are closely linked, it is especially important to consider their nonindependence. Recently, deep learning techniques such as convolutional neural networks (CNNs) have been used to perform population genetic inferences with linked SNPs coded as binary images. Here, we use CNNs for selecting among candidate hybridization scenarios using the tree topology (((P1 , P2 ), P3 ), Out) and a matrix of pairwise nucleotide divergence (dXY ) calculated in windows across the genome. Using coalescent simulations to train and independently test a neural network showed that our method, HyDe-CNN, was able to accurately perform model selection for hybridization scenarios across a wide breath of parameter space. We then used HyDe-CNN to test models of admixture in Heliconius butterflies, as well as comparing it to phylogeny-based introgression statistics. Given the flexibility of our approach, the dropping cost of long-read sequencing and the continued improvement of CNN architectures, we anticipate that inferences of hybridization using deep learning methods like ours will help researchers to better understand patterns of admixture in their study organisms.
© 2021 John Wiley & Sons Ltd.

Entities:  

Keywords:  admixture; convolutional neural networks; deep learning; gene flow; hybridization; model selection

Mesh:

Year:  2021        PMID: 33682305      PMCID: PMC8675098          DOI: 10.1111/1755-0998.13355

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  60 in total

1.  Isolation by Distance.

Authors:  S Wright
Journal:  Genetics       Date:  1943-03       Impact factor: 4.562

2.  Ancestral population genomics: the coalescent hidden Markov model approach.

Authors:  Julien Y Dutheil; Ganesh Ganapathy; Asger Hobolth; Thomas Mailund; Marcy K Uyenoyama; Mikkel H Schierup
Journal:  Genetics       Date:  2009-07-06       Impact factor: 4.562

Review 3.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

4.  A Three-Sample Test for Introgression.

Authors:  Matthew W Hahn; Mark S Hibbins
Journal:  Mol Biol Evol       Date:  2019-12-01       Impact factor: 16.240

5.  The Timing and Direction of Introgression Under the Multispecies Network Coalescent.

Authors:  Mark S Hibbins; Matthew W Hahn
Journal:  Genetics       Date:  2019-01-22       Impact factor: 4.562

6.  A universal SNP and small-indel variant caller using deep neural networks.

Authors:  Ryan Poplin; Pi-Chuan Chang; David Alexander; Scott Schwartz; Thomas Colthurst; Alexander Ku; Dan Newburger; Jojo Dijamco; Nam Nguyen; Pegah T Afshar; Sam S Gross; Lizzie Dorfman; Cory Y McLean; Mark A DePristo
Journal:  Nat Biotechnol       Date:  2018-09-24       Impact factor: 54.908

7.  Genomic relationships and speciation times of human, chimpanzee, and gorilla inferred from a coalescent hidden Markov model.

Authors:  Asger Hobolth; Ole F Christensen; Thomas Mailund; Mikkel H Schierup
Journal:  PLoS Genet       Date:  2006-11-30       Impact factor: 5.917

8.  Deep Learning for Population Genetic Inference.

Authors:  Sara Sheehan; Yun S Song
Journal:  PLoS Comput Biol       Date:  2016-03-28       Impact factor: 4.475

9.  Chromosome-scale inference of hybrid speciation and admixture with convolutional neural networks.

Authors:  Paul D Blischak; Michael S Barker; Ryan N Gutenkunst
Journal:  Mol Ecol Resour       Date:  2021-03-08       Impact factor: 7.090

10.  No evidence for maintenance of a sympatric Heliconius species barrier by chromosomal inversions.

Authors:  John W Davey; Sarah L Barker; Pasi M Rastas; Ana Pinharanda; Simon H Martin; Richard Durbin; W Owen McMillan; Richard M Merrill; Chris D Jiggins
Journal:  Evol Lett       Date:  2017-06-14
View more
  2 in total

1.  Reference-free discovery of nuclear SNPs permits accurate, sensitive identification of Carya (hickory) species and hybrids.

Authors:  Robert A Literman; Brittany M Ott; Jun Wen; L J Grauke; Rachel S Schwartz; Sara M Handy
Journal:  Appl Plant Sci       Date:  2022-01-20       Impact factor: 1.936

2.  Chromosome-scale inference of hybrid speciation and admixture with convolutional neural networks.

Authors:  Paul D Blischak; Michael S Barker; Ryan N Gutenkunst
Journal:  Mol Ecol Resour       Date:  2021-03-08       Impact factor: 7.090

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.