Literature DB >> 33045078

Learning Retention Mechanisms and Evolutionary Parameters of Duplicate Genes from Their Expression Data.

Michael DeGiorgio1,2, Raquel Assis1,2.   

Abstract

Learning about the roles that duplicate genes play in the origins of novel phenotypes requires an understanding of how their functions evolve. A previous method for achieving this goal, CDROM, employs gene expression distances as proxies for functional divergence and then classifies the evolutionary mechanisms retaining duplicate genes from comparisons of these distances in a decision tree framework. However, CDROM does not account for stochastic shifts in gene expression or leverage advances in contemporary statistical learning for performing classification, nor is it capable of predicting the parameters driving duplicate gene evolution. Thus, here we develop CLOUD, a multi-layer neural network built on a model of gene expression evolution that can both classify duplicate gene retention mechanisms and predict their underlying evolutionary parameters. We show that not only is the CLOUD classifier substantially more powerful and accurate than CDROM, but that it also yields accurate parameter predictions, enabling a better understanding of the specific forces driving the evolution and long-term retention of duplicate genes. Further, application of the CLOUD classifier and predictor to empirical data from Drosophila recapitulates many previous findings about gene duplication in this lineage, showing that new functions often emerge rapidly and asymmetrically in younger duplicate gene copies, and that functional divergence is driven by strong natural selection. Hence, CLOUD represents a major advancement in classifying retention mechanisms and predicting evolutionary parameters of duplicate genes, thereby highlighting the utility of incorporating sophisticated statistical learning techniques to address long-standing questions about evolution after gene duplication.
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  Ornstein–Uhlenbeck; gene duplication; neofunctionalization; neural network; subfunctionalization

Mesh:

Year:  2021        PMID: 33045078      PMCID: PMC7947822          DOI: 10.1093/molbev/msaa267

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  81 in total

1.  The evolution of gene expression levels in mammalian organs.

Authors:  David Brawand; Magali Soumillon; Anamaria Necsulea; Philippe Julien; Gábor Csárdi; Patrick Harrigan; Manuela Weier; Angélica Liechti; Ayinuer Aximu-Petri; Martin Kircher; Frank W Albert; Ulrich Zeller; Philipp Khaitovich; Frank Grützner; Sven Bergmann; Rasmus Nielsen; Svante Pääbo; Henrik Kaessmann
Journal:  Nature       Date:  2011-10-19       Impact factor: 49.962

2.  Gene expression divergence recapitulates the developmental hourglass model.

Authors:  Alex T Kalinka; Karolina M Varga; Dave T Gerrard; Stephan Preibisch; David L Corcoran; Julia Jarrells; Uwe Ohler; Casey M Bergman; Pavel Tomancak
Journal:  Nature       Date:  2010-12-09       Impact factor: 49.962

3.  Gene Dosage Balance Immediately following Whole-Genome Duplication in Arabidopsis.

Authors:  Sunil K Kenchanmane Raju
Journal:  Plant Cell       Date:  2020-03-17       Impact factor: 11.277

4.  Evolution at two levels in fire ants: the relationship between patterns of gene expression and protein sequence evolution.

Authors:  Brendan G Hunt; Lino Ometto; Laurent Keller; Michael A D Goodisman
Journal:  Mol Biol Evol       Date:  2012-10-10       Impact factor: 16.240

5.  Gene duplication and the evolution of phenotypic diversity in insect societies.

Authors:  Linh M Chau; Michael A D Goodisman
Journal:  Evolution       Date:  2017-09-29       Impact factor: 3.694

6.  New genes in Drosophila quickly become essential.

Authors:  Sidi Chen; Yong E Zhang; Manyuan Long
Journal:  Science       Date:  2010-12-17       Impact factor: 47.728

7.  A genome-wide view of the spectrum of spontaneous mutations in yeast.

Authors:  Michael Lynch; Way Sung; Krystalynne Morris; Nicole Coffey; Christian R Landry; Erik B Dopman; W Joseph Dickinson; Kazufusa Okamoto; Shilpa Kulkarni; Daniel L Hartl; W Kelley Thomas
Journal:  Proc Natl Acad Sci U S A       Date:  2008-06-26       Impact factor: 11.205

8.  High mutational rates of large-scale duplication and deletion in Daphnia pulex.

Authors:  Nathan Keith; Abraham E Tucker; Craig E Jackson; Way Sung; José Ignacio Lucas Lledó; Daniel R Schrider; Sarah Schaack; Jeffry L Dudycha; Matthew Ackerman; Andrew J Younge; Joseph R Shaw; Michael Lynch
Journal:  Genome Res       Date:  2015-10-30       Impact factor: 9.043

9.  Supervised machine learning reveals introgressed loci in the genomes of Drosophila simulans and D. sechellia.

Authors:  Daniel R Schrider; Julien Ayroles; Daniel R Matute; Andrew D Kern
Journal:  PLoS Genet       Date:  2018-04-23       Impact factor: 5.917

10.  The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference.

Authors:  Lex Flagel; Yaniv Brandvain; Daniel R Schrider
Journal:  Mol Biol Evol       Date:  2019-02-01       Impact factor: 16.240

View more
  1 in total

1.  BLAST from the Past: Impacts of Evolving Approaches on Studies of Evolution by Gene Duplication.

Authors:  Frédéric J J Chain; Raquel Assis
Journal:  Genome Biol Evol       Date:  2021-07-06       Impact factor: 3.416

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.