Literature DB >> 33600408

Bayesian parameter estimation for automatic annotation of gene functions using observational data and phylogenetic trees.

George G Vega Yon1, Duncan C Thomas1, John Morrison1, Huaiyu Mi2, Paul D Thomas2, Paul Marjoram1.   

Abstract

Gene function annotation is important for a variety of downstream analyses of genetic data. But experimental characterization of function remains costly and slow, making computational prediction an important endeavor. Phylogenetic approaches to prediction have been developed, but implementation of a practical Bayesian framework for parameter estimation remains an outstanding challenge. We have developed a computationally efficient model of evolution of gene annotations using phylogenies based on a Bayesian framework using Markov Chain Monte Carlo for parameter estimation. Unlike previous approaches, our method is able to estimate parameters over many different phylogenetic trees and functions. The resulting parameters agree with biological intuition, such as the increased probability of function change following gene duplication. The method performs well on leave-one-out cross-validation, and we further validated some of the predictions in the experimental scientific literature.

Entities:  

Year:  2021        PMID: 33600408      PMCID: PMC7924801          DOI: 10.1371/journal.pcbi.1007948

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


  28 in total

1.  Phylogenetic logistic regression for binary dependent variables.

Authors:  Anthony R Ives; Theodore Garland
Journal:  Syst Biol       Date:  2009-11-04       Impact factor: 15.683

2.  Predicting microbial traits with phylogenies.

Authors:  Marta Goberna; Miguel Verdú
Journal:  ISME J       Date:  2015-09-15       Impact factor: 10.302

3.  Genome-scale phylogenetic function annotation of large and diverse protein families.

Authors:  Barbara E Engelhardt; Michael I Jordan; John R Srouji; Steven E Brenner
Journal:  Genome Res       Date:  2011-07-22       Impact factor: 9.043

Review 4.  Model organism data evolving in support of translational medicine.

Authors:  Douglas G Howe; Judith A Blake; Yvonne M Bradford; Carol J Bult; Brian R Calvi; Stacia R Engel; James A Kadin; Thomas C Kaufman; Ranjana Kishore; Stanley J F Laulederkind; Suzanna E Lewis; Sierra A T Moxon; Joel E Richardson; Cynthia Smith
Journal:  Lab Anim (NY)       Date:  2018-09-17       Impact factor: 12.625

5.  ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R.

Authors:  Emmanuel Paradis; Klaus Schliep
Journal:  Bioinformatics       Date:  2019-02-01       Impact factor: 6.937

6.  A linear-time algorithm for Gaussian and non-Gaussian trait evolution models.

Authors:  Lam si Tung Ho; Cécile Ané
Journal:  Syst Biol       Date:  2014-02-04       Impact factor: 15.683

7.  Evolutionary trees from DNA sequences: a maximum likelihood approach.

Authors:  J Felsenstein
Journal:  J Mol Evol       Date:  1981       Impact factor: 2.395

8.  GIGA: a simple, efficient algorithm for gene tree inference in the genomic age.

Authors:  Paul D Thomas
Journal:  BMC Bioinformatics       Date:  2010-06-09       Impact factor: 3.169

9.  PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements.

Authors:  Huaiyu Mi; Xiaosong Huang; Anushya Muruganujan; Haiming Tang; Caitlin Mills; Diane Kang; Paul D Thomas
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

10.  A Fast Likelihood Method to Reconstruct and Visualize Ancestral Scenarios.

Authors:  Sohta A Ishikawa; Anna Zhukova; Wataru Iwasaki; Olivier Gascuel
Journal:  Mol Biol Evol       Date:  2019-09-01       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.