Literature DB >> 21873298

Statistics and truth in phylogenomics.

Sudhir Kumar1, Alan J Filipski, Fabia U Battistuzzi, Sergei L Kosakovsky Pond, Koichiro Tamura.   

Abstract

Phylogenomics refers to the inference of historical relationships among species using genome-scale sequence data and to the use of phylogenetic analysis to infer protein function in multigene families. With rapidly decreasing sequencing costs, phylogenomics is becoming synonymous with evolutionary analysis of genome-scale and taxonomically densely sampled data sets. In phylogenetic inference applications, this translates into very large data sets that yield evolutionary and functional inferences with extremely small variances and high statistical confidence (P value). However, reports of highly significant P values are increasing even for contrasting phylogenetic hypotheses depending on the evolutionary model and inference method used, making it difficult to establish true relationships. We argue that the assessment of the robustness of results to biological factors, that may systematically mislead (bias) the outcomes of statistical estimation, will be a key to avoiding incorrect phylogenomic inferences. In fact, there is a need for increased emphasis on the magnitude of differences (effect sizes) in addition to the P values of the statistical test of the null hypothesis. On the other hand, the amount of sequence data available will likely always remain inadequate for some phylogenomic applications, for example, those involving episodic positive selection at individual codon positions and in specific lineages. Again, a focus on effect size and biological relevance, rather than the P value, may be warranted. Here, we present a theoretical overview and discuss practical aspects of the interplay between effect sizes, bias, and P values as it relates to the statistical inference of evolutionary truth in phylogenomics.

Entities:  

Mesh:

Year:  2011        PMID: 21873298      PMCID: PMC3258035          DOI: 10.1093/molbev/msr202

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  181 in total

1.  A Dirichlet process model for detecting positive selection in protein-coding DNA sequences.

Authors:  John P Huelsenbeck; Sonia Jain; Simon W D Frost; Sergei L Kosakovsky Pond
Journal:  Proc Natl Acad Sci U S A       Date:  2006-04-10       Impact factor: 11.205

2.  From phylogenetics to phylogenomics: the evolutionary relationships of insect endosymbiotic gamma-Proteobacteria as a test case.

Authors:  Iñaki Comas; Andrés Moya; Fernando González-Candelas
Journal:  Syst Biol       Date:  2007-02       Impact factor: 15.683

3.  Calculating bootstrap probabilities of phylogeny using multilocus sequence data.

Authors:  Tae-Kun Seo
Journal:  Mol Biol Evol       Date:  2008-02-14       Impact factor: 16.240

4.  Codon-based tests of positive selection, branch lengths, and the evolution of mammalian immune system genes.

Authors:  Austin L Hughes; Robert Friedman
Journal:  Immunogenetics       Date:  2008-06-26       Impact factor: 2.846

Review 5.  Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis.

Authors:  J A Eisen
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

6.  Bayes or bootstrap? A simulation study comparing the performance of Bayesian Markov chain Monte Carlo sampling and bootstrapping in assessing phylogenetic confidence.

Authors:  Michael E Alfaro; Stefan Zoller; François Lutzoni
Journal:  Mol Biol Evol       Date:  2003-02       Impact factor: 16.240

7.  28-way vertebrate alignment and conservation track in the UCSC Genome Browser.

Authors:  Webb Miller; Kate Rosenbloom; Ross C Hardison; Minmei Hou; James Taylor; Brian Raney; Richard Burhans; David C King; Robert Baertsch; Daniel Blankenberg; Sergei L Kosakovsky Pond; Anton Nekrutenko; Belinda Giardine; Robert S Harris; Svitlana Tyekucheva; Mark Diekhans; Thomas H Pringle; William J Murphy; Arthur Lesk; George M Weinstock; Kerstin Lindblad-Toh; Richard A Gibbs; Eric S Lander; Adam Siepel; David Haussler; W James Kent
Journal:  Genome Res       Date:  2007-11-05       Impact factor: 9.043

8.  Temporal patterns of fruit fly (Drosophila) evolution revealed by mutation clocks.

Authors:  Koichiro Tamura; Sankar Subramanian; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2003-08-29       Impact factor: 16.240

9.  Timing major conflict between mitochondrial and nuclear genes in species relationships of Polygonia butterflies (Nymphalidae: Nymphalini).

Authors:  Niklas Wahlberg; Elisabet Weingartner; Andrew D Warren; Sören Nylin
Journal:  BMC Evol Biol       Date:  2009-05-07       Impact factor: 3.260

Review 10.  Recent evolutions of multiple sequence alignment algorithms.

Authors:  Cédric Notredame
Journal:  PLoS Comput Biol       Date:  2007-08       Impact factor: 4.475

View more
  65 in total

1.  Target-capture phylogenomics provide insights on gene and species tree discordances in Old World treefrogs (Anura: Rhacophoridae).

Authors:  Kin Onn Chan; Carl R Hutter; Perry L Wood; L Lee Grismer; Rafe M Brown
Journal:  Proc Biol Sci       Date:  2020-12-09       Impact factor: 5.349

Review 2.  A beginner's guide to phylogenetics.

Authors:  Roy D Sleator
Journal:  Microb Ecol       Date:  2013-04-28       Impact factor: 4.552

3.  Inferring ancient divergences requires genes with strong phylogenetic signals.

Authors:  Leonidas Salichos; Antonis Rokas
Journal:  Nature       Date:  2013-05-08       Impact factor: 49.962

Review 4.  Evolutionary genomics of Borrelia burgdorferi sensu lato: findings, hypotheses, and the rise of hybrids.

Authors:  Wei-Gang Qiu; Che L Martin
Journal:  Infect Genet Evol       Date:  2014-04-03       Impact factor: 3.342

5.  Simultaneous Bayesian estimation of alignment and phylogeny under a joint model of protein sequence and structure.

Authors:  Joseph L Herman; Christopher J Challis; Ádám Novák; Jotun Hein; Scott C Schmidler
Journal:  Mol Biol Evol       Date:  2014-06-04       Impact factor: 16.240

6.  Terrace Aware Data Structure for Phylogenomic Inference from Supermatrices.

Authors:  Olga Chernomor; Arndt von Haeseler; Bui Quang Minh
Journal:  Syst Biol       Date:  2016-04-26       Impact factor: 15.683

7.  mPartition: A Model-Based Method for Partitioning Alignments.

Authors:  Thu Le Kim; Vinh Le Sy
Journal:  J Mol Evol       Date:  2020-08-31       Impact factor: 2.395

Review 8.  Microbial sequence typing in the genomic era.

Authors:  Marcos Pérez-Losada; Miguel Arenas; Eduardo Castro-Nallar
Journal:  Infect Genet Evol       Date:  2017-09-21       Impact factor: 3.342

9.  Functional, Morphological, and Evolutionary Characterization of Hearing in Subterranean, Eusocial African Mole-Rats.

Authors:  Sonja J Pyott; Marcel van Tuinen; Laurel A Screven; Katrina M Schrode; Jun-Ping Bai; Catherine M Barone; Steven D Price; Anna Lysakowski; Maxwell Sanderford; Sudhir Kumar; Joseph Santos-Sacchi; Amanda M Lauer; Thomas J Park
Journal:  Curr Biol       Date:  2020-09-03       Impact factor: 10.834

10.  Biases of STRUCTURE software when exploring introduction routes of invasive species.

Authors:  Eric Lombaert; Thomas Guillemaud; Emeline Deleury
Journal:  Heredity (Edinb)       Date:  2018-01-17       Impact factor: 3.821

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.