Literature DB >> 20382658

Compositional heterogeneity and phylogenomic inference of metazoan relationships.

Maximilian P Nesnidal1, Martin Helmkampf, Iris Bruchhaus, Bernhard Hausdorf.   

Abstract

Compositional heterogeneity of sequences between taxa may cause systematic error in phylogenetic inference. The potential influence of such bias might be mitigated by strategies to reduce compositional heterogeneity in the data set or by phylogeny reconstruction methods that account for compositional heterogeneity. We adopted several of these strategies to analyze a large ribosomal protein data set representing all major metazoan taxa. Posterior predictive tests revealed that there is compositional bias in this data set. Only a few taxa with strongly deviating amino acid composition had to be excluded to reduce this bias. Thus, this is a good solution, if these taxa are not central to the phylogenetic question at hand. Deleting individual proteins from the data matrix may be an appropriate method, if compositional heterogeneity among taxa is concentrated in a few proteins. However, half of the ribosomal proteins had to be excluded to reduce the compositional heterogeneity to a degree that the CAT model was no longer significantly violated. Recoding of amino acids into groups is another alternative but causes a loss of information and may result in badly resolved trees as demonstrated by the present data set. Bayesian inference with the CAT-BP model directly accounts for compositional heterogeneity between lineages by introducing breakpoints along the branches of the phylogeny at which the amino acid composition is allowed to change but is computationally expensive. Finally, a neighbor joining tree based on equal input distances that consider pattern and rate heterogeneity showed several unusual groupings, which are most likely artifacts, probably caused by the loss of information resulting from the transformation of the sequence data into distances. As long as no more efficient phylogenetic inference methods are available that can directly account for compositional heterogeneity in large data sets, using methods for reducing compositional heterogeneity in the data in combination with methods that assume a stationary amino acid composition remains an option for controlling systematic errors in tree reconstruction that result from compositional bias. Our analyses indicated that the paraphyly of Deuterostomia in some analyses is the result of systematic errors that also affected the relationships of Entoprocta and Ectoprocta.

Mesh:

Substances:

Year:  2010        PMID: 20382658     DOI: 10.1093/molbev/msq097

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  27 in total

1.  Evolutionary origin of a streamlined marine bacterioplankton lineage.

Authors:  Haiwei Luo
Journal:  ISME J       Date:  2014-11-28       Impact factor: 10.302

Review 2.  Evolutionary ecology of the marine Roseobacter clade.

Authors:  Haiwei Luo; Mary Ann Moran
Journal:  Microbiol Mol Biol Rev       Date:  2014-12       Impact factor: 11.056

3.  Evolutionary analysis of a streamlined lineage of surface ocean Roseobacters.

Authors:  Haiwei Luo; Brandon K Swan; Ramunas Stepanauskas; Austin L Hughes; Mary Ann Moran
Journal:  ISME J       Date:  2014-01-23       Impact factor: 10.302

4.  Can quartet analyses combining maximum likelihood estimation and Hennigian logic overcome long branch attraction in phylogenomic sequence data?

Authors:  Patrick Kück; Mark Wilkinson; Christian Groß; Peter G Foster; Johann W Wägele
Journal:  PLoS One       Date:  2017-08-25       Impact factor: 3.240

5.  Construction of a Species-Level Tree of Life for the Insects and Utility in Taxonomic Profiling.

Authors:  Douglas Chesters
Journal:  Syst Biol       Date:  2017-05-01       Impact factor: 15.683

6.  Six-State Amino Acid Recoding is not an Effective Strategy to Offset Compositional Heterogeneity and Saturation in Phylogenetic Analyses.

Authors:  Alexandra M Hernandez; Joseph F Ryan
Journal:  Syst Biol       Date:  2021-10-13       Impact factor: 15.683

7.  iPhy: an integrated phylogenetic workbench for supermatrix analyses.

Authors:  Martin O Jones; Georgios D Koutsovoulos; Mark L Blaxter
Journal:  BMC Bioinformatics       Date:  2011-01-24       Impact factor: 3.169

8.  Phylogenetic relationships within the Opisthokonta based on phylogenomic analyses of conserved single-copy protein domains.

Authors:  Guifré Torruella; Romain Derelle; Jordi Paps; B Franz Lang; Andrew J Roger; Kamran Shalchian-Tabrizi; Iñaki Ruiz-Trillo
Journal:  Mol Biol Evol       Date:  2011-07-18       Impact factor: 16.240

9.  The impact of paralogy on phylogenomic studies - a case study on annelid relationships.

Authors:  Torsten H Struck
Journal:  PLoS One       Date:  2013-05-07       Impact factor: 3.240

10.  Agent of whirling disease meets orphan worm: phylogenomic analyses firmly place Myxozoa in Cnidaria.

Authors:  Maximilian P Nesnidal; Martin Helmkampf; Iris Bruchhaus; Mansour El-Matbouli; Bernhard Hausdorf
Journal:  PLoS One       Date:  2013-01-30       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.