Literature DB >> 25209223

ModelOMatic: fast and automated model selection between RY, nucleotide, amino acid, and codon substitution models.

Simon Whelan1, James E Allen2, Benjamin P Blackburne2, David Talavera2.   

Abstract

Molecular phylogenetics is a powerful tool for inferring both the process and pattern of evolution from genomic sequence data. Statistical approaches, such as maximum likelihood and Bayesian inference, are now established as the preferred methods of inference. The choice of models that a researcher uses for inference is of critical importance, and there are established methods for model selection conditioned on a particular type of data, such as nucleotides, amino acids, or codons. A major limitation of existing model selection approaches is that they can only compare models acting upon a single type of data. Here, we extend model selection to allow comparisons between models describing different types of data by introducing the idea of adapter functions, which project aggregated models onto the originally observed sequence data. These projections are implemented in the program ModelOMatic and used to perform model selection on 3722 families from the PANDIT database, 68 genes from an arthropod phylogenomic data set, and 248 genes from a vertebrate phylogenomic data set. For the PANDIT and arthropod data, we find that amino acid models are selected for the overwhelming majority of alignments; with progressively smaller numbers of alignments selecting codon and nucleotide models, and no families selecting RY-based models. In contrast, nearly all alignments from the vertebrate data set select codon-based models. The sequence divergence, the number of sequences, and the degree of selection acting upon the protein sequences may contribute to explaining this variation in model selection. Our ModelOMatic program is fast, with most families from PANDIT taking fewer than 150 s to complete, and should therefore be easily incorporated into existing phylogenetic pipelines. ModelOMatic is available at https://code.google.com/p/modelomatic/.
© The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  AIC; model selection; phylogenetics; substitution models

Mesh:

Substances:

Year:  2014        PMID: 25209223     DOI: 10.1093/sysbio/syu062

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  13 in total

1.  Revised Phylogeny of the Cellulose Synthase Gene Superfamily: Insights into Cell Wall Evolution.

Authors:  Alan Little; Julian G Schwerdt; Neil J Shirley; Shi F Khor; Kylie Neumann; Lisa A O'Donovan; Jelle Lahnstein; Helen M Collins; Marilyn Henderson; Geoffrey B Fincher; Rachel A Burton
Journal:  Plant Physiol       Date:  2018-05-20       Impact factor: 8.340

2.  New data on Henneguya postexilis Minchew, 1977, a parasite of channel catfish Ictalurus punctatus, with notes on resolution of molecular markers for myxozoan phylogeny.

Authors:  Ethan T Woodyard; Thomas G Rosser; Justin M Stilwell; Alvin C Camus; Lester H Khoo; Geoffrey Waldbieser; W Walter Lorenz; Matt J Griffin
Journal:  Syst Parasitol       Date:  2022-01-14       Impact factor: 1.431

3.  Rhodopsin-bestrophin fusion proteins from unicellular algae form gigantic pentameric ion channels.

Authors:  Andrey Rozenberg; Igor Kaczmarczyk; Donna Matzov; Johannes Vierock; Takashi Nagata; Masahiro Sugiura; Kota Katayama; Yuma Kawasaki; Masae Konno; Yujiro Nagasaka; Mako Aoyama; Ishita Das; Efrat Pahima; Jonathan Church; Suliman Adam; Veniamin A Borin; Ariel Chazan; Sandra Augustin; Jonas Wietek; Julien Dine; Yoav Peleg; Akira Kawanabe; Yuichiro Fujiwara; Ofer Yizhar; Mordechai Sheves; Igor Schapiro; Yuji Furutani; Hideki Kandori; Keiichi Inoue; Peter Hegemann; Oded Béjà; Moran Shalev-Benami
Journal:  Nat Struct Mol Biol       Date:  2022-06-16       Impact factor: 18.361

4.  ModelFinder: fast model selection for accurate phylogenetic estimates.

Authors:  Subha Kalyaanamoorthy; Bui Quang Minh; Thomas K F Wong; Arndt von Haeseler; Lars S Jermiin
Journal:  Nat Methods       Date:  2017-05-08       Impact factor: 28.547

5.  The effects of repeated whole genome duplication events on the evolution of cytokinin signaling pathway.

Authors:  Elisabeth Kaltenegger; Svetlana Leng; Alexander Heyl
Journal:  BMC Evol Biol       Date:  2018-05-29       Impact factor: 3.260

6.  Unifying the global phylogeny and environmental distribution of ammonia-oxidising archaea based on amoA genes.

Authors:  Ricardo J Eloy Alves; Bui Quang Minh; Tim Urich; Arndt von Haeseler; Christa Schleper
Journal:  Nat Commun       Date:  2018-04-17       Impact factor: 14.919

7.  Phylogenomic proof of Recurrent Demipolyploidization and Evolutionary Stalling of the "Triploid Bridge" in Arundo (Poaceae).

Authors:  Wuhe Jike; Mingai Li; Nicola Zadra; Enrico Barbaro; Gaurav Sablok; Giorgio Bertorelle; Omar Rota-Stabelli; Claudio Varotto
Journal:  Int J Mol Sci       Date:  2020-07-24       Impact factor: 5.923

8.  Ungulate malaria parasites.

Authors:  Thomas J Templeton; Masahito Asada; Montakan Jiratanh; Sohta A Ishikawa; Sonthaya Tiawsirisup; Thillaiampalam Sivakumar; Boniface Namangala; Mika Takeda; Kingdao Mohkaew; Supawan Ngamjituea; Noboru Inoue; Chihiro Sugimoto; Yuji Inagaki; Yasuhiko Suzuki; Naoaki Yokoyama; Morakot Kaewthamasorn; Osamu Kaneko
Journal:  Sci Rep       Date:  2016-03-21       Impact factor: 4.379

9.  Relative Model Fit Does Not Predict Topological Accuracy in Single-Gene Protein Phylogenetics.

Authors:  Stephanie J Spielman
Journal:  Mol Biol Evol       Date:  2020-07-01       Impact factor: 16.240

10.  Big data analysis of human mitochondrial DNA substitution models: a regression approach.

Authors:  Keren Levinstein Hallak; Shay Tzur; Saharon Rosset
Journal:  BMC Genomics       Date:  2018-10-19       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.