Literature DB >> 31198636

CAM: an alignment-free method to recover phylogenies using codon aversion motifs.

Justin B Miller1, Lauren M McKinnon1, Michael F Whiting1,2, Perry G Ridge1.   

Abstract

BACKGROUND: Common phylogenomic approaches for recovering phylogenies are often time-consuming and require annotations for orthologous gene relationships that are not always available. In contrast, alignment-free phylogenomic approaches typically use structure and oligomer frequencies to calculate pairwise distances between species. We have developed an approach to quickly calculate distances between species based on codon aversion.
METHODS: Utilizing a novel alignment-free character state, we present CAM, an alignment-free approach to recover phylogenies by comparing differences in codon aversion motifs (i.e., the set of unused codons within each gene) across all genes within a species. Synonymous codon usage is non-random and differs between organisms, between genes, and even within a single gene, and many genes do not use all possible codons. We report a comprehensive analysis of codon aversion within 229,742,339 genes from 23,428 species across all kingdoms of life, and we provide an alignment-free framework for its use in a phylogenetic construct. For each species, we first construct a set of codon aversion motifs spanning all genes within that species. We define the pairwise distance between two species, A and B, as one minus the number of shared codon aversion motifs divided by the total codon aversion motifs of the species, A or B, containing the fewest motifs. This approach allows us to calculate pairwise distances even when substantial differences in the number of genes or a high rate of divergence between species exists. Finally, we use neighbor-joining to recover phylogenies.
RESULTS: Using the Open Tree of Life and NCBI Taxonomy Database as expected phylogenies, our approach compares well, recovering phylogenies that largely match expected trees and are comparable to trees recovered using maximum likelihood and other alignment-free approaches. Our technique is much faster than maximum likelihood and similar in accuracy to other alignment-free approaches. Therefore, we propose that codon aversion be considered a phylogenetically conserved character that may be used in future phylogenomic studies. AVAILABILITY: CAM, documentation, and test files are freely available on GitHub at https://github.com/ridgelab/cam.

Entities:  

Keywords:  Alignment-free; Codon aversion; Codon usage bias; Maximum likelihood; Phylogenetics; Phylogenomics; Phylogeny; Systematics; Taxonomy; Tree of life

Year:  2019        PMID: 31198636      PMCID: PMC6555396          DOI: 10.7717/peerj.6984

Source DB:  PubMed          Journal:  PeerJ        ISSN: 2167-8359            Impact factor:   2.984


  45 in total

1.  Genomic signature: characterization and classification of species assessed by chaos game representation of sequences.

Authors:  P J Deschavanne; A Giron; J Vilain; G Fagot; B Fertil
Journal:  Mol Biol Evol       Date:  1999-10       Impact factor: 16.240

2.  Introducing RefSeq and LocusLink: curated human genome resources at the NCBI.

Authors:  K D Pruitt; K S Katz; H Sicotte; D R Maglott
Journal:  Trends Genet       Date:  2000-01       Impact factor: 11.639

Review 3.  The role of phylogenetics in comparative genetics.

Authors:  Douglas E Soltis; Pamela S Soltis
Journal:  Plant Physiol       Date:  2003-08       Impact factor: 8.340

4.  A genomic schism in birds revealed by phylogenetic analysis of DNA strings.

Authors:  Scott V Edwards; Bernard Fertil; Alain Giron; Patrick J Deschavanne
Journal:  Syst Biol       Date:  2002-08       Impact factor: 15.683

5.  General nature of the genetic code for proteins.

Authors:  F H CRICK; L BARNETT; S BRENNER; R J WATTS-TOBIN
Journal:  Nature       Date:  1961-12-30       Impact factor: 49.962

6.  The average common substring approach to phylogenomic reconstruction.

Authors:  Igor Ulitsky; David Burstein; Tamir Tuller; Benny Chor
Journal:  J Comput Biol       Date:  2006-03       Impact factor: 1.479

7.  Phylogenomics of nonavian reptiles and the structure of the ancestral amniote genome.

Authors:  Andrew M Shedlock; Christopher W Botka; Shaying Zhao; Jyoti Shetty; Tingting Zhang; Jun S Liu; Patrick J Deschavanne; Scott V Edwards
Journal:  Proc Natl Acad Sci U S A       Date:  2007-02-16       Impact factor: 11.205

8.  Convergent host-parasite codon usage between honeybee and bee associated viral genomes.

Authors:  Panuwan Chantawannakul; Robert W Cutler
Journal:  J Invertebr Pathol       Date:  2008-03-07       Impact factor: 2.841

9.  Exploration of phylogenetic data using a global sequence analysis method.

Authors:  Charles Chapus; Christine Dufraigne; Scott Edwards; Alain Giron; Bernard Fertil; Patrick Deschavanne
Journal:  BMC Evol Biol       Date:  2005-11-09       Impact factor: 3.260

10.  Database resources of the National Center for Biotechnology Information.

Authors:  David L Wheeler; Tanya Barrett; Dennis A Benson; Stephen H Bryant; Kathi Canese; Vyacheslav Chetvernin; Deanna M Church; Michael DiCuccio; Ron Edgar; Scott Federhen; Lewis Y Geer; Yuri Kapustin; Oleg Khovayko; David Landsman; David J Lipman; Thomas L Madden; Donna R Maglott; James Ostell; Vadim Miller; Kim D Pruitt; Gregory D Schuler; Edwin Sequeira; Steven T Sherry; Karl Sirotkin; Alexandre Souvorov; Grigory Starchenko; Roman L Tatusov; Tatiana A Tatusova; Lukas Wagner; Eugene Yaschenko
Journal:  Nucleic Acids Res       Date:  2006-12-14       Impact factor: 16.971

View more
  6 in total

1.  CUBAP: an interactive web portal for analyzing codon usage biases across populations.

Authors:  Matthew W Hodgman; Justin B Miller; Taylor E Meurs; John S K Kauwe
Journal:  Nucleic Acids Res       Date:  2020-11-04       Impact factor: 16.971

2.  Plastome evolution of Aeonium and Monanthes (Crassulaceae): insights into the variation of plastomic tRNAs, and the patterns of codon usage and aversion.

Authors:  Shiyun Han; Ran Yi; Hengwu Ding; Longhua Wu; Xianzhao Kan
Journal:  Planta       Date:  2022-07-09       Impact factor: 4.540

3.  Codon Pairs are Phylogenetically Conserved: A comprehensive analysis of codon pairing conservation across the Tree of Life.

Authors:  Justin B Miller; Lauren M McKinnon; Michael F Whiting; John S K Kauwe; Perry G Ridge
Journal:  PLoS One       Date:  2020-05-13       Impact factor: 3.240

4.  A comprehensive analysis of the phylogenetic signal in ramp sequences in 211 vertebrates.

Authors:  Lauren M McKinnon; Justin B Miller; Michael F Whiting; John S K Kauwe; Perry G Ridge
Journal:  Sci Rep       Date:  2021-01-12       Impact factor: 4.379

5.  Plastomes of Bletilla (Orchidaceae) and Phylogenetic Implications.

Authors:  Shiyun Han; Rongbin Wang; Xin Hong; Cuilian Wu; Sijia Zhang; Xianzhao Kan
Journal:  Int J Mol Sci       Date:  2022-09-05       Impact factor: 6.208

6.  Molecular epidemiology of carbapenem-resistance plasmids using publicly available sequences.

Authors:  Galen E Card; Brandon D Pickett; Perry G Ridge; Richard A Robison
Journal:  Genome       Date:  2019-09-06       Impact factor: 2.449

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.