Literature DB >> 33560364

PhyKIT: a broadly applicable UNIX shell toolkit for processing and analyzing phylogenomic data.

Jacob L Steenwyk1, Thomas J Buida2, Abigail L Labella1, Yuanning Li1, Xing-Xing Shen3, Antonis Rokas1.   

Abstract

MOTIVATION: Diverse disciplines in biology process and analyze multiple sequence alignments (MSAs) and phylogenetic trees to evaluate their information content, infer evolutionary events and processes, and predict gene function. However, automated processing of MSAs and trees remains a challenge due to the lack of a unified toolkit. To fill this gap, we introduce PhyKIT, a toolkit for the UNIX shell environment with 30 functions that process MSAs and trees, including but not limited to estimation of mutation rate, evaluation of sequence composition biases, calculation of the degree of violation of a molecular clock, and collapsing bipartitions (internal branches) with low support.
RESULTS: To demonstrate the utility of PhyKIT, we detail three use cases: (1) summarizing information content in MSAs and phylogenetic trees for diagnosing potential biases in sequence or tree data; (2) evaluating gene-gene covariation of evolutionary rates to identify functional relationships, including novel ones, among genes; and (3) identify lack of resolution events or polytomies in phylogenetic trees, which are suggestive of rapid radiation events or lack of data. We anticipate PhyKIT will be useful for processing, examining, and deriving biological meaning from increasingly large phylogenomic datasets. AVAILABILITY: PhyKIT is freely available on GitHub (https://github.com/JLSteenwyk/PhyKIT), PyPi (https://pypi.org/project/phykit/), and the Anaconda Cloud (https://anaconda.org/JLSteenwyk/phykit) under the MIT license with extensive documentation and user tutorials (https://jlsteenwyk.com/PhyKIT). SUPPLEMENTARY INFORMATION: Supplementary data are available on figshare (doi: 10.6084/m9.figshare.13118600) and are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 33560364      PMCID: PMC8388027          DOI: 10.1093/bioinformatics/btab096

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  62 in total

1.  Testing hybridization hypotheses based on incongruent gene trees.

Authors:  T Sang; Y Zhong
Journal:  Syst Biol       Date:  2000-09       Impact factor: 15.683

2.  ERC analysis: web-based inference of gene function via evolutionary rate covariation.

Authors:  Nicholas W Wolfe; Nathan L Clark
Journal:  Bioinformatics       Date:  2015-08-04       Impact factor: 6.937

3.  Roles of the RAM signaling network in cell cycle progression in Saccharomyces cerevisiae.

Authors:  Lydia M Bogomolnaya; Ritu Pathak; Jinbai Guo; Michael Polymenis
Journal:  Curr Genet       Date:  2006-03-22       Impact factor: 3.886

4.  DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets.

Authors:  Julio Rozas; Albert Ferrer-Mata; Juan Carlos Sánchez-DelBarrio; Sara Guirao-Rico; Pablo Librado; Sebastián E Ramos-Onsins; Alejandro Sánchez-Gracia
Journal:  Mol Biol Evol       Date:  2017-12-01       Impact factor: 16.240

5.  Efficient comparative phylogenetics on large trees.

Authors:  Stilianos Louca; Michael Doebeli
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

Review 6.  Phylogenetic tree building in the genomic age.

Authors:  Paschalia Kapli; Ziheng Yang; Maximilian J Telford
Journal:  Nat Rev Genet       Date:  2020-05-18       Impact factor: 53.242

7.  Biopython: freely available Python tools for computational molecular biology and bioinformatics.

Authors:  Peter J A Cock; Tiago Antao; Jeffrey T Chang; Brad A Chapman; Cymon J Cox; Andrew Dalke; Iddo Friedberg; Thomas Hamelryck; Frank Kauff; Bartek Wilczynski; Michiel J L de Hoon
Journal:  Bioinformatics       Date:  2009-03-20       Impact factor: 6.937

8.  The root of the mammalian tree inferred from whole mitochondrial genomes.

Authors:  Matthew J Phillips; David Penny
Journal:  Mol Phylogenet Evol       Date:  2003-08       Impact factor: 4.286

9.  Phylogenomic Resolution of the Phylogeny of Laurasiatherian Mammals: Exploring Phylogenetic Signals within Coding and Noncoding Sequences.

Authors:  Meng-Yun Chen; Dan Liang; Peng Zhang
Journal:  Genome Biol Evol       Date:  2017-08-01       Impact factor: 3.416

10.  One thousand plant transcriptomes and the phylogenomics of green plants.

Authors: 
Journal:  Nature       Date:  2019-10-23       Impact factor: 49.962

View more
  4 in total

1.  BioKIT: a versatile toolkit for processing and analyzing diverse types of sequence data.

Authors:  Jacob L Steenwyk; Thomas J Buida; Carla Gonçalves; Dayna C Goltz; Grace Morales; Matthew E Mead; Abigail L LaBella; Christina M Chavez; Jonathan E Schmitz; Maria Hadjifrangiskou; Yuanning Li; Antonis Rokas
Journal:  Genetics       Date:  2022-07-04       Impact factor: 4.402

2.  An evolutionary genomic approach reveals both conserved and species-specific genetic elements related to human disease in closely related Aspergillus fungi.

Authors:  Matthew E Mead; Jacob L Steenwyk; Lilian P Silva; Patrícia A de Castro; Nauman Saeed; Falk Hillmann; Gustavo H Goldman; Antonis Rokas
Journal:  Genetics       Date:  2021-06-24       Impact factor: 4.562

3.  Examination of Gene Loss in the DNA Mismatch Repair Pathway and Its Mutational Consequences in a Fungal Phylum.

Authors:  Megan A Phillips; Jacob L Steenwyk; Xing-Xing Shen; Antonis Rokas
Journal:  Genome Biol Evol       Date:  2021-10-01       Impact factor: 3.416

4.  Examination of Genome-Wide Ortholog Variation in Clinical and Environmental Isolates of the Fungal Pathogen Aspergillus fumigatus.

Authors:  Maria Augusta C Horta; Jacob L Steenwyk; Matthew E Mead; Luciano H Braz Dos Santos; Shu Zhao; John G Gibbons; Marina Marcet-Houben; Toni Gabaldón; Antonis Rokas; Gustavo H Goldman
Journal:  mBio       Date:  2022-06-29       Impact factor: 7.786

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.