| Literature DB >> 34097064 |
Shuangbin Xu1, Zehan Dai1, Pingfan Guo1, Xiaocong Fu1, Shanshan Liu1, Lang Zhou1, Wenli Tang1, Tingze Feng1, Meijun Chen1, Li Zhan1, Tianzhi Wu1, Erqiang Hu1, Yong Jiang2, Xiaochen Bo3, Guangchuang Yu1,2,4.
Abstract
We present the ggtreeExtra package for visualizing heterogeneous data with a phylogenetic tree in a circular or rectangular layout (https://www.bioconductor.org/packages/ggtreeExtra). The package supports more data types and visualization methods than other tools. It supports using the grammar of graphics syntax to present data on a tree with richly annotated layers and allows evolutionary statistics inferred by commonly used software to be integrated and visualized with external data. GgtreeExtra is a universal tool for tree data visualization. It extends the applications of the phylogenetic tree in different disciplines by making more domain-specific data to be available to visualize and interpret in the evolutionary context.Entities:
Keywords: data integration; data visualization; phylogeny; software
Mesh:
Year: 2021 PMID: 34097064 PMCID: PMC8382893 DOI: 10.1093/molbev/msab166
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1.The design and features of the ggtreeExtra package. (A) The overall design of the ggtreeExtra package; (B) comparison of visualization methods for tree annotation (i.e., tree and data graphic alignment) supported by ggtreeExtra and other tools; (C) visualizing associated data (e.g., distribution of species abundance as boxplot) with a phylogenetic tree side by side or on the external ring (inset on the left); (D) using subplots and images as insets on a phylogenetic tree to present taxon-specific structural feature and summary statistics; (E) illustration of representing multidimensional data sets on an inward circular phylogenetic tree with chord diagram incorporated to display inter-relationships. The ggtreeExtra package supports both rectangular and circular layouts and allows transformation between different layouts (C). Multiple data sets can be integrated and a variable can be mapped to visual characteristics to visualize another type of data (CDE), such as using taxon information to color silhouette images (D).