| Literature DB >> 31609424 |
Jingwen Yang1,2, Hang Ruan3, Wenjie Xu1, Xun Gu4.
Abstract
Recent innovations of next-generation sequencing such as RNA-seq have generated an enormous amount of comparative transcriptome data, which have shed lights on our understanding of the complexity of transcriptional regulatory systems. Despite numerous RNA-seq analyses, statistical methods and computational tools designed for phylogenetic transcriptome analysis and evolution have not been well developed. In response to this need, we developed software TreeExp2 specifically for RNA-seq data. The R-package TreeExp2 has implemented a suite of advanced, recently developed methods for transcriptome evolutionary analysis. Its main functions include the ancestral transcriptome inference, estimation of the strength of expression conservation, new expression distance, and the relative expression rate test. TreeExp2 provides an integrated, statistically sound framework for phylogenetic transcriptome analysis. It will considerably enhance our analytical capability for exploring the evolution and selection at the transcriptome level. The current version of TreeExp2 is available under GPLv3 license at the Github developer site https://github.com/jingwyang/TreeExp; last accessed November 12, 2019, and its online tutorial which describes the biological theories in details and fully worked case studies with real data can be found at https://jingwyang.github.io/TreeExp-Tutorial; last accessed November 12, 2019.Entities:
Keywords: R-package; high-throughput analysis; phylogenomics; transcriptome evolution
Mesh:
Year: 2019 PMID: 31609424 PMCID: PMC6934891 DOI: 10.1093/gbe/evz222
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—RNA-seq data from multiple species and tissues, illustrated by expression levels = (x1, x2, …, x) of an orthologous gene over n species. TreeExp2 can perform the following analyses. 1) Infer the ancestral expression state (node y in red brown as example) of a gene in a tissue, which is a (phylogeny-dependent linear) combination of . 2) Estimate the strength of expression conservation (W) for any gene when is given in a tissue. 3) Calculate expression distance that is linear to the evolutionary time. And 4) detect lineage-specific fast-evolving expression divergence in species A or B (yellow branches) using species C (purple branch) as outgroup.
. 2.—The evolutionary phylogeny for comparative transcriptome analysis (A) under Ornstein–Uhlenbeck (OU) model. (B) Phylogeny when considering the origin of the tissue (node Z) to the root (node O) of the species tree. When the tissue origin is so ancient that τ → ∞, it is called the stationary OU model along the species phylogeny.
Expression Distance Estimates between Human and Macaque (t = 29 Ma)
| Tissues | Brain | Cerebellum | Liver | Kidney | Heart | Testis |
|---|---|---|---|---|---|---|
|
| 0.901 | 0.893 | 0.896 | 0.876 | 0.708 | 0.744 |
| Pearson expression distance | ||||||
| Expression distance | 0.089 | 0.107 | 0.104 | 0.124 | 0.292 | 0.256 |
| Rate of transcriptome evolution | 1.53 | 1.84 | 1.79 | 2.14 | 5.03 | 4.41 |
| Constant- | ||||||
| Expression distance | 0.104 | 0.113 | 0.110 | 0.132 | 0.345 | 0.296 |
| Rate of transcriptome evolution | 1.79 | 1.95 | 1.90 | 2.28 | 5.95 | 5.10 |
| Variable-μ expression distance, | ||||||
| Estimated | 0.354 | 0.384 | 0.401 | 0.377 | 0.327 | 0.392 |
| Expression distance | 0.148 | 0.191 | 0.191 | 0.222 | 0.569 | 0.547 |
| Rate of transcriptome evolution | 2.41 | 3.29 | 3.29 | 3.83 | 9.81 | 9.43 |