Literature DB >> 29309527

myTAI: evolutionary transcriptomics with R.

Hajk-Georg Drost^1,2, Alexander Gabel², Jialin Liu³, Marcel Quint⁴, Ivo Grosse^2,5.

Abstract

Motivation: Next Generation Sequencing (NGS) technologies generate a large amount of high quality transcriptome datasets enabling the investigation of molecular processes on a genomic and metagenomic scale. These transcriptomics studies aim to quantify and compare the molecular phenotypes of the biological processes at hand. Despite the vast increase of available transcriptome datasets, little is known about the evolutionary conservation of those characterized transcriptomes.
Results: The myTAI package implements exploratory analysis functions to infer transcriptome conservation patterns in any transcriptome dataset. Comprehensive documentation of myTAI functions and tutorial vignettes provide step-by-step instructions on how to use the package in an exploratory and computationally reproducible manner. Availability and implementation: The open source myTAI package is available at https://github.com/HajkD/myTAI and https://cran.r-project.org/web/packages/myTAI/index.html. Contact: hgd23@cam.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities: Species

Mesh：

Year: 2018 PMID： 29309527 PMCID： PMC5925770 DOI： 10.1093/bioinformatics/btx835

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

1 Introduction

To investigate phenotypic changes, diseases, environmental stresses, or developmental processes, transcriptome studies are often the approach of choice. Although transcriptomics is based on a solid methodology, little is known about the evolutionary conservation and dynamics of transcriptomes across species (Drost ). Understanding the evolutionary processes that change transcriptomes over time, however, might lead to new insights on how diseases emerge or how phenotypic changes are caused by changes in transcriptomes. For this purpose, evolutionary transcriptomics studies aim to capture and quantify the evolutionary conservation of transcriptomes during specific stages of the biological process of interest (Domazet-Lošo and Tautz, 2010). Here, we present the exploratory analysis package myTAI, which can combine evolutionary information of genes with their transcript levels to infer transcriptome conservation patterns. Evolutionary information is given as input to the package and can range from classical phylogenetic or orthology relationships between genes to more recent approaches such as phylogenetic comparative methods (PCMs) (Dunn ), phylogenetic reconciliation methods (Doyon ), or phylostratigraphy (Domazet-Lošo ). In summary, starting with a pre-computed table of gene age information and a transcriptome dataset, the R package myTAI can be used to screen for stages of high or low transcriptome conservation within a biological process of interest. If highly conserved or variable transcriptomes were found in particular stages or treatments, more specialized experimental studies could subsequently be designed to investigate the functions and mechanistic implications of these conserved or variable stages.

2 Implementation

The R package myTAI is released under the GNU General Public License within the CRAN project (R Core Team). The package can be downloaded from https://cran.r-project.org/web/packages/myTAI/index.html. The source code is publicly available at https://github.com/HajkD/myTAI. Internal myTAI functions are implemented in C++ and integrated via the Rcpp (Eddelbuettel, 2013) Application Programming Interface (API) and unit tested using testthat. The myTAI package furthermore depends on the R packages nortest, fitdistrplus (Delignette-Muller and Dutang, 2015), dplyr, RColorBrewer, taxize (Chamberlain and Szöcs, 2013), reshape2, ggplot2 (Wickham, 2009), biomartr (Drost and Paszkowski, 2017), readr, tibble, scales and gridExtra.

3 Functions and Examples

More than fifty functions are provided by the myTAI package. We recently used myTAI to investigate the developmental hourglass model of embryo development (Raff, 1996) on the transcriptomic level (Quint ; Drost ). Others used myTAI to investigate transcriptome conservation in plant organ development (Lei ). To illustrate an example workflow with myTAI, we here use the developmental transcriptome of Arabidopsis thaliana embryo development (Quint ) (see Supplementary Material for more details about data formats): # Import the myTAI package and load example dataset library(myTAI); data(PhyloExpressionSetExample) One metric to quantify transcriptome conservation on a global scale is the Transcriptome Age Index (TAI) (Domazet-Lošo and Tautz, 2010), which denotes the average transcriptome age throughout the biological process of interest. # Plot the Transcriptome Age Index of A. thaliana embryo development PlotSignature (PhyloExpressionSetExample) To quantify the transcript level of each gene age category to the overall transcriptome for each developmental stage, the gene expression level distributions for each gene age category can be visualized by: # Plot gene expression level distributions PlotCategoryExpr (PhyloExpressionSetExample, legendName = “PS”, log.expr = TRUE) A linear transformation of the mean expression levels into the interval [0, 1] enables the comparison of mean expression level patterns between gene age categories independent of their actual mean expression magnitude. A relative expression level of 0 denotes the minimum mean expression level compared to all other stages, and a relative expression level of 1 denotes the maximum mean expression level compared to all other stages: # Plot relative expression levels PlotRE (PhyloExpressionSetExample, Groups = list (c(1: 3), c(4: 12)), legendName = “PS”, adjust.range = TRUE) Finally, we compare relative expression levels between groups of age categories and quantify their difference: # Compare relative expression levels between groups of age categories PlotBarRE (PhyloExpressionSetExample, Groups = list(group_1 = 1: 3, group_2 = 4: 12), xlab = “Ontogeny”, ylab = “Mean Relative Expression”, cex = 1.5) In addition to these exploratory functions, myTAI provides functionality for taxonomic information retrieval, gene age enrichment analyses, differential gene expression analyses of age categories, and additional metrics for quantifying trancriptome conservation. A detailed description and interpretation of myTAI functions is available at https://github.com/HajkD/myTAI#tutorials and also in the Supplementary Material.

4 Conclusions

Evolutionary transcriptomics studies can serve as a first approach to screen in silico for the potential existence of evolutionary constraints within a biological process of interest. This is achieved by quantifying transcriptome conservation patterns and their underlying gene sets in biological processes. The exploratory analysis functions implemented in myTAI provide users with a standardized, automated and optimized framework to investigate evolutionary signatures in any transcriptome dataset of interest.

Funding

We thank ERC grant EVOBREED [grant number 322621], the SKWP Research Foundation, and the German Science Foundation (grants Qu 141/5-1, Qu 141/6-1, Qu 141/7-1, GR 3526/6-1, GR 3526/7-1, GR 3526/8-1, and FZT 118) for financial support. Conflict of Interest: none declared. Click here for additional data file.

9 in total

Review 1. Models, algorithms and programs for phylogeny reconciliation.

Authors: Jean-Philippe Doyon; Vincent Ranwez; Vincent Daubin; Vincent Berry
Journal: Brief Bioinform Date: 2011-09 Impact factor: 11.622

2. A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns.

Authors: Tomislav Domazet-Lošo; Diethard Tautz
Journal: Nature Date: 2010-12-09 Impact factor: 49.962

3. A phylostratigraphy approach to uncover the genomic history of major adaptations in metazoan lineages.

Authors: Tomislav Domazet-Loso; Josip Brajković; Diethard Tautz
Journal: Trends Genet Date: 2007-11 Impact factor: 11.639

4. Phylogenetic analysis of gene expression.

Authors: Casey W Dunn; Xi Luo; Zhijin Wu
Journal: Integr Comp Biol Date: 2013-06-07 Impact factor: 3.326

Review 5. Cross-kingdom comparison of the developmental hourglass.

Authors: Hajk-Georg Drost; Philipp Janitza; Ivo Grosse; Marcel Quint
Journal: Curr Opin Genet Dev Date: 2017-03-24 Impact factor: 5.578

6. A transcriptomic hourglass in plant embryogenesis.

Authors: Marcel Quint; Hajk-Georg Drost; Alexander Gabel; Kristian Karsten Ullrich; Markus Bönn; Ivo Grosse
Journal: Nature Date: 2012-09-05 Impact factor: 49.962

7. taxize: taxonomic search and retrieval in R.

Authors: Scott A Chamberlain; Eduard Szöcs
Journal: F1000Res Date: 2013-09-18

8. Plant organ evolution revealed by phylotranscriptomics in Arabidopsis thaliana.

Authors: Li Lei; Joshua G Steffen; Edward J Osborne; Christopher Toomajian
Journal: Sci Rep Date: 2017-08-08 Impact factor: 4.379

9. Biomartr: genomic data retrieval with R.

Authors: Hajk-Georg Drost; Jerzy Paszkowski
Journal: Bioinformatics Date: 2017-04-15 Impact factor: 6.937

9 in total

1. Pervasive convergent evolution and extreme phenotypes define chaperone requirements of protein homeostasis.

Authors: Yasmine Draceni; Sebastian Pechmann
Journal: Proc Natl Acad Sci U S A Date: 2019-09-16 Impact factor: 11.205

2. Oxytocin receptor expression patterns in the human brain across development.

Authors: Jaroslav Rokicki; Tobias Kaufmann; Ann-Marie G de Lange; Dennis van der Meer; Shahram Bahrami; Alina M Sartorius; Unn K Haukvik; Nils Eiel Steen; Emanuel Schwarz; Dan J Stein; Terje Nærland; Ole A Andreassen; Lars T Westlye; Daniel S Quintana
Journal: Neuropsychopharmacology Date: 2022-03-28 Impact factor: 8.294

3. Developmental Constraints on Genome Evolution in Four Bilaterian Model Species.

Authors: Jialin Liu; Marc Robinson-Rechavi
Journal: Genome Biol Evol Date: 2018-09-01 Impact factor: 3.416

4. Elucidating the endogenous synovial fluid proteome and peptidome of inflammatory arthritis using label-free mass spectrometry.

Authors: Shalini M Mahendran; Edward C Keystone; Roman J Krawetz; Kun Liang; Eleftherios P Diamandis; Vinod Chandran
Journal: Clin Proteomics Date: 2019-05-30 Impact factor: 3.988

5. Gene expression variation in the brains of harvester ant foragers is associated with collective behavior.

Authors: Daniel Ari Friedman; Ryan Alexander York; Austin Travis Hilliard; Deborah M Gordon
Journal: Commun Biol Date: 2020-03-05

6. Relative qPCR to quantify colonization of plant roots by arbuscular mycorrhizal fungi.

Authors: Natacha Bodenhausen; Gabriel Deslandes-Hérold; Jan Waelchli; Alain Held; Marcel G A van der Heijden; Klaus Schlaeppi
Journal: Mycorrhiza Date: 2021-01-21 Impact factor: 3.387

7. A comparative transcriptional landscape of maize and sorghum obtained by single-molecule sequencing.

Authors: Bo Wang; Michael Regulski; Elizabeth Tseng; Andrew Olson; Sara Goodwin; W Richard McCombie; Doreen Ware
Journal: Genome Res Date: 2018-04-30 Impact factor: 9.043

8. Embryo-Like Features in Developing Bacillus subtilis Biofilms.

Authors: Momir Futo; Luka Opašić; Sara Koska; Nina Čorak; Tin Široki; Vaishnavi Ravikumar; Annika Thorsell; Maša Lenuzzi; Domagoj Kifer; Mirjana Domazet-Lošo; Kristian Vlahoviček; Ivan Mijakovic; Tomislav Domazet-Lošo
Journal: Mol Biol Evol Date: 2021-01-04 Impact factor: 16.240

9. New Genes Interacted With Recent Whole-Genome Duplicates in the Fast Stem Growth of Bamboos.

Authors: Guihua Jin; Peng-Fei Ma; Xiaopei Wu; Lianfeng Gu; Manyuan Long; Chengjun Zhang; De-Zhu Li
Journal: Mol Biol Evol Date: 2021-12-09 Impact factor: 16.240

9 in total