Literature DB >> 29028262

TSIS: an R package to infer alternative splicing isoform switches for time-series data.

Wenbin Guo1,2, Cristiane P G Calixto2, John W S Brown2,3, Runxuan Zhang1.   

Abstract

SUMMARY: An alternative splicing isoform switch is where a pair of transcript isoforms reverse their relative expression abundances in response to external or internal stimuli. Although computational methods are available to study differential alternative splicing, few tools for detection of isoform switches exist and these are based on pairwise comparisons. Here, we provide the TSIS R package, which is the first tool for detecting significant transcript isoform switches in time-series data. The main steps of TSIS are to search for the isoform switch points in the time-series, characterize the switches and filter the results with user input parameters. All the functions are integrated into a Shiny App for ease of implementation of the analysis.
AVAILABILITY AND IMPLEMENTATION: The TSIS package is available on GitHub: https://github.com/wyguo/TSIS. CONTACT: runxuan.zhang@hutton.ac.uk.
© The Author 2017. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 29028262      PMCID: PMC5860037          DOI: 10.1093/bioinformatics/btx411

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Regulation of gene expression by alternative splicing (AS) generates changes in abundance of different transcript isoforms. One particular splicing phenotype is isoform switching where the relative abundance of different isoforms of the same gene is reversed in different cell types or in response to stimuli. Isoform switches often play pivotal roles in re-programming of gene expression and isoform switches of functionally different transcript isoforms between normal and tumor tissues provide signatures for cancer diagnostics and prognostics (Sebestyen ). There are limited tools designed for inference of isoform switches and currently there is no software available for detecting alternative splicing isoform switches for time-series data. Isoform switch detection tools, such as iso-kTSP (Sebestyen ), spliceR (Vitting-Seerup ) and SwitchSeq (Gonzàlez-Porta and Brazma, 2014), only perform pairwise comparisons (Fig. 1a). Time-series RNA-seq data greatly enhances the resolution of changes in expression and AS during development or in responses to external or internal cues. Identification of isoform switches in time-series data presents specific challenges in that (i) switch points can happen between any time-points, and (ii) the isoform pairs may undergo a number of switches during the time course (Fig. 1b). To detect and characterize temporal and complex isoform switches, we developed the time-series isoform switch (TSIS) R package, which incorporates score schemes from current methods and includes a number of new metrics which capture the characteristics of the isoform switches.
Fig. 1

Analyzes of isoform switches. In (a) and (b), expression data with three replicates for each condition/time-point is simulated for isoforms and . The points in the plots represent the samples and the black lines connect the average of samples. (a) A scheme plot for iso-kTSP that shows an isoform switch between two conditions and . (b) A scheme plot for TSIS where two isoforms show three switches at different time-points. In (c) and (d), TSIS-generated output files are shown for real time-course RNA-seq data. (c) Histogram of isoform switches identified in 30 different genes. (d) Example of two transcript isoforms from gene G30 showing multiple switches, where user input parameter on the region for investigation has been labeled. TPM, transcripts per million

Analyzes of isoform switches. In (a) and (b), expression data with three replicates for each condition/time-point is simulated for isoforms and . The points in the plots represent the samples and the black lines connect the average of samples. (a) A scheme plot for iso-kTSP that shows an isoform switch between two conditions and . (b) A scheme plot for TSIS where two isoforms show three switches at different time-points. In (c) and (d), TSIS-generated output files are shown for real time-course RNA-seq data. (c) Histogram of isoform switches identified in 30 different genes. (d) Example of two transcript isoforms from gene G30 showing multiple switches, where user input parameter on the region for investigation has been labeled. TPM, transcripts per million

2 Methods and application

TSIS detects pairs of AS transcripts with one or more isoform switches and genes with multiple pairs of transcripts which show isoform switches. By defining five metrics of the isoform switch, the method comprehensively captures and describes the isoform switches occurring at different points in time-series data. TSIS analysis can be carried out using command lines as well as through a graphic interface using a Shiny App (https://CRAN.R-project.org/package=shiny) where the analysis can be implemented easily.

2.1 Determine the switch points

We have offered two approaches to search for the switch points in TSIS. The first approach takes the average expression values of the replicates for each time-point for each isoform and searches for the cross points. The second approach uses natural spline curves to fit the time-series data for each transcript isoform using the R package ‘splines’ (version 3.3.2) and finds cross points of the fitted curves for each pair of isoforms. The spline method is useful to find global trends of time-series data when the data is noisy. However, it may lack details of isoform switches in the local region. It is recommended that users use both average and spline methods to search for the switch points and examine manually when inconsistent results were produced by the above two methods.

2.2 Define the switch metrics

The intersection points determined in Section 2.1 divide the time-series frame into intervals and each switch point is flanked by an interval before the switch and after the switch (Fig. 1b). We define the switch of two isoforms and by (i) the switch point , (ii) time-points between switch points and as interval before switch and (iii) time-points between switch points and as interval after the switch (Fig. 1b). Each isoform switch is described by five metrics. Metric 1: represents the probability of the abundance switch and is calculated as the sum of the frequencies of two possible scenarios that one isoform is more or less abundant than the other in the two intervals adjacent to a switch point, as used in iso-kTSP (Sebestyen ). Where and are the frequencies/probabilities that the samples of one isoform is greater or less than in the other in corresponding intervals. Metric 2: is the sum of average abundance differences of the two isoforms in both intervals. Where is the average difference of abundances between and in interval defined as is the number of samples in interval and is the expression of of sample in interval . Metric 2 indicates the magnitude of the switch. Higher values mean larger changes in abundances before and after the switch. Metric 3 measures the significance of the differences between the isoform abundances before and after the switch using paired t-tests to generate P-values for each interval. Metric 4 is a measure of whether the effect of the switch is transient or long lived (reflecting the number of time-points in the flanking intervals). Metric 5: Isoforms with high negative correlations across the time-points may identify important regulation in alternative splicing. Thus we also calculated the Pearson correlation of two isoforms across the whole time-series.

2.3 Filter and visualize the results

TSIS provides histograms that show the number of switches happening at each time-point as well as interactive visualizations of the isoform switch profiles (Fig. 1c, d). TSIS also allows regions of interest to be defined (Fig. 1d) or switches involving the most abundant isoforms or any predefined list of isoforms to be selected as outputs. Known IS in Arabidopsis circadian clock genes AT1G01060 (G2), AT5G37260 (G29) and AT3G09600 (G12) (Fig. 1c) (Filichkin ; James , 2012b) were successfully detected by TSIS. The example dataset (used in Fig. 1c, d) and details to run the tool are shown in the user manual on the Github page.
  5 in total

1.  Environmental stresses modulate abundance and timing of alternatively spliced circadian transcripts in Arabidopsis.

Authors:  Sergei A Filichkin; Jason S Cumbie; Palitha Dharmawardhana; Pankaj Jaiswal; Jeff H Chang; Saiprasad G Palusa; A S N Reddy; Molly Megraw; Todd C Mockler
Journal:  Mol Plant       Date:  2015-01-08       Impact factor: 13.164

2.  Detection of recurrent alternative splicing switches in tumor samples reveals novel signatures of cancer.

Authors:  Endre Sebestyén; Michał Zawisza; Eduardo Eyras
Journal:  Nucleic Acids Res       Date:  2015-01-10       Impact factor: 16.971

3.  Thermoplasticity in the plant circadian clock: how plants tell the time-perature.

Authors:  Allan B James; Naeem Hasan Syed; John W S Brown; Hugh G Nimmo
Journal:  Plant Signal Behav       Date:  2012-08-20

4.  Alternative splicing mediates responses of the Arabidopsis circadian clock to temperature changes.

Authors:  Allan B James; Naeem Hasan Syed; Simon Bordage; Jacqueline Marshall; Gillian A Nimmo; Gareth I Jenkins; Pawel Herzyk; John W S Brown; Hugh G Nimmo
Journal:  Plant Cell       Date:  2012-03-09       Impact factor: 11.277

5.  spliceR: an R package for classification of alternative splicing and prediction of coding potential from RNA-seq data.

Authors:  Kristoffer Vitting-Seerup; Bo Torben Porse; Albin Sandelin; Johannes Waage
Journal:  BMC Bioinformatics       Date:  2014-03-23       Impact factor: 3.169

  5 in total
  11 in total

1.  A dynamic intron retention program regulates the expression of several hundred genes during pollen meiosis.

Authors:  Agnieszka A Golicz; Annapurna D Allu; Wei Li; Neeta Lohani; Mohan B Singh; Prem L Bhalla
Journal:  Plant Reprod       Date:  2021-05-21       Impact factor: 3.767

2.  A high-resolution single-molecule sequencing-based Arabidopsis transcriptome using novel methods of Iso-seq analysis.

Authors:  Runxuan Zhang; Richard Kuo; Max Coulter; Cristiane P G Calixto; Juan Carlos Entizne; Wenbin Guo; Yamile Marquez; Linda Milne; Stefan Riegler; Akihiro Matsui; Maho Tanaka; Sarah Harvey; Yubang Gao; Theresa Wießner-Kroh; Alejandro Paniagua; Martin Crespi; Katherine Denby; Asa Ben Hur; Enamul Huq; Michael Jantsch; Artur Jarmolowski; Tino Koester; Sascha Laubinger; Qingshun Quinn Li; Lianfeng Gu; Motoaki Seki; Dorothee Staiger; Ramanjulu Sunkar; Zofia Szweykowska-Kulinska; Shih-Long Tu; Andreas Wachter; Robbie Waugh; Liming Xiong; Xiao-Ning Zhang; Ana Conesa; Anireddy S N Reddy; Andrea Barta; Maria Kalyna; John W S Brown
Journal:  Genome Biol       Date:  2022-07-07       Impact factor: 17.906

3.  Rapid and Dynamic Alternative Splicing Impacts the Arabidopsis Cold Response Transcriptome.

Authors:  Cristiane P G Calixto; Wenbin Guo; Allan B James; Nikoleta A Tzioutziou; Juan Carlos Entizne; Paige E Panter; Heather Knight; Hugh G Nimmo; Runxuan Zhang; John W S Brown
Journal:  Plant Cell       Date:  2018-05-15       Impact factor: 11.277

4.  Cold-Dependent Expression and Alternative Splicing of Arabidopsis Long Non-coding RNAs.

Authors:  Cristiane P G Calixto; Nikoleta A Tzioutziou; Allan B James; Csaba Hornyik; Wenbin Guo; Runxuan Zhang; Hugh G Nimmo; John W S Brown
Journal:  Front Plant Sci       Date:  2019-02-28       Impact factor: 5.753

5.  Temporal Splicing Switches in Elements of the TNF-Pathway Identified by Computational Analysis of Transcriptome Data for Human Cell Lines.

Authors:  Nikolai Genov; Alireza Basti; Mónica Abreu; Angela Relógio
Journal:  Int J Mol Sci       Date:  2019-03-08       Impact factor: 5.923

6.  Hybrid sequencing reveals insight into heat sensing and signaling of bread wheat.

Authors:  Xiaoming Wang; Siyuan Chen; Xue Shi; Danni Liu; Peng Zhao; Yunze Lu; Yanbing Cheng; Zhenshan Liu; Xiaojun Nie; Weining Song; Qixin Sun; Shengbao Xu; Chuang Ma
Journal:  Plant J       Date:  2019-04-23       Impact factor: 6.417

7.  Integrated multi-omics framework of the plant response to jasmonic acid.

Authors:  Mark Zander; Mathew G Lewsey; Natalie M Clark; Lingling Yin; Anna Bartlett; J Paola Saldierna Guzmán; Elizabeth Hann; Amber E Langford; Bruce Jow; Aaron Wise; Joseph R Nery; Huaming Chen; Ziv Bar-Joseph; Justin W Walley; Roberto Solano; Joseph R Ecker
Journal:  Nat Plants       Date:  2020-03-13       Impact factor: 15.793

8.  Rice Transcriptome Analysis Reveals Nitrogen Starvation Modulates Differential Alternative Splicing and Transcript Usage in Various Metabolism-Related Genes.

Authors:  Saurabh Chaudhary; Meenu Kalkal
Journal:  Life (Basel)       Date:  2021-03-27

9.  3D RNA-seq: a powerful and flexible tool for rapid and accurate differential expression and alternative splicing analysis of RNA-seq data for biologists.

Authors:  Wenbin Guo; Nikoleta A Tzioutziou; Gordon Stephen; Iain Milne; Cristiane Pg Calixto; Robbie Waugh; John W S Brown; Runxuan Zhang
Journal:  RNA Biol       Date:  2020-12-19       Impact factor: 4.652

10.  Deriving Ranges of Optimal Estimated Transcript Expression due to Nonidentifiability.

Authors:  Hongyu Zheng; Cong Ma; Carl Kingsford
Journal:  J Comput Biol       Date:  2022-01-17       Impact factor: 1.549

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.