Literature DB >> 19933160

MADS+: discovery of differential splicing events from Affymetrix exon junction array data.

Shihao Shen1, Claude C Warzecha, Russ P Carstens, Yi Xing.   

Abstract

MOTIVATION: The Affymetrix Human Exon Junction Array is a newly designed high-density exon-sensitive microarray for global analysis of alternative splicing. Contrary to the Affymetrix exon 1.0 array, which only contains four probes per exon and no probes for exon-exon junctions, this new junction array averages eight probes per probeset targeting all exons and exon-exon junctions observed in the human mRNA/EST transcripts, representing a significant increase in the probe density for alternative splicing events. Here, we present MADS+, a computational pipeline to detect differential splicing events from the Affymetrix exon junction array data. For each alternative splicing event, MADS+ evaluates the signals of probes targeting competing transcript isoforms to identify exons or splice sites with different levels of transcript inclusion between two sample groups. MADS+ is used routinely in our analysis of Affymetrix exon junction arrays and has a high accuracy in detecting differential splicing events. For example, in a study of the novel epithelial-specific splicing regulator ESRP1, MADS+ detects hundreds of exons whose inclusion levels are dependent on ESRP1, with a RT-PCR validation rate of 88.5% (153 validated out of 173 tested). AVAILABILITY: MADS+ scripts, documentations and annotation files are available at http://www.medicine.uiowa.edu/Labs/Xing/MADSplus/.

Entities:  

Mesh:

Year:  2009        PMID: 19933160      PMCID: PMC2804303          DOI: 10.1093/bioinformatics/btp643

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Alternative splicing of precursor mRNAs is a prevalent regulatory mechanism in higher eukaryotes. Recent data suggest that over 90% of multi-exon human genes are alternatively spliced (Blencowe et al., 2009). Use of splicing-sensitive microarrays is a popular approach for global profiling of alternative splicing events. One widely used commercial platform is the Affymetrix exon 1.0 array (Clark et al., 2007). This array allocates on average four probes per exon for all transcript-confirmed and predicted exons in a mammalian genome, allowing for concurrent profiling of gene expression levels and alternative splicing events. Since its release, a variety of algorithms for splicing analysis using the exon 1.0 array have been proposed [for a summary of these algorithms, see (Laajala et al., 2009)]. However, as a first-generation exon array platform, the design of the Affymetrix exon 1.0 array has its limitations. Most importantly, the array has only four probes per exon and no probes targeting exon–exon junctions. To address these limitations, Affymetrix recently released its second-generation exon array, the Human Exon Junction Array (HJAY), with a significantly improved probe design. This new array averages eight probes per probeset for 315 137 exons and 260 488 exon–exon junctions in the human genome, covering all alternative splicing events with mRNA/EST evidence in the UCSC/Ensembl databases (Yamamoto et al., 2009). The increased probe density, inclusion of exon–exon junction probes and comprehensive coverage of known alternative splicing events make the HJAY a powerful platform for analysis of alternative splicing. Building on our previous work on the exon 1.0 array (Xing et al., 2008), we have developed a pipeline, MADS+, for splicing analysis using the new Affymetrix exon junction array.

2 DESCRIPTIONS

MADS+ is implemented in Python/R and can be run on any Linux/Unix-like environment with installation of Python, R and required packages. An overview of the MADS+ pipeline is provided below: Preprocessing: raw junction array CEL files are processed by the ProbeEffects program (Kapur et al., 2007). A variety of background-correction and normalization methods are implemented in ProbeEffects. In particular, we employ a linear model with 80 parameters to correct for the background intensities of individual probes based on nucleotides at each position of a 25mer probe (Kapur et al., 2007). The background-corrected intensities are used for downstream analysis. Expression Index Calculation: for each gene, the background corrected and normalized intensities of its exon probes are used to derive an index for overall gene expression levels. The ProbeSelect program implements an iterative probe selection algorithm to select a subset of exon probes for each gene with highly correlated intensities across all samples (Xing et al., 2006). These probes are treated as reliable indicators of overall gene expression levels and their probe intensities are fitted to the Li-Wong model to calculate expression indexes (Li et al., 2001; Xing et al., 2006). Alternative Splicing Analysis: the alternative splicing analysis in MADS+ consists of several steps. First, for each probe in a particular sample, we calculate a ‘splicing index’, represented as the background corrected probe intensity divided by the estimated gene expression index. Two separate one-sided t-tests are performed to test if the splicing index of a probe is substantially higher in one sample group over the other group. Then, for each probeset, the P-values of individual probes are summarized using Fisher's method to obtain a probeset-level P-value for differential splicing (Xing et al., 2008). Next, MADS+ reads precompiled annotation files of alternative splicing events queried by the exon junction array. For each alternative splicing event, MADS+ evaluates the P-values of multiple probesets targeting exons and exon–exon junctions, requiring opposite trends for probesets targeting competing isoforms as evidence of differential splicing. For example, for each exon skipping event, we require that the ‘skip’ probeset and at least one of the three ‘include’ probesets (see Fig. 1A) have P-values below a predefined significance cutoff (default is 0.001), and that significant ‘include’ and ‘skip’ probesets display opposite directions of change in splicing index between the two sample groups. Finally, for candidate differential splicing events, MADS+ plots the background-corrected intensities of individual probes as well as the estimated gene expression indexes, allowing users to directly inspect the evidence for splicing differences between sample groups (see Fig. 1C for an example). It also produces annotated text output grouped according to the types of alternative splicing patterns (e.g. exon skipping, alternative splice site usage, mutually exclusive exon usage). This facilitates downstream analysis of identified differential splicing events.
Fig. 1.

Detection of a differentially spliced cassette exon using MADS+. (A) The HJAY probe design for cassette exons. Each probeset has eight probes. (B) Exon 13 of SLK is positively regulated by ESRP1. (C) The HJAY data indicate significantly elevated exon inclusion of SLK exon 13 in response to ESRP1 overexpression. In each panel, the dashed red line indicates the average gene expression levels of SLK in four replicate samples transduced with EV or ESRP1. The individual blue lines indicate the average background-corrected intensities of individual probes in a probeset. MADS+ P-value is shown below the name of each probeset. The ‘+’ and ‘−’ signs indicate the direction of change in splicing indexes from EV to ESRP1.

Detection of a differentially spliced cassette exon using MADS+. (A) The HJAY probe design for cassette exons. Each probeset has eight probes. (B) Exon 13 of SLK is positively regulated by ESRP1. (C) The HJAY data indicate significantly elevated exon inclusion of SLK exon 13 in response to ESRP1 overexpression. In each panel, the dashed red line indicates the average gene expression levels of SLK in four replicate samples transduced with EV or ESRP1. The individual blue lines indicate the average background-corrected intensities of individual probes in a probeset. MADS+ P-value is shown below the name of each probeset. The ‘+’ and ‘−’ signs indicate the direction of change in splicing indexes from EV to ESRP1.

3 EXAMPLES

The MADS+ pipeline has been used routinely in our exon-level analysis of the Affymetrix HJAY data. Figure 1 shows a candidate differential exon skipping event identified by MADS+. In this study, in order to identify splicing events regulated by the novel epithelial-specific splicing regulator ESRP1 (Warzecha et al., 2009), a cDNA for mouse ESRP1 was ectopically expressed in the MDA-MB-231 mesenchymal human breast cancer cell line by viral transduction compared with empty vector (EV) control and profiled by the Affymetrix HJAY (four biological replicates each condition). As shown in Figure 1, exon 13 of the SLK gene was positively regulated by ESRP1. Its exon inclusion level was 2% after transduction by EV and 80% after transduction by ESRP1. From the HJAY data of this exon, we found significantly elevated splicing indexes of the exon inclusion probesets (upstream junction, downstream junction and exon) and significantly reduced splicing indexes of the exon skipping probeset in the ESRP1 group as compared with the EV group (Fig. 1C), which was validated by RT-PCR (Fig. 1B). In sum, among ESRP1-dependent exon skipping events identified by MADS+, we obtained a RT-PCR validation rate of 88.5% (153 exons validated out of 173 tested so far, Warzecha,C.C. et al., unpublished data). This example demonstrates that MADS+ is a useful tool for global analysis and visualization of differential alternative splicing.
  9 in total

1.  Alternative pre-mRNA splicing switches modulate gene expression in late erythropoiesis.

Authors:  Miki L Yamamoto; Tyson A Clark; Sherry L Gee; Jeong-Ah Kang; Anthony C Schweitzer; Amittha Wickrema; John G Conboy
Journal:  Blood       Date:  2009-02-04       Impact factor: 22.113

Review 2.  Current-generation high-throughput sequencing: deepening insights into mammalian transcriptomes.

Authors:  Benjamin J Blencowe; Sidrah Ahmad; Leo J Lee
Journal:  Genes Dev       Date:  2009-06-15       Impact factor: 11.361

3.  MADS: a new and improved method for analysis of differential alternative splicing by exon-tiling microarrays.

Authors:  Yi Xing; Peter Stoilov; Karen Kapur; Areum Han; Hui Jiang; Shihao Shen; Douglas L Black; Wing Hung Wong
Journal:  RNA       Date:  2008-06-19       Impact factor: 4.942

4.  ESRP1 and ESRP2 are epithelial cell-type-specific regulators of FGFR2 splicing.

Authors:  Claude C Warzecha; Trey K Sato; Behnam Nabet; John B Hogenesch; Russ P Carstens
Journal:  Mol Cell       Date:  2009-03-13       Impact factor: 17.970

5.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection.

Authors:  C Li; W H Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2001-01-02       Impact factor: 11.205

6.  Probe selection and expression index computation of Affymetrix Exon Arrays.

Authors:  Yi Xing; Karen Kapur; Wing Hung Wong
Journal:  PLoS One       Date:  2006-12-20       Impact factor: 3.240

7.  Probe-level estimation improves the detection of differential splicing in Affymetrix exon array studies.

Authors:  Essi Laajala; Tero Aittokallio; Riitta Lahesmaa; Laura L Elo
Journal:  Genome Biol       Date:  2009-07-16       Impact factor: 13.583

8.  Discovery of tissue-specific exons using comprehensive human exon microarrays.

Authors:  Tyson A Clark; Anthony C Schweitzer; Tina X Chen; Michelle K Staples; Gang Lu; Hui Wang; Alan Williams; John E Blume
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

9.  Exon arrays provide accurate assessments of gene expression.

Authors:  Karen Kapur; Yi Xing; Zhengqing Ouyang; Wing Hung Wong
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

  9 in total
  25 in total

1.  Genome-wide determination of a broad ESRP-regulated posttranscriptional network by high-throughput sequencing.

Authors:  Kimberly A Dittmar; Peng Jiang; Juw Won Park; Karine Amirikian; Ji Wan; Shihao Shen; Yi Xing; Russell P Carstens
Journal:  Mol Cell Biol       Date:  2012-02-21       Impact factor: 4.272

2.  An ESRP-regulated splicing programme is abrogated during the epithelial-mesenchymal transition.

Authors:  Claude C Warzecha; Peng Jiang; Karine Amirikian; Kimberly A Dittmar; Hezhe Lu; Shihao Shen; Wei Guo; Yi Xing; Russ P Carstens
Journal:  EMBO J       Date:  2010-08-13       Impact factor: 11.598

Review 3.  Metastasis: tumor cells becoming MENAcing.

Authors:  Frank Gertler; John Condeelis
Journal:  Trends Cell Biol       Date:  2010-11-09       Impact factor: 20.808

4.  Exon Array Biomarkers for the Differential Diagnosis of Schizophrenia and Bipolar Disorder.

Authors:  Marquis Philip Vawter; Robert Philibert; Brandi Rollins; Patricia L Ruppel; Terry W Osborn
Journal:  Mol Neuropsychiatry       Date:  2018-04-10

5.  Genome wide array analysis indicates that an amyotrophic lateral sclerosis mutation of FUS causes an early increase of CAMK2N2 in vitro.

Authors:  Paolo Convertini; Jiayu Zhang; Pierre de la Grange; Lawrence J Hayward; Haining Zhu; Stefan Stamm
Journal:  Biochim Biophys Acta       Date:  2013-03-29

6.  The cardiotonic steroid digitoxin regulates alternative splicing through depletion of the splicing factors SRSF3 and TRA2B.

Authors:  Erik S Anderson; Chia-Ho Lin; Xinshu Xiao; Peter Stoilov; Christopher B Burge; Douglas L Black
Journal:  RNA       Date:  2012-03-28       Impact factor: 4.942

7.  Evolution of alternative splicing in primate brain transcriptomes.

Authors:  Lan Lin; Shihao Shen; Peng Jiang; Seiko Sato; Beverly L Davidson; Yi Xing
Journal:  Hum Mol Genet       Date:  2010-05-11       Impact factor: 6.150

8.  EGFR mutation-induced alternative splicing of Max contributes to growth of glycolytic tumors in brain cancer.

Authors:  Ivan Babic; Erik S Anderson; Kazuhiro Tanaka; Deliang Guo; Kenta Masui; Bing Li; Shaojun Zhu; Yuchao Gu; Genaro R Villa; David Akhavan; David Nathanson; Beatrice Gini; Sergey Mareninov; Rui Li; Carolina Espindola Camacho; Siavash K Kurdistani; Ascia Eskin; Stanley F Nelson; William H Yong; Webster K Cavenee; Timothy F Cloughesy; Heather R Christofk; Douglas L Black; Paul S Mischel
Journal:  Cell Metab       Date:  2013-05-23       Impact factor: 27.287

9.  SNORD116 and SNORD115 change expression of multiple genes and modify each other's activity.

Authors:  Marina Falaleeva; Justin Surface; Manli Shen; Pierre de la Grange; Stefan Stamm
Journal:  Gene       Date:  2015-07-26       Impact factor: 3.688

10.  Experimental and Computational Considerations in the Study of RNA-Binding Protein-RNA Interactions.

Authors:  Eric L Van Nostrand; Stephanie C Huelga; Gene W Yeo
Journal:  Adv Exp Med Biol       Date:  2016       Impact factor: 2.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.