Literature DB >> 15961480

RASE: recognition of alternatively spliced exons in C.elegans.

G Rätsch1, S Sonnenburg, B Schölkopf.   

Abstract

MOTIVATION: Eukaryotic pre-mRNAs are spliced to form mature mRNA. Pre-mRNA alternative splicing greatly increases the complexity of gene expression. Estimates show that more than half of the human genes and at least one-third of the genes of less complex organisms, such as nematodes or flies, are alternatively spliced. In this work, we consider one major form of alternative splicing, namely the exclusion of exons from the transcript. It has been shown that alternatively spliced exons have certain properties that distinguish them from constitutively spliced exons. Although most recent computational studies on alternative splicing apply only to exons which are conserved among two species, our method only uses information that is available to the splicing machinery, i.e. the DNA sequence itself. We employ advanced machine learning techniques in order to answer the following two questions: (1) Is a certain exon alternatively spliced? (2) How can we identify yet unidentified exons within known introns?
RESULTS: We designed a support vector machine (SVM) kernel well suited for the task of classifying sequences with motifs having positional preferences. In order to solve the task (1), we combine the kernel with additional local sequence information, such as lengths of the exon and the flanking introns. The resulting SVM-based classifier achieves a true positive rate of 48.5% at a false positive rate of 1%. By scanning over single EST confirmed exons we identified 215 potential alternatively spliced exons. For 10 randomly selected such exons we successfully performed biological verification experiments and confirmed three novel alternatively spliced exons. To answer question (2), we additionally used SVM-based predictions to recognize acceptor and donor splice sites. Combined with the above mentioned features we were able to identify 85.2% of skipped exons within known introns at a false positive rate of 1%. AVAILABILITY: Datasets, model selection results, our predictions and additional experimental results are available at http://www.fml.tuebingen.mpg.de/~raetsch/RASE SUPPLEMENTARY INFORMATION: http://www.fml.tuebingen.mpg.de/raetsch/RASE.

Entities:  

Mesh:

Year:  2005        PMID: 15961480     DOI: 10.1093/bioinformatics/bti1053

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  40 in total

1.  mGene: accurate SVM-based gene finding with an application to nematode genomes.

Authors:  Gabriele Schweikert; Alexander Zien; Georg Zeller; Jonas Behr; Christoph Dieterich; Cheng Soon Ong; Petra Philips; Fabio De Bona; Lisa Hartmann; Anja Bohlen; Nina Krüger; Sören Sonnenburg; Gunnar Rätsch
Journal:  Genome Res       Date:  2009-06-29       Impact factor: 9.043

2.  Discriminative prediction of mammalian enhancers from DNA sequence.

Authors:  Dongwon Lee; Rachel Karchin; Michael A Beer
Journal:  Genome Res       Date:  2011-08-29       Impact factor: 9.043

3.  Prediction of alternatively spliced exons using support vector machines.

Authors:  Jing Xia; Doina Caragea; Susan J Brown
Journal:  Int J Data Min Bioinform       Date:  2010       Impact factor: 0.667

4.  Improved identification of conserved cassette exons using Bayesian networks.

Authors:  Rileen Sinha; Michael Hiller; Rainer Pudimat; Ulrike Gausmann; Matthias Platzer; Rolf Backofen
Journal:  BMC Bioinformatics       Date:  2008-11-12       Impact factor: 3.169

5.  KIRMES: kernel-based identification of regulatory modules in euchromatic sequences.

Authors:  Sebastian J Schultheiss; Wolfgang Busch; Jan U Lohmann; Oliver Kohlbacher; Gunnar Rätsch
Journal:  Bioinformatics       Date:  2009-04-23       Impact factor: 6.937

6.  Alternative splicing and the steady-state ratios of mRNA isoforms generated by it are under strong stabilizing selection in Caenorhabditis elegans.

Authors:  Sergio Barberan-Soler; Alan M Zahler
Journal:  Mol Biol Evol       Date:  2008-08-20       Impact factor: 16.240

7.  Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics.

Authors:  Nico Pfeifer; Andreas Leinenbach; Christian G Huber; Oliver Kohlbacher
Journal:  BMC Bioinformatics       Date:  2007-11-30       Impact factor: 3.169

8.  Genome-wide chromatin remodeling identified at GC-rich long nucleosome-free regions.

Authors:  Karin Schwarzbauer; Ulrich Bodenhofer; Sepp Hochreiter
Journal:  PLoS One       Date:  2012-11-05       Impact factor: 3.240

9.  Deciphering the plant splicing code: experimental and computational approaches for predicting alternative splicing and splicing regulatory elements.

Authors:  Anireddy S N Reddy; Mark F Rogers; Dale N Richardson; Michael Hamilton; Asa Ben-Hur
Journal:  Front Plant Sci       Date:  2012-02-07       Impact factor: 5.753

10.  POIMs: positional oligomer importance matrices--understanding support vector machine-based signal detectors.

Authors:  Sören Sonnenburg; Alexander Zien; Petra Philips; Gunnar Rätsch
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.