Literature DB >> 10842737

Modeling splice sites with Bayes networks.

D Cai1, A Delcher, B Kao, S Kasif.   

Abstract

MOTIVATION: The main goal in this paper is to develop accurate probabilistic models for important functional regions in DNA sequences (e.g. splice junctions that signal the beginning and end of transcription in human DNA). These methods can subsequently be utilized to improve the performance of gene-finding systems. The models built here attempt to model long-distance dependencies between non-adjacent bases.
RESULTS: An efficient modeling method is described which models biological data more accurately than a first-order Markov model without increasing the number of parameters. Intuitively, a small number of parameters helps a learning system to avoid overfitting. Several experiments with the model are presented, which show a small improvement in the average accuracy as compared with a simple Markov model. These experiments suggest that single long distance dependencies do not help the recognition problem, thus confirming several previous studies which have used more heuristic modeling techniques. AVAILABILITY: This software is available for downloaded and as a web resource at http://www.ai.uic.edu/software CONTACT: kasif@eecs.uic.edu

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10842737     DOI: 10.1093/bioinformatics/16.2.152

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  Determinants of the inherent strength of human 5' splice sites.

Authors:  Xavier Roca; Ravi Sachidanandam; Adrian R Krainer
Journal:  RNA       Date:  2005-05       Impact factor: 4.942

2.  Apples and oranges: avoiding different priors in Bayesian DNA sequence analysis.

Authors:  Jens Keilwagen; Jan Grau; Stefan Posch; Ivo Grosse
Journal:  BMC Bioinformatics       Date:  2010-03-22       Impact factor: 3.169

3.  Unifying generative and discriminative learning principles.

Authors:  Jens Keilwagen; Jan Grau; Stefan Posch; Marc Strickert; Ivo Grosse
Journal:  BMC Bioinformatics       Date:  2010-02-22       Impact factor: 3.169

4.  Improved identification of conserved cassette exons using Bayesian networks.

Authors:  Rileen Sinha; Michael Hiller; Rainer Pudimat; Ulrike Gausmann; Matthias Platzer; Rolf Backofen
Journal:  BMC Bioinformatics       Date:  2008-11-12       Impact factor: 3.169

5.  Splice site prediction with quadratic discriminant analysis using diversity measure.

Authors:  Lirong Zhang; Liaofu Luo
Journal:  Nucleic Acids Res       Date:  2003-11-01       Impact factor: 16.971

6.  Human-mouse gene identification by comparative evidence integration and evolutionary analysis.

Authors:  Lingang Zhang; Vladimir Pavlovic; Charles R Cantor; Simon Kasif
Journal:  Genome Res       Date:  2003-05-12       Impact factor: 9.043

7.  A Study of Domain Adaptation Classifiers Derived From Logistic Regression for the Task of Splice Site Prediction.

Authors:  Nic Herndon; Doina Caragea
Journal:  IEEE Trans Nanobioscience       Date:  2016-01-28       Impact factor: 2.935

8.  A new approach to bias correction in RNA-Seq.

Authors:  Daniel C Jones; Walter L Ruzzo; Xinxia Peng; Michael G Katze
Journal:  Bioinformatics       Date:  2012-01-28       Impact factor: 6.937

9.  A method for identifying alternative or cryptic donor splice sites within gene and mRNA sequences. Comparisons among sequences from vertebrates, echinoderms and other groups.

Authors:  Katherine M Buckley; Liliana D Florea; L Courtney Smith
Journal:  BMC Genomics       Date:  2009-07-16       Impact factor: 3.969

10.  PCRPi: Presaging Critical Residues in Protein interfaces, a new computational tool to chart hot spots in protein interfaces.

Authors:  Salam A Assi; Tomoyuki Tanaka; Terence H Rabbitts; Narcis Fernandez-Fuentes
Journal:  Nucleic Acids Res       Date:  2009-12-11       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.