Literature DB >> 15262783

Splice site identification by idlBNs.

Robert Castelo1, Roderic Guigó.   

Abstract

MOTIVATION: Computational identification of functional sites in nucleotide sequences is at the core of many algorithms for the analysis of genomic data. This identification is based on the statistical parameters estimated from a training set. Often, because of the huge number of parameters, it is difficult to obtain consistent estimators. To simplify the estimation problem, one imposes independent assumptions between the nucleotides along the site. However, this can potentially limit the minimum value of the estimation error.
RESULTS: In this paper, we introduce a novel method in the context of identifying functional sites, that finds a reasonable set of independence assumptions supported by the data, among the nucleotides, and uses it to perform the identification of the sites by their likelihood ratio. More importantly, in many practical situations it is capable of improving its performance as the training sample size increases. We apply the method to the identification of splice sites, and further evaluate its effect within the context of exon and gene prediction.

Mesh:

Year:  2004        PMID: 15262783     DOI: 10.1093/bioinformatics/bth932

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

1.  Apples and oranges: avoiding different priors in Bayesian DNA sequence analysis.

Authors:  Jens Keilwagen; Jan Grau; Stefan Posch; Ivo Grosse
Journal:  BMC Bioinformatics       Date:  2010-03-22       Impact factor: 3.169

2.  Genome-wide association between branch point properties and alternative splicing.

Authors:  André Corvelo; Martina Hallegger; Christopher W J Smith; Eduardo Eyras
Journal:  PLoS Comput Biol       Date:  2010-11-24       Impact factor: 4.475

3.  Splice site identification using probabilistic parameters and SVM classification.

Authors:  A K M A Baten; B C H Chang; S K Halgamuge; Jason Li
Journal:  BMC Bioinformatics       Date:  2006-12-18       Impact factor: 3.169

4.  VOMBAT: prediction of transcription factor binding sites using variable order Bayesian trees.

Authors:  Jan Grau; Irad Ben-Gal; Stefan Posch; Ivo Grosse
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

5.  Vertebrate gene finding from multiple-species alignments using a two-level strategy.

Authors:  David Carter; Richard Durbin
Journal:  Genome Biol       Date:  2006-08-07       Impact factor: 13.583

6.  An empirical study of ensemble-based semi-supervised learning approaches for imbalanced splice site datasets.

Authors:  Ana Stanescu; Doina Caragea
Journal:  BMC Syst Biol       Date:  2015-09-01

7.  MotifAdjuster: a tool for computational reassessment of transcription factor binding site annotations.

Authors:  Jens Keilwagen; Jan Baumbach; Thomas A Kohl; Ivo Grosse
Journal:  Genome Biol       Date:  2009-05-01       Impact factor: 13.583

8.  Computational predictions provide insights into the biology of TAL effector target sites.

Authors:  Jan Grau; Annett Wolf; Maik Reschke; Ulla Bonas; Stefan Posch; Jens Boch
Journal:  PLoS Comput Biol       Date:  2013-03-14       Impact factor: 4.475

9.  Fast splice site detection using information content and feature reduction.

Authors:  A K M A Baten; S K Halgamuge; B C H Chang
Journal:  BMC Bioinformatics       Date:  2008-12-12       Impact factor: 3.169

10.  Effective transcription factor binding site prediction using a combination of optimization, a genetic algorithm and discriminant analysis to capture distant interactions.

Authors:  Victor G Levitsky; Elena V Ignatieva; Elena A Ananko; Igor I Turnaev; Tatyana I Merkulova; Nikolay A Kolchanov; T C Hodgman
Journal:  BMC Bioinformatics       Date:  2007-12-19       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.