Literature DB >> 12463853

DNA splice site detection: a comparison of specific and general methods.

Won Kim1, W John Wilbur.   

Abstract

In an era when whole organism genomes are being routinely sequenced, the problem of gene finding has become a key issue on the road to understanding. For eukaryotic organisms a large part of locating the genes is accomplished by predicting the likely location of splice sites on a DNA strand. This problem of splice site location has been ap- proached using a number of machine learning or statistical methods tailored more or less specifically to the nature of the problem. Recently large margin classifiers and boosting methods have been found to give improvements over more traditional methods in a number of areas. Here we compare large margin classifiers (SVM and CMLS) and boosted decision trees with the three most common models used for splice site detection (WMM, WAM, and MDT). We find that the newer methods compare favorably in all cases and can yield significant improvement in some cases.

Mesh:

Substances:

Year:  2002        PMID: 12463853      PMCID: PMC2244387     

Source DB:  PubMed          Journal:  Proc AMIA Symp        ISSN: 1531-605X


  14 in total

1.  GeneSplicer: a new computational method for splice site prediction.

Authors:  M Pertea; X Lin; S L Salzberg
Journal:  Nucleic Acids Res       Date:  2001-03-01       Impact factor: 16.971

2.  A decision tree system for finding genes in DNA.

Authors:  S Salzberg; A L Delcher; K H Fasman; J Henderson
Journal:  J Comput Biol       Date:  1998       Impact factor: 1.479

3.  Boosting naïve Bayesian learning on a large subset of MEDLINE.

Authors:  W J Wilbur
Journal:  Proc AMIA Symp       Date:  2000

4.  Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach.

Authors:  E C Uberbacher; R J Mural
Journal:  Proc Natl Acad Sci U S A       Date:  1991-12-15       Impact factor: 11.205

5.  Identification of protein coding regions in the human genome by quadratic discriminant analysis.

Authors:  M Q Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  1997-01-21       Impact factor: 11.205

6.  GeneID in Drosophila.

Authors:  G Parra; E Blanco; R Guigó
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

7.  Finding genes in DNA using decision trees and dynamic programming.

Authors:  S Salzberg; X Chen; J Henderson; K Fasman
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1996

8.  Prediction of complete gene structures in human genomic DNA.

Authors:  C Burge; S Karlin
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

9.  Computer methods to locate signals in nucleic acid sequences.

Authors:  R Staden
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

10.  Prediction of human mRNA donor and acceptor sites from the DNA sequence.

Authors:  S Brunak; J Engelbrecht; S Knudsen
Journal:  J Mol Biol       Date:  1991-07-05       Impact factor: 5.469

View more
  1 in total

1.  Effective automated feature construction and selection for classification of biological sequences.

Authors:  Uday Kamath; Kenneth De Jong; Amarda Shehu
Journal:  PLoS One       Date:  2014-07-17       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.