Literature DB >> 8877520

Finding genes in DNA using decision trees and dynamic programming.

S Salzberg1, X Chen, J Henderson, K Fasman.   

Abstract

This study demonstrates the use of decision tree classifiers as the basis for a general gene-finding system. The system uses a dynamic programming algorithm that finds the optimal segmentation of a DNA sequence into coding and non-coding regions (exons and introns). The optimality property is dependent on a separate scoring function that takes a subsequence and assigns to it a score reflecting the probability that the sequence is an exon. In this study, the scoring functions were sets of decision trees and rules that were combined to give the probability estimate. Experimental results on a newly collected database of human DNA sequences are encouraging, and some new observations about the structure of classifiers for the gene-finding problem have emerged from this study. We also provide descriptions of a new probability chain model that produces very accurate filters to find donor and acceptor sites.

Entities:  

Mesh:

Year:  1996        PMID: 8877520

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  1 in total

1.  DNA splice site detection: a comparison of specific and general methods.

Authors:  Won Kim; W John Wilbur
Journal:  Proc AMIA Symp       Date:  2002
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.