K Murakami1, T Takagi. 1. 1Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokane-dai Minato-ku, Tokyo 108-8639 and 2Central Research Laboratory, Hitachi Ltd, 1-280 Higashi-Koigakubo, Kokubunji-shi, Tokyo 185-8601, Japan.
Abstract
MOTIVATION: A number of programs have been developed to predict the eukaryotic gene structures in DNA sequences. However, gene finding is still a challenging problem. RESULTS: We have explored the effectiveness when the results of several gene-finding programs were re-analyzed and combined. We studied several methods with four programs (FEXH, GeneParser3, GEN-SCAN and GRAIL2). By HIGHEST-policy combination method or BOUNDARY method, approximate correlation (AC) improved by 3-5% in comparison with the best single gene-finding program. From another viewpoint, OR-based combination of the four programs is the most reliable to know whether a candidate exon overlaps with the real exon or not, although it is less sensitive than GENSCAN for exon-intron boundaries. Our methods can easily be extended to combine other programs. AVAILABILITY: We have developed a server program (Shirokane System) and a client program (GeneScope) to use the methods. GeneScope is available through a WWW site (http://gf.genome.ad.jp/). CONTACT: (katsu,takagi)@ims.u-tokyo.ac.jp
MOTIVATION: A number of programs have been developed to predict the eukaryotic gene structures in DNA sequences. However, gene finding is still a challenging problem. RESULTS: We have explored the effectiveness when the results of several gene-finding programs were re-analyzed and combined. We studied several methods with four programs (FEXH, GeneParser3, GEN-SCAN and GRAIL2). By HIGHEST-policy combination method or BOUNDARY method, approximate correlation (AC) improved by 3-5% in comparison with the best single gene-finding program. From another viewpoint, OR-based combination of the four programs is the most reliable to know whether a candidate exon overlaps with the real exon or not, although it is less sensitive than GENSCAN for exon-intron boundaries. Our methods can easily be extended to combine other programs. AVAILABILITY: We have developed a server program (Shirokane System) and a client program (GeneScope) to use the methods. GeneScope is available through a WWW site (http://gf.genome.ad.jp/). CONTACT: (katsu,takagi)@ims.u-tokyo.ac.jp
Authors: Stephanie J Namciu; Richard D Friedman; Mark D Marsden; Lourdes M Sarausad; Christine L Jasoni; R E K Fournier Journal: Mamm Genome Date: 2004-03 Impact factor: 2.957
Authors: Liliana Florea; Valentina Di Francesco; Jason Miller; Russell Turner; Alison Yao; Michael Harris; Brian Walenz; Clark Mobarry; Gennady V Merkulov; Rosane Charlab; Ian Dew; Zuoming Deng; Sorin Istrail; Peter Li; Granger Sutton Journal: Genome Res Date: 2005-01 Impact factor: 9.043
Authors: B C Schutte; B C Bjork; K B Coppage; M I Malik; S G Gregory; D J Scott; L M Brentzell; Y Watanabe; M J Dixon; J C Murray Journal: Genome Res Date: 2000-01 Impact factor: 9.043