Literature DB >> 11465033

Comparing the success of different prediction software in sequence analysis: a review.

V B Bajić1.   

Abstract

The abundance of computer software for different types of prediction in DNA and protein sequence analyses raises the problem of adequate ranking of prediction program quality. A single measure of success of predictor software, which adequately ranks the predictors, does not exist. A typical example of such an incomplete measure is the so-called correlation coefficient. This paper provides an overview and short analysis of several different measures of prediction quality. Frequently, some of these measures give results contradictory to each other even when they relate to the same prediction scores. This may lead to confusion. In order to overcome some of the problems, a few new measures are proposed including some variants of a 'generalised distance from the ideal predictor score'; these are based on topological properties, rather than on statistics. In order to provide a sort of a balanced ranking, the averaged score measure (ASM) is introduced. The ASM provides a possibility for the selection of the predictor that probably has the best overall performance. The method presented in the paper applies to the ranking problem of any prediction software whose results can be properly represented in a true positive-false positive framework, thus providing a natural set-up for linear biological sequence analysis.

Mesh:

Year:  2000        PMID: 11465033     DOI: 10.1093/bib/1.3.214

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  18 in total

1.  SITECON: a tool for detecting conservative conformational and physicochemical properties in transcription factor binding site alignments and for site recognition.

Authors:  D Y Oshchepkov; E E Vityaev; D A Grigorovich; E V Ignatieva; T M Khlebodarova
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

Review 2.  Performance assessment of promoter predictions on ENCODE regions in the EGASP experiment.

Authors:  Vladimir B Bajic; Michael R Brent; Randall H Brown; Adam Frankish; Jennifer Harrow; Uwe Ohler; Victor V Solovyev; Sin Lam Tan
Journal:  Genome Biol       Date:  2006-08-07       Impact factor: 13.583

3.  HMCan-diff: a method to detect changes in histone modifications in cells with different genetic characteristics.

Authors:  Haitham Ashoor; Caroline Louis-Brennetot; Isabelle Janoueix-Lerosey; Vladimir B Bajic; Valentina Boeva
Journal:  Nucleic Acids Res       Date:  2017-05-05       Impact factor: 16.971

4.  Dragon gene start finder: an advanced system for finding approximate locations of the start of gene transcriptional units.

Authors:  Vladimir B Bajic; Seng Hong Seah
Journal:  Genome Res       Date:  2003-07-17       Impact factor: 9.043

5.  E2F5 status significantly improves malignancy diagnosis of epithelial ovarian cancer.

Authors:  Narasimhan Kothandaraman; Vladimir B Bajic; Pang N K Brendan; Chan Y Huak; Peh B Keow; Khalil Razvi; Manuel Salto-Tellez; Mahesh Choolani
Journal:  BMC Cancer       Date:  2010-02-24       Impact factor: 4.430

6.  Simplified method to predict mutual interactions of human transcription factors based on their primary structure.

Authors:  Sebastian Schmeier; Boris Jankovic; Vladimir B Bajic
Journal:  PLoS One       Date:  2011-07-05       Impact factor: 3.240

Review 7.  EGASP: the human ENCODE Genome Annotation Assessment Project.

Authors:  Roderic Guigó; Paul Flicek; Josep F Abril; Alexandre Reymond; Julien Lagarde; France Denoeud; Stylianos Antonarakis; Michael Ashburner; Vladimir B Bajic; Ewan Birney; Robert Castelo; Eduardo Eyras; Catherine Ucla; Thomas R Gingeras; Jennifer Harrow; Tim Hubbard; Suzanna E Lewis; Martin G Reese
Journal:  Genome Biol       Date:  2006-08-07       Impact factor: 13.583

8.  GC-compositional strand bias around transcription start sites in plants and fungi.

Authors:  Shigeo Fujimori; Takanori Washio; Masaru Tomita
Journal:  BMC Genomics       Date:  2005-02-28       Impact factor: 3.969

9.  HMCan: a method for detecting chromatin modifications in cancer samples using ChIP-seq data.

Authors:  Haitham Ashoor; Aurélie Hérault; Aurélie Kamoun; François Radvanyi; Vladimir B Bajic; Emmanuel Barillot; Valentina Boeva
Journal:  Bioinformatics       Date:  2013-09-09       Impact factor: 6.937

10.  Determining promoter location based on DNA structure first-principles calculations.

Authors:  J Ramon Goñi; Alberto Pérez; David Torrents; Modesto Orozco
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.