Literature DB >> 10637326

Positional characterisation of false positives from computational prediction of human splice sites.

T A Thanaraj1.   

Abstract

The performance of computational tools that can predict human splice sites are reviewed using a test set of EST-confirmed splice sites. The programs (namely HMMgene, NetGene2, HSPL, NNSPLICE, SpliceView and GeneID-3) differ from one another in the degree of discriminatory information used for prediction. The results indicate that, as expected, HMMgene and NetGene2 (which use global as well as local coding information and splice signals) followed by HSPL (which uses local coding information and splice signals) performed better than the other three programs (which use only splice signals). For the former three programs, one in every three false positive splice sites was predicted in the vicinity of true splice sites while only one in every 12 was expected to occur in such a region by chance. The persistence of this observation for programs (namely FEXH, GRAIL2, MZEF, GeneID-3, HMMgene and GENSCAN) that can predict all the potential exons (including optimal and sub-optimal) was assessed. In a high proportion (>50%) of the partially correct predicted exons, the incorrect exon ends were located in the vicinity of the real splice sites. Analysis of the distribution of proximal false positives indicated that the splice signals used by the algorithms are not strong enough to discriminate particularly those false predictions that occur within +/- 25 nt around the real sites. It is therefore suggested that specialised statistics that can discriminate real splice sites from proximal false positives be incorporated in gene prediction programs.

Entities:  

Mesh:

Year:  2000        PMID: 10637326      PMCID: PMC102552          DOI: 10.1093/nar/28.3.744

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  13 in total

1.  Prediction of gene structure.

Authors:  R Guigó; S Knudsen; N Drake; T Smith
Journal:  J Mol Biol       Date:  1992-07-05       Impact factor: 5.469

2.  Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach.

Authors:  E C Uberbacher; R J Mural
Journal:  Proc Natl Acad Sci U S A       Date:  1991-12-15       Impact factor: 11.205

3.  Intron-exon structures of eukaryotic model organisms.

Authors:  M Deutsch; M Long
Journal:  Nucleic Acids Res       Date:  1999-08-01       Impact factor: 16.971

4.  Evaluation of gene structure prediction programs.

Authors:  M Burset; R Guigó
Journal:  Genomics       Date:  1996-06-15       Impact factor: 5.736

5.  Improved splice site detection in Genie.

Authors:  M G Reese; F H Eeckman; D Kulp; D Haussler
Journal:  J Comput Biol       Date:  1997       Impact factor: 1.479

6.  Analysis of donor splice sites in different eukaryotic organisms.

Authors:  I B Rogozin; L Milanesi
Journal:  J Mol Evol       Date:  1997-07       Impact factor: 2.395

7.  Prediction of complete gene structures in human genomic DNA.

Authors:  C Burge; S Karlin
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

8.  Identification of protein coding regions in genomic DNA.

Authors:  E E Snyder; G D Stormo
Journal:  J Mol Biol       Date:  1995-04-21       Impact factor: 5.469

9.  Prediction of human mRNA donor and acceptor sites from the DNA sequence.

Authors:  S Brunak; J Engelbrecht; S Knudsen
Journal:  J Mol Biol       Date:  1991-07-05       Impact factor: 5.469

10.  Predicting internal exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames.

Authors:  V V Solovyev; A A Salamov; C B Lawrence
Journal:  Nucleic Acids Res       Date:  1994-12-11       Impact factor: 16.971

View more
  12 in total

1.  Determinants of the inherent strength of human 5' splice sites.

Authors:  Xavier Roca; Ravi Sachidanandam; Adrian R Krainer
Journal:  RNA       Date:  2005-05       Impact factor: 4.942

2.  Human GC-AG alternative intron isoforms with weak donor sites show enhanced consensus at acceptor exon positions.

Authors:  T A Thanaraj; F Clark
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

3.  Evolution of the exon-intron structure and alternative splicing of the MAGE-A family of cancer/testis antigens.

Authors:  Irena I Artamonova; Mikhail S Gelfand
Journal:  J Mol Evol       Date:  2004-11       Impact factor: 2.395

4.  Inferring alternative splicing patterns in mouse from a full-length cDNA library and microarray data.

Authors:  Hiromi Kochiwa; Ryosuke Suzuki; Takanori Washio; Rintaro Saito; Hidemasa Bono; Piero Carninci; Yasushi Okazaki; Rika Miki; Yoshihide Hayashizaki; Masaru Tomita
Journal:  Genome Res       Date:  2002-08       Impact factor: 9.043

5.  Comprehensive splice-site analysis using comparative genomics.

Authors:  Nihar Sheth; Xavier Roca; Michelle L Hastings; Ted Roeder; Adrian R Krainer; Ravi Sachidanandam
Journal:  Nucleic Acids Res       Date:  2006-08-12       Impact factor: 16.971

6.  Aberrant 3' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization.

Authors:  Igor Vorechovský
Journal:  Nucleic Acids Res       Date:  2006-09-08       Impact factor: 16.971

7.  Current awareness on comparative and functional genomics.

Authors: 
Journal:  Yeast       Date:  2000-09-30       Impact factor: 3.239

8.  Complex inheritance in Pulmonary Arterial Hypertension patients with several mutations.

Authors:  Guillermo Pousada; Adolfo Baloira; Diana Valverde
Journal:  Sci Rep       Date:  2016-09-15       Impact factor: 4.379

9.  TrueSight: a new algorithm for splice junction detection using RNA-seq.

Authors:  Yang Li; Hongmei Li-Byarlay; Paul Burns; Mark Borodovsky; Gene E Robinson; Jian Ma
Journal:  Nucleic Acids Res       Date:  2012-12-18       Impact factor: 16.971

10.  Aberrant 5' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization.

Authors:  Emanuele Buratti; Martin Chivers; Jana Královicová; Maurizio Romano; Marco Baralle; Adrian R Krainer; Igor Vorechovsky
Journal:  Nucleic Acids Res       Date:  2007-06-18       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.