Literature DB >> 7816600

Predicting internal exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames.

V V Solovyev1, A A Salamov, C B Lawrence.   

Abstract

A new method which predicts internal exon sequences in human DNA has been developed. The method is based on a splice site prediction algorithm that uses the linear discriminant function to combine information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotides in protein coding and intron regions. The accuracy of our splice site recognition function is 97% for donor splice sites and 96% for acceptor splice sites. For exon prediction, we combine in a discriminant function the characteristics describing the 5'-intron region, donor splice site, coding region, acceptor splice site and 3'-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79%. The recognition quality computed at the level of individual nucleotides is 89% for exon sequences and 98% for intron sequences. This corresponds to a correlation coefficient for exon prediction of 0.87. The precision of this approach is better than other methods and has been tested on a larger data set. We have also developed a means for predicting exon-exon junctions in cDNA sequences, which can be useful for selecting optimal PCR primers.

Entities:  

Mesh:

Substances:

Year:  1994        PMID: 7816600      PMCID: PMC332054          DOI: 10.1093/nar/22.24.5156

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  16 in total

1.  The prediction of exons through an analysis of spliceable open reading frames.

Authors:  G B Hutchinson; M R Hayden
Journal:  Nucleic Acids Res       Date:  1992-07-11       Impact factor: 16.971

2.  Comparison of the predicted and observed secondary structure of T4 phage lysozyme.

Authors:  B W Matthews
Journal:  Biochim Biophys Acta       Date:  1975-10-20

Review 3.  Assessment of protein coding measures.

Authors:  J W Fickett; C S Tung
Journal:  Nucleic Acids Res       Date:  1992-12-25       Impact factor: 16.971

4.  gm: a practical tool for automating DNA sequence analysis.

Authors:  C A Fields; C A Soderlund
Journal:  Comput Appl Biosci       Date:  1990-07

5.  Determination of eukaryotic protein coding regions using neural networks and information theory.

Authors:  R Farber; A Lapedes; K Sirotkin
Journal:  J Mol Biol       Date:  1992-07-20       Impact factor: 5.469

6.  Finding protein coding regions in genomic sequences.

Authors:  R Staden
Journal:  Methods Enzymol       Date:  1990       Impact factor: 1.600

7.  Splice junctions, branch point sites, and exons: sequence statistics, identification, and applications to genome project.

Authors:  P Senapathy; M B Shapiro; N L Harris
Journal:  Methods Enzymol       Date:  1990       Impact factor: 1.600

8.  Identification of coding regions in genomic DNA sequences: an application of dynamic programming and neural networks.

Authors:  E E Snyder; G D Stormo
Journal:  Nucleic Acids Res       Date:  1993-02-11       Impact factor: 16.971

9.  Assignment of position-specific error probability to primary DNA sequence data.

Authors:  C B Lawrence; V V Solovyev
Journal:  Nucleic Acids Res       Date:  1994-04-11       Impact factor: 16.971

10.  Prediction of splice junctions in mRNA sequences.

Authors:  K Nakata; M Kanehisa; C DeLisi
Journal:  Nucleic Acids Res       Date:  1985-07-25       Impact factor: 16.971

View more
  59 in total

1.  Engineering the largest RNA virus genome as an infectious bacterial artificial chromosome.

Authors:  F Almazán; J M González; Z Pénzes; A Izeta; E Calvo; J Plana-Durán; L Enjuanes
Journal:  Proc Natl Acad Sci U S A       Date:  2000-05-09       Impact factor: 11.205

2.  Positional characterisation of false positives from computational prediction of human splice sites.

Authors:  T A Thanaraj
Journal:  Nucleic Acids Res       Date:  2000-02-01       Impact factor: 16.971

3.  GeneSplicer: a new computational method for splice site prediction.

Authors:  M Pertea; X Lin; S L Salzberg
Journal:  Nucleic Acids Res       Date:  2001-03-01       Impact factor: 16.971

4.  Modulation of exon skipping by high-affinity hnRNP A1-binding sites and by intron elements that repress splice site utilization.

Authors:  M Blanchette; B Chabot
Journal:  EMBO J       Date:  1999-04-01       Impact factor: 11.598

5.  An intronic splicing enhancer binds U1 snRNPs to enhance splicing and select 5' splice sites.

Authors:  A J McCullough; S M Berget
Journal:  Mol Cell Biol       Date:  2000-12       Impact factor: 4.272

Review 6.  Computational gene finding in plants.

Authors:  Mihaela Pertea; Steven L Salzberg
Journal:  Plant Mol Biol       Date:  2002-01       Impact factor: 4.076

7.  Computational analysis of candidate intron regulatory elements for tissue-specific alternative pre-mRNA splicing.

Authors:  M Brudno; M S Gelfand; S Spengler; M Zorn; I Dubchak; J G Conboy
Journal:  Nucleic Acids Res       Date:  2001-06-01       Impact factor: 16.971

8.  Identification of programmed translational -1 frameshifting sites in the genome of Saccharomyces cerevisiae.

Authors:  Michaël Bekaert; Hugues Richard; Bernard Prum; Jean-Pierre Rousset
Journal:  Genome Res       Date:  2005-10       Impact factor: 9.043

9.  Sequence conservation at human and mouse orthologous common fragile regions, FRA3B/FHIT and Fra14A2/Fhit.

Authors:  T Shiraishi; T Druck; K Mimori; J Flomenberg; L Berk; H Alder; W Miller; K Huebner; C M Croce
Journal:  Proc Natl Acad Sci U S A       Date:  2001-04-24       Impact factor: 11.205

10.  Sequencing and comparative analysis of a conserved syntenic segment in the Solanaceae.

Authors:  Ying Wang; Adam Diehl; Feinan Wu; Julia Vrebalov; James Giovannoni; Adam Siepel; Steven D Tanksley
Journal:  Genetics       Date:  2008-08-24       Impact factor: 4.562

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.