Literature DB >> 9149143

Prediction of complete gene structures in human genomic DNA.

C Burge1, S Karlin.   

Abstract

We introduce a general probabilistic model of the gene structure of human genomic sequences which incorporates descriptions of the basic transcriptional, translational and splicing signals, as well as length distributions and compositional features of exons, introns and intergenic regions. Distinct sets of model parameters are derived to account for the many substantial differences in gene density and structure observed in distinct C + G compositional regions of the human genome. In addition, new models of the donor and acceptor splice signals are described which capture potentially important dependencies between signal positions. The model is applied to the problem of gene identification in a computer program, GENSCAN, which identifies complete exon/intron structures of genes in genomic DNA. Novel features of the program include the capacity to predict multiple genes in a sequence, to deal with partial as well as complete genes, and to predict consistent sets of genes occurring on either or both DNA strands. GENSCAN is shown to have substantially higher accuracy than existing methods when tested on standardized sets of human and vertebrate genes, with 75 to 80% of exons identified exactly. The program is also capable of indicating fairly accurately the reliability of each predicted exon. Consistently high levels of accuracy are observed for sequences of differing C + G content and for distinct groups of vertebrates.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9149143     DOI: 10.1006/jmbi.1997.0951

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  1339 in total

1.  Campomelic dysplasia translocation breakpoints are scattered over 1 Mb proximal to SOX9: evidence for an extended control region.

Authors:  D Pfeifer; R Kist; K Dewar; K Devon; E S Lander; B Birren; L Korniszewski; E Back; G Scherer
Journal:  Am J Hum Genet       Date:  1999-07       Impact factor: 11.025

2.  ExInt: an Exon/Intron database.

Authors:  M Sakharkar; M Long; T W Tan; S J de Souza
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  A cell plate-specific callose synthase and its interaction with phragmoplastin.

Authors:  Z Hong; A J Delauney; D P Verma
Journal:  Plant Cell       Date:  2001-04       Impact factor: 11.277

Review 4.  Annotating sequence data using Genotator.

Authors:  N L Harris
Journal:  Mol Biotechnol       Date:  2000-11       Impact factor: 2.695

5.  PipMaker--a web server for aligning two genomic DNA sequences.

Authors:  S Schwartz; Z Zhang; K A Frazer; A Smit; C Riemer; J Bouck; R Gibbs; R Hardison; W Miller
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

6.  Positional characterisation of false positives from computational prediction of human splice sites.

Authors:  T A Thanaraj
Journal:  Nucleic Acids Res       Date:  2000-02-01       Impact factor: 16.971

7.  First pass annotation of promoters on human chromosome 22.

Authors:  M Scherf; A Klingenhoff; K Frech; K Quandt; R Schneider; K Grote; M Frisch; V Gailus-Durner; A Seidel; R Brack-Werner; T Werner
Journal:  Genome Res       Date:  2001-03       Impact factor: 9.043

8.  GeneSplicer: a new computational method for splice site prediction.

Authors:  M Pertea; X Lin; S L Salzberg
Journal:  Nucleic Acids Res       Date:  2001-03-01       Impact factor: 16.971

9.  Hd1, a major photoperiod sensitivity quantitative trait locus in rice, is closely related to the Arabidopsis flowering time gene CONSTANS.

Authors:  M Yano; Y Katayose; M Ashikari; U Yamanouchi; L Monna; T Fuse; T Baba; K Yamamoto; Y Umehara; Y Nagamura; T Sasaki
Journal:  Plant Cell       Date:  2000-12       Impact factor: 11.277

10.  An effective approach for analyzing "prefinished" genomic sequence data.

Authors:  P M Kuehl; J M Weisemann; J W Touchman; E D Green; M S Boguski
Journal:  Genome Res       Date:  1999-02       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.