Literature DB >> 1763041

Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach.

E C Uberbacher1, R J Mural.   

Abstract

Genes in higher eukaryotes may span tens or hundreds of kilobases with the protein-coding regions accounting for only a few percent of the total sequence. Identifying genes within large regions of uncharacterized DNA is a difficult undertaking and is currently the focus of many research efforts. We describe a reliable computational approach for locating protein-coding portions of genes in anonymous DNA sequence. Using a concept suggested by robotic environmental sensing, our method combines a set of sensor algorithms and a neural network to localize the coding regions. Several algorithms that report local characteristics of the DNA sequence, and therefore act as sensors, are also described. In its current configuration the "coding recognition module" identifies 90% of coding exons of length 100 bases or greater with less than one false positive coding exon indicated per five coding exons indicated. This is a significantly lower false positive rate than any method of which we are aware. This module demonstrates a method with general applicability to sequence-pattern recognition problems and is available for current research efforts.

Entities:  

Mesh:

Substances:

Year:  1991        PMID: 1763041      PMCID: PMC53114          DOI: 10.1073/pnas.88.24.11261

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  17 in total

1.  Fractal geometry of music.

Authors:  K J Hsü; A J Hsü
Journal:  Proc Natl Acad Sci U S A       Date:  1990-02-01       Impact factor: 11.205

2.  K-tuple frequency analysis: from intron/exon discrimination to T-cell epitope mapping.

Authors:  J M Claverie; I Sauvaget; L Bougueleret
Journal:  Methods Enzymol       Date:  1990       Impact factor: 1.600

3.  A common language for physical mapping of the human genome.

Authors:  M Olson; L Hood; C Cantor; D Botstein
Journal:  Science       Date:  1989-09-29       Impact factor: 47.728

4.  The GenBank genetic sequence data bank.

Authors:  H S Bilofsky; C Burks
Journal:  Nucleic Acids Res       Date:  1988-03-11       Impact factor: 16.971

5.  A comprehensive set of sequence analysis programs for the VAX.

Authors:  J Devereux; P Haeberli; O Smithies
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

6.  Neural networks and physical systems with emergent collective computational abilities.

Authors:  J J Hopfield
Journal:  Proc Natl Acad Sci U S A       Date:  1982-04       Impact factor: 11.205

7.  Codon preference and its use in identifying protein coding regions in long DNA sequences.

Authors:  R Staden; A D McLachlan
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

8.  A method for measuring the non-random bias of a codon usage table.

Authors:  A D McLachlan; R Staden; D R Boswell
Journal:  Nucleic Acids Res       Date:  1984-12-21       Impact factor: 16.971

9.  Recognition of protein coding regions in DNA sequences.

Authors:  J W Fickett
Journal:  Nucleic Acids Res       Date:  1982-09-11       Impact factor: 16.971

10.  Learning algorithms and probability distributions in feed-forward and feed-back networks.

Authors:  J J Hopfield
Journal:  Proc Natl Acad Sci U S A       Date:  1987-12       Impact factor: 11.205

View more
  93 in total

1.  ExInt: an Exon/Intron database.

Authors:  M Sakharkar; M Long; T W Tan; S J de Souza
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Shotgun sequencing of the human transcriptome with ORF expressed sequence tags.

Authors:  E Dias Neto; R G Correa; S Verjovski-Almeida; M R Briones; M A Nagai; W da Silva; M A Zago; S Bordin; F F Costa; G H Goldman; A F Carvalho; A Matsukuma; G S Baia; D H Simpson; A Brunstein; P S de Oliveira; P Bucher; C V Jongeneel; M J O'Hare; F Soares; R R Brentani; L F Reis; S J de Souza; A J Simpson
Journal:  Proc Natl Acad Sci U S A       Date:  2000-03-28       Impact factor: 11.205

3.  Positional characterisation of false positives from computational prediction of human splice sites.

Authors:  T A Thanaraj
Journal:  Nucleic Acids Res       Date:  2000-02-01       Impact factor: 16.971

4.  A novel method to isolate the common fraction of two DNA samples: hybrid specific amplification (HSA).

Authors:  F Lecerf; L Foggia; P Mulsant; A Bonnet; F Hatey
Journal:  Nucleic Acids Res       Date:  2001-09-01       Impact factor: 16.971

Review 5.  Omiga: a PC-based sequence analysis tool.

Authors:  J A Kramer
Journal:  Mol Biotechnol       Date:  2001-09       Impact factor: 2.695

6.  Identification and characterization of GONST1, a golgi-localized GDP-mannose transporter in Arabidopsis.

Authors:  T C Baldwin; M G Handford; M I Yuseff; A Orellana; P Dupree
Journal:  Plant Cell       Date:  2001-10       Impact factor: 11.277

7.  Genomic anatomy of a premier major histocompatibility complex paralogous region on chromosome 1q21-q22.

Authors:  T Shiina; A Ando; Y Suto; F Kasai; A Shigenari; N Takishima; E Kikkawa; K Iwata; Y Kuwano; Y Kitamura; Y Matsuzawa; K Sano; M Nogami; H Kawata; S Li; Y Fukuzumi; M Yamazaki; H Tashiro; G Tamiya; A Kohda; K Okumura; T Ikemura; E Soeda; N Mizuki; M Kimura; S Bahram; H Inoko
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

8.  Cloning and sequencing of cDNAs for hypothetical genes from chromosome 2 of Arabidopsis.

Authors:  Yong-Li Xiao; Mukesh Malik; Catherine A Whitelaw; Christopher D Town
Journal:  Plant Physiol       Date:  2002-12       Impact factor: 8.340

9.  DNA splice site detection: a comparison of specific and general methods.

Authors:  Won Kim; W John Wilbur
Journal:  Proc AMIA Symp       Date:  2002

10.  Construction of a genomic DNA 'feature map' by sequencing from nested deletions: application to the HLA class I region.

Authors:  B R Krishnan; I Jamry; D E Berg; C M Berg; D D Chaplin
Journal:  Nucleic Acids Res       Date:  1995-01-11       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.