Literature DB >> 2014171

Training back-propagation neural networks to define and detect DNA-binding sites.

M C O'Neill1.   

Abstract

A three layered back-propagation neural network was trained to recognize E. coli promoters of the 17 base spacing class. To this end, the network was presented with 39 promoter sequences and derivatives of those sequences as positive inputs; 60% A + T random sequences and sequences containing 2 promoter-down point mutations were used as negative inputs. The entire promoter sequence of 58 bases, approximately -50 to +8, was entered as input. The network was asked to associate an output of 1.0 with promoter sequence input and 0.0 with non-promoter input. Generally, after 100,000 input cycles, the network was virtually perfect in classifying the training set. A trained network was about 80% effective in recognizing 'new' promoters which were not in the training set, with a false positive rate below 0.1%. Network searches on pBR322 and on the lambda genome were also performed. Overall the results were somewhat better than the best rule-based procedures. The trained network can be analyzed both for its choice of base and relative weighting, positive and negative, at each position of the sequence. This method, which requires only appropriate input/output training pairs, can be used to define and search for any DNA regulatory sequence for which there are sufficient exemplars.

Entities:  

Mesh:

Substances:

Year:  1991        PMID: 2014171      PMCID: PMC333596          DOI: 10.1093/nar/19.2.313

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  19 in total

1.  Information content of binding sites on nucleotide sequences.

Authors:  T D Schneider; G D Stormo; L Gold; A Ehrenfeucht
Journal:  J Mol Biol       Date:  1986-04-05       Impact factor: 5.469

2.  Analysis of E. coli promoter sequences.

Authors:  C B Harley; R P Reynolds
Journal:  Nucleic Acids Res       Date:  1987-03-11       Impact factor: 16.971

3.  Rigorous pattern-recognition methods for DNA sequences. Analysis of promoter sequences from Escherichia coli.

Authors:  D J Galas; M Eggert; M S Waterman
Journal:  J Mol Biol       Date:  1985-11-05       Impact factor: 5.469

4.  Promoters recognized by Escherichia coli RNA polymerase selected by function: highly efficient promoters from bacteriophage T5.

Authors:  R Gentz; H Bujard
Journal:  J Bacteriol       Date:  1985-10       Impact factor: 3.490

5.  Computer methods to locate signals in nucleic acid sequences.

Authors:  R Staden
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

6.  Escherichia coli promoter sequences predict in vitro RNA polymerase selectivity.

Authors:  M E Mulligan; D K Hawley; R Entriken; W R McClure
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

7.  Revised sequence of the tetracycline-resistance gene of pBR322.

Authors:  K W Peden
Journal:  Gene       Date:  1983 May-Jun       Impact factor: 3.688

8.  Sequence determinants of promoter activity.

Authors:  P Youderian; S Bouvier; M M Susskind
Journal:  Cell       Date:  1982-10       Impact factor: 41.582

9.  Use of the 'Perceptron' algorithm to distinguish translational initiation sites in E. coli.

Authors:  G D Stormo; T D Schneider; L Gold; A Ehrenfeucht
Journal:  Nucleic Acids Res       Date:  1982-05-11       Impact factor: 16.971

10.  Search algorithm for pattern match analysis of nucleic acid sequences.

Authors:  R Harr; M Häggström; P Gustafsson
Journal:  Nucleic Acids Res       Date:  1983-05-11       Impact factor: 16.971

View more
  22 in total

Review 1.  Computational gene finding in plants.

Authors:  Mihaela Pertea; Steven L Salzberg
Journal:  Plant Mol Biol       Date:  2002-01       Impact factor: 4.076

2.  An assessment of neural network and statistical approaches for prediction of E. coli promoter sites.

Authors:  P B Horton; M Kanehisa
Journal:  Nucleic Acids Res       Date:  1992-08-25       Impact factor: 16.971

3.  Molecular characterization of mutations affecting expression level and growth rate-dependent regulation of the Escherichia coli zwf gene.

Authors:  D L Rowley; W P Fawcett; R E Wolf
Journal:  J Bacteriol       Date:  1992-01       Impact factor: 3.490

4.  Protein classification artificial neural system.

Authors:  C Wu; G Whitson; J McLarty; A Ermongkonchai; T C Chang
Journal:  Protein Sci       Date:  1992-05       Impact factor: 6.725

5.  Isolation of DNA damage-inducible promoters in Escherichia coli: regulation of polB (dinA), dinG, and dinH by LexA repressor.

Authors:  L K Lewis; M E Jenkins; D W Mount
Journal:  J Bacteriol       Date:  1992-05       Impact factor: 3.490

6.  A general procedure for locating and analyzing protein-binding sequence motifs in nucleic acids.

Authors:  M C O'Neill
Journal:  Proc Natl Acad Sci U S A       Date:  1998-09-01       Impact factor: 11.205

7.  The Haemophilus influenzae dprABC genes constitute a competence-inducible operon that requires the product of the tfoX (sxy) gene for transcriptional activation.

Authors:  S Karudapuram; G J Barcak
Journal:  J Bacteriol       Date:  1997-08       Impact factor: 3.490

8.  Experimentally determined weight matrix definitions of the initiator and TBP binding site elements of promoters.

Authors:  R J Kraus; E E Murray; S R Wiley; N M Zink; K Loritz; G W Gelembiuk; J E Mertz
Journal:  Nucleic Acids Res       Date:  1996-04-15       Impact factor: 16.971

9.  Analysis of eukaryotic promoter sequences reveals a systematically occurring CT-signal.

Authors:  N I Larsen; J Engelbrecht; S Brunak
Journal:  Nucleic Acids Res       Date:  1995-04-11       Impact factor: 16.971

10.  Structural organization, nucleotide sequence, and regulation of the Haemophilus influenzae rec-1+ gene.

Authors:  J J Zulty; G J Barcak
Journal:  J Bacteriol       Date:  1993-11       Impact factor: 3.490

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.