Literature DB >> 7937158

Back-propagation and counter-propagation neural networks for phylogenetic classification of ribosomal RNA sequences.

C Wu1, S Shivakumar.   

Abstract

A neural network system has been developed for rapid and accurate classification of ribosomal RNA sequences according to phylogenetic relationship. The molecular sequences are encoded into neural input vectors using an n-gram hashing method. A SVD (singular value decomposition) method is used to compress and reduce the size of long and sparse n-gram input vectors. The neural networks used are three-layered, feed-forward networks that employ supervised learning paradigms, including the back-propagation algorithm and a modified counter-propagation algorithm. A pedagogical pattern selection strategy is used to reduce the training time. After trained with ribosomal RNA sequences of the RDP (Ribosomal Database Project) database, the system can classify query sequences into more than one hundred phylogenetic classes with a 100% accuracy at a rate of less than 0.3 CPU second per sequence on a workstation. When compared to other sequence similarity search methods, including Similarity Rank, Blast and Fasta, the neural network method has a higher classification accuracy at a speed of about an order of magnitude faster. The software tool will be made available to the biology community, and the system may be extended into a gene identification system for classifying indiscriminately sequenced DNA fragments.

Entities:  

Mesh:

Substances:

Year:  1994        PMID: 7937158      PMCID: PMC331947          DOI: 10.1093/nar/22.20.4291

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  15 in total

1.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

2.  Determination of eukaryotic protein coding regions using neural networks and information theory.

Authors:  R Farber; A Lapedes; K Sirotkin
Journal:  J Mol Biol       Date:  1992-07-20       Impact factor: 5.469

3.  Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach.

Authors:  E C Uberbacher; R J Mural
Journal:  Proc Natl Acad Sci U S A       Date:  1991-12-15       Impact factor: 11.205

Review 4.  Bacterial evolution.

Authors:  C R Woese
Journal:  Microbiol Rev       Date:  1987-06

5.  Improvements in protein secondary structure prediction by an enhanced neural network.

Authors:  D G Kneller; F E Cohen; R Langridge
Journal:  J Mol Biol       Date:  1990-07-05       Impact factor: 5.469

Review 6.  Phylogenies from molecular sequences: inference and reliability.

Authors:  J Felsenstein
Journal:  Annu Rev Genet       Date:  1988       Impact factor: 16.830

7.  Predicting the secondary structure of globular proteins using neural network models.

Authors:  N Qian; T J Sejnowski
Journal:  J Mol Biol       Date:  1988-08-20       Impact factor: 5.469

8.  Self-organized neural maps of human protein sequences.

Authors:  E A Ferrán; B Pflugfelder; P Ferrara
Journal:  Protein Sci       Date:  1994-03       Impact factor: 6.725

9.  Escherichia coli promoters: neural networks develop distinct descriptions in learning to search for promoters of different spacing classes.

Authors:  M C O'Neill
Journal:  Nucleic Acids Res       Date:  1992-07-11       Impact factor: 16.971

10.  A new family of powerful multivariate statistical sequence analysis techniques.

Authors:  M van Heel
Journal:  J Mol Biol       Date:  1991-08-20       Impact factor: 5.469

View more
  7 in total

1.  Enhancement to the RANKPEP resource for the prediction of peptide binding to MHC molecules using profiles.

Authors:  Pedro A Reche; John-Paul Glutting; Hong Zhang; Ellis L Reinherz
Journal:  Immunogenetics       Date:  2004-09-03       Impact factor: 2.846

2.  Phylogenetic analysis of the bacterial communities in marine sediments.

Authors:  J P Gray; R P Herwig
Journal:  Appl Environ Microbiol       Date:  1996-11       Impact factor: 4.792

3.  Cold-adapted alanine dehydrogenases from two antarctic bacterial strains: gene cloning, protein characterization, and comparison with mesophilic and thermophilic counterparts.

Authors:  A Galkin; L Kulakova; H Ashida; Y Sawa; N Esaki
Journal:  Appl Environ Microbiol       Date:  1999-09       Impact factor: 4.792

4.  Cold-active serine alkaline protease from the psychrotrophic bacterium Shewanella strain ac10: gene cloning and enzyme purification and characterization.

Authors:  L Kulakova; A Galkin; T Kurihara; T Yoshimura; N Esaki
Journal:  Appl Environ Microbiol       Date:  1999-02       Impact factor: 4.792

5.  Computational analysis and modeling of cleavage by the immunoproteasome and the constitutive proteasome.

Authors:  Carmen M Diez-Rivero; Esther M Lafuente; Pedro A Reche
Journal:  BMC Bioinformatics       Date:  2010-09-23       Impact factor: 3.169

6.  A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

Authors:  Ai-bing Zhang; Jie Feng; Robert D Ward; Ping Wan; Qiang Gao; Jun Wu; Wei-zhong Zhao
Journal:  PLoS One       Date:  2012-02-20       Impact factor: 3.240

7.  Comparison of Diagnosis Accuracy between a Backpropagation Artificial Neural Network Model and Linear Regression in Digestive Disease Patients: an Empirical Research.

Authors:  Wei Wei; Xu Yang
Journal:  Comput Math Methods Med       Date:  2021-02-27       Impact factor: 2.238

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.