Literature DB >> 3330960

Recognition of characteristic patterns in sets of functionally equivalent DNA sequences.

G Mengeritsky1, T F Smith.   

Abstract

An algorithm has been developed for the identification of unknown patterns which are distinctive for a set of short DNA sequences believed to be functionally equivalent. A pattern is defined as being a string, containing fully or partially specified nucleotides at each position of the string. The advantage of this 'vague' definition of the pattern is that it imposes minimum constraints on the characterization of patterns. A new feature of the approach developed here is that it allows a 'fair' simultaneous testing of patterns of all degrees of degeneracy. This analysis is based on an evaluation of inhomogeneity in the empirical occurrence distribution of any such pattern within a set of sequences. The use of the nonparametric kernel density estimation of Parzen allows one to assess small disturbances among the sequence alignments. The method also makes it possible to identify sequence subsets with different characteristic patterns. This algorithm was implemented in the analysis of patterns characteristic of sets of promoters, terminators and splice junction sequences. The results are compared with those obtained by other methods.

Mesh:

Substances:

Year:  1987        PMID: 3330960     DOI: 10.1093/bioinformatics/3.3.223

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  11 in total

1.  UTRdb and UTRsite: specialized databases of sequences and functional elements of 5' and 3' untranslated regions of eukaryotic mRNAs.

Authors:  G Pesole; S Liuni; G Grillo; F Licciulli; A Larizza; W Makalowski; C Saccone
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Discovering regulatory elements in non-coding sequences by analysis of spaced dyads.

Authors:  J van Helden; A F Rios; J Collado-Vides
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

3.  UTRdb and UTRsite: specialized databases of sequences and functional elements of 5' and 3' untranslated regions of eukaryotic mRNAs. Update 2002.

Authors:  Graziano Pesole; Sabino Liuni; Giorgio Grillo; Flavio Licciulli; Flavio Mignone; Carmela Gissi; Cecilia Saccone
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

4.  WORDUP: an efficient algorithm for discovering statistically significant patterns in DNA sequences.

Authors:  G Pesole; N Prunella; S Liuni; M Attimonelli; C Saccone
Journal:  Nucleic Acids Res       Date:  1992-06-11       Impact factor: 16.971

5.  SQUIRREL: Sequence QUery, Information Retrieval and REporting Library. A program package for analyzing signals in nucleic acid sequences for the VAX.

Authors:  C J Gartmann; U Grob
Journal:  Nucleic Acids Res       Date:  1991-11-11       Impact factor: 16.971

6.  UTRdb: a specialized database of 5' and 3' untranslated regions of eukaryotic mRNAs.

Authors:  G Pesole; S Liuni; G Grillo; M Ippedico; A Larizza; W Makalowski; C Saccone
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

7.  UTRdb: a specialized database of 5'- and 3'-untranslated regions of eukaryotic mRNAs.

Authors:  G Pesole; S Liuni; G Grillo; C Saccone
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

8.  Computer-assisted prediction, classification, and delimitation of protein binding sites in nucleic acids.

Authors:  K Frech; G Herrmann; T Werner
Journal:  Nucleic Acids Res       Date:  1993-04-11       Impact factor: 16.971

9.  Discover 1: a new program to search for unusually represented DNA motifs.

Authors:  G Valle
Journal:  Nucleic Acids Res       Date:  1993-11-11       Impact factor: 16.971

10.  Multi-alphabet consensus algorithm for identification of low specificity protein-DNA interactions.

Authors:  A V Ulyanov; G D Stormo
Journal:  Nucleic Acids Res       Date:  1995-04-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.