Literature DB >> 8670622

Identification of functional elements in unaligned nucleic acid sequences by a novel tuple search algorithm.

F Wolfertstetter1, K Frech, G Herrmann, T Werner.   

Abstract

We present an algorithm to identify potential functional elements like protein binding sites in DNA sequences, solely from nucleotide sequence data. Prerequisites are a set of at least seven not closely related sequences with a common biological function which is correlated to one or more unknown sequence elements present in most but not necessarily all of the sequences. The algorithm is based on a search for n-tuples which occur at least in a minimum percentage of the sequences with no or one mismatch, which may be at any position of the tuple. In contrast to functional tuples, random tuples show no preferred pattern of mismatch locations within the tuple nor is the conservation extended beyond the tuple. Both features of functional tuples are used to eliminate random tuples. Selection is carried out by maximization of the information content first for the n-tuple, then for a region containing the tuple and finally for the complete binding site. Further matches are found in an additional selection step, using the ConsInd method previously described. The algorithm is capable of identifying and delimiting elements (e.g. protein binding sites) represented by single short cores (e.g. TATA box) in sets of unaligned sequences of about 500 nucleotides using no information other than the nucleotide sequences. Furthermore, we show its ability to identify multiple elements in a set of complete LTR sequences (more than 600 nucleotides per sequence).

Mesh:

Substances:

Year:  1996        PMID: 8670622     DOI: 10.1093/bioinformatics/12.1.71

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  19 in total

1.  SELEX_DB: an activated database on selected randomized DNA/RNA sequences addressed to genomic sequence annotation.

Authors:  J V Ponomarenko; G V Orlova; M P Ponomarenko; S V Lavryushev; A S Frolov; S V Zybova; N A Kolchanov
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Discovering regulatory elements in non-coding sequences by analysis of spaced dyads.

Authors:  J van Helden; A F Rios; J Collado-Vides
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

3.  Integrated functional and bioinformatics approach for the identification and experimental verification of RNA signals: application to HIV-1 INS.

Authors:  Horst Wolff; Ruth Brack-Werner; Markus Neumann; Thomas Werner; Ralf Schneider
Journal:  Nucleic Acids Res       Date:  2003-06-01       Impact factor: 16.971

Review 4.  Computational approaches to identify promoters and cis-regulatory elements in plant genomes.

Authors:  Stephane Rombauts; Kobe Florquin; Magali Lescot; Kathleen Marchal; Pierre Rouzé; Yves van de Peer
Journal:  Plant Physiol       Date:  2003-07       Impact factor: 8.340

5.  Predicting gene regulatory elements in silico on a genomic scale.

Authors:  A Brazma; I Jonassen; J Vilo; E Ukkonen
Journal:  Genome Res       Date:  1998-11       Impact factor: 9.043

6.  DNA elements regulating alpha1-tubulin gene induction during regeneration of eukaryotic flagella.

Authors:  G Periz; L R Keller
Journal:  Mol Cell Biol       Date:  1997-07       Impact factor: 4.272

7.  Characterisation of complementary DNAs from the expressed sequence tag analysis of life cycle stages of Laminaria digitata (Phaeophyceae).

Authors:  F Crépineau; T Roscoe; R Kaas; B Kloareg; C Boyen
Journal:  Plant Mol Biol       Date:  2000-07       Impact factor: 4.076

8.  Proteomic analysis of native hepatocyte nuclear factor-4α (HNF4α) isoforms, phosphorylation status, and interactive cofactors.

Authors:  Kenji Daigo; Takeshi Kawamura; Yoshihiro Ohta; Riuko Ohashi; Satoshi Katayose; Toshiya Tanaka; Hiroyuki Aburatani; Makoto Naito; Tatsuhiko Kodama; Sigeo Ihara; Takao Hamakubo
Journal:  J Biol Chem       Date:  2010-11-03       Impact factor: 5.157

9.  Direct vs 2-stage approaches to structured motif finding.

Authors:  Maria Federico; Mauro Leoncini; Manuela Montangero; Paolo Valente
Journal:  Algorithms Mol Biol       Date:  2012-08-21       Impact factor: 1.405

10.  Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites.

Authors:  Soumyadeep Nandi; Ilya Ioshikhes
Journal:  BMC Genomics       Date:  2012-08-22       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.