Literature DB >> 9520502

Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm.

I Rigoutsos1, A Floratos.   

Abstract

MOTIVATION: The discovery of motifs in biological sequences is an important problem.
RESULTS: This paper presents a new algorithm for the discovery of rigid patterns (motifs) in biological sequences. Our method is combinatorial in nature and able to produce all patterns that appear in at least a (user-defined) minimum number of sequences, yet it manages to be very efficient by avoiding the enumeration of the entire pattern space. Furthermore, the reported patterns are maximal: any reported pattern cannot be made more specific and still keep on appearing at the exact same positions within the input sequences. The effectiveness of the proposed approach is showcased on a number of test cases which aim to: (i) validate the approach through the discovery of previously reported patterns; (ii) demonstrate the capability to identify automatically highly selective patterns particular to the sequences under consideration. Finally, experimental analysis indicates that the algorithm is output sensitive, i.e. its running time is quasi-linear to the size of the generated output.

Mesh:

Substances:

Year:  1998        PMID: 9520502     DOI: 10.1093/bioinformatics/14.1.55

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  101 in total

1.  Building a dictionary for genomes: identification of presumptive regulatory sites by statistical analysis.

Authors:  H J Bussemaker; H Li; E D Siggia
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

2.  Dictionary-driven prokaryotic gene finding.

Authors:  Tetsuo Shibuya; Isidore Rigoutsos
Journal:  Nucleic Acids Res       Date:  2002-06-15       Impact factor: 16.971

3.  The role of intercalating residues in chromosomal high-mobility-group protein DNA binding, bending and specificity.

Authors:  Janet Klass; Frank V Murphy; Susan Fouts; Melissa Serenil; Anita Changela; Jessica Siple; Mair E A Churchill
Journal:  Nucleic Acids Res       Date:  2003-06-01       Impact factor: 16.971

4.  In silico pattern-based analysis of the human cytomegalovirus genome.

Authors:  Isidore Rigoutsos; Jiri Novotny; Tien Huynh; Stephen T Chin-Bow; Laxmi Parida; Daniel Platt; David Coleman; Thomas Shenk
Journal:  J Virol       Date:  2003-04       Impact factor: 5.103

5.  Re-evaluation and in silico annotation of the Tupaia herpesvirus proteins.

Authors:  Udo Bahr; Gholamreza Darai
Journal:  Virus Genes       Date:  2004-01       Impact factor: 2.332

6.  The web server of IBM's Bioinformatics and Pattern Discovery group.

Authors:  Tien Huynh; Isidore Rigoutsos; Laxmi Parida; Daniel Platt; Tetsuo Shibuya
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

7.  Structural details (kinks and non-alpha conformations) in transmembrane helices are intrahelically determined and can be predicted by sequence pattern descriptors.

Authors:  Isidore Rigoutsos; Peter Riek; Robert M Graham; Jiri Novotny
Journal:  Nucleic Acids Res       Date:  2003-08-01       Impact factor: 16.971

8.  Dictionary-driven protein annotation.

Authors:  Isidore Rigoutsos; Tien Huynh; Aris Floratos; Laxmi Parida; Daniel Platt
Journal:  Nucleic Acids Res       Date:  2002-09-01       Impact factor: 16.971

9.  Genomic analysis of immunity in a Urochordate and the emergence of the vertebrate immune system: "waiting for Godot".

Authors:  Kaoru Azumi; Rosaria De Santis; Anthony De Tomaso; Isidore Rigoutsos; Fumiko Yoshizaki; Maria Rosaria Pinto; Rita Marino; Kazuhito Shida; Makoto Ikeda; Masami Ikeda; Masafumi Arai; Yasuhito Inoue; Toshio Shimizu; Nori Satoh; Daniel S Rokhsar; Louis Du Pasquier; Masanori Kasahara; Masanobu Satake; Masaru Nonaka
Journal:  Immunogenetics       Date:  2003-10-07       Impact factor: 2.846

10.  The web server of IBM's Bioinformatics and Pattern Discovery group: 2004 update.

Authors:  Tien Huynh; Isidore Rigoutsos
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.