Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A system for pattern matching applications on biosequences.

Literature DB >> 8324630

A system for pattern matching applications on biosequences.

Abstract

ANREP is a system for finding matches to patterns composed of (i) spacing constraints called 'spacers', and (ii) approximate matches to 'motifs' that are, recursively, patterns composed of 'atomic' symbols. A user specifies such patterns via a declarative, free-format and strongly typed language called A that is presented here in a tutorial style through a series of progressively more complex examples. The sample patterns are for protein and DNA sequences, the application domain for which ANREP was specifically created. ANREP provides a unified framework for almost all previously proposed biosequence patterns and extends them by providing approximate matching, a feature heretofore unavailable except for the limited case of individual sequences. The performance of ANREP is discussed and an appendix gives a concise specification of syntax and semantics. A portable C software package implementing ANREP is available via anonymous remote file transfer.

Mesh：

Year: 1993 PMID： 8324630 DOI： 10.1093/bioinformatics/9.3.299

Source DB: PubMed Journal: Comput Appl Biosci ISSN： 0266-7061

Keyword Cloud
Cited

11 in total

1. HyPaLib: a database of RNAs and RNA structural elements defined by hybrid patterns.

Authors: S Gräf; D Strothmann; S Kurtz; G Steger
Journal: Nucleic Acids Res Date: 2001-01-01 Impact factor: 16.971

2. MultiPipMaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences.

Authors: Scott Schwartz; Laura Elnitski; Mei Li; Matt Weirauch; Cathy Riemer; Arian Smit; Eric D Green; Ross C Hardison; Webb Miller
Journal: Nucleic Acids Res Date: 2003-07-01 Impact factor: 16.971

3. PepPat, a pattern-based oligopeptide homology search method and the identification of a novel tachykinin-like peptide.

Authors: Ying Jiang; Ge Gao; Gang Fang; Eric L Gustafson; Maureen Laverty; Yanbin Yin; Yong Zhang; Jingchu Luo; Jonathan R Greene; Marvin L Bayne; Joseph A Hedrick; Nicholas J Murgolo
Journal: Mamm Genome Date: 2003-05 Impact factor: 2.957

4. SMOTIF: efficient structured pattern and profile motif search.

Authors: Yongqiang Zhang; Mohammed J Zaki
Journal: Algorithms Mol Biol Date: 2006-11-21 Impact factor: 1.405

5. Protein sequence similarity searches using patterns as seeds.

Authors: Z Zhang; A A Schäffer; W Miller; T L Madden; D J Lipman; E V Koonin; S F Altschul
Journal: Nucleic Acids Res Date: 1998-09-01 Impact factor: 16.971

6. Palingol: a declarative programming language to describe nucleic acids' secondary structures and to scan sequence database.

Authors: B Billoud; M Kontic; A Viari
Journal: Nucleic Acids Res Date: 1996-04-15 Impact factor: 16.971

A system for pattern matching applications on biosequences.

1. HyPaLib: a database of RNAs and RNA structural elements defined by hybrid patterns.

2. MultiPipMaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences.

3. PepPat, a pattern-based oligopeptide homology search method and the identification of a novel tachykinin-like peptide.

4. SMOTIF: efficient structured pattern and profile motif search.

5. Protein sequence similarity searches using patterns as seeds.

6. Palingol: a declarative programming language to describe nucleic acids' secondary structures and to scan sequence database.

7. PRINTS--a database of protein motif fingerprints.

8. ModuleOrganizer: detecting modules in families of transposable elements.

9. SCANMOT: searching for similar sequences using a simultaneous scan of multiple sequence motifs.

10. PCAS--a precomputed proteome annotation database resource.