Literature DB >> 11882250

tacg--a grep for DNA.

Harry J Mangalam1.   

Abstract

BACKGROUND: Pattern matching is the core of bioinformatics; it is used in database searching, restriction enzyme mapping, and finding open reading frames. It is done repeatedly over increasingly long sequences, thus codes must be efficient and insensitive to sequence length. Such patterns of interest include simple motifs with IUPAC degeneracies, regular expressions, patterns allowing mismatches, and probability matrices.
RESULTS: I describe a small application which allows searching for all the above pattern types individually, which further allows these atomic motifs to be assembled into logical rules for more sophisticated analysis.
CONCLUSION: tacg is small, portable, faster and more capable than most alternatives, relatively easy to modify, and freely available in source code.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11882250      PMCID: PMC99049          DOI: 10.1186/1471-2105-3-8

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  12 in total

1.  The TRANSFAC system on gene expression regulation.

Authors:  E Wingender; X Chen; E Fricke; R Geffers; R Hehl; I Liebich; M Krull; V Matys; H Michael; R Ohnhäuser; M Prüss; F Schacherer; S Thiele; S Urbach
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  AFLP: a new technique for DNA fingerprinting.

Authors:  P Vos; R Hogers; M Bleeker; M Reijans; T van de Lee; M Hornes; A Frijters; J Pot; J Peleman; M Kuiper
Journal:  Nucleic Acids Res       Date:  1995-11-11       Impact factor: 16.971

4.  Hidden Markov Models of the G-protein-coupled receptor family.

Authors:  P Baldi; Y Chauvin
Journal:  J Comput Biol       Date:  1994       Impact factor: 1.479

5.  SEALS: a system for easy analysis of lots of sequences.

Authors:  D R Walker; E V Koonin
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1997

6.  Searching for patterns in genomic data.

Authors:  M Dsouza; N Larsen; R Overbeek
Journal:  Trends Genet       Date:  1997-12       Impact factor: 11.639

7.  Rapid and sensitive protein similarity searches.

Authors:  D J Lipman; W R Pearson
Journal:  Science       Date:  1985-03-22       Impact factor: 47.728

8.  'DNA Strider': a 'C' program for the fast analysis of DNA and protein sequences on the Apple Macintosh family of computers.

Authors:  C Marck
Journal:  Nucleic Acids Res       Date:  1988-03-11       Impact factor: 16.971

9.  Profile analysis.

Authors:  M Gribskov
Journal:  Methods Mol Biol       Date:  1994

10.  Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies.

Authors:  J van Helden; B André; J Collado-Vides
Journal:  J Mol Biol       Date:  1998-09-04       Impact factor: 5.469

View more
  4 in total

1.  The mosaic nature of intergenic 16S-23S rRNA spacer regions suggests rRNA operon copy number variation in Clostridium difficile strains.

Authors:  Nourkhoda Sadeghifard; Volker Gürtler; Michael Beer; Robert J Seviour
Journal:  Appl Environ Microbiol       Date:  2006-09-15       Impact factor: 4.792

2.  Clustering of DNA sequences in human promoters.

Authors:  Peter C FitzGerald; Andrey Shlyakhtenko; Alain A Mir; Charles Vinson
Journal:  Genome Res       Date:  2004-07-15       Impact factor: 9.043

3.  PatMatch: a program for finding patterns in peptide and nucleotide sequences.

Authors:  Thomas Yan; Danny Yoo; Tanya Z Berardini; Lukas A Mueller; Dan C Weems; Shuai Weng; J Michael Cherry; Seung Y Rhee
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

4.  Comparative genomics of Drosophila and human core promoters.

Authors:  Peter C FitzGerald; David Sturgill; Andrey Shyakhtenko; Brian Oliver; Charles Vinson
Journal:  Genome Biol       Date:  2006       Impact factor: 13.583

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.