| Literature DB >> 21330290 |
Charles E Grant1, Timothy L Bailey, William Stafford Noble.
Abstract
UNLABELLED: A motif is a short DNA or protein sequence that contributes to the biological function of the sequence in which it resides. Over the past several decades, many computational methods have been described for identifying, characterizing and searching with sequence motifs. Critical to nearly any motif-based sequence analysis pipeline is the ability to scan a sequence database for occurrences of a given motif described by a position-specific frequency matrix.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21330290 PMCID: PMC3065696 DOI: 10.1093/bioinformatics/btr064
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Comparison of motif search functionality
| Method | Scans DNA | Scans proteins | Supports custom backgrounds | Reports | Performs multiple testing correction | Source code freely available | Web accessible | GFF/WIG output | XML/HTML output |
|---|---|---|---|---|---|---|---|---|---|
| MotifScanner | ✓ | ✓ | ✓ | ✓ | ✓ | ||||
| MotifViz | ✓ | ✓ | ✓ | ✓ | ✓ | ||||
| STORM | ✓ | ✓ | ✓ | ||||||
| TRED | ✓ | ✓ | ✓ | ||||||
| RSAT | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| Patser | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |||
| PoSSuMsearch | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| MATCH | ✓ | ✓ | ✓ | ||||||
| FIMO | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
References for the motif scanning algorithms are provided in the supplement. Note that FIMO only supports zero-order custom backgrounds.
Fig. 1.Using FIMO to identify candidate CTCF binding sites in the human genome. (A) Sample FIMO HTML output, showing the locations of the top-scoring occurrences of the CTCF motif in the human genome. (B) A precision-recall curve created by comparing FIMO's ranked list of CTCF sites with a gold standard derived from a ChIP-seq experiment.