| Literature DB >> 21513505 |
Alexandra M Carvalho1, Arlindo L Oliveira.
Abstract
BACKGROUND: Position-specific priors (PSP) have been used with success to boost EM and Gibbs sampler-based motif discovery algorithms. PSP information has been computed from different sources, including orthologous conservation, DNA duplex stability, and nucleosome positioning. The use of prior information has not yet been used in the context of combinatorial algorithms. Moreover, priors have been used only independently, and the gain of combining priors from different sources has not yet been studied.Entities:
Year: 2011 PMID: 21513505 PMCID: PMC3112114 DOI: 10.1186/1748-7188-6-13
Source DB: PubMed Journal: Algorithms Mol Biol ISSN: 1748-7188 Impact factor: 1.405
Definition of terms used in describing the algorithms presented in Methods.
| Symbol | Meaning |
|---|---|
| Σ | alphabet (usually DNA or IUPAC) |
| input sequences | |
| number of input sequences | |
| length of | |
| motif size | |
| ℓ | number of priors (it can be zero) |
| minimum number of motifs expected to be returned by a RISOTTO run | |
| maximum number of motifs expected to be returned by a RISOTTO run | |
| number of top motifs post-processed from RISOTTO output | |
| the set with the | |
| motif of size | |
| motif | |
| empty motif (its BIS score is the minimum possible value) | |
| prior probability at the | |
| annotated position for | |
| probability distribution given by the PSSM induced by | |
| the weight of the | |
| coefficient to balance priors and over-representation contribution | |
Comparison of GRISOTTO with state-of-the-art methods over ChiP-chip data.
| Algorithm | Description | Successes | % | Ref |
|---|---|---|---|---|
| PhyloCon | Local alignment of conserved regions | 19 | 12% | [ |
| PhyME | Alignment-based with EM | 21 | 13% | [ |
| MEME:OOPS | MEME with OOPS model | 36 | 23% | [ |
| MEME:ZOOPS | MEME with ZOOPS model | 39 | 25% | [ |
| MEME-c | MEME without conserved bases masked | 49 | 31% | [ |
| PhyloGibbs | Alignment-based with Gibbs Sampling | 54 | 35% | [ |
| Kellis | Alignment-based | 56 | 36% | [ |
| CompareProspector | Alignment-based with Gibbs sampling | 64 | 41% | [ |
| Converge | Alignment-based with EM | 68 | 44% | [ |
| MEME:OOPS- | MEME with OOPS model and | 73 | 47% | [ |
| PRIORITY- | Gibbs sampler with | 77 | 49% | [ |
| MEME:ZOOP- | MEME with ZOOPS model and | 81 | 52% | [ |
| GRISOTTO with | - | |||
| PRIORITY- | Gibbs sampler with | 70 | 45% | [ |
| GRISOTTO with | - | |||
| PRIORITY- | Gibbs sampler with | 70 | 45% | [ |
| GRISOTTO with | - | |||
| GRISOTTO with combined priors | - | |||
The results of motif discoverers were taken from R. Gordân et al. [16] and T. L. Bailey et al. [17].
All priors used were devised by R. Gordân, A. J. Hartemink and L. Narlikar [11,14-16].
Figure 1Comparison of GRISOTTO-. Motifs reported by Chen et al. [21] and MEME-[17] are shown along side motifs found by GRISOTTO- for the 13 mouse ChiP-seq data. Chen et al. only reported 12 out of the 13 motifs, excluding the E2f1 motif, so in this case the TRANSFAC [40] motif is shown instead.