Jacques van Helden1. 1. SCMBB, Université Libre de Bruxelles, Campus Plaine CP 263, Boulevard du Triomphe, B-1050 Bruxelles, Belgium. jvanheld@ucmb.ulb.ac.be
Abstract
MOTIVATION: Upstream sequences contain short motifs, which mediate transcriptional regulation by specifically binding different transcription factors. The presence of common motifs in the regulatory regions of two genes might be considered as a clue for a potential co-regulation. A pattern count-based (dis)similarity metric between sequences could thus be used to classify genes according to their putative regulatory properties. RESULTS: We present here several metrics which rely on probability theory, and which aim at comparing sequences on the basis of pattern counts. We compare these metrics to several classical dissimilarity and similarity metrics, and illustrate their behaviour with a biological example.
MOTIVATION: Upstream sequences contain short motifs, which mediate transcriptional regulation by specifically binding different transcription factors. The presence of common motifs in the regulatory regions of two genes might be considered as a clue for a potential co-regulation. A pattern count-based (dis)similarity metric between sequences could thus be used to classify genes according to their putative regulatory properties. RESULTS: We present here several metrics which rely on probability theory, and which aim at comparing sequences on the basis of pattern counts. We compare these metrics to several classical dissimilarity and similarity metrics, and illustrate their behaviour with a biological example.
Authors: Florian A Karreth; Yvonne Tay; Daniele Perna; Ugo Ala; Shen Mynn Tan; Alistair G Rust; Gina DeNicola; Kaitlyn A Webster; Dror Weiss; Pedro A Perez-Mancera; Michael Krauthammer; Ruth Halaban; Paolo Provero; David J Adams; David A Tuveson; Pier Paolo Pandolfi Journal: Cell Date: 2011-10-14 Impact factor: 41.582
Authors: Kai Song; Jie Ren; Gesine Reinert; Minghua Deng; Michael S Waterman; Fengzhu Sun Journal: Brief Bioinform Date: 2013-09-23 Impact factor: 11.622
Authors: Nathan L Clement; Quinn Snell; Mark J Clement; Peter C Hollenhorst; Jahnvi Purwar; Barbara J Graves; Bradley R Cairns; W Evan Johnson Journal: Bioinformatics Date: 2009-10-27 Impact factor: 6.937