A Klingenhoff1, K Frech, K Quandt, T Werner. 1. Institute of Mammalian Genetics, GSF-National Research Center for Environment and Health, Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany.
Abstract
MOTIVATION: Gene regulation often depends on functional modules which feature a detectable internal organization. Overall sequence similarity of these modules is often insufficient for detection by general search methods like FASTA or even Gapped BLAST. However, it is of interest to evaluate whether modules, often known from experimental analysis of single sequences, are present in other regulatory sequences. RESULTS: We developed a new method (FastM) which combines a search algorithm for individual transcription factor binding sites (MatInspector) with a distance correlation function. FastM allows fast definition of a model of correlated binding sites derived from as little as a single promoter or enhancer. ModelInspector results are suitable for evaluation of the significance of the model. We used FastM to define a model for the experimentally verified NFkappaB/IRF1 regulatory module from the major histocompatibility complex (MHC) class I HLA-B gene promoter. Analysis of a test set of sequences as well as database searches with this model showed excellent correlation of the model with the biological function of the module. These results could not be obtained by searches using FASTA or Gapped BLAST, which are based on sequence similarity. We were also able to demonstrate association of a hypothetical GRE-GRE module with viral sequences based on analysis of several GenBank sections with this module. AVAILABILITY: The WWW version of FastM is accessible at: http://www.gsf.de/cgi-bin/fastm. pl and http://genomatix.gsf.de/cgi-bin/fastm2/fastm.pl
MOTIVATION: Gene regulation often depends on functional modules which feature a detectable internal organization. Overall sequence similarity of these modules is often insufficient for detection by general search methods like FASTA or even Gapped BLAST. However, it is of interest to evaluate whether modules, often known from experimental analysis of single sequences, are present in other regulatory sequences. RESULTS: We developed a new method (FastM) which combines a search algorithm for individual transcription factor binding sites (MatInspector) with a distance correlation function. FastM allows fast definition of a model of correlated binding sites derived from as little as a single promoter or enhancer. ModelInspector results are suitable for evaluation of the significance of the model. We used FastM to define a model for the experimentally verified NFkappaB/IRF1 regulatory module from the major histocompatibility complex (MHC) class I HLA-B gene promoter. Analysis of a test set of sequences as well as database searches with this model showed excellent correlation of the model with the biological function of the module. These results could not be obtained by searches using FASTA or Gapped BLAST, which are based on sequence similarity. We were also able to demonstrate association of a hypothetical GRE-GRE module with viral sequences based on analysis of several GenBank sections with this module. AVAILABILITY: The WWW version of FastM is accessible at: http://www.gsf.de/cgi-bin/fastm. pl and http://genomatix.gsf.de/cgi-bin/fastm2/fastm.pl
Authors: M Scherf; A Klingenhoff; K Frech; K Quandt; R Schneider; K Grote; M Frisch; V Gailus-Durner; A Seidel; R Brack-Werner; T Werner Journal: Genome Res Date: 2001-03 Impact factor: 9.043
Authors: Laura Saba; Sanjiv V Bhave; Nicholas Grahame; Paula Bice; Razvan Lapadat; John Belknap; Paula L Hoffman; Boris Tabakoff Journal: Mamm Genome Date: 2006-06-12 Impact factor: 2.957
Authors: Ryan P McMullin; Albert Dobi; Laura N Mutton; András Orosz; Shilpi Maheshwari; Cooduvalli S Shashikant; Charles J Bieberich Journal: Proc Natl Acad Sci U S A Date: 2009-12-14 Impact factor: 11.205
Authors: Benjamin Müller; Christian Prante; Cornelius Knabbe; Knut Kleesiek; Christian Götting Journal: Glycoconj J Date: 2012-08-11 Impact factor: 2.916