| Literature DB >> 18841238 |
Ambika Shyam Sundar1, Susan Mary Varghese, Khader Shameer, Nataraja Karaba, Makarla Udayakumar, Ramanathan Sowdhamini.
Abstract
UNLABELLED: The expressions of proteins in the cell are carefully regulated by transcription factors that interact with their downstream targets in specific signal transduction cascades. Our understanding of the regulation of functional genes responsive to stress signals is still nascent. Plants like Arabidopsis thaliana, are convenient model systems to study fundamental questions related to regulation of the stress transcriptome in response to stress challenges. Microarray results of the Arabidopsis transcriptome indicate that several genes could be upregulated during multiple stresses, such as cold, salinity, drought etc. Experimental biochemical validations have proved the involvement of several transcription factors could be involved in the upregulation of these stress responsive genes. In order to follow the intricate and complicated networks of transcription factors and genes that respond to stress situations in plants, we have developed a computer algorithm that can identify key transcription factor binding sites upstream of a gene of interest. Hidden Markov models of the transcription factor binding sites enable the identification of predicted sites upstream of plant stress genes. The search algorithm, STIF, performs very well, with more than 90% sensitivity, when tested on experimentally validated positions of transcription factor binding sites on a dataset of 60 stress upregulated genes. AVAILABILITY: Supplementary data is available at http://caps.ncbs.res.in/download/stif.Entities:
Keywords: Arabidopsis thaliana; HMM based algorithm; gene regulation; stress genes; transcription factor binding site prediction
Year: 2008 PMID: 18841238 PMCID: PMC2561162 DOI: 10.6026/97320630002431
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Construction of a Hidden Markov Model of transcription factor binding sites given the experimentally observed nucleotide patterns.
Figure 2Flow chart diagram of STIF search algorithm.
Figure 3(a) The validation set of 11 stress responsive genes when searched for 100 base pairs with its 5′UTR with 11 stress responsive genes. The total number of false positives obtained during the search is compared against the total number of false negatives for various Z-score thresholds applied for the statistical tests. (b) Same as Figure 3a but for a validation set of 29 stress genes where search for TFBS was performed 1000 base pairs with its 5′UTR.