| Literature DB >> 19146695 |
Jan Baumbach1, Sven Rahmann, Andreas Tauch.
Abstract
BACKGROUND: Transcriptional regulation of gene activity is essential for any living organism. Transcription factors therefore recognize specific binding sites within the DNA to regulate the expression of particular target genes. The genome-scale reconstruction of the emerging regulatory networks is important for biotechnology and human medicine but cost-intensive, time-consuming, and impossible to perform for any species separately. By using bioinformatics methods one can partially transfer networks from well-studied model organisms to closely related species. However, the prediction quality is limited by the low level of evolutionary conservation of the transcription factor binding sites, even within organisms of the same genus.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19146695 PMCID: PMC2653031 DOI: 10.1186/1752-0509-3-8
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Scheme of the transfer workflow. a, Simplified structure of a typical transcriptional gene regulatory interaction database. Using genetic upstream sequences and transcription factor binding site annotations the TFBSs can be re-adjusted and modeled as PWMs for subsequent TFBS predictions. Sequence clustering tools can be applied to the stored genome annotation and gene/protein sequences to gain information about homologous genes/proteins. b, A regulatory interaction can be transferred from a model organism to a closely related species if the regulator as well as the target gene are orthologous and a matching TFBS can be found in the upstream sequence of the orthologous target gene.
Comparison of the original and the transferred database content of CoryneRegNet.
| TFs | TFsC | TFsK | TFsCK | Regulations | |||
| CG | 128 | 69 | 530 | ||||
| original | transferred | transferred | original | transferred | |||
| CD | 63 | 49 (77.8%) | 2 (3.2%) | 20 (×10) | 20 (40.1%) | 46 | 193 (×4.2) |
| CE | 103 | 77 (74.8%) | 5 (4.9%) | 28 (×5.6) | 28 (36.4%) | 64 | 348 (×5.4) |
| CJ | 55 | 31 (56.4%) | 1 (1.8%) | 13 (×13) | 13 (41.9%) | 51 | 150 (×2.9) |
| Av | 69.6% | 3.3% | x9.5 | 39.7% | x4.2 | ||
Abrev.: CG = The model organism Corynebacterium glutamicum, CD = C. diphtheriae, CE = C. efficiens, CJ = C. jeikeium, Av = Average, TFs = Transcription factors, TFsC = Common transcription factors with C. glutamicum, TFsK = Transcription factors with knowledge, TFsCK = Transcription factors common with C. glutamicum with knowledge, Regulations = Transcriptional regulatory interactions, original = Original database content, transferred = Database content after network transfer. Percentages/factors: TFsC = relative to column TFs, TFsK/original = relative to column TFs, TFsK/transferred = relative to column TFsK/orig., TFsCK = relative to column TFsC, Regulations/transferred = relative to column Regulations/original.
Figure 2Illustration of the gene regulatory network for PcaR. a, A comparative visualization of the known gene regulatory network of PcaR in the model organism C. glutamicum (right) transferred to C. efficiens (left). Nodes correspond to genes, directed (red) edges to negative transcriptional regulatory interactions, and undirected (black) edges to a sequence-based similarity that indicates a putative homology. b/c, The sequence logos computed from the PcaR binding sites in C. efficiens (predicted, left) and in C. glutamicum (evidenced, right).