| Literature DB >> 23032607 |
Joseph P Cornish1, Fialelei Matthews, Julien R Thomas, Ivan Erill.
Abstract
The assumption of basic properties, like self-regulation, in simple transcriptional regulatory networks can be exploited to infer regulatory motifs from the growing amounts of genomic and meta-genomic data. These motifs can in principle be used to elucidate the nature and scope of transcriptional networks through comparative genomics. Here we assess the feasibility of this approach using the SOS regulatory network of Gram-positive bacteria as a test case. Using experimentally validated data, we show that the known regulatory motif can be inferred through the assumption of self-regulation. Furthermore, the inferred motif provides a more robust search pattern for comparative genomics than the experimental motifs defined in reference organisms. We take advantage of this robustness to generate a functional map of the SOS response in Gram-positive bacteria. Our results reveal definite differences in the composition of the LexA regulon between Firmicutes and Actinobacteria, and confirm that regulation of cell-division inhibition is a widespread characteristic of this network among Gram-positive bacteria.Entities:
Keywords: LexA; SOS response; comparative genomics; motif discovery; transcription factor; transcriptional network
Year: 2012 PMID: 23032607 PMCID: PMC3422134 DOI: 10.4137/EBO.S9205
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1Schematic representation of the comparative genomics approach used in this work. (A) Motif discovery. Self-regulation is assumed for the transcription factor of interest, which is identified univocally in a particular genome. A uniform sample of genome sequences from a given phylogenetic group is selected and multiple homologues of the transcription factor are identified in these genome sequences as best-bidirectional BLAST hits using the TF of interest as the starting query. The upstream sequences of the genes coding for the TF homologues are retrieved and a motif discovery algorithm is applied to them. The resulting best motif model is refined by exploiting its palindromic nature and the collection of sites that compose it is expanded by re-searching the gene upstream regions for additional sites. (B) Comparative genomics.
Notes: The expanded site collection is used in subsequent genome-wide searches against a selected subset of genome sequences. Results on each genome are first filtered with an initial threshold (T0), which is revised (T1...T) when further instances of regulation are discovered for a given gene. Genes showing instances of regulation for more than one species are reported as putative elements of the TF regulon.
Figure 2LexA-binding motifs. (A) Best-scoring motif reported by MEME based on 50 sites, (B) search-expanded motif encompassing 71 sites, (C) motif from experimental sites in B. subtilis, (D) motif from experimental sites in C. glutamicum, (E) motif from experimental sites in M. tuberculosis and (F) motif from experimental sites in L. monocytogenes.
Figure 3ROC curves for search efficiency with experimentally-validated and MEME-derived collections on four different genomes corresponding to the Firmicutes (left) and Actinobacteria (right).
Notes: Sensitivity corresponds to the fraction of experimentally validated binding sites detected by the search algorithm. Specificity is the fraction of the rest of genomic positions reported by the search algorithm. Only the high-specificity region of the ROC curve is shown to illustrate the differences between the different collections used to define the search motif on different genomes. Expanded ROC curves showing a larger segment of specificity are reported as Supplementary data 10.
Composition of the LexA regulon in Gram-positive bacteria.
| Gene | Reference | Firmicutes | Actinobacteria | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| ||||||||||||
| Bsu | Cac | Efa | Lmo | Sau | Ace | Cgl | Lxy | Mtu | Nfa | Sgr | |||
| Core | ■ | ■ | ■ | ■ | ■ | ■ | ■ | ■ | ■ | ■ | ■ | ||
| ■ | ■ | ■ | ■ | ■ | ■ | ■ | ■ | ■ | ■ | ■ | |||
| TLS | ■ | ■ | ■ | □ | □ | □ | □ | □ | ■ | ||||
| □ | □ | ■ | ■ | ■ | |||||||||
| □ | ■ | ■ | □ | □ | |||||||||
| DNA repair | ■ | ■ | ■ | ■ | |||||||||
| ■ | ■ | □ | ■ | ■ | |||||||||
| ■ | □ | □ | ■ | □ | □ | □ | □ | □ | □ | □ | |||
| ■ | ■ | ■ | ■ | ■ | □ | □ | □ | □ | □ | □ | |||
| ■ | ■ | ■ | ■ | ||||||||||
| ■ | □ | □ | ■ | ■ | □ | ||||||||
| □ | ■ | ■ | ■ | ■ | ■ | ||||||||
| ■ | ■ | ■ | |||||||||||
| □ | □ | □ | ■ | ■ | □ | ||||||||
| Cell division | ■ | ■ | |||||||||||
| ■ | |||||||||||||
| ■ | |||||||||||||
| Rv2719c | ■ | ||||||||||||
| ■ | |||||||||||||
| ■ | ■ | ||||||||||||
Notes: Known or predicted LexA regulation is indicated by filled squares, whereas an open square denotes lack of putative LexA-binding sites. Empty cells indicate that a given gene is absent from a particular genome. A list of references in which SOS regulation has been experimentally verified is provided for each gene. Genes are sorted according to the following categories. A richer version of this table, with gene annotation information and with the sequence and location of identified LexA-binding sites is available as supplementary data (Supplementary data 11).
Abbreviations: Core, core regulon members, TSL, Translesion synthesis, DNA repair, genes involved in DNA repair, and Cell division, genes involved in cell division suppression. Species abbreviations are as follows: Bsu, B. subtilis; Cac, C. acetobutylicum; Efa, E. faecalis; Lmo, L. monocytogenes; Sau, S. aureus; Ace, A. cellulolyticus; Cgl, C. glutamicum; Lxy, L. xyli; Mtu, M. tuberculosis; Nfa, N. farcinica; Sgr, S. griseus.
Figure 4Schematic representation of the genomic region encompassing the lexA gene and divergently paired putative and known (yneA, divS, Rv2719c) cell division inhibitors.
Notes: Genes are represented by arrows. Solid grey arrows indicate nonconserved genes. Red circles denote LexA-binding sites. NCBI GenBank accession numbers are provided relevant genes. The figure illustrates the conservation of synteny in both Firmicutes and Actinobacteria. In L. xyli, a recent genetic rearrangement involving a transposase gene (tnp) and the his operon has visibly displaced nrdR from the vicinity of the LysM domain-containing protein encoded by Lxx15870. The divS and nrdR genes, as well as yneA, yneB and ynzC, are known to constitute operons.32,60