| Literature DB >> 23039862 |
Olga V Tsoy1, Mikhail A Pyatnitskiy, Marat D Kazanov, Mikhail S Gelfand.
Abstract
BACKGROUND: The exponential growth of the number of fully sequenced genomes at varying taxonomic closeness allows one to characterize transcriptional regulation using comparative-genomics analysis instead of time-consuming experimental methods. A transcriptional regulatory unit consists of a transcription factor, its binding site and a regulated gene. These units constitute a graph which contains so-called "network motifs", subgraphs of a given structure. Here we consider genomes of closely related Enterobacteriales and estimate the fraction of conserved network motifs and sites as well as positions under selection in various types of non-coding regions.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23039862 PMCID: PMC3735044 DOI: 10.1186/1471-2148-12-200
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Types of feed-forward loops (FFLs).
Figure 2Five types of alignment positions for sequences from two clades ( and spp.).
Distribution of neutrally evolving positions
| Variable positions in the | |||
| Conserved positions in the | not relevant | ||
| Total | not relevant |
Figure 3Fraction of positions under selection for different types of intergenic regions. Whiskers designate 95%-confidence interval for the mean
Figure 4Fraction of positions under selection for different functional DNA regions. Whiskers designate the 95%-confidence interval for the mean
Figure 5The fraction of conserved links depending on the number of studied strains.A. global non-motif links; (B) links from global FFLs; (C) local non-motif links; (D) links from local FFLs
The number of events in TRN
| Non-motif connections | |||||
| No TF | 0 | 73 | 0 | 240 | 45 |
| No regulated gene | 31 | 8 | 607 | 674 | 26 |
| No TFBS (at least 1) | 7 | 62 | 708 | 1201 | 14 |
| No TFBS (all) | 1 | 25 | 596 | 746 | 11 |
| Conserved links | 51 | 321 | 840 | 2522 | 250 |
| Total links | 89 | 452 | 2155 | 4637 | 335 |
| FFL | |||||
| No TF | 0 | 189 | 0 | 208 | 44 |
| No regulated gene | 281 | 82 | 572 | 193 | 15 |
| No TFBS (at least 1) | 186 | 117 | 828 | 395 | 17 |
| No TFBS (all) | 105 | 44 | 536 | 186 | 8 |
| Conserved links | 290 | 483 | 1310 | 596 | 110 |
| Total links | 757 | 871 | 2710 | 1371 | 186 |
The number of events in TRN
| C1 FFLs | ||||
| No TF | 0 | 72 | 0 | 138 |
| No regulated gene | 157 | 15 | 131 | 32 |
| No TFBS (at least 1) | 50 | 40 | 282 | 184 |
| No TFBS (all) | 10 | 14 | 178 | 77 |
| Conserved links | 87 | 107 | 483 | 147 |
| Total links | 294 | 234 | 896 | 501 |
| I1 FFLs | ||||
| No TF | 0 | 193 | 0 | 100 |
| No regulated gene | 135 | 50 | 245 | 92 |
| No TFBS (at least 1) | 154 | 58 | 367 | 131 |
| No TFBS (all) | 57 | 30 | 166 | 26 |
| Conserved links | 271 | 258 | 712 | 276 |
| Total links | 560 | 559 | 1324 | 599 |
Distribution of allelic variants
| 1 allele in | 17541 + 3824 | 1087 + 245 | 33 + 4 | 0 |
| 2 alleles in | 594 + 126 | 153 | 7 | 0 |
| 3 alleles in | 24 + 4 | 9 | 1 | 0 |
| 4 alleles in | 1 | 0 | 0 | 0 |
The first number in cells from the first row and the first column reflects cases where the single allelic variant in one lineage is among the allelic variants from the other lineage, and the second number reflects the cases where the allelic variants are different. The mean values for all parameters were calculated for 100 bootstrap samples of 15 E. coli strains and 15 Salmonella strains.