| Literature DB >> 15927055 |
Karthikeyan Sivaraman1, Aswin Sai Narain Seshasayee, Krishnakumar Swaminathan, Geetha Muthukumaran, Gautam Pennathur.
Abstract
BACKGROUND: Transcription is the first step in cellular information processing. It is regulated by cis-acting elements such as promoters and operators in the DNA, and trans-acting elements such as transcription factors and sigma factors. Identification of cis-acting regulatory elements on a genomic scale requires computational analysis.Entities:
Mesh:
Year: 2005 PMID: 15927055 PMCID: PMC1166578 DOI: 10.1186/1742-4682-2-20
Source DB: PubMed Journal: Theor Biol Med Model ISSN: 1742-4682 Impact factor: 2.432
Figure 1(A) A schematic representation of the procedure used for profiling, incorporating the definition of the four fragments F1, F2, F3 and F4 used in this study. (B) Comparison of the occurrence distribution in the random control (series 1), F4 (series 2), F3 (series 3), F2 (series 2) and F1 (series 4). (C) Number of words whose occurrence is greater than μ+Nσ, where N is on the x axis. (D) Distribution of the three classes of oligonucleotides in the four fragments: TATAAT for class 1, AGGAGG for class 2 and AAAAAA for class 3.
| Threshold(C)/Region(R) | %>μ+2 | %>μ+3 | %>μ+4 | %>200 |
| σ | σ | σ | % | |
| Control1 | 184 | 42 | 18 | 1 |
| Control2 | 155 | 30 | 9 | 0 |
| Control3 | 164 | 46 | 16 | 0 |
| F1 | 229 | 66 | 24 | 7 |
| F2 | 341 | 116 | 44 | 0 |
| F3 | 662 | 387 | 236 | 14 |
| F4 | 1112 | 834 | 634 | 190 |
Occurrence of single base relatives of TATAAT in E.coli genome. F1:-301 to -400; F2:-201 to -300; F3: -101 to -200; F4: -1 to -100. Those elements that are enriched (> = 200%) are marked by an asterisk in the last column.
| Hex | Total Occ. | F4 | F3 | F2 | F1 | Occ. % in F1 | ENR |
| TCTAAT | 595 | 29 | 30 | 47 | 46 | 7.731092 | |
| TAGAAT | 507 | 25 | 23 | 36 | 71 | 14.00394 | * |
| TATAGT | 681 | 19 | 38 | 30 | 87 | 12.77533 | * |
| TATAAG | 879 | 45 | 32 | 67 | 102 | 11.6041 | * |
| TACAAT | 903 | 42 | 56 | 71 | 109 | 12.07087 | * |
| TATAAC | 1590 | 65 | 80 | 94 | 121 | 7.610063 | |
| TATGAT | 1539 | 63 | 111 | 94 | 127 | 8.252112 | |
| TATACT | 620 | 39 | 40 | 62 | 131 | 21.12903 | * |
| CATAAT | 2448 | 90 | 106 | 115 | 132 | 5.392157 | |
| TATAAT | 1036 | 58 | 68 | 78 | 134 | 12.93436 | * |
| TGTAAT | 1870 | 74 | 80 | 129 | 140 | 7.486631 | |
| TATCAT | 2082 | 101 | 96 | 128 | 164 | 7.877041 | |
| TATATT | 1943 | 87 | 130 | 145 | 168 | 8.646423 | |
| TATTAT | 2280 | 118 | 124 | 155 | 178 | 7.807018 | |
| GATAAT | 3735 | 125 | 140 | 172 | 201 | 5.381526 | |
| TATAAA | 2304 | 98 | 144 | 177 | 236 | 10.24306 | * |
| TTTAAT | 3671 | 182 | 205 | 219 | 249 | 6.782893 | |
| TAAAAT | 2947 | 142 | 159 | 204 | 268 | 9.093994 | |
| AATAAT | 4132 | 188 | 201 | 287 | 287 | 6.945789 |
Figure 2Addressed promoter model. (A) Occurrence distribution of TATAAT, AGGAGG and AAAAAA within the -100 region using a 30-base window: -1 to -30, -10 to -40, -20 to -50, ..., -70 to -100. (B) A schematic comparison of the classical and the addressed promoter models. Blue peaks represent the canonical promoter. Red background (where present) represents the address.