| Literature DB >> 15918906 |
Mythily Ganapathi1, Pragya Srivastava, Sushanta Kumar Das Sutar, Kaushal Kumar, Dipayan Dasgupta, Gajinder Pal Singh, Vani Brahmachari, Samir K Brahmachari.
Abstract
BACKGROUND: Global regulatory mechanisms involving chromatin assembly and remodelling in the promoter regions of genes is implicated in eukaryotic transcription control especially for genes subjected to spatial and temporal regulation. The potential to utilise global regulatory mechanisms for controlling gene expression might depend upon the architecture of the chromatin in and around the gene. In-silico analysis can yield important insights into this aspect, facilitating comparison of two or more classes of genes comprising of a large number of genes within each group.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15918906 PMCID: PMC1173084 DOI: 10.1186/1471-2105-6-126
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Distribution of putative S/MARs in housekeeping and tissue specific genes.
| presence of S/MAR | 26.1 | 34.1 | 19.1 | 20.6 |
| absence of S/MAR | 26.1 | 19.2 | 34.5 | 25.1 |
#Housekeeping genes, §Tissue specific genes,
* ChrClass and MAR Finder programs were used for prediction of S/MARs in housekeeping and tissue specific genes regulatory regions (5' & 3' regions – 2000 bp each). The common predictions of both the programs were used for the analysis. The data is represented as percentage of genes with predicted S/MARs in 5' and 3' regions of 525 housekeeping and 532 tissue specific genes.
Distribution of poly (dA.dT) repeats of various lengths in the 5' upstream regions of housekeeping and tissue specific genes.
| § | |||||
| >10 | 443 | 345 | 268 (51.0) | 240 (43.0) | 1.31E-04 |
| >11 | 381 | 297 | 243 (46.3) | 214 (38.4) | 3.25E-04 |
| >12 | 339 | 248 | 226 (43.1) | 184 (33.0) | 6.10E-05 |
| >13 | 295 | 207 | 209 (39.8) | 156 (28.0) | 4.29E-05 |
| >14 | 251 | 168 | 188 (35.8) | 128 (22.9) | 2.77E-05 |
| >15 | 209 | 140 | 164 (31.2) | 111 (19.9) | 8.83E-05 |
| >16 | 180 | 116 | 146 (27.8) | 99 (17.7) | 7.58E-05 |
| >17 | 155 | 103 | 134 (25.5) | 88 (15.8) | 2.42E-04 |
| >18 | 138 | 79 | 120 (22.9) | 71 (12.7) | 2.23E-05 |
| >19 | 112 | 66 | 101 (19.2) | 59 (10.6) | 2.61E-04 |
| >20 | 100 | 58 | 92 (17.5) | 53 (9.5) | 5.32E-04 |
#Housekeeping genes, *Tissue specific genes. A total of 525 housekeeping and 558 tissue specific genes were analysed. The numbers in parentheses (4th & 5th columns) represent the percentage of genes containing the repeat stretch.
§Difference in the distribution of poly (dA.dT) stretches in Hkg and Tsg analysed by applying t-test (for normalizing the difference in sample size). The repeat lengths from >12 to >18 bp are showing very significantly different distributions between Hkg and Tsg. The distributions were examined in 2000 bp upstream region from the gene start site.
Figure 1Nucleosome formation potential score distributions for 5' regions of housekeeping and tissue specific genes. The 5' sequences of human housekeeping and tissue specific genes were analysed by Recon for distribution of nucleosome formation potential scores. Frequency distribution histograms were plotted for scores in various intervals (range -3.2 to +3.2). (A) and (B) show the distribution of nucleosome formation potential scores at 400 and 2000 bp upstream from the gene start site respectively. Nucleosomal density is significantly lower for housekeeping genes as compared to tissue specific ones, in regions close to the gene start site.
Figure 2Nucleosome formation potential score distributions for 5' regions at different positions from the gene start site in housekeeping and tissue specific genes. The 5' regions of 800, 1200 and 1600 bp from the gene start site of housekeeping and tissue specific genes were taken for the analysis. Frequency distribution histograms were plotted for Recon scores in various intervals (range -3.2 to +3.2). (A), (B) and (C) show the distribution of nucleosome formation potential scores at 800, 1200 and 1600 bp upstream from the gene start site respectively. As we move upstream from the gene start site, the difference in the nucleosome formation potentials between housekeeping and tissue specific genes gradually fades away.
t-test P-values for the difference in the distribution of nucleosome formation potential scores between housekeeping and tissue specific genes.
| 400 | 3.53E-13 | 8.73E-17 | 1.28E-17 | 4.45E-23 |
| 800 | 1.16E-24 | 6.27E-24 | 6.64E-12 | 6.65E-22 |
| 1200 | 6.91E-26 | 1.44E-24 | 2.10E-09 | 1.64E-18 |
| 1600 | 1.72E-24 | 6.99E-24 | 2.72E-07 | 5.63E-15 |
| 2000 | 2.55E-25 | 1.84E-25 | 3.22E-05 | 6.71E-13 |
*denotes the length of 5' upstream region from the gene start site taken for the analysis. The scores were compared in the four Recon score intervals of relevance -1.2 to -1, -1 to -0.8, 0.8 to 1 and 1 to 1.2.
Correlation coefficients of total expression levels (log10) with nucleosome formation potential scores in housekeeping (Hkg) and tissue specific genes (Tsg).
| Hkg# | 0.10 | 0.14 | 0.04 | -0.01 |
| Tsg* | -0.10 | -0.12 | 0.15 | 0.17 |
#Housekeeping genes, *Tissue specific genes.
The correlation was drawn in the four Recon score intervals of relevance -1.2 to -1, -1 to -0.8, 0.8 to 1 and 1 to 1.2.
Comparison of the level of correlation between nucleosome formation potential scores and contrasting expression levels of genes.
| Hkg ↑↑ | 0.17 | 0.30 | -0.10 | -0.26 |
| Hkg ↓↓ | 0.36 | 0.35 | -0.28 | -0.39 |
| Tsg ↑↑ | 0.03 | 0.02 | 0.26 | 0.11 |
| Tsg ↓↓ | -0.12 | -0.14 | 0.15 | 0.16 |
*High and low expression level genes were categorised in Hkg (housekeeping genes) and Tsg (tissue specific genes) groups separately. # Recon score intervals. The genes classified as high and low expression genes in both Hkg and Tsg had atleast a 10-fold difference in their expression levels. The up (↑↑) and down (↓↓) arrows denote high expression and low expression respectively.
The distribution of Alu repeats in 5' upstream regions of housekeeping (Hkg) and tissue specific genes (Tsg) is represented in terms of the number of copies and basepairs covered by Alu repeats.
| Alu | 866 | 575 | 20.1 | 12.3 |
#Housekeeping genes, *Tissue specific genes.
t-test P-values for the difference in the distribution of Alu repeats in 5' upstream regions of housekeeping and tissue specific genes.
| Alu | 3.02E-08 | 7.63E-11 |
Figure 3A model for chromatin landscape in 5' regions of tissue specific and housekeeping genes. (A) depicts the repressive role of chromatin in maintaining tissue specific gene expression profiles in a chromosome. The chromatin organisation in the 5' regions of Tsg1 and Tsg2, two different tissue specific genes dispersed in the chromosome is shown. Nucleosome formation potentials and S/MARs – the boundary elements, are enriched in their upstream regions and might play a major role in facilitating tissue specific expression. This is likely to be a local effect since neighbouring genes might have a different expression pattern. (B) depicts the chromatin organisation in the 5' regions of Hkg1, Hkg2 and Hkg3, three housekeeping genes clustered in the chromosome. The presence of low nucleosome formation potential regions and enrichment of nucleosome destabilising elements ensure an open chromatin configuration in this domain. As Hkg generally cluster together, they are depleted in S/MARs relative to tissue specific genes as shown in the present analysis by the significant absence of predicted S/MARs in both 5' and 3' regions of housekeeping genes as compared to tissue specific genes.