| Literature DB >> 24595272 |
Hao Tong1, Jan Mrázek2.
Abstract
Many prokaryotic and eukaryotic genomes feature a characteristic periodic signal in distribution of short runs of A or T (A-tracts) phased with the DNA helical period of ∼10-11 bp. Such periodic spacing of A-tracts has been associated with intrinsic DNA curvature. In eukaryotes, this periodicity is a major component of the nucleosome positioning signal but its physiological role in prokaryotes is not clear. One hypothesis centers on possible role of intrinsic DNA bends in nucleoid compaction. We use comparative genomics to investigate possible relationship between the A-tract periodicity and nucleoid-associated proteins in prokaryotes. We found that genomes with DNA-bridging proteins tend to exhibit stronger A-tract periodicity, presumably indicative of more prevalent intrinsic DNA curvature. A weaker relationship was detected for nucleoid-associated proteins that do not form DNA bridges. We consider these results an indication that intrinsic DNA curvature acts collaboratively with DNA-bridging proteins in maintaining the compact structure of the nucleoid, and that previously observed differences among prokaryotic genomes in terms DNA curvature-related sequence periodicity may reflect differences in nucleoid organization. We subsequently investigated the relationship between A-tract periodicity and presence of CRISPR elements and we found that genomes with CRISPR tend to have stronger A-tract periodicity. This result is consistent with our earlier hypothesis that extensive A-tract periodicity could help protect the chromosome against integration of prophages, possibly due to its role in compaction of the nucleoid.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24595272 PMCID: PMC3940949 DOI: 10.1371/journal.pone.0090940
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
A list of nucleoid associated proteins.
| K Number | Gene | DNA bridging |
| K05516 | CbpA, curved-DNA-binding protein A | N/A |
| K04047 | Dps, DNA protection from starvation | N/A |
| K03557 | Fis, factor for inversion stimulation | + |
| K03746 | H-NS, histone-like nucleoid-structuring | + |
| K05787 | hupA, DNA-binding protein HU-alpha | N/A |
| K03530 | hupB, DNA-binding protein HU-beta | N/A |
| K04764 | ihfA, integration host factor alpha | N/A |
| K05788 | ihfB, integration host factor beta | N/A |
| K03719 | Lrp, leucine-responsive regulatory protein | + |
| K03632 | MukB, chromosome partition protein | + |
| K11685 | stpA, DNA-binding protein | + |
K Number and Gene are the KEGG Orthology number and abbreviated name of protein; DNA bridging:+indicates that the protein has DNA bridging function in a model organism, while N/A means that the protein is not known to form DNA bridges. The classification was adopted from ref. [1].
Summary MaxQ* statistics in genomes having and lacking specific NAPs in all analyzed genomes.
| Interaction mode | NAP | “A2T2” method | ||||
| Genomes with the NAP | Genomes lacking the NAP |
| ||||
| N | Mean MaxQ* ± SD | N | Mean MaxQ* ± SD | |||
| Bridging | H-NS | 130 | 3.08±0.76 | 443 | 2.78±0.71 | <10−4 |
| Bridging | StpA | 10 | 3.90±0.49 | 563 | 2.83±0.72 | <10−4 |
| Bridging | MukB | 47 | 3.50±0.52 | 526 | 2.79±0.72 | <10−12 |
| Bridging | Lrp | 255 | 3.00±0.73 | 318 | 2.72±0.72 | <10−5 |
| Br+Be | Fis | 143 | 3.08±0.76 | 430 | 2.77±0.71 | <10−5 |
| Bending | IHF-β | 256 | 2.91±0.73 | 317 | 2.79±0.74 | 0.028 |
| Bending | IHF-α | 250 | 2.92±0.73 | 323 | 2.78±0.73 | 0.010 |
| Bending | HU-α | 67 | 3.32±0.74 | 506 | 2.78±0.71 | <10−7 |
| Bending | HU-β | 456 | 2.88±0.72 | 117 | 2.70±0.78 | 0.006 |
| None | Dps | 306 | 2.91±0.73 | 267 | 2.77±0.73 | 0.009 |
| None | CbpA | 150 | 2.98±0.79 | 423 | 2.80±0.71 | 0.019 |
Mean MaxQ* values assessed by the “A2T2” method and standard deviations are shown for groups of genomes possessing and lacking a homolog of each NAP. N signifies the number of genomes in each group. Presence of absence of each NAP in a genome is based on data from the KEGG database ([17], http://www.genome.jp/linkdb/). ‘Interaction mode’ specifies whether the NAP forms bridges or bends upon interaction with DNA (adapted from [1]). Statistical significance of the differences was assessed by Man-Whitney U-test. Data corresponding to p-values ≥0.05 are in parentheses. DNA-bridging NAPs are in the top part of the table.
The ratio of genomes with and without the NAP is unbalanced (>10 or <0.1).
Fis can form both bridges and bends.
Dps and CbpA have not been confirmed to form bridges or bends.
Summary MaxQ* statistics in genomes having and lacking specific NAPs in proteobacteria.
| Interaction mode | NAP | “A2T2” method | ||||
| Genomes with the NAP | Genomes lacking the NAP |
| ||||
| N | Mean MaxQ* ± SD | N | Mean MaxQ* ± SD | |||
| Bridging | H-NS | 127 | 3.07±0.76 | 143 | 2.83±0.69 | 0.02 |
| Bridging | StpA | 10 | 3.90±0.49 | 260 | 2.91±0.71 | <10−4 |
| Bridging | MukB | 47 | 3.50±0.52 | 223 | 2.83±0.72 | <10−7 |
| Bridging | Lrp | 172 | 3.08±0.73 | 98 | 2.72±0.68 | <10−3 |
| Br+Be | Fis | 143 | 3.08±0.76 | 127 | 2.79±0.66 | 0.002 |
| Bending | IHF-β | 237 | (2.95) ±0.71 | 33 | (2.92) ±0.85 | 0.95 |
| Bending | IHF-α | 243 | (2.94) ±0.72 | 27 | (2.83) ±0.69 | 0.4 |
| Bending | HU-α | 67 | 3.32±0.74 | 203 | 2.82±0.69 | <10−5 |
| Bending | HU-β | 244 | (2.94) ±0.72 | 26 | (2.99) ±0.81 | 0.69 |
| None | Dps | 178 | (2.99) ±0.75 | 92 | (2.86) ±0.68 | 0.16 |
| None | CbpA | 104 | (3.02) ±0.78 | 166 | (2.90) ±0.70 | 0.28 |
See legend to Table 2.
The ratio of genomes with and without the NAP is unbalanced (>10 or <0.1).
Fis can form both bridges and bends.
Dps and CbpA have not been confirmed to form bridges or bends.
Summary MaxQ* statistics in genomes having and lacking specific NAPs in γ-proteobacteria.
| Interaction mode | NAP | “A2T2” method | ||||
| Genomes with the NAP | Genomes lacking the NAP |
| ||||
| N | Mean MaxQ* ± SD | N | Mean MaxQ* ± SD | |||
| Bridging | H-NS | 77 | 3.39±0.73 | 31 | 2.83±0.80 | 0.002 |
| Bridging | StpA | 10 | 3.90±0.49 | 98 | 3.16±0.78 | 0.002 |
| Bridging | MukB | 46 | 3.51±0.52 | 62 | 3.02±0.89 | 0.005 |
| Bridging | Lrp | 90 | 3.38±0.73 | 18 | 2.49±0.68 | <10−4 |
| Br+Be | Fis | 99 | 3.29±0.76 | 9 | 2.51±0.75 | 0.008 |
| Bending | IHF-β | 100 | 3.29±0.76 | 8 | 2.43±0.80 | 0.008 |
| Bending | IHF-α | 101 | 3.30±0.76 | 7 | 2.20±0.50 | 0.001 |
| Bending | HU-α | 66 | (3.33) ±0.74 | 42 | (3.07) ±0.85 | 0.14 |
| Bending | HU-β | 92 | (3.24) ±0.78 | 16 | (3.15) ±0.85 | 0.67 |
| None | Dps | 75 | 3.35±0.77 | 33 | 2.97±0.79 | 0.019 |
| None | CbpA | 45 | (3.14) ±0.89 | 63 | (3.29) ±0.71 | 0.44 |
See legend to Table 2.
The ratio of genomes with and without the NAP is unbalanced (>10 or <0.1).
Fis can form both bridges and bends.
Dps and CbpA have not been confirmed to form bridges or bends.
Figure 1The ‘box and whiskers’ plots showing the distribution of MaxQ* values in genomes that contain different numbers of the investigated DNA-bridging (left) and non-bridging (right) NAPs.
The bold bar shows the median, the box covers the range between 25th and 75th percentiles, and the top and bottom bars indicate the maximum and minimum excluding outliers.
Figure 2Correlation between the number of predicted DNA bends and the number of predicted binding sites in E. coli K12 for a) H-NS, b) Fis, c) IHF and d) Lrp.
Fisher exact test between NAPs and CRISPRs.
| NAP | N1 | N2 | N3 | N4 |
|
| IHF-β | 98 | 177 | 88 | 79 | 0.0005 (+) |
| HU-β | 205 | 70 | 135 | 32 | 0.132 |
| cbpA | 73 | 202 | 41 | 126 | 0.656 |
| stpA | 7 | 268 | 0 | 167 | 0.048 |
| MukB | 27 | 248 | 9 | 158 | 0.109 |
| Fis | 61 | 214 | 40 | 127 | 0.726 |
| Lrp | 109 | 166 | 71 | 96 | 0.618 |
| H-NS | 53 | 222 | 37 | 130 | 0.543 |
| IHF-α | 95 | 180 | 86 | 81 | 0.0005 (+) |
| HU-α | 33 | 242 | 17 | 150 | 0.643 |
| Dps | 111 | 164 | 106 | 61 | 0.000002 (+) |
N1 is the number of genomes with BOTH the NAP and CRISPR; N2 is the number of genomes with ONLY CRISPR; N3 is the number of genomes with ONLY the NAP; N4 is the number of genomes with NEITHER the NAP NOR CRISPR; (+) indicates a positive correlation significant with p<0.01.
Correlations between CRISPR and MaxQ* while controlling for the relationship between CRISPR and NAPs.
| NAP | Genomes with CRISPR | Genomes lacking CRISPR |
| ||
| N | Mean MaxQ* ± SD | N | Mean MaxQ* ± SD | ||
| IHF-β | 98 | 3.01±0.77 | 88 | 2.67±0.67 | 0.002 |
| IHF-α | 95 | 3.03±0.76 | 86 | 2.67±0.68 | 0.002 |
| Dps | 111 | 3.09±0.74 | 106 | 2.68±0.71 | <10−4 |
The Mann-Whitney U test was performed using only genomes that contain the specified NAP. The analysis was performed for three NAPS that show significant relationship with CRISPR (Table 5). See legend to Table 2.
NAP distribution in different phyla.
| Phyla | Sample size | H-NS | Fis | Lrp | MukB | stpA | IHF-α | IHF-β | HU-α | HU-β | Dps | CbpA |
| γ-proteobacteria | 108 | 71% | 92% | 83% | 43% | 9% | 94% | 93% | 61% | 85% | 69% | 42% |
| β-proteobacteria | 46 | 80% | 89% | 85% | 98% | 98% | 98% | 80% | 48% | |||
| α-proteobacteria | 80 | 14% | 45% | 94% | 88% | 1% | 91% | 64% | 3% | |||
| δ-proteobacteria | 21 | 5% | 10% | 29% | 5% | 100% | 100% | 90% | 19% | 90% | ||
| ε-proteobacteria | 15 | 93% | 67% | 100% | ||||||||
| Cyanobacteria | 10 | 10% | 100% | 80% | 90% | |||||||
| Firmicutes | 87 | 29% | 97% | 57% | 7% | |||||||
| Actinobacteria | 48 | 65% | 83% | 75% | 13% | |||||||
| Tenericutes | 21 | 90% | 10% | |||||||||
| Bacteroidetes | 14 | 36% | 93% | 64% | 43% | |||||||
| Chlorobi | 10 | 30% | 100% | 10% | 100% | |||||||
| Chloroflexi | 6 | 83% | 67% | 50% | ||||||||
| Chlamydiae | 7 | 14% | ||||||||||
| Thermotogae | 9 | 100% | ||||||||||
| Euryarchaeota | 35 | 17% | ||||||||||
| Crenarchaeota | 17 | |||||||||||
| Spirochaetes | 13 | 8% | 62% | 8% | 85% | 54% |
Phyla represented by more than 5 genomes in our dataset are shown. Sample size is the number of genomes representing each particular phylum. Bridging NAPs are shown in bold face. The numbers in the cells indicate the percentage of genomes in the phylum that have the specific NAP, blank cell means 0.