| Literature DB >> 16614447 |
Alfred Cioffi1, Tomara J Fleury, Arnold Stein.
Abstract
The large amount of non-coding DNA present in mammalian genomes suggests that some of it may play a structural or functional role. We provide evidence that it is possible to predict computationally, from the DNA sequence, loci in mouse liver nuclei that possess distinctive nucleosome arrays. We tested the hypothesis that a 100 kb region of DNA possessing a strong, in-phase, dinucleosome period oscillation in the motif period-10 non-T, A/T, G, should generate a nucleosome array with a nucleosome repeat that is one-half of the dinucleosome oscillation period value, as computed by Fourier analysis of the sequence. Ten loci with short repeats, that would be readily distinguishable from the pervasive bulk repeat, were predicted computationally and then tested experimentally. We estimated experimentally that less than 20% of the chromatin in mouse liver nuclei has a nucleosome repeat length that is 15 bp, or more, shorter than the bulk repeat value of 195 +/- bp. All 10 computational predictions were confirmed experimentally with high statistical significance. Nucleosome repeats as short as 172 +/- 5 bp were observed for the first time in mouse liver chromatin. These findings may be useful for identifying distinctive chromatin structures computationally from the DNA sequence.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16614447 PMCID: PMC1435979 DOI: 10.1093/nar/gkl078
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Computational predictions of nucleosome repeats base upon the DNA sequence and the results of the experimental tests of these predictions in mouse liver nuclei
| Locus number | Chromosome number-probe size (bp) | Nucleosome repeat | Experiment fit param. | Fourier signal characteristics (Gaussian) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Theoretical prediction | Experimental value | Y-intercept | SD of fit (bp) | Peak SD (bp) | Peak height (VWG count) | Ratio | Window (kb) | Predicted range | ||
| 1 | 4–419 | 337/2 = (169) | 172 | −0.6 | 11 | 9.0 | 1174 | 2.2 | 100 | 90–110 |
| 2 | 7–500 | 348/2 = 171 | 172 | −1.4 | 5.8 | 8.0 | 773 | 2.1 | 100 | 90–110 |
| 3 | 3–413 | 347/2 = 174 | 174 | 0.7 | 8.5 | 7.0 | 1653 | 4.5 | 120 | 40–500 |
| 4 | 7–474 | 348/2 = 174 | 178 | −4.6 | 7.2 | 15 | 918 | 6.6 | 100 | 90–200 |
| 5 | 10–336 | 358/2 = 179 | 176 | −5.4 | 6.5 | 9.0 | 975 | 3.1 | 100 | 70–120 |
| 6 | 2–375 | 358/2 = 179 | 180 | −0.6 | 13 | 17 | 811 | 9.3 | 90 | 80–100 |
| 7 | 2–450 | 357/2 = 179 | 183 | −1.3 | 12 | 16 | 960 | 7.0 | 100 | 90–170 |
| 8 | 12–573 | 359/2 = 180 | 179 | −14 | 2.8 | 7.0 | 1368 | 3.4 | 100 | 70–130 |
| 9 | 15–407 | 360/2 = 180 | 180 | −23 | 11 | 14 | 788 | 5.0 | 100 | 90–110 |
| 10 | 3–552 | 360/2 = 180 | 181 | −2.1 | 4.9 | 7.5 | 931 | 3.7 | 90 | 70–110 |
| Mada | 2–820 | 370/2 = 185 | 183 | −3.8 | 9.2 | 7.5 | 1011 | 2.2 | 110 | 70–130 |
| 3–342 | 381/2 = 191 | 190 | −17 | 9.2 | 10.5 | 855 | 3.8 | 120 | 100–130 | |
| 19–395 | 420/2 = (210) | 187 | −11 | 5.7 | 14 | 1052 | 2.6 | 100 | 90–110 | |
| 1–678 | bulk | 191 | −12 | 9.2 | ||||||
| 9–403 | bulk | 192 | −7.8 | 12 | ||||||
| 18–735 | bulk | 194 | −42 | 8.1 | ||||||
| Et br | bulk | 194 | −33 | 10 | ||||||
aDinucleosome period mean value (of predominant Gaussian)/2 = predicted nucleosome repeat.
bAll probes exactly centered on the computation window of best ratio.
cRatio of the signal peak area to the next largest peak area in the physiological region.
dRange over which peak area of interest is at least two times greater than the next highest Gaussian in the physiological range.
eParentheses denote extreme values that might not be physiological for mouse liver chromatin.
fSee reference (17).
gNo signal.
hnon-specific probe.
Figure 1Predicted and experimentally determined nucleosome arrangements for two loci. (A) FTs of the curves of the oscillations of period-10 VWG with nucleotide number for 100 kb windows. Probe 9–403 (left) was from the center of a locus on chromosome 9 that does not possess a predominant in-phase period-10 VWG oscillation, thus predicting that nucleosome arrays in this locus should have the properties of the bulk chromatin. Probe 4–419 (right) was from the center of a locus on chromosome 4 that exhibits a predominant in-phase period-10 VWG oscillation at a period of 337 bp (arrow), thus predicting a nucleosome array with a very short nucleosome repeat value of 337 bp/2 = 169 bp. The Fourier curve (black points and spline curve) was represented by five Gaussian peaks (blue curves) that summed (orange circles) to fit the computed Fourier points in the physiological dinucleosome region. (B) Southern blots of agarose gel electrophoresis of the DNA fragments obtained from MNase digests of nuclei. Portions of the same digests were probed with probe 9–403 (left) or probe 4–419 (right), and the resultant nucleosome ladders were compared. Lanes 1 were from a 1.5 min digest and lanes 2 were from a 2 min digest. Lanes D were from a HindIII + PstI digestion of purified mouse liver genomic DNA to assess the specificity of the probes, lanes M1 were labeled 100 bp ladders (sizes of selected fragments are indicated on the right) for nucleosome oligomer sizing, and lanes M2 were labeled size markers (see Materials and Methods) for restriction fragment sizing. The nucleosome 2, 4 and 6mers detected by each probe are identified. (C) Analysis of the nucleosome repeat length from the ladders of the 2 min digest detected with each probe. Plots of nucleosome oligomer size (Base pairs) versus nucleosome oligomer number are shown. Probe 9–403, squares; probe 4–419, circles. The equations of the best straight line fits and the R2 values are shown. The nucleosome repeat lengths are the slopes of the lines.
Figure 2Effect of varying the window size on the FT of the period-10 VWG oscillations for locus 1. All windows were centered on the midpoint of probe 4–419. The superimposed curves for windows ranging from 7 to 120 kb can be distinguished from each other by their colors.
Figure 3Simulated nucleosome ladders having two different repeats illustrating the vernier effect. A ladder having 195 bp spacing is shown adjacent to a ladder having 180 bp spacing. At least 10 bands can be resolved for each. The superposition of these two ladders (merged) leads to a loss of resolution after the 5mer due to a vernier effect.
Figure 4Mixing experiment providing an estimation of the percentage of mouse liver chromatin that possesses a short repeat. (A) Nuclei from mouse liver (ML) or Matthiola petals (MP) were digested with MNase, and the purified DNA fragments were run on an agarose gel which was stained with ethidium bromide to visualize the nucleosome ladders from total genomic DNA for each sample (lanes ML and MP). The shorter nucleosome repeat (183 ± 5 bp) of the MP chromatin compared to the ML chromatin (195 ± 5 bp) is evident. The DNA from the two chromatin samples was mixed together in the proportions indicated (%MP) and analyzed on the same gel. Nucleosome oligomer bands for the MP chromatin are indicated. The brackets denote the upper region of the gel containing the oligomer DNA fragments greater than 5mers. (B) The 20% MP lane is shown adjacent to the ML lane, and the photograph was expanded for comparing the upper region of the gel. The lanes were precisely aligned using the 100 bp ladder markers immediately flanking the gel. (C) Densitometer scan of the ML and 20% MP lanes shown in (B).