| Literature DB >> 23143266 |
Abstract
The 3D higher order organization of chromatin within the nucleus of eukaryotic cells has so far remained elusive. A wealth of relevant information, however, is increasingly becoming available from chromosome conformation capture (3C) and related experimental techniques, which measure the probabilities of contact between large numbers of genomic sites in fixed cells. Such contact probabilities (CPs) can in principle be used to deduce the 3D spatial organization of chromatin. Here, we propose a computational method to recover an ensemble of chromatin conformations consistent with a set of given CPs. Compared with existing alternatives, this method does not require conversion of CPs to mean spatial distances. Instead, we estimate CPs by simulating a physically realistic, bead-chain polymer model of the 30-nm chromatin fiber. We then use an approach from adaptive filter theory to iteratively adjust the parameters of this polymer model until the estimated CPs match the given CPs. We have validated this method against reference data sets obtained from simulations of test systems with up to 45 beads and 4 loops. With additional testing against experiments and with further algorithmic refinements, our approach could become a valuable tool for researchers examining the higher order organization of chromatin.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23143266 PMCID: PMC3592477 DOI: 10.1093/nar/gks1029
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Main components of the proposed computational approach to recover a conformation ensemble from a given set of reference CPs.
Figure 2.Schematic representations of (a) restrained bead-chain polymer model used for BD simulations of a 30-nm chromatin fiber subjected to looping constraints and (b) application of the LMS algorithm to the optimization of the parameters in the general linear model (Equation 9) used to predict restraint spring constants from reference CPs.
Parameter values used to simulate the restrained bead-chain polymer model of chromatin and to provide a physically realistic approximation of the mechanical properties of chromatin, as currently known from experiments
| Parameter | Symbol | Reduced units | SI units |
|---|---|---|---|
| Thermal energy | 1.0 | ||
| Bead mass | 1.0 | ||
| Lennard–Jones size parameter | 1.0 | 30 nm | |
| Lennard–Jones energy parameter | |||
| Bead separation | 30 nm | ||
| Contact distance | 45 nm | ||
| Bond spring constant | |||
| Persistence length | 120 nm | ||
| Bending energy constant | |||
| Time step/damping constant |
aEnergy per bead per degree of freedom at T = 300 K.
bRepresentative value based on the experimental measurement of 23.3 MDa for a 15.5-kb fragment of 30-nm chromatin upstream of the chicken -globin locus (42).
cFollowing Rosa et al. (35), equivalent to assuming that contacts between chromatin fibers are mediated by proteins of 15-nm diameter.
dFrom experiments, the stretching modulus is 5–150 pN (43), hence ranges from to .
eFrom experiments, 30 – 200 nm (43).
fTo maximize conformation sampling efficiency, we used the largest value of found to maintain stability of the BD simulations. A lower bound for can be estimated by considering a chromatin sphere of radius r = 15 nm and using with the viscosity of water = 890 µPa s at 25°C and 1 bar (44). Then, 18 ns.
Figure 6.Initial conformations used in eight trials of the ensemble recovery procedure for a chain with 35 beads and 6 restraints (third row in Table 2), shown before (top) and after (bottom) minimization of the potential energy, Equation 1. Images were generated using UCSF Chimera (48).
Validation of the conformation ensemble recovery procedure using reference CPs estimated by simulating test systems of increasing complexity
aCharacteristics of test systems used to generate conformation ensembles from which reference CPs were estimated.
bResults of ensemble recovery procedure applied to reference CPs.
cRMSD between recovered and reference values of restraint spring constants (k), CPs (p) and mean inter-bead distances (), achieved when using a general linear model (Equation 9) with the specified number of parameters per spring constant.
dLabel used to identify test system in Figure 3.
eNumber of beads in the chain.
fNumber of restraints used to induce the loops in the bead chain.
gNumber and type of induced loops.
hNumber of restraints found by peak detection algorithm.
iNumber of trials performed to select the optimal ensemble.
jAverage computation time per trial in hours when performing each trial with n + 1 parameters on one core of a 2.2-GHz AMD Opteron Processor 2427.
Figure 3.Energy-minimized conformations of the test systems used to generate reference CPs for validating the proposed computational method. The systems are labeled as in Table 2. Images were generated using UCSF Chimera (48).
Figure 4.Heat maps representing (a) reference and (b) recovered CPs for a chain of 45 beads with (left) four free loops or (right) four tied loops. Free loops result from connecting loop end-beads with harmonic restraints (gray arcs in top-left schematic), while tied loops result from connecting middle beads across free loops (dotted arcs in top-right schematic). Blue circles on the maps identify pairs of beads that were restrained (a) when generating reference CPs and (b) when performing the ensemble recovery procedure. Test systems with two and three loops (Table 2) yielded a similarly good visual match between reference and recovered CP maps (data not shown).
Figure 5.Variation of bead CPs with mean inter-bead distance in reference ensembles for chain (a) without restraints, (b) with four free loops and (c) with four tied loops. Each point represents one of the possible bead pairs in the chain. Error bars are standard deviations over 10 independent simulations. The dashed line in (a) is a fit of the power law , giving . Inset in (a): looping probability versus loop length in ensemble for chain without restraints. The curve in this inset is a fit of Equation 3 from (26). The dashed curves in (b) and (c) are power laws with exponents −8 and −3. Distances are in units of .
Figure 7.Plots of RMSD of mean inter-bead distances (Equation 17) against RMSD of CPs (Equation 15) for all trials of the ensemble recovery procedure and for all tested systems. Each point represents RMSD values obtained from an ensemble of 106 conformations at the end of a particular trial. Inset: enlarged view of boxed area. Distances are in units of .
Figure 8.Comparison of (a,b) bead CPs, (c,d) mean inter-bead distances and (e,f) standard deviation of inter-bead distance determined from optimal recovered ensemble to respective quantities determined from reference ensemble for a chain with 45 beads and (a,c,e) 4 free loops or (b,d,f) 4 tied loops. Each point represents one of the possible bead pairs in the chain. The dashed lines are plots of y = x, not linear fits. Distances are in units of . Better correlations were observed for test systems with three and two loops (data not shown).
Figure 9.Spatial representation of reference (left) and recovered (right) conformation ensembles for bead chains of 45 beads with (a) four free loops and (b) four tied loops. Each depicted ensemble consists of 100 conformations extracted at equal intervals from 108 steps of BD simulation and aligned on the beads that were connected by harmonic restraints in the simulations used to generate the reference ensembles. The coloring order along each chain is red, yellow, green, cyan and blue. Images were generated using UCSF Chimera (48).