| Literature DB >> 35403917 |
Kunhe Li1, Nestor Norio Oiwa2, Sujeet Kumar Mishra3, Dieter W Heermann4.
Abstract
No systematic method exists to derive inter-nucleosomal potentials between nucleosomes along a chromosome consistently across a given genome. Such potentials can yield information on nucleosomal ordering, thermal as well as mechanical properties of chromosomes. Thus, indirectly, they shed light on a possible mechanical genomic code along a chromosome. To develop a method yielding effective inter-nucleosomal potentials between nucleosomes, a generalized Lennard-Jones potential for the parameterization is developed based on nucleosomal positioning data. This approach eliminates some of the problems that the underlying nucleosomal positioning data have, rendering the extraction difficult on the individual nucleosomal level. Furthermore, patterns on which to base a classification along a chromosome appear on larger domains, such as hetero- and euchromatin. An intuitive selection strategy for the noisy optimization problem is employed to derive effective exponents for the generalized potential. The method is tested on the Candida albicans genome. Applying k-means clustering based on potential parameters and thermodynamic compressibilities, a genome-wide clustering of nucleosome sequences is obtained for C. albicans. This clustering shows that a chromosome beyond the classical dichotomic categories of hetero- and euchromatin is more feature-rich.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35403917 PMCID: PMC9001623 DOI: 10.1140/epje/s10189-022-00185-3
Source DB: PubMed Journal: Eur Phys J E Soft Matter ISSN: 1292-8941 Impact factor: 1.624
Fig. 1Steps to derive inter-nucleosomal potentials from nucleosomal positioning data. Panel A shows schematically the distribution of nucleosomes in a section of chromosome 2 of C. albicans. We split the chromosome into sections, typically of size 50,000 bp. The lower part of Panel A shows the density after applying a rolling mean averaging with window size 5000 bp, and the typical section size is chosen to be 10 times of this scale. Step 1 takes the red binary data. Based on this data, the radial distribution function (RDF) is computed. This step enables us to obtain a coarse-grained representation of the chromosome that allows for an effective and efficient simulation of a chromosome. There is also a 12,500 bp extra intersection at each end with its neighbor. This resolves the boundaries between the sections. Once the radial distribution is computed, we apply a cut-off to the potential. Using a reverse Monte Carlo simulation, we estimate a potential from the RDF. We employ an intuitive selection strategy, i.e., a noisy optimization technique to find the best fit for the generalized Lennard-Jones exponents (see Panel C)
Fig. 2Effective pair-potential, genome-wide classification, and compressibility. Panel A: Shown is the result for C. albicans. Each chromosome is partitioned into several sections, each containing 50, 000 base pairs with two additional 12, 500 bp intersections on both sides. The curves are the effective potentials, which quantify the global interaction pattern between nucleosomes. Their coloring is adjusted to be consistent with panel B. Panel B shows the classification of the sections based on the pair potentials and compressibilities for the whole genome. This classification is based on a k-means clustering into 3 clusters. They are intentionally classified to be comparable with the classification of heterochromatin, euchromatin, and differently organized. The dashed lines are the compressibility results. The two yellow and the two blue lines mark the position of known characterization. Panel C: This panel shows the reduced isothermal compressibility employing the block density method. The plot displays the process for chr. 2. The x-axis is the number of blocks . The linked dots are the compressibilities of block . By extrapolating their linear regressions, we obtain the intercepts as the compressibility, marked by triangles. Panel D: For a better representation of the complex structure, we calculated the distribution of the compressibility )