| Literature DB >> 22618872 |
Satoshi Yamasaki1, Tohru Terada, Hidetoshi Kono, Kentaro Shimizu, Akinori Sarai.
Abstract
Proteins recognize a specific DNA sequence not only through direct contact (direct readout) with base pairs but also through sequence-dependent conformation and/or flexibility of DNA (indirect readout). However, it is difficult to assess the contribution of indirect readout to the sequence specificity. What is needed is a straightforward method for quantifying its contributions to specificity. Using Bayesian statistics, we derived the probability of a particular sequence for a given DNA structure from the trajectories of molecular dynamics (MD) simulations of DNAs containing all possible tetramer sequences. Then, we quantified the specificity of indirect readout based on the information entropy associated with the probability. We tested this method with known structures of protein-DNA complexes. This method enabled us to correctly predict those regions where experiments suggested the involvement of indirect readout. The results also indicated new regions where the indirect readout mechanism makes major contributions to the recognition. The present method can be used to estimate the contribution of indirect readout without approximations to the distributions in the conformational ensembles of DNA, and would serve as a powerful tool to study the mechanism of protein-DNA recognition.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22618872 PMCID: PMC3458528 DOI: 10.1093/nar/gks462
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Scatter plot of PSTnative [P(s|Θ) with the native sequence for s] against I calculated in dimeric ATGC space (a) and in tetrameric RY space (b). Triangles correspond to the tetramer sequence steps for which experimental data obtained by base mutations suggested the involvement of the indirect readout. The labeled points (‘a’ to ‘c’) are the examples discussed in the manuscript.
Figure 2.Schematic representation of the crystal structure of homing endonuclease I-PpoI in complex with DNA (PDB ID: 1A74) (top). Plots of PSTnative [PSTnative({ATGC}4): solid line, PSTnative({RY}2): dashed line, PSTnative({RY}4): dotted line], specificity scores [I({ATGC}2): solid line, I({RY}2): dashed line, I({RY}4): dotted line], number of contacts (direct: solid line with plus marks, water-mediated: dashed line with x marks) versus DNA sequence (bottom). Positions of Steps a, b and c are indicated with boxes in the sequence, and positions of their central base-pair steps are indicated with gray dashed lines in the structure image and in the plots.
Figure 3.Scatter plots of ΔI1 = I({ATGC}4) – I({RY}2) (a) and of ΔI2 = I({RY}4) – I({RY}2) (b) against I({RY}2). Diagonal dotted lines represent ΔI1 = I({RY}2) and ΔI2 = I({RY}2). Dotted lines downward to the right represent I({ATGC}4) = 1 and I({RY}4) = 1.