| Literature DB >> 21586590 |
Peter C Holmquist1, Gerald P Holmquist, Michael L Summers.
Abstract
We show that the cAMP receptor protein (Crp) binds to DNA as several different conformers. This situation has precluded discovering a high correlation between any sequence property and binding affinity for proteins that bend DNA. Experimentally quantified affinities of Synechocystis sp. PCC 6803 cAMP receptor protein (SyCrp1), the Escherichia coli Crp (EcCrp, also CAP) and DNA were analyzed to mathematically describe, and make human-readable, the relationship of DNA sequence and binding affinity in a given system. Here, sequence logos and weight matrices were built to model SyCrp1 binding sequences. Comparing the weight matrix model to binding affinity revealed several distinct binding conformations. These Crp/DNA conformations were asymmetrical (non-palindromic).Entities:
Mesh:
Substances:
Year: 2011 PMID: 21586590 PMCID: PMC3159480 DOI: 10.1093/nar/gkr369
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.DNA binding substrates used and distinct His-SyCrp1 affinities for endogenous and chimeric substrates containing either (+13)AAAA or (+13)GGCC. (A) DNA binding substrate sequences and nomenclature. (B and C) Titration curves for His-SyCrp1 binding to the endogenous wild-type slr1351 (closed circles; 1) and sll1268 (closed circles; 3) substrates are compared with their respective chimeric substrates slr1351/sll1268 (open circles; 2) and sll1268/slr1351 (open circles; 4) listed in panel A. The fraction of bound DNA is shown as a function of the concentration of His-SyCrp1. The solid and dashed lines are obtained by a nonlinear regression best fit to the three-parameter Hill equation, n = 3 ± SE (see also Supplementary Table S31). Binding reactions and electrophoresis were performed at 22°C. Reactions contained His-SyCrp1 at the indicated concentrations, 0.1 nM radiolabeled substrate, 20 µM cAMP and reaction buffer only. The non-specific competitor Rndm. was omitted.
Figure 2.Three distinct SyCrp1/DNA conformer logos (left) with the corresponding conformer model (right) contributing to the major binding mode PSSM #3 model. The ∼10.6 [black, wide peaks (5,41,42)] and 8.5 (red, narrow peaks) bp/period sine curves are aligned to the contour of the logos. (A)PSSM #1.3, a (+)monad (+13)bend conformer (non-planar, 17.7 bits positions −20 to +19). The strand contributing to the logo is the highest scoring strand orientation collected with the scanning PSSM #1.2 (Supplementary Table S32). In the conformer model (Right), DNA indicated with white fill is less conserved than DNA indicated with grey fill. Note: the protein's F-helix (small grey circle) on the left does not necessarily contact the major groove. (B) Dyad conformer (planar, 22.9848 bits positions −20 to +19). The sequences aligned are endogenous class II SyCrp1 substrates previously suggested (46) and upstream of the open reading frames sll1247, sll1520, sll1941, slr1667, slr1732, slr0442, sll1268, sll1371, sll1261, slr0869, slr1805, sll1924, slr0316 and slr2127. Sequences were aligned in the conventional strand orientation as shown (see ‘Materials and Methods’ section). (C) Dyad (+13)bend conformer (non-planar, 21.7 bits positions −20 to +19) aligned in the conventional strand orientation (see Supplementary Introduction). Slr1351 is the highest affinity known SyCrp1 binding locus and also a Dyad (+13)bend conformer. (D) PSSM #3 (Supplementary Table S34) (neither planar nor non-planar, 16.0 bits positions from −14 to +13). The PSSM #3 model considered here estimates the major binding mode. PSSM #3 includes both conventional and reverse complement strands for each of 70 loci giving 140 aligned sequences. Thus, all conformer models (right) are represented in both conventional and reverse complement orientations. PSSM #3 calculating positive W for all sequences in each functional conformer is represented by the PSSM #3 conformer model (right) showing all conformers overlaid. In the schematic conformer models, the DNA is labeled, the flanking flexible bends (if present) are labeled and SyCrp1 consists of representative F-helix circles atop homodimeric ovoids. (E) ‘Filling in the hole’, a HS algorithm. The ‘hole’ is at position +5. An endogenous DNA substrate sequence containing a flanking flexible bend is operated upon by the HS algorithm to generate a ψ-sequence changing primary kink-associated positions +3, +5 and +7 (yellow bars in panels C and E) to the most frequently occurring base identity at those positions. The primary kink positions −5 and −6 (yellow arrow) and a canonical flanking flexible bend (green arrow, green box, positions from +11 to +14) are shown. The dashed arrows indicate the dyad axis of symmetry.
Figure 3.Affinity (ΔΔG) versus weight score (W) comparison of all 10 published endogenous SyCrp1 binding sites validating the PSSM #3 model. Traditional (open symbols) and Ψ-plots (closed symbols) for both dyad and dyad (+13)bend His-SyCrp1/DNA conformers are shown. Substrate ΔΔG values relative to ΔG for ICAP were obtained from experiments performed by Omagari et al. (20). PSSM #3 was the scanning matrix for calculating all scores. The highest score fitting to PSSM #3 (either strand orientation) is shown using all 10 input sequences as the background p. (A) PSSM #3 used to calculate W for all known SyCrp1 sequence-specific binding substrates (open circles). (B) PSSM #3 calculated ψ-scores (W) of ψ (closed circles and crosses) by application of a HS algorithm. Specific ψ-sequences, ΔΔG values and PSSM #3 calculated ψ-scores are clearly listed in Supplementary Table S35. Outliers (crosses) are labeled vertically and not included in the R2 value because they are distinct from the major mode trendline. The R2 value for the major mode trendline (dotted) was obtained by fitting to y = y0 + alnx. ΔΔG is the same for any given substrate in each plot. Substrates are labeled as previously (20). Here, 4 = slr1351 and 2 = sll1268 substrates from Figure 1A. Note logarithmic scaling of the abscissa.
Figure 4.Affinity (ΔΔG) versus weight score (W) comparison validating the HS model. Traditional (open symbols) and Ψ-plots (closed symbols) for dyad (triangles) and dyad (+10)bend (circles and crosses) EcCrp/XD-DNA sequence-specific conformers. Substrate ΔΔG values relative to ΔG for ICAP were obtained from experiments performed by Lindemose et al. (34). PSSM #A was the scanning matrix for calculating all scores. The highest score fitting to PSSM #A (either strand orientation) is shown. (A) PSSM #A calculated W of dyad (open triangles) and dyad (+10)bend substrates (open circles). (B) PSSM #A calculated ψ-scores (W) of ψ for dyad (closed triangles) and dyad (+10)bend substrates (closed circles and crosses). Each set of conformer-specific sequences scored (e.g. the circles) make up the input sequence background p for scoring that conformer. Specific ψ-sequences, ΔΔG values, and PSSM #A calculated ψ-scores are clearly listed in Supplementary Table S36. Outliers (crosses) are labeled vertically and not included in the R2 value because they are distinct from a trendline. R2 shown for trend lines (dotted) was obtained by fitting to y = y0 + alnx. ΔΔG is the same for any given substrate in each graph. The point for the highest affinity substrate is G8.85 = 85 and labels correspond to the G8.## clones as described (34). Note logarithmic scaling of the abscissa. For the XD-DNA shown, G = X and A = D. (C) The entire sequence distribution constituting the major binding mode illustrated as PSSM #A (25.7 bits). All 49 unique sequences of the SELEX system make up this logo. The EcCrp/XD-DNA complex affinities for 26 dsDNA substrates with these unique sequences were determined, leaving 23 sequences having unknown affinities. (D) The dyad (+10)bend conformer illustrated as PSSM #A1 (33.5 bits). The 14 sequences that contain (+10)CCCC make up this logo. A total of 12 affinities were determined (circles and crosses in A and B). (E) The dyad conformer illustrated as PSSM #A2 (31.2 bits). The seven sequences that contain ≤2 X/C bp's total in both (+ and −10)NNNN tracts make up this logo. A total of four affinities were determined (triangles in panels A and B). The reported bit values span between positions −16 and +16. The logos in C and D are not ψ-sequences. Base positions and arbitrary regions shaded in gray are labeled as in Figure 1A. Flanking flexible bend proximal primary kink associated positions +3, +5 and +7 (yellow bars) are where ψ-sequence changes were performed.
Figure 5.Schematic folding funnel for SyCrp1 conformational selection with linear B-form DNA substrates in vitro. Planar (class II) dyad and non-planar dyad (+13)bend or non-planar (+)monad (+13)bend conformers are represented as in Figure 2. The cAMP molecule is represented with small grey circled black dots. In this model, SyCrp1 cAMP-independent binding has not yet been rejected, but has been rejected for EcCrp due to cAMP-dependent induced fit changes to the C-helix secondary structure. Attaining a bound state is dynamic in this model. SyCrp1 must exhibit conformational entropy (61) leading to conformational-capture (33) because the DNA sequence determines the sequential minor mode folding paths (arrows) leading to the lowest energy conformer. In vivo DNA topology landscapes impose additional influences due to imposed supercoiling and flanking chromatin.