Mariano Dellarole1, Ignacio E Sánchez, Gonzalo de Prat Gay. 1. Protein Structure-Function and Engineering Laboratory, Fundación Instituto Leloir and IIBBA-Conicet, Patricias Argentinas 435, Buenos Aires, Argentina.
Abstract
Binding cooperativity guides the formation of protein-nucleic acid complexes, in particular those that are highly regulated such as replication origins and transcription sites. Using the DNA binding domain of the origin binding and transcriptional regulator protein E2 from human papillomavirus type 16 as model, and through isothermal titration calorimetry analysis, we determined a positive, entropy-driven cooperativity upon binding of the protein to its cognate tandem double E2 site. This cooperativity is associated with a change in DNA structure, where the overall B conformation is maintained. Two homologous E2 domains, those of HPV18 and HPV11, showed that the enthalpic-entropic components of the reaction and DNA deformation can diverge. Because the DNA binding helix is almost identical in the three domains, the differences must lie dispersed throughout this unique dimeric β-barrel fold. This is in surprising agreement with previous results for this domain, which revealed a strong coupling between global dynamics and DNA recognition.
Binding cooperativity guides the formation of protein-nucleic acid complexes, in particular those that are highly regulated such as replication origins and transcription sites. Using the DNA binding domain of the origin binding and transcriptional regulator protein E2 from human papillomavirus type 16 as model, and through isothermal titration calorimetry analysis, we determined a positive, entropy-driven cooperativity upon binding of the protein to its cognate tandem double E2 site. This cooperativity is associated with a change in DNA structure, where the overall B conformation is maintained. Two homologous E2 domains, those of HPV18 and HPV11, showed that the enthalpic-entropic components of the reaction and DNA deformation can diverge. Because the DNA binding helix is almost identical in the three domains, the differences must lie dispersed throughout this unique dimeric β-barrel fold. This is in surprising agreement with previous results for this domain, which revealed a strong coupling between global dynamics and DNA recognition.
Eukaryotic DNA replication begins with the recognition of defined sequences termed the origin (ori) by origin-binding proteins (OBPs).1 These defined sequences are often tandem repeats of a specific binding site. After binding, OBPs recruit most often hexameric helicases and other proteins, leading to the activation of cellular DNA synthesis machinery. The initiation of replication usually represses transcription (1) to prevent the collision of the two machineries (2).Abbreviations: HPV, human papillomavirus; ori, replication origin; OBP, origin-binding protein; E2C, E2 DNA binding domain; BS, DNA binding site; DBS, DNA double binding site; ITC, isothermal titration calorimetry; CD, circular dichroism; PDB, Protein Data Bank.Viruses are the smallest self-replicating entities, with small yet complex genomes that encode many functions. Because eukaryotic viral transcription and replication rely heavily on the host machineries and share common mechanisms with host cells, the study of viral molecular genetics has yielded many insights into transcription and replication of eukaryotic genomes (3,4). ori recognition by an OBP is essential for genome replication in double-stranded DNA viruses. This function is fulfilled by the Large T antigen in SV40 (5). LANA1 is the counterpart in Kaposi’s sarcoma-associated herpesvirus (6) and EBNA1 in Epstein-Barr virus (7), and the E1 and E2 proteins are the counterparts in papillomaviruses (8). In most cases, binding of the OBP to its target DNA sequence is actively involved in the regulation of viral transcription (7,9−11) and chromosomal segregation (6,12−14).Most DNA tumor viruses bear multiple copies of the OBP target DNA sequence at the origin of replication (12,15−18). Cooperative binding (19) of regulatory proteins to these multiple target sequences is essential for the precise and economical regulation of the recruitment of the transcription and/or replication machineries (20). In vitro experiments such as electrophoretic mobility shift assays showed in all cases tested that binding to multiple target sequences at viral origins of replication takes place with positive cooperativity (15,21−25). The thermodynamic and structural changes that accompany cooperative protein−DNA binding are key to the recognition, bending, and unwinding of DNA sites during transcription and replication but are still poorly understood.Human papillomavirus (HPV) types that infect mucosal epithelia are highly infectious pathogens causing a variety of lesions that go from innocuous to malignant, where the most widespread is cervical cancer (26). Of the more than 100 types of HPV, there are more than 50 mucosal HPV types, all of them belonging to the alpha genus (27). Types 11 (low cancer risk) and types 16 and 18 (high cancer risk) are among the most prevalent and therefore are used as model systems for papillomavirus research. The noncoding upstream regulatory region (URR) of the HPV genome is ∼900 bp long and contains the target DNA sequences for the HPV E1 and E2 proteins (Figure 1A). Binding of E2 to its adjacent binding sites BS1 and BS2 proximal to the transcription start and ori sites is fundamental for the recruitment of E1 helicase, which in turn unwinds the origin of replication and recruits the host replication machinery (8,28,29). E2 also regulates the expression of viral proteins: binding to tandem sites BS1 and BS2 (double binding site, DBS) leads to repression of oncogene transcription by steric displacement of Sp1 and TATA binding protein (TBP) from their binding sites (Figure 1A) (30−32). Binding of E2 to the more distant BS4 activates viral transcription and DNA looping between BS4 and DBS (33−35). The sequence and relative positions of E2, Sp1, and TBP binding sites are highly conserved in the alpha genus (18) (Figure 1B), underscoring the importance of this region and of the DBS in the life cycle of the virus. Because the switch between replication and transcription in HPV is spatially and temporally regulated by the E2 protein along the virus life cycle, E2 is considered as a master regulator.
Figure 1
E2 binding sites in the alpha papillomavirus upstream regulatory region. (A) Schematic view of the upstream regulatory region of a prototypical alpha papillomavirus genome. Shown are flanking ORFs L1 and E6, E2 binding sites BS1, BS2, BS3, and BS4, the E1 binding site, the Sp1 binding site, and the TATA box. (B) Sequence logo (57,58) of the cis-responsive elements of the E6 promoter. (C) Model of the complex of two E2C homodimers with two adjacent sites separated by one base. Under our experimental conditions, E2C is always dimeric. The model was constructed by aligning two copies of a HPV6 E2C−DNA complex (PDB entry 2ayg) using PyMol (Delano Scientific LLC). (D) Sequence of the double-stranded oligonucleotide DBS from HPV16, with BS1 colored blue and BS2 red.
E2 binding sites in the alpha papillomavirus upstream regulatory region. (A) Schematic view of the upstream regulatory region of a prototypical alpha papillomavirus genome. Shown are flanking ORFs L1 and E6, E2 binding sites BS1, BS2, BS3, and BS4, the E1 binding site, the Sp1 binding site, and the TATA box. (B) Sequence logo (57,58) of the cis-responsive elements of the E6 promoter. (C) Model of the complex of two E2C homodimers with two adjacent sites separated by one base. Under our experimental conditions, E2C is always dimeric. The model was constructed by aligning two copies of a HPV6 E2C−DNA complex (PDB entry 2ayg) using PyMol (Delano Scientific LLC). (D) Sequence of the double-stranded oligonucleotideDBS from HPV16, with BS1 colored blue and BS2 red.The E2 protein is highly conserved among papillomavirus and consists of an ∼400-amino acid chain, with an N-terminal transactivation domain, a C-terminal dimeric DNA binding domain (E2C), and a flexible “hinge domain” (36). E2C is a homodimeric β-barrel protein (36−41), which binds to a highly conserved pseudopalindromic target site (Figure 1B). Interestingly, this unique fold has been found only in the DNA binding domain of Epstein-Barr nuclear antigen 1 (EBNA1), which acts as an OBP, but with otherwise no amino acid or DNA site sequence homology (42). The structure of the EBNA1 DNA binding domain bound to its target DNA was modeled on two adjacent sites, and it was proposed that a 20° unwinding in the DNA was required to explain the positive cooperativity, although no quantitative analysis was conducted (25,43). When bound to E2, the target DNA is in the B form and presents a global bend angle of 24−43°, as revealed by the structures of HPV18 and HPV6 E2C domains with DNA (44,45).We have been using HPV16E2C as a model for folding and protein−DNA interaction mechanisms (36). Detailed mapping of the binding interface allowed us to determine that all individual side chain−base interactions are additive as opposed to cooperative in energetic terms, with none of the interface interactions acting as a “hot spot” as frequently found in protein−protein interfaces (46). Formation of the complex between HPV16E2C and BS2 can take place via two alternative kinetic routes (47−49). One of them can be described as two-state, while the other populates at least two intermediates (48,49); we were able to describe a DNA−protein interaction landscape for each of the routes (48,49). In addition, we determined that recognition of DNA by E2C is an enthalpically driven process (50−52). More recently, we showed that specific recognition of DNA by E2C implies a coupling among histidine protonation, global structural cooperativity, and dynamics, through an “indirect readout” effect on the protein side (52).Using the binding of E2C to its tandem double-site cognate sequence as a model, we set out to investigate the thermodynamic basis for the cooperative recognition of this viral replication origin as a model for replication initiation and transcription repression, and in particular for DNA tumor virus tandem site origins. In addition, we intended to address conformational changes in the DNA that must take place for DNA replication to start. We determined the thermodynamic basis for binding cooperativity using isothermal titration calorimetry (ITC), a challenging task considering the complexity of the system, and correlated it most likely to partial local unstacking (53). Given the possibility of different but highly related HPVE2C domains, we compared binding of E2C domains from HPV11, HPV16, and HPV18 (E2C-11, E2C-16, and E2C-18) to individual single sites and the double site from the HPV16 genome. This allowed us to test for the consequences of sequence divergence among representative viral types on binding cooperativity. Besides a model system, gene expression and replication control in HPV are essential for cancer progression and a pursued antiviral target.
Experimental Procedures
Proteins
E2C-11 and E2C-18 were recombinantly expressed, purified, and stored as previously described for E2C-16 (54). The molecular weight and purity of recombinant proteins were tested by matrix-assisted laser desorption ionization (MALDI) and by sodium dodecyl sulfate−polyacrylamide gel electrophoresis (SDS−PAGE). Protein concentrations were determined spectroscopically using the molar extinction coefficients (ε280): 30940, 41940, and 33920 μM−1 cm−1 for E2C-11, E2C-16, and E2C-18, respectively. The three domains are known to form homodimers in solution with subnanomolar dissociation constants (44,55,56). Thus, the concentration of E2C monomers in our experiments can be considered negligible.
DNA
Oligonucleotides were purchased from Integrated DNA Technologies (Coralville, IA). Sequences are shown in Figure 1D. We performed annealing by mixing equal amounts of the oligos in 10 mM BisTris buffer (pH 7.0) and 50 mM NaCl, incubating the mixture for 5 min at 95 °C, and slowly cooling the mixture to 25 °C over 16 h. This yielded a double-stranded oligonucleotide. No detectable single-stranded oligonucleotide was present as judged by 20% polyacrylamide gel electrophoresis. The double-stranded oligonucleotide concentration was calculated using molar extinction coefficients (ε260) obtained from the nucleotide composition of 369300, 352400, and 715100 μM−1 cm−1 for BS1, BS2, and DBS, respectively.
Sequence Logo
Sequence logos were generated with WebLogo (57,58) and the aligned DNA. All alpha papillomavirus genomes were obtained from the International Committee on Taxonomy of Viruses data Base (59) (taxonomy ID 151340) and manipulated as described previously (18). We used the alignment editors BioEdit version 7.0.8 (T. Hall, Ibis Biosciences) and Jalview version 2.2.1 (60) for sequence manipulation. Nucleotide positions with a proportion of ≥60% gaps were removed for the sake of clarity.
Isothermal Titration Calorimetry (ITC)
Proteins and DNA were extensively dialyzed at 4 °C against the measurement buffer prior to the experiment. Binding experiments were performed using the VP-ITC calorimeter (MicroCal). The volume of each injection was 8 or 10 μL, except that of the first injection, which was 2 μL. Syringe rotation was fixed to 290 rpm. Titrations were continued beyond saturation levels to allow for determination of heats of ligand dilution. ORIGIN version 5.0, supplied with the calorimeter, was used to subtract the heat of dilution. Measurements were performed in the presence of 200 mM phosphate so that the binding constants could be determined from the ITC measurements (51,61).
Binary Complexes
BS1 or BS2 at concentrations in the range of 80−100 μM was titrated into the ITC cell containing 8−10 μM E2C. Global fits to a single-site binding model of data obtained from at least two titrations for each site were conducted using SEDPHAT (62).
Ternary Complexes
In “direct” titrations of the ternary complex, DBS at concentrations in the range of 32.9−40.5 μM was titrated into the ITC cell containing 4.5−4.9 μM E2C. In reverse titrations of the ternary complex, E2C at concentrations in the range of 41.2−48.1 μM was titrated into the ITC cell containing 3.3−4.4 μM DBS. Global fits to a two-site binding model of data obtained from direct and reverse titrations of each ternary complex were conducted using SEDPHAT (62).
Spectroscopy
Circular dichroism and absorbance spectra were recorded in a Jasco (Japan) J815 instrument in a 0.1 cm path length cuvette. All complexes were studied at a concentration of 15 μM. All complexes were incubated for 30 min at 298.15 K and centrifuged for 15 min at 13000 rpm before measurement. Ten scans were accumulated and averaged for each measurement. The spectrum of the buffer was recorded and subtracted from all sample spectra. Data analysis was performed in pro Fit (Quantum Soft).
Kinetics
Kinetics of structural changes were recorded in an Applied Photophysics SX18-MV spectrophotometer. Absorbance was measured at 270 nm (4 nm slit). One volume of 2 μM DNA was mixed with 1 volume of protein to yield final protein:DNA stoichiometries of 2:1, 4:1, and 8:1.
Population Analysis
Fractional populations of free DBS, DBS bound to a single E2C domain, and the ternary complex were calculated as a function of E2C concentration using the measured values for KBS1, KBS2, and Kcoop as follows:
Results
Binding Model and Considerations
The first step in investigating the cooperative recognition of the HPV origin used as a model is to determine the thermodynamic basis for the recognition of the separate cognate DNA binding sites BS1 and BS2 that are configured in tandem in the HPV genome (DBS) to two E2C homodimers (Figure 1). We make use of a general binding model (Scheme 1) (21)in which E2C:DBS1 denotes binding of an E2C homodimer to BS1 within DBS to form a binary complex, with an equilibrium constant KBS1, and E2C:DBS2 denotes binding of an E2C dimer to BS2 within DBS to form a binary complex, with an equilibrium constant KBS2. The presence of flanking nonspecific DNA does not affect E2C binding (49). Finally, E2C:DBS:E2C denotes binding of two E2C homodimers to DBS to form a ternary complex, with an equilibrium constant KBS1KBS2Kcoop. The cooperativity constant Kcoop is 1 for independent binding of the two E2C homodimers. A Kcoop of >1 implies positive cooperativity; that is, binding of the second E2C molecule is favored when an E2C molecule is already bound to DBS (19). A Kcoop of <1 implies negative cooperativity; that is, binding of the second E2C molecule is disfavored when an E2C molecule is already bound to DBS (21).
Scheme 1
For the sake of clarity, we have built an informative scheme of the ternary complex between DBS and two E2C molecules (Figure 1C) by manually superimposing the most distal bases of two E2C−DNA complexes so that the distance between BS1 and BS2 is one base, as in the HPV16 genome (Figure 1D). Albeit rough, this approximation clearly shows that independent bending to the two sites would position the two E2C molecules at opposite sides of the DNA. Thus, on the basis of this model, additive binding of E2C to DBS can in principle be expected in the HPV16 genome, and protein−protein contacts seem unlikely. This work aims to answer these questions in mechanistic and thermodynamic terms.
Binding of E2C to Isolated BS1 and BS2
We conducted the analysis of the cognate interaction between E2C-16 and DBS (Figure 1D). Throughout this work, all experiments were performed in 200 mM sodium phosphate (pH 7) and 0.2 mM DTT at 298 K. To determine all binding constants in Scheme 1 with reliability, we first investigated binding of E2C-16 to the isolated sites BS1 and BS2 (Figure 1D) by ITC. Figure 2 shows integrated, concentration-normalized data for injection of BS1 (blue circles) or BS2 (red circles) into the cell containing E2C-16 (Figure 2A). The observed reaction stoichiometry is ≥0.8, and the c value is >10 (Table 1), validating fitting a 1:1 binding model to the data. The fit yields the binding free energies (ΔG) and their enthalpic and entropic components (Table 1). Binding of E2C-16 to the individual cognate sites was enthalpy-driven in both cases, where binding to BS2 was significantly stronger than binding to BS1 (4-fold difference in the equilibrium binding constant, K).
Figure 2
Binding of E2C to BS1 and BS2. Experiments were performed in 200 mM sodium phosphate (pH 7) and 0.2 mM DTT at 298 K. The top panels show integrated, concentration-normalized binding isotherms of BS1 (blue) or BS2 (red) injected into the ITC cell containing protein. The solid lines represent global fits (62) to a single-site binding model of data points of at least two independent experiments for each homologous protein: (A) E2C-16, (D) E2C-11, and (G) E2C-18. The middle panels show difference CD spectra for binding of E2C to BS1 (blue) and BS2 (red): (B) E2C-16, (E) E2C-11, and (H) E2C-18. The spectra of E2C and BS1 or BS2 were subtracted from the spectra of each 1:1 E2C complex. All data are shown to the same scale for direct comparison. The bottom panels show difference absorbance spectra for binding of E2C to BS1 (blue) and BS2 (red): (C) E2C-16, (F) E2C-11, and (I) E2C-18. The spectra of E2C and BS1 or BS2 were subtracted from the spectra of each 1:1 E2C complex.
Table 1
Thermodynamics of Binding of E2C to BS1 and BS2
DNA
parameter
E2C-16
E2C-11
E2C-18
BS1
KBS1 (M−1)
(4.0 ± 0.5) × 106
(1.7 ± 0.6) × 106
(2.5 ± 0.1) × 106
ΔG (kcal/mol)
−9.0 ± 0.1
−8.5 ± 0.2
−8.7 ± 0.1
ΔH (kcal/mol)
−15.2 ± 0.4
−5.8 ± 0.7
−19.5 ± 0.9
−TΔS (kcal/mol)
6.2 ± 0.3
−2.7 ± 0.5
10.8 ± 0.8
Na
0.8 ± 0.1
0.8 ± 0.1
0.8 ± 0.1
c valueb
28 ± 5
11 ± 3
21 ± 1
BS2
KBS2 (M−1)
(1.1 ± 0.4) × 107
(4.6 ± 0.7) × 106
(7.9 ± 0.3) × 106
ΔG (kcal/mol)
−9.6 ± 0.2
−9.1 ± 0.1
−9.4 ± 0.1
ΔH (kcal/mol)
−22.7 ± 0.4
−7.9 ± 0.1
−24.1 ± 0.3
−TΔS (kcal/mol)
13.1 ± 0.2
−1.2 ± 0.1
14.7 ± 0.3
Na
0.8 ± 0.1
0.9 ± 0.1
0.9 ± 0.1
c valueb
82 ± 29
29 ± 2
64 ± 5
Stoichiometry of the reaction calculated from the incompetent fraction of the ligand (62).
Ratio of the ligand concentration to the product of the dissociation constant and the measured stoichiometry (61). Errors were derived from the standard deviation of at least two single experiments.
Binding of E2C to BS1 and BS2. Experiments were performed in 200 mM sodium phosphate (pH 7) and 0.2 mM DTT at 298 K. The top panels show integrated, concentration-normalized binding isotherms of BS1 (blue) or BS2 (red) injected into the ITC cell containing protein. The solid lines represent global fits (62) to a single-site binding model of data points of at least two independent experiments for each homologous protein: (A) E2C-16, (D) E2C-11, and (G) E2C-18. The middle panels show difference CD spectra for binding of E2C to BS1 (blue) and BS2 (red): (B) E2C-16, (E) E2C-11, and (H) E2C-18. The spectra of E2C and BS1 or BS2 were subtracted from the spectra of each 1:1 E2C complex. All data are shown to the same scale for direct comparison. The bottom panels show difference absorbance spectra for binding of E2C to BS1 (blue) and BS2 (red): (C) E2C-16, (F) E2C-11, and (I) E2C-18. The spectra of E2C and BS1 or BS2 were subtracted from the spectra of each 1:1 E2C complex.Stoichiometry of the reaction calculated from the incompetent fraction of the ligand (62).Ratio of the ligand concentration to the product of the dissociation constant and the measured stoichiometry (61). Errors were derived from the standard deviation of at least two single experiments.Formation of E2C−DNA complexes induces bending of the DNA toward the minor groove and small rearrangements in the protein, as indicated by crystallographic evidence (44,45) and solution measurements (38,54). Because of the large difference in extinction coefficients in favor of DNA, the contribution of the protein to the spectra is minor and easily subtractable in absorbance and negligible in near-UV CD, which allows us to determine that spectral changes correspond mostly to DNA conformation (53,63). Thus, we may attribute the changes in CD and absorbance spectra to DNA bending and torsion. We recorded spectra of isolated DNA sites, E2C-16, and the 1:1 protein−DNA complexes. The CD spectra of free and bound BS1 and BS2 show a positive peak at 275 nm, a crossover at 260 nm, and a negative peak at 245 nm (Figure S1 of the Supporting Information), characteristic of B-form DNA (53,63,64), as expected from structural data (44,45). We calculated difference CD and absorbance spectra by subtracting the individual spectra of the free DNA and protein from the spectra of the corresponding complexes. The difference spectra represent the combined spectral changes mainly in DNA upon binding. As expected from previous results (38,44,45,54), we observed an increase in both the magnitude of the CD signal at 275 nm (Figure 2B) and the absorbance at 265 nm (Figure 2C) for the binary complex of E2C-16 with either BS1 or BS2.As stated above, the possibility of evaluating E2C domains from related HPV types not only provides insight into the different mechanisms among the types related to the virus biology but also more importantly allows us to compare a differential behavior related to our reference data on HPV16E2C with evolutionarily related variants of the domain. Binding of E2C-11 (Figure 2D) and E2C-18 (Figure 2G) to BS1 and BS2 is also enthalpy-driven, and binding to BS2 is significantly stronger than binding to BS1, as in the case of HPV16 (Table 1). This feature is in agreement with our previous proposal that sequence specificity is conserved across E2C domains from alpha papillomaviruses (18). There are differences in enthalpic and entropic contributions to binding energy among the three proteins that merit further detailed investigation with more HPV-type variants. Binding of E2C-11 and E2C-18 to BS1 and BS2 followed by CD (Figure 2E,H) and absorbance (Figure 2F,I) leads to spectroscopic changes comparable to those observed for E2C-16. This indicates that the conformational changes in solution that take place in BS1 and BS2 upon binding of the three domains are similar, in agreement with previously reported crystallographic data for E2C-18 (44) and E2C-6 (45).
Thermodynamics of Binding of E2C to the Tandem DNA Site DBS
Binding of E2C-16 to the tandem DBS was investigated by ITC (Figure 1C). Figure 3A shows the experiment conducted by direct titration, i.e., injecting DNA into a cell containing the protein. In the first injections, the large excess of protein leads to predominant formation of the ternary E2C−DBS−E2C complex. The apparent DNA:protein stoichiometry, Ndirect, is on the order of 1:2, and the apparent enthalpy of binding (Figure 3A, approximately −38 kcal/mol) is close to the sum of the ΔH values for binding to isolated BS1 and BS2 (see Table 1). Figure 3B shows the experiment conducted by reverse titration, i.e., injecting protein into a cell containing DNA. As expected for a multisite system, the observed shape of the isotherms depends on the order of addition in the titration experiment (see Figure S2 of the Supporting Information for an explicative scheme). In the first injections, the large excess of DNA leads to predominant formation of the binary E2C−DBS1 and E2C−DBS2 complexes. The apparent protein:DNA stoichiometry Nreverse is on the order of 2:1, and the apparent enthalpy of binding is close to the average of the ΔH values for binding to isolated BS1 and BS2 (compare the ΔH of −20 kcal/mol from Figure 3B with data from Table 1).
Figure 3
Binding of E2C to the tandem DNA site DBS. Experiments were performed in 200 mM sodium phosphate (pH 7) and 0.2 mM DTT at 298 K. Each panel shows the ITC data in both raw and integrated, concentration-normalized form. The top panels show direct ITC titrations, i.e., injecting DBS into a cell containing E2C: (A) E2C-16, (C) E2C-11, and (E) E2C-18. The bottom panels show reverse ITC titrations, i.e., injecting E2C into a cell containing DBS: (B) E2C-16, (D) E2C-11, and (F) E2C-18. Lines are global fits (62) to the two-site binding model of direct and reverse titrations for each homologous protein.
Binding of E2C to the tandem DNA site DBS. Experiments were performed in 200 mM sodium phosphate (pH 7) and 0.2 mM DTT at 298 K. Each panel shows the ITC data in both raw and integrated, concentration-normalized form. The top panels show direct ITC titrations, i.e., injecting DBS into a cell containing E2C: (A) E2C-16, (C) E2C-11, and (E) E2C-18. The bottom panels show reverse ITC titrations, i.e., injecting E2C into a cell containing DBS: (B) E2C-16, (D) E2C-11, and (F) E2C-18. Lines are global fits (62) to the two-site binding model of direct and reverse titrations for each homologous protein.We have globally fitted (62) the binding model shown in Scheme 1 to the direct and reverse titrations (see Experimental Procedures). The binding constants and enthalpies of binding of E2C-16 to BS1 and BS2 within DBS were held constant during fitting to the values measured for the isolated sites (Table 1). The fit yields parameters for the cooperativity constant Kcoop (and, thus, the free energy of cooperativity ΔGcoop) and its enthalpic and entropic components (Table 2). The binding model (Scheme 1) fit the data very well (Figure 3A,B), and the observed stoichiometry is close to the expected value (Table 2), thus validating our approach. We confirmed direct and reverse stoichiometries using fluorescence and anisotropy titrations and electrophoresis mobility shift assays (Figure S3 of the Supporting Information). E2C-16 binds to its cognate tandem DNA site DBS with positive cooperativity [Kcoop = 2.6 ± 0.3 (Table 2) (ΔGcoop = −0.6 ± 0.1 kcal/mol)]. Fitting a model with two nonidentical independent sites (Kcoop = 1) to the data resulted in a 2-fold higher χ2 value (Figure S4 of the Supporting Information), confirming that the formation of the ternary complex is cooperative. The dissection of ΔG into its enthalpic and entropic components is shown in Figure 4A. Interestingly, positive cooperativity is governed by entropy for E2C-16, with a −TΔScoop of −1.5 ± 0.5 kcal/mol (Table 2).
Table 2
Thermodynamics of Binding of E2C to the Tandem DNA Site DBS
parameter
E2C-16
E2C-11
E2C-18
Ndirecta
0.4 ± 0.1
0.3 ± 0.2
0.5 ± 0.1
Nreverseb
1.6 ± 0.1
1.9 ± 0.1
1.3 ± 0.1
Kcoop
2.6 ± 0.3
3.4 ± 0.8
0.8 ± 0.2
ΔGcoop (kcal/mol)
−0.6 ± 0.1
−0.7 ± 0.1
0.1 ± 0.1
ΔHcoop (kcal/mol)
0.9 ± 0.5
−3.1 ± 0.5
2.3 ± 1.1
−TΔScoop (kcal/mol)
−1.5 ± 0.5
2.3 ± 0.6
−2.1 ± 1.2
Stoichiometry of the reaction for direct titration, i.e., injecting DNA into a cell containing protein.
Stoichiometry of the reaction for reverse titration, i.e., injecting protein into a cell containing DNA. Stoichiometries were calculated from the incompetent fraction of ligand (62). Errors were derived from Monte Carlo analysis of global fits (62).
Figure 4
Cooperative binding of E2C to the tandem site DBS. The top panels show thermodynamic dissection of (A) E2C-16, (C) E2C-11, or (E) E2C-18 binding to DBS. Negative values represent a favorable additional contribution to binding, whereas positive values represent an unfavorable additional contribution to binding. The bottom panels show the fractional population of free DBS, DBS with one site occupied, and DBS with both sites occupied as a function of (B) E2C-16, (D) E2C-11, or (F) E2C-18 concentration. Solid lines were calculated using the measured values of KBS1, KBS2, and Kcoop. Dotted lines were calculated using the measured values of KBS1, KBS2, and Kcoop set to 1.
Cooperative binding of E2C to the tandem site DBS. The top panels show thermodynamic dissection of (A) E2C-16, (C) E2C-11, or (E) E2C-18 binding to DBS. Negative values represent a favorable additional contribution to binding, whereas positive values represent an unfavorable additional contribution to binding. The bottom panels show the fractional population of free DBS, DBS with one site occupied, and DBS with both sites occupied as a function of (B) E2C-16, (D) E2C-11, or (F) E2C-18 concentration. Solid lines were calculated using the measured values of KBS1, KBS2, and Kcoop. Dotted lines were calculated using the measured values of KBS1, KBS2, and Kcoop set to 1.Stoichiometry of the reaction for direct titration, i.e., injecting DNA into a cell containing protein.Stoichiometry of the reaction for reverse titration, i.e., injecting protein into a cell containing DNA. Stoichiometries were calculated from the incompetent fraction of ligand (62). Errors were derived from Monte Carlo analysis of global fits (62).Binding of E2C-11 to DBS shows a Kcoop of 3.4 ± 0.8 (Table 2) (ΔGcoop = −0.7 ± 0.1 kcal/mol), similar to that of E2C-16. However, the positive cooperativity of E2C-11 bears an enthalpic origin, with a ΔHcoop of −3.1 ± 0.5 kcal/mol (Figure 4C). E2C-18 binds to the tandem DNA site DBS in a noncooperative manner with a Kcoop of 0.8 ± 0.2 (ΔGcoop = 0.1 ± 0.1 kcal/mol). The χ2 value is only slightly improved by inclusion of cooperativity in the binding model (Figure S4 of the Supporting Information). Surprisingly, the near-zero ΔGcoop for E2C-18 originates in compensating enthalpic and entropic contributions, which are similar in magnitude to those observed for E2C-11 and E2C-16 (Figure 4E).We illustrate the consequences of binding cooperativity by simulating the fractional population of free DBS, DBS with one site occupied, and DBS with both sites occupied (Figure 4, bottom panels) (see Experimental Procedures). Solid lines were calculated using the measured values of KBS1, KBS2, and Kcoop, and dashed lines were calculated using the measured values of KBS1, KBS2, with Kcoop set to 1. Positive binding cooperativity for E2C-16 and E2C-11 has two linked effects (Figure 4B,D). Relative to additive binding, the ternary complex is populated at lower E2C concentrations and the binary complexes are populated to a lesser degree.
DNA Conformational Changes Associated with Binding of E2C to DBS
We used CD and absorbance spectroscopy as reporters of changes in the DNA conformation of DBS upon formation of the cognate ternary complex with E2C-16 and compared them to those associated with individual single-site binding. We recorded spectra for the isolated DBS and for the ternary E2C−DBS−E2C complex. The CD spectra of the free DBS and of the ternary complex exhibit a positive peak at 275 nm, a crossover at 260, and a negative peak at 245 nm (Figure S5 of the Supporting Information), characteristic of B-form DNA (53,63,64). This shows that the DNA of DBS forms a B type of helix both in its free form and in the ternary complex with E2C-16.We obtained difference CD and absorbance spectra for the ternary complex by subtracting the spectra of DBS and of E2C-16 (in a 1:2 molar ratio) from the spectra of the E2C−DBS−E2C complex. These difference spectra represent the combined CD and absorbance changes in DBS DNA upon binding (solid lines in Figure 5A,B). We then compared the difference spectra (both CD and absorbance) for the ternary complex with the sum of difference spectra for each individual E2C−BS1 and E2C−BS2 binary complex (dashed lines in Figure 5A,B). Interestingly, there are evident differences between the two spectra, indicating that formation of the ternary complex involves a conformational rearrangement in DBS DNA that goes beyond those observed for single-site binding (Figure 2B,C).
Figure 5
Spectroscopic changes associated with binding of E2C to the tandem DNA site DBS. Experiments were performed in 200 mM sodium phosphate (pH 7) and 0.2 mM DTT at 298 K. Solid lines are difference spectra for binding of E2C to DBS. The spectra of DBS and two E2C molecules were subtracted from the spectra of the E2C−DBS−E2C ternary complex. Dashed lines are sums of difference spectra for the E2C−BS1 and E2C−BS2 complexes. The top panels show difference CD spectra: (A) E2C-16, (C) E2C-11, and (E) E2C-18. The bottom panels show difference absorbance spectra: (B) E2C-16, (D) E2C-11, and (F) E2C-18.
Spectroscopic changes associated with binding of E2C to the tandem DNA site DBS. Experiments were performed in 200 mM sodium phosphate (pH 7) and 0.2 mM DTT at 298 K. Solid lines are difference spectra for binding of E2C to DBS. The spectra of DBS and two E2C molecules were subtracted from the spectra of the E2C−DBS−E2C ternary complex. Dashed lines are sums of difference spectra for the E2C−BS1 and E2C−BS2 complexes. The top panels show difference CD spectra: (A) E2C-16, (C) E2C-11, and (E) E2C-18. The bottom panels show difference absorbance spectra: (B) E2C-16, (D) E2C-11, and (F) E2C-18.We have also calculated the respective difference spectra for the ternary complexes of DBS with E2C-11 and E2C-18 (solid lines in Figure 5C−F) and the sum of difference spectra for the binary complexes of E2C-11 and E2C-18 with BS1 and BS2 (dashed lines in Figure 5C−F). The considerable differences between the solid and dashed lines for both E2C-11 and E2C-18 show that, as in the case of E2C-16, formation of the ternary complexes involves conformational rearrangements in DBS, in addition to the ones observed for single-site binding. Remarkably, the difference between the conformation of DBS (solid lines) and the sum of individual sites BS1 and BS2 (dashed lines) varies among the three E2C domains. This suggests that the structural changes in DBS associated with formation of the ternary complex depend on the E2C domain variant.
Kinetics of E2C−DBS Interaction: Time-Dependent DNA Structural Changes
Finally, the kinetics of formation of the ternary complex of E2C-16 with DBS was investigated by stopped-flow techniques (see Experimental Procedures). We mixed equal volumes of 2 μM E2C-16 and 1 μM DBS and followed the changes in absorbance. The E2C−DNA association events previously determined by us and others showed a kon of 0.1−1.4 × 109 M−1 s−1[45,47]. Thus, under these conditions that require high concentrations of the reactants to produce a good signal-to-noise ratio, the two molecules associate within the ∼2 ms dead time of the instrument and any further observed signal changes should correspond to unimolecular conformational rearrangements. Under these conditions, the changes in the absorbance signal observed at equilibrium (Figure 5B) indeed occur within the dead time of the instrument (data not shown). Thus, in the case of E2C-16, the DNA conformational rearrangements in DBS take place on the submillisecond time scale, most likely in parallel with the bimolecular association event.Conversely, changes in DNA conformation upon binding of E2C-11 are observed within the experimental window (Figure 6). The kinetics is described well by a single-exponential function with an observed rate constant of 44.9 ± 0.5 s−1. This rate constant is independent of E2C concentration in the range of 2−8 μM (Figure S6 of the Supporting Information), indicating that the rate-limiting step of the reaction under these conditions is a unimolecular process. Thus, for E2C-11, the ternary complex undergoes an observable conformational transition in DBS after the association step, which on the basis of the known association rates, takes place within the experimental dead time (45,47). We did not observe any changes in absorbance upon binding of E2C-18 to DBS, as expected from the spectra of the complex and the separate components (Figure S6F of the Supporting Information).
Figure 6
Kinetics of structural changes upon binding of E2C to the tandem DNA site DBS. DBS (1 μM) was mixed with 2 μM E2C-11 in 200 mM sodium phosphate (pH 7) and 1 mM DTT at 298 K, and the changes in absorbance at 270 nm were followed. Shown are the normalized data and the fit to a single-exponential function.
Kinetics of structural changes upon binding of E2C to the tandem DNA site DBS. DBS (1 μM) was mixed with 2 μM E2C-11 in 200 mM sodium phosphate (pH 7) and 1 mM DTT at 298 K, and the changes in absorbance at 270 nm were followed. Shown are the normalized data and the fit to a single-exponential function.
Discussion
To complement the structure and biology of eukaryotic DNA origin replication sites, and in particular those of tumor DNA viruses, we undertook a thermodynamic and conformational dissection of the complex between the DNA recognition domain of the master regulator protein, E2, and two DNA sites arranged in tandem at the ori and cis-regulatory region of HPV type 16 (Figure 1). The thermodynamic analysis of data for the formation of ternary complexes by ITC is a challenging task (refs (65) and (66) and reviewed in ref (67)), especially so in the case of protein−DNA interaction where it is common in the literature to find incomplete cooperative ITC analysis (e.g., refs (68) and (69)). The usual technical strategies for addressing cooperative protein−DNA recognition are EMSA (e.g., ref (21)) and fluorescence spectroscopic assays (e.g., refs (70) and (71)). However, our binding system presents a scenario in which recognition of isolated DNA sites is not equivalent (Figure 2) and binding to tandem sites is not independent (Figure 3), increasing the degree of complexity. To address the cooperative interaction of E2C and DBS, we started by using EMSA and fluorescence spectroscopy, but we could not interpret the population of the species involved beyond the verification of the stoichiometry (Figure S3 of the Supporting Information), because of the intricate nature of the binding model. Because the quantitative discrimination of the individual complexes is not possible on the basis of the fluorescence signal, we concentrated on the ITC data.The fact that we could measure the enthalpies of each individual process independently and accurately (Figure 2) allowed us to tackle the binding analysis of the ternary E2C−DBS−E2C complex by ITC. We first measured binding to the isolated sites BS1 and BS2 separately (Figure 2) and subsequently global fitting of direct and reverse titrations of the tandem site (Figure 3) (62). Our data for E2C-16 produce the best fit with a cooperativity constant of 2.6 ± 0.3 (Table 2). This value is in line with the low cooperativity constant of 1.7 measured earlier via an EMSA (21). However, modeling of the EMSA data involves a larger error and assumed that the affinity of E2C for BS1 and BS2 was the same (21), which was shown to be incorrect (Figure 2, Table 1, and ref (72)).Formation of the binary complexes of BS1 and BS2 with E2C-16 is associated with small changes in the CD signal at 275 nm of approximately 1 mdeg and changes of 0.04−0.06 absorbance unit at 270 nm (Figure 2), in agreement with previous results (38,54). The spectral changes upon binding of E2C domains to DBS differ from those expected in the case of independent binding by ∼4 mdeg at 275 nm for CD and ∼0.3 absorbance unit at 270 nm (Figure 5). We hypothesize that the additional structural changes in DBS associated with binding of E2C-16 are larger than those observed upon binding to the isolated sites, so that the actual structure of the ternary complex likely differs significantly from the model shown in Figure 1C. It is worth noting that (1) the shape of the circular dichroism spectrum shows that distortion of the DNA from the B form is small, (2) the increase in the magnitude of the CD signal at 275 nm suggests a decrease in the DNA helix winding angle and an increase in the base pair twist angle (73), and (3) the increase in the absorbance signal could be indicative of base unstacking (74). From this evidence, we suggest that the change in DNA structure involves significant perturbation of base conformation but small backbone rearrangements because CD indicates that overall B conformation is maintained. We can visualize this as a small bubble without perturbing overall backbone conformation, but this primary event is crucial to helicase activity and subsequent assembly of the cellular DNA replication machinery. Our kinetic experiments show that there is a very small free energy barrier for this change in structure, because it takes place on the submillisecond time scale. From a functional point of view, simultaneous binding of two E2C-16 molecules to DBS may assist in the fast unwinding of the double helix at the start of replication fork progression (8,28,29,75), repress transcription trough displacement of the TATA binding protein and other regulatory proteins (30−32,75), and/or facilitate long-range interactions via DNA looping through its N-terminal domain (34,35).Moreover, binding of the homologous E2C domains from HPV types 11, 16, and 18 to DBS provides us with an excellent model for studying how protein sequence divergence can modulate the thermodynamics of binding. The overall sequence identities for the DNA-binding domains are 54% for the 16/18 pair, 61% for the 16/11 pair, and 49% for the 11/18 pair. E2C−DNA binding is determined to a great degree by the side chains of the six highly conserved helix 1 residues involved in direct readout of the DNA bases (18), which are identical in the domains studied here. In agreement with this, the free energies of binding of the three proteins to BS1 and BS2 are very similar, differing by at most 0.5 kcal/mol (Table 1). This result contrasts with the variable Kcoop for association with DBS, with E2C-11 and E2C-16 showing cooperative binding and E2C-18 showing additive binding (Table 2). Thus, the protein regions involved in cooperative binding must be variable among the three homologous proteins. Our cumulative folding and DNA binding studies on this domain (36) and recent NMR work show an unusual delocalized coupling among folding, dynamics, and DNA binding, which led us to conclude an “indirect readout from the protein side” (52), in agreement with the results presented in this work.Binding cooperativity as described by Kcoop is similar for E2C-11 and E2C-16. However, cooperativity is enthalpy-driven for E2C-11 and entropy-driven for E2C-16 (Figure 4). In addition, the associated spectroscopic changes are clearly different for the two domains (Figure 5). In our view, depicting the binding of E2C-11 and E2C-16 to DBS as “similarly cooperative” would not be adequate. According to Kcoop, E2C-18 binds to DBS in an additive, noncooperative manner. In the absence of other data, this would suggest that binding of the first protein molecule is indistinguishable from binding of the second one (Figure 4). Nevertheless, the lack of cooperativity for E2C-18 is due to compensating changes in enthalpy and entropy and is likely associated with sizable changes in the structure of the DNA that go beyond those expected for additive binding (Figure 5). Thus, we conclude that describing the binding of E2C-18 to DBS simply as noncooperative is not whole. Whereas changes in ΔG are less than 5%, differences in the enthalpic and entropic components are much larger and even of the opposite sign.The activation of DNA replication is often linked to the transcription program. In the case of double-stranded DNA viruses, genome replication represses the expression of early proteins. This includes those participating directly in replication, such as HPV E2. Consequently, late protein expression is triggered, including the structural elements of the capsid, the shell that will transport the amplified viral DNA to new hosts (3,4). Our detailed structural and thermodynamic characterization of an important binding event for the coupling between replication and transcription (20) may aid the understanding of this process in papillomaviruses and in double-stranded DNA viruses in general.
Authors: Jon C D Houtman; Patrick H Brown; Brent Bowden; Hiroshi Yamaguchi; Ettore Appella; Lawrence E Samelson; Peter Schuck Journal: Protein Sci Date: 2007-01 Impact factor: 6.725