Mammalian global genomic nucleotide excision repair requires lesion recognition by XPC, whose detailed binding mechanism remains to be elucidated. Here we have delineated the dynamic molecular pathway and energetics of lesion-specific and productive binding by the Rad4/yeast XPC lesion recognition factor, as it forms the open complex [Min, J. H., and Pavletich, N. P. (2007) Nature 449, 570-575; Chen, X., et al. (2015) Nat. Commun. 6, 5849] that is required for excision. We investigated extensively a cis-syn cyclobutane pyrimidine dimer in mismatched duplex DNA, using high-level computational approaches. Our results delineate a preferred correlated motion mechanism, which provides for the first time an atomistic description of the sequence of events as Rad4 productively binds to the damaged DNA.
Mammalian global genomic nucleotide excision repair requires lesion recognition by XPC, whose detailed binding mechanism remains to be elucidated. Here we have delineated the dynamic molecular pathway and energetics of lesion-specific and productive binding by the Rad4/yeastXPC lesion recognition factor, as it forms the open complex [Min, J. H., and Pavletich, N. P. (2007) Nature 449, 570-575; Chen, X., et al. (2015) Nat. Commun. 6, 5849] that is required for excision. We investigated extensively a cis-syncyclobutane pyrimidine dimer in mismatched duplex DNA, using high-level computational approaches. Our results delineate a preferred correlated motion mechanism, which provides for the first time an atomistic description of the sequence of events as Rad4 productively binds to the damaged DNA.
Global genomic nucleotide excision
repair (NER) is a key cellular defense mechanism against a broad range
of DNA lesions, such as certain ones derived from polycyclic aromatic
chemical pollutants, reactive oxygen species,[3] and the ultraviolet (UV) components of sunlight.[4] The latter are among the most prevalent of DNA lesions
and are responsible for the majority of humanskin cancers, including
melanoma.[5] The vital importance of NER
is demonstrated in the humanxeroderma pigmentosum (XP) diseases,
caused by mutations in NER genes. The XP-C variant stems from mutations
in the NER lesion recognition protein, XPC; these mutations cause
extraordinary sensitivity to sunlight with a great proclivity for
skin cancers at an early age. The failure to repair UV-derived lesions,
predominantly cis-syncyclobutane pyrimidine dimers
(CPD) (Figure A),
is responsible for this cancer-prone syndrome.[5,6]
Figure 1
(A) CPD
lesion (chemical and three-dimensional structures, the
latter taken from the crystal structure of PDB entry 1T4I(34)) and DNA sequence context taken from the crystal structure
of PDB entry 2QSG. (B) Productive binding
model of Rad4 and CPD-containing DNA, with the sequence shown in panel
A, based on crystal structures of PDB entries 2QSG and 1T4I.[34] The protein is shown
as a cartoon, the CPD lesion as sticks, and the DNA as a cartoon with
a surface overlay. Hydrogen atoms are not shown for the sake of clarity.
The XPC-RAD23B protein complex has been identified as the critical
factor that recognizes NER-susceptible lesions[7−9] in mammals:
it subsequently recruits TFIIH, whose XPD helicase verifies the lesion.[4,10−12] Other NER factors are then recruited to ultimately
excise a 24–32-mer oligonucleotide containing the damage.[9,13−15] When the CPD lesion contains normal partner bases
adenine, the auxiliary proteins UV-DDB, composed of the heterodimer
of DDB1 and DDB2, first recognize the damage site and then hand it
off to XPC.[4,16] However, when faced with mismatched
partners, XPC can recognize the lesion site directly.[15,17]How the broad range of DNA lesions differing greatly in chemical
structure are recognized by XPC has been elucidated by an X-ray crystallographic
structure of Saccharomyces cerevisiaeRad4-Rad23,
the yeast ortholog of humanXPC-RAD23B, in complex with a CPD-containing
DNA duplex with mismatched thymine partners [Protein Data Bank (PDB[18]) entry 2QSG].[1] The β-hairpin
of the Rad4 BHD3 domain is inserted into the DNA helix from the major
groove side while BHD2 contacts the minor groove, the CPD lesion (unresolved)
is extruded and not contacted by the protein, and the two mismatched
thymine partners are flipped out into Rad4 pockets. This structure,
the productive open complex that correctly recruits TFIIH, strongly
supports the proposal that the NER lesion recognition factor does
not recognize the lesion itself, but the local distortions, dynamics,
and destabilizations in the DNA associated with them.[15,19−33] We have used molecular modeling and molecular dynamics (MD) simulations
(detailed in the Supporting Information) to model unresolved residues in BHD2 and the extruded CPD lesion
of the crystal structure (PDB entry 2QSG), yielding a completed productive open
complex (Figure B).(A) CPD
lesion (chemical and three-dimensional structures, the
latter taken from the crystal structure of PDB entry 1T4I(34)) and DNA sequence context taken from the crystal structure
of PDB entry 2QSG. (B) Productive binding
model of Rad4 and CPD-containing DNA, with the sequence shown in panel
A, based on crystal structures of PDB entries 2QSG and 1T4I.[34] The protein is shown
as a cartoon, the CPD lesion as sticks, and the DNA as a cartoon with
a surface overlay. Hydrogen atoms are not shown for the sake of clarity.It has been observed that the
wide array of DNA lesions are repaired
by NER at very different rates,[21,24,28] and the origin of this diversity remains poorly understood. It is
hypothesized that differences in lesion-induced local DNA duplex destabilization
may play a key role in governing the rate of formation of the productive
open complex for lesion recognition by XPC,[35] which ultimately leads to excision. Of particular interest are certain
bulky DNA lesions that stabilize the DNA duplex and are NER-resistant,[36,37] because they persist and can cause cancer-initiating mutations.A kinetic gating mechanism has recently been proposed on the basis
of essentially identical crystal structures of Rad4-Rad23 complexed
with damaged[1] and undamaged DNA,[2] and temperature-jump perturbation spectroscopy
(T-jump) experiments: the probability of forming the productive open
complex for lesion recognition is a result of the kinetic competition
between Rad4 residence time and the rate of forming the open complex
at the lesion site.[2] At a lesion, the free
energy barrier to form the productive open complex, compared with
the residence time of XPC, determines its recognition for NER; furthermore,
it is proposed that distorting and destabilizing lesions decrease
the opening time for productive binding and increase the residence
time of XPC. To understand how lesions signal their presence in the
face of predominantly undamaged DNA, it is necessary to characterize
the dynamic molecular pathway and energetics of Rad4/XPC lesion-specific
and productive binding to damaged DNA. The present study addresses
this vital gap in our knowledge by investigating the mismatched CPD
lesion. Our work provides a complement to experimental investigations,
such as single molecule[38] and T-jump[2] studies that are elucidating binding dynamics
of the XPC NER factor.We have hypothesized two types of mechanisms
for binding of Rad4-Rad23
suggested by the crystal structures of its apoprotein without substrate,
and in a complex with a CPD lesion (PDB entries 2QSF and 2QSG):[1] (1) conformational capture[39] and (2) correlated motion (they are illustrated in Figure ). These are representative
of current understanding of protein–DNA recognition mechanisms,
whose balance is believed to depend on the particular system.[39] An example of the conformational capture mechanism
has been observed crystallographically for the AlkD DNA glycosylase.[40] In this mechanism, the fraction of the population
with flipped-out partners is governed by the local thermodynamic stability:
a lower flipping free energy barrier in more destabilized cases produces
a larger flipped out population (Figure A). No such population is required for the
correlated motion mechanism, where flipping occurs in concert with
other lesion recognition events (Figure B).
Figure 2
Hypothesized conformational capture and correlated
motion mechanisms
for lesion recognition by Rad4. (A) Conformational capture mechanism.
The ensemble of damaged duplexes contains a population that has one
or two bases opposite the lesion flipped out. The Rad4 scans the damaged
DNA, flipped out bases in a locally destabilized region are captured
by the BHD2 and BHD3 domains, and the BHD3 β-hairpin inserts
with eviction of the lesion. (B) Correlated motion mechanism. Scanning
is accompanied by probing for a locally destabilized region by BHD2
and BHD3 domains. Once a sufficiently destabilized region is sensed,
in correlated motion the BHD3 β-hairpin inserts, the lesion
is evicted, and the partner bases flip out to associate with BHD2
and BHD3 domains; the order of these events may be lesion-dependent.
Hypothesized conformational capture and correlated
motion mechanisms
for lesion recognition by Rad4. (A) Conformational capture mechanism.
The ensemble of damaged duplexes contains a population that has one
or two bases opposite the lesion flipped out. The Rad4 scans the damaged
DNA, flipped out bases in a locally destabilized region are captured
by the BHD2 and BHD3 domains, and the BHD3 β-hairpin inserts
with eviction of the lesion. (B) Correlated motion mechanism. Scanning
is accompanied by probing for a locally destabilized region by BHD2
and BHD3 domains. Once a sufficiently destabilized region is sensed,
in correlated motion the BHD3 β-hairpin inserts, the lesion
is evicted, and the partner bases flip out to associate with BHD2
and BHD3 domains; the order of these events may be lesion-dependent.Here we have used state-of-the-art
simulations to characterize
on an atomistic and energetic level the lesion recognition mechanism
for forming the productive open complex between Rad4-Rad23 and a CPD-containing
DNA duplex with mismatched thymine partners[1] (Figure ). To obtain
the molecular pathways and their free energy profiles, we first used
umbrella sampling[41] with restrained molecular
dynamics (MD) simulations to generate structural ensembles along selected
reaction coordinates. We used the AMBER12[42] or AMBER14[43] packages with the ff12SB
force field and a customized force field for the CPD lesion (Table S1), explicit water, and counterions. Then,
free energy profiles were calculated using the variational free energy
profile (vFEP) method[44] for the correlated
motion mechanism. Full details of modeling, reaction coordinates,
and MD simulations are given in the Supporting Information.[45−47]Via exploration of many possible combinations
of pathways for productive
binding to the open complex, we determine that the energetically preferred
pathway utilizes a correlated motion mechanism involving three stages
(Figures and 4 and Figures S2–S4): (1) initial association of the BHD2 and BHD3 domains induces DNA
duplex distortions in bending, unwinding, and stretching, to initiate
lesion-partner base flipping; (2) the two partner bases opposite the
lesion flip into the protein in a correlated fashion; and (3) the
BHD3 β-hairpin inserts into the denatured lesion site to stabilize
the open complex.
Figure 3
Correlated motion pathway. Probing model (PM), intermediate (INT),
transition state (TS), and productively bound state (PB) structures
are rendered as cartoons with CPD as sticks and F556, F597, and F599
residues as spheres. The BHD2 β-hairpin of PB is transparent
to reveal BHD3 β-hairpin (green) insertion. See the Supporting Movie for the dynamic pathway.
Figure 4
Free energy profile for the correlated motion
pathway. The indicated
states correspond to those in Figure . The statistical error for the transition state (TS)
is estimated by averaging the free energy difference between 10–35
and 35–60 ns shown in Figure S4.
For stage (1), we first docked the apo Rad4-Rad23TGD and BHD1
domains to the DNA duplex positioned as in the crystal structure of
the complex,[1] but without engagement of
the BHD2 and BHD3 domains (docking model, Figure S1); then these domains were guided to approach the lesion
site (Figure S2). A structural ensemble
we term the probing model (PM) (Figure ) was obtained, whose energy is ∼1 kcal/mol
lower than that of the docking model (Figure S2). In this structural ensemble, the DNA duplex bends slightly in
the direction of the productive binding model, the BHD2 β-hairpin
tip gains initial interactions with the DNA backbone on the minor
groove side, and the CPD and its partner bases maintain their stacked-in
conformation as in the docking model (Figure S1). As BHD2 and BHD3 associate further, an intermediate state (INT)
was identified with an energy 0.8 kcal/mol higher than that of the
PM structure (Figures and 4 and Figure S2). This intermediate state was confirmed by two unrestrained MD simulations
[initiated from two different structures (detailed in the Supporting Information), which converged to INT
(Figure S2)]. The approach of BHD2 and
BHD3 to the lesion site in proceeding from PM to INT (Figure ) is correlated with further
DNA bending and untwisting around the lesion site; these are accompanied
by gradual displacement of the CPD from its stacked-in position in
the PM (Figure S5). The distortions of
the DNA duplex cause the two partner thymines to approach Phe599 and
enter a “flipping pathway” comprised of three Phe residues
(Phe599, Phe597, and Phe556) that direct the partner bases into their
binding pocket at the BHD2-BHD3/DNA interface (Figure ).For stage (2), the 5′ partner
thymine is guided to approach
its position in the binding pocket (Figure S3). This process is the most energetically demanding one in the preferred
pathway of the correlated motion mechanism. As the 5′ thymine
flips into its binding pocket, the free energy gradually increases
to 4.2 kcal/mol, which is denoted as a transition state (TS) in the
free energy profile (Figure ). In the TS ensemble, the 5′ thymine is in its binding
pocket positioned further along the Phe flipping path, stacked with
Phe597; this is accompanied by unguided flipping of the 3′
thymine to stack on the edge of Phe597, but it is not totally inserted
into its binding pocket in BHD3 (Figure S3). The lesion site is now denatured, and Phe599 in the BHD3 β-hairpin
has reoriented for insertion of the hairpin into the duplex to fill
the cavity (Figure and Figure S5). These structural changes
result in further unwinding and stretching around the lesion site
(Figure S5).For stage (3), after
both 5′ and 3′ partner thymines
are flipped out with the 5′ thymine in its binding pocket,
the insertion of the BHD3 β-hairpin is achieved by guiding the
hairpin to approach the center of the denatured lesion site (Figure S4). From the TS, the energy drops to
−0.8 kcal/mol as the BHD3 β-hairpin inserts, providing
an energy gain of ∼5 kcal/mol (Figure ). The ensemble at −0.8 kcal/mol exhibits
a stably inserted BHD3 hairpin and converges with an unrestrained
MD initiated from our productive binding (PB) model (Figure and Figure S4). Hence, this structural ensemble has achieved the productive
binding state. The hairpin insertion is a spontaneous process once
the local denaturation at the lesion site has crossed the maximal
energy barrier at the TS, where both partner bases are flipped. Along
the insertion path of the BHD3 β-hairpin, in correlated motion,
the CPD lesion extrudes out of the major groove (Figure S5), the lesion site further unwinds (Figure S5), the 3′ thymine follows the PHE flipping
path to insert completely into its binding pocket in BHD3, stacking
with Phe556 (Figure ), and the bend angle of the DNA duplex adjusts toward the value
of 47° (Figure S5) as in our productive
binding model (Figure A).Correlated motion pathway. Probing model (PM), intermediate (INT),
transition state (TS), and productively bound state (PB) structures
are rendered as cartoons with CPD as sticks and F556, F597, and F599
residues as spheres. The BHD2 β-hairpin of PB is transparent
to reveal BHD3 β-hairpin (green) insertion. See the Supporting Movie for the dynamic pathway.Free energy profile for the correlated motion
pathway. The indicated
states correspond to those in Figure . The statistical error for the transition state (TS)
is estimated by averaging the free energy difference between 10–35
and 35–60 ns shown in Figure S4.The complete free energy profile
of our preferred correlated motion
mechanism is shown in Figure . The free energy barrier is 4.2 ± 0.2 kcal/mol, where
flipping of partner bases accounts for most of this energetic cost.
We also explored other possibilities for the order of events, which
showed higher energy barriers and/or led to nonproductive binding
states (Figures S6 and S7). Compared with
the free energy barrier of our conformational capture mechanism of
9.6 kcal/mol (detailed in Figures S8–S11 of the Supporting Information), the correlated motion mechanism
presents a much lower free energy barrier for the Rad4-CPD-containing
DNA duplex to achieve the productive binding state. We hypothesize
that the energetic relationship between the two mechanisms may depend
on the nature of the lesion; e.g., some lesions, such as the benzo[a]pyrene (B[a]P)-derived cis-B[a]P-dG adduct, whose normal partner C is preflipped
in a base-displaced intercalated conformation,[48] may reveal lower barriers for the conformational capture
mechanism.Our mechanism sheds light on the function of BHD2
in lesion sensing.
It has been shown experimentally with humanXPC that deletion of the
BHD3 domain does not entirely destroy the XPC DNA-damage sensor function;[49] this is consistent with the initial stage of
our correlated motion mechanism, in which the PM structure shows engagement
first of BHD2 with the minor groove of the DNA damage site, which
leads to further DNA bending at INT (Figure ) that facilitates partner base flipping.Chen et al. have used temperature-jump perturbation spectroscopy
to obtain an estimate of the activation enthalpy for forming the open
complex when Rad4 is bound to mismatched duplexes, and obtained a
value of 7.9 ± 4.7 kcal/mol2 for the most relevant
mismatch sequence investigated. To the best of our knowledge, this
is the only experimental estimate available to compare with our free
energy barrier of 4.2 kcal/mol, although the systems are not the same
and our work provides the free energy; nonetheless, it is interesting
that our computed barrier is of the same order as this experimental
quantity.Our results have identified a PHE “flipping
path”
in BHD3 (Figure ),
which can direct destabilized partner bases into their binding pocket
for productive binding. The loss of this aromatic pathway would impede
the flipping of the partner bases into the protein for productive
binding, which is subject to experimental testing.While the
NER mechanism recognizes a wide array of DNA lesions,
they are repaired at very different rates, but the origin of this
diversity is not understood. It will be interesting to determine if
lesions with different local thermodynamic stabilities present different
free energy barriers for forming the productive open complex, to help
explain this diversity. Furthermore, stabilizing, repair-resistant
bulky lesions[36,37] might manifest very high barriers
that inhibit formation of the productive open complex, which would
better explain the origin of their NER resistance on an energetic
and molecular level.
Authors: H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971
Authors: Muwen Kong; Lili Liu; Xuejing Chen; Katherine I Driscoll; Peng Mao; Stefanie Böhm; Neil M Kad; Simon C Watkins; Kara A Bernstein; John J Wyrick; Jung-Hyun Min; Bennett Van Houten Journal: Mol Cell Date: 2016-10-06 Impact factor: 17.970
Authors: Kevin Hauser; Bernard Essuman; Yiqing He; Evangelos Coutsias; Miguel Garcia-Diaz; Carlos Simmerling Journal: Nucleic Acids Res Date: 2015-12-15 Impact factor: 16.971