Alice R Walker1, G Andrés Cisneros1. 1. Department of Chemistry, University of North Texas , 1155 Union Circle, Denton, Texas 76203, United States.
Abstract
Genetic information is vital in the cell cycle of DNA-based organisms. DNA polymerases (DNA Pols) are crucial players in transactions dealing with these processes. Therefore, the detailed understanding of the structure, function, and mechanism of these proteins has been the focus of significant effort. Computational simulations have been applied to investigate various facets of DNA polymerase structure and function. These simulations have provided significant insights over the years. This perspective presents the results of various computational studies that have been employed to research different aspects of DNA polymerases including detailed reaction mechanism investigation, mutagenicity of different metal cations, possible factors for fidelity synthesis, and discovery/functional characterization of cancer-related mutations on DNA polymerases.
Genetic information is vital in the cell cycle of DNA-based organisms. DNA polymerases (DNA Pols) are crucial players in transactions dealing with these processes. Therefore, the detailed understanding of the structure, function, and mechanism of these proteins has been the focus of significant effort. Computational simulations have been applied to investigate various facets of DNA polymerase structure and function. These simulations have provided significant insights over the years. This perspective presents the results of various computational studies that have been employed to research different aspects of DNA polymerases including detailed reaction mechanism investigation, mutagenicity of different metal cations, possible factors for fidelity synthesis, and discovery/functional characterization of cancer-related mutations on DNA polymerases.
The
accurate synthesis and maintenance of DNA in cells depend heavily
on a complex network of proteins, of which DNA polymerases are crucial
players. Each DNA polymerase has a unique task to ensure the successful
replication and repair of the genome of a cell. The number of potential
errors, mismatches, and damage to DNA is vast, with far-reaching consequences
for cell function that can result in disease or, in many cases, death.[1−4] These polymerases have complex structures, mechanisms, and dynamics,
which in many cases are not fully understood.[5−7] Computational
investigations of these systems require a variety of techniques and
a balance between rigorous theory and computational resources, especially
considering the relatively large size of polymerase systems and their
multifarious functions.[8]DNA polymerases
are divided into several families, each of which
has similarities in active site structure, processivity, and potential
for additional activity (such as exonuclease or lyase capabilities).[9−12] In brief, family A polymerases perform excision repair with relatively
poor processivity; family B performs DNA repair with the potential
to bypass lesions and has variable processivity; family C performs
DNA synthesis with a proofreading function and a high level of processivity;
family X is involved several types of DNA repair, including base excision
repair (BER) and non-homologous end joining (NHEJ); and family Y performs
translesion DNA synthesis with a low level of accuracy.[8,13−15]Even within families there can be substantial
differences in structure
and function of the various polymerases. However, overall the polymerase
subdomain structures are generally similar and are usually described
as a right or left-hand. Within the hand there are the finger, thumb,
and palm domains. The palm domain is where the DNA is bound while
the polymerase is active, and also where the active site for phosphoryl
transfer/nucleotide addition takes place, which is discussed more
thoroughly below. The thumb domain is usually thought to be involved
in the position and movement of the DNA through the polymerase, and
the finger domain is involved with the alignment of the incoming nucleotide
with the template strand and, potentially, recognition of the correct
incoming nucleotide.[16]Computational
simulations based on detailed classical and/or quantum
analysis have been applied to gain atomic-level insight into this
important class of enzymes. Molecular dynamics (MD) simulations allow
for the investigation of large and small-scale structural differences
over time and can provide better understanding regarding how large,
flexible polymerases adapt to relatively small structural changes,
such as mutagenic lesions, and how they maintain fidelity.[17−21] These simulations can also provide information on large scale motions
and structural changes such as switching between polymerase and exonuclease
activities.[22,23] As a recent example, Kim et al.
have provided a novel explanation for how DNA polymerase β (Polβ)
becomes inactivated by oxidized guanine by applying targeted MD to
obtain the specifics of the transition from the open active state
of Polβ to the inactive closed state by disruption of a water
network and, eventually, the active site.[24] Yang et al. have also used targeted MD to investigate how Polβ
changes between correctly paired terminal bases versus incorrectly
paired ones, and show aspects of the overall motion of Polβ,
and of a rotation of Arg258 in the active site that has been correlated
to the shift from the active to inactive state.[25] Additionally, molecular mechanics (MM) work by Jia et al.
uncovered the N-clasp structural feature of Polκ that allows
for nearly error-free bypass of mutagenic lesions.[26]Quantum mechanical (QM) and hybrid QM/molecular mechanics
(QM/MM)
methods have also frequently been used to investigate polymerase function
and activity. QM methods alone are highly accurate but are difficult
or impossible to apply to large enzymatic systems, and so usually
QM/MM is used to obtain information on the complex electronic structure
of the metal ions and accurate energetics for reaction mechanisms.[7,27−30] QM/MM subdivides the calculated system into a small region of 100
atoms or less that is calculated with QM, usually the active site
of the polymerase along with relevant ions and cofactors, and the
MM region, containing the rest of the polymerase and solvent. This
allows for insight into particulars of the reaction mechanisms and
metal catalysis that differ between specific polymerase families and
sometimes within polymerases themselves, as discussed further in section and throughout
the perspective. These cannot be investigated with MD alone since
they require investigation into bond breaking and forming and the
movement of electrons. Family X and Y polymerases in particular have
been the object of a large number of QM/MM[31] simulations.[19,32−39] To name a few examples, Hummer et al. have applied combined QM/MM
techniques to investigate a variety of aspects about B. halodurans ribonuclease (RNase), including the reactivity of specific metals
and the overall mechanism, gaining insights into nucleotide cleavage
and transfer reactions.[40] Combined QM/MM
simulations and structural/biochemical experiments, reported by Perera
et al., have given fascinating insight into the role of metal ions
in the active site of Polβ, and their dynamics of facilitation
and inhibition in relation to pyrophosphorolysis, or the reverse of
the usual addition reaction of polymerases.[41] QM/MM has also been used to gain mechanistic insight into several
Y family polymerases, which are unique in their ability to bypass
bulky chemical lesions. Dpo4, which can perform error-prone translesion
synthesis, has a particularly solvent-exposed active site; Wang et
al. have shown a corresponding mechanism for nucleotidyl-transfer
reactions through water-dependent pathways using QM/MM calculations.[42] Additionally, QM/MM work by Hoffmann et al.
shows a similar water-mediated transfer reaction for Polκ, which
has a much higher fidelity as compared to Dpo4.[43] Further discussion of mutagenic lesions and Dpo4 is found
later in the perspective in section .The use of advanced force fields for classical
or QM/MM simulations
applied to complex biomolecular systems also provides new insights
and results, showing new avenues for exploring the role of electronic
polarization and improved treatments of permanent electrostatics.[44−48] These advanced methods can improve solvent boundaries for QM/MM[49] and the description of metal cations.[50−52]We have developed a variety of methods for simulations and
analysis,
which have been applied to various biochemical systems including DNA
polymerases. In this perspective, we discuss our studies on DNA polymerase
catalysis in section , metal mutagenicity in section , fidelity determinants in section , mutagenic lesion bypass in section , and cancer mutations and
their effects on the structure and function of polymerases in section .
Computational Investigation of DNA Polymerase
Reaction Mechanisms
As briefly touched on in the Introduction, DNA polymerases possess a remarkable
degree of specificity and
variability in function. That said, the reaction mechanism of DNA
synthesis is very consistent across polymerases. The reaction mechanism
for DNA synthesis by DNA polymerases involves a nucleophilic attack
on the alpha phosphate (Pα) of the incoming nucleotide by the
O3′ of the primer-terminus nucleotide with the concomitant
formation of pyrophosphate (PPi). This reaction results in the formation
of an O–P bond and therefore the addition of the incoming nucleotide,
as shown schematically in Figure . In general, this mechanism is considered to have
two metal ions, though that can vary, as will be discussed later in
the perspective.[53]
Figure 1
Mechanism of polymerase
addition reaction for Pol λ. Reproduced
from Fang, D., Chaudret, R., Piquemal, J.-P., and Cisneros, G. A.
(2013) Toward a Deeper Understanding of Enzyme Reactions Using the
Coupled ELF/NCI Analysis: Application to DNA Repair Enzymes. J. Chem. Theo. Comp.9, 2156–2160.
Copyright 2013 American Chemical Society.[54]
Mechanism of polymerase
addition reaction for Pol λ. Reproduced
from Fang, D., Chaudret, R., Piquemal, J.-P., and Cisneros, G. A.
(2013) Toward a Deeper Understanding of Enzyme Reactions Using the
Coupled ELF/NCI Analysis: Application to DNA Repair Enzymes. J. Chem. Theo. Comp.9, 2156–2160.
Copyright 2013 American Chemical Society.[54]Several reaction mechanisms have
been investigated to assess the
particulars of proton transfer from the O3′ and seem to vary
depending on the particular DNA polymerase. The proton transfer from
the nucleophilic O may involve a direct proton transfer to an oxygen
on the Pα, indirectly through an ordered water, or to one of
the conserved aspartate/glutamic acid residues in the active site.[36,37,55] We have performed various theoretical
studies on the reaction mechanism of DNA polymerase λ (Polλ).[56] Polλ is an X family polymerase involved
in NHEJ and can fill small DNA gaps (1–2 nucleotides). QM/MM
simulations of the reaction mechanism for the three H+ transfer
possibilities in Polλ suggest that in this case the proton is
transferred to the conserved D490 in the active site.[56]DNA polymerases have been shown to be able to perform
their catalytic
activity using different divalent cations in the active site.[4,57] Polλ can employ either Mn2+ or Mg2+ to
synthesize DNA.[58] We performed QM/MM simulations
using both metal cations to determine the differences in the reaction
mechanism in Polλ. Our results show that the energy barriers
for both Mn2+ and Mg2+ are in good agreement
with experimental estimates.[56] In both
cases, the reaction is calculated to proceed through a two step mechanism
involving the initial proton transfer from O3′ to D490, followed
by the nucleophilic attack of O3′ on the alpha phosphate of
the incoming nucleotide. The energy barrier associated with the reaction
is ∼2 kcal/mol smaller for the Mn2+ catalyzed reaction
than for Mg2+, in agreement with experiment.Further
insights into the reaction were provided by investigating
the impact of individual residues on the critical points along the
reaction path. To this end, the non-bonded interaction energy between
each residue in the protein and the active site can be analyzed by
means of an energy decomposition analysis (EDA). This analysis provides
a qualitative assessment of the role of each amino acid on the reaction
and can be used to compare to available mutagenesis experiments, or
to predict possible mutation sites for subsequent experimental analysis.
The residues found to be important for catalysis in Polλ are
shown in Figure .[56] None of the predicted residues had been experimentally
investigated in Polλ prior to our simulations. Only three homologous
residues had been mutated experimentally in Polβ, and all exhibited
impact on catalysis. Following our computational simulations, mutations
on Polλ were performed on R386[59] and
K427,[60] which confirmed the role of these
residues on activity and stabilization of the DNA substrate. These
residues located in the so-called second shell around the active site
also gave rise to the analysis of cancer SNPs as described in subsection .
Figure 2
Catalytically relevant
residues in Polλ. Residues colored
in red have large energetic contributions to transition state stability
or are present in all transition states, residues in green correspond
to the QM subsystem. Reprinted from Cisneros, G. A., Perera, L., García-Díaz,
M., Bebenek, K., Kunkel, T. A., and Pedersen, L. G. (2008) Catalytic
mechanism of human DNA polymerase λ with Mg2+ and Mn2+ from
ab initio quantum mechanical/molecular mechanical studies. In DNA
Repair, Vol. 7, pp 1824–1834, Copyright 2008, with permission
from Elsevier.[56]
Catalytically relevant
residues in Polλ. Residues colored
in red have large energetic contributions to transition state stability
or are present in all transition states, residues in green correspond
to the QM subsystem. Reprinted from Cisneros, G. A., Perera, L., García-Díaz,
M., Bebenek, K., Kunkel, T. A., and Pedersen, L. G. (2008) Catalytic
mechanism of humanDNA polymerase λ with Mg2+ and Mn2+ from
ab initio quantum mechanical/molecular mechanical studies. In DNA
Repair, Vol. 7, pp 1824–1834, Copyright 2008, with permission
from Elsevier.[56]E. coliDNA polymerase III (Pol III) is
a complex
that is relatively large composed of 10 subunits with multiple functions
including DNA replication, post-replicative repair, and exonucleolytic
proofreading.[61,62] We have investigated the ϵ
subunit in particular and its function as an exonuclease.[63] The ϵ subunit (termed ϵ in the subsequent
discussion) preferentially removes incorrectly paired nucleotides
and is a crucial component of DNA Pol III’s high rate of fidelity.[64,65]One metal and two metal mechanisms have been proposed[66] for the proofreading mechanism; our theoretical
work focused on the two-metal mechanism.[67] Additionally, the reaction catalyzed by ϵ in vivo has been reported to be faster with Mn2+ than with Mg2+, so the mechanisms for both of these metals were explored.[66] In both cases, the two metal mechanism involves
a nucleophilic attack on the base to be excised by an attacking hydroxide
ion; essentially, the reverse of the standard polymerase addition
reaction. The formation of this ion is facilitated by the catalytic
metal, Me1, as seen in Figure .
Figure 3
Reaction mechanism for the ϵ subunit of DNA polymerase III
holoenzyme.
Reaction mechanism for the ϵ subunit of DNA polymerase III
holoenzyme.The ϵ subunit can
associate with an additional θ subunit,
which has been proposed to promote its activity.[68] The θ subunit also has a smaller homologue from bacteriophage
P1, HOT, which can also stabilize ϵ and perform a similar function.[69] Crystal structures of the ϵ–HOT
complex with two metals in the active site were used to explore the
mechanism using QM/MM and similar techniques as described for Polλ.
Our simulations involved three different systems including the ϵ–HOT
complex with Mg2+ or Mn2+, and free–ϵ
with Mg2+. The calculated energy barriers suggest that
ϵ is slightly more active with Mn2+ than with Mg2+; the ϵ–HOT complex shows a slightly lower activation
barrier compared with free-ϵ consistent with experiment.A particularly interesting aspect of this study involves the number
of ligands coordinated to the catalytic metal in the active site.
The reported crystal structure for ϵ–HOT, pdbid 2IDO,
suggests that the catalytic metal is pentacoordinated.[70] This is an unusual coordination number, especially
for Mg2+, which is considered to be the natural divalent
cation for both DNA polymerases and exonucleases. Moreover, this pentacoordination
is preferentially maintained along the reaction path according to
our QM/MM simulations.[67] The more typical
hexacoordination was also investigated but did not result in chemically
reasonable structures for the catalytic step. This was observed for
both Mn2+, which can maintain a pentacoordinated structure
in the first coordination shell, and for Mg2+, for which
it is less common but still present in other enzymes. This unusual
coordination sphere has been subsequently reported in other enzymes.[40,71,72]Additionally, we performed
EDA in a similar manner to Polλ
to determine residues that either stabilize or destabilize the TS
for the proofreading mechanism (Figure ).[67] Several residues near
the active site and on HOT were shown to have a significant effects
on catalysis. Four of these residues had been studied experimentally
previously. All four showed mutator effects proportional to their
(de)stabilization energies, with D103 and D129 showing particularly
strong mutagenic effects.
Figure 4
Catalytically important residues in ϵ–HOT.
Top: Residues
colored in red have large energetic contributions to transition state
stability and/or are present in all transition states, residues in
blue are active site residues. Reproduced from Cisneros, G. A., Perera,
L., Schaaper, R. M., Pedersen, L. C., London, R. E., Pedersen, L.
G., and Darden, T. A. (2009) Reaction Mechanism of the ε Subunit
of E. coli DNA Polymerase III: Insights into Active
Site Metal Coordination and Catalytically Significant Residues. J. Am. Chem. Soc.131, 1550–1556.
Copyright 2009 American Chemical Society.[67]
Catalytically important residues in ϵ–HOT.
Top: Residues
colored in red have large energetic contributions to transition state
stability and/or are present in all transition states, residues in
blue are active site residues. Reproduced from Cisneros, G. A., Perera,
L., Schaaper, R. M., Pedersen, L. C., London, R. E., Pedersen, L.
G., and Darden, T. A. (2009) Reaction Mechanism of the ε Subunit
of E. coliDNA Polymerase III: Insights into Active
Site Metal Coordination and Catalytically Significant Residues. J. Am. Chem. Soc.131, 1550–1556.
Copyright 2009 American Chemical Society.[67]We have also developed several
methods to improve simulation accuracy
and enable deeper analyses. One of these involves the combination
of two quantum interpretative techniques, electron localization function
(ELF) and noncovalent interaction (NCI) analysis, to investigate covalent
and non-covalent regions. This combination was originally developed
to investigate organic reactions in gas phase[73] and subsequently extended by us to enzymatic reaction mechanisms.[54] We applied ELF/NCI analysis to further examine
the role of the electronic structure on the intermolecular interactions
and metal coordinations related to the reactions catalyzed by Polλ
and ϵ. For Polλ, the ELF/NCI analysis revealed that the
interaction between O3′ on the primer and Mg2+ was
weaker than for Mn2+, as indicated by the associated NCI
surface. Concomitantly, the interaction between O3′ and the
phosphateoxygen coordinated to the metal ions was stronger for Mg2+ than for Mn2+ (Figure ). These results are consistent with our
previous QM/MM (and experimental) results where the barrier for proton
transfer is higher for Mg2+;[63] further details about the ELF results can be seen in Subsection .
Figure 5
NCIplot for Pol λ
reactant structures, with strong attractive
forces denoted in blue, strong repulsive forces denoted in red, and
weak forces denoted in green. Reproduced from Fang, D., Chaudret,
R., Piquemal, J.-P., and Cisneros, G. A. (2013) Toward a Deeper Understanding
of Enzyme Reactions Using the Coupled ELF/NCI Analysis: Application
to DNA Repair Enzymes. J. Chem. Theo. Comp.9, 2156–2160. Copyright 2013 American Chemical Society.[54]
NCIplot for Pol λ
reactant structures, with strong attractive
forces denoted in blue, strong repulsive forces denoted in red, and
weak forces denoted in green. Reproduced from Fang, D., Chaudret,
R., Piquemal, J.-P., and Cisneros, G. A. (2013) Toward a Deeper Understanding
of Enzyme Reactions Using the Coupled ELF/NCI Analysis: Application
to DNA Repair Enzymes. J. Chem. Theo. Comp.9, 2156–2160. Copyright 2013 American Chemical Society.[54]Similarly, for the ϵ subunit of Pol III, the Mn2+ ion shows electronic density splitting in the surfaces associated
with the metal ions. This splitting is not present for Mg2+, indicating that the Mn2+ ion has stronger interactions
with pertinent residues in the active site as compared to Mg2+. Interestingly, the catalytic metal exhibits five splitting basins
as shown in Figure , which is also consistent with the pentacoordination of Me1 as explained
above. Further insights into metal mutagenicity using ELF/NCI are
discussed below in section .
Figure 6
Combined NCI and ELF surfaces for the ϵ subunit of Pol III.
ELF surfaces are shown in transparent purple, and NCI surfaces shown
as solid red/green/blue surfaces. Panels a–c show the reactant,
transition state and product structures for Mg2+, while
panels d–f show the same for Mn2+. Reproduced from
Fang, D., Chaudret, R., Piquemal, J.-P., and Cisneros, G. A. (2013)
Toward a Deeper Understanding of Enzyme Reactions Using the Coupled
ELF/NCI Analysis: Application to DNA Repair Enzymes. J. Chem.
Theo. Comp.9, 2156–2160. Copyright
2013 American Chemical Society.[54]
Combined NCI and ELF surfaces for the ϵ subunit of Pol III.
ELF surfaces are shown in transparent purple, and NCI surfaces shown
as solid red/green/blue surfaces. Panels a–c show the reactant,
transition state and product structures for Mg2+, while
panels d–f show the same for Mn2+. Reproduced from
Fang, D., Chaudret, R., Piquemal, J.-P., and Cisneros, G. A. (2013)
Toward a Deeper Understanding of Enzyme Reactions Using the Coupled
ELF/NCI Analysis: Application to DNA Repair Enzymes. J. Chem.
Theo. Comp.9, 2156–2160. Copyright
2013 American Chemical Society.[54]
Third Metal in Polλ
Mechanism
As explained above, the synthesis of DNA by DNA
polymerases involves
a general two metal ion mechanism. Recently, Nakamura et al. reported
the existence of a transient third metal ion in the in crystalo reaction of Polμ based on time-resolved X-ray crystallography.[74] Wilson and co-workers have published similar
evidence for Polβ.[75] On the basis
of these results, Perera et al. have performed extensive computational
simulations to investigate the role of this third cation in the reaction
mechanism of Polβ.[27,41] Their results suggest
that this third metal does not affect the forward chemical step (DNA
synthesis) and significantly impairs the pyrophosphorolysis (backward)
reaction.Polλ is 35% identical to Polβ and thus
the question arises of whether this transient third metal is also
present in the Polλ reaction. We have explored the possible
role of the third metal ion in Polλ using our own QM/MM program,
LICHEM.[76] As reported in ref (74), the comparison of the
calculated energies of Polλ reactant and product structures
with a third metal included in the active site results in a stabilization
of the product relative to the reactant (Figure ). This is in contrast to our original reported
path for the mechanism of Polλ, where the calculated reaction
energy is endoergic by around 4–5 kcal/mol. Thus, our initial
calculations are consistent with the work of Perera et al. and Genna
et al. in that the third metal stabilizes the product structure and
thus reduces the likelihood of the backward reaction.[27,77] In addition, the resulting distances for the active site in the
optimized structures in Polλ are similar to those reported by
Perera et al., with the third metal coordinating to the apical oxygen
on Pα with distances around 2.1 Å for reactant and product.
Figure 7
Close-up
of the active site for the optimized reactant (left) and
product (right) structures of Polλ with the third metal in the
active site.
Close-up
of the active site for the optimized reactant (left) and
product (right) structures of Polλ with the third metal in the
active site.
Metal Mutagenicity
in DNA Polymerases
The requirement of metal cations to facilitate
the reaction in
DNA Pols gives rise to the possibility of replacement of the ”natural”
cation, Mg2+, by other metals. It is known that several
cations are carcinogenic or genotoxic, and these effects are due to
the inhibition of DNA repair transactions.[78] Indeed, the replacement of Mg2+ ions in the active site
of DNA polymerases by metals such as Cr(II), Cd(II), Ni(II), Ca(II),
Na(+), or Mn(II) results in either a decrease in synthesis fidelity
or outright inhibition.[57] Pelletier et
al. investigated the effect of different metals in ternary structures
of DNA Polβ.[79] Their results show
that metals other than Mg2+ result in changes in the active
site geometry.On the basis of our previous results on the reaction
mechanism
of Polλ, we investigated the effect of nine different cations
including Mg2+, Na+, Ca2+, Zn2+, Co2+, Cr2+, Cu2+, Mn2+, and Ni2+ using QM/MM methods.[80] In addition to the structural effects, we performed electron
localization function (ELF) analyses on all the resulting systems
to gain insights into the effects of the metals on the electronic
structure.The structures obtained from our simulations indicate
that the
overall arrangement of the active site is largely unaltered regardless
of the cation, with only subtle differences arising such as very slight
increases of 0.2–0.4 Å between the metals and the atoms
in the first coordination shell. Similarly, the nucleophilic attack
distance (O3′ to Pα) increases slightly, ∼0.3
Å, for two of the three inhibitor cations, Na+ and
Ca2+.To further understand the differences between
these systems given
the very slight changes in geometry, we turned to the analysis of
the electronic structure via ELF. The results from the ELF analysis
are striking in that out of the nine tested cations, significant differences
were observed between the calculated systems with inhibitor and mutagenic
cations compared to the Mg2+ system.[80] In particular, the ELF analysis indicates that when two
Mg2+ occupy the active site, the cations and incoming nucleotide
experience a particular electronic polarization. By contrast, the
inhibitory (Ca+, Na2+, and Zn2+)
and mutagenic (Co2+, Cr2+, Cu2+,
Mn2+, and Ni2+ cations exert a different (hyper)polarization
on the oxygen atoms on the triphosphate of the incoming nucleotide.
In addition, the enzyme environment also significantly affects the
metals in that only the Mg2+ shows a single basin around
the metals, whereas all other metals are (hyper-)polarized in such
a way that the metal exhibits multiple basins (Figure ).
Figure 8
ELF analysis for the active site of Polλ
with Mg2+ (left) and Ni2+ (right) occupying
the catalytic and nucleotide
binding metal sites. Reproduced from Chaudret, R., Piquemal, J.-P.,
and Cisneros, G. A. (2011) Correlation between electron localization
and metal ion mutagenicity in DNA synthesis from QM/MM calculations. Phys. Chem. Chem. Phys.13, 11239–11247,
with permission from the PCCP Owner Societies.[80]
ELF analysis for the active site of Polλ
with Mg2+ (left) and Ni2+ (right) occupying
the catalytic and nucleotide
binding metal sites. Reproduced from Chaudret, R., Piquemal, J.-P.,
and Cisneros, G. A. (2011) Correlation between electron localization
and metal ion mutagenicity in DNA synthesis from QM/MM calculations. Phys. Chem. Chem. Phys.13, 11239–11247,
with permission from the PCCP Owner Societies.[80]Thus, the detailed analysis of
the electronic structure of Polλ
suggests that the replacement of the natural metals by other cations
results in a change in polarization that may be responsible for the
observed inhibitory or mutagenic effects of the replacing cations
depending on their identity.[80]
DNA Synthesis Fidelity Checking
The accurate replication
of the genomic data is crucial for organismal
survival. DNA polymerases are arguably some of the most important
enzymes in the transactions involved in replication. For example,
the replication error rates in E. coli are around
1 in 10–10.[64] This very
low error rate (high fidelity) is due to at least three steps that
are involved in the replication of DNA: (1) base selection, (2) proofreading,
and (3) mismatch repair. DNA Pol I is a key player in the achievement
of the low error rates due to its participation in two of the three
steps via three enzymatic activities: DNA polymerase, 5′ →3′
exonuclease, and 3′ →5′ exonuclease.[81]The Klenow fragment (KF) is a functional
fragment of the POLA gene, which codes for PolI,
and contains the polymerase
and 3′ →5′ exonuclease activities.[82−91] The KF has provided a very useful system to learn a great deal about
DNA polymerases including reasons for nucleotide misinsertion discrimination
and decrease in synthesis fidelity rates, among other insights.[84,91] The KF is a member of the A family of polymerases, which includes
various other highly studied polymerases such as DNA PolI from Thermus aquaticus, Klentaq fragment, and Bacillus
stearothermophilus [Bacillus fragment (BF)].The discrimination
of correct versus incorrect nucleotide insertion
in KF has been investigated with single-molecule Förster resonance
energy transfer (FRET) by Christian et al.[92] The single-molecule results indicated a movement along the DNA following
synthesis that had not been previously observed. The authors proposed
that KF translocates two bases downstream along the DNA strand following
a nucleotide incorporation before backtracking for the next round
of incorporation (Figure ).[92] On the basis of these results,
Christian et al. proposed a possible post-instertion fidelity checking
site to check that the newly incorporated base is not mismatched with
the templating base. The checking site is assumed to be located immediately
behind the pre-insertion site (n – 1 position
along the DNA)
Figure 9
Schematic diagram of the proposed fidelity checking process
in
DNA polymerase I. Following nucleotide insertion, the DNA is translocated
to the checking site. If the inserted base pair is a correct pair,
the polymerase continues. If not, the DNA is shifted to the exonuclease
domain and the incorrect base pair is excised. Reproduced from Graham,
S.E., Syeda, F., and Cisneros, G. A. (2012) Computational Prediction
of Residues Involved in Fidelity Checking for DNA Synthesis in DNA
Polymerase I. Biochemistry51, 2569–2578.
Copyright 2012 American Chemical Society.[93]
Schematic diagram of the proposed fidelity checking process
in
DNA polymerase I. Following nucleotide insertion, the DNA is translocated
to the checking site. If the inserted base pair is a correct pair,
the polymerase continues. If not, the DNA is shifted to the exonuclease
domain and the incorrect base pair is excised. Reproduced from Graham,
S.E., Syeda, F., and Cisneros, G. A. (2012) Computational Prediction
of Residues Involved in Fidelity Checking for DNA Synthesis in DNA
Polymerase I. Biochemistry51, 2569–2578.
Copyright 2012 American Chemical Society.[93]We employed various computational
techniques to investigate the
location of potential residues that could be involved in the putative
checking site on KF, BF, and Klentaq.[93,94] MD simulations
were performed on all three polymerases containing correctly or incorrectly
paired bases with either a blunt end (KF, BF)[93] or a two-base overhang template (Klentaq).[94] The resulting ensembles were subjected to various analyses including
energy decomposition (EDA), electrostatic free energy response (EFER),[95,96] and non-covalent interaction (NCI)[97,98] to investigate
whether/if any residues show changes in interaction with correctly
or incorrectly paired bases.The interaction analyses revealed
that there are a number of residues
around the putative checking site (Figure ). Correlation of EDA, EFER, and NCI analyses
for all three systems (KF, BF, and Klentaq) suggests that there are
six residues with altered interactions with mismatched bases compared
to correctly paired bases in the putative checking site. Figure shows a condensed
sequence alignment for A family polymerases that indicates the six
residues with altered interactions.
Figure 10
Positions of residues with altered interactions
in KF and BF (orange
sticks). The structure and numbering correspond to BF. Polymerase
subdomains are colored red, blue, yellow, and mauve for the palm,
fingers, thumb, and exonuclease, respectively. DNA bases (T:A) in
the preinsertion site are shown as balls and sticks. Reproduced from
Graham, S.E., Syeda, F., and Cisneros, G. A. (2012) Computational
Prediction of Residues Involved in Fidelity Checking for DNA Synthesis
in DNA Polymerase I. Biochemistry51, 2569–2578. Copyright 2012 American Chemical Society.[93]
Figure 11
Condensed alignment for family A DNA polymerases with the common
residues with altered interactions in the putative (n – 1) checking site indicated. The numbering corresponds to
Klentaq. Reproduced from Elias, A. A., and Cisneros, G. A. (2014)
Computational Study of Putative Residues Involved in DNA Synthesis
Fidelity Checking in Thermus aquaticus DNA Polymerase
I. In Adv. Protein. Chem. Struct. Biol.96, 39–75, Copyright 2014, with permission from Elsevier.[94]
Positions of residues with altered interactions
in KF and BF (orange
sticks). The structure and numbering correspond to BF. Polymerase
subdomains are colored red, blue, yellow, and mauve for the palm,
fingers, thumb, and exonuclease, respectively. DNA bases (T:A) in
the preinsertion site are shown as balls and sticks. Reproduced from
Graham, S.E., Syeda, F., and Cisneros, G. A. (2012) Computational
Prediction of Residues Involved in Fidelity Checking for DNA Synthesis
in DNA Polymerase I. Biochemistry51, 2569–2578. Copyright 2012 American Chemical Society.[93]Condensed alignment for family A DNA polymerases with the common
residues with altered interactions in the putative (n – 1) checking site indicated. The numbering corresponds to
Klentaq. Reproduced from Elias, A. A., and Cisneros, G. A. (2014)
Computational Study of Putative Residues Involved in DNA Synthesis
Fidelity Checking in Thermus aquaticusDNA Polymerase
I. In Adv. Protein. Chem. Struct. Biol.96, 39–75, Copyright 2014, with permission from Elsevier.[94]Various residues that show altered interactions from our
computational
analysis have been reported to result in changes to DNA fidelity synthesis
in KF and Klentaq. In particular, residues R668 and R682 in KF (R615,
R629 in BF and R573, R587 in Klentaq) have been reported to be important
in the partitioning of the DNA primer terminus between the polymerase
and exonuclease active sites as well as mispair discrimination for
synthesis fidelity.[88,89] The correlation between experimental
mutagenesis studies indicating the role of these residues in fidelity
and the possible involvement of these residues in the fidelity-checking
step provide possible targets for further investigation of this intriguing
hypothesis.
Effects of Mutagenic Lesions on Structure and
Function of DNA Polymerase IV
Y-family polymerases, such
as DNA polymerase IV (Dpo4), have large,
flexible, and solvent exposed active sites that can accommodate bulky
lesions, which is likely crucial to their ability to perform translesion
synthesis (TLS). DNA can be damaged in a variety of ways, including
the incorporation of endogenously produced molecules such as polycyclic
aromatic hydrocarbons (PAHs).[99] Benzo[a]pyrene
(B[a]P)is a particularly bulky adduct that can add to DNA in several
conformations with different effects on DNA replication.[5,100]We have recently carried out computational simulations to
aid in
the understanding of biochemical and single molecule FRET experiments
for Dpo4 with B[a]P damage. The experimental results indicate that
the cis conformation of B[a]P adducted to guanine exhibits two different
behaviors depending on the solvent environment; in a pure water system,
the adduct takes the place of the guanine residue, intercalated within
the DNA helix, and replication does not proceed past that point.[101] In a 10% dimethyl sulfoxide (DMSO)/90% water
mixture, two different FRET states are observed and Dpo4 is able to
continue the DNA synthesis (TLS). Additionally, a G-G mismatch is
more stable than the correct G-C base pairing, resulting in increased
errors in TLS.We performed classical MD simulations on Dpo4
with B[a]P incorporated
to the DNA in pure water and in the water/DMSO mixture.[101] Our results indicate that the B[a]P indeed
intercalates with the DNA and does not allow the adduct to be everted
from the double strand when in pure water. Conversely, in the water/DMSO
mixture, the B[a]P adduct flips out into an open space next to the
active site underneath the finger domain in a solvent exposed conformation,
allowing replication to continue (Figure ). This is enabled by DMSO reducing the
dielectric constant of the solvent around the B[a]P adduct (effectively
microsolvating the B[a]P) and allowing it to be everted from the double
helix and thus avoiding the arrest of the polymerase.
Figure 12
Microsolvation of the
flipped out conformation of cis-benzo[a]pyrene (blue
surface) with dimethyl sulfoxide (yellow and
green surfaces).
Microsolvation of the
flipped out conformation of cis-benzo[a]pyrene (blue
surface) with dimethyl sulfoxide (yellow and
green surfaces).
Discovery
and Characterization of Cancer Mutants
on DNA Pols
The discovery of various residues involved in
catalyis in the “2nd-shell”
of Polλ (Figure ) raised a question about how many of these residues are conserved
among all human polymerases. A detailed sequence–structure
alignment reveals that five of the nine catalytically relevant second-shell
residues are at least partially conserved (Figure ). This points to the evolutionary importance
of these residues for polymerase activity and/or function.
Figure 13
Second-shell
residues conserved among 14 different human polymerases.
Second-shell
residues conserved among 14 different human polymerases.On the basis of these results, we wondered if there
are any natural
variants associated with a disease state that result in mutations
of these second-shell residues and how these may affect the structure
and function. Several polymerases have been linked specifically to
cancer, such as Polη, which has a well-documented relationship
with skin cancer.[3,102,103] To this end, we developed a methodology that combines a targeted
search for single nucleotide polymorphisms (SNPs) on selected genes,
statistical validation of the uncovered SNPs, and computational simulations
of the wild-type and SNP variants to determine if/whether the disease
variant shows a difference in its structure or function. The search
algorithm was implemented in a computer program called Hypothesis
Driven–SNP–Search, or HyDn–SNP–S.[104]The first application of HyDn–SNP–S
was performed
to uncover SNPs on all DNA polymerase genes for four different genome-wide
association studies comprising melanoma, lung, prostate, and breast
cancer (Figure ).
Following logistical regression, 79 exonic non-synonymous SNPs were
found to be statistically significantly associated with disease status.[104] Further analysis to determine derived haplotypes
of the uncovered SNPs with the selected cancer phenotypes resulted
in two new haplotypes for breast and prostate cancer. The first haplotype
is composed of two SNPs on POLL, rs3730477 and rs3730463,
and results in a three-fold increase in risk of breast cancer. The
second haplotype is constructed from three SNPs on POLG, rs3087374, rs2351000, and rs2247233. The resulting haplotype indicates
having two copies of the G–C–A haplotype (p = 0.008) results in a 9.64-fold increase in prostate cancer disease
status.[104]
Figure 14
Edge–node network
of the HyDn–SNPs results for DNA
polymerases. Phenotypes and polymerases are shown as nodes, edges
are weighted by total number of SNPs connecting each phenotype to
each polymerase. Reproduced from Swett, R. J., Elias, A., Miller,
J. A., Dyson, G. E., and Cisneros, G. A. (2013) Hypothesis driven
single nucleotide polymorphism search (HyDnSNP-S). In DNA
Repair12, 733–740, Copyright 2013,
with permission from Elsevier.[104]
Edge–node network
of the HyDn–SNPs results for DNA
polymerases. Phenotypes and polymerases are shown as nodes, edges
are weighted by total number of SNPs connecting each phenotype to
each polymerase. Reproduced from Swett, R. J., Elias, A., Miller,
J. A., Dyson, G. E., and Cisneros, G. A. (2013) Hypothesis driven
single nucleotide polymorphism search (HyDnSNP-S). In DNA
Repair12, 733–740, Copyright 2013,
with permission from Elsevier.[104]In addition to uncovering a large
number of previously unknown
cancer missense SNPs on DNA polymerases, we performed computational
simulations on one particular SNP. One of the SNPs on Polλ,
rs3730477, is associated with breast cancer status and results in
the R438W variant. R438 is located around 12 Å away from the
active site (Figure ). This residue sits at the end of loop1, which is crucial for Polλ
fidelity, and the SNP variant has been shown to give rise to chromosomal
abnormalities.[105,106] We performed MD simulations
on the binary and ternary structures of wild-type and the R438W variant
to determine the possible impact of the breast-cancer mutation on
the structure or function of Polλ. Our MD simulations indicate
that the mutation of R by W at position 438 results in a change in
the dynamics of loop 1, without affecting the overall motions of the
rest of the protein. Our results suggest that the alteration of the
motion of loop 1, which is experimentally known to be crucial for
fidelity, could affect the function of this critical structural feature
and help explain the observed decrease in fidelity.[104] Recently, on the basis of our work, Sweasy and co-workers
have confirmed that rs3730477 can be used as a biomarker for breast
cancer, and, more importantly for individuals with this particular
mutation, estrogen treatment may increase the risk of breast cancer
status.[107]
Figure 15
(A) Overlay of Polλ
in the binary and ternary conformations.
DNA is shown in light blue, and Loop 1 is shown in purple. (B) Differences
in Loop 1 orientation between the two conformations. Distance between
position 438 and Loop 1 following an interpolation between the two
structures at its (D) furthest and (C) closest approaches. Reproduced
from Swett, R. J., Elias, A., Miller, J. A., Dyson, G. E., and Cisneros,
G. A. (2013) Hypothesis driven single nucleotide polymorphism search
(HyDnSNP-S). In DNA Repair12, 733–740,
Copyright 2013, with permission from Elsevier.[104]
(A) Overlay of Polλ
in the binary and ternary conformations.
DNA is shown in light blue, and Loop 1 is shown in purple. (B) Differences
in Loop 1 orientation between the two conformations. Distance between
position 438 and Loop 1 following an interpolation between the two
structures at its (D) furthest and (C) closest approaches. Reproduced
from Swett, R. J., Elias, A., Miller, J. A., Dyson, G. E., and Cisneros,
G. A. (2013) Hypothesis driven single nucleotide polymorphism search
(HyDnSNP-S). In DNA Repair12, 733–740,
Copyright 2013, with permission from Elsevier.[104]
Summary
and Perspective
Computational simulations of DNA polymerases
using classical, quantum
mechanical, and hybrid QM/MM methods have enabled a deeper understanding
of a variety of facets related to their structure, function, and mechanism.
Detailed analysis at the atomic and electronic structure level enabled
by these methods has been shown to be useful to gain insights into
factors of metal mutagenicity, nucleotide discrimination, and impact
of protein residues on catalytic activity. The application of novel
bioinformatic techniques to uncover cancer-associated SNPs in DNA
polymerases has resulted in a large number of statistically significantly
associated biomarkers on DNA polymerases for four distinct cancer
phenotypes. The use of computational simulations on these cancer mutants
has provided possible explanations for the impact of these cancer
mutations on the function of the proteins.In the future, it
is expected that the improvement of simulation
methods, including more accurate classical potentials, coupled with
better levels of QM theory, enabled by larger computing capacity will
provide more opportunities for deeper insights into these systems.
For example, longer simulation times will enable the possibility to
study the transfer of ssDNA from the polymerase to the exonuclease
to investigate steps required for proofreading. Continuing the use
of computational simulations to investigate the impact of cancer-associated
SNPs on DNA polymerase structure, function, and dynamics will provide
the possibility to investigate the effect of these cancer mutations
at an atomic level. It is clear that the use of computational simulations
of DNA polymerases is a robust and well-established field that has
provided important contributions and will continue to aid in the detailed
understanding of these important enzymes.
Authors: Elizabeth H Z Thompson; Michael F Bailey; Edwin J C van der Schans; Catherine M Joyce; David P Millar Journal: Biochemistry Date: 2002-01-22 Impact factor: 3.162
Authors: Samir Hamdan; Esther M Bulloch; Phillip R Thompson; Jennifer L Beck; Ji Yeon Yang; Jeffrey A Crowther; Penelope E Lilley; Paul D Carr; David L Ollis; Susan E Brown; Nicholas E Dixon Journal: Biochemistry Date: 2002-04-23 Impact factor: 3.162
Authors: Hailey L Gahlon; Alice R Walker; G Andrés Cisneros; Meindert H Lamers; David S Rueda Journal: Phys Chem Chem Phys Date: 2018-10-31 Impact factor: 3.676