Literature DB >> 28877429

Computational Simulations of DNA Polymerases: Detailed Insights on Structure/Function/Mechanism from Native Proteins to Cancer Variants.

Abstract

Genetic information is vital in the cell cycle of DNA-based organisms. DNA polymerases (DNA Pols) are crucial players in transactions dealing with these processes. Therefore, the detailed understanding of the structure, function, and mechanism of these proteins has been the focus of significant effort. Computational simulations have been applied to investigate various facets of DNA polymerase structure and function. These simulations have provided significant insights over the years. This perspective presents the results of various computational studies that have been employed to research different aspects of DNA polymerases including detailed reaction mechanism investigation, mutagenicity of different metal cations, possible factors for fidelity synthesis, and discovery/functional characterization of cancer-related mutations on DNA polymerases.

Entities: Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2017 PMID： 28877429 PMCID： PMC5696005 DOI： 10.1021/acs.chemrestox.7b00161

Source DB: PubMed Journal: Chem Res Toxicol ISSN： 0893-228X Impact factor: 3.739

Introduction

The accurate synthesis and maintenance of DNA in cells depend heavily on a complex network of proteins, of which DNA polymerases are crucial players. Each DNA polymerase has a unique task to ensure the successful replication and repair of the genome of a cell. The number of potential errors, mismatches, and damage to DNA is vast, with far-reaching consequences for cell function that can result in disease or, in many cases, death.[1−4] These polymerases have complex structures, mechanisms, and dynamics, which in many cases are not fully understood.[5−7] Computational investigations of these systems require a variety of techniques and a balance between rigorous theory and computational resources, especially considering the relatively large size of polymerase systems and their multifarious functions.[8] DNA polymerases are divided into several families, each of which has similarities in active site structure, processivity, and potential for additional activity (such as exonuclease or lyase capabilities).[9−12] In brief, family A polymerases perform excision repair with relatively poor processivity; family B performs DNA repair with the potential to bypass lesions and has variable processivity; family C performs DNA synthesis with a proofreading function and a high level of processivity; family X is involved several types of DNA repair, including base excision repair (BER) and non-homologous end joining (NHEJ); and family Y performs translesion DNA synthesis with a low level of accuracy.[8,13−15] Even within families there can be substantial differences in structure and function of the various polymerases. However, overall the polymerase subdomain structures are generally similar and are usually described as a right or left-hand. Within the hand there are the finger, thumb, and palm domains. The palm domain is where the DNA is bound while the polymerase is active, and also where the active site for phosphoryl transfer/nucleotide addition takes place, which is discussed more thoroughly below. The thumb domain is usually thought to be involved in the position and movement of the DNA through the polymerase, and the finger domain is involved with the alignment of the incoming nucleotide with the template strand and, potentially, recognition of the correct incoming nucleotide.[16] Computational simulations based on detailed classical and/or quantum analysis have been applied to gain atomic-level insight into this important class of enzymes. Molecular dynamics (MD) simulations allow for the investigation of large and small-scale structural differences over time and can provide better understanding regarding how large, flexible polymerases adapt to relatively small structural changes, such as mutagenic lesions, and how they maintain fidelity.[17−21] These simulations can also provide information on large scale motions and structural changes such as switching between polymerase and exonuclease activities.[22,23] As a recent example, Kim et al. have provided a novel explanation for how DNA polymerase β (Polβ) becomes inactivated by oxidized guanine by applying targeted MD to obtain the specifics of the transition from the open active state of Polβ to the inactive closed state by disruption of a water network and, eventually, the active site.[24] Yang et al. have also used targeted MD to investigate how Polβ changes between correctly paired terminal bases versus incorrectly paired ones, and show aspects of the overall motion of Polβ, and of a rotation of Arg258 in the active site that has been correlated to the shift from the active to inactive state.[25] Additionally, molecular mechanics (MM) work by Jia et al. uncovered the N-clasp structural feature of Polκ that allows for nearly error-free bypass of mutagenic lesions.[26] Quantum mechanical (QM) and hybrid QM/molecular mechanics (QM/MM) methods have also frequently been used to investigate polymerase function and activity. QM methods alone are highly accurate but are difficult or impossible to apply to large enzymatic systems, and so usually QM/MM is used to obtain information on the complex electronic structure of the metal ions and accurate energetics for reaction mechanisms.[7,27−30] QM/MM subdivides the calculated system into a small region of 100 atoms or less that is calculated with QM, usually the active site of the polymerase along with relevant ions and cofactors, and the MM region, containing the rest of the polymerase and solvent. This allows for insight into particulars of the reaction mechanisms and metal catalysis that differ between specific polymerase families and sometimes within polymerases themselves, as discussed further in section and throughout the perspective. These cannot be investigated with MD alone since they require investigation into bond breaking and forming and the movement of electrons. Family X and Y polymerases in particular have been the object of a large number of QM/MM[31] simulations.[19,32−39] To name a few examples, Hummer et al. have applied combined QM/MM techniques to investigate a variety of aspects about B. halodurans ribonuclease (RNase), including the reactivity of specific metals and the overall mechanism, gaining insights into nucleotide cleavage and transfer reactions.[40] Combined QM/MM simulations and structural/biochemical experiments, reported by Perera et al., have given fascinating insight into the role of metal ions in the active site of Polβ, and their dynamics of facilitation and inhibition in relation to pyrophosphorolysis, or the reverse of the usual addition reaction of polymerases.[41] QM/MM has also been used to gain mechanistic insight into several Y family polymerases, which are unique in their ability to bypass bulky chemical lesions. Dpo4, which can perform error-prone translesion synthesis, has a particularly solvent-exposed active site; Wang et al. have shown a corresponding mechanism for nucleotidyl-transfer reactions through water-dependent pathways using QM/MM calculations.[42] Additionally, QM/MM work by Hoffmann et al. shows a similar water-mediated transfer reaction for Polκ, which has a much higher fidelity as compared to Dpo4.[43] Further discussion of mutagenic lesions and Dpo4 is found later in the perspective in section . The use of advanced force fields for classical or QM/MM simulations applied to complex biomolecular systems also provides new insights and results, showing new avenues for exploring the role of electronic polarization and improved treatments of permanent electrostatics.[44−48] These advanced methods can improve solvent boundaries for QM/MM[49] and the description of metal cations.[50−52] We have developed a variety of methods for simulations and analysis, which have been applied to various biochemical systems including DNA polymerases. In this perspective, we discuss our studies on DNA polymerase catalysis in section , metal mutagenicity in section , fidelity determinants in section , mutagenic lesion bypass in section , and cancer mutations and their effects on the structure and function of polymerases in section .

Computational Investigation of DNA Polymerase Reaction Mechanisms

As briefly touched on in the Introduction, DNA polymerases possess a remarkable degree of specificity and variability in function. That said, the reaction mechanism of DNA synthesis is very consistent across polymerases. The reaction mechanism for DNA synthesis by DNA polymerases involves a nucleophilic attack on the alpha phosphate (Pα) of the incoming nucleotide by the O3′ of the primer-terminus nucleotide with the concomitant formation of pyrophosphate (PPi). This reaction results in the formation of an O–P bond and therefore the addition of the incoming nucleotide, as shown schematically in Figure . In general, this mechanism is considered to have two metal ions, though that can vary, as will be discussed later in the perspective.[53]

Figure 1

Mechanism of polymerase addition reaction for Pol λ. Reproduced from Fang, D., Chaudret, R., Piquemal, J.-P., and Cisneros, G. A. (2013) Toward a Deeper Understanding of Enzyme Reactions Using the Coupled ELF/NCI Analysis: Application to DNA Repair Enzymes. J. Chem. Theo. Comp.9, 2156–2160. Copyright 2013 American Chemical Society.[54] Several reaction mechanisms have been investigated to assess the particulars of proton transfer from the O3′ and seem to vary depending on the particular DNA polymerase. The proton transfer from the nucleophilic O may involve a direct proton transfer to an oxygen on the Pα, indirectly through an ordered water, or to one of the conserved aspartate/glutamic acid residues in the active site.[36,37,55] We have performed various theoretical studies on the reaction mechanism of DNA polymerase λ (Polλ).[56] Polλ is an X family polymerase involved in NHEJ and can fill small DNA gaps (1–2 nucleotides). QM/MM simulations of the reaction mechanism for the three H+ transfer possibilities in Polλ suggest that in this case the proton is transferred to the conserved D490 in the active site.[56] DNA polymerases have been shown to be able to perform their catalytic activity using different divalent cations in the active site.[4,57] Polλ can employ either Mn2+ or Mg2+ to synthesize DNA.[58] We performed QM/MM simulations using both metal cations to determine the differences in the reaction mechanism in Polλ. Our results show that the energy barriers for both Mn2+ and Mg2+ are in good agreement with experimental estimates.[56] In both cases, the reaction is calculated to proceed through a two step mechanism involving the initial proton transfer from O3′ to D490, followed by the nucleophilic attack of O3′ on the alpha phosphate of the incoming nucleotide. The energy barrier associated with the reaction is ∼2 kcal/mol smaller for the Mn2+ catalyzed reaction than for Mg2+, in agreement with experiment. Further insights into the reaction were provided by investigating the impact of individual residues on the critical points along the reaction path. To this end, the non-bonded interaction energy between each residue in the protein and the active site can be analyzed by means of an energy decomposition analysis (EDA). This analysis provides a qualitative assessment of the role of each amino acid on the reaction and can be used to compare to available mutagenesis experiments, or to predict possible mutation sites for subsequent experimental analysis. The residues found to be important for catalysis in Polλ are shown in Figure .[56] None of the predicted residues had been experimentally investigated in Polλ prior to our simulations. Only three homologous residues had been mutated experimentally in Polβ, and all exhibited impact on catalysis. Following our computational simulations, mutations on Polλ were performed on R386[59] and K427,[60] which confirmed the role of these residues on activity and stabilization of the DNA substrate. These residues located in the so-called second shell around the active site also gave rise to the analysis of cancer SNPs as described in subsection .

Figure 2

Catalytically relevant residues in Polλ. Residues colored in red have large energetic contributions to transition state stability or are present in all transition states, residues in green correspond to the QM subsystem. Reprinted from Cisneros, G. A., Perera, L., García-Díaz, M., Bebenek, K., Kunkel, T. A., and Pedersen, L. G. (2008) Catalytic mechanism of human DNA polymerase λ with Mg2+ and Mn2+ from ab initio quantum mechanical/molecular mechanical studies. In DNA Repair, Vol. 7, pp 1824–1834, Copyright 2008, with permission from Elsevier.[56] E. coli DNA polymerase III (Pol III) is a complex that is relatively large composed of 10 subunits with multiple functions including DNA replication, post-replicative repair, and exonucleolytic proofreading.[61,62] We have investigated the ϵ subunit in particular and its function as an exonuclease.[63] The ϵ subunit (termed ϵ in the subsequent discussion) preferentially removes incorrectly paired nucleotides and is a crucial component of DNA Pol III’s high rate of fidelity.[64,65] One metal and two metal mechanisms have been proposed[66] for the proofreading mechanism; our theoretical work focused on the two-metal mechanism.[67] Additionally, the reaction catalyzed by ϵ in vivo has been reported to be faster with Mn2+ than with Mg2+, so the mechanisms for both of these metals were explored.[66] In both cases, the two metal mechanism involves a nucleophilic attack on the base to be excised by an attacking hydroxide ion; essentially, the reverse of the standard polymerase addition reaction. The formation of this ion is facilitated by the catalytic metal, Me1, as seen in Figure .

Figure 3

Reaction mechanism for the ϵ subunit of DNA polymerase III holoenzyme.

Reaction mechanism for the ϵ subunit of DNA polymerase III holoenzyme. The ϵ subunit can associate with an additional θ subunit, which has been proposed to promote its activity.[68] The θ subunit also has a smaller homologue from bacteriophage P1, HOT, which can also stabilize ϵ and perform a similar function.[69] Crystal structures of the ϵ–HOT complex with two metals in the active site were used to explore the mechanism using QM/MM and similar techniques as described for Polλ. Our simulations involved three different systems including the ϵ–HOT complex with Mg2+ or Mn2+, and free–ϵ with Mg2+. The calculated energy barriers suggest that ϵ is slightly more active with Mn2+ than with Mg2+; the ϵ–HOT complex shows a slightly lower activation barrier compared with free-ϵ consistent with experiment. A particularly interesting aspect of this study involves the number of ligands coordinated to the catalytic metal in the active site. The reported crystal structure for ϵ–HOT, pdbid 2IDO, suggests that the catalytic metal is pentacoordinated.[70] This is an unusual coordination number, especially for Mg2+, which is considered to be the natural divalent cation for both DNA polymerases and exonucleases. Moreover, this pentacoordination is preferentially maintained along the reaction path according to our QM/MM simulations.[67] The more typical hexacoordination was also investigated but did not result in chemically reasonable structures for the catalytic step. This was observed for both Mn2+, which can maintain a pentacoordinated structure in the first coordination shell, and for Mg2+, for which it is less common but still present in other enzymes. This unusual coordination sphere has been subsequently reported in other enzymes.[40,71,72] Additionally, we performed EDA in a similar manner to Polλ to determine residues that either stabilize or destabilize the TS for the proofreading mechanism (Figure ).[67] Several residues near the active site and on HOT were shown to have a significant effects on catalysis. Four of these residues had been studied experimentally previously. All four showed mutator effects proportional to their (de)stabilization energies, with D103 and D129 showing particularly strong mutagenic effects.

Figure 4

Catalytically important residues in ϵ–HOT. Top: Residues colored in red have large energetic contributions to transition state stability and/or are present in all transition states, residues in blue are active site residues. Reproduced from Cisneros, G. A., Perera, L., Schaaper, R. M., Pedersen, L. C., London, R. E., Pedersen, L. G., and Darden, T. A. (2009) Reaction Mechanism of the ε Subunit of E. coli DNA Polymerase III: Insights into Active Site Metal Coordination and Catalytically Significant Residues. J. Am. Chem. Soc.131, 1550–1556. Copyright 2009 American Chemical Society.[67] We have also developed several methods to improve simulation accuracy and enable deeper analyses. One of these involves the combination of two quantum interpretative techniques, electron localization function (ELF) and noncovalent interaction (NCI) analysis, to investigate covalent and non-covalent regions. This combination was originally developed to investigate organic reactions in gas phase[73] and subsequently extended by us to enzymatic reaction mechanisms.[54] We applied ELF/NCI analysis to further examine the role of the electronic structure on the intermolecular interactions and metal coordinations related to the reactions catalyzed by Polλ and ϵ. For Polλ, the ELF/NCI analysis revealed that the interaction between O3′ on the primer and Mg2+ was weaker than for Mn2+, as indicated by the associated NCI surface. Concomitantly, the interaction between O3′ and the phosphate oxygen coordinated to the metal ions was stronger for Mg2+ than for Mn2+ (Figure ). These results are consistent with our previous QM/MM (and experimental) results where the barrier for proton transfer is higher for Mg2+;[63] further details about the ELF results can be seen in Subsection .

Figure 5

NCIplot for Pol λ reactant structures, with strong attractive forces denoted in blue, strong repulsive forces denoted in red, and weak forces denoted in green. Reproduced from Fang, D., Chaudret, R., Piquemal, J.-P., and Cisneros, G. A. (2013) Toward a Deeper Understanding of Enzyme Reactions Using the Coupled ELF/NCI Analysis: Application to DNA Repair Enzymes. J. Chem. Theo. Comp.9, 2156–2160. Copyright 2013 American Chemical Society.[54] Similarly, for the ϵ subunit of Pol III, the Mn2+ ion shows electronic density splitting in the surfaces associated with the metal ions. This splitting is not present for Mg2+, indicating that the Mn2+ ion has stronger interactions with pertinent residues in the active site as compared to Mg2+. Interestingly, the catalytic metal exhibits five splitting basins as shown in Figure , which is also consistent with the pentacoordination of Me1 as explained above. Further insights into metal mutagenicity using ELF/NCI are discussed below in section .

Figure 6

Combined NCI and ELF surfaces for the ϵ subunit of Pol III. ELF surfaces are shown in transparent purple, and NCI surfaces shown as solid red/green/blue surfaces. Panels a–c show the reactant, transition state and product structures for Mg2+, while panels d–f show the same for Mn2+. Reproduced from Fang, D., Chaudret, R., Piquemal, J.-P., and Cisneros, G. A. (2013) Toward a Deeper Understanding of Enzyme Reactions Using the Coupled ELF/NCI Analysis: Application to DNA Repair Enzymes. J. Chem. Theo. Comp.9, 2156–2160. Copyright 2013 American Chemical Society.[54]

Third Metal in Polλ Mechanism

As explained above, the synthesis of DNA by DNA polymerases involves a general two metal ion mechanism. Recently, Nakamura et al. reported the existence of a transient third metal ion in the in crystalo reaction of Polμ based on time-resolved X-ray crystallography.[74] Wilson and co-workers have published similar evidence for Polβ.[75] On the basis of these results, Perera et al. have performed extensive computational simulations to investigate the role of this third cation in the reaction mechanism of Polβ.[27,41] Their results suggest that this third metal does not affect the forward chemical step (DNA synthesis) and significantly impairs the pyrophosphorolysis (backward) reaction. Polλ is 35% identical to Polβ and thus the question arises of whether this transient third metal is also present in the Polλ reaction. We have explored the possible role of the third metal ion in Polλ using our own QM/MM program, LICHEM.[76] As reported in ref (74), the comparison of the calculated energies of Polλ reactant and product structures with a third metal included in the active site results in a stabilization of the product relative to the reactant (Figure ). This is in contrast to our original reported path for the mechanism of Polλ, where the calculated reaction energy is endoergic by around 4–5 kcal/mol. Thus, our initial calculations are consistent with the work of Perera et al. and Genna et al. in that the third metal stabilizes the product structure and thus reduces the likelihood of the backward reaction.[27,77] In addition, the resulting distances for the active site in the optimized structures in Polλ are similar to those reported by Perera et al., with the third metal coordinating to the apical oxygen on Pα with distances around 2.1 Å for reactant and product.

Figure 7

Close-up of the active site for the optimized reactant (left) and product (right) structures of Polλ with the third metal in the active site.

Metal Mutagenicity in DNA Polymerases

The requirement of metal cations to facilitate the reaction in DNA Pols gives rise to the possibility of replacement of the ”natural” cation, Mg2+, by other metals. It is known that several cations are carcinogenic or genotoxic, and these effects are due to the inhibition of DNA repair transactions.[78] Indeed, the replacement of Mg2+ ions in the active site of DNA polymerases by metals such as Cr(II), Cd(II), Ni(II), Ca(II), Na(+), or Mn(II) results in either a decrease in synthesis fidelity or outright inhibition.[57] Pelletier et al. investigated the effect of different metals in ternary structures of DNA Polβ.[79] Their results show that metals other than Mg2+ result in changes in the active site geometry. On the basis of our previous results on the reaction mechanism of Polλ, we investigated the effect of nine different cations including Mg2+, Na+, Ca2+, Zn2+, Co2+, Cr2+, Cu2+, Mn2+, and Ni2+ using QM/MM methods.[80] In addition to the structural effects, we performed electron localization function (ELF) analyses on all the resulting systems to gain insights into the effects of the metals on the electronic structure. The structures obtained from our simulations indicate that the overall arrangement of the active site is largely unaltered regardless of the cation, with only subtle differences arising such as very slight increases of 0.2–0.4 Å between the metals and the atoms in the first coordination shell. Similarly, the nucleophilic attack distance (O3′ to Pα) increases slightly, ∼0.3 Å, for two of the three inhibitor cations, Na+ and Ca2+. To further understand the differences between these systems given the very slight changes in geometry, we turned to the analysis of the electronic structure via ELF. The results from the ELF analysis are striking in that out of the nine tested cations, significant differences were observed between the calculated systems with inhibitor and mutagenic cations compared to the Mg2+ system.[80] In particular, the ELF analysis indicates that when two Mg2+ occupy the active site, the cations and incoming nucleotide experience a particular electronic polarization. By contrast, the inhibitory (Ca+, Na2+, and Zn2+) and mutagenic (Co2+, Cr2+, Cu2+, Mn2+, and Ni2+ cations exert a different (hyper)polarization on the oxygen atoms on the triphosphate of the incoming nucleotide. In addition, the enzyme environment also significantly affects the metals in that only the Mg2+ shows a single basin around the metals, whereas all other metals are (hyper-)polarized in such a way that the metal exhibits multiple basins (Figure ).

Figure 8

ELF analysis for the active site of Polλ with Mg2+ (left) and Ni2+ (right) occupying the catalytic and nucleotide binding metal sites. Reproduced from Chaudret, R., Piquemal, J.-P., and Cisneros, G. A. (2011) Correlation between electron localization and metal ion mutagenicity in DNA synthesis from QM/MM calculations. Phys. Chem. Chem. Phys.13, 11239–11247, with permission from the PCCP Owner Societies.[80] Thus, the detailed analysis of the electronic structure of Polλ suggests that the replacement of the natural metals by other cations results in a change in polarization that may be responsible for the observed inhibitory or mutagenic effects of the replacing cations depending on their identity.[80]

DNA Synthesis Fidelity Checking

The accurate replication of the genomic data is crucial for organismal survival. DNA polymerases are arguably some of the most important enzymes in the transactions involved in replication. For example, the replication error rates in E. coli are around 1 in 10–10.[64] This very low error rate (high fidelity) is due to at least three steps that are involved in the replication of DNA: (1) base selection, (2) proofreading, and (3) mismatch repair. DNA Pol I is a key player in the achievement of the low error rates due to its participation in two of the three steps via three enzymatic activities: DNA polymerase, 5′ →3′ exonuclease, and 3′ →5′ exonuclease.[81] The Klenow fragment (KF) is a functional fragment of the POLA gene, which codes for PolI, and contains the polymerase and 3′ →5′ exonuclease activities.[82−91] The KF has provided a very useful system to learn a great deal about DNA polymerases including reasons for nucleotide misinsertion discrimination and decrease in synthesis fidelity rates, among other insights.[84,91] The KF is a member of the A family of polymerases, which includes various other highly studied polymerases such as DNA PolI from Thermus aquaticus, Klentaq fragment, and Bacillus stearothermophilus [Bacillus fragment (BF)]. The discrimination of correct versus incorrect nucleotide insertion in KF has been investigated with single-molecule Förster resonance energy transfer (FRET) by Christian et al.[92] The single-molecule results indicated a movement along the DNA following synthesis that had not been previously observed. The authors proposed that KF translocates two bases downstream along the DNA strand following a nucleotide incorporation before backtracking for the next round of incorporation (Figure ).[92] On the basis of these results, Christian et al. proposed a possible post-instertion fidelity checking site to check that the newly incorporated base is not mismatched with the templating base. The checking site is assumed to be located immediately behind the pre-insertion site (n – 1 position along the DNA)

Figure 9

Schematic diagram of the proposed fidelity checking process in DNA polymerase I. Following nucleotide insertion, the DNA is translocated to the checking site. If the inserted base pair is a correct pair, the polymerase continues. If not, the DNA is shifted to the exonuclease domain and the incorrect base pair is excised. Reproduced from Graham, S.E., Syeda, F., and Cisneros, G. A. (2012) Computational Prediction of Residues Involved in Fidelity Checking for DNA Synthesis in DNA Polymerase I. Biochemistry51, 2569–2578. Copyright 2012 American Chemical Society.[93] We employed various computational techniques to investigate the location of potential residues that could be involved in the putative checking site on KF, BF, and Klentaq.[93,94] MD simulations were performed on all three polymerases containing correctly or incorrectly paired bases with either a blunt end (KF, BF)[93] or a two-base overhang template (Klentaq).[94] The resulting ensembles were subjected to various analyses including energy decomposition (EDA), electrostatic free energy response (EFER),[95,96] and non-covalent interaction (NCI)[97,98] to investigate whether/if any residues show changes in interaction with correctly or incorrectly paired bases. The interaction analyses revealed that there are a number of residues around the putative checking site (Figure ). Correlation of EDA, EFER, and NCI analyses for all three systems (KF, BF, and Klentaq) suggests that there are six residues with altered interactions with mismatched bases compared to correctly paired bases in the putative checking site. Figure shows a condensed sequence alignment for A family polymerases that indicates the six residues with altered interactions.

Figure 10

Figure 11

Condensed alignment for family A DNA polymerases with the common residues with altered interactions in the putative (n – 1) checking site indicated. The numbering corresponds to Klentaq. Reproduced from Elias, A. A., and Cisneros, G. A. (2014) Computational Study of Putative Residues Involved in DNA Synthesis Fidelity Checking in Thermus aquaticus DNA Polymerase I. In Adv. Protein. Chem. Struct. Biol.96, 39–75, Copyright 2014, with permission from Elsevier.[94]

Positions of residues with altered interactions in KF and BF (orange sticks). The structure and numbering correspond to BF. Polymerase subdomains are colored red, blue, yellow, and mauve for the palm, fingers, thumb, and exonuclease, respectively. DNA bases (T:A) in the preinsertion site are shown as balls and sticks. Reproduced from Graham, S.E., Syeda, F., and Cisneros, G. A. (2012) Computational Prediction of Residues Involved in Fidelity Checking for DNA Synthesis in DNA Polymerase I. Biochemistry51, 2569–2578. Copyright 2012 American Chemical Society.[93] Condensed alignment for family A DNA polymerases with the common residues with altered interactions in the putative (n – 1) checking site indicated. The numbering corresponds to Klentaq. Reproduced from Elias, A. A., and Cisneros, G. A. (2014) Computational Study of Putative Residues Involved in DNA Synthesis Fidelity Checking in Thermus aquaticus DNA Polymerase I. In Adv. Protein. Chem. Struct. Biol.96, 39–75, Copyright 2014, with permission from Elsevier.[94] Various residues that show altered interactions from our computational analysis have been reported to result in changes to DNA fidelity synthesis in KF and Klentaq. In particular, residues R668 and R682 in KF (R615, R629 in BF and R573, R587 in Klentaq) have been reported to be important in the partitioning of the DNA primer terminus between the polymerase and exonuclease active sites as well as mispair discrimination for synthesis fidelity.[88,89] The correlation between experimental mutagenesis studies indicating the role of these residues in fidelity and the possible involvement of these residues in the fidelity-checking step provide possible targets for further investigation of this intriguing hypothesis.

Effects of Mutagenic Lesions on Structure and Function of DNA Polymerase IV

Y-family polymerases, such as DNA polymerase IV (Dpo4), have large, flexible, and solvent exposed active sites that can accommodate bulky lesions, which is likely crucial to their ability to perform translesion synthesis (TLS). DNA can be damaged in a variety of ways, including the incorporation of endogenously produced molecules such as polycyclic aromatic hydrocarbons (PAHs).[99] Benzo[a]pyrene (B[a]P)is a particularly bulky adduct that can add to DNA in several conformations with different effects on DNA replication.[5,100] We have recently carried out computational simulations to aid in the understanding of biochemical and single molecule FRET experiments for Dpo4 with B[a]P damage. The experimental results indicate that the cis conformation of B[a]P adducted to guanine exhibits two different behaviors depending on the solvent environment; in a pure water system, the adduct takes the place of the guanine residue, intercalated within the DNA helix, and replication does not proceed past that point.[101] In a 10% dimethyl sulfoxide (DMSO)/90% water mixture, two different FRET states are observed and Dpo4 is able to continue the DNA synthesis (TLS). Additionally, a G-G mismatch is more stable than the correct G-C base pairing, resulting in increased errors in TLS. We performed classical MD simulations on Dpo4 with B[a]P incorporated to the DNA in pure water and in the water/DMSO mixture.[101] Our results indicate that the B[a]P indeed intercalates with the DNA and does not allow the adduct to be everted from the double strand when in pure water. Conversely, in the water/DMSO mixture, the B[a]P adduct flips out into an open space next to the active site underneath the finger domain in a solvent exposed conformation, allowing replication to continue (Figure ). This is enabled by DMSO reducing the dielectric constant of the solvent around the B[a]P adduct (effectively microsolvating the B[a]P) and allowing it to be everted from the double helix and thus avoiding the arrest of the polymerase.

Figure 12

Microsolvation of the flipped out conformation of cis-benzo[a]pyrene (blue surface) with dimethyl sulfoxide (yellow and green surfaces).

Discovery and Characterization of Cancer Mutants on DNA Pols

The discovery of various residues involved in catalyis in the “2nd-shell” of Polλ (Figure ) raised a question about how many of these residues are conserved among all human polymerases. A detailed sequence–structure alignment reveals that five of the nine catalytically relevant second-shell residues are at least partially conserved (Figure ). This points to the evolutionary importance of these residues for polymerase activity and/or function.

Figure 13

Second-shell residues conserved among 14 different human polymerases.

Second-shell residues conserved among 14 different human polymerases. On the basis of these results, we wondered if there are any natural variants associated with a disease state that result in mutations of these second-shell residues and how these may affect the structure and function. Several polymerases have been linked specifically to cancer, such as Polη, which has a well-documented relationship with skin cancer.[3,102,103] To this end, we developed a methodology that combines a targeted search for single nucleotide polymorphisms (SNPs) on selected genes, statistical validation of the uncovered SNPs, and computational simulations of the wild-type and SNP variants to determine if/whether the disease variant shows a difference in its structure or function. The search algorithm was implemented in a computer program called Hypothesis Driven–SNP–Search, or HyDn–SNP–S.[104] The first application of HyDn–SNP–S was performed to uncover SNPs on all DNA polymerase genes for four different genome-wide association studies comprising melanoma, lung, prostate, and breast cancer (Figure ). Following logistical regression, 79 exonic non-synonymous SNPs were found to be statistically significantly associated with disease status.[104] Further analysis to determine derived haplotypes of the uncovered SNPs with the selected cancer phenotypes resulted in two new haplotypes for breast and prostate cancer. The first haplotype is composed of two SNPs on POLL, rs3730477 and rs3730463, and results in a three-fold increase in risk of breast cancer. The second haplotype is constructed from three SNPs on POLG, rs3087374, rs2351000, and rs2247233. The resulting haplotype indicates having two copies of the G–C–A haplotype (p = 0.008) results in a 9.64-fold increase in prostate cancer disease status.[104]

Figure 14

Edge–node network of the HyDn–SNPs results for DNA polymerases. Phenotypes and polymerases are shown as nodes, edges are weighted by total number of SNPs connecting each phenotype to each polymerase. Reproduced from Swett, R. J., Elias, A., Miller, J. A., Dyson, G. E., and Cisneros, G. A. (2013) Hypothesis driven single nucleotide polymorphism search (HyDnSNP-S). In DNA Repair12, 733–740, Copyright 2013, with permission from Elsevier.[104] In addition to uncovering a large number of previously unknown cancer missense SNPs on DNA polymerases, we performed computational simulations on one particular SNP. One of the SNPs on Polλ, rs3730477, is associated with breast cancer status and results in the R438W variant. R438 is located around 12 Å away from the active site (Figure ). This residue sits at the end of loop1, which is crucial for Polλ fidelity, and the SNP variant has been shown to give rise to chromosomal abnormalities.[105,106] We performed MD simulations on the binary and ternary structures of wild-type and the R438W variant to determine the possible impact of the breast-cancer mutation on the structure or function of Polλ. Our MD simulations indicate that the mutation of R by W at position 438 results in a change in the dynamics of loop 1, without affecting the overall motions of the rest of the protein. Our results suggest that the alteration of the motion of loop 1, which is experimentally known to be crucial for fidelity, could affect the function of this critical structural feature and help explain the observed decrease in fidelity.[104] Recently, on the basis of our work, Sweasy and co-workers have confirmed that rs3730477 can be used as a biomarker for breast cancer, and, more importantly for individuals with this particular mutation, estrogen treatment may increase the risk of breast cancer status.[107]

Figure 15

(A) Overlay of Polλ in the binary and ternary conformations. DNA is shown in light blue, and Loop 1 is shown in purple. (B) Differences in Loop 1 orientation between the two conformations. Distance between position 438 and Loop 1 following an interpolation between the two structures at its (D) furthest and (C) closest approaches. Reproduced from Swett, R. J., Elias, A., Miller, J. A., Dyson, G. E., and Cisneros, G. A. (2013) Hypothesis driven single nucleotide polymorphism search (HyDnSNP-S). In DNA Repair12, 733–740, Copyright 2013, with permission from Elsevier.[104]

Summary and Perspective

Computational simulations of DNA polymerases using classical, quantum mechanical, and hybrid QM/MM methods have enabled a deeper understanding of a variety of facets related to their structure, function, and mechanism. Detailed analysis at the atomic and electronic structure level enabled by these methods has been shown to be useful to gain insights into factors of metal mutagenicity, nucleotide discrimination, and impact of protein residues on catalytic activity. The application of novel bioinformatic techniques to uncover cancer-associated SNPs in DNA polymerases has resulted in a large number of statistically significantly associated biomarkers on DNA polymerases for four distinct cancer phenotypes. The use of computational simulations on these cancer mutants has provided possible explanations for the impact of these cancer mutations on the function of the proteins. In the future, it is expected that the improvement of simulation methods, including more accurate classical potentials, coupled with better levels of QM theory, enabled by larger computing capacity will provide more opportunities for deeper insights into these systems. For example, longer simulation times will enable the possibility to study the transfer of ssDNA from the polymerase to the exonuclease to investigate steps required for proofreading. Continuing the use of computational simulations to investigate the impact of cancer-associated SNPs on DNA polymerase structure, function, and dynamics will provide the possibility to investigate the effect of these cancer mutations at an atomic level. It is clear that the use of computational simulations of DNA polymerases is a robust and well-established field that has provided important contributions and will continue to aid in the detailed understanding of these important enzymes.

97 in total

Review 1. Carcinogenicity of metal compounds: possible role of DNA repair inhibition.

Authors: A Hartwig
Journal: Toxicol Lett Date: 1998-12-28 Impact factor: 4.372

2. Determinants of DNA mismatch recognition within the polymerase domain of the Klenow fragment.

Authors: Elizabeth H Z Thompson; Michael F Bailey; Edwin J C van der Schans; Catherine M Joyce; David P Millar
Journal: Biochemistry Date: 2002-01-22 Impact factor: 3.162

Review 3. Considering the cancer consequences of altered DNA polymerase function.

Authors: Thomas A Kunkel
Journal: Cancer Cell Date: 2003-02 Impact factor: 31.743

4. Computer simulation studies of the fidelity of DNA polymerases.

Authors: Jan Florián; Myron F Goodman; Arieh Warshel
Journal: Biopolymers Date: 2003-03 Impact factor: 2.505

Review 5. The proofreading 3'-->5' exonuclease activity of DNA polymerases: a kinetic barrier to translesion DNA synthesis.

Authors: Vineeta Khare; Kristin A Eckert
Journal: Mutat Res Date: 2002-12-29 Impact factor: 2.433

6. Presence of 18-A long hydrogen bond track in the active site of Escherichia coli DNA polymerase I (Klenow fragment). Its requirement in the stabilization of enzyme-template-primer complex.

Authors: Kamalendra Singh; Mukund J Modak
Journal: J Biol Chem Date: 2003-01-09 Impact factor: 5.157

7. Infidelity of DNA synthesis in vitro: screening for potential metal mutagens or carcinogens.

Authors: M A Sirover; L A Loeb
Journal: Science Date: 1976-12-24 Impact factor: 47.728

8. Functional mutation of DNA polymerase beta found in human gastric cancer--inability of the base excision repair in vitro.

Authors: A Iwanaga; M Ouchida; K Miyazaki; K Hori; T Mukai
Journal: Mutat Res Date: 1999-10-22 Impact factor: 2.433

9. Polymerase beta simulations suggest that Arg258 rotation is a slow step rather than large subdomain motions per se.

Authors: Linjing Yang; William A Beard; Samuel H Wilson; Suse Broyde; Tamar Schlick
Journal: J Mol Biol Date: 2002-04-12 Impact factor: 5.469

10. Hydrolysis of the 5'-p-nitrophenyl ester of TMP by the proofreading exonuclease (epsilon) subunit of Escherichia coli DNA polymerase III.

Authors: Samir Hamdan; Esther M Bulloch; Phillip R Thompson; Jennifer L Beck; Ji Yeon Yang; Jeffrey A Crowther; Penelope E Lilley; Paul D Carr; David L Ollis; Susan E Brown; Nicholas E Dixon
Journal: Biochemistry Date: 2002-04-23 Impact factor: 3.162

5 in total

1. Phosphorylation Induced Conformational Transitions in DNA Polymerase β.

Authors: Amit Srivastava; Haitham Idriss; Kamal Taha; Sungmun Lee; Dirar Homouz
Journal: Front Mol Biosci Date: 2022-06-13

2. Exploring the Role of the Third Active Site Metal Ion in DNA Polymerase η with QM/MM Free Energy Simulations.

Authors: David R Stevens; Sharon Hammes-Schiffer
Journal: J Am Chem Soc Date: 2018-07-09 Impact factor: 15.419

3. Combining Evolutionary Conservation and Quantum Topological Analyses To Determine Quantum Mechanics Subsystems for Biomolecular Quantum Mechanics/Molecular Mechanics Simulations.

Authors: Mark A Hix; Emmett M Leddin; G Andrés Cisneros
Journal: J Chem Theory Comput Date: 2021-06-04 Impact factor: 6.578

4. Reduced structural flexibility for an exonuclease deficient DNA polymerase III mutant.

Authors: Hailey L Gahlon; Alice R Walker; G Andrés Cisneros; Meindert H Lamers; David S Rueda
Journal: Phys Chem Chem Phys Date: 2018-10-31 Impact factor: 3.676

Review 5. Building better polymerases: Engineering the replication of expanded genetic alphabets.

Authors: Zahra Ouaray; Steven A Benner; Millie M Georgiadis; Nigel G J Richards
Journal: J Biol Chem Date: 2020-10-01 Impact factor: 5.157

5 in total