Literature DB >> 25775267

Human DNA polymerase θ grasps the primer terminus to mediate DNA repair.

Karl E Zahn1, April M Averill1, Pierre Aller2, Richard D Wood3, Sylvie Doublié1.   

Abstract

DNA polymerase θ protects against genomic instability via an alternative end-joining repair pathway for DNA double-strand breaks. Polymerase θ is overexpressed in breast, lung and oral cancers, and reduction of its activity in mammalian cells increases sensitivity to double-strand break-inducing agents, including ionizing radiation. Reported here are crystal structures of the C-terminal polymerase domain from human polymerase θ, illustrating two potential modes of dimerization. One structure depicts insertion of ddATP opposite an abasic-site analog during translesion DNA synthesis. The second structure describes a cognate ddGTP complex. Polymerase θ uses a specialized thumb subdomain to establish unique upstream contacts to the primer DNA strand, including an interaction with the 3'-terminal phosphate from one of five distinctive insertion loops. These observations demonstrate how polymerase θ grasps the primer to bypass DNA lesions or extend poorly annealed DNA termini to mediate end-joining.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25775267      PMCID: PMC4385486          DOI: 10.1038/nsmb.2993

Source DB:  PubMed          Journal:  Nat Struct Mol Biol        ISSN: 1545-9985            Impact factor:   15.369


Introduction

Human polymerase θ (pol θ) is a multi-domain protein of 2590 amino acids, with homologs throughout multicellular organisms[1-3]. The human POLQ gene encodes the pol θ protein, which conserves superfamily 2 helicase (SF2) and family A DNA polymerase domains at the N- and C-termini, respectively, linked by a large central region[4]. In vertebrates, pol θ and polymerase ν (pol ν)[5] are the only family A polymerases present in the cell nucleus. The mitochondrial polymerase γ (pol γ) is distantly related to pol θ, and 5’-deoxyribose phosphate (5’-dRP) lyase activity has been attributed to both pol θ and pol γ[6,7]. Pol θ, however, does not synthesize DNA with the accuracy or processivity of the mitochondrial replicase[8,9]. Several conserved insertion loops, absent from bacterial homologs, intervene within the family A polymerase fold of pol θ. Deletion of residues 2264–2315, which include much of the penultimate insertion loop, was shown to abrogate pol θ’s ability to extend unannealed single-stranded oligonucleotides[10], and bypass AP sites or thymine glycol[11]. Early biochemical characterization of pol θ hinted at a role in translesion DNA synthesis (TLS), given pol θ’s lesion bypass activity[12,13], and the enzyme indeed appears to be involved in TLS of oxidative DNA damage in vivo[14]. Assays utilizing chicken DT40 cells deduced overlapping roles for pol θ and polymerase β (pol β) in base excision repair (BER)[15]. Pol θ can also substitute for pol β during BER in C. elegans[16]. It has been suggested that pol θ is involved in somatic hypermutation of immunoglobulin genes, but any role appears minimal[17]. Pol θ does, however, participate in a subset of immunoglobulin gene class-switch recombination events in mouse B cells, dependent on an alternative (Ku-independent) DNA end-joining pathway[18]. Pol θ contributes to alternative end-joining of double-strand breaks (DSBs) in D. melanogaster[19] and mammalian cells[18]. In C. elegans, pol θ limits extensive deletions at DNA replication fork barriers, but generates small indels, templated by DNA adjacent to the excision site[20,21]. Pol θ also produces templated indels in Drosophila[22,23] and mice[18]. Furthermore, pol θ interacts with the origin recognition complex in human cells during G1, and may affect DNA replication timing[24]. POLQ codes for the only nuclear DNA polymerase over-expressed in breast cancer, and higher expression correlates with unsuccessful treatment[25,26]. Elevated POLQ expression also occurs in oral squamous cell carcinomas[27]. Higher POLQ expression associates with disease free survival of lung cancers, however[28]. Knockout or knockdown of POLQ in mouse[18,29] and human[30] cells increases sensitivity to double-strand break inducing agents, including ionizing radiation. Complementation of this sensitivity phenotype requires the polymerase activity of pol θ, but not the ATPase activity of the helicase domain[18]. The presence of distinctive insertion loops in pol θ has encouraged speculation that these loops bestow unique properties to the enzyme. However, the molecular mechanisms that confer on pol θ an ability to prime DNA synthesis from non-optimal base pairing in the context of alternative end-joining have remained elusive until now. We set out to determine the crystal structures of two active constructs, comprising the entire C-terminal polymerase domain and its associated inactivated exonuclease-like subdomain, which are reported here. The structural work, in conjunction with a biochemical analysis of point variants, illuminates specific interactions from the specialized thumb subdomain of pol θ to the primer terminus that are necessary for TLS, and contribute to elongation of single-stranded DNA oligonucleotides.

Results

Human pol θ inserts ddATP opposite an abasic site

To identify structural components providing for the unique enzymatic activities of pol θ, we determined the crystal structures of two DNA polymerase domain constructs. Molecular replacement[31], using a ternary complex of Taq DNA polymerase (1QSY)[32] as the search model, sufficed to place the four similar molecules in the crystal asymmetric unit (ASU) of the THFddATP complex, which captured human pol θ inserting ddATP opposite tetrahydrofuran (THF), a stable abasic (AP) site analog. Refinement[33] at 3.9 Å to an RFree of 30.2 % (RWork= 24.1 %; Table 1) revealed an overall fold reminiscent of bacterial homologs—with exonuclease, thumb and fingers subdomains oriented about a right hand palm subdomain (Supplementary Table 1)—but disrupted by five unique insertion loops (Fig. 1a–c and Supplementary Table 2). The THFddATP complex showed clearly the nascent base pair in the polymerase active site: The strictly conserved catalytic aspartate and glutamate residues (D2330, D2540, and E2541) of the palm subdomain coordinate a divalent Ca2+ ion, associated with the triphosphate tail of the ddATP nucleotide (Fig. 1d and Supplementary Fig. 1). As a known inhibitor of DNA polymerases, Ca2+ was essential for trapping the closed complex, because the primer strand retains a 3’-hydroxyl moiety for nucleophilic attack (Supplementary Fig. 2). The highly conserved lysine (K2383) and arginine (R2379) residues of the fingers subdomain O-helix contact non-bridging oxygens of the α and γ phosphates, respectively, as seen in other closed ternary complexes of family A polymerases (Supplementary Fig. 2)[34]. The conserved O-helix residue Y2391 was fully displaced from its template-occluding position in open or “ajar” structures[35] (Fig. 1d and Supplementary Fig. 1), demonstrating that the pol θ structure represents the first fully closed model of any family A DNA polymerase inserting adenine opposite a non-templating DNA lesion. Previously, trapping ternary complexes destabilized by templating THF has required the purine analog 5-nitro-1-indolyl-2’-deoxyribose-5’-triphosphate (5-NITP), because of its enhanced capacity for base stacking[36,37]. The 5-NITP and ddATP double rings show substantial overlap when the current pol θ model is superimposed onto these previous structures[38].
Table 1

Data collection and refinement statistics

Data collection and refinement statistics (molecular replacement)

Pol θ Ca2+THF•ddATPPol θ Mg2+dCMP•ddGTPPol θ SeMet Mg2+(dCMP•ddGTP)
Data collection
Space groupP212121P21221P21221
Cell dimensions
a, b, c (Å)126.9, 137.0, 248.0100.7, 135.8, 161.9100.4, 134.8, 166.5
    α, β, γ (°)90, 90, 9090, 90, 9090, 90, 90
Resolution (Å)50–3.9(4.0–3.9)*40–3.9(4.0–3.9)40–4.6(4.8–4.6)
Rmerge (%)20.5(100)19.6(72.2)15.5(31.5)
RFriedel (%)8.4(79.2)9.0 (39.4)16.9(27.4)
II16.7(2.1)12.5(1.9)8.7(4.3)
Completeness (%)99.7(99.1)85.6(64.0)78.1(53.1)
Redundancy16.6(14.0)22.6(13.4)4.8(3.7)
Refinement
Resolution (Å)30–3.91(3.96–3.91)30–3.90(3.96–3.90)
No. reflections74685(2403)31372(1987)
Rwork/Rfree (%)24.1(39.4)/30.2(43.9)25.8(35.0)/31.6(45.4)
No. atoms2244810902
  Protein199009950
  DNA2428890
  Ligand/ion12062
B-factors (Å2)
  Protein190144
  DNA215166
  Ligand/ion217100
R.m.s. deviations
  Bond lengths (Å)0.0150.014
  Bond angles (°)1.3621.169

Values in parentheses are for highest-resolution shells. Pol θ Ca2+ and Mg2+ datasets are the product of merging 3 and 4 crystals, respectively. The pol θ Semet Mg2+ data set was collected on a single crystal.

R = Σ(|I+ − I−|)/Σ

Fig. 1

(a) The overall THF–ddATP pol θ structure is shown in comparison to Taq polymerase (circle insert; PDBID 1QSY[32]), identifying the 5 insertion loops. Loops exo1 and exo2 (cyan) extend from the β-sheet of the exonuclease-like subdomain. Insert 1 (brown) occupies the tip of the thumb subdomain. Insert 2 (yellow) joins the thumb subdomain, adjacent to the active site. Insert 3 (purple) of the palm subdomain lies in close proximately to the exonuclease-like subdomain. The dotted lines represent regions of the loops that were not seen in the electron density map and thus not built in the crystallographic model (b) A putty representation of pol θ, in the same orientation as panel a, displays the tube radius of the backbone trace proportionally to the refined atomic displacement parameters. Peaks from an NCS averaged anomalous difference electron density map (yellow mesh, contoured at 4 σ) pinpoint the location of methionines. (c) The full-length human pol θ domain architecture schematic describes the crystallization constructs, which encompass the entire C-terminal polymerase domain (residues 1819 to 2590) and vestigial exonuclease-like domain (residues 1819 to 2090). (d) A close-up view of the pol θ active site shows ddATP opposite THF in the closed conformation. Contacts (black dashes) are mediated from the O-helix residue Q2384 to the incoming nucleobase. (e) The THF–ddATP (dark pink, dark blue, and yellow-green) and dCMP–ddGTP (lighter hues) models appear superimposed, based on palm subdomain residues. Subtle rearrangements with cognate dCMP–ddGTP in the active site reposition the C-terminal end of the O-helix, forming a putative salt bridge from R2254 of insert 2 to D2376 of the fingers. All molecular illustrations were made with PyMOL (The PyMOL Molecular Graphics System, Version 1.5.0.4 Schrödinger, LLC).

Pol θ and pol ν both contain unique signature residues at mutable O-helix sites[39], adjacent to the essential arginine and lysine side chains that contact the triphosphate tail and make pyrophosphate a better leaving group[40]. A glutamine residue occupies position 2384 of pol θ (Q2384), where pol ν conserves a lysine (K679) and bacterial polymerases generally harbor an alanine or threonine. Q2384 appeared in the vicinity of the Hoogsteen face of the incoming nucleotide, within hydrogen bonding distance to the exocyclic N6 of the ddATP molecule (Fig. 1d). Mutation of the analogous residue to alanine in pol ν (K679A) has reduced both bypass of Tg, and pol ν’s propensity for dTMP–dGTP mismatch formation[39]. An additional contact to the incoming nucleotide was established in pol θ between the β-phosphate and O-helix residue Y2387 (Supplementary Fig. 1), which is also conserved in DNA polymerases from T-odd bacteriophages and pol ν[34,39].

The fingers subdomain closes on a cognate base pair

We pursued a related pol θ structure in the presence of Mg2+, the likely physiological divalent metal ion, and with a cognate dCMPddGTP base pair in the active site. This crystallization construct encompassed an additional 27 amino acid residue truncation from the N-terminus (Fig. 1c), and the DNA sequence was blunted to remove a 3’ template overhang. We also modified the 5’ sequence of the template to complement consecutive incoming ddGTP molecules, in order to promote enzymatic chain termination of the primer strand before trapping the closed ternary complex. These crystals diffracted to similar resolution as the Ca2+ crystals, although the diffraction data was deemed inferior, due to an off-origin native Patterson map peak and corresponding non-crystallographic symmetrical (NCS) translation, relating the two protein molecules of the ASU. Coupled with anisotropic diffraction, skewed intensity distribution statistics led to systematic data loss and reduced data completeness, even for highly redundant data sets compiled with extensive cross-crystal data merging (Table 1). A molecular replacement solution was readily obtained, however, based on the Ca2+ model, and refined to an RFree of 31.6 % (RWork= 25.8 %; Table 1). Subsequently, this approach yielded crystals (Table 1) of a selenomethionyl variant in the presence of Mg2+, from which a 4.6 Å anomalous difference Fourier map pinpointing the incorporated Se atoms was calculated. Superimposing these anomalous peaks onto the Ca2+ model added confidence to the overall trace of the model, and in assignment of side chains in poorly conserved regions, by anchoring the pol θ sequence (Fig. 1c). The Mg2+ complex appeared globally similar to the complex obtained with Ca2+. The former depicted subtle rearrangements in the fingers subdomain, adjusting the relative position of helix O to the primer–template and palm subdomain, as the O-helix appears to have moved closer to the cognate dCMPddGTP than the THFddATP pair (Fig. 1e).

Unique insertion loops decorate pol θ

The locations of the three loop elements previously described in the palm and thumb subdomains[12] are now firmly established by the current pol θ structure, in addition to two previously unreported insertion elements present in the N-terminal vestigial exonuclease-like subdomain (Fig. 1a and Supplementary Data Table 2). Intrinsic flexibility prevented modeling any of the pol θ insertion loops in their entirety (Fig. 1b). Nevertheless, the molecular replacement solutions yielded electron density for the base of each loop, which served to orient them about the global polymerase fold and provided a partial description of their structures. The tip of the thumb subdomain in pol θ resembles that of other homologous bacterial and phage polymerase structures, with two long helical segments separated by several flexible loops and smaller helices. Often in crystal structures of family A DNA polymerases, the tip of the thumb develops relatively high B-factors, due to the flexibility of these loops[32,34], and pol θ provided no exception (Fig. 1b). Refinement produced electron density only for the first residues on either side of insert 1 (residues P2144-F2177), although the proximity of insert 1 to the DNA minor groove demonstrates potential for contacts to the DNA[9]. The location of insert 1 resembles that of both the thioredoxin-binding motif[12] in T7 DNA polymerase[34], and the binding site of the pol γ processivity factor[8], which is consistent with the ascribed role of insert 1 in the processivity of pol θ[11]. Insert 2 (residues R2254-S2313) departs the palm subdomain near the primer terminus binding site to join the thumb subdomain, where electron density for two β-strands was observed exiting the active site. These strands constitute the most highly conserved fragments of insertion loop 2 in pol θ and mus308-like proteins, from which a highly retained basic residue (R2254 in human pol θ; Fig. 2a) contacts the 3’-terminal phosphate of the primer DNA (Fig. 2b and Supplementary Fig. 2). A second contact of interest involving R2254 appeared only in the Mg2+ crystal form, where the O-helix has closed farther towards the active site. These subtle shifts reposition the acidic C-terminal end of the O-helix, from which a well conserved aspartate residue (D2376) engages in a salt bridge with R2254 of insertion loop 2 (Fig. 1e), pinning the guanidinium moiety between the terminal phosphate and O-helix. Unlike insert 3, which resides on the opposite site of the protein from the primer–template DNA, insert 2 is poised to align a poorly matched primer terminus for nucleophilic attack. Both inserts 2 and 3 have been shown necessary for TLS by pol θ, with insert 2 required for single-stranded oligonucleotide extension[10,11].
Fig. 2

(a) Structure-based sequence alignment illustrates the conservation of K2181, R2202, and R2254 (blue triangles) in multicellular organisms. Pol θ, mus308, and pol ν conserve an arginine or lysine at position 2254, whereas bacterial polymerases retain a hydrophobic side chain. Only pol θ-like enzymes (top nine proteins) conserve basic residues at positions 2181 and 2202. (b) Superposition of pol θ (warm pink), Taq (1QSY[32], cyan), and Klenow (1KLN[42], light grey) illustrates how insertion loop 2 (yellow) departs from the canonical family A polymerase fold. R2254 in human pol θ contacts the primer 3’-terminal (n-1) phosphate. (c) The palm (warm pink) and specialized thumb subdomain (green) of pol θ are illustrated to display unique contacts (black dashes) to the primer DNA (orange sticks). In addition to R2254 at the n-1 phosphate, R2202 contacts the n-2 and n-3 phosphates of the primer DNA. K2181 establishes a putative salt bridge to the n-5 phosphate, just after insertion loop 1 (brown).

The N-terminal exonuclease domains of family A DNA polymerases are generally divergent in sequence, despite conservation of the overall fold. A subset of these enzymes harbor 3’–5’ exonuclease activity in this domain, such as E. coli pol I, whereas this activity has been inactivated in other polymerases, such as Taq DNA polymerase. The crystallization construct used in this study includes approximately 300 residues upstream of the polymerase core, which is homologous to the E. coli pol I 3’–5’ exonuclease domain. The catalytic residues necessary for metal ion binding and primer degradation (D355, E357, D424 in E. coli pol I) are absent from the N-terminal subdomain of pol θ, and thus exonuclease activity has not been observed. Density modification with cross-crystal NCS averaging provided detail to confidently place many side chains in regions of well-ordered secondary structure[41], and this approach proved indispensable in modeling the N-terminal subdomain at 3.9 Å resolution, given the characteristic sequence divergence from available models. The current study identified two unreported insertions in the N-terminal subdomain of pol θ, in addition to the three inserts of the core polymerase subdomains. The second insert of the N-terminal subdomain (loop exo2; Q1918-D1936) extends an established surface loop, which is only 10 amino acids in bacterial pol I (386–396 in E. coli pol I). Residues K1858-L1899 form the first insertion (loop exo1), of which only 8 amino acids could be modeled. Loops exo1 and exo2 appear in close proximity, where together they extend from the conserved β-sheet of the N-terminal subdomain (Fig. 1a and Fig. 3a,b). In full-length pol θ, these insertion elements might provide contacts to the helicase domain or central region, considering their proximity to the extreme N-terminus of the polymerase domain crystallization construct.
Fig. 3

(a) The N-terminal inactivated exonuclease-like subdomain of pol θ (smudge green) appears superimposed onto the 3’–5’ exonuclease domain of Klenow fragment (grey, 1KLN[42]). The Klenow structure shows the path of DNA into the exonuclease active site (tan), which is blocked by a shifted loop in the pol θ model. Insert 3 (purple) is resolved in close proximity to the exonuclease-like subdomain, and near loops exo1 and exo2 (cyan). Insert 3 is sequentially close to polymerase motif 5 of the palm (red), thus providing a structural linkage between the exonuclease-like subdomain and polymerase active site. (b) A close-up view of loops exo1 and exo2 (cyan) is provided, with the location of S1977 indicated. S1977P is homologous to the chaos1 mouse allele of pol θ. The current model indicates potential for hydrogen bond (black dashes) from S1977 to a backbone carbonyl of loop exo1.

Insert 3 (S2503-F2534) terminates with two consecutive glycine residues that lead into a 6-residue β-strand of the palm subdomain, directly prior to polymerase motif 5 (HDELLY) (Fig. 3a). Motif 5 contains two of the strictly conserved carboxylate residues, D2540 and E2541. A helical segment of insert 3 appears at the interface between the N-terminal and the palm subdomains of pol θ, lying at the base of the cleft present in the N-terminal subdomain. In E. coli pol I Klenow fragment, this cleft provides a path for DNA to migrate into the exonuclease proofreading active site[42]. In pol θ, a small loop (G2022-E2039) in the exonuclease-like domain is shifted that would block DNA from taking a trajectory identical to that previously observed in Klenow fragment (Fig. 3a).

The homologous chaos1 variant (S1977P) catalyzes TLS

A genetic screen for genomic stability factors in mice implicated pol θ in DSB repair[43,44]. Mouse cells carrying the chaos1 allele of POLQ have elevated spontaneous levels of micronuclei, and increased micronuclei following treatment with ionizing radiation or mitomycin C[18,29,44]. The chaos1 allele was subsequently verified to encode a serine to proline missense mutation (S1977P in human pol θ) in the N-terminal exonuclease-like subdomain. Partial synthetic lethality results when the chaos1 allele is introduced into an ATM knockout background, allowing only 10 % of mice to survive past the neonatal period[44]. The human pol θ model illustrates that S1977 caps the C-terminal end of a hydrophobic helix in the vestigial exonuclease-like subdomain. In the current model, S1997 could provide a hydrogen bond to the backbone carbonyl of D1897 of loop exo1 (Fig. 3b). We generated the S1977P mutant to characterize the biochemical characteristics of the homologous human chaos1 variant. Assays aimed to evaluate TLS (Fig. 4) and single-stranded primer extension (data not shown) failed to reveal a dramatic biochemical phenotype, however. These findings support the suggestion that cellular levels of the pol θ protein are depleted in the chaos1 mice due to poor in vivo expression or stability[18].
Fig. 4

The primer extension assay demonstrates that pol θ R2254V cannot perform the bypass step of TLS, even when all nucleotides are provided (500 µM each). Pol θ and the human chaos1 allele homolog (pol θ S1977P) bypass the AP site to a full-length product, with terminal transferase activity extending the primer by one additional nucleotide. Klenow fragment exo- is able to bypass the AP site under these reaction conditions (125 nM enzyme + 250 nM primer–template), without terminal transferase activity.

Pol θ R2254V fails to bypass AP sites

We designed the R2254V variant to evaluate how the conserved basic residue of insertion loop 2 contributes to pol θ’s activity on single-stranded DNA oligonucleotides, and bypass of AP sites or Tg lesions. Family A DNA polymerases from bacteria conserve a hydrophobic amino acid (valine or isoleucine) at the equivalent position of R2254 (Fig. 2a), and the R2254V variant therefore mimics these bacterial enzymes. Pol θ R2254V, although active on double-stranded DNA, failed to bypass AP sites or Tg (Figs. 4–6), and was marginally hindered during extension of unannealed single-stranded DNA oligonucleotides, especially when provided with only pyrimidine nucleotides (Fig. 7a,b). The salt bridge from R2254 to the primer 3’-terminal phosphate appears essential in compensating for interactions missing from the templating strand, due to DNA lesions or distorted base pairing.
Fig. 6

Bands from the primer extension assays shown in Fig. 5 were quantified in order to plot the average extension by pol θ thumb subdomain variants opposite an AP site, Tg, and an undamaged template.

Fig. 7

(a) Primer extension assays utilizing a single-stranded substrate show that pol θ variants K2181A, R2202A, R2254A and R2254V are all able to catalyze extension of a single-stranded oligonucleotide after 10 min in the indicated sequence context. (b) A primer extension assay in an alternate sequence context shows that variants of R2202 and R2254 produce shorter products when forced to utilize pyrimidines. In this sequence context, K2202A, R2254A, and R2254V appear unable to utilize dTTP and show reduced incorporation of dCTP compared to the parental enzyme.

The pol θ structure revealed other unique upstream contacts to the primer DNA strand mediated by the specialized pol θ thumb subdomain (Fig. 2a). R2202 inserts its guanidinium moiety between the n-2 and n-3 primer strand phosphates, and is therefore poised to make two contacts to the DNA backbone. An additional contact is possible from K2181 of the thumb subdomain to the primer n-5 phosphate (Fig. 2c). Including the contacts from R2254 to the n-1 phosphate, the pol θ thumb subdomain appears specialized to provide salt bridges to the primer strand phosphate backbone in excess of those conserved by all other family A polymerases, including pol ν (Fig. 2a). Like pol ν and bacterial pol I, pol θ retains arginine residues at positions 2201 and 2315 (631 and 690 in E. coli pol I), which provide absolutely conserved interactions to the n-4 and n-1 phosphate, respectively. Generating alanine substitutions at K2181, R2202, or R2254 in pol θ, for the purpose of evaluating the contribution of the specialized thumb subdomain during bypass of an AP site or Tg, revealed that the greatest TLS defect associates with a loss of contacts closest to the extreme primer terminus (Fig. 5). Although the extension step of TLS opposite an AP site challenged all variants, the mutations made at R2254 (to alanine or valine) afforded the most dramatically deficient TLS phenotypes, followed closely by R2202A. K2181A reduced bypass of the AP site compared to the wild type enzyme, although not to the same extent as other variants (Fig. 6). When provided with a substrate placing Tg in templating position, the same pattern emerged, albeit R2202A possessed a lesser defect, closer to that of K2181A. Even though pol ν harbors a lysine (K584) residue in insertion 2, homologous to R2254 in pol θ, R2202 and K2182 appear in neither pol ν nor bacterial homologs (Fig. 2a). These differences could explain why only pol θ grasps the primer tightly enough to bypass AP sites and extend certain minimally annealed primer–templates[45]. Pol ν is less adept at bypassing blocking lesions, and has only been shown to bypass 5S-Tg effectively[13]. Likewise, pol ν does not extend single-stranded oligonucleotides in vitro, a reaction readily catalyzed by pol θ[10]. Considering that pol θ retained substantial ability to extend single-stranded substrates in the context of individual mutations at R2254, R2202 or K2181 in vitro (Fig. 7a), the transiently templated nature of this reaction is emphasized[18], and pol θ must therefore employ a different set of amino acid side chains to manipulate the single-stranded substrate as its own template for primer extension.
Fig. 5

(a) Primer extension assays compare bypass of a templating AP site by pol θ thumb subdomain variants K2181A, R2202A, R2254A, and R2254V. (b) The variant polymerases are also challenged during bypass of a Tg lesion in otherwise identical primer extension assays. (c) Nearly full activity for all variant polymerases is observed in primer extension assays with an undamaged template.

Pol θ crystals reveal two modes of dimerization

The pol θ ASU in the Ca2+ crystal form contains 4 protein-DNA complexes, assembled as a dimer of 2-fold dimers (Fig. 8a). Two dissimilar types of contacts mediated packing of the DNA ends distal from the active site: The 3’ ends of each template DNA strand provide a dGMP overhang, two of which stack against conserved tryptophan residues (W1907) of adjacent protein molecules. The remaining template 3’ ends appear in close proximity, forming inter-DNA contacts outside the molecular footprint of pol θ.
Fig. 8

The NCS 2-fold axes in pol θ crystals are visualized and compared. (a) The ASU of the pol θ THF–ddATP model contains four protein-DNA complexes. Assembling the ASU requires two skew NCS 2-fold axes (dotted lines). (b) The NCS 2-fold relating chains C and D passes near insert 3 of the palm (pink hues) subdomain, adjacent to the N-terminal (grey shades) subdomain (c) The 5’-template DNA propagated NCS 2-fold relates chains A and B, passing near the fingers (blue hues) and thumb (green hues) subdomains. (d) Crystal packing in the dCMP–ddGTP crystal form is depicted, showing that the two molecules per ASU pack differently from the four molecules of the THF–ddATP crystal form (described in panel a). The identical NCS 2-fold axes are observed, but parallel to the 21 screw along cell edge c. The dimer at the right of the panel is generated by a crystallographic translation of the unit cell (green rectangular prism) along the b cell axis. The dimer at the bottom left is generated by the 21 screw along cell axis c (vertical axis), which in the context of the 5’-template DNA propagated NCS 2-fold axis gives rise to the NCS translation.

Three perpendicular 2-fold screw axes generate the space group P212121. However, two alternate pure 2-fold axes assemble the ASU of the Ca2+ crystal form, which exist as NCS operators. The first NCS 2-fold axis passes near insert 3 and the N-terminal subdomain (Fig. 8b). The second NCS 2-fold axis passes adjacent to the 5’-template DNA, upstream of the polymerase active site, and near the putative location of insertion loop 1 (Fig. 8c). The ASU in the current Ca2+ model therefore presents two potentially biologically relevant assemblies, where the 5’-template DNA 2-fold symmetry axis relates protein chains A and B, and the 2-fold symmetry axis of insert 3 relates molecules C and D. Interestingly, the pseudosymmetry observed in the Mg2+ crystal form was a consequence of similar configurations within the ASU, where the identical NCS 2-fold dimeric relationships about the 5’-template DNA and insert 3 appeared once again. In this case, both NCS 2-fold axes appeared nearly parallel to the crystallographic 21 screw axis along c, thus generating the NCS translation (Fig. 8d).

Discussion

An alternative DNA end-joining pathway in eukaryotes absolutely requires the POLQ gene product, where the C-terminal polymerase activity is applied specifically in a role for which other DNA polymerases or helicases are unable to compensate[18-20]. The current study informs a mechanism by which the human pol θ polymerase domain maintains a tight hold on the primer DNA strand, by establishing interactions from positively charged residues K2181, R2202 and R2254 to the phosphate backbone. The crystal structures also establish a precedence for the pol θ protein to dimerize. Although a multimer was not observed in solution by gel filtration chromatography, resolving identical NCS 2-fold axes in two different crystal forms indicates multimerization of the pol θ polymerase domain is possible at sufficient concentration. Analysis of these potential dimer interfaces in PISA[46] attributes 12,400 and 13,100 Å2 of buried surface area for the 5’ DNA propagated and insert 3 NCS 2-fold interfaces (Fig. 8b and 8c), respectively, neglecting the flexible segments omitted from the models. In the context of alternative end-joining, pol θ multimerization might be essential for bringing two DNA ends together, or sequestering free DNA ends from classical nonhomologous end-joining factors, such as Ku[47]. Higher expression of POLQ correlates with decreased survival in breast cancer patients[48,49], and knockdown of the gene product in several malignant cell lines induces radiosensitivity[30]. Given that POLQ is non-essential in healthy cells[44], the potential to pharmacologically target unique features of the pol θ protein, such as the specialized thumb subdomain revealed by this study, is an appealing approach for adjuvant radiation cancer therapy[26]. Future experiments will expand our understanding of the mechanisms by which pol θ protects cancer cells from radiation or chemotherapeutics. The extent to which transient pol θ dimers might bridge DSBs, for example, must also be considered as POLQ is evaluated as a potential target for next generation cancer drugs.

Online Methods

Crystallization of recombinant human pol θ

The pol θ polymerase crystallization construct (residues 1792–2590) was expressed from the pSUMO3 vector in Rosetta2(DE3)pLysS cells (Novagen) by autoinduction, and purified to homogeneity utilizing a nickel-NTA resin (Thermo Scientific), a HiTrap Heparin column (GE Healthcare Life Sciences), and a Superdex 200 Increase GL gel filtration column (GE Healthcare Life Sciences)[11]. The selenomethionyl pol θ protein was also autoinduced, but with the Overnight Express System 2 kit (EMD Millipore) using 125 mg/L of L-selenomethionine (Acros Organics). The protein was concentrated to 5 mg/mL in a buffer of ammonium acetate (150 mM), KCl (150 mM), Tris-HCl buffer pH 8.0 (40 mM), TCEP (2.5 mM), and glycerol (1% v/v). Reacting pol θ (2.5 mg/mL) with a 13–18mer oligonucleotide (50 µM) containing a THF abasic site analog (Midland), with annealed primer=gcggctgtcattc and template =cgt[THF]gaatgacagccgcg, in the presence of ddATP (1 mM), sucrose monolaurate (300 µM) and spermine tetrahydrochloride (20 mM) prepared the sample for hanging drop vapor diffusion over a 1 mL reservoir containing PEG 2000mme (9.5% v/v), MPD (2% v/v), CaCl2 (50 mM), KCl (200 mM), and Tris-HCl buffer pH 8.5 (100 mM), by mixing 0.6 µL of the reservoir solution with an equal part reaction solution. Narrow, long crystals of approximate final dimensions 400 × 60 × 60 µm3 grew over the next 3 days. Cryoprotection was achieved by adding 1 µL of a solution containing PEG 2000mme (25% v/v), MPD (25% v/v), CaCl2 (20 mM), spermine tetrahydrochloride (10 mM) and KCl (260 mM) and nucleotide (1 mM) directly to the drop, prior to flash cooling into liquid nitrogen. The alternative Mg2+ crystal form was discovered in an attempt to obtain better diffracting pol θ crystals. An additional 27 residues were cleaved from the N-terminus of the crystallization construct, leaving residues 1819–2590. The DNA sequence was also altered to allow for the formation of a Watson-Crick base pair in the active site and eliminating the 3’ template dGMP overhang. Two consecutive dCMP were introduced in the template to allow for enzymatic chain termination of the primer strand by incubation with ddGMP, yielding the primer=gcggctgtcatt and template=cgtccaatgacagccgc. Crystals were obtained from similar conditions as before, except that the reaction mixture was supplemented with 1 mM MgCl2, and ddGTP replaced the ddATP. The PEG 2000mme in the reservoir was increased to 12%, and the MPD to 10%. Although these crystals grew to similar length, they were wider (80 µm) but thinner (20 µm). Increasing the precipitant slightly to 12.5% allowed for crystallization similarly of the selenomethionyl variant of the Δ1818 construct in complex with the same primer–template, although these crystals were approximately half the size of the native Mg2+ crystals. Both native and selenomethionyl forms were cryoprotected by the prior procedure, except the cryoprotection solution was supplemented with MgCl2 instead of CaCl2.

Data collection, structure solution, and refinement

Pol θ crystals of the Ca2+ form diffracted at the APS synchrotron (beamline 23-ID-D) past 3.9 Å at 12 kEV, and complete data sets were collected from these radiation-sensitive primitive orthorhombic (P212121) crystals at 80 K by merging approximately 20 image segments, as the 20 µm beam was translated along the length of the crystal. Due to anisotropy of diffraction, multiple crystals were merged in order to obtain a complete data set with substantial redundancy, to be used in structure solution and refinement. An identical strategy was utilized to collect native and derivative datasets at Se-peak (12.67 kEV) for the Mg2+ crystal forms. Structure solution of the Ca2+ crystal form was accomplished by molecular replacement, utilizing the Taq DNA polymerase ternary complex 1QSY[32] to search for the 4 protein/DNA complexes per ASU. Long segments of secondary structure were rigid body fit into a reduced bias “prime and switch” map[53] with NCS averaging. Rebuilding in Coot[54] and refinement in Phenix[55] allowed most of the DNA to be built around the active site, although uncertainty remained concerning insertion elements, and in assigning sequence to the trace of the N-terminal vestigial exonuclease-like subdomain. The Mg2+ crystal form diffracted to similar resolution as the Ca2+ form, but was plagued by a strong off-origin native Patterson peak consistent with an NCS translation of (0.1, −0.5, 0.5) fractional units, which was verified by molecular replacement in the space group P21221, utilizing the Ca2+ structure as a search model. The identical NCS 2-fold dimeric configurations were resolved, as observed of crystals grown in Ca2+, although in this case a single dimer constituted the ASU. The NCS 2-folds occurred parallel to the crystallographic 21 screw operator along c, which gave rise to pseudosymmetry and potential twinning (Fig. 5d). Refinement against potential twin laws in all related monoclinic cells failed to improve refinement statistics, however, and twinning was ruled out, despite the presence of suspect intensity distribution. Moreover, the electron density maps were of lesser quality. Nevertheless, phases obtained by refining the rigid-body solution of the Mg2+ crystal form in Phenix[33] proved useful in generating cross-crystal density-modified NCS-averaged maps, which provided high quality electron density for the placing side chains of the final model, and helped resolve ambiguities in the tracing of the inserts and N-terminal exonuclease-like subdomain. Phases calculated from the transformed model also allowed visualization of anomalous peaks at 4.6 Å, due to selenomethionine substitution in the SeMet Mg2+ crystal form, and thereby provided invaluable verification of the backbone trace in the N-terminal domain and insertion element 2. Concluding refinement steps were conducted in Phenix[55], fitting protein domains and nucleic acid chains as rigid bodies. Torsion NCS, secondary structure, and Ramachandran restraints were enforced during individual xyz refinement, prior to TLS and group_ADP refinement, at which point the Rwork reached 28.3 % (Rfree= 31.2 %) for the Ca2+ model. Finally, the individual_ADP strategy was applied for 2 cycles of refinement to yield the completed model, for which statistics are reported. Ramachandran analysis places 98.6% of residues in the Ca2+ model in favored or allowed regions of the plot, leaving 1.4% outliers. The final Mg2+ model was obtained by trimming several residues from the base of insertion loop 1, and omitting 3 base-pairs of the duplex DNA extruding from the molecule in the 5’ primer direction, before similarly running refinement in Phenix. The completed Mg2+ model identifies 99.3% of residues in the favored or allowed region of the Ramachandran plot, with 0.7% outliers.

Variant production and biochemical analysis

Variant pol θ constructs were generated by site-directed mutagenesis with the QuikChange XL kit (Stratagene). Biochemical reactions were conducted by pre-incubating the primer–template (250 nM) with pol θ variants (125 nM) in Tris-HCl buffer pH 8.0 (20 mM), KCl (25 mM), and β-mercaptoethanol (1 mM), prior to adding deoxynucleotides (500 µM each or individually) and MgCl2 (10 mM). Aliquots were quenched at indicated time points by mixing in equal parts of a quench solution made of formamide (95 %), EDTA (20 mM), and trace bromophenol blue. Products were separated on a SequaGel UreaGel (National Diagnostics) denaturing sequencing gel (12 %), and visualized by excitation of a 5’ tetrachlorofluorescein tag on the primer strand at 532 nm setting on the PharosFX (BioRad) imaging device. Bands were quantified with the QuantityOne (BioRad) software and plotted in GNUplot via the function , where ρ is the normalized density of the band corresponding to the i extension product, to yield the average extension of the primer strand in fractional nucleotides.
  55 in total

1.  The structure of a high fidelity DNA polymerase bound to a mismatched nucleotide reveals an "ajar" intermediate conformation in the nucleotide selection mechanism.

Authors:  Eugene Y Wu; Lorena S Beese
Journal:  J Biol Chem       Date:  2011-03-19       Impact factor: 5.157

2.  Crystal structure of a bacteriophage T7 DNA replication complex at 2.2 A resolution.

Authors:  S Doublié; S Tabor; A M Long; C C Richardson; T Ellenberger
Journal:  Nature       Date:  1998-01-15       Impact factor: 49.962

3.  Phenotype-based identification of mouse chromosome instability mutants.

Authors:  Naoko Shima; Suzanne A Hartford; Ted Duffy; Lawriston A Wilson; Kerry J Schimenti; John C Schimenti
Journal:  Genetics       Date:  2003-03       Impact factor: 4.562

4.  Structural insight into processive human mitochondrial DNA synthesis and disease-related polymerase mutations.

Authors:  Young-Sam Lee; W Dexter Kennedy; Y Whitney Yin
Journal:  Cell       Date:  2009-10-16       Impact factor: 41.582

5.  Features and development of Coot.

Authors:  P Emsley; B Lohkamp; W G Scott; K Cowtan
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2010-03-24

Review 6.  DNA polymerase POLQ and cellular defense against DNA damage.

Authors:  Matthew J Yousefzadeh; Richard D Wood
Journal:  DNA Repair (Amst)       Date:  2012-12-04

7.  DNA polymerase theta (POLQ) can extend from mismatches and from bases opposite a (6-4) photoproduct.

Authors:  Mineaki Seki; Richard D Wood
Journal:  DNA Repair (Amst)       Date:  2007-10-24

8.  Mechanism of microhomology-mediated end-joining promoted by human DNA polymerase θ.

Authors:  Tatiana Kent; Gurushankar Chandramouly; Shane Michael McDevitt; Ahmet Y Ozdemir; Richard T Pomerantz
Journal:  Nat Struct Mol Biol       Date:  2015-02-02       Impact factor: 15.369

9.  Nucleic acid polymerases use a general acid for nucleotidyl transfer.

Authors:  Christian Castro; Eric D Smidansky; Jamie J Arnold; Kenneth R Maksimchuk; Ibrahim Moustafa; Akira Uchida; Matthias Götte; William Konigsberg; Craig E Cameron
Journal:  Nat Struct Mol Biol       Date:  2009-01-18       Impact factor: 15.369

10.  Low-fidelity DNA synthesis by human DNA polymerase theta.

Authors:  Mercedes E Arana; Mineaki Seki; Richard D Wood; Igor B Rogozin; Thomas A Kunkel
Journal:  Nucleic Acids Res       Date:  2008-05-24       Impact factor: 16.971

View more
  56 in total

1.  Structures of human DNA polymerases ν and θ expose their end game.

Authors:  William A Beard; Samuel H Wilson
Journal:  Nat Struct Mol Biol       Date:  2015-04       Impact factor: 15.369

2.  Template and primer requirements for DNA Pol θ-mediated end joining.

Authors:  Peng He; Wei Yang
Journal:  Proc Natl Acad Sci U S A       Date:  2018-07-09       Impact factor: 11.205

Review 3.  Error-Prone Repair of DNA Double-Strand Breaks.

Authors:  Kasey Rodgers; Mitch McVey
Journal:  J Cell Physiol       Date:  2016-01       Impact factor: 6.384

4.  Expression and Structural Analyses of Human DNA Polymerase θ (POLQ).

Authors:  Andrew W Malaby; Sara K Martin; Richard D Wood; Sylvie Doublié
Journal:  Methods Enzymol       Date:  2017-05-24       Impact factor: 1.600

5.  Linking DNA polymerase theta structure and function in health and disease.

Authors:  Kelly Beagan; Mitch McVey
Journal:  Cell Mol Life Sci       Date:  2015-10-29       Impact factor: 9.261

6.  Genetic evidence for reconfiguration of DNA polymerase θ active site for error-free translesion synthesis in human cells.

Authors:  Jung-Hoon Yoon; Robert E Johnson; Louise Prakash; Satya Prakash
Journal:  J Biol Chem       Date:  2020-03-13       Impact factor: 5.157

Review 7.  Repair Pathway Choices and Consequences at the Double-Strand Break.

Authors:  Raphael Ceccaldi; Beatrice Rondinelli; Alan D D'Andrea
Journal:  Trends Cell Biol       Date:  2015-10-01       Impact factor: 20.808

8.  A polar filter in DNA polymerases prevents ribonucleotide incorporation.

Authors:  Mary K Johnson; Jithesh Kottur; Deepak T Nair
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

9.  Essential Roles for Polymerase θ-Mediated End Joining in the Repair of Chromosome Breaks.

Authors:  David W Wyatt; Wanjuan Feng; Michael P Conlin; Matthew J Yousefzadeh; Steven A Roberts; Piotr Mieczkowski; Richard D Wood; Gaorav P Gupta; Dale A Ramsden
Journal:  Mol Cell       Date:  2016-07-21       Impact factor: 17.970

Review 10.  Regulation of Single-Strand Annealing and its Role in Genome Maintenance.

Authors:  Ragini Bhargava; David O Onyango; Jeremy M Stark
Journal:  Trends Genet       Date:  2016-07-19       Impact factor: 11.639

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.