| Literature DB >> 18503084 |
Mercedes E Arana1, Mineaki Seki, Richard D Wood, Igor B Rogozin, Thomas A Kunkel.
Abstract
Human DNA polymerase theta (pol or POLQ) is a proofreading-deficient family A enzyme implicated in translesion synthesis (TLS) and perhaps in somatic hypermutation (SHM) of immunoglobulin genes. These proposed functions and kinetic studies imply that pol may synthesize DNA with low fidelity. Here, we show that when copying undamaged DNA, pol generates single base errors at rates 10- to more than 100-fold higher than for other family A members. Pol adds single nucleotides to homopolymeric runs at particularly high rates, exceeding 1% in certain sequence contexts, and generates single base substitutions at an average rate of 2.4 x 10(-3), comparable to inaccurate family Y human pol kappa (5.8 x 10(-3)) also implicated in TLS. Like pol kappa, pol is processive, implying that it may be tightly regulated to avoid deleterious mutagenesis. Pol also generates certain base substitutions at high rates within sequence contexts similar to those inferred to be copied by pol during SHM of immunoglobulin genes in mice. Thus, pol is an exception among family A polymerases, and its low fidelity is consistent with its proposed roles in TLS and SHM.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18503084 PMCID: PMC2441791 DOI: 10.1093/nar/gkn310
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Error Rate for pol θ
| Total changes | Error rate (×10−3) | |
|---|---|---|
| Frameshifts (−1) | 112 | 1.4 |
| Frameshifts (+1) | 270 | 3.3 |
| Base substitutions | 194 | 2.4 |
| Other | 29 | – |
| ± >1 bases | 16 | – |
aError rates are calculated from all observed changes (see Materials and Methods section). As mentioned in the text, mutant frequencies were calculated for two experiments. Experiment 1: mutant frequency was 27.0% (267 mutant plaques from a total of 997). Experiment 2: mutant frequency was 31.0% (873 mutant plaques from a total of 2835). Error rates are calculated from Experiment 1.
bError rates calculated for detectable changes are: frameshifts (−1) 0.9 × 10−3, frameshifts (+1) 1.9 × 10−3, base substitutions 1.4 × 10−3.
cThese errors are further discussed in the text and Table 2.
dAmong the 200 clones sequenced, several clones (13) contained deletions of two or more bases, while three clones contained insertions of two or more bases.
Pol θ errors involving more than one base
For each single event listed, the top sequence shows the original template sequence (Figure 1) including the five nucleotides on the 5′ and 3′ sides of the sequence that was changed. The bottom sequence shows the observed changes highlighted in red. For example, among 200 independent clones, 10 contained an insertion of a C at position 132–136 and a T at position 137–139.
Figure 3.Processive synthesis by Klenow exo− and pol θ. Reactions were performed as described in Materials and Methods section. (A) Representative phosphor image of the reaction products of processive DNA synthesis resolved on a 12% denaturing polyacrylamide gel. Lanes 2–5 represent primer extension products by Klenow exo− at 1, 3, 4 and 5 min. Lanes 7–11, primer extension products by pol θ at 2, 5, 10, 15 and 30 min. Lanes 1 and 6 represent negative control reactions. The numbers on the right indicate the positions along the lacZ gene. (B) Termination probability per nucleotide at each template position, 144 to 127. Numbers along the Y-axis indicate nucleotide positions along the lacZ gene. The termination probability is defined as the amount of product of a given length divided by the sum of products of that length plus all greater length products. The values plotted are the average of three values for pol θ (for time points 5, 10, and 15 min for pol θ:DNA at one ratio) and an average of three values for Klenow exo− (time points 3, 4 and 5 min at one ratio) and the error bars represent the standard deviation. Error rates for certain sequence contexts (CCCCC at positions 132–136 and TTT at positions 137–139) are shown for both pol θ and Klenow exo– (21).
Figure 1.Spectrum of errors generated by human pol θ. The 407 template nucleotides within the single-strand gap of the M13mp2 substrate are shown as five lines of the template sequence. Letters above the target sequence indicate base substitutions. Deletion of a base is depicted by an open triangle, whereas addition of a base is represented by a closed inverted triangle above the target sequence. ‘Red’ characters represent phenotypically detectable changes in the gap region while ‘black’ characters represent phenotypically undetectable changes found in association with detectable changes. ‘Gray’ arrows indicate hot spots of template A or T base substitutions (−36, +19, +113 and +131) and base substitution hot spots (+131 and +136) +1 represents the first transcribed nucleotide of the lacZα-complementation region.
Figure 2.Pol θ indel error rates as a function of homopolymeric run length. Error rates are the number of all observed single-base deletions and single base-additions (Table 1 and Figure 1) divided by the total number of template nucleotides present in runs of the indicated length indicated in the figure among 200 sequenced lacZ clones generated by pol θ.
Base substitution error rates for pol θ
| Base | Mutation From → To | Mispair Template · dNMP | All observed | Error rate (×10−4) |
|---|---|---|---|---|
| A | A → G | A · dCMP | 11 | 5.6 |
| A → T | A · dAMP | 9 | 4.5 | |
| A → C | A · dGMP | 7 | 3.5 | |
| T | T → C | T · dGMP | 77 | 42.0 |
| T → A | T · dTMP | 24 | 13.0 | |
| T → G | T · dCMP | 4 | 2.2 | |
| G | G → A | G · dTMP | 16 | 8.4 |
| G → C | G · dGMP | 4 | 2.1 | |
| G → T | G · dAMP | 5 | 2.6 | |
| C | C → T | C · dAMP | 35 | 14.0 |
| C → G | C · dCMP | 0 | ≤ 0.4 | |
| C → A | C · dTMP | 2 | 0.8 |
aBase substitution error rates were calculated using results from Experiment 1 (Table 1). All data are from the forward mutation assay. Error rates for all observed individual base substitutions were calculated as previously described (20).
Mutations in different mutable motifs
| Increase in mutations | Pol θ hot spot | ||||||
|---|---|---|---|---|---|---|---|
| Motifs | XP-V | Mouse η−/− | Pol θ | Mouse η−/−θ | GC | CA | CA |
| 0.3 (0.80) | 1.0 (0.45) | NO | YES | YES | |||
| A | 0.5 (0.22) | 0.8 (0.71) | YES | YES | NO | ||
aNumber of mutations in mutable motifs was calculated for the underlined bases (mutable sites). Values listed represent the fold increase in mutations at mutable sites compared with mutations at other sites. Bold italicized numbers represent a significant increase in mutations at mutable motifs (PW Wrandom 0.05), as revealed by using a Monte Carlo procedure (60).
bMutation spectra from Mayorov et al. (31).
cCombined spectra from Martomo et al. and Delbos et al. (61,62).
dMutation spectra from Masuda et al. (16).
eMutable motifs were compared with the distribution of base substitution error rates made by pol theta, three hotspots on the pol theta error spectrum (Figure 1) significantly correlated with the AA and ADK motifs.
Figure 4.Pol θ error rates compared to other DNA polymerases. (A) Single-base deletions (B) Single-base additions (C) Base substitutions. Family Y: hPol κ and hPol η error rates are taken from (20). Family X: hPol β and hPol λ error rates are taken from (57). Family B: hPol α error rates are unpublished results from Kokoska, R.J. and Kunkel, T.A., yPol δ (Exo–) error rates are taken from (58), yPol ɛ (Exo–) error rates are taken from (59) and yPol ζ error rates are taken from (50). Family A: hPol θ and hPol ν data are from this study. For hPol ν, at neutral pH, a mutation frequency of 2.6% was obtained (64 mutant plaques from a total of 6432). Of these, 22 clones were sequenced and as with the previous study (24), these clones show a preference for G to A base substitutions. hPol γ (Exo–) (+p140 and p55) error rates are from (23) and Kf (Exo−) error rates are taken from (21).