Literature DB >> 29905846

5-Formylcytosine mediated DNA-protein cross-links block DNA replication and induce mutations in human cells.

Shaofei Ji1, Iwen Fu2, Spandana Naldiga3, Hongzhao Shao1, Ashis K Basu3, Suse Broyde2, Natalia Y Tretyakova4,5.   

Abstract

5-Formylcytosine (5fC) is an epigenetic DNA modification introduced via TET protein-mediated oxidation of 5-methyl-dC. We recently reported that 5fC form reversible DNA-protein conjugates (DPCs) with histone proteins in living cells (Ji et al. (2017) Angew. Chem. Int. Ed., 56:14130-14134). We now examined the effects of 5fC mediated DPCs on DNA replication. Synthetic DNA duplexes containing site-specific DPCs between 5fC and lysine-containing proteins and peptides were subjected to primer extension experiments in the presence of human translesion synthesis DNA polymerases η and κ. We found that DPCs containing histones H2A or H4 completely inhibited DNA replication, but the replication block was removed when the proteins were subjected to proteolytic digestion. Cross-links to 11-mer or 31-mer peptides were bypassed by both polymerases in an error-prone manner, inducing targeted C→T transitions and -1 deletions. Similar types of mutations were observed when plasmids containing 5fC-peptide cross-links were replicated in human embryonic kidney (HEK) 293T cells. Molecular simulations of the 11-mer peptide-dC cross-links bound to human polymerases η and κ revealed that the peptide fits well on the DNA major groove side, and the modified dC forms a stable mismatch with incoming dATP via wobble base pairing in the polymerase active site.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29905846      PMCID: PMC6061883          DOI: 10.1093/nar/gky444

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Genomic DNA is associated with a range of architectural and regulatory proteins including histones, HMG proteins and protein factors involved in transcription, replication, and repair. These dynamic DNA–protein complexes can be converted to covalent DNA–protein cross-links (DPCs) when proteins are trapped on DNA strands upon exposure to α,β-unsaturated carbonyls, free radicals, heavy metals, and UV radiation (1–5). A wide range of proteins participate in DPC formation, including histones, DNA polymerases, DNA repair proteins and transcription factors (4,6,7). Because of their unusually large size and their ability to disrupt DNA–protein interactions, covalent DPCs can interfere with chromatin architecture, inhibit DNA replication, transcription and repair, and induce mutations and toxicity in human cells (1–4,8–10). Cellular DPCs have been proposed to contribute to human diseases including cancer, aging, and neurodegenerative disorders (11,12). We and others recently reported that 5-formylcytosine (5fC) residues of DNA form reversible DPCs with Lys side chains of histone proteins (Figure 1A) (13–16). 5fC bases are endogenous DNA epigenetic marks generated via oxidation of 5-methylcytosine (5mC) by ten eleven translocation dioxygenases (TET) (17–19). Although the global genomic levels of 5fC are relatively low (0.02–0.002% of total cytosines), they are found in all mammalian tissues (20) and are recognized by specific protein readers (21,22). 5fC are thought to play a role in epigenetic regulation due to their presence in regulatory elements of mammalian genes (18) and their association with gene expression levels (21,23). The recent reports of 5fC-mediated cross-links to histone proteins in vitro (16) and in human cells (1.20 ± 0.07 × 10−4 % of all dCs) (15) have raised some important questions such as: Do reversible 5fC-histone DPCs regulate gene expression? And Can 5fC-histone DPCs be tolerated by the DNA replication machinery?
Figure 1.

(A) The formation of DNA–protein and DNA–peptide cross-links at 5-formylcytosine of DNA. The aldehyde group of 5fC readily forms reversible Schiff base conjugates with the Arg or Lys side chains of proteins and polypeptides, which can be stabilized by reduction with NaCNBH3 or glutathione (GSH). (B) Representative MALDI MS spectrum of the conjugate between 5fC (X) containing DNA 23-mer 5′-AGG GTT TTC CXA GTC ACG ACG TT-3′ and 11-mer polypeptide RPKPQQFFGLM-CONH2 following NaCNBH3 reduction. (C) Denaturing PAGE analysis of the cross-links between 5fC (X) containing DNA 23-mer 5′-AGG GTT TTC CXA GTC ACG ACG TT-3′ and histone H2A. The reaction mixtures were heated prior to loading on gel, and samples in lanes 3 and 4 were reduced with NaCNBH3 or GSH, respectively, to stabilize DPCs.

(A) The formation of DNA–protein and DNA–peptide cross-links at 5-formylcytosine of DNA. The aldehyde group of 5fC readily forms reversible Schiff base conjugates with the Arg or Lys side chains of proteins and polypeptides, which can be stabilized by reduction with NaCNBH3 or glutathione (GSH). (B) Representative MALDI MS spectrum of the conjugate between 5fC (X) containing DNA 23-mer 5′-AGG GTT TTC CXA GTC ACG ACG TT-3′ and 11-mer polypeptide RPKPQQFFGLM-CONH2 following NaCNBH3 reduction. (C) Denaturing PAGE analysis of the cross-links between 5fC (X) containing DNA 23-mer 5′-AGG GTT TTC CXA GTC ACG ACG TT-3′ and histone H2A. The reaction mixtures were heated prior to loading on gel, and samples in lanes 3 and 4 were reduced with NaCNBH3 or GSH, respectively, to stabilize DPCs. A recent report by the Balasubramanian group suggests that Schiff base complexes between histone residues and 5fC in the genome influence gene expression levels by changing nucleosomal organization and establishing distinct regulatory regions (bioRxiv doi: 10.1101/224444). However, to our knowledge, there are no previous studies that address the effects of 5fC-histone conjugates on DNA replication. Our previous studies with other types of DPCs have established a key role of translesion synthesis (TLS) polymerases such as human Pol κ and η in catalyzing DNA replication past DNA–peptide conjugates (24–26). Unlike replicative Pols which have tight active sites and undergo a conformational change upon binding correct dNTP, Pol κ and η have open sites that allow them to bypass bulky DNA lesions. These specialized polymerases are recruited to stalled replication forks and traverse the damaged sites, affording the cell additional time to repair the damage before the replicase returns to complete genome duplication (27,28). In the present work, we evaluated the ability of human TLS polymerases κ and η to catalyze DNA synthesis past 5fC-mediated conjugates to histone H2A, histone H4 and shorter polypeptides representing possible proteolytic degradation products of larger DPCs. These two TLS polymerases were selected based on our previous studies demonstrating their ability to catalyze polymerase bypass past other DNA-polypeptide cross-links (25,26,29). For example, we have shown that hPol η and κ knockdown in human cells influences replication bypass fidelity of DPCs conjugated at the C7 position of 7-deaza-dG (24). In addition, our preliminary cellular experiments revealed that hPol η and κ knockdown strongly affects the replication fidelity of 5fC-mediated DPCs in human cells (A. Basu, unpublished observations). We employed a combination of gel electrophoresis and mass spectrometry based techniques to characterize the efficiency and the fidelity of polymerase bypass past 5fC-polypeptide lesions. Further, molecular modeling and molecular dynamics simulations were conducted to evaluate the structural effects of such endogenous DPCs on the ternary complex of hPol η and κ with DNA template–primer complex and the incoming dNTPs. Finally, cellular replication of 5fC-peptide containing plasmids was examined in human embryonic kidney (HEK) 293T cells.

MATERIALS AND METHODS

Full details of the experimental and modeling methods are given in the supporting information

Synthesis and characterization of 5fC-containing oligonucleotide

5fC-containing 23-mer and 17-mer oligodeoxynucleotides (ODNs, 5′-AGG GTT TTC CXA GTC ACG ACG TT-3′ and 5′-TTC CXA GTC ACG ACG TT-3′, where X = 5fC, A and B in Table 1) were prepared using solid phase synthesis on an ABI 394 DNA synthesizer (Applied Biosystem, Foster City, CA, USA). 5-Formyl-dC-III-CE phosphoramidite was purchased from Glen Research (Sterling, VA, USA). Synthetic ODNs were deprotected in 30% ammonium hydroxide at room temperature for 17 h and then in 80% acetic acid at 20°C for 6 h. 5fC-containing ODNs were purified by semi-preparative HPLC, desalted by NAP-10 columns (GE Healthcare, Pittsburgh, PA, USA), and characterized by mass spectrometry (Table 1). Unmodified DNA oligodeoxynucleotides were purchased from IDT (Coralville, IA).
Table 1.

Nucleobase sequence and mass spectrometry characterization of synthetic DNA strands

Expected mass (Da)Observed mass (Da)
A5′-AGG GTT TTC CXA GTC ACG ACG TT-3′, where X = 5fC*7058.67059.1
B5′-TTC CXA GTC ACG ACG TT-3′, where X = 5fC5149.35149.4
C5′-AGG GTT TTC CXA GTC ACG ACG TT-3′, where X contains 5fC cross-link to the K residue of RPKPQQFFGLM-CONH2 (see Figure 1A for structure)8391.28415.0 [M+Na]+
D5′-AGG GTT TTC CXA GTC ACG ACG TT-3′, where X = 5fC cross-link to YGGFMTSEKSQTPLVTLFKNAIIKNAYKKGE10507.010532.0 [M+Na]+
E5′-TTC CXA GTC ACG ACG TT-3′, where X contains 5fC cross-link to the K residue of RPKPQQFFGLM-CONH26480.96504.8 [M+Na]+
F5′-AGG GTT TTC CXA GTC ACG ACG TT-3′, where X contains 5fC cross-link to the K residue of histone H418 284N/A
G5′-AGG GTT TTC CXA GTC ACG ACG TT-3′, where X contains 5fC cross-link to the K residue of histone H2A21 038N/A

*DNA 23-mers A, C, D, F, G are complementary to M13mp2 (6254–6276). B and E is a truncated version of the same sequence. Substance P (11-mer, RPKPQQFFGLM-CONH2) and β-endorphin (31-mer, YGGFMTSEKSQTPLVTLFKNAII KNAYKKGE) were selected as model polypeptides based on our earlier work (15).

Nucleobase sequence and mass spectrometry characterization of synthetic DNA strands *DNA 23-mers A, C, D, F, G are complementary to M13mp2 (6254–6276). B and E is a truncated version of the same sequence. Substance P (11-mer, RPKPQQFFGLM-CONH2) and β-endorphin (31-mer, YGGFMTSEKSQTPLVTLFKNAII KNAYKKGE) were selected as model polypeptides based on our earlier work (15).

Synthesis, purification and characterization of DNA–histone and DNA–peptide cross-links

DNA–protein or DNA–peptide cross-links were synthesized and characterized as previously described (15). Briefly, to form DNA–histone cross-links, DNA 23-mers containing site-specific 5fC (A in Table 1, 300 pmol) were incubated with equimolar amounts of recombinant histones H2A or H4 (New England Biolabs, Beverly, MA, USA) in 16 μl sodium phosphate buffer (4.5 mM, pH 7.4) for 3 h at 37°C. These conjugates were either used directly or subjected to NaCNBH3 reduction as described below. To convert transient Schiff base conjugates to stable DPCs amenable to gel purification, structural characterization, and kinetic analyses, samples were treated with NaCNBH3 (4 μl of 100 mM solution) at 37°C overnight (Figure 1A). The reaction mixtures were heated at 80°C for 10 min and separated by 4–12% denaturing SDS PAGE. Gel-purified DNA–histone conjugates were desalted on Micro Bio-spin columns (Bio-Rad, Hercules, CA, USA) and characterized by HPLC-ESI+-MS and MS2 (15). Stable DNA–peptide cross-links containing 11-mer peptide, RPKPQQFFGLM-CONH2 and 31-mer peptide, YGGFMTSEKSQ TPLVTL FKNAIIKNAYKKGE were prepared analogously, with the exception that 10–30 molar excess of synthetic peptides were used. The reaction mixtures were heated at 90°C for 15 min and purified by 20% (w/v) denaturing PAGE containing 7 M urea. Gel-purified DNA–peptide conjugates were desalted by solid phase extraction on Sep-Pak C18 cartridges (Waters Corporation, Milford, MA, USA) and characterized by MALDI-TOF-MS using 3-HPA matrix (see Figure 1B for representative spectra). DNA–protein and DNA–peptide conjugates were further purified by denaturing PAGE as needed to achieve >98% purity.

Preparation of template–primer complexes for primer extension assays

Synthetic DNA 12-mers (500 pmol, 5′-AAC GTC GTG ACT-3′) were radiolabelled by incubating with γ-32P ATP (1 μl, PerkinElmer Life Sciences, Boston, MA) and T4 PNK (20 units) in 20 μl 1 × T4 PNK buffer (New England Biolabs, Beverly, MA, USA) at 37°C for 1 h. The enzyme was deactivated by heating at 65°C for 10 min, and excess γ-32P ATP was removed using Microspin G25 columns (GE Healthcare, Pittsburgh, PA, USA). Radiolabeled 12-mer primers (30 pmol) were mixed with 1.5 eq. of 23-mer DNA template strands containing site-specific DNA–protein or DNA–peptide cross-links (5′-AGG GTT TTC CXA GTC ACG ACG TT-3′, where X is adducted position) in 10 mM TrisHCl buffer (pH 8.0) and 50 mM NaCl. The mixtures were heated at 90°C for 10 min and allowed to slowly cool down to room temperature overnight to generate primer–template complexes for primer extension assays. To digest the protein component of DPCs, primer–template complexes containing histone H4-5fC cross-link (2 pmol) were incubated with proteinase K (2.4 units) at 37°C for 48 h in the presence of 1× T4 PNK buffer (New England Biolabs, Beverly, MA, USA). The reaction mixtures were heated at 80°C for 2 h to deactivate the enzyme and then allowed to slowly cool down to room temperature overnight.

Primer extension assays using human TLS DNA polymerases

32P-labeled primer–template duplexes containing unmodified dC, 5-formyl-dC, DNA–peptide or DNA–protein crosslinks, or proteinase K digested DPCs (1 pmol) were incubated with human DNA polymerase η or κ (2 pmol) in the presence of 50 mM TrisHCl (pH 7.5), 5 mM DTT, 5 mM MgCl2, 50 mM NaCl, 100 μg/mL BSA and 10% glycerol (v/v) (total volume, 30 μl) at 37°C. The polymerization was initiated by the addition of all four dNTPs at 1 mM final concentration. Aliquots of the reaction mixtures (4 μl) were taken at pre-selected time points (0, 5, 30, 90, 180 min) and quenched with a gel loading buffer (20 mM EDTA in 95 % formamide containing 0.05 % bromophenol blue and xylene cyanol). The extension products were loaded onto 20 % denaturing PAGE containing 7 M urea, run at 80 W for 2.5 h in 1× TBE buffer, and visualized using Typhoon FLA 7000 phosphorimager (GH Healthcare, Pittsburgh, PA, USA).

Single nucleotide incorporation assays

32P-labeled primer–template complexes containing unmodified dC or 5fC cross-links to 11-mer peptide or 31-mer peptide (1.5 pmol) were incubated with human DNA polymerases (0.5 pmol of hPol η or 2 pmol of hPol κ) in the presence of 50 mM TrisHCl (pH 7.5), 5 mM DTT, 5 mM MgCl2, 50 mM NaCl, 100 μg/ml BSA and 10% glycerol (v/v) at 37°C. Individual dNTPs (50 μM final concentration) were added to initiate the polymerization in a final volume of 20 μl. Aliquots (4 μl) were quenched with gel loading buffer after 0, 5, 15 or 30 min, and the extension products were resolved by 20% (w/v) denaturing PAGE containing 7 M urea and visualized using Typhoon FLA 7000 system.

Steady-state kinetic analyses

Steady-state kinetics for the incorporation of individual dNTPs opposite unmodified dC or 11-mer peptide cross-links (RPKPQQFFGLM-CONH2) was examined by performing single nucleotide insertion assays in the presence of human DNA polymerases η or κ and increasing concentrations of specific dNTPs. primer–template duplexes (30 nM) were incubated with 0.4–1.0 nM hPol η or 1.2 nM hPol κ in the presence of individual dNTPs (5, 10, 25, 50, 100, 150, 250, 500 μM) for specified time periods (0–30 min). Primer extension products were visualized with a Typhoon FLA 7000 system and quantified by volume analysis using the ImageQuant TL 8.0 software (GE Healthcare). Steady-state kinetic parameters were calculated by nonlinear regression analysis using one-site hyperbolic fits in Prism 4.0 (Graphpad Software, La Jolla, CA, USA).

Sequencing and quantification of primer extension products using HPLC-ESI−-MS and HPLC-ESI−-MS2

Biotinylated primers (Biotin-5′-(T)8 AAC GTC GUG ACT-3′, purchased from IDT) were annealed to 17-mer oligonucleotide containing either unmodified dC (negative control) or 11-mer peptide–DNA cross-links (5′-TTC CXA GTC ACG ACG TT-3′, where X = dC or fC-Lys cross-link to the 11-mer peptide RPKPQQFFGLM-CONH2). The resulting template–primer complexes (200 pmol) were incubated with hPol κ (50 pmol) or hPol κ (70 pmol) at 37°C for 6 h in the presence of 50 mM TrisHCl (pH 7.5), 5 mM DTT, 5 mM MgCl2, 50 mM NaCl, 100 μg/ml BSA, 10% glycerol (v/v) and 1 mM of each dNTPs in a total volume of 90 μl. Following the addition of 20 mM sodium phosphate buffer (pH 7.0) containing 150 mM NaCl (400 μl), biotinylated primer extension products were captured on streptavidin–sepharose high performance beads (200 μl, GE Healthcare). To release truncated primer extension products, streptavidin beads were washed with water (3 × 400 μl) and then incubated at 37°C for 4 h with uracil-DNA glycosylase (UDG, 20 units) in the presence of 50 mM TrisHCl (pH 7.5), 1 mM EDTA and 1 mM DTT in a final volume of 500 μl. After removing the supernatant, the beads were washed with water (3 × 400 μl), and UDG-induced abasic sites were cleaved with hot piperidine (250 mM, 95°C for 1 h). The supernatants were dried under vacuum and reconstituted in 25 μl of water for HPLC–ESI−-MS analysis. HPLC–ESI-MS/MS sequencing and quantification of primer extension products was performed using a Dionex UltiMate 3000 HPLC coupled to a LTQ Orbitrap Velos mass spectrometer (Thermo Scientific, Watham, MA, USA) as described previously (25,26). A Zorbax 300SB-C18 column (150 × 0.5 mm, 5 μm, Agilent Technologies, Inc., Wilmington, CA, USA) was eluted at a flow rate of 15.0 μl/min using 15 mM ammonium acetate in water (A) and acetonitrile (B). Solvent composition was linearly increased from 2 to 20% B in 25 min. Mass spectrometry analyses were performed at a resolution of 60 000 and a scan range of m/z 300–2000. Relative quantification of various primer extension products was performed by comparing the corresponding HPLC–ESI-MS peak areas in extracted ion chromatograms. MS/MS fragmentation patterns were compared with the expected CID fragmentation from Mongo Oligo mass calculator version 2.06 available online (The RNA Institute, College of Arts and Sciences, State University of New York at Albany).

Construction and characterization of a plasmid containing a single DNA–peptide cross-link and its replication in human embryonic kidney (HEK) 293T cells

The 23-mer oligonucleotide (A in Table 1) containing NaCNBH3 stabilized cross-link to the 11-mer peptide (RPKPQQFFGLM-CONH2) was enzymatically ligated to a gapped single-stranded pMS2 plasmid carrying neomycin and ampicillin resistance genes, following methods reported earlier (24). A control plasmid containing native C at position X was constructed analogously. HEK 293T cells were grown to ∼90% confluency and transfected with 50 ng of the engineered plasmid constructs using Lipofectamine (Invitrogen, Carlsbad, CA, USA). Subsequently, the cells were allowed to grow at 37°C in 5% CO2 for 2 days. Plasmid DNA was isolated, purified, and used to transform Escherichia coli cells. The transformants were analyzed by oligonucleotide hybridization followed by DNA sequence analysis, as reported earlier (24).

Molecular dynamics simulations

Initial models

Our initial models of human polymerase κ (hPol κ) in complex with unmodified DNA were based on our earlier work by Lior-Hoffmann et al. (30), which employed the PDB (31) entry 2OH2 crystal structure of human pol κ ternary complex (32). The initial model of human polymerase η (hPol η) in complex with unmodified DNA was based on the ternary crystal structure PDB entry 3MR2 (33). In this hPol η structure, three residues in the palm domain (residues 155−157) and two residues in the little finger domain (residues 411−412) were not resolved and thus were modeled into the polymerase. We wished to study ternary complexes containing DNA–peptide cross-links at the templating base, human pol κ and η, and incoming dGTP or dATP nucleotides. To construct the DNA–peptide conjugates of interest, we first replaced the template base in our initial models with a C. This cytosine base was then modified at its C5 atom by conjugation to the side chain of the single lysine residue via a methylene linker. We then added three amino acid residues to this lysine to create the four amino acid peptide adduct NH2-RPKP-CONH2, and this peptide was energy minimized. Subsequently, we added one additional amino acid residue at a time to the C terminus, to create the 11-mer peptide RPKPQQFFGLM-CONH2, with minimization at each step; the C terminus of the growing peptide was directed toward the 3′-end of the template strand to place it in the major groove, where the cytosine C5 atom is located in B-DNA. For the Watson-Crick partner, we remodeled the incoming dNTP to dGTP. These final models were named 11mer-dGTP in both pol η and pol κ. To study the mismatch of A with the modified C, we replaced the dGTP with N1-protonated dATP and aligned it with the templating C to form a C–A+ wobble pair with two hydrogen bonds in pol η and pol κ; these two models are termed 11mer-dATP(η) and 11mer-dATP(κ). In addition, four analogous models without 11-mer peptide were constructed as controls, named dC-dGTP(κ), dC-dATP(κ), dC-dGTP(η) and dC-dATP(η). Thus, a total of eight polymerase models were prepared for the full MD simulations studies. Exploratory studies were also conducted for DPC models with incoming unprotonated dATP in pol η and pol κ.

Simulation methods

Molecular dynamics simulations were performed for each of the eight polymerase models. Following minimization and equilibration, we conducted ∼400 ns unrestrained production MD simulations using the AMBER14 package (34) with force field ff14SB (35,36) for proteins and incorporated modifications for DNA (37–39). We used the Joung-Cheatham model (40) for the Na+ and Cl− ions, and the TIP3P model (41) for water. For non-standard residues, the dGTP, dATP, N1-protonated dATP and DNA–protein cross-link lesion, parameters were assigned according to the GAFF (42) and the AMBERff14SB (35,36) force fields using the antechamber module of AMBER 14 (34). Partial charges for dGTP and dATP were taken from the literature (43–45); partial charges for the DNA–protein cross-link lesion and the N1-protonated dATP were determined using the restrained electrostatic potential (RESP) fit procedure at the HF/6-31G(d) level of theory using Gaussian 09 (46). Details on partial charge calculations and MD simulation protocol are given in the Methods Section in Supplementary Data. Molecular topology and coordinate files for the initial polymerase models were constructed using the tleap module of AmberTools14 (34). Sufficient numbers of Na+ ions were added to neutralize the system, and then additional Na+ and Cl− ions were added to bring the salt concentration close to the physiological value (0.15 M). The systems were solvated in the rectangular TIP3P water box with at least a 10 Å buffer, resulting in ∼70 Na+ ions, 60 Cl− ions and ∼21 000 water molecules. Initial structures were modeled using Discovery Studio software v2.5.5 (Accelrys), and all molecular images and movies were generated with PyMOL (The PyMOL Molecular Graphic System, version 1.3x Schrodinger, LLC). Post-processing of all simulations was carried out using the CPPTRAJ (47) module of AMBER14 (34) and Curves+ (48).

RESULTS

Bypass of DNA–protein and DNA–peptide cross-links by human DNA polymerases in vitro

To induce DPC formation, 5fC containing DNA strands (A and B in Table 1) were incubated with recombinant human histones H2A or H4 or model polypeptides (RPKPQQFFGLM-CONH2, YGGFMTSEKSQTPLVTLFKN AIIKNAYKKGE) The resulting DNA–protein/polypeptide conjugates were either used directly or stabilized by reduction in the presence of NaCNBH3 (Figure 1). Reversible Schiff base DPCs were used directly, while reductively stabilized DPCs (D–G in Table 1) were purified by denaturing PAGE and characterized by ESI-MS or MALDI mass spectrometry (Table 1) (15). To investigate the effects of C5-dC conjugated DNA–protein and DNA–peptide cross-links on DNA replication, primer–template complexes containing unmodified dC, 5fC, DNA–peptide, or DNA–protein cross-links (A, C, D, F, and G in Table 1) were subjected to in vitro replication in the presence of human DNA polymerases η or κ (enzyme: substrate = 2:1), and the primer extension products were analyzed by denaturing PAGE (Figures 2 and 3). These experiments revealed that the presence of reversible 5fC-histone cross-links dramatically reduces the amount of full length primer extension products in reactions catalyzed by human lesion bypass polymerases κ and η (Figure 2). However, kinetic parameters for polymerase bypass could not be established due to the reversible nature of Schiff base histone-DNA linkages (calculated half-life, 1.8 h) (15). Because such imino conjugates cannot be isolated in pure form and can spontaneously dissociate under physiological conditions, the template strands contain a mixture 5fC and 5fC-histone cross-links (15).
Figure 2.

Primer extension assays for hPol η and hPol κ catalyzed bypass of reversible DNA–histone cross-links generated via a Schiff base formation between histone proteins and 5fC in DNA. 32P-labeled 12-mer primer was annealed with 23-mer template containing 5-formyl-dC (A), or 5-formyl-dC conjugated to histone H2A via Schiff base formation (B). Polymerase reactions were initiated by addition of a mixture of hPol η or κ and a mixture of dNTP, quenched at pre-selected time points, and loaded onto 20% denaturing PAGE. Percent bypass was plotted over time (C).

Figure 3.

Primer extension assays for replication bypass of reductively stabilized DNA–protein or DNA–peptide cross-links by hPol η (top panel) and hPol κ (bottom panel). 32P-labeled 12-mer primer was annealed with 23-mer template containing unmodified dC (A), 5-formyl-dC (B), or C5-dC cross-links to 11-mer peptide RPKPQQFFGLM-CONH2 (C) 31-mer peptide YGGFMTSEKSQTPLVTLFKNAIIKNAYKKGE (D), full size histone H2A (E), histone H4 (F), or proteinase K digested histone H4 (G). Polymerase reactions were initiated by addition of DNA polymerases and a mixture of dNTP and quenched at pre-selected time points prior to loading onto 20% denaturing PAGE.

Primer extension assays for hPol η and hPol κ catalyzed bypass of reversible DNA–histone cross-links generated via a Schiff base formation between histone proteins and 5fC in DNA. 32P-labeled 12-mer primer was annealed with 23-mer template containing 5-formyl-dC (A), or 5-formyl-dC conjugated to histone H2A via Schiff base formation (B). Polymerase reactions were initiated by addition of a mixture of hPol η or κ and a mixture of dNTP, quenched at pre-selected time points, and loaded onto 20% denaturing PAGE. Percent bypass was plotted over time (C). Primer extension assays for replication bypass of reductively stabilized DNA–protein or DNA–peptide cross-links by hPol η (top panel) and hPol κ (bottom panel). 32P-labeled 12-mer primer was annealed with 23-mer template containing unmodified dC (A), 5-formyl-dC (B), or C5-dC cross-links to 11-mer peptide RPKPQQFFGLM-CONH2 (C) 31-mer peptide YGGFMTSEKSQTPLVTLFKNAIIKNAYKKGE (D), full size histone H2A (E), histone H4 (F), or proteinase K digested histone H4 (G). Polymerase reactions were initiated by addition of DNA polymerases and a mixture of dNTP and quenched at pre-selected time points prior to loading onto 20% denaturing PAGE. To allow for further mechanistic and kinetic studies, primer extension experiments were repeated using gel purified DPC templates that were stabilized by NaCNBH3 reduction to form irreversible amino linkages (Figure 1). We found that reductively stabilized C5-dC DPC lesions to full size histone proteins completely inhibited DNA replication: no full length products were observed for histone H2A and H4 DPCs (Figure 3E and F). However, Pol κ and Pol η were able to extend the primer opposite smaller cross-links containing 11-mer peptide (RPKPQQFFGLM-CONH2 and a 31-mer peptide (YGGFMTSEKSQTPLVTL FKNAIIKNAYKKGE) (Figure 3C and D). When the protein component of histone H4 DPC was digested to single amino acids using proteinase K, the ability of polymerase η to bypass the lesion was restored (Figure 3G). To determine whether TLS polymerases Pol κ and Pol η replicate 5fC-peptide conjugates in an error free or error prone manner, single nucleotide incorporation assays with individual dNTPs were conducted (see example in Figure 4). PAGE analysis of the extension products has revealed that while hPol η inserted either one of the four dNTPs opposite 11-mer peptide-dC conjugates, pol κ predominantly incorporated an incorrect base, dA (Figure 4). Similar results were obtained for substrates containing a larger cross-link to 31-mer peptide, although incorporation efficiency was reduced (Supplementary Figure S1). These results provided initial evidence that 5fC mediated DNA–peptide lesions interfere with fidelity of TLS DNA polymerases.
Figure 4.

Single nucleotide insertion opposite unmodified dC (control) and 5fC-11 mer peptide cross-links (X = RPKPQQFFGLM-CONH2). template–primer complexes (top) were incubated with hPol η (top panel) or hPol κ (bottom panel) in the presence of 50 μM individual dNTP and then quenched at pre-selected time points.

Single nucleotide insertion opposite unmodified dC (control) and 5fC-11 mer peptide cross-links (X = RPKPQQFFGLM-CONH2). template–primer complexes (top) were incubated with hPol η (top panel) or hPol κ (bottom panel) in the presence of 50 μM individual dNTP and then quenched at pre-selected time points. To determine the catalytic efficiency for incorporation of G, A, C, and T opposite 5fC-peptide conjugates, steady state kinetics experiments were conducted using the same primer-substrate complexes (Figure 3) and hPol η or κ in the presence of individual dNTPs (5, 10, 25, 50, 100, 150, 250, 500 μM). The reaction mixtures were quenched at pre-selected time points (0–30 min). To ensure steady state conditions, 30–80 molar excess of DNA substrate over polymerases was employed, and time points were selected in such a way that < 35% of extension products were observed (see Supplementary Figure S2 for representative gel). Kinetic parameters (kcat, Km,Table 2) were calculated by plotting reaction velocities against the concentration of dNTPs using Michaelis–Menten equation (49).
Table 2.

Steady-state kinetics parameters for single nucleotide incorporation opposite unmodified dC or 11-mer peptide (RPKPQQFFGLM-CONH2) conjugated to C5 position of cytosine by hPol η or κ

PolymeraseTemplateIncoming nucleotide k cat (min–1) K m (μM) k cat/Km (μM–1 min–1) f
hPol ηdCdATP1.54 ± 0.42354 ± 1740.00430.013
dGTP2.50 ± 0.307.5 ± 3.30.331
dCTP0.52 ± 0.11232 ± 980.00220.007
dTTP2.38 ± 0.90427 ± 2760.00560.017
11-mer peptidedATP2.82 ± 0.3259 ± 220.0481.92
dGTP1.50 ± 0.1561 ± 140.0251
dCTP1.20 ± 0.13179 ± 450.00660.26
dTTP1.12 ± 0.08108 ± 210.010.4
hPol κdCdGTP0.075 ± 0.0066.3 ± 2.70.0121
11-mer peptidedATP0.068 ± 0.00880 ± 310.00081

*f (misinsertion frequency) = (kcat/Km)incorrect dNTP/(kcat/Km)correct dNTP.

Steady-state kinetics parameters for single nucleotide incorporation opposite unmodified dC or 11-mer peptide (RPKPQQFFGLM-CONH2) conjugated to C5 position of cytosine by hPol η or κ *f (misinsertion frequency) = (kcat/Km)incorrect dNTP/(kcat/Km)correct dNTP. The kcat/Km for incorporation of correct base (G) opposite unmodified dC and 5dC-11-mer peptide conjugate by hPol η were 0.33 and 0.025 μM−1 min−1, respectively, indicating that the presence of 11-mer peptide conjugate strongly interfered with accurate DNA replication. Furthermore, the specificity constants for insertion of incorrect bases (A, C or T) opposite dC-11-mer peptide conjugate were 2–10 times higher than those observed for unmodified dC, suggesting that this lesion can be mutagenic. Overall, the preference order for nucleotide insertion opposite 5fC-11mer peptide DPC was A > G > T >C. hPol η preferentially incorporated A (incorrect base) opposite the peptide adduct, rather than the correct base (G), with misinsertion frequency (f) of 1.92 (Table 2). In reactions catalyzed by hPol κ, high fidelity was observed when replicating native DNA, incorporating only correct bases (G) opposite C (Table 2). However, only the incorrect base (A) was inserted opposite dC-11-mer peptide conjugate, although the catalytic efficiency was 30- to 100-fold lower as compared to hPol η (Table 2). To gain additional insight into the effects of dC-11-mer peptide cross-links on DNA replication, primer extension products were sequenced by HPLC-ESI-MS/MS (Figure 5A) (25,26,50). In brief, biotinylated primers (Biotin-5′-(T)8 AAC GTC GUG ACT-3′) were annealed to 17-mer oligonucleotides (5′-TTC CXA GTC ACG ACG TT-3′) containing unmodified dC or dC-11-mer peptide crosslinks at position X. The resulting primer–template complexes were incubated with hPol η or κ in the presence of all four dNTPs, and the extension products were captured on streptavidin beads (Figure 5A). The extended primers were site-specifically cleaved with uracil DNA glycosylase/hot piperidine to release 9-mer oligonucleotide products that contain the region of interest and are amenable for sequencing by HPLC–ESI−-MS/MS (Figure 5A). HPLC–ESI−-FTMS analyses of the reaction mixtures using Orbitrap Velos mass spectrometer at R = 60 000 and mass accuracy <10 ppm allowed for detection and relative quantification of primer extension products from peak areas in extracted ion chromatograms (Supplementary Figure S3), while their identities were confirmed by HPLC–ESI−-MS/MS (Figure 5B–D). In control experiments with unmodified DNA, both hPol η and κ preferentially generated error-free primer extension products (5′-pGACTGGGAA-3′, [M–3H]3− = 954.5, Table 3). In contrast, replication bypass of dC-11-mer peptide conjugates by hPol η led to large numbers of C → T mutations (20.1 %) and (-1) deletions (12.4 %). Primer extension in the presence of hPol κ resulted in C →T transitions (13.9 %) and deletions 78.6 %, with only 7.5 % of error-free products (Table 3). No other replication products were detected. Overall, these results are in agreement with our steady-state kinetics data (Table 2), indicating that C5-cytosine conjugated DNA–peptide conjugates induce targeted C →T mutations. In addition, significant numbers of (–1) deletions were detected upon replication of DPC containing template (Table 3).
Figure 5.

HPLC–ESI-MS/MS sequencing of primer extension products via streptavidin capture- nanoHPLC–ESI-MS/MS (A) and MS/MS spectra of primer extension products 5′-pGAC TGG GAA-3′ (B, error free), 5′-pGAC TAG GAA-3′ (C, C → T transition), and 5′-pGAC T_G GAA-3′ (D, –1 deletion).

Table 3.

Sequencing and quantification of primer extension products by hPol η or κ using high resolution HPLC-ESI-MS/MS on Orbitrap Velos

Biotin-5′-(T)8-AA CGT CGU GAC T-3′ 3′-TT GCA GCA CTG AXC CTT-5′
PolymeraseTemplateExtension productPercent productBase opposite XComment
hPol ηdCGACTGGGAA95.6Gerror free
GACTAGGAA1.9AC to T transition
GACT_GGAA2.5_deletion
11-mer peptideGACTGGGAA67.5Gerror free
GACTAGGAA20.1AC to T transition
GACT_GGAA12.4_deletion
hPol κdCGACTGGGAA97.3Gerror free
GACT_GGAA2.7_deletion
11-mer peptideGACTGGGAA7.5Gerror free
GACTAGGAA13.9AC to T transition
GACT_GGAA78.6_deletion
HPLC–ESI-MS/MS sequencing of primer extension products via streptavidin capture- nanoHPLC–ESI-MS/MS (A) and MS/MS spectra of primer extension products 5′-pGAC TGG GAA-3′ (B, error free), 5′-pGAC TAG GAA-3′ (C, C → T transition), and 5′-pGAC T_G GAA-3′ (D, –1 deletion). Sequencing and quantification of primer extension products by hPol η or κ using high resolution HPLC-ESI-MS/MS on Orbitrap Velos

Replication of 5fC DNA–peptide cross-links in human cells

To determine whether 5fC mediated DNA–peptide cross-links can induce mutations in living cells, single stranded pMS2 plasmid containing site-specific 5fC DPC to an 11-mer peptide was constructed using the previously reported strategy (Figure 6A) (51,52). In brief, 23-mer oligonucleotide containing reductively stabilized 11-mer peptide to 5fC (C in Table 1) was enzymatically ligated into a gapped pMS2 plasmid, and the plasmids bearing site-specific DPC lesions were purified by agarose gel. Upon replication of DPC containing plasmid construct in HEK293T cells, we found that the number of progeny was comparable to that from a control plasmid, suggesting that the 5fC-peptide cross-links did not block DNA replication in human cells. However, sequencing analyses revealed that 11–16% of the progeny plasmids contained mutants, which included both targeted and semi-targeted mutations (Figure 6B). Consistent with our in vitro results (Figure 4, Tables 2 and 3), the major type of mutation observed was C → T transitions, which occurred at a frequency of 6.7%. In addition, we noted a low level of targeted deletions (< 2%) and C → G transversions (0.7%). Semi-targeted mutations 3′ and 5′ to the lesion site were observed at 7% frequency, in addition to 3–6 base deletions at 1.7% frequency. Overall, our results of adduct replication in human cells are consistent with the in vitro data (Tables 2 and 3), indicating that both hPol η and κ can insert an incorrect base (A) opposite 5fC mediated DNA–peptide conjugates, leading to C → T mutations.
Figure 6.

(A) Construction of DPC-containing pMS2 plasmid and its replication in HEK 293T cells. (B)Types and frequencies of mutations induced by the DNA–peptide cross-link in HEK293T cells. The data represent the mean and standard deviation of three independent experiments. Targeted C → T, C → G, C deletion, 3–6 base deletions as well as a variety of semi-targeted mutations near the lesion sites.

(A) Construction of DPC-containing pMS2 plasmid and its replication in HEK 293T cells. (B)Types and frequencies of mutations induced by the DNA–peptide cross-link in HEK293T cells. The data represent the mean and standard deviation of three independent experiments. Targeted C → T, C → G, C deletion, 3–6 base deletions as well as a variety of semi-targeted mutations near the lesion sites.

Molecular dynamics

In order to gain structural understanding of the mutagenesis observed for 5fC-11mer peptide cross-links (Tables 2 and 3, Figure 6B), 400 ns molecular dynamics simulations were carried out. To help explain the preferential misinsertion of A opposite 5fC-peptide lesions, we created a molecular model of the 11-mer peptide (RPKPQQFFGLM-CONH2) cross-linked to the cytosine template base in the human DNA polymerase η ternary complex with incoming dATP nucleotide (Figure 1B and Supplementary Figure S4). Our investigation has focused on the C–A+ mismatch structure, in which the N1 atom in the adenine of the dATP incoming nucleotide is protonated (Figure 7A). This wobble base pairing scheme affords two hydrogen bonds, namely C(N3)-A+(N6H2) and C(O2)-A+(N1H) with glycosidic bonds anti in both C and A+ (Figure 7A). Exploratory studies had indicated that a C–A mismatch structure without the N1 atom protonated and hence containing only one hydrogen bond, C(O2)-A(HN6), was much less stable in MD simulations.
Figure 7.

(A) The most representative structure from the 400 ns MD simulation of the pol η-DNA ternary complex, containing the 11-mer peptide cross-linked to the templating C opposite to the N1 protonated incoming dATP. Inset: (left), a zoomed-in view of the 11-mer peptide cross-linked to the templating C opposite the incoming dATP; (right), hydrogen bonding scheme of the C–A+ wobble pair. The protonated hydrogen at N1 in adenine is shown in red. A rotating view is given in the Supplementary Movie (Movie S1). (B) The most representative structure from the 400 ns MD simulation of the pol κ–DNA ternary complex, containing the 11-mer peptide cross-linked to the templating C opposite to the N1 protonated dATP. Inset: a zoomed-in view. A rotating view is given in the Supplementary Movie (Movie S2). In both structures, the complex is displayed in cartoon diagram except for the dATP and the cross-link-containing template C, which are in stick, and Mg2+ ions which are shown as yellow balls. The 11-mer peptide is shown in red surface. The two hydrogen bonds between the template base and incoming nucleotide are shown with dotted lines.

(A) The most representative structure from the 400 ns MD simulation of the pol η-DNA ternary complex, containing the 11-mer peptide cross-linked to the templating C opposite to the N1 protonated incoming dATP. Inset: (left), a zoomed-in view of the 11-mer peptide cross-linked to the templating C opposite the incoming dATP; (right), hydrogen bonding scheme of the C–A+ wobble pair. The protonated hydrogen at N1 in adenine is shown in red. A rotating view is given in the Supplementary Movie (Movie S1). (B) The most representative structure from the 400 ns MD simulation of the pol κ–DNA ternary complex, containing the 11-mer peptide cross-linked to the templating C opposite to the N1 protonated dATP. Inset: a zoomed-in view. A rotating view is given in the Supplementary Movie (Movie S2). In both structures, the complex is displayed in cartoon diagram except for the dATP and the cross-link-containing template C, which are in stick, and Mg2+ ions which are shown as yellow balls. The 11-mer peptide is shown in red surface. The two hydrogen bonds between the template base and incoming nucleotide are shown with dotted lines. Our simulations have revealed a stable (Supplementary Figure S5), nearly undistorted 11-mer peptide cross-linked to C, which fits well on the DNA major groove side and forms a C–A+ mismatched base-pair (Supplementary Figure S6 and Supplementary Table S1). Intriguingly, our single nucleotide incorporation data (Table 2) shows that the insertion of dA opposite the C5-dC cross-linked to the 11-mer peptide is more favorable than dA incorporation opposite unmodified C. Our simulations provide a possible structural explanation of this finding: the C–A+ mismatched base-pair is much more distorted, mainly through buckle, in the case of unmodified C. These structures suggest that the 11-mer peptide anchors the conjugated cytosine templating base to be ideally positioned for the formation of hydrogen bonds between the C and the A+ in the wobble pair. This anchoring is revealed by the inversion of the buckle (i.e. the peptide causes the buckle to invert direction and sign) as shown in Supplementary Figure S6 and Supplementary Table S1. Our experimental data (Table 2) show that nucleotide insertion efficiency in hPol κ is much lower than by hPol η for both control and modified substrates. Furthermore, only mismatched base (A) is inserted opposite the peptide cross-link. To explain this data, we carried out MD simulations in pol κ with N1-protonated dATP, as in the pol η case, opposite to C5-dC cross-linked to 11-mer peptide (Figure 7B), which fits comfortably on the DNA major groove side. Our simulations revealed a stable (Supplementary Figure S5), modestly distorted C–A+ mismatched base-pair (Supplementary Table S1). The C–A+ mismatch is quite well formed, with two hydrogen bonds having occupancies >94 %.

DISCUSSION

Because of their endogenous formation in human cells (15), it is of critical importance to establish the effects of 5fC mediated DNA–histone protein conjugates on DNA replication. Our data presented herein (Figures 2 and 3) indicate that the presence of reductively stabilized histone protein-5fC conjugates completely blocks human lesion bypass polymerases η and κ. These two polymerases were selected for the present study because of their ability to catalyze replication past DNA-polypeptide cross-links (24,25,29) and other bulky DNA adducts (28,51,53), as well as our preliminary data showing that hPol η and κ knockdown in human cells influences replication fidelity (Naldiga and Basu, unpublished observations). Specifically, we previously reported that model DPCs generated by Huisgen cycloaddition reaction between alkyne-containing DNA (at the C5 position of dT) and azide-functionalized protein completely blocked DNA replication (29). Although reversible histone protein-5fC Schiff base lesions also blocked DNA replication in vitro (Figure 2), some polymerase bypass was observed, probably due to transient protein dissociation from DNA. Therefore, endogenous formation of DNA–histone conjugates at 5fC marks is anticipated to interfere with DNA replication, unless a separate cellular mechanism facilitates their bypass and/or removal. Previous studies have revealed that the histone octamer can be released from the parental DNA by the action of histone chaperone FACT (facilitates chromatin transcription) which collaborates with MCM helicase (54,55). The loss of the histone octamer from the parental DNA ahead of the replication fork is accompanied by the dissociation of H3/H4 from H2A/H2B. FACT is a two subunit complex containing SPT16 and SSRP1 (structure specific recognition protein) capable of binding to both H2A/H2B and H3/H4 (56). Future studies will establish whether the same protein chaperones can dissociate reversibly bound histones from DNA in order to prevent mutagenesis and to allow for replication fork progression. Alternatively, the protein component of DPC may be proteolytically degraded by specialized proteases such as Wss1/Spartan to form the corresponding DNA-small peptide conjugates (57,58). Recent reports indicate that Spartan can be recruited to stalled replication forks at the site of DPCs (59). The resulting DNA–peptide cross-links may be repaired by nucleotide excision pathway or be bypassed by translesion synthesis polymerases (9,60). We found that 5fC conjugates to 11-mer and 31-mer peptides can be readily bypassed by DNA polymerases η and κ (Figure 3C and D), although with a lower efficiency as compared with unmodified dC (Table 2). As compared to similar lesions containing a peptide cross-link to 7-deaza-dG or the N2 position of dG (25,26,53), 5fC-peptide lesions represent a weaker block to DNA replication, allowing for their bypass. Although lesion bypass by TLS polymerases avoids the deleterious consequence of stalled replication forks, this damage-tolerance mechanism may be error-prone, and cell survival can be associated with an increased risk of mutagenesis and carcinogenesis (51,61). We found that the presence of reductively stabilized 5fC-peptide lesions caused targeted C → T transitions and deletion mutations both in vitro (Figure 4, Tables 2 and 3) and in human cells (Figure 6B). The mutation frequency was lower in living cells, which is not surprising since human cells are proficient in DNA repair and contain many DNA polymerases that can cooperate with each other using ‘polymerase switching’ (62). Interestingly, in vivo mutagenesis studies also revealed both 3′ and 5′ semi-targeted mutations (Figure 6B), probably a result of peptide interfering with nucleotide incorporation by DNA polymerases. Semi-targeted mutations were also noted in our previous study with DNA–peptide cross-links conjugated to the N7 position of guanine (24). It is not surprising that the results of in vitro and in vivo replication experiments for DNA–peptide adducts differ. In our previous studies, we found that other bulky DNA lesions that block DNA polymerases in vitro only marginally reduce translesion synthesis (TLS) in cells. For example, the C8-dG adduct formed by the carcinogen 3-nitrobenzanthrone was a strong replication block of hpol η, κ, ι and Rev1 in vitro (63), but when replicated in HEK 293T cells, allowed for 80% lesion bypass (64). Likewise, when we investigated the replicative properties of another DNA–peptide crosslink (DpC) in which the peptide was covalently attached to 7-deazaguanine, TLS in HEK293T cells was 76 ± 3% (24) as compared to significant blockage of both hpol η and κ by the same lesion in vitro (26). In a cellular system, multiple DNA polymerases can cooperate with each other to allow for facile bypass of a lesion that blocks single DNA polymerases in vitro. Molecular modeling and molecular dynamics simulations of template–primer complexes revealed that modified C forms a stable mismatch with dATP in the active sites of polymerases η and κ, providing a plausible structural origin for C → T mutations. The peptide is stably positioned on the DNA major groove side when cross-linked to the templating C in both cases. In our model of hPol η ternary complex, the 11-mer peptide anchors the templating conjugated C in the spacious active site, so as to optimize the hydrogen bonding geometry when the peptide is present (Figure 7 and Supplementary Figure S6). The observed preference for incorporation of dATP rather than the correct base (dGTP) opposite C that is cross-linked to 11-mer peptide by hPol η can be explained by the weakened dC-dGTP base pair, so that only one full and two half occupancy hydrogen bonds can be formed (Supplementary Figure S7). Our observation that hPol η is more efficient at bypassing 5fC DNA–peptide cross-links as compared with hPol κ is in agreement with previous experiments performed with other DPCs and structurally related lesions (25,26,65). Structurally, both human (33) and yeast (66) pol η have a large active site cleft with the fingers domain elevated, so that two nucleotides can be accommodated at the same time, notably the two covalently linked thymines in the cis-syn cyclobutane pyrimidine dimer (CPD). In contrast, pol κ has a constricted active site cleft with the fingers lowered onto the templating base, so that an additional nucleotide cannot be accommodated (32) (Supplementary Figure S8A). Furthermore, pol κ possesses the unique L-shaped N-clasp at its N-terminus, composed of short (αN1) and long (αN2) α helices, with the αN2 helical axis diagonally crossing the DNA duplex. With this N-clasp, pol κ encircles the DNA by holding the catalytic core (fingers, palm and thumb subdomains) and little finger together on the major groove side; therefore, the major groove side is occluded (Supplementary Figure S8B). On the other hand, pol κ contains a gap between the catalytic core and little finger on the DNA minor groove side that pol η lacks (Supplementary Figure S8C). The absence of this opening in pol η allows the newly synthesized duplex to retain a stable B-form, ‘splinted’ by the ungapped polymerase (33), even in the presence of a CPD lesion. These structural differences between pol η and pol κ may help explain our experimental data on a molecular level. Overall, the lower nucleotide insertion efficiency of pol κ relative to pol η (Table 2) can be rationalized by its more constricted active site region and the presence of the major groove-occluding N-clasp in pol κ, which present steric obstacles in nucleotide insertion and translocation. In addition to C →T transitions, significant numbers of –1 deletion products were detected by LC–MS-based strategy (Table 3). Deletion mutations were a significant feature in the primer extension experiments for both pol η (12.4%) and pol κ (78.6%) in the presence of the peptide adduct (Table 3). The structure of pol η can provide an insight into this observation. With an active site region that can accommodate the two nucleotides of the CPD lesion (Supplementary Figure S9A) (33), the occurrence of a deletion in the primer extension data can be explained by a mechanism in which the adducted 3′-nucleotide is paired with the incoming dNTP, while the 5′ neighboring base on the templating strand is skipped. This is analogous to the Type II crystal structure of Dpo4 (61), where two nucleotides occupied the spacious Dpo4 active site and the incoming dNTP is aligned with the 5′-templating base, while its 3′-neighbor is skipped (Supplementary Figure S9B). This structure provides a molecular view of a -1 deletion, and has been previously observed for 1, N2-ϵ-guanine (50) and an abasic site (61) in the active site of Dpo4. While no such -1 deletion crystal structure is available for pol η, a similar mechanism is plausible for pol η, in view of the expanded active site similarity. In contrast, pol κ has a more constricted active site and thus must use a different mechanism to induce -1 deletions in the presence of a peptide lesion. For instance, the adducted nucleotide may assume a ‘slipped’ or extrahelical conformation (50,67). Alternatively, slippage may occur at the subsequent nucleotide insertion opposite the adduct (50,67). For example, pol κ has been shown to use a template-slippage deletion mechanism (68). A single bulged out base that is skipped would be responsible for the deletion, but the exact location of the bulge is unknown. For example, in E. coli DNA pol II which does not have an expanded active site region, multiple looped out structures have been noted (69). The generation of one base or two base deletion mutations by TLS polymerases is well documented, especially in bypassing bulky DNA adducts (50). Our primer extension data for pol κ in the presence of the peptide adduct also revealed significant incorporation of the normal incoming dGTP (Table 3). Our simulations with ternary complex-containing dGTP and the 11-mer peptide in pol κ showed a well-aligned C-dGTP Watson–Crick pair, whose extension could explain this observation (Supplementary Figure S6 and Supplementary Table S1). Our modeling studies of C–A mismatch mutations as well as our above discussion of slippage mechanisms consider template-directed nucleotide incorporation; moreover, our work reveals that template-directed dATP incorporation via wobble pairing is feasible with the bulky peptide cross-linked to the templating C. However, it is also possible that some C–A mismatches or deletion events occur via a non-instructive, untemplated mechanism, governed by the ‘A’ rule; this rule determined that ‘damaged nucleotides produce ‘non-instructive’ sites at which normal base pairing was not possible, and at which As were incorporated due to inherent polymerase preference' (70,71). This mechanism is a consideration in the bypass of bulky lesions. Together with our earlier findings (26,29), these results indicate that the cross-linking site with DNA has a strong impact on the biological outcomes resulting from DNA–peptide lesions. C5-Thymidine peptide conjugates examined in our earlier work (25,29) induced high numbers of frameshift mutations and base substitutions, while the TLS bypass of a peptide conjugated to the C7 position of 7-deaza-dG was extremely inefficient, but essentially error-free (26). The C5-dC adducts examined here are less blocking and more mutagenic than C5-dT and 7-deaza-dG lesions examined in these earlier studies, inducing large numbers of C to T transitions (Figure 4 and Table 2). Similarly, Minko et al. have shown that peptide cross-linked to the N position of dG completely blocked DNA replication, while the same peptide at the N position of dA was bypassed, introducing small amounts of mutations (53,72). However, every full size DNA–protein conjugate examined completely blocked DNA replication as shown in (25,26,29) and in Figures 2 and 3E, F. Taken together, our data suggest that reversible conjugation between 5fC in DNA and histone proteins has a profound effect on DNA replication catalyzed by lesion bypass polymerases Pol κ and Pol η. Future studies are needed to establish whether such endogenous DPC contribute to C to T mutations frequently observed at CpG islands within the human genome (73), to test their effects on additional DNA polymerases, and to identify cellular mechanism and protein factors that prevent polymerase stalling of these super-bulky lesions observed at epigenetic marks of DNA.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online. Click here for additional data file.
  69 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Repair and biochemical effects of DNA-protein crosslinks.

Authors:  Hiroshi Ide; Mahmoud I Shoulkamy; Toshiaki Nakano; Mayumi Miyamoto-Matsubara; Amir M H Salem
Journal:  Mutat Res       Date:  2010-12-24       Impact factor: 2.433

3.  Comparison of multiple Amber force fields and development of improved protein backbone parameters.

Authors:  Viktor Hornak; Robert Abel; Asim Okur; Bentley Strockbine; Adrian Roitberg; Carlos Simmerling
Journal:  Proteins       Date:  2006-11-15

4.  DNA adduct bypass polymerization by Sulfolobus solfataricus DNA polymerase Dpo4: analysis and crystal structures of multiple base pair substitution and frameshift products with the adduct 1,N2-ethenoguanine.

Authors:  Hong Zang; Angela K Goodenough; Jeong-Yun Choi; Adriana Irimia; Lioudmila V Loukachevitch; Ivan D Kozekov; Karen C Angel; Carmelo J Rizzo; Martin Egli; F Peter Guengerich
Journal:  J Biol Chem       Date:  2005-06-17       Impact factor: 5.157

5.  Identification of mammalian proteins cross-linked to DNA by ionizing radiation.

Authors:  Sharon Barker; Michael Weinfeld; Jing Zheng; Liang Li; David Murray
Journal:  J Biol Chem       Date:  2005-08-09       Impact factor: 5.157

Review 6.  Chaperoning histones during DNA replication and repair.

Authors:  Monica Ransom; Briana K Dennehey; Jessica K Tyler
Journal:  Cell       Date:  2010-01-22       Impact factor: 41.582

7.  5-Formylcytosine Yields DNA-Protein Cross-Links in Nucleosome Core Particles.

Authors:  Fengchao Li; Yingqian Zhang; Jing Bai; Marc M Greenberg; Zhen Xi; Chuanzheng Zhou
Journal:  J Am Chem Soc       Date:  2017-07-25       Impact factor: 15.419

8.  DNA-reactive protein monoepoxides induce cell death and mutagenesis in mammalian cells.

Authors:  Natalia Y Tretyakova; Erin D Michaelson-Richie; Teshome B Gherezghiher; Jamie Kurtz; Xun Ming; Susith Wickramaratne; Melissa Campion; Sreenivas Kanugula; Anthony E Pegg; Colin Campbell
Journal:  Biochemistry       Date:  2013-04-24       Impact factor: 3.162

9.  Conformational analysis of nucleic acids revisited: Curves+.

Authors:  R Lavery; M Moakher; J H Maddocks; D Petkeviciute; K Zakrzewska
Journal:  Nucleic Acids Res       Date:  2009-07-22       Impact factor: 16.971

10.  Mutational specificity of gamma-radiation-induced guanine-thymine and thymine-guanine intrastrand cross-links in mammalian cells and translesion synthesis past the guanine-thymine lesion by human DNA polymerase eta.

Authors:  Laureen C Colis; Paromita Raychaudhury; Ashis K Basu
Journal:  Biochemistry       Date:  2008-07-11       Impact factor: 3.162

View more
  12 in total

1.  Error-prone replication of a 5-formylcytosine-mediated DNA-peptide cross-link in human cells.

Authors:  Spandana Naldiga; Shaofei Ji; Jenna Thomforde; Claudia M Nicolae; Marietta Lee; Zhongtao Zhang; George-Lucian Moldovan; Natalia Y Tretyakova; Ashis K Basu
Journal:  J Biol Chem       Date:  2019-05-28       Impact factor: 5.157

Review 2.  Protein Interactions at Oxidized 5-Methylcytosine Bases.

Authors:  Gerd P Pfeifer; Piroska E Szabó; Jikui Song
Journal:  J Mol Biol       Date:  2019-08-08       Impact factor: 5.469

3.  Transcriptional Bypass of DNA-Protein and DNA-Peptide Conjugates by T7 RNA Polymerase.

Authors:  Shaofei Ji; Jenna Thomforde; Colette Rogers; Iwen Fu; Suse Broyde; Natalia Y Tretyakova
Journal:  ACS Chem Biol       Date:  2019-10-21       Impact factor: 5.100

4.  5-Formylcytosine-induced DNA-peptide cross-links reduce transcription efficiency, but do not cause transcription errors in human cells.

Authors:  Shaofei Ji; Daeyoon Park; Konstantin Kropachev; Marina Kolbanovskiy; Iwen Fu; Suse Broyde; Maram Essawy; Nicholas E Geacintov; Natalia Y Tretyakova
Journal:  J Biol Chem       Date:  2019-10-09       Impact factor: 5.157

5.  Site-Specific 5-Formyl Cytosine Mediated DNA-Histone Cross-Links: Synthesis and Polymerase Bypass by Human DNA Polymerase η.

Authors:  Suresh S Pujari; Mingxuan Wu; Jenna Thomforde; Zhipeng A Wang; Christopher Chao; Noelle M Olson; Luke Erber; William C K Pomerantz; Philip Cole; Natalia Y Tretyakova
Journal:  Angew Chem Int Ed Engl       Date:  2021-11-16       Impact factor: 15.336

6.  Reactivity of the Major Product of C5'-Oxidative DNA Damage in Nucleosome Core Particles.

Authors:  Anup Rana; Kun Yang; Marc M Greenberg
Journal:  Chembiochem       Date:  2019-01-10       Impact factor: 3.164

7.  Participation of Histones in DNA Damage and Repair within Nucleosome Core Particles: Mechanism and Applications.

Authors:  Mengtian Ren; Marc M Greenberg; Chuanzheng Zhou
Journal:  Acc Chem Res       Date:  2022-03-10       Impact factor: 22.384

8.  Translesion Synthesis Past 5-Formylcytosine-Mediated DNA-Peptide Cross-Links by hPolη Is Dependent on the Local DNA Sequence.

Authors:  Jenna Thomforde; Iwen Fu; Freddys Rodriguez; Suresh S Pujari; Suse Broyde; Natalia Tretyakova
Journal:  Biochemistry       Date:  2021-06-03       Impact factor: 3.321

9.  A programmable DNA roadblock system using dCas9 and multivalent target sites.

Authors:  Emily K Matozel; Stephen Parziale; Allen C Price
Journal:  PLoS One       Date:  2022-05-06       Impact factor: 3.752

10.  Squaramate-Modified Nucleotides and DNA for Specific Cross-Linking with Lysine-Containing Peptides and Proteins.

Authors:  Ivana Ivancová; Radek Pohl; Martin Hubálek; Michal Hocek
Journal:  Angew Chem Int Ed Engl       Date:  2019-08-13       Impact factor: 15.336

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.