| Literature DB >> 23303792 |
Paolo Rossi1, Christopher M Barbieri, James M Aramini, Elisabetta Bini, Hsiau-Wei Lee, Haleema Janjua, Rong Xiao, Thomas B Acton, Gaetano T Montelione.
Abstract
Single-stranded DNA (ssDNA) binding proteins are important in basal metabolic pathways for gene transcription, recombination, DNA repair and replication in all domains of life. Their main cellular role is to stabilize melted duplex DNA and protect genomic DNA from degradation. We have uncovered the molecular function of protein domain family domain of unknown function DUF2128 (PF09901) as a novel ssDNA binding domain. This bacterial domain strongly associates into a dimer and presents a highly positively charged surface that is consistent with its function in non-specific ssDNA binding. Lactococcus lactis YdbC is a representative of DUF2128. The solution NMR structures of the 20 kDa apo-YdbC dimer and YdbC:dT(19)G(1) complex were determined. The ssDNA-binding energetics to YdbC were characterized by isothermal titration calorimetry. YdbC shows comparable nanomolar affinities for pyrimidine and mixed oligonucleotides, and the affinity is sufficiently strong to disrupt duplex DNA. In addition, YdbC binds with lower affinity to ssRNA, making it a versatile nucleic acid-binding domain. The DUF2128 family is related to the eukaryotic nuclear protein positive cofactor 4 (PC4) family and to the PUR family both by fold similarity and molecular function.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23303792 PMCID: PMC3575825 DOI: 10.1093/nar/gks1348
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Solution NMR structure of L. lactis apo-YdbC shown in the identical top-view orientation. (A) Stereoview of dimeric YdbC with labelled secondary structure elements and amino termini. (B) ConSurf (29,30) amino acid conservation mapped onto the lowest energy NMR structure. Highly conserved residues are labelled on the protein backbone of a single protomer. (C) Solvent exposed electrostatic potential (32) mapped onto the surface of apo-YdbC. Only the ssDNA-binding epitope is shown for clarity.
Summary of NMR Structural Statistics for apo-YdbC and YdbC:dT19G1 ensembles
| Data Type | apo | holo |
|---|---|---|
| Completeness of resonance assignments | ||
| Backbone (%) | 97 | 91 |
| Side-chain (%) | 93 | 93 |
| Aromatic (%) | 100 | 100 |
| Stereospecific methyl (%) | 100 | 100 |
| Conformationally restricting restraints | ||
| NOE restraints | ||
| Total | 3142 | 2258 |
| Intra-residue ( | 673 | 362 |
| Sequential (| | 819 | 562 |
| Medium range (1 < | | 480 | 288 |
| Long range (| | 1170 | 1046 |
| NOE restraints/residue | 42 | 14 |
| Interchain protein/protein NOEs | 244 | 184 |
| Interchain protein/ssDNA NOEs | 254 | |
| Dihedral angle restraints | 330 | 446 |
| Hydrogen bond restraints | 128 | 120 |
| NH RDC restraints (polyethylene glycol (PEG)+phage) | 222 | |
| Number of restraints/residue (total/long range) | 48/16.6 | 17.4/6.5 |
| Residual constraint violations | ||
| Average distance restraint violations/structure | ||
| 0.1–0.2 Å | 19.8 | 30.9 |
| 0.2–0.5 Å | 3.6 | 11.7 |
| >0.5 Å | 0.0 | 0.3 |
| Average RMS of distance violation/restraint (Å) | 0.02 | 0.03 |
| Maximum distance violation (Å) | 0.45 | 0.61 |
| Average RMS dihedral angle violations/structure | ||
| >1°–10° | 18.1 | 40.1 |
| >10° | 0.4 | 1.35 |
| Average RMS dihedral angle violation/restraint | 1.0 | 1.1 |
| Maximum dihedral angle violation (°) | 11.0 | 20.5 |
| Model quality | ||
| RMSD from average coordinates (Å) | ||
| All backbone atoms (ordered/all) | 0.6/1.7 | 1.0/1.4 |
| All heavy atoms (ordered/all) | 0.9/2.3 | 0.4/0.4 |
| RMSD bond lengths (Å) | 0.018 | 0.004 |
| RMSD bond angles (°) | 1.3 | 0.7 |
| Molprobity Ramachandran plot | ||
| Most favoured regions (%) | 95.7 | 90.9 |
| Additionally allowed regions (%) | 4.2 | 9.1 |
| Disallowed regions (%) | 0.1 | 0.0 |
| Global quality scores (Raw/ | ||
| Procheck G-factor (ϕ,ψ) | −0.47/−1.53 | −0.55/−1.85 |
| Procheck G-factor (all dihedrals) | −0.18/−1.06 | −0.43/−2.54 |
| Verify3D | 0.38/−1.28 | 0.39/−1.12 |
| ProsaII | 0.40/−1.03 | 0.53/−0.50 |
| MolProbity clashscore | 14.52/−0.97 | 21.6/−2.18 |
| RPF scores | ||
| Recall/Precison | 86.8/89.1 | |
| F measure/DP score | 87.9/71.8 | |
| Residual Dipolar Couplings (RDC) Scores | ||
| Q-factor (PEG/phage) | 0.20/0.18 | |
| R (PEG/phage) | 0.97/0.98 |
aStructural statistics were computed for the ensembles of 20 deposited structures (PDB ID: 2ltd and 2ltt) using PSVS (23).
bComputed for residues 1–74. Resonances that were not included were exchangeable protons (N-terminal NH3+, Lys NH3+, Arg NH2, Cys SH, Ser/Thr/Tyr OH) and Pro N, C-terminal carbonyl, side-chain carbonyl and non-protonated aromatic carbons.
cAverage distance constraints were calculated using the sum of r−6.
dOrdered residue ranges [S(ϕ) + S(ψ) > 1.8]:3–74 (chain A), 3–74 (chain B). Secondary structure elements APO: 7–19 (β1, β1′), 22–32 (β2, β2′), 37–44 (β3, β3′), 51–57 (β4, β4′), 59–72 (α1, α1′). Secondary structure elements HOLO: 7–17 (β1, β1′), 24–32 (β2, β2′), 36–44 (β3, β3′), 55–57 (β4, β4′), 59–72 (α1, α1′).
eRPF scores (38) reflecting the goodness-of-fit of the final ensemble of structures (including disordered residues) to the NOESY data and resonance assignments.
fResidual dipolar coupling quality scores (24).
Figure 2.(A) Structure-based sequence alignment (26,27) of L. lactis YdbC (DUF2128; PF09901), H. sapiens PC4 (PF02229) and B. burgdorferi PUR-α (DUF3276; PF11680). (Top) Sequence alignment rendered by ESPript (42) using default parameters for residue similarity calculations, where boxed residues represent identical (red box, white character) and similar (red character) amino acid conservation. (Bottom) Sequence alignment rendered using ConSurf (29,30) where residue conservation across individual protein domain families range from highly conserved (magenta) to variable (cyan). (B) Comparison of the solution NMR structure of L. lactis YdbC with crystal structures structurally similar apo-forms of dimeric ssDNA-binding proteins, H. sapiens PC4 (PDB ID: 1pcf) (43) and B. burgdorferi PUR-α (PDB ID: 3nm7) (8).
Figure 3.NMR characterization of poly-dT binding to L. lactis YdbC. (A) CSPs (Δδcomp) histogram. The bottom panel shows colour-coded residues defined according to the magnitude of the deviation from the mean CSP (green dotted line); yellow dotted line: mean + 1σ; red dotted line: mean + 2σ. The CSPs are mapped onto the apo-YdbC structure in tube representation. (B) {1H}-15N heteronuclear NOE difference (ΔhetNOE) between ssDNA-bound and apo-YdbC. The histogram (bottom panel) shows colour-coded residues defined according to magnitude of the deviation from the mean ΔhetNOE (cyan dotted line); purple dotted line: mean + 1σ; magenta dotted line: mean + 2σ. The ΔhetNOEs are mapped onto the apo-YdbC structure in tube representation with the same colouring scheme.
Figure 4.Solution NMR structure of YdbC:dT19G1 complex. (A) Cartoon stereoview with labelled β4 dimer interface element and structured ssDNA segments and their termini. (B and C) Top and side view of complex with labelled and coloured dT bases (T1–T7). For visual clarity, one side has been greyed out. (D) Detailed view of each dT:protein interaction sites for dT1–dT7. Residues showing hydrophobic interactions <5 Å have been included. Dashed lines represent H-bond interactions within typical range (2.7–3.1 Å). Base–base stacking between dT4 and dT5 was found; protein aromatic to base stacking was present between Trp23 and dT2 and Trp32 and dT5.
Figure 5.ssDNA-binding profiles for YdbC at 25° C and 150 mM of NaCl. (Top) Thermal power versus time with legend added for clarity. ITC thermograms for the injection of 220 µM YdbC into 8-µM solutions of d(AC)10 (green), dA20 (blue), dT19G1 (black) and dC20 (red). Each heat burst curve corresponds to the injection of 2 µl of a solution of YdbC into a solution of the ssDNA oligo. (Bottom) Injection heat versus YdbC/ssDNA ratio. The thermograms in the top panel were integrated to create the binding isotherms with the same colour-coding as in the top panel. The binding isotherms were fit (solid lines) with models for one [d(AC)10 and dA20] or two (dT19G1 and dC20) sets of binding sites. Top and bottom panels use identical colour-coding.
ITC-derived parameters for the binding of YdbC to selected 20mer oligonucleotides
| Oligonucleotide | Binding site | Δ | Δ | Δ | ||
|---|---|---|---|---|---|---|
| dT19G1 | 1 | (1.6 ± 0.6) × 10−8 | −10.6 ± 0.3 | −10.1 ± 0.1 | 1.8 ± 1.3 | 2.2 ± 0.1 |
| 2 | (7.7 ± 3.0) × 10−6 | −7.0 ± 0.3 | −4.3 ± 0.6 | 9.0 ± 3.0 | 2 fixed | |
| dC20 | 1 | (1.1 ± 0.6) × 10−8 | −10.8 ± 0.4 | −7.8 ± 0.1 | 10.2 ± 1.7 | 2.2 ± 0.1 |
| 2 | (1.7 ± 0.9) × 10−6 | −7.9 ± 0.4 | −1.6 ± 0.2 | −21.0 ± 2.0 | 2 fixed | |
| dA20 | 1 | (1.1 ± 0.6) × 10−5 | −6.8 ± 0.5 | −1.8 ± 0.6 | 16.6 ± 3.6 | 2.1 ± 0.5 |
| d(A–C)10 | 1 | (3.9 ± 0.5) × 10−8 | −10.1 ± 0.1 | −10.3 ± 0.1 | −0.6 ± 0.6 | 2.0 ± 0.1 |
The ITC profiles shown in Figure 5 were fit with models for either one [dA20 and d(A−C)10] or two (dT19G1 and dC20) independent sets of binding sites. All parameters were allowed to float during the fitting routines except for values of n for site 2 in dT19G1 and dC20, which were manually varied to yield the best fit (as reflected by minimization of χ2). The indicated uncertainties in the fitted values reflect the standard deviation of the experimental data from the fitted curves. Values for ΔG and ΔS were calculated using the standard formalisms containing the maximum errors as carried through the equations.
Figure 6.Neighbour-joining tree of YdbC homologues compared with sequences within the PC4 and Pur-α families. Sequence accession (GenBank ID) numbers are in parenthesis. Sequences and DUF2128 in bold to highlight significance to this study. The PC4 homologue of Desulfobacca acetoxidans was used as the outgroup. Bootstrap values >50 are shown. Bar indicates 0.1 substitutions per amino acid position.