| Literature DB >> 34723130 |
Siyu Chen1,2, James P Lees-Miller3, Yuan He1,2, Susan P Lees-Miller3,4.
Abstract
DNA-dependent protein kinase catalytic subunit DNA-PKcs/PRKDC is the largest serine/threonine protein kinase of the phosphatidyl inositol 3-kinase-like protein kinase (PIKK) family and is the most highly expressed PIKK in human cells. With its DNA-binding partner Ku70/80, DNA-PKcs is required for regulated and efficient repair of ionizing radiation-induced DNA double-strand breaks via the non-homologous end joining (NHEJ) pathway. Loss of DNA-PKcs or other NHEJ factors leads to radiation sensitivity and unrepaired DNA double-strand breaks (DSBs), as well as defects in V(D)J recombination and immune defects. In this review, we highlight the contributions of the late Dr. Carl W. Anderson to the discovery and early characterization of DNA-PK. We furthermore build upon his foundational work to provide recent insights into the structure of NHEJ synaptic complexes, an evolutionarily conserved and functionally important YRPD motif, and the role of DNA-PKcs and its phosphorylation in NHEJ. The combined results identify DNA-PKcs as a master regulator that is activated by its detection of two double-strand DNA ends for a cascade of phosphorylation events that provide specificity and efficiency in assembling the synaptic complex for NHEJ.Entities:
Keywords: Cryo-EM; DNA-PKcs; Non-homologous end joining; Phosphorylation; Structural biology
Year: 2021 PMID: 34723130 PMCID: PMC8549938 DOI: 10.1007/s42764-021-00047-w
Source DB: PubMed Journal: Genome Instab Dis ISSN: 2524-7662
Fig. 1Structure of DNA-PKcs. A Schematic of DNA-PKcs showing the N-HEAT domain (residues 1–891, light blue), the M-HEAT domain (residues 892–2800, light green), the FAT domain (residues 2801–3579, purple), the kinase domain (3580–4099, yellow) and the FATC domain (4100–4128, orange). Also shown is the position of the conserved forehead domain (892–1289, bright green) and the NUC194 domain (residues 1815–2202, dark green), the YRPD motif (residues 2772–2784, dark blue), the FRB domain (residues 3582–3675, bright blue) and the positions of the PQR, ABCDE, S3205 and T3950 phosphorylation sites (red triangles). B Two views of DNA-PKcs (from PDB 7LT3) (Chen et al., 2021a), rotated by 90°, colored as in A. The YRPD motif is shown in dark blue, most clearly visible in the side view, indicated by an arrow. See Lees-Miller et al., (2020) for conservation of amino acids in the forehead and NUC194 domains in DNA-PKcs from eukaryotes
Fig. 2Model for NHEJ. A IR induces DSBs, often with damaged ends that contain non-ligatable end groups (stars). B A Ku heterodimer (composed of Ku70, orange; Ku80, red) binds to each end of the DSB and C recruits DNA-PKcs (shown with head /kinase-FAT-FATC domain in pink and circular cradle in green) to form a synaptic complex. D The Ku complex also recruits a single homodimer of XLF (dark blue) that interacts via its head domain with two XRCC4 homodimers (light blue). The coiled-coil domain of XRCC4 interacts with tandem BRCT domains in the C-terminal region of DNA ligase IV (black). This forms a long-range synaptic complex as reported in Chen et al. (2021a). E Autophosphorylation of DNA-PKcs at ABCDE and possibly other sites, likely in trans leads to a conformational change that causes release of the DSB ends by DNA-PKcs. F autophosphorylated DNA-PKcs dissociates from the complex providing access to processing enzymes such as PNKP, Artemis and DNA polymerases mu and lambda which remove non-ligatable end groups and fill in gaps (G). H and I each DNA ligase IV is tethered to an XRCC4 homodimer through its C-terminal BRCT domains while the N-terminal catalytic domain is attached via a flexible linker region. This allows the catalytic domains to access the DSB ends while Ligase IV remains tethered to the synaptic complex. Each DNA Ligase IV repairs a single-strand break so that the two breaks are repaired sequentially. J The break is repaired restoring genome integrity. G shows additional NHEJ factors such as PAXX, APLF, WRN, Cyren/MRI and long non-coding RNA which may also be involved. The precise order of recruitment and dissociation of the NHEJ proteins is unknown
Fig. 3Model for long-range synaptic complex. The long-range synaptic complex (PDB: 7LT3) (Chen et al., 2021a) was generated using ChimeraX 1.1.1 (Pettersen et al., 2021). The N-HEAT domain is shown in light blue, the M-HEAT in green and the kinase domain in red. The FAT domain is in pink and the FATC in tan. dsDNA is shown in dark orange. The Ku heterodimer, XLF and XRCC4-Lig4 are shown in grey. The structure on the right is rotated by 90° to reveal the relative positions of the kinase domains (red) and circular cradles (green) in the opposing DNA-PKcs molecules. The location of the ATM-dependent phosphorylation site S3205 is shown in yellow
Fig. 4Cartoon showing regulatory sequences located in DNA-PKcs between 2500 and 2800. A Schematic showing the region of DNA-PKcs between amino acids 2500 and 2800 that regulates the synaptic interface. The dimerization loop (residues 2569–2585), the YRPD-interacting motif (2586–2604), the DNA End Binding (DEB) helix (2736–2767) with DEB-appended (DEB-A, residues 2724–2730) and a short linker (2731–2735), and the YRPD motif (2722–2784) are from Chen et al. (2021a). DEB-A, the linker and 37 amino acid DEB form a basic surface that interacts directly with the ends of the DSB (Chen et al., 2021a). B Close up view of interaction interface between two DNA-PKcs molecules, labeled DNA-PKcs-A and DNA-PKcs-B, showing the forehead loop (residues 896–903, green), the dimerization loop (2569–2585, purple), the YRPD-I (residues 2586–2604, yellow), the YRPD (2774–2784, red) and the DEB-A, linker DEB (2724–2767, blue). The position of the DEB is maintained through interaction of the YRPD and the YRPD-I, shown as yellow and red beta sheets respectively, the dimerization loop and the forehead loop. The disordered loop containing the ABCDE autophosphorylation sites (residues 2605–2721) is absent from the structures and is not shown
Fig. 5Interaction of the C-terminal region of Ku80 (residues 725–731) with the NUC194 domain of DNA-PKcs. The amino acid sequence VDDLLDM at the extreme C-terminus of Ku80 interacts directly with DNA-PKcs (Falck et al., 2005). In the long-range complex (Chen et al., 2021a), this sequence forms an alpha-helix (purple) that interacts with the NUC194 domain (residues 1815–2202) of DNA-PKcs. Conserved residues within the NUC194 domain of vertebrate model organisms are shown in magenta and variable amino acids are shown in teal
Fig. 6Conservation of the C-terminal region of Ku80 in vertebrates and multiple invertebrates and plants that also contain DNA-PKcs, but not in Dikarya yeast, C. elegans and flowering plants that do not. Multiple protein sequence alignment of Ku80 was carried out using Clustal Omega and conserved amino acids were colored manually using the following criteria: acidic amino acids (D, E) are highlighted in magenta; basic amino acids (R, K, H) are highlighted in blue; small polar amino acids (T, S, G, C) are in green; aliphatic amino acids (A, L, M, I, V) are in yellow; large polar amino acids (Q, N) are in red; aromatic amino acids (F, W, Y) are in red with yellow highlight; and proline (P) is in grey. Conservation of small amino acids (A, S, T, C, V, G) is illustrated by yellow letters with dark green highlight. In bold are the amino acids present in the structure of the long-range complex (DVDDLLDM) that interact with DNA-PKcs. Organisms and accession numbers for the sequences shown are as follows: PLASMODIOPHORE (Club root pathogen), Plasmodiophora brassicae, CEO94558.1; HONDAEA (Protist) Hondaea fermentalgiana, BG27299.1; APHANOMYCESa (Crayfish plague mold) Aphanomyces astaci, XP_009833210.1; STYLONYCHIA (Ciliate) Stylonychia lemnae, CDW74323.1; NAEGLERIA (Amoeba/flagellate), Naegleria gruberi strain, XP_002672413.1; BARLEY, Hordeum vulgare, AEO86624.1; ARABIDOPSIS, Arabidopsis thaliana, NP_564520.1; SOYBEAN, Glycine max, XP_003524779.1; GREEN_ALGAE, Klebsormidium nitens, GAQ85406.1; LIVERWORT, Marchantia polymorpha, PTQ30121.1; DICTYOSTELIUM, Dictyostelium purpureum, XP_637846.1; S_CEREVISIAE, Saccharomyces cerevisiae, NP_013824.1; S_POMBE, Schizosaccharomyces pombe, Q9HGM8.1; NEUROSPORA, (Bread mold), Neurospora crassa, AFM68948.1; NEOCALLIMASTIX (Anaerobic gut fungi), Neocallimastix californiae, ORY73184.1; SPIZELLOMYCES (Chytridomycota fungus), Spizellomyces punctatus, KND04299.1; MONOSIGA, (Choanoflagellate), Monosiga brevicollis, XP_001750285.1; CAPSASPORA (Amoeboid protist), Capsaspora kowczarzaki, KJE95979.1; SPONGE, Amphimedon queenslandica, XP_019851059.1; HYDRA, Hydra vulgaris, XP_012555181.1; SOFT_CORAL, Dendronephthya gigantea, XP_028404243.1; FLATWORM, Macrostomum lignano, PAA63650.1; C_ELEGANS, Caenorhabditis elegans, CAA83623; TRICHINELLA, Parasitic roundworm, Trichinella pseudospiralis, KRZ25527.1; ROTIFER, Brachionus plicatilis, RNA39402.1; SILKWORM, Bombyx mori, XP_037875756.1; THRIP, Frankliniella occidentalis, XP_026281764.1; DROSOPHILA, Drosophila melanogaster, NP_609767.2; ROBBERFLY, Eutolmus rufibarbis, SRR5185497; STABLEFLY, Stomoxys calcitrans, XP_013115123.1; MOSQUITO, Aedes aegypti, XP_001657128.2; TERMITE, Cryptotermes secundus, XP_023718673.1; DAPHNIA, Daphnia magna, XP_032789667.1; TICK, Ixodes scapularis, XP_002405506.2; PRIAPULA, Penis worm, Priapulus caudatus, XP_014668438.1; MILLIPEDE, Craspedosoma sp., GERS01021842.1; CENTIPEDE, Himantarium gabrielis, GCIL01016305.1; CAPITELLA, Annelid worm, Capitella teleta, ELU08335.1; LINGULA, Brachiopod, Lingula anatine, XP_013403161.1; LIMPET, Lottia gigantea, XP_009064003.1; SEA URCHIN, Strongylocentrotus purpuratus, XP_788472.3; STARFISH, Acanthaster planci, XP_022103565.1; LANCELET, Branchiostoma belcheri, XP_019618779.1; HEMICHORDATE, Acorn worm, Saccoglossus kowalevskii, XP_006822840.1; LAMPREY, Petromyzon marinus, xp_032818808; SHARK, Prionace glauca, GFYY0116112.1; DANIO, Danio rerio, NP_001017360.2; GAR, Lepisosteus oculatus, XP_015214714.1; XENOPUS, Xenopus laevis, BAA76954.1; PLATYPUS, Ornithorhynchus anatinus, XP_028928834.1; TASMANIAN_DEVIL, Sarcophilus harrisii, XP_031813944.1; OSTRICH, Struthio camelus australis; XP_009666109.1; SNAKE, Pseudonaja textilis, XP_026563791.1. The names of organisms that do not contain DNA-PKcs are shown in red font. All others contain DNA-PKcs. See (Lees-Miller et al., 2020) for details. The human sequence is shown at the top and bottom in yellow highlight for comparison