| Literature DB >> 34657170 |
Charles G Bailey1,2,3, Shailendra Gupta4,5, Cynthia Metierre1,2, Punkaja M S Amarasekera1,2, Patrick O'Young1,2, Wunna Kyaw1,2, Tatyana Laletin1,2, Habib Francis1,2,3, Crystal Semaan1,2, Mehdi Sharifi Tabar1,2,3, Krishna P Singh4, Charles G Mullighan6, Olaf Wolkenhauer4,5,7, Ulf Schmitz2,3,8, John E J Rasko9,10,11.
Abstract
CCCTC-binding factor (CTCF) plays fundamental roles in transcriptional regulation and chromatin architecture maintenance. CTCF is also a tumour suppressor frequently mutated in cancer, however, the structural and functional impact of mutations have not been examined. We performed molecular and structural characterisation of five cancer-specific CTCF missense zinc finger (ZF) mutations occurring within key intra- and inter-ZF residues. Functional characterisation of CTCF ZF mutations revealed a complete (L309P, R339W, R377H) or intermediate (R339Q) abrogation as well as an enhancement (G420D) of the anti-proliferative effects of CTCF. DNA binding at select sites was disrupted and transcriptional regulatory activities abrogated. Molecular docking and molecular dynamics confirmed that mutations in residues specifically contacting DNA bases or backbone exhibited loss of DNA binding. However, R339Q and G420D were stabilised by the formation of new primary DNA bonds, contributing to gain-of-function. Our data confirm that a spectrum of loss-, change- and gain-of-function impacts on CTCF zinc fingers are observed in cell growth regulation and gene regulatory activities. Hence, diverse cellular phenotypes of mutant CTCF are clearly explained by examining structure-function relationships.Entities:
Keywords: CTCF; Cancer; Gain-of-function; Loss-of-function; Molecular docking; Molecular dynamics; Somatic mutation; Zinc finger
Mesh:
Substances:
Year: 2021 PMID: 34657170 PMCID: PMC8629902 DOI: 10.1007/s00018-021-03946-z
Source DB: PubMed Journal: Cell Mol Life Sci ISSN: 1420-682X Impact factor: 9.261
Fig. 1Distribution and impact of CTCF somatic mutations in cancer. A The landscape of somatic mutations (above) and SNPs (below) occurring in CTCF: the distribution and frequency within the coding region is shown, recurrent somatic mutations (occurring 10 times) are labelled. For a curated list of non-redundant CTCF mutations from cancer genome sequencing studies (TCGA, COSMIC) and published studies see Supplementary Table 1. CTCF mutation type (B); and tissue distribution (C) are shown; n = total number of mutations. D Analysis of cancer-related somatic missense variants and missense SNPs occurring in each domain of CTCF (N=N-terminus; Z=ZF domain; C=C-terminus). The expected occurrence was calculated from the total number with the proportion of missense variants expected in each domain if they were evenly distributed. The observed/expected ratio confirms if there is a de-enrichment (< 1.0) or an enrichment (> 1.0) of non-synonymous changes. E Frequency of somatic missense mutations occurring in specific ZFs of CTCF, the mean for all ZFs is shown (dotted line). F Sequence logo of all 11 aligned CTCF ZFs; numbers (− 6 to + 6) indicate co-ordinates within the DNA-binding portion of the ZF. Similar amino acids are coloured: black—hydrophobic (G, A, V, I, L, P, W, F, M); green—polar (S, T, Y, C); purple—polar amide (Q, N); blue—basic (K, R, H); and red—acidic (D, E). The height of each amino acid residue is proportional to its observed frequency. The overall height of each letter ‘stack’ is proportional to the sequence conservation, shown in bits. G Frequency of missense somatic mutations at each ZF position; the mean for all ZFs is shown (dotted line). Data represent the mean ± SD with statistical analysis performed using the Chi-square test (*p < 0.05; **p < 0.01; ****p < 0.0001)
Fig. 2Functional impacts of CTCF ZF missense mutations. A Published and unpublished missense CTCF mutations (red circles) occurring in acute lymphoblastic leukaemia (L309P, R339Q, R337H, G420D—highlighted in black) superimposed on a C2H2 ZF structure: C = cysteine, H = histidine, Zn = Zn2+ ion; R339W (underlined) is a previously characterised change-of-function mutation used as a control. The Polyphen score for each mutation is indicated. Numbers (− 6 to + 6) indicate co-ordinates within the DNA-binding portion of the ZF; residues directly contacting DNA at positions − 1, + 2, + 3 and + 6 are indicated (white ring). B Western blot of WT and mutant CTCF expression in transduced K562 cells; anti-HA antibody detects ectopic CTCF, CTCF antibody detects total CTCF; GAPDH is a loading control; size markers indicate MW in kDa. C Immunofluorescence of HA-tagged WT and mutant CTCF in K562 cells using anti-HA antibody, scale bar = 5 μm. D, E Functional assays of CTCF mutants in K562 cells including: D MTT proliferation; and E colony forming assay in Methocult. Data represent the mean ± s.e.m for 3 experiments each performed in triplicate. Statistical analysis was performed using a one-way ANOVA with Tukey’s multiple comparisons test for pairwise comparisons between control, WT and mutant (ns = not significant; *p < 0.05; **p < 0.01; ****p < 0.0001)
Fig. 3CTCF ZF mutants reduce DNA binding and transcriptional regulation. A CTCF DNA binding assay performed with in vitro transcribed and translated CTCF protein (WT and mutants) and a biotinylated dsDNA probe representing the core CTCF binding site. Eluted and input samples were probed for CTCF protein by Western blot; numbers at the bottom indicate band densitometric values after normalisation to input. B, C Control (eGFP alone), CTCF WT- or mutant-containing lentivector plasmids were transfected into HEK293T cells for 48 h. B Representative Western blots (of 3 replicates) indicating ectopic (HA-tagged) CTCF, total CTCF and GAPDH loading control after transfection of HEK293T cells. C GFP mean fluorescence intensity (MFI) detected after 48 h and normalised to eGFP empty vector control set as 1.0. Data represent the mean ± s.e.m for 3 experiments each performed in triplicate except for the Western blots which are only single replicates. Statistical analysis was performed using a one-way ANOVA with Tukey’s multiple comparisons test for pairwise comparisons between control, WT and mutant (ns = not significant; **p < 0.01; ***p < 0.001; ****p < 0.0001)
Fig. 4Differential DNA binding exhibited by ZF-mutant CTCF. ChIP-PCR of HA-tagged WT and mutant CTCF expressed in K562 cells; L = 100 bp ladder, input is total genomic DNA before ChIP, Control = eGFP empty vector. Diverse CTCF sites were examined: including A archetypal CTCF sites; B the C-MYC locus; C enhancers; D insulators; and E TAD boundaries. Where relevant, the genomic distance from the TSS is indicated in brackets. The SLC7A1 androgen response element (ARE) was used a negative control for CTCF binding. See Supplementary Table 4 listing references for known CTCF sites and their chromosomal locations
Fig. 5Homology modelling of CTCF ZF mutations. A CTCF ZF residues impacted by somatic mutation are depicted on the crystal structure model of ZFs 2–7 in association with DNA. Zinc molecules are shown as grey spheres. B Net change in bonds in individual CTCF ZFs following specific ZF mutations. Data shows the mean ± SD. C–G Overlay images of the normal (WT, grey) and mutant (blue) residues superimposed on the CTCF crystal structure: C L309P (L grey, P dark blue); D R339Q (R grey, Q cyan); E R339W (R grey, W green); F R377H (R grey, H magenta); G G420D (G grey, D red). Dashed lines indicate hydrogen bond pairing: old (grey) and new (green). DNA bases and their position relative to the 5′ end of the CTCF consensus are shown
Consequences of CTCF ZF mutations on DNA binding and protein folding
| Number of ZF/DNA bonds predicted from molecular docking of WT and mutant CTCF ZF | |||||||
|---|---|---|---|---|---|---|---|
| WT | L309P | R339Q | R339W | R377H | G420D | ||
| Core CTCF site | (ZF2) | (ZF3) | (ZF3) | (ZF4/5) | (ZF6) | ||
| ZF2 | 6 | 16 | 9 | 13 | 11 | 15 | |
| ZF3 | A/GC/GT/C | 9 | 17 | 8 | 4 | 13 | 15 |
| ZF4 | GGC | 15 | 15 | 15 | 18 | 14 | 19 |
| ZF5 | GG/AN | 24 | 22 | 27 | 21 | 21 | 17 |
| ZF6 | NNA | 17 | 20 | 12 | 13 | 14 | 10 |
| ZF7 | NCA | 14 | 11 | 17 | 13 | 16 | 17 |
| Total | 85 | 101 | 88 | 82 | 89 | 93 | |
| Net change | + 16 | + 3 | − 3 | + 4 | + 8 | ||
| Mutation Energy (ΔΔ | 12.05 | 6.87 | 5.00 | 5.64 | 1.91 | ||
| Effect | Destabilising | Destabilising | Destabilising | Destabilising | Destabilising | ||
Using molecular dynamics simulations of WT and mutant CTCF, the number of bonds formed between CTCF ZF residues and the CTCF binding site was predicted for each ZF. The following bond categories were collated: hydrogen bond; electrostatic and hydrophobic. Using molecular docking, the effect of mutation on the CTCF ZF structure folding energy was measured by the change in minimum free energy (ΔΔG) for WT and mutant ZF structures in the DNA-bound state. ΔΔG values: > 0.5 kcal/mol are destabilising; − 0.5 to 0.5 kcal/mol are neutral; < − 0.5 kcal/mol are stabilising. The nucleotide triplet that each ZF binds in the 15 bp core CTCF site used for this model (5′-NCANNAGG/AGGCA/GC/GT/C-3′) [45] is shown. The ZF containing each respective mutation is shown in brackets
Fig. 6Impact of mutation on CTCF ZF domain conformational stability revealed by MD simulations. A, B Root-mean-square deviation (RMSD) measurements were calculated from the position differences of backbone atoms in the native (WT) and various mutant conformations. A Trajectories of all 5 mutants and WT CTCF are displayed over a 10 ns time span, measured at 2 ps intervals. B Violin plots of all RMSD measurements (0.000–10.000 ns, 5001 in total). In each plot, the solid black line indicates median and dashed coloured lines indicate quartiles. C Root-mean-square fluctuation (RMSF) measurements were obtained for all residues (n = 173) at each time point for the WT and mutant structures. In each plot, the mean ± SD is shown. In B, C the Wilcoxon matched-pairs signed rank test was applied to all paired measurements (ns = not significant; *p < 0.05; **p < 0.01; ****p < 0.0001). D The net change in distance (in Å) of the centre of mass (centroid) of the associated ZF domain from DNA compared to WT CTCF