| Literature DB >> 32060375 |
Artem Bonchuk1,2, Sofia Kamalyan2,3, Sofia Mariasina4,5, Konstantin Boyko6, Vladimir Popov6,7, Oksana Maksimenko8, Pavel Georgiev9.
Abstract
<span class="Gene">CTCF is the main architectural protein found in most of the examined bilaterian organisms. The cluster of the C2H2 zinc-finger domains involved in recognition of long DNA-binding motif is only part of the protein that is evolutionarily conserved, while the N-terminal domain (NTD) has different sequences. Here, we performed biophysical characterization of <span class="Gene">CTCF NTDs from various species representing all major phylogenetic clades of higher metazoans. With the exception of Drosophilides, the N-terminal domains of CTCFs show an unstructured organization and absence of folded regions in vitro. In contrast, NTDs of Drosophila melanogaster and virilis CTCFs contain unstructured folded regions that form tetramers and dimers correspondingly in vitro. Unexpectedly, most NTDs are able to self-associate in the yeast two-hybrid and co-immunoprecipitation assays. These results suggest that NTDs of CTCFs might contribute to the organization of CTCF-mediated long-distance interactions and chromosomal architecture.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32060375 PMCID: PMC7021899 DOI: 10.1038/s41598-020-59459-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Biochemical and biophysical studies of dCTCF N-terminal domain. (a) Superdex S200 size-exclusion chromatography of Drosophila CTCF NTDs (without Thioredoxin). SEC measures Stokes radius (Rs) of particles in solution, which depends on the size of particles and their shape. Estimated Rs are calculated for globular and completely unfolded particles, which appear much heavier than could be expected for globular molecules with the same molecular weight[66]. Calculated Stokes radii (Rs, Å) for Drosophila CTCF NTDs are shown in brackets (globular monomer/globular dimer/globular tetramer/unfolded monomer). SEC showed that both NTDs have larger Rs than calculated for monomeric and even dimeric globular protein of that molecular weight. Elution volumes of proteins with known Rs are shown under histogram. (b) Cross-linking of Thioredoxin-tagged Drosophila CTCF NTDs using increasing concentrations of glutaraldehyde (GA). (c) The far-UV circular-dichroism spectra of D. melanogaster NTD at 20 °C reveals important characteristics of its possible secondary structure. The shape of obtained spectra shows the random coil conformation of D. melanogaster NTD. (d) Ab initio bead model of D. melanogaster CTCF N-terminal domain (1–163) (calculated from data obtained at 7.5 mg/ml) and D. virilis CTCF N-terminal domain (1–144) (calculated from data obtained at 5.0 mg/ml) obtained by DAMMIN shape reconstruction program based on SAXS data. SAXS provides precise information about the size of macromolecules in solution that is almost independent of their shape. Dmax—maximum dimension of the particles. (e) Kratky plot (I*s2 vs s) of SAXS data derived for Drosophila CTCF NTDs to assess the folding state of protein molecules[75]. Logarithmic curve suggests unfolded protein, whereas the bell-shaped curve indicates the presence of globular structure. Both Drosophila NTDs demonstrate such bell-shaped appearance (more obvious in case of dmCTCF-NTD), which strongly suggests that these proteins are folded at least partially.
Molecular weight of protein particles calculated from SAXS data using extrapolated I0 scattering intensity and protein standards of known Mw as described[71].
| CTCF-NTD | MW of the monomer, kDa | Estimated MW in solution, kDa |
|---|---|---|
| 18.7 | 71.0–83.0 | |
| 16.3 | 29.0–35.0 | |
| 25.1 | 33.0–41.0 | |
| 26.1 | 25.0–29.0 | |
| 19.9 | 27.5–33.5 | |
| 35.5 | 49.0–54.0 | |
| 44.2 | 52.0–62.0 | |
| 31.6 | 37.0–47.0 | |
| 31.4 | 45.0–51.0 |
Scattering parameters for the N-terminal domains of CTCF from various species are shown in Table S1.
Figure 2(a) Summary of multiple sequence alignment of CTCF proteins used in this study. The similarity is a score of how similar each amino acid or groups of amino acids are across the whole alignment. The absolute complexity is the average of the pairwise alignment scores using the substitution matrix chosen in the alignment setup. Overall domain structure of CTCF proteins is shown below. (b) Positions of species selected for this study on the phylogenetic tree of metazoans (adapted from[38]). (c) Superdex S200 size-exclusion chromatography of CTCF NTDs (without Thioredoxin). Elution volumes of proteins with known Rs values (Å) are shown. Calculated Rs values for NTDs are shown in brackets (globular/unfolded monomer). (d) Cross-linking of Thioredoxin-tagged NTDs using increasing concentrations of glutaraldehyde (GA). (e) Kratky plot (I*s2 vs s) of SAXS data derived for CTCF NTDs. The bell-shaped curve suggests that polypeptide is folded, whereas the logarithmic shape is a sign of random coil conformation.
Figure 3Analysis of CTCF proteins N-terminal dimerization in vivo. Summarized results for NTDs CTCF interactions in the yeast two-hybrid assay are presented at the left. NTD CTCFs were fused to the N- and C-termini of the GAL4 DNA-binding and activating domains. The resulted plasmids were tested for interaction. The results of Y2H analysis are shown in columns. The “+” and “−” signs indicate an interaction or the absence of interaction, respectively. The GAL4 activating and GAL4 DNA-binding domains alone were used as negative controls. SA – self-activation variant of NTD CTCF. Co-immunoprecipitations of NTDs CTCF fused with 3 × FLAG or 3 × HA were performed in S2 cells. Drosophila S2 cells were cotransfected with 3 × FLAG- and 3 × HA-fused plasmids. Antibodies against 3 × FLAG or 3 × HA were used for immunoprecipitation of protein extracts (IP). Nonspecific IgG antibodies were used as a negative control. The presence of HA- or FLAG-tagged proteins were studied by Western blotting. ‘Input’ refers to samples of the initial protein extract; ‘output’ refers to the supernatant after the removal of immunoprecipitate (IP). Specific IP signal with partner protein is indicated with an asterisk. Detailed results are presented in Table S2.