One of the major functions of DNA methylation is the repression of transposable elements, such as the long-interspersed nuclear element 1 (L1). The underlying mechanism(s), however, are unclear. Here, we addressed how retrotransposon activation and mobilization are regulated by methyl-cytosine modifying ten-eleven-translocation (Tet) proteins and how this is modulated by methyl-CpG binding domain (MBD) proteins. We show that Tet1 activates both, endogenous and engineered L1 retrotransposons. Furthermore, we found that Mecp2 and Mbd2 repress Tet1-mediated activation of L1 by preventing 5hmC formation at the L1 promoter. Finally, we demonstrate that the methyl-CpG binding domain, as well as the adjacent non-sequence specific DNA binding domain of Mecp2 are each sufficient to mediate repression of Tet1-induced L1 mobilization. Our study reveals a mechanism how L1 elements get activated in the absence of Mecp2 and suggests that Tet1 may contribute to Mecp2/Mbd2-deficiency phenotypes, such as the Rett syndrome. We propose that the balance between methylation "reader" and "eraser/writer" controls L1 retrotransposition.
One of the major functions of DNA methylation is the repression of transposable elements, such as the long-interspersed nuclear element 1 (L1). The underlying mechanism(s), however, are unclear. Here, we addressed how retrotransposon activation and mobilization are regulated by methyl-cytosine modifying ten-eleven-translocation (Tet) proteins and how this is modulated by methyl-CpG binding domain (MBD) proteins. We show that Tet1 activates both, endogenous and engineered L1 retrotransposons. Furthermore, we found that Mecp2 and Mbd2 repress Tet1-mediated activation of L1 by preventing 5hmC formation at the L1 promoter. Finally, we demonstrate that the methyl-CpG binding domain, as well as the adjacent non-sequence specific DNA binding domain of Mecp2 are each sufficient to mediate repression of Tet1-induced L1 mobilization. Our study reveals a mechanism how L1 elements get activated in the absence of Mecp2 and suggests that Tet1 may contribute to Mecp2/Mbd2-deficiency phenotypes, such as the Rett syndrome. We propose that the balance between methylation "reader" and "eraser/writer" controls L1 retrotransposition.
Entities:
Keywords:
5-hydroxymethylcytosine; DNA methylation; Rett syndrome; genome stability; repetitive elements
In humans, 17% of nuclear DNA consists of long interspersed nuclear element 1 (LINE1 or L1). The majority of L1s, however, are retrotransposition defective (RD-L1) due to 5′ truncations, internal rearrangements or mutations. Only 80–100 copies of the half a million human L1s are retrotransposition competent (RC-L1). Full length L1 has a total length of about 6 kilobase pairs and contains a 5′ untranslated region (5′UTR) with promoter activity in both, sense and antisense directions, 3 open reading frames (ORFs) and a 3′UTR that ends in an AATAAA polyadenylation signal. ORF1 encodes for a p40 protein with RNA-binding and chaperone activities, whereas ORF2 encodes for a protein of 150 kDa in size with endonuclease and reverse transcriptase activities. In contrast to ORF1 and ORF2, ORF0 is primate-specific and lies downstream the 5′UTR in antisense direction. It has 2 splice donor sites that can react with splice acceptors of downstream genomic sequences to generate fusion proteins. During the L1 retrotransposition procedure, ORF1 and ORF2 proteins bind to their own RNA in the cytosol to form a ribonucleoprotein particle (RNP), which facilitates the re-import of L1 RNA to the nucleus. The majority of genomic L1 integrations follow a mechanism termed target primed reverse transcription (TPRT), which involves endonuclease and reverse transcriptase activity of ORF2p. However, endonuclease independent L1 integration is also observed in non-homologous end joining (NHEJ) and p53 double deficient cells. Previous studies using engineered L1 elements showed that retrotransposition occurs not only in brain cells like neural progenitor cells of rat hippocampus and human fetal brain, but also in non-brain cells such as embryonic stem cells (ESC). Moreover, transgenic mice, harboring a fluorescently tagged humanL1 under the control of its endogenous promoter showed only detectable L1 retrotransposition activity in mouse germ cells and brain. Furthermore, methylation level of L1 elements differed in brain and skin, indicating that L1s are differently regulated in tissues and cell types. Altogether, these studies demonstrate that retrotransposition of L1 elements can occur in embryonic and importantly, also in somatic cells and correlates with the L1 promoter methylation status.Although host cells have multiple mechanisms to restrict L1 retrotransposition, sporadic insertions of a small number of RC-L1s accompanied by large chromosomal rearrangements can occur that lead to genomic instability. Insertions of L1 sequences into protein coding regions of the genome can decrease RNA levels by inhibiting transcriptional elongation. In addition to its ability to propagate itself, 3′ transductions of L1 occur in germ and cancer cells, whereby unique sequences downstream of L1 elements can also be retrotransposed, if transcription continues beyond the L1 sequence. Moreover, other studies showed that L1 expression leads to high levels of double strand breaks, as evidenced by the formation of γH2AX foci and the recruitment of repair proteins involved in the L1 retrotransposition process.One of the most important mechanisms repressing L1 retrotransposition depends on DNA methylation. Methylation of cytosines has been shown to recruit 5-methylcytosine (5mC) binding domain (MBD) proteins. Mecp2, the founding member of the MBD protein family, was subsequently shown to modulate L1 retrotransposition in a DNA methylation dependent manner. While the transcriptional repression domain (TRD) of Mecp2 was shown to be sufficient for repressing L1 retrotransposition in a reporter assay system, the mechanism(s) of L1 repression mediated by Mecp2 remain unknown. Mecp2 and Mbd2, another family member of the MBD protein family, specifically bind to methylated CpG dinucleotides in DNA. Hypomethylation of the L1 promoter is associated with overexpression of L1 transcripts, suggesting that a decrease in DNA methylation might play a role in L1 retrotransposition. Ten-eleven-translocation (Tet) proteins convert 5-methylcytosine to 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC) in an iterative iron- and oxoglutarate dependent oxidation reaction. These further modifications of methylated cytosines are proposed to be one of the long sought mechanisms leading to loss of DNA methylation.
Tet1 and Tet2 depletion in ESCs has been shown to cause loss of 5hmC in the 5′ region of L1, but a connection to L1 regulation is lacking.In this study, we investigate whether and how Mbd and Tet proteins affect L1 expression and mobilization in human cells. We detected increased transcription and transposition of human endogenous L1 in the presence of Tet1 proteins and showed that activation of L1 transposition depends on the catalytic activity of Tet1. By the use of an L1 retrotransposition reporter assay, we additionally showed that Tet1 proteins activate retrotransposition of engineered L1. Moreover, we found that Mbd2, Mecp2, as well as its methylcytosine binding and transcriptional repression subdomains counteract Tet1-mediated reactivation of L1 retrotransposons.
Results and discussion
Tet1 activates retrotransposition of endogenous L1
Three steps are involved in L1 retrotransposition, comprising loss of DNA methylation in the L1 5′UTR, L1 transcription and L1 transposition (Fig 1A). 5hmC, which is produced by Tet proteins, is thought to be an intermediate modification during loss of DNA methylation, as well as a stable epigenetic mark, which tunes a large number of CpG dinucleotides located at poised enhancers and actively transcribed regions. To test whether L1 retrotransposition is re-activated by Tet-mediated 5mC to 5hmC conversion, we transfected HEK-EBNA cells with the catalytically active domain of Tet1 fused to mcherry (mcherry-Tet1CD), which was previously shown to be sufficient to induce genome-wide hydroxymethylation in vivo. As control, mcherry-Tet1CDmut, which lacked catalytic activity due to 2 mutations in the Fe(II) binding sites (H1652Y, D1654A) was used. Since loss of DNA methylation in the L1 5′UTR is the first step that makes L1 retrotransposition possible, we performed GluMs-qPCR (DNA glucosylation, MspI digestion and quantitative PCR based 5hmC detection, described in the methods) to determine the methylation state of the L1 promoter. L1 has 20 mapped CpG sites within its 5′UTR. Methylation of only a subset of CpG dinucleotides, such as nucleotide position 482, were shown to correlate inversely with retrotransposition activity of L1 elements. Therefore, we chose CpG dinucleotide at position 482 to quantify the methylation status of the L1 promoter (Fig 1B) before and after transfection. In untransfected cells, we observed using GluMs-qPCR that around 60% of all CpG dinucleotides at position 482 were methylated upon Tet1CD expression, however, 5hmC levels at position 482 were increased compared with Tet1CDmut and mock transfected cells (Fig 1C), indicating that Tet1 oxidizes 5mC in the L1 promoter.
Figure 1.
Tet1 reactivates retrotransposition of endogenous L1. (A) Schematic overview of L1 retrotransposition. (B) Experimental rationale of human L1 retrotransposition detection. (C) Relative 5hmC levels in L1 5′UTR (n = 3, *p < 0.05, independent 2-sample student's t-test), (D) relative L1 transcription levels (n ≥ 4; ns = non significant; ***p < 0.001, independent 2-sample student's t-test) and (E) relative L1 ORF2 DNA content was checked 48 hours after Tet1CD, Tet1CDmut and mock transfection (n = 4, *p < 0.05, independent 2-sample student's t-test). (F) Scheme summarizing the effect of Tet1 on the 3 steps of L1 retrotransposition. Black and red circles indicate 5mC and 5hmC nucleotides, respectively. Bars represent the mean + standard deviation (SD). (G) Experimental rationale of mouse L1 retrotransposition detection. (H) Boxplot of the log2-fold changes of the triple Tet-knockout mouse embryonic stem cells relative to wild-type (V6.5) for all genes and all L1 elements. Negative values indicate a downregulation in the knockout relative to the wild-type, positive values an upregulation. Significant elements are marked in color. The red line is at zero, i.e., the expected value if expression were identical in the wild type and mutant. (I) Relative mouse L1 ORF2 content. Bar represent mean+ SD. (n = 3, *p < 0.05, independent 2-sample student's t-test).
Tet1 reactivates retrotransposition of endogenous L1. (A) Schematic overview of L1 retrotransposition. (B) Experimental rationale of humanL1 retrotransposition detection. (C) Relative 5hmC levels in L1 5′UTR (n = 3, *p < 0.05, independent 2-sample student's t-test), (D) relative L1 transcription levels (n ≥ 4; ns = non significant; ***p < 0.001, independent 2-sample student's t-test) and (E) relative L1ORF2 DNA content was checked 48 hours after Tet1CD, Tet1CDmut and mock transfection (n = 4, *p < 0.05, independent 2-sample student's t-test). (F) Scheme summarizing the effect of Tet1 on the 3 steps of L1 retrotransposition. Black and red circles indicate 5mC and 5hmC nucleotides, respectively. Bars represent the mean + standard deviation (SD). (G) Experimental rationale of mouseL1 retrotransposition detection. (H) Boxplot of the log2-fold changes of the triple Tet-knockout mouse embryonic stem cells relative to wild-type (V6.5) for all genes and all L1 elements. Negative values indicate a downregulation in the knockout relative to the wild-type, positive values an upregulation. Significant elements are marked in color. The red line is at zero, i.e., the expected value if expression were identical in the wild type and mutant. (I) Relative mouseL1ORF2 content. Bar represent mean+ SD. (n = 3, *p < 0.05, independent 2-sample student's t-test).Consequently, we aimed to determine, whether Tet1-mediated 5hmC formation of the L1 promoter leads to reactivation of L1 transcription. Since RNA transcript levels might be affected by several exogenous factors during cell sorting, cells cotransfected with mcherry-Tet1CD and EGFP, as well as mcherry-Tet1CDmut and EGFP were analyzed by flow cytometry and only cell populations with comparable expression levels between samples were chosen for RNA preparation (Fig S1). Reverse transcription quantitative real-time PCR (RT qPCR) showed that besides Tet1CD, ectopic expression of Tet1CDmut leads to increased L1 transcription when compared with mock transfected cells (Fig 1D), indicating that L1 transcription does not depend on the catalytic activity of Tet1 proteins. Previous studies showed that besides DNA methylation, histone acetylation and chromatin structure affect L1 retrotransposition. In that regard, we found chromatin decondensation caused by Tet proteins in an oxygenase-independent manner (Fig S2), which might contribute to the reactivation of L1 transcription.Using high content imaging, we next tested, whether the observed increase in L1 transcription results in elevated L1ORF1p levels. Therefore, HEK cells ectopically expressing mcherry-Tet1CD, mcherry-Tet1CDmut or mcherry were fixed and immunostained for L1ORF1p. While the catalytically active Tet1 variant increased mean protein levels of L1ORF1 by ∼60%, ORF1p levels were elevated by ∼20% in Tet1CDmut expressing cells (Fig S3). Accordingly, L1ORF1p formation requires, at least in part, the catalytic activity of Tet proteins.Based on the knowledge that L1 elements mobilize via an RNA intermediate using a “copy and paste” mechanism and our observation that Tet1 proteins increase L1 transcription, we next analyzed L1 copy numbers in genomic DNA (gDNA) of Tet1CD and Tet1CDmut overexpressing cells, respectively. Since most of the newly inserted L1 elements are 5′ truncated, we used primers specific for the 5′UTR as a constant normalization control and ORF2 primers as an indication of newly inserted L1 elements to detect de novo transposition events. Compared to Tet1CDmut and mock transfected cells, genomic L1ORF2 content was increased upon Tet1CD overexpression (Fig 1E), indicating that L1 transposition depends on the oxygenase activity of Tet1. As we observe that catalytically active and inactive Tet1 proteins generate similar L1 mRNA levels, but have different L1 transposition efficiencies, we suggest that the Tet 1 catalytic activity may enhance L1 transposition through a yet unknown mechanism. We found that the global Tet1-mediated 5hmC increase leads to accumulation of γH2AX (Fig S4). As L1 retrotransposition has been shown to be proportional to the number of γH2AX foci, we suggest that Tet1 induced formation of γH2AX might enhance L1 integration. Since, on the other hand, the catalytically inactive Tet1 does not produce 5hmC and, accordingly, γH2AX was not increased (Fig S4), L1 transposition was thus not elevated. In summary, our data demonstrate that Tet1 proteins induce loss of L1 promoter methylation and further activate L1 transcription and transposition in humanHEK cells (Fig 1F).As ESCs depleted of Tet1 and Tet2 showed loss of 5hmC in the 5′ region of L1, we further tested, whether loss of 5hmC affects L1 transcription and transposition. To this end, we performed RNA-seq analysis and determined L1 copy numbers using RNA and DNA from Tet1/Tet2/Tet3 triple knockout ESCs (Tet-TKO), as well as the corresponding wild type cells (v6.5 wt) (Fig 1G). Although global gene expression did not change, L1 expression was significantly reduced in Tet-TKO cells (Fig 1H), indicating that Tet proteins regulate L1 transcription. In agreement with this, L1 copy numbers were decreased in Tet-TKO cells, indicating that Tet proteins modulate L1 transposition.The activity of L1 is tissue and cell type dependent. Low and high activity of L1 retrotransposition was observed in fibroblasts and neural stem cells, respectively, indicating that host cells adopt a protection mechanism to prevent L1 retrotransposition. To test whether Tet proteins could activate L1 in both cells types, we analyzed L1 copy number in cells expressing either mcherry-Tet1CD or mcherry. As shown in Figure S5, genomic L1ORF2 content was increased in mouse neural stem cells, however, it was not changed in both mouse and human fibroblasts upon Tet1 expression. These results indicate that although Tet proteins can active L1 retrotransposition, cell type dependent protection mechanisms also play a role in preventing L1 activity.
Tet1 activates retrotransposition of engineered L1
To further validate Tet1 mediated L1 activation, we made use of a cell culture based retrotransposition assay, where the plasmid pLRE3-EGFP was episomally present in HEK-EBNA cells. The L1 cassette of the pLRE3-EGFP construct contains a full-length humanL1 element, with a sense oriented γ-globin intron, which interrupts the antisense EGFP cassette in its 3′UTR region. Therefore, EGFP positive cells arise only when the EGFP is transcribed from the L1 promoter, spliced, reverse transcribed and integrated into the genome (Fig S6A). Self-replicating, viral-based vectors are capable of long-term episomal persistence in mammalian cells, in particular in HEK-EBNA cells. Two components are needed for episomal maintenance, the latent origin of replication (oriP) present in the vector and the transactivator protein EBNA-1 stably expressed in HEK-EBNAtransgenic cells. The episomal plasmid contains a puromycin resistance gene to facilitate selection of cells containing the episome. HEK-EBNA cells were transfected with the pLRE3-EGFP construct and treated with 2 µg/mL puromycin 2 d later. From 3 d after transfection, EGFP expression was observed in a small number of cells, indicating that HEK-EBNA cells are suitable to detect engineered L1 retrotransposition from the episome into the genome. Since we observed only a small number of EGFP expressing cells, i.e., where transposition took place, we wanted to test whether the EGFP negative cells still contained the episomal plasmid pLRE3-EGFP. To this end, PCR was performed on whole cell lysates of sorted, EGFP negative cells using primers for EGFP, which amplify both, spliced (genomically integrated copy) and unspliced (episomal copy) EGFP versions. As shown in Figure S7, only unspliced EGFP was amplified from EGFP negative cells, indicating that pLRE3-EGFP is present in these cells. A spliced, genomically integrated EGFP cassette (CMV promoter+EGFP), however, could not be detected indicating that no retrotransposition took place in these cells. After 15 d of antibiotic selection, puromycin-resistant colonies containing few EGFP positive cells formed (Fig S6B). Antibiotic resistance of a large number of EGFP negative cells, however, indicated that the pLRE3-EGFP was episomally present, but its ability to retrotranspose was most likely silenced by DNA methylation (Fig S6B).To test whether Tet1 can activate L1 transposition from the silenced episome in these cells, we transfected the reporter cell line with mcherry-Tet1CD, mcherry-Tet1CDmut or mcherry, fixed them 24 and 48 hours post transfection and further quantified the EGFP positive cells by high content screening microscopy. The ratio of EGFP-positive cells in Tet1/mcherry-positive and -negative cells was used to show L1 retrotransposition events induced by Tet activity (Fig 2A). We observed an increase in the number of EGFP-positive cells upon overexpression of Tet1CD, compared with the overexpression of Tet1CDmut and mcherry alone at 48 hours (Fig 2C). 24 hours after transfection no increase of EGFP positive cells was detected in the presence of Tet1CD (Fig 2B). Consistent with the reporter assay using fixed cells, live-cell time lapse imaging analysis showed a large increase in the amount of EGFP positive cells after 40 hours with Tet1CD transfection, but not with Tet1CDmut and mcherry (Fig S8A, videos 1–3). This fits well the previously reported L1 de novo retrotransposition kinetics. To further validate the increase of L1 retrotransposition induced by Tet1CD, we used flow cytometry analysis. The number of mcherry positive (+) and negative (−), as well as EGFP positive (+) cells was counted by flow cytometry and the ratio between EGFP+/mcherry+ and EGFP+/mcherry- cells was calculated to show recent L1 integrations. The results showed an increase of EGFP positive cells in the presence of Tet1CD 48 hours after transfection (Fig S8B). These data indicate that the L1 promoter of the episomal plasmid was activated by Tet1CD ultimately leading to transposition into the host cell's genome and expression of the EGFP gene.
Figure 2.
Tet1 reactivates retrotransposition of engineered L1. (A) Experimental rationale. The ratio of EGFP-positive cells in Tet1/mcherry-positive and negative cells, was quantified to detect recent L1 retrotransposition events. (B-C) Relative increases of retrotransposition events (B) 24 hours and (C) 48 hours after Tet1CD, Tet1CDmut and mcherry transfection. At least 3 independent experiments were performed and more than 250,000 cells for each group were analyzed and the bar represents the mean + SD. Independent 2-sample student's t-test was performed between Tet1CD and Tet1CDmut or Tet1CD and mcherry transfected cells. Only significant differences were indicated on the plots as 2 asterisks (p < 0.005). (D) Representative images of the pLRE3-EGFP reporter cell line 48 hours after mcherry-Tet1CD/CDmut and mcherry transfection, respectively. Scale bar: 100 µm.
Tet1 reactivates retrotransposition of engineered L1. (A) Experimental rationale. The ratio of EGFP-positive cells in Tet1/mcherry-positive and negative cells, was quantified to detect recent L1 retrotransposition events. (B-C) Relative increases of retrotransposition events (B) 24 hours and (C) 48 hours after Tet1CD, Tet1CDmut and mcherry transfection. At least 3 independent experiments were performed and more than 250,000 cells for each group were analyzed and the bar represents the mean + SD. Independent 2-sample student's t-test was performed between Tet1CD and Tet1CDmut or Tet1CD and mcherry transfected cells. Only significant differences were indicated on the plots as 2 asterisks (p < 0.005). (D) Representative images of the pLRE3-EGFP reporter cell line 48 hours after mcherry-Tet1CD/CDmut and mcherry transfection, respectively. Scale bar: 100 µm.To further verify that the activation of EGFP resulted from L1 retrotransposition and not from silenced genomically integrated copies, we cultured the reporter cell line in the absence of puromycin. Four and 11 d later, the cells were transfected with mcherry-Tet1CD plasmids and 2 d after transfection the EGFP positive cell numbers were counted using flow cytometry. As shown in Figure S9, EGFP positive cell numbers were decreased in the absence of puromycin. Previous studies showed that the episomal plasmids are easily lost during cell generations. In the absence of puromycin, the cells without pLRE3-EGFP plasmid are able to survive, but due to the lack of the episome no transposition can take place and, hence, those cells do not express EGFP. These results indicate that the activation of EGFP expression upon Tet expression results from L1 retrotransposition from the episome into the genome rather than from genomically integrated EGFP cassette, since the latter would be activated by Tet1CD in the absence or presence of puromycin.Increase of L1 retrotransposition and the conversion of 5mC to 5hmC induced by Tet1 prompted us to test, whether the observed L1 reactivation arose from DNA methylation changes, so we made use of the cytidine analog 5-azacytidine (5-aza-C) to induce DNA hypomethylation. The stable cell line was treated with either 5 µM or 50 µM of 5-aza-C and 48 hours later the number of EGFP positive cells was quantified by flow cytometry analysis. Compared to untreated cells, the 5-aza-C treatment increased the number of EGFP positive cells in a dose dependent manner (Fig S8C), indicating that decreased DNA methylation is involved in the activation of L1. This effect is in line with results from non-transformed cells that were treated with 5-aza-C. Since 5mC oxidation products such as 5hmC are considered to be intermediates of DNA demethylation, we suggest that the activation of L1 by Tet1 is mainly dependent on its DNA demethylation activity potentially involving subsequent repair processes.
Mecp2 and Mbd2 repress endogenous human L1 retrotransposition
As Mecp2 has been shown to regulate L1 retrotransposition in a methylation dependent manner, we wanted to test whether Tet1 mediated activation of L1 can be counteracted by MBD proteins. To this end, we analyzed the effects of co-expressed Tet1CD, Tet1CDmut, Mecp2 and Tet1CD+Mecp2 on transcription of a luciferase reporter plasmid driven by the internal L1.3 promoter (nucleotides 1–909) in response to its methylation by HpaII methyltransferase (Fig S10A and S10B). While in the absence of any effector protein (Tet1CD/Tet1CDmut), methylation of the L1.3 promoter led to a weak transcriptional decrease, overexpression of Tet1CD/Tet1CDmut resulted in similar transcription rates for the methylated and unmethylated promoter (Fig S10C). Ectopic expression of Mecp2, in contrast, reduced transcription from the methylated promoter in the absence and presence of Tet1CD (Fig S10C), indicating that Mecp2 represses Tet1 mediated activation of L1 transcription.Next, we co-expressed Tet1 with Mbd2, Mecp2, or its subdomains MBD and IDTRD, respectively in HEK-EBNA cells and analyzed 5hmC levels at the endogenous L1 promoter by GluMS-qPCR. The results showed decreased 5hmC levels in the 5′UTR of L1 upon co-overexpression of MBDs together with Tet1CD (Fig 3A), indicating that MBDs blockTet mediated 5mC to 5hmC conversion in the 5′UTR of L1. The failure of Tet binding to DNA in the presence of Mbd2, Mecp2, as well as its subdomains MBD and IDTRD might thus be causative for the observed decrease of 5hmC at the L1 5′UTR. Previous studies showed activation of L1 retrotransposition in the absence of Mecp2, and our data indicate that this activation might be due to Tet induced 5hmC formation. In accordance with the decreased 5hmC levels, L1 mRNA levels and copy numbers were lower in cells co-expressing Tet1CD and MBDs (Fig 3B and Fig 3C), indicating that MBDs can prevent L1 transcription and transposition. In summary, the activation of L1 by Tet1 can be prevented by the action of MBD proteins.
Figure 3.
MBD proteins prevent Tet-mediated reactivation of endogenous L1. (A) Relative 5hmC in L1 5′UTR (n = 3, *p < 0.05, independent 2-sample student's t-test), (B) relative L1 transcription levels (n = 4; **p < 0.01, independent 2-sample student's t-test) and (C) relative L1 ORF2 DNA content were checked 48 hours after cotransfection with plasmids coding for Tet1CD- and MBD proteins (n = 3, *p < 0.05, independent 2-sample student's t-test). All of the 3 independent experiments for IDTRD and Mbd2 showed decreased, but variable L1 copy number when compared with Tet1CD, giving rise to an apparent non-significant difference. (D) Scheme illustrating the effect of Tet1- and MBD proteins on L1 retrotransposition. Black and red circles indicate 5mC and 5hmC nucleotides, respectively. Bars represent the mean + SD.
MBD proteins prevent Tet-mediated reactivation of endogenous L1. (A) Relative 5hmC in L1 5′UTR (n = 3, *p < 0.05, independent 2-sample student's t-test), (B) relative L1 transcription levels (n = 4; **p < 0.01, independent 2-sample student's t-test) and (C) relative L1ORF2 DNA content were checked 48 hours after cotransfection with plasmids coding for Tet1CD- and MBD proteins (n = 3, *p < 0.05, independent 2-sample student's t-test). All of the 3 independent experiments for IDTRD and Mbd2 showed decreased, but variable L1 copy number when compared with Tet1CD, giving rise to an apparent non-significant difference. (D) Scheme illustrating the effect of Tet1- and MBD proteins on L1 retrotransposition. Black and red circles indicate 5mC and 5hmC nucleotides, respectively. Bars represent the mean + SD.Here, we show that Tet1 mediated formation of 5hmC activates the L1 retrotransposition. Previous studies have shown that the activation of endogenous L1 transcription does not depend on Tet3 in mouse zygotes, but the potential effect of Tet3 on L1 transposition is still unknown. To further check the effect of Tets on L1 transposition, we took advantage of Tet1, Tet2 and Tet3 triple knockout ESCs, which do not express any Tets but their corresponding wild type cells express high levels of Tets, especially Tet1 and Tet2. We showed that the transcription and transposition of mouseL1 is significantly decreased in Tet1, Tet2 and Tet3 triple knockout mouse ESCs as compared with the corresponding wild type cells, indicating that Tets are involved in L1 retrotransposition activation. Since 5hmC is not only involved In loss of DNA methylation, but also a stable epigenetic mark in mouse embryonic stem cells, we propose that both, loss of DNA methylation and the 5hmC mark itself are involved in L1 retrotransposition activation.Rett syndrome (RTT), a postnatal occurring neurologic disorder with an incidence of ∼1 in 10,000 female births, is mostly caused by mutations in the MBD and IDTRD of Mecp2. The increased L1 retrotransposition events in RTT patients suggest that L1 activity is facilitated upon loss of Mecp2 function in human cells, but the mechanism is still unknown. Here, we show that L1 retrotransposition is activated by Tet1 and repressed by MBD proteins. In the absence of Mecp2, Tet proteins oxidize methylated DNA, which is usually bound by Mecp2, leading to L1 element activation. Although previous studies showed that Mecp2 binds to 5hmC enriched within active genes, we found, on the other hand, that the MBD of Mecp2 does not bind to 5hmC. Therefore, our data are not consistent with sustained inhibition of L1 activation through binding of Mecp2 to 5hmC. However, Mecp2 binds with high affinity to DNA and this could explain the results of Mellen et al.Previous studies showed that transgenic mice harboring either DNA binding-incompetent MBD or TRD of Mecp2 is sufficient to cause RTT. Here, we show that MBD as well as the IDTRD of Mecp2 are also each sufficient to mediate repression of L1 mobilization. In addition to Mecp2, we show that Mbd2 can also repress humanL1 retrotransposition, indicating that the repression of L1 is not Mecp2 specific. Besides 5mC specific binding domains, such as the MBD of Mecp2, the non sequence specific DNA binding domain IDTRD of Mecp2 can also repress the L1 retrotransposition. When compared with in vitro methylated L1 5′UTR using HpaII methyltransferase, human endogenous L1 5′UTR might provide more binding sites to Mbd2. Thus, 4 methylated CpG sites seem to be not enough for the repression of L1 retrotransposition by Mbd2, but it is enough for Mecp2 because of its sequence unspecific strong DNA binding ability.In summary, we show that humanL1 can be re-activated by Tet1 proteins and this might lead to decreased genome stability and activation of proto-oncogenes in cancer. Finally, our data indicates that the Tet1 mediated activation of L1 can be repressed by Mecp2 and Mbd2 revealing a role of Mecp2 and Mbd2 as guardians of genome stability by preventing retrotransposition.
Materials and methods
Plasmids, cell culture and transfection
Plasmids coding EGFP tagged MBDs (Mecp2: pc1121, MBD: pc0841, IDTRD: pc1852, Mbd2: pc2399) and mcherry-tagged catalytic active (mcherry-Tet1CD: aa 1367–2007, pc2547) and inactive (mcherry-Tet1CDmut: aa 1367–2007, H1652Y, D1654A, pc2815) domain of mouseTet1 were described in previous publications.The reporter plasmid pGL3-L1.3-Luc (pc3342) coding for a firefly luciferase under the control of the L1.3 promoter was a generous gift of G. Schumann (Paul Ehrlich Institute, Langen, Germany).HEK-EBNA cell line was purchased from Invitrogen (catalog #R620–07). Cells were cultured and transfected as described previously. For genomic DNA (gDNA) extraction, HEK-EBNA cells were transfected with mcherry-Tet1CD/CDmut and EGFP-MBDs and flow cytometry sorted (Biorad S3 sorter, Bio-Rad Laboratories) according to EGFP and mcherry expression levels. 488 and 561 nm excitation lasers and 525 ± 30 and 586 ± 25 nm emission filters are used for EGFP and cherry detection. For RNA preparation, HEK-EBNA cells were transfected with mcherry-Tet1CD/CDmut and EGFP-MBDs encoding plasmids, and the expression of mcherry and EGFP was analyzed by flow cytometry. Cells with similar mcherry-Tet1CD/CDmut expression were used for RNA preparation.C2C12mouse myoblast cell line were cultured using the conditions described previously.C2C12 cells were grown to 70% confluency on glass coverslips and transfected with EGFP-Tet1CD/CDmut and EGFP expression constructs 24 hours post seeding using Lipofectamine (Life Technologies) according to the manufacturer's instructions.Human AG01522D foreskin fibroblasts cells (obtained from Coriell Cell Repository) were cultured in DMEM medium supplemented with 15% FCS. The cells were transfected by electroporation as described previously.Mouse tail fibroblasts (MTF) cells were a gift from A. Bird (Wellcome Trust Center for Cell Biology, Edinburgh, UK) and were cultured using the conditions as described previously.Mouse neural stem cells were a gift from B. Hendrich (Cambridge Stem Cell Institute, Cambridge, UK) and cultured using the conditions as described previously.Human fibroblast cells (BJ-hTERT; ATCC CRL-4001) were cultured in DMEM medium supplemented with 15% FCS. The cells were transfected by electroporation as described previously.V6.5 wt and Tet-TKO mouse embryonic stem cells were a gift from R. Jaenisch (Whitehead Institute for Biomedical Research, Cambridge, USA) and were maintained under serum-free and feeder-free conditions on Geltrex-coated flasks in N2B27 (50% neurobasal medium (Life Technologies) and 50% DMEM/F12 (Life Technologies) containing 2 mM L-glutamine (Life Technologies), 0.1 mM β-mercaptoethanol, N2 supplement (Life Technologies), B27 serum-free supplement (Life Technologies), 100 U/ml Penicillin-Streptomycin, 1000 U/ml LIF and 2i (1 μM PD032591 and 3 μM CHIR99021 (Axon Medchem).
L1 retrotransposition reporter assay
HEK-EBNA cells were transfected with the pLRE3-EGFP plasmid (gift from J. V. Moran, U. Michigan Medical School, USA, pc3341). Two days after transfection, cells were cultured in DMEM medium containing 2 µg/mL puromycin (Invitrogen) for resistance selection. 18 d later, single colonies were picked and passaged in DMEM medium containing 2 µg/mL puromycin. For retrotransposition reporter assays, cells were cultured in DMEM medium without puromycin selection.For fixed-cell analysis, the reporter cell line was grown on glass coverslips and transfected with mcherry-Tet1CD/CDmut and only mcherry using PEI. 24 hours and 48 hours after transfection, cells were fixed with 3.7% formaldehyde (Carl Roth GmbH), DNA was counterstained with DAPI (Invitrogen) and cells were mounted in Vectashield antifade medium (Vector Laboratories, Burlingame, CA, USA). DAPI, EGFP and mcherry were imaged by high content microscopy with 20x long/0.45 NA objective (Operetta), a xenon fiber optic as light source, 360–400, 460–490 and 560–580 nm excitation- and 410–480, 500–550 and 590–640 emission filters, respectively. And the intensities were analyzed using the Harmony software (PerkinElmer, UK).For flow cytometry analysis of the aforementioned reporter cell line, cells either ectopically expressing mcherry-Tet1CD/CDmut or mcherry or treated with 5-azacytidine (5-aza-C, Sigma-Aldrich) were grown in 6-well plates. 48 hours later, cells were harvested and EGFP positive cells were counted by flow cytometry.
RNA-Seq library preparation and data analysis
Total RNA was isolated from mouse embryonic stem cells (V6.5) in biological quadruplicates using the nucleospin triprep kit from Macherey-Nagel. 50 ng RNA was reverse transcribed. cDNA was pre-amplified as described elsewhere. 1 ng of cDNA was used as input for tagmentation by the Nextera XT Sample Preparation Kit (Illumina), where a second amplification round was performed for 12 cycles. For each sample, 2.5 ng of final library was pooled. The library pool was sequenced 1 × 100 bases on a Illumina HiSeq1500. The average sequencing depth was 1.2 million reads per replicate.Sequencing reads were demultiplexed from the Nextera (i5 and i7) indices. Demultiplexed reads were mapped to the Mouse genome build mm10 using STAR version STAR 2.5.1 with the specific settings: –outFilterMultimapNmax 100 –outFilterMismatchNmax 4 –winAnchorMultimapNmax → 100. The junction annotation was taken from ensembl GRCm38.75 and the index was created as recommended using the option –sjdbOverhang → 99. The resulting bam-files were then processed using TEtranscript to obtain read count tables for transcripts and transposons, using the TE annotation as provided by the authors of TEtranscript (http://labshare.cshl.edu/shares/mhammelllab/www-data/TEToolkit/TE_GTF/mm10_rmsk_TE.gtf.gz). Normalization and differential expression analysis was done using DESeq2.
Quantitative PCR (q-PCR)
q-PCR was performed on a StepOnePlus Real-Time PCR System (Applied Biosystems) with Platinum SYBR Green qPCR SuperMix-UDG w/ROX (Invitrogen) according to the manufacturer's instruction. The program used for the amplification of all fragments consisted of 1) inactivation of UDG for 2 min at 50°C, 2) denaturation of DNA for 10 min at 98°C, 3) 40 cycles of PCR (98°C for 15 sec, 60°C for 1 min), followed by 4) dissociation (melting) curve analysis to confirm the specificity of the amplicon.
DNA glucosylation, MspI digestion and quantitative PCR based 5hmC and 5mC detection (GluMs-qPCR)
To detect 5hmC after Tet1 transfection, gDNA was extracted from HEK-EBNA cells as described previously. Concentration and purity of DNA was measured on a TECAN infinite M200 plate reader (Tecan Group Ltd.) by the absorbance at 260 nm and 280 nm. 1 µg of gDNA was treated with or without 0.18 µM of T4 phage β-glucosyltransferase (T4-BGT) in a final volume of 50 µl supplemented with 1x NEB cut smart buffer (NEB) and 1 mM of UDP-Glucose (Sigma-Aldrich) for 18 hours at 37°C. Then 0.5 µg of glucosylated or mock treated DNA was used for digestion with 100 units of MspI (NEB) at 37°C for 18 hours in a final volume of 20 µL, which was followed by treatment with 20 µg of proteinase K (PK, Carl Roth GmbH) for 30 min at 50°C. Following proteolysis, PK enzymatic activity was inactivated for 10 min at 98°C. The MspI-resistant fraction was amplified using qPCR with primers flanking the MspI site (F: 5'- ATCCCACACCTGGCTCAGAGGG -3′ and R: 5′- GTCAGGGGTCAGGGACCCACTT -3′). After qPCR, the relative amounts of 5hmC were analyzed as described previously. To detect 5mC at position 482 in L1 5′UTR before Tet1 transfection, the gDNA was treated with or without T4-BGT as described above. Then, the gDNA was further treated with MspI, HpaII (50 units, NEB) or mock for 18 hours at 37°C. The qPCR and data analysis were performed as above.
cDNA preparation and reverse transcription quantitative real-time PCR (RT qPCR)
Total RNA was isolated using the RNeasy Mini Kit (Qiagen) according to the manufacturer's instruction. To remove traces of genomic DNA, RNA was treated with RNase-free recombinant DNaseI (Macherey Nagel) for 30 min at 37°C and further purified with the Qiagen RNeasy Mini Kit. To assess the concentration and purity of RNA, the ratio of absorbance at 260 nm and 280 nm was measured on a TECAN infinite M200 plate reader. 500 ng of total RNA were used for cDNA synthesis using 200 units M-MuLV reverse transcriptase (NEB), 0.01 OD units random primer from the Prime-It II Random Primer Labeling Kit (Stratagene), 0.5 mM dNTPs (Carl Roth GmbH) and 40 units recombinant ribonuclease inhibitor RNaseOUT (Invitrogen) in a total reaction volume of 20 µL. Cycles were set to 5 min at 25°C, 90 min at 50°C and 15 min at 70°C. For qPCR, 0.5 ng of cDNA were used for each reaction. Primers for quantitative real-time PCR contained the following sequences: Gapdh F: 5′- CAT GAG AAG TAT GAC AAC AGC CT-3′, Gapdh R: 5′-AGT CCT TCC ACG ATA CCA AAG T-3′, hL1 5′ UTR F: 5′-GAA TGA TTT TGA CGA GCT GAG AGA A-3′, hL1 5′ UTR R: 5′-GTC CTC CCG TAG CTC AGA GTA ATT -3′. 5′ UTR expression level was normalized to Gapdh and calculated using the comparative CT method (ΔΔCT method).
L1 copy number analysis
To detect newly integrated L1ORF2 sequences in humanembryonic kidney or human fibroblast cells, qPCR was performed as described above. For each reaction, 80 pg of gDNA were used. Primers contained the following sequences: hL1ORF2 F: 5′- CAAACACCGCATATTCTCACTCA-3′, hL1ORF2 R: 5′-CTTCCTGTGTCCATGTGATCTCA-3′. Computational estimates using the UCSC genome browser in silico PCR function (genome: human, assembly: December 2013 (GRCh38/hg38), target: genome assembly) indicated that at least 2734 endogenous humanL1 elements could be detected using the hL1ORF2 primer set.As control we used primers for hL1 5′ UTR. This sequence is usually truncated during the course of retrotransposition. Accordingly, the number of hL1 5′ UTR sequences remains constant. Primers contained following sequences: hL1 5′ UTR F: 5′-ACAGCTTTGAAGAGAGCAGTGGTT-3′, hL1 5′ UTR R: 5′-AGTCTGCCCGTTCTCAGATCT-3′.To determine the content of L1ORF2 sequences in mouse embryonic stem cells (wild type and Tet1/Tet2/Tet3 TKO) and mouse fibroblast cells ectopically expressing mcherry or mcherry-Tet1CD, gDNA was isolated and qPCR was performed as described above. For each reaction, 80 pg of gDNA were used. Primers contained the following sequences: mL1 ORF2 F: 5′-CTGGCGAGGATGTGGAGAA-3′, mL1 ORF2 R: 5′-CCTGCAATCCCACCAACAAT-3′. Computational estimates using the UCSC genome browser in silico PCR function (genome: mouse, assembly: December 2011 (GRCm38/mm10), target: genome assembly) indicated that at least 1308 endogenous mouseL1 elements could be detected using the mL1 ORF2 primer set.As control we used non-mobile genomic repetitive sequences: 5SRNA F: 5′-ACGGCCATACCACCCTGAA-3′; 5SRNA R: 5′-GGTCTCCCATCCAAGTACTAACC-3′.To calculate the relative copy number of genomic L1ORF2, the comparative CT method (ΔΔCT method) was used. For HEK cells, as well as for mouse and human fibroblast cells, relative ORF2 content was further normalized Tet1CD expressing cells. For mouse ESCs, relative ORF2 content of wild type cells was further normalized to the mean relative ORF2 content of Tet-TKO cells.
Chromatin decondensation analysis
3D structured illumination microscopy images were acquired as described previously. To quantify the grade of chromatin decondensation, binary nuclear masks were generated. Therefore, images were processed using a 3D median filter. Filtered images of the DAPI channel were then thresholded using the basic algorithm. Next, all DAPI pixels below the threshold were set to 0 and all pixels above the threshold were set to 1. For further improvement of the nuclear masks, binary images were additionally processed using the “fill holes” and “watershed” algorithms. Finally, the standard deviation of all DAPI histograms was calculated automatically. To automate this procedure, a routine was written in the programming language python.
Immunofluorescence staining
Human AG522D fibroblasts cells were fixed for 10 minutes in 4% formaldehyde in PBS and permeabilized for 20 minutes with 0.5% Triton X-100. For detection of genomic 5hmC and γH2AX, the cells were further fixed with ice-cold methanol for 5 minutes. After RNaseA treatment (10 µg/mL) for 30 minutes at 37°C, cells were washed and blocked for 60 minutes in 0.2% fish skin gelatin (Sigma Aldrich) at 37°C. Then 5hmC and γH2AX were detected using a rabbit anti-5hmC (1:250, catalog number: 39769, Active Motif) and a mouse Anti-phospho-Histone H2A.X (1:400, catalog number: 05–636, Merck), antibody in conjunction with 25 U/mL DNaseI (Sigma Aldrich) for 70 minutes at 37°C. To stop DNaseI digestion, cells were washed 3 time with PBS containing 1mM EDTA and 0.01% Tween. Following incubation with secondary antibody mixture of AMCAdonkey anti-rabbit IgG (1:100, catalog number: 715–155–151, The Jackson Laboratory) and Alexa Fluor 488goat anti-mouse IgG (1:250, catalog number: A11029, Invitrogen) for 50 minutes at RT. After 3 time washing with PBS containing 0.01% Tween the cells were mounted in Vectashield Medium (Vector Labs) and imaged with high content screening microscopy with 20x long/0.45 NA objective (Operetta). The intensities were further calculated using the Harmony software (Operetta) and blotted by RStudio (https://www.rstudio.com).L1ORF1p proteins were detected in HEK using the polyclonal rabbit anti-L1ORF1p antibody as described before. As secondary antibody the AMCAdonkey anti-rabbit IgG was used (1:100, catalog number: 715–155–151, The Jackson Laboratory).
Luciferase reporter assay
Methylated pGL3-L1.3-Luc reporter plasmids were obtained by incubation with M.HpaII and controlled by digestion with HpaII and MspI. Cells were seeded in 6-well dishes at 7 × 105 cells/well. Three hours post seeding, cells were cotransfected with the unmethylated or methylated reporter plasmid pGL3-L1.3-Luc and effector constructs coding for Tet1CD/Tet1CDmut/ Mecp2/Tet1CD+Mecp2, respectively. Luciferase activity was determined using the “Luciferase Assay System” (Promega) as described by the manufacturer on a TECAN infinite M200 plate reader (Tecan Group Ltd.). To control for consistent transfection of methylated and unmethylated reporter plasmids, the fluorescent signal emanating from the effector proteins was quantified in parallel (GFP: excitation 475 nm, emission 520 nm; RFP: excitation 585 nm, emission 630 nm) and was used for normalization of the luciferase signal. In addition, fluorescent signals were used to control for homogeneous expression of Tet1CD in single (Tet1CD) and double (Tet1CD + Mecp2) transfected cells.
Authors: Tammy A Morrish; Nicolas Gilbert; Jeremy S Myers; Bethaney J Vincent; Thomas D Stamato; Guillermo E Taccioli; Mark A Batzer; John V Moran Journal: Nat Genet Date: 2002-05-13 Impact factor: 38.330
Authors: Ryan Lister; Eran A Mukamel; Joseph R Nery; Mark Urich; Clare A Puddifoot; Nicholas D Johnson; Jacinta Lucero; Yun Huang; Andrew J Dwork; Matthew D Schultz; Miao Yu; Julian Tonti-Filippini; Holger Heyn; Shijun Hu; Joseph C Wu; Anjana Rao; Manel Esteller; Chuan He; Fatemeh G Haghighi; Terrence J Sejnowski; M Margarita Behrens; Joseph R Ecker Journal: Science Date: 2013-07-04 Impact factor: 47.728
Authors: Carina Frauer; Thomas Hoffmann; Sebastian Bultmann; Valentina Casa; M Cristina Cardoso; Iris Antes; Heinrich Leonhardt Journal: PLoS One Date: 2011-06-22 Impact factor: 3.240
Authors: Annette Becker; Lena Allmann; Maria Hofstätter; Valentina Casà; Patrick Weber; Anne Lehmkuhl; Henry D Herce; M Cristina Cardoso Journal: PLoS One Date: 2013-01-15 Impact factor: 3.240
Authors: María Arroyo; Florian D Hastert; Andreas Zhadan; Florian Schelter; Susanne Zimbelmann; Cathia Rausch; Anne K Ludwig; Thomas Carell; M Cristina Cardoso Journal: Nat Commun Date: 2022-09-02 Impact factor: 17.694
Authors: Cathia Rausch; Peng Zhang; Corella S Casas-Delucchi; Julia L Daiß; Christoph Engel; Gideon Coster; Florian D Hastert; Patrick Weber; M Cristina Cardoso Journal: Nucleic Acids Res Date: 2021-12-16 Impact factor: 16.971