Literature DB >> 35562425

Establishment of H3K9-methylated heterochromatin and its functions in tissue differentiation and maintenance.

Jan Padeken1,2, Stephen P Methot1, Susan M Gasser3,4.   

Abstract

Heterochromatin is characterized by dimethylated or trimethylated histone H3 Lys9 (H3K9me2 or H3K9me3, respectively) and is found at transposable elements, satellite repeats and genes, where it ensures their transcriptional silencing. The histone methyltransferases (HMTs) that methylate H3K9 - in mammals Suppressor of variegation 3-9 homologue 1 (SUV39H1), SUV39H2, SET domain bifurcated 1 (SETDB1), SETDB2, G9A and G9A-like protein (GLP) - and the 'readers' of H3K9me2 or H3K9me3 are highly conserved and show considerable redundancy. Despite their redundancy, genetic ablation or mistargeting of an individual H3K9 methyltransferase can correlate with impaired cell differentiation, loss of tissue identity, premature aging and/or cancer. In this Review, we discuss recent advances in understanding the roles of the known H3K9-specific HMTs in ensuring transcriptional homeostasis during tissue differentiation in mammals. We examine the effects of H3K9-methylation-dependent gene repression in haematopoiesis, muscle differentiation and neurogenesis in mammals, and compare them with mechanistic insights obtained from the study of model organisms, notably Caenorhabditis elegans and Drosophila melanogaster. In all these organisms, H3K9-specific HMTs have both unique and redundant roles that ensure the maintenance of tissue integrity by restricting the binding of transcription factors to lineage-specific promoters and enhancer elements.
© 2022. Springer Nature Limited.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35562425      PMCID: PMC9099300          DOI: 10.1038/s41580-022-00483-w

Source DB:  PubMed          Journal:  Nat Rev Mol Cell Biol        ISSN: 1471-0072            Impact factor:   113.915


Introduction

More than half of the vertebrate genome is packaged into condensed, transcriptionally repressed heterochromatin. The epigenetic hallmark of heterochromatin is dimethylated or trimethylated histone H3 Lys9 (H3K9me2 or H3K9me3, respectively). The DNA classically attributed to methylated H3K9 (H3K9me) heterochromatin comprises repetitive non-coding sequences such as pericentric satellites and subtelomeric repeats, which are silenced and clustered together, forming constitutive heterochromatin. However, recent work shows that this well-conserved histone modification serves a second, more dynamic function in the repression of tissue-specific genes. The six mammalian histone methyltransferases (HMTs) that target H3K9 have distinct modes of action and a complex interplay as they establish facultative heterochromatin, which controls lineage-specific gene repression during development in a tissue-specific manner. Early data suggesting that H3K9 methylation had a role in repressing genes and not only repetitive elements were based on the study of individual loci (for example, IFNB1 (ref.[1]), Magea genes[2] and Bmi1 (ref.[3])). H3K9me2 catalysed by the mammalian HMT G9A was especially shown to regulate transcription in these pioneering studies. Obtaining a genome-wide view was hindered by the difficulty of mapping H3K9 methylation accurately[4] and by the fact that the six HMTs that methylate H3K9 in mice and humans showed partial redundancy. The redundancy issue was resolved in part by studying H3K9 methylation in simpler organisms, such as Drosophila melanogaster, which has three enzymes, and Caenorhabditis elegans, which has only two. Remarkably, the deletion of both H3K9 methyltransferases in worms, which eliminated all detectable H3K9 methylation, only slightly impaired embryo to adult development[5-7]. In this Review, we focus on the best characterized mammalian H3K9 methyltransferases — Suppressor of variegation 3–9 homologue 1 (SUV39H1), SUV39H2, SET domain bifurcated 1 (SETDB1), SETDB2, G9A (also known as EHMT2) and G9A-like protein (GLP; also known as EHMT1)[8] — and on their related enzymes in D. melanogaster and C. elegans (Fig. 1a). We discuss their modes of action and roles in lineage-specific gene expression. Elegant work in fission yeast detailing the role of H3K9 methylation and its methyltransferase Clr4, which sets up self-propagating domains of heterochromatin at centromeres, telomeres and the mating type locus, has been covered elsewhere[9,10].
Fig. 1

The major H3K9 methyltransferases, cofactors and targets.

a | The structural domains of the major histone H3 Lys9 (H3K9) methyltransferases in Mus musculus, Drosophila melanogaster and Caenorhabditis elegans. The SET domain contains the catalytic site. The enzymes are grouped by structural relatedness. One outlier is C. elegans SET-25, which has a SUV39-like SET and pre-SET domain, but does not have the methylated-lysine-binding chromodomain like other SUV39 enzymes. The amino-terminal basic domain in SUV39H2 is found uniquely in the mouse, and not the human, enzyme. SET domain bifurcated 1 (SETDB1), Setdb1 and MET-2 are structurally closely related and also share conserved essential cofactors. Tudor domains, ankyrin repeats and methyl-CpG-binding domains (MBDs) have roles in targeting the histone methyltransferases (HMTs) to specific sites of action. b | The three families of H3K9 methyltransferases are shown with cofactors and interacting proteins that enable site-specific HMT recruitment. Suppressor of variegation 3–9 homologue 1 (SUV39H1) and SUV39H2 are often recruited to chromatin by non-coding RNA (ncRNA), or in worms by endogenous siRNA bound to Argonaut (Ago) proteins. SUV39 enzymes bind both to heterochromatin protein 1 (HP1) proteins and to dimethylated or trimethylated H3K9 (H3K9me2/3), which help to further recruit the HMT. SETDB1 class enzymes form a complex with activating transcription factor 7-interacting protein 1 (ATF7IP) (LIN-65 in worms) and ARLE14EP (adenosine 5′-diphosphate-ribosylation factor-like 14 (ARLE-14) in worms) or with chromatin assembly factor 1 (CAF1). With CAF1, SETDB1 monomethylates H3K9, whereas with ATF7IP, it binds either Krüppel-associated box (KRAB)-interacting protein 1 (KAP1) together with KRAB domain-containing zinc-finger proteins (KRAB-ZFPs) at endogenous retroviruses (ERVs) or is directly recruited by HP1 proteins to produce H3K9me2/3. In worms, MET-2 binds LIN-65 and ARLE-14. The enzymes G9A and G9A-like protein (GLP) bind a variety of transcription factors (TFs), together with histone deacetylases (HDACs), mSin3a and other chromatin modifiers. They also bind monomethylated H3K9 (H3K9me1) or H3K9me2 with their ankyrin repeats. The repeat elements targeted by these HMTs belong to three classes: centromeric or telomeric satellites (simple repeats), ERVs, which are long terminal repeat (LTR)-containing retrotransposons, or non-LTR-containing transposons such as long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs). All HMTs are also recruited to genes and enhancer elements. CTBP, C-terminal-binding protein 1; ETn, early transposon; IAP, intracisternal A-particle; MERVL, murine endogenous retrovirus type L; MLV, murine leukaemia virus; NES, nuclear export signal; NLS, nuclear localization signal; ORF, open reading frame; TSD, target site duplication; UTR, untranslated region; WIZ, widely interspaced zinc-finger-containing protein.

The major H3K9 methyltransferases, cofactors and targets.

a | The structural domains of the major histone H3 Lys9 (H3K9) methyltransferases in Mus musculus, Drosophila melanogaster and Caenorhabditis elegans. The SET domain contains the catalytic site. The enzymes are grouped by structural relatedness. One outlier is C. elegans SET-25, which has a SUV39-like SET and pre-SET domain, but does not have the methylated-lysine-binding chromodomain like other SUV39 enzymes. The amino-terminal basic domain in SUV39H2 is found uniquely in the mouse, and not the human, enzyme. SET domain bifurcated 1 (SETDB1), Setdb1 and MET-2 are structurally closely related and also share conserved essential cofactors. Tudor domains, ankyrin repeats and methyl-CpG-binding domains (MBDs) have roles in targeting the histone methyltransferases (HMTs) to specific sites of action. b | The three families of H3K9 methyltransferases are shown with cofactors and interacting proteins that enable site-specific HMT recruitment. Suppressor of variegation 3–9 homologue 1 (SUV39H1) and SUV39H2 are often recruited to chromatin by non-coding RNA (ncRNA), or in worms by endogenous siRNA bound to Argonaut (Ago) proteins. SUV39 enzymes bind both to heterochromatin protein 1 (HP1) proteins and to dimethylated or trimethylated H3K9 (H3K9me2/3), which help to further recruit the HMT. SETDB1 class enzymes form a complex with activating transcription factor 7-interacting protein 1 (ATF7IP) (LIN-65 in worms) and ARLE14EP (adenosine 5′-diphosphate-ribosylation factor-like 14 (ARLE-14) in worms) or with chromatin assembly factor 1 (CAF1). With CAF1, SETDB1 monomethylates H3K9, whereas with ATF7IP, it binds either Krüppel-associated box (KRAB)-interacting protein 1 (KAP1) together with KRAB domain-containing zinc-finger proteins (KRAB-ZFPs) at endogenous retroviruses (ERVs) or is directly recruited by HP1 proteins to produce H3K9me2/3. In worms, MET-2 binds LIN-65 and ARLE-14. The enzymes G9A and G9A-like protein (GLP) bind a variety of transcription factors (TFs), together with histone deacetylases (HDACs), mSin3a and other chromatin modifiers. They also bind monomethylated H3K9 (H3K9me1) or H3K9me2 with their ankyrin repeats. The repeat elements targeted by these HMTs belong to three classes: centromeric or telomeric satellites (simple repeats), ERVs, which are long terminal repeat (LTR)-containing retrotransposons, or non-LTR-containing transposons such as long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs). All HMTs are also recruited to genes and enhancer elements. CTBP, C-terminal-binding protein 1; ETn, early transposon; IAP, intracisternal A-particle; MERVL, murine endogenous retrovirus type L; MLV, murine leukaemia virus; NES, nuclear export signal; NLS, nuclear localization signal; ORF, open reading frame; TSD, target site duplication; UTR, untranslated region; WIZ, widely interspaced zinc-finger-containing protein.

Features and functions of H3K9 methyltransferases

The six mammalian H3K9 methyltransferases fall into three main families, with two variants in each. The fact that each family has homologues in simpler multicellular organisms, on the basis of homology within and beyond the catalytic SET domain, argues for conserved modes of recruitment and secondary activities. Genetic approaches that ablate H3K9 methyltransferases confirm that the mechanisms that regulate each HMT family are distinct, although the HMTs silence partially overlapping sets of target loci.

The ‘Suppressor of variegation’ HMT family: SUV39H1, SUV39H2, Su(var)3–9 and SET-25

One of the best studied H3K9 methyltransferases is the fruitfly enzyme Suppressor of variegation 3–9 (Su(var)3–9), which was identified genetically as a suppressor of position effect variegation (PEV)[11]. PEV is a genetic assay used in screens, which provided much of our basic insight into individual heterochromatin proteins and how they function[12] (Box 1). Following up on genetic assays that implicated Su(var)3–9 in the establishment and spreading of centromeric heterochromatin in flies, it was shown that its two mouse homologues, SUV39H1 and SUV39H2 (SUV39H1/H2)[13], as well as Su(var)3–9 itself, function as H3K9-specific HMTs[11]. Su(var)3–9 is a founding member of the SET domain-containing superfamily of methyltransferases (Fig. 1a). Su(var)3–9 and its homologues are characterized by a central SET domain flanked by conserved pre-SET and post-SET domains. These Cys-rich modules bind the three zinc ions necessary for the catalytic activity of the central SET domain, which was originally named for a larger conserved region present in the D. melanogaster proteins Su(var)3–9, Enhancer of zeste and Trithorax[14]. Although SUV39H1 can methylate unmodified H3K9 in vitro[13], it was shown to bind preferentially to H3K9me1, which may serve as its recruitment signal at centromeric repeats[15]. In vivo, although SUV39H1/H2 are together responsible for most H3K9me2 and H3K9me3 at pericentric heterochromatin and interspersed satellite repeats, they are not essential for H3K9me1 (refs[16-19]) (Fig. 1b). Thus, it is widely assumed, albeit not proven, that in organisms H3K9me1 is a prerequisite for SUV39H1/H2 activity. Interestingly, the catalytic activities of SUV39H1/H2 are reinforced by their binding to H3K9me2 and H3K9me3 (ref.[20]) through a chromodomain located in their amino termini[21,22], suggesting a role for these HMTs in the propagation of existing H3K9 methylation. The chromodomain is conserved in the D. melanogaster Su(var)3–9 and the fission yeast Clr4, but is missing in a related C. elegans homologue, SET-25 (Fig. 1a). Activity of both Clr4 and SUV39H2 also appears to be regulated by automethylation within an inhibitory loop downstream of the SET domain[23]. Besides binding H3K9me2 and H3K9me3 directly, SUV39H1/H2 interact with members of the heterochromatin protein 1 (HP1) family, which are characterized by an amino-terminal chromodomain specific for H3K9me2 and H3K9me3, and a dimer-promoting chromoshadow domain in their carboxyl terminus. These domains are linked by a flexible hinge region that is thought to be involved in binding RNA[24]. The HP1 proteins function as adaptors that recruit other proteins to heterochromatin and mediate repression. In mammals the binding of HP1α or HP1β to existing H3K9me2 and H3K9me3 helps recruit SUV39H1/H2, providing a mechanism that promotes the spread of H3K9 methylation from sites of initial HP1 recruitment[25,26], reminiscent of mechanisms described in fission yeast[9,10] (Fig. 1b). Neither the direct binding nor the indirect binding of an HMT to existing H3K9me2 and H3K9me3 explains, however, the de novo establishment of heterochromatin at target sequences, a capacity clearly demonstrated for the C. elegans HMT SET-25 (ref.[27]). SET-25 appears to be recruited to its target regions at least in part through Argonaute proteins and endogenous siRNAs to nucleate repression[28-30] (Fig. 1b). Interestingly, recent work revealed that, in addition to binding H3K9me2 and H3K9me3, the chromodomain of SUV39H1 recognizes RNA transcribed from major satellite repeats[31,32]. A similar activity was assigned to the first 81 amino acids of mouse SUV39H2, a domain that binds single-stranded major satellite repeat RNA, ensuring retention of the enzyme on mitotic chromosomes[33]. The RNA-mediated recruitment of SUV39H1/H2 provides an elegant mechanism for nucleation of heterochromatin precisely at the sites where repetitive sequences are aberrantly transcribed. Moreover, in two-cell-stage mouse embryos, a burst of transcription from the major satellite repeats is required for the organization of heterochromatin into chromocentres[34-36] (Box 2; Fig. 2).
Fig. 2

Roles of H3K9 methylation in long-range chromatin interactions and nuclear organization.

An interphase nucleus in a differentiated vertebrate tissue is shown with heterochromatin (blue) sequestered against the nucleolus and against the nuclear lamina.The separation of active and inactive chromatin domains can manifest itself as lamin-associated domains (LADs)[207] or as A and B compartments[208] (Box 2). B compartments correspond to gene-inactive chromatin (heterochromatin) and largely include LADs. A compartments correspond to gene-active chromatin, which is further organized into topologically associating domains (TADs; left), which enable enhancer–promoter interaction that stimulates gene expression (not shown). In differentiated tissues, methylated histone H3 Lys9 (H3K9me) blocks TAD formation by preventing the binding of CCCTC-binding factor (CTCF) and cohesin complexes, which delineate TAD borders. On the right are shown several mechanisms for heterochromatin anchorage at the nuclear periphery. In Caenorhabditis elegans, the nuclear envelope-associated protein CEC-4 recognizes all forms of H3K9me (monomethylated H3K9 (H3K9me1), dimethylated H3K9 (H3K9me2) and trimethylated H3K9 (H3K9me3)) through its chromodomain, with affinity roughly equal to that of heterochromatin protein 1 (HP1)[209]. CEC-4 contributes directly to heterochromatin anchoring in worm embryos[209–211], but upon tissue differentiation, as in mammals, additional anchoring pathways are induced, which in worms are independent of H3K9me[212]. Anchoring mechanisms present in differentiated mammalian cells include a histone methylation-dependent pathway with at least two potential anchors: proline-rich protein 14 (PRR14; potentially a functional homologue of CEC-4) is a non-transmembrane protein that anchors heterochromatin through H3K9me and HP1, and lamin B receptor (LBR), which is a transmembrane factor that binds both HP1α and dimethylated histone H4 Lys20 (H4K20me2) through its Tudor domain[213–216]. Another mechanism is lamin A dependent[214] and likely involves its interaction with RB and/or with transcription factors (TFs), such as SREBP. Emerin and lamina-associated polypeptide 2, isoform-β (LAP2β) are additional lamin-associated factors that bind chromatin through the histone deacetylase 3 (HDAC3)–nuclear receptor co-repressor 1 (NCOR)–silencing mediator of retinoic acid and thyroid hormone receptor (SMRT; also known as NCOR2) complex and potentially through tissue-specific transcription factors[215,216] (Box 2). H. sapiens, Homo sapiens.

Roles of H3K9 methylation in long-range chromatin interactions and nuclear organization.

An interphase nucleus in a differentiated vertebrate tissue is shown with heterochromatin (blue) sequestered against the nucleolus and against the nuclear lamina.The separation of active and inactive chromatin domains can manifest itself as lamin-associated domains (LADs)[207] or as A and B compartments[208] (Box 2). B compartments correspond to gene-inactive chromatin (heterochromatin) and largely include LADs. A compartments correspond to gene-active chromatin, which is further organized into topologically associating domains (TADs; left), which enable enhancer–promoter interaction that stimulates gene expression (not shown). In differentiated tissues, methylated histone H3 Lys9 (H3K9me) blocks TAD formation by preventing the binding of CCCTC-binding factor (CTCF) and cohesin complexes, which delineate TAD borders. On the right are shown several mechanisms for heterochromatin anchorage at the nuclear periphery. In Caenorhabditis elegans, the nuclear envelope-associated protein CEC-4 recognizes all forms of H3K9me (monomethylated H3K9 (H3K9me1), dimethylated H3K9 (H3K9me2) and trimethylated H3K9 (H3K9me3)) through its chromodomain, with affinity roughly equal to that of heterochromatin protein 1 (HP1)[209]. CEC-4 contributes directly to heterochromatin anchoring in worm embryos[209-211], but upon tissue differentiation, as in mammals, additional anchoring pathways are induced, which in worms are independent of H3K9me[212]. Anchoring mechanisms present in differentiated mammalian cells include a histone methylation-dependent pathway with at least two potential anchors: proline-rich protein 14 (PRR14; potentially a functional homologue of CEC-4) is a non-transmembrane protein that anchors heterochromatin through H3K9me and HP1, and lamin B receptor (LBR), which is a transmembrane factor that binds both HP1α and dimethylated histone H4 Lys20 (H4K20me2) through its Tudor domain[213-216]. Another mechanism is lamin A dependent[214] and likely involves its interaction with RB and/or with transcription factors (TFs), such as SREBP. Emerin and lamina-associated polypeptide 2, isoform-β (LAP2β) are additional lamin-associated factors that bind chromatin through the histone deacetylase 3 (HDAC3)–nuclear receptor co-repressor 1 (NCOR)–silencing mediator of retinoic acid and thyroid hormone receptor (SMRT; also known as NCOR2) complex and potentially through tissue-specific transcription factors[215,216] (Box 2). H. sapiens, Homo sapiens. In addition to RNA transcribed from major satellite repeats, the long non-coding RNA Oct4P4, which is transcribed from an Oct4 pseudogene, has been shown to form a complex with SUV39H1 and to direct H3K9me3 to the promoter of the functional Oct4 gene, ensuring its transcriptional silencing[37]. This important example demonstrates that RNA-mediated SUV39H1 recruitment can occur in trans. Similarly, the long non-coding RNA telomeric repeat-containing RNA (TERRA) associates with SUV39H1 to promote an accumulation of H3K9me3 at damaged telomeres and prevent end-to-end chromosomal fusions[38], potentially acting in trans. The molecular determinants of specificity in the interaction of SUV39H1/H2 with RNA are unclear, yet these results argue that RNA-mediated recruitment of SUV39H1/H2 may be a general mechanism for context-dependent establishment of an H3K9me-marked domain (see ref.[10] for siRNA-mediated heterochromatin establishment in Schizosaccharomyces pombe). Determinants of specificity of HMT recruitment are of particular interest given the diverse set of targets that depend on SUV39H1/H2 for repression[18,19,39]. Namely, in addition to pericentric heterochromatin, SUV39H1/H2 are recruited to class II endogenous retroviruses (ERVs), particularly intracisternal A-particles (IAPs; such as IAPEz and IAPEy) and early transposons (especially MMETn and ETnERV), and to intact long interspersed nuclear elements (LINEs) such as LINE1 in mouse embryonic stem cells (ESCs)[19] (Fig. 1b). In mouse ESCs, the loss of SUV39H1/H2 led to a loss H3K9me3 and upregulation of transcripts from intact LINE1 elements[18,19]. By contrast, IAP elements responded to the deletion of both Suv39h1 and Suv39h2 by losing their flanking H3K9me3, yet they retained H3K9me3 over the repeat element itself[19] and remained silent, likely owing to the recruitment and activity of another H3K9 methyltransferase, for example SETDB1 (ref.[19]). Similar redundancy among H3K9 methyltransferases was observed at telomeres: although the deletion of both Suv39h1 and Suv39h2 resulted in abnormally long telomeres with lower levels of H3K9me2 and H3K9me3, the levels of H3K9me1 increased[39], as did the recruitment of SETDB1, thereby ensuring that H3K9me3 was not completely lost[40]. Such redundancy, found both within and across the H3K9 methyltransferase families, makes it difficult to use loss-of-function genetic screens to identify recruitment factors. Whereas satellite repeats and LINE1 transposons are the best studied targets of SUV39H1/H2-mediated repression, these HMTs silence other important targets. For example, SUV39H1 and its homologues are essential for the repression of non-transcribed ribosomal DNA (rDNA) repeats in flies[41], worms[7] and mice[42]. In flies, the loss of Su(var)3–9 results not only in elevated levels of ribosomal RNA transcripts but also in chromosomal instability at the rDNA repeat loci, a phenotype also observed in ago2 mutants[41], which lose Argonaute-2. In contrast to the constitutive silencing of satellite repeats, mouse SUV39H1 at rDNA repeats is regulated by the energy status of the cell, potentially by the NAD+/NADH ratio and the histone deacetylase (HDAC) SIRT1 (ref.[42]). SIRT1 is important for the deacetylation of histones H3 and H4, to allow their subsequent methylation[42], but it also directly regulates SUV39H1 through the deacetylation of Lys266 in its SET domain. Lys266 deacetylation increases SUV39H1 activity[43] and prevents its polyubiquitylation on Lys87 and subsequent degradation, which occur under oxidative stress[44]. SUV39H1 activity is further regulated by the HDAC SIRT6, which promotes monoubiquitylation of Cys222, Cys226 and Cys232 in the pre-SET domain by the E3 ubiquitin ligase SKP2 (ref.[45]). These mechanisms help regulate SUV39H1/H2 activity at repeat sequences, yet they may also be relevant in the repression of tissue-specific genes, as discussed later. The classical position effect variegation (PEV) assay is based on an inversion in the Drosophila melanogaster X chromosome, which brings the white gene within ~25 kb of a domain of pericentric heterochromatin, shifting the border between heterochromatin and euchromatin towards the telomere. The inversion leads to the silencing of white and loss of the red eye colour in flies (fly genes are named for the mutant phenotypes)[12]. With use of the PEV assay, it was observed that sequences organized into heterochromatin had a tendency to be silenced in a semi-stable manner, generating phenotypic variegation. Similar variegated phenotypes arise when genes are inserted into subtelomeric chromatin in yeast[221]. In the case of the translocated white gene, PEV manifests itself as a random pattern of eye facets coloured white, instead of red. Various research groups have used PEV-reporter strains to screen for mutant genes that encode proteins that either modulate or are essential for heterochromatic gene repression, with the histone H3 Lys9 (H3K9) methyltransferase gene Su(var)3–9 being one of the most prominent hits, along with Su(var)2–5, which encodes heterochromatin protein 1 (Hp1), a protein that binds methylated H3K9 (ref.[12]). Euchromatin and heterochromatin are spatially segregated in the interphase nucleus. The identification of A and B compartments within the genome stems from sequence interaction data generated by Hi-C[208] or ARC-C[222], and reflects the observation that interactions between loci on a megabase scale are favoured within each compartment rather than between them. The A compartment comprises largely open chromatin, and the B compartment is generally closed (Fig. 2). The former is further subdivided into smaller, looped topologically associating domains (TADs)[222-224], within which sequences also preferentially interact to regulate transcription. TAD looping and insulation is mediated primarily by cohesin complexes[225] and the insulator protein CTCF in mammals[226,227]. Finally, juxtaposed to the nuclear envelope, lamin-associated domains (LADs) can be identified by Dam-ID[207]. LADs are enriched in heterochromatin in mammals and worms[228-231], and dimethylated histone H3 Lys9 (H3K9me2) loss compromises their lamina anchoring in worm embryos[6]. H3K9me2 is enriched at LADs, but is not exclusively found in LADs[229], which significantly overlap with the B compartment[222,232]. H3K9me2 levels are regulated by different histone methyltransferases (HMTs) in A and B compartments: in embryonic stem cells (ESCs) and differentiated mammalian cells, A-compartment H3K9me2 levels depend on SET domain bifurcated 1 (SETDB1) and especially on G9A and/or G9A-like protein (GLP), whereas in B compartments, they depend on all five main HMTs (G9A, GLP, SETDB1, Suppressor of variegation 3–9 homologue 1 (SUV39H1) and SUV39H2)[233,234]. The separation of A and B compartments is less pronounced in undifferentiated ESCs, and SETDB1 ablation in ESCs resulted in changes in H3K9 methylation in both compartments[233,235]. G9A-deficient and SETDB1-deficient cells show aberrant CTCF and cohesin binding, resulting in the formation of additional chromatin contacts and diminished TAD insulation[107,236]. The loss of widely interspaced zinc-finger-containing protein (WIZ), a G9A cofactor, decreased CTCF and cohesin colocalization and chromatin looping and destabilized G9A[237], implicating HMTs and H3K9me2 in establishment and maintenance of the inactive compartment. Connecting heterochromatin to the nuclear envelope is well characterized in Caenorhabditis elegans, where a nuclear envelope-associated chromodomain protein, CEC-4, binds equally to monomethylated H3K9, H3K9me2 and trimethylated H3K9 (H3K9me3) peptides[209-211,216] (Fig. 2). CEC-4 chromodomain integrity is required for the correct positioning of methylated H3K9 (H3K9me)-marked domains in worm embryos, but not for transcription repression, which depends instead on ‘readers’, namely heterochromatin protein 1 (HP1) homologues and the protein LIN-61 (refs[82,209]). H3K9me, but not perinuclear anchoring, was shown to contribute to megabase-scale TAD organization on the worm X chromosome by the condensin dosage compensation complex (DCC)[238]. In met-2 (SETDB1 homologue) set-25 (Su(var)3–9 homologue) double mutants, DCC-dependent TAD boundaries were compromised, while DCC-independent boundaries were not[238], showing that multiple pathways segregate active and inactive chromatin domains[212]. Whereas ablation of the H3K9me2 HMT MET-2 in worm embryos largely abolished heterochromatin anchoring[6,60,210], loss of the mouse HMT G9A had a more nuanced effect on nuclear organization. Similarly, in G9A-deficient mouse ESCs or G9A inhibitor-treated hepatocytes, only a subset of nuclear envelope-bound genes were derepressed[234,239], likely reflecting redundancy between G9A, SETDB1, SUV39H1 and SUV39H2 (ref.[240]) and redundancy between anchors[212,214] (Fig. 2). In mammals, multiple factors contribute to heterochromatin anchoring. First, PRR14 is a bifunctional heterochromatin anchor with separable HP1 and lamin interaction domains[241,242]. It is phosphorylated and dissociates from chromatin at mitosis and reassociates in early G1 phase to anchor H3K9-methylated chromatin[243]. Second, lamin itself contributes to the spatial segregation of active TADs from inactive LADs in mammalian cells[244], Drosophila melanogaster S2 cells[245] and C. elegans[246,247]. Third, vertebrate lamin A and the lamin B receptor contribute with various degrees to heterochromatin tethering in differentiated-cell nuclei[214,215], along with other tissue-specific pathways found in many species[214]. In cultured D. melanogaster cells, lamin ablation led to chromatin repositioning from the nuclear envelope, an accumulation of active TADs (A type) and TAD decompaction[245]. This phenotype also occurred upon lamin disruption in mammals[244]. Finally, lamin point mutations can both disrupt and enhance perinuclear sequestration of genes in differentiated tissues[181,214,248,249]. Although H3K9me has a direct role in nuclear positioning of chromatin and in gene repression, positioning at the nuclear periphery is not sufficient to repress transcription[6,83,250-252]. Nonetheless, in a promoter-dependent manner, perinuclear localization can dampen expression[215]. For instance, histone deacetylase 3 (HDAC3) associates with the nuclear lamina and with the SMRT–NCOR1 complex, which interacts with various tissue-specific transcription factors[215,253]. These interactions appear to antagonize H3K27 acetylation, which drives both transcription activation and relocation of nuclear envelope genes in differentiated cells[212,214]. Importantly, the association of HP1 with H3K9me is not sufficient for nuclear envelope anchoring: heterochromatin centromeric satellite repeats are found clustered in nucleus-internal chromocentres[254] in both mouse embryonic fibroblasts and D. melanogaster cells. Although deletion of both SUV39H1 and SUV39H2 resulted in H3K9me3 loss from chromocentres, these compartments remained structurally intact and the chromatin remained compacted[17,255]. The integrity of chromocentres was partially lost in SUV39H1–SUV39H2–SETDB1 triple mutants and was extensively lost upon the ablation of all six mammalian HMTs[8], underscoring their functional redundancy.

SETDB1 and MET-2

SETDB1, like SUV39H1, is an abundant SET domain-containing H3K9 methyltransferase. SETDB1 is characterized by an unusual bifurcated SET domain, which contains a 347 amino acid insertion in the middle of an otherwise well conserved and catalytically active SET domain as well as characteristic pre-SET and post-SET domains[46,47] (Fig. 1a). In contrast to SUV39H1/H2, however, SETDB1 can methylate unmodified H3K9 to generate H3K9me1, H3K9me2 and H3K9me3 both in vitro and in vivo[48]. Its catalytic activity is regulated by several modifications at or near the SET domain[49-52]. These modifications include monoubiquitylation of Lys867 in the SET domain insertion, which has been shown to increase SETDB1 processivity in mammals[49,50] and to enhance protein stability and nuclear localization in D. melanogaster[52]. In addition, sumoylation of the insertion sequence may regulate SETDB1 recruitment to chromatin[51]. Su(var)2-10, the fly homologue of the SUMO E3 ligase PIAS, is essential to mediate SETDB1-dependent and H3K9me3-dependent repression by PIWI-interacting RNAs in the female germ line[53]. Collectively, these observations suggest that the regulation of SETDB1 function through ubiquitylation and sumoylation is conserved. Interestingly, the H3K9me reader HP1 and the transcription regulator Krüppel-associated box (KRAB)-interacting protein 1 (KAP1; also known as TIF1β and TRIM28), which recruits SETDB1, are also regulated by sumoylation[54,55]. Monoubiquitylation of SETDB1 has been suggested to be promoted by the conserved cofactor of SETDB1, activating transcription factor 7-interacting protein 1 (ATF7IP)[56,57]. Consistently, mammalian ATF7IP and its fly homologue (Windei) and worm homologue (LIN-65) are essential for robust SETDB1-mediated H3K9 methylation[58]. As well as regulating SETDB1 activity, ATF7IP (like Windei and LIN-65) is necessary for nuclear localization and chromatin association of SETDB1 (refs[52,56,57,59,60]). Loss of ATF7IP further coincides with enhanced, but incomplete, degradation of SETDB1, a phenomenon also observed for the C. elegans homologue MET-2 upon loss of LIN-65 (refs[56,57,60]). It is likely that these various roles are interdependent, as ATF7IP and LIN-65 binding also correlates with the stable presence of SETDB1 and MET-2, respectively, in heterochromatin foci or condensates[60]. An integral element of SETDB1 control appears to be its nuclear depletion, which has been observed in differentiating muscle cells in mice[61], upon heat stress in C. elegans[60] and when the protein is overexpressed in fusion with GFP (compare ref.[6] with ref.[60]). Indeed, the SETDB1 amino terminus contains both nuclear localization and nuclear export signals[62], and the ATF7IP-binding site[52,57,59] (Fig. 1a). The fluctuation of nuclear levels of SETDB1 in response to environmental stress suggests that active nuclear uptake or export may be a physiological regulator of SETDB1 function[60,61,63]. Indeed, MET-2 shifts from cytoplasm to the nucleus during early development, coincident with the bulk accumulation of H3K9me2 (ref.[59]). In addition to the SET domain, SETDB1 contains a double Tudor domain[64] that has been implicated in its recruitment to histones that are bivalent for H3K9me1 and H3K4 acetylation[65]. Although the bivalency is proposed to recruit SETDB1 to specific loci, it is unclear whether or where the bivalency is relevant in vivo. SETDB1 further contains a putative methyl-CpG-binding domain[47,48] that is conserved across mammals, flies and C. elegans (Fig. 1a). The methyl-CpG-binding domain is proposed to help target SETDB1 to regions of DNA methylation, yet it likely serves other functions given that D. melanogaster and C. elegans lack DNA methylation (Box 3). In worms, adenosine 5′-diphosphate-ribosylation factor-like 14 (ARLE-14) potentially helps to recruit MET-2 to gene promoters[59,60,63] (Fig. 1b). SETDB1 has been extensively studied in mouse ESCs, where its loss affects both cell viability and differentiation potential[66]. In these cells the best characterized targets of SETDB1 are class I and class II long terminal repeat (LTR)-containing retroviruses, such as murine leukaemia virus elements and IAPs[67-75] (Fig. 1b). Both of these ERV families are evolutionarily young with intact repeat copies that were shown to have the potential to be active in mice[76]. SETDB1 is recruited to these elements through its interaction with KAP1 (ref.[48]), which functions as a scaffold that bridges between KRAB domain-containing zinc-finger proteins (KRAB-ZFPs) and various heterochromatin proteins. KRAB-ZFPs bind to ERV LTRs to ensure the sequence-specific establishment of H3K9me3 domains[77]. The co-evolution of genetically unstable ERVs and their cognate KRAB-ZFPs has been proposed to drive the enormous expansion of the latter in mammalian genomes[78]. Although ERV silencing is the best described function of SETDB1, several other repetitive elements, as well as genes, are SETDB1 targets (Fig. 1b). At telomeric repeats, SETDB1-mediated H3K9me3 drives recombination-based alternative telomere lengthening, whereas SUV39H1/H2 appear to suppress recombination[40]. At pericentric heterochromatin, SETDB1 is part of the machinery involved in silencing both major and minor satellite repeats[79], possibly by providing H3K9me1 for SUV39-mediated trimethylation[8,80]. This mode of action, in which two HMTs sequentially methylate H3K9, was also shown in C. elegans for MET-2 and SET-25. MET-2 deposits H3K9me2, which is converted to H3K9me3 by SET-25 at full-length transposons and tissue-specific genes[7,81,82]. In worms, the ablation of set-25 did not derepress many genes or satellite repeats, and derepressed only a subset of transposons, because MET-2-mediated H3K9 dimethylation was sufficient (and necessary) to maintain transcriptional repression[27,81,83], likely in cooperation with a range of other heterochromatin proteins[82]. Partial redundancy between SUV39H1 and/or SUV39H2 and SETDB1 was also observed in the silencing of the mammalian LINE1 retrotransposons. Recruitment of SETDB1 to these non-LTR transposons dependent not on KAP1 (ref.[84]) but rather on its interaction with the human silencing hub (HUSH) complex, which is composed of transgene activation suppressor protein (TASOR), M-phase phosphoprotein 8 (MPP8) and periphilin[85]. SETDB1 recruitment appears to require interaction between MPP8 and methylated ATF7IP[86], while TASOR is the central adaptor connecting MPP8 with periphilin 1 and MORC2, which is an ATPase involved in heterochromatin compaction[87,88]. In addition, TASOR interacts with RNA-processing proteins and/or regulators of RNA polymerase II activity[89]. There are two mechanisms that ensure the recruitment of HUSH and SETDB1 to their target sites. MPP8 can bind H3K9me3 through its chromodomain[90], which would in principle promote SETDB1 spreading through a reader–writer mechanism, much like HP1 and SUV39H1/H2 (refs[25,26,91]). This, however, has been contested by another study that argues that HUSH–SETDB1 cannot promote the spread of heterochromatin on its own, even though it can methylate nucleosomes at LINE1 elements[19]. A non-exclusive alternative would be that SETDB1 recruitment is mediated by the ability of periphilin 1 to bind large, intronless RNA derived from endogenous retroelements (for example, LINE1) and pseudogenes, and exogenous RNA transcribed from plasmids or viruses[92]. This capacity would allow a transcription-dependent, but sequence-independent, establishment of H3K9me3 domains[92]. During DNA replication, SETDB1 is also found in complex with other chromatin-targeting factors, namely the histone chaperone chromatin assembly factor 1 (CAF1) and HP1α[80,93] (Fig. 1b). Interestingly, formation of the CAF1–HP1α–SETDB1 complex is mutually exclusive with formation of the ATF7IP–SETDB1 complex[80], and whereas CAF1–HP1α–SETDB1 mediates H3K9 monomethylation at pericentric heterochromatin, ATF7IP–SETDB1 drives most SETDB1-mediated H3K9 trimethylation. SETDB1 is ubiquitously expressed throughout mammalian development, yet most work aimed at understanding SETDB1 function and recruitment is performed in HeLa cells or undifferentiated ESCss. These are inadequate models for explaining the diverse roles of SETDB1 in cell type-specific gene repression during tissue differentiation, especially when compared to C. elegans. In mammals, but not in Drosophila melanogaster, Caenorhabditis elegans or fission yeast, repeat elements and genes can be repressed through DNA methylation of cytosine to 5-methylcytosine (reviewed in ref.[256]). DNA methylation has been shown to depend on individual histone H3 Lys9 (H3K9) methyltransferases to various degrees. Loss of G9A in mouse embryonic stem cells leads to reduced levels of DNA methylation at retrotransposons, satellite repeats, CpG-rich promoters and imprinting control regions[110,257,258]. However, H3K9 methyltransferase redundancy ensures that most repetitive elements retain H3K9 trimethylation and remain silent, despite the loss of DNA methylation[110]. Loss of SET domain bifurcated 1 (SETDB1) or its cofactor Krüppel-associated box (KRAB)-interacting protein 1 (KAP1) results in loss of de novo DNA methylation at long terminal repeat retrotransposons and telomeres in early embryos[40,259], and deletion of both Suppressor of variegation 3–9 homologue 1 (SUV39H1) and SUV39H2 leads to displacement of part of the DNA methyltransferase machinery from pericentric heterochromatin[158,260]. In mouse embryonic stem cells that lack G9A and G9A-like protein (GLP), DNA methylation is lost from imprinting control regions[257], including from the Prader–Willi syndrome imprinting centre[258], thereby generating a parent-of-origin disease. The general consensus is that H3K9 methylation precedes DNA methylation.

SETDB2

SETDB2 is a closely related paralogue of SETDB1 that also has H3K9 methylation activity (Fig. 1a). It is one of the six mouse HMTs that must be ablated to generate cells lacking all detectable H3K9me[8], yet its catalytic activity appears to be restricted to the generation of H3K9me3 from H3K9me1 or H3K9me2 (ref.[94]). In zebrafish, the loss of Setdb2 specifically affects H3K9me3 and disrupts left–right asymmetry during development[95]. In mammals, several recent studies have linked SETDB2 to the silencing of pro-inflammatory chemokine and cytokine genes during influenza virus[96] or severe acute respiratory syndrome coronavirus 2 (ref.[97]) infections. Other studies found low SETDB2 levels in individuals with diabetes with pro-inflammatory phenotypes[98]. In general, however, SETDB2 is an understudied H3K9 methyltransferase.

G9A, GLP and SET-25

The two SET domain-containing HMTs G9A and GLP define a third class of H3K9-specific HMT complexes (Fig. 1). The two enzymes are both ubiquitously expressed and share 44.5% sequence identity (76.5% similarity). Whereas each can form functional homodimers and heterodimers in vitro, only the G9A–GLP heterodimer has been successfully isolated from mouse ESCs[99], and loss or inhibition of either HMT results in major loss of H3K9me2 in mouse ESCs[99,100]. Nonetheless, G9A and GLP appear to have non-overlapping functions in muscle development and terminal differentiation[101]. Like SUV39H1 and SETDB1, G9A and GLP also contain characteristic pre-SET and post-SET domains, even though their SET domains can catalyse only H3K9 monomethylation and dimethylation. This is because the hydroxyl group of Tyr1067 in the G9A SET domain (Tyr1124 in GLP) restricts the orientation of the dimethylamine of H3K9me2, thereby preventing transfer of a third methyl group[102,103]. To date it is unclear whether H3K9me1 and H3K9me2 catalysed by G9A and GLP serve as substrates for other HMTs, although most genomes do contain large domains bearing H3K9me2 only. These domains could arise from unique recruitment of G9A and GLP, from the activity of specific H3K9me1 and/or H3K9me2 readers that block subsequent trimethylation by other HMTs or by the localized action of specific lysine demethylases[104,105] that prevent H3K9me3 accumulation. On the basis of early immunofluorescence analysis of H3K9me2 and/or H3K9me3 in G9A and GLP mutants, both HMTs were initially considered to be ‘euchromatic H3K9 methyltransferases’ (that is, specifically involved in the regulation of genes)[2,99,106]. However, we now know that also class III ERVs, such as murine endogenous retrovirus type L (specifically MT2), are marked by H3K9me2 and repressed in a G9A-dependent manner[107,108]. In addition, G9A was uniquely required for de novo establishment, but not for maintenance, of transcriptional silencing and DNA methylation of integrated vectors of Moloney murine leukaemia virus[109]. Interestingly, in mouse ESCs lacking G9A, a number of IAP and class I MusD retroelements lose H3K9me2, but maintain H3K9me3, suggesting redundancy between G9A and SETDB1 (Fig. 1b). By contrast, a C. elegans set-25 null mutant exhibited loss of H3K9me3, but a gain of MET-2-mediated H3K9me2. Such compensation illustrates the importance of analysing both dimethylation and trimethylation H3K9 states both in the wild type and in the relevant mutants[81,110,111]. Further evidence for a broad function of G9A and GLP in transposon silencing comes from analyses in mouse testis, where G9A is both necessary and sufficient to silence LINE1a and IAP elements in the absence of a functional PIWI-interacting RNA pathway (Mili−/−; also known as Piwil2−/−) [112]. Confusingly, IAP silencing is also dependent on SETDB1 in testis[113], suggesting either that individual copies of these transposons are silenced by different HMTs in some tissues or that the two HMTs are interdependent in testis, but not in mouse ESCs[108]. Finally, similarly to SUV39H1 mutants, the inhibition of G9A resulted in the derepression of rDNA repeats, in nucleolar fragmentation and in accumulation of nucleolar R-loops[114]. Like SUV39H1/H2, G9A and GLP can directly bind H3K9me, albeit not through a chromodomain, but through seven or eight ankyrin repeats, respectively, that bind H3K9me1 and H3K9me2 (ref.[115]). In addition, G9A and GLP bind the HP1 proteins[99,116] through an interaction regulated by automethylation of G9A and GLP at Lys165 (ref.[116]). This automethylation falls within a motif (ARKT) that resembles the histone H3K9 context (ARKS). Interestingly, the same motif is found methylated in the SETDB1 cofactor ATF7IP[86], suggesting that regulation through methylation may be conserved among multiple HMTs. In addition to interactions that recruit G9A to pre-existing H3K9me1 and H3K9me2, G9A was shown to bind a range of sequence-specific transcription factors in different cell types. In CD8+ T cells and U2OS cells, G9A binds the transcription factor PRDM1 (refs[1,117]), whose multiple roles include repression of the interferon-β gene response to regulate T cell development following viral infection[117]. In HeLa cells, G9A and GLP are part of a complex containing chromodomain Y-like protein, HDAC1 or HDAC2 and the transcriptional repressor REST[116,118], whereas in liver cells, G9A represses important developmental genes (for example, FGF21 and CYP7A1) through interactions with the transcription factors E4BP4 (also known as NFIL3)[119,120] and small heterodimer partner (SHP)[121,122]. SHP is an unusual orphan nuclear receptor that lacks a DNA-binding domain, which is involved in the repression of nuclear-receptor targets by recruiting G9A and the chromatin remodeller mSin3a complex[121,122] (Fig. 1b). The interaction of G9A and GLP with HDACs or with known HDAC adapter proteins such as mSin3a is found in multiple tissues. Indeed, the best known cofactor of G9A and GLP is the transcription factor widely interspaced zinc-finger-containing protein (WIZ), which bridges between the HMTs and the co-repressor CTBP[123]. CTBP associates with HDAC1, HDAC2 and HDAC3 (ref.[124]), which, in addition to deacetylating histones, may have a role in the anchoring of inactive genes to the nuclear lamina (Box 2; Fig. 2). Overall, the mechanisms through which H3K9 methyltransferases work together to establish heterochromatin are only partly understood. Nor is it clear in which context they establish H3K9 methylation de novo as opposed to the maintenance of an existing methylation state that antagonizes acetylation. The spatial separation of euchromatin from heterochromatin may itself have a role in the maintenance of silent states (Box 2; Fig. 2). Clearly, understanding both specific HMT targeting and domain segregation remains an important goal for deciphering how H3K9me-mediated silencing influences organismal development.

HMTs during and after differentiation

Whereas much of the cellular H3K9me2 and H3K9me3 marks repetitive or ‘constitutive’ heterochromatin, detailed analyses of tissue-specific H3K9 methylation patterns and transcriptomics in both wild type and HMT mutant organisms implicate this modification in the silencing of tissue-specific genes. Among the different physiological roles proposed for H3K9me2 and H3K9me3, dominant is the maintenance of cell identity by preventing inappropriate expression of tissue-specific genes[125]. Compelling support for this role came from the discovery that H3K9me itself is a barrier to the reprogramming of differentiated cells into so-called induced pluripotent stem cells, presumably by partially blocking the binding to DNA of the pluripotency transcription factors OCT4, SOX2, Krüppel-like factor 4 (KLF4) and MYC[126,127]. Similarly, genomic regions resistant to gene-expression reprogramming following somatic nuclear transfer are marked by H3K9me3, and both the depletion of SUV39H1/H2 and the overexpression of H3K9me demethylases enhance the efficiency of somatic cell nuclear reprogramming[128,129]. Conversely, ectopic expression of the oncogenic mutation of H3 Lys9 to Met (H3K9M), which inhibits Lys9 methylation even in wild type histone H3, impairs ESC differentiation by allowing the continued expression of OCT4, SOX2 and another pluripotency factor, NANOG[130]. Despite these deficiencies, H3K9M-expressing mice survived for up to 1 year, although they experienced expansion of multipotent progenitors, aberrant lymphopoiesis and thrombocytosis, and, in some animals, aggressive T cell leukaemia or lymphomas[130]. Collectively, these results argue strongly for a role for H3K9 methylation in stabilizing differentiated cell type identity and countering oncogenic transformation[131]. In C. elegans, where H3K9me is not essential for mitosis, the complete loss of H3K9 methylation (that is, in the met-2 set-25 double mutant) did not prevent development from embryos to adults, despite widespread misexpression of both repetitive elements and tissue-specific genes[5-7,82]. Nonetheless, the H3K9me-deficient worms displayed stochastic developmental delay and reduced fertility[7,132] and became highly dependent on factors involved in DNA repair and small-RNA pathways for survival[27,81,82,133]. In contrast to worms, mice with knockout of G9a (also known as Ehmt2) or Glp (also known as Ehmt1) were early embryonic lethal (approximately embryonic day 9.5 to embryonic day 12.5) and showed gross morphological defects across all developing cell lineages[2]. Maternally contributed G9A was able to silence a subset of four cell stage-specific genes[134], and ensure proper chromosome segregation[135], but was presumably insufficient to allow prolonged development[134]. Deletion of Setdb1 resulted in similar embryonic lethal phenotypes at 3.5–5.5 days after coitus, blastocysts in vivo showed sub-Mendelian survival[66] and ESC survival in vitro was compromised[66]. By contrast, knockout of Suv39h1 or Suv39h2 showed no superficial developmental defects, although double-null mice were also born at sub-Mendelian ratios and were characterized by prenatal lethality linked to genome instability[16]. The developmental defects that stem from the loss of H3K9 methylation in vertebrates can be aggravated by the mis-segregation of chromosomes, which arises from altered pericentromeric heterochromatin[16,136]. In addition, H3K9 methyltransferases are known to have a few non-histone targets (for example, G9A methylates oestrogen receptor-α[137] and ligase 1 (ref.[138]), SETDB1 and SETDB2 methylate the HIV-1 protein Tat[139], and SUV39H2 methylates RAG2 and the lysine methyltransferases SET8 and DOT1L[140]). Loss of methylation of these non-histone targets could compromise viability; thus, phenotypes driven by the loss of HMTs may not exclusively reflect the absence of H3K9 methylation. Given their ubiquitous expression in tissues[141-143], including in testis[22,141,142], oocytes and zygotes[144], it is not surprising that vertebrate organisms lacking H3K9 methyltransferases are embryonic lethal. To study the roles of individual HMTs in differentiated cells, use of conditional tissue-specific knockouts or conditional degradation through tissue-specific degrons is required. Given the complexity of producing organisms with HMT-deficient tissues, work in tissues was mainly based on loss of SUV39H1 alone, despite evidence for its redundancy with SUV39H2 (ref.[111]) and the role of the latter in keratinocyte gene repression[145]. Similarly, largely on the basis of the interdependency between G9A and GLP in mouse ESCs, primarily G9A mutants have been analysed in tissues, despite evidence that the two HMTs have non-overlapping target genes in differentiating myoblasts[101]. Nonetheless, studies of haematopoiesis, muscle development and neuronal development have provided ample evidence that specific H3K9me methyltransferases have key roles in the formation and maintenance of differentiated tissues. We discuss these studies in the following sections, and include C. elegans and D. melanogaster results for additional mechanistic insights.

Haematopoiesis

Haematopoietic cells, and especially the lymphoid cell lineages, are among the best characterized models of lineage-dependent differentiation (reviewed in refs[146-148]). Haematopoiesis is thus an interesting model for the analysis of mutants with pleiotropic effects, such as deletions of epigenetic modifiers. The adult lymphoid system originates from haematopoietic stem cells (HSCs) located in the bone marrow[146] (Fig. 3). In HSCs, SETDB1 is required for stem cell maintenanace[149] by repressing genes associated with other, non-haematopoietic cell lineages[150]. Loss of H3K9me3 owing to deletion of Setdb1 in the mouse, however, was restricted to a fraction of repressed genes and transposable elements[150]. Among the SETDB1-modified genes, only a subset was actually transcribed in the Setdb1−/− mice. Transposable elements in SETDB1-deficient HSCs also showed only a marginal loss of H3K9me3, even at the class I and class II retrotransposons that in mouse ESCs require SETDB1 for repression. Reductions in H3K9me3 levels were insufficient to trigger loss of DNA methylation at transposons[150]. Nevertheless, mice with HSC-specific Setdb1 deficiency rapidly lost HSCs and progenitor cells from the bone marrow[150].
Fig. 3

Roles of H3K9 methyltransferases in T cell lineage determination.

The well-characterized differentiation steps from haematopoietic stem cells (HSCs) into terminally differentiated T helper cells (T helper 1 (TH1) cells, TH2 cells, T follicular helper (TFH) cells and TH17 cells) are shown. The effects of deletion of SUV39H1 and/or SUV39H2, SETDB1 or G9A are listed. Although not shown, histone H3 Lys9 (H3K9) methyltransferases also have a role in the maturation of B cell progenitors (pro-B cells). In this case, the loss of SETDB1 leads to derepression of both specific endogenous retroviruses (ERVs)[69,217] and non-lineage-specific genes, including genes specific to innate immunity[69]. The selection of genes and repeats that are derepressed likely depends on the variable presence of lineage-specific transcription factors, as described in Caenorhabditis elegans[83] (Box 4). CLP, common lymphoid progenitor; DN, double negative (CD4−CD8−); DP, double positive (CD4+CD8+); ETP, early T cell progenitor; LMPP, lymphoid primed multipotent progenitor; Treg cell, regulatory T cell.

Roles of H3K9 methyltransferases in T cell lineage determination.

The well-characterized differentiation steps from haematopoietic stem cells (HSCs) into terminally differentiated T helper cells (T helper 1 (TH1) cells, TH2 cells, T follicular helper (TFH) cells and TH17 cells) are shown. The effects of deletion of SUV39H1 and/or SUV39H2, SETDB1 or G9A are listed. Although not shown, histone H3 Lys9 (H3K9) methyltransferases also have a role in the maturation of B cell progenitors (pro-B cells). In this case, the loss of SETDB1 leads to derepression of both specific endogenous retroviruses (ERVs)[69,217] and non-lineage-specific genes, including genes specific to innate immunity[69]. The selection of genes and repeats that are derepressed likely depends on the variable presence of lineage-specific transcription factors, as described in Caenorhabditis elegans[83] (Box 4). CLP, common lymphoid progenitor; DN, double negative (CD4−CD8−); DP, double positive (CD4+CD8+); ETP, early T cell progenitor; LMPP, lymphoid primed multipotent progenitor; Treg cell, regulatory T cell. Knockout of both Suv39h1 and Suv39h2, but not of either of them, resulted in reduction of the HSC pool and a nuanced loss of differentiation. Surviving HSCs were characterized by an increase in the number of nucleoli, disrupted nuclear lamina and the appearance of phosphorylated histone H2AX (γ-H2AX) foci, which mark DNA damage[151]. There was also a loss of 4′,6-diamidino-2-phenylindole (chromatin)-dense regions overlapping with HP1α[151]. Interestingly, these phenotypes are similarly associated with HSCs isolated from older individuals[150], and appear during cellular aging[152]. Indeed, HSCs isolated from humans older than 70 years showed an ∼50% reduction in SUV39H1 expression compared with humans younger than 35 years[153]. A similar reduction in the levels of both SUV39H1 and H3K9me3 was observed in HSCs from old mice[151,153]. Thus, in the haematopoietic lineage, the loss of SUV39H1/SUV39H2 generates aging-related phenotypes. By contrast, the deletion of G9A in HSCs more closely resembled the loss of SETDB1, leading to the transcription of genes from a variety of non-haematopoietic tissues, such as lung, liver, saliva and brain[154]. It is unclear whether the similarity in the sets of genes derepressed upon loss of G9A or SETDB1 represents an interdependency among the enzymes, or the fact that they have overlapping targets (Fig. 3). During both B cell differentiation and T cell differentiation, the ablation of SUV39H1, SETDB1 or G9A individually did not block differentiation per se, but each HMT was required to stabilize well-defined differentiation steps, as illustrated in Fig. 3 for T cells. In wild type mice, naive CD4+ T helper (TH) cells differentiate into two main cell types, TH1 and TH2 cells, and the ratio between the two cell types is an important determinant of the immune response[155]. However, isolated naive CD4+ T cells deficient in Suv39h1 or lacking the H3K9me2 and H3K9me3 reader HP1α were unable to stably differentiate into CD4+ TH2 cells. These unstable Suv39h1-deficient TH2 cells were characterized by derepression of TH1-specific genes, which correlated with reduced H3K9me3 levels[156]. Consequently, TH2 cell to TH1 cell transdifferentiation increased and caused a skewed TH1 cell response in a mouse model of TH2 cell-driven allergic asthma, thereby decreasing the associated lung pathology[156]. Interestingly, Suv39h1-deficient cytotoxic CD8+ T lymphocytes also showed a reduced antigen-specific CD8+ T cell response due to defects in the silencing of T cell ‘stemness’ memory genes[157]. Thus, even in closely related cell lineages, the loss of H3K9me2 and/or H3K9me3 affects different target genes, and the aberrantly expressed cohort of genes is strongly cell type specific. Given the phenotypes of SUV39H1 deficiency in late differentiation, it was somewhat surprising to find that SETDB1 was also essential for the stable, terminal differentiation of CD4+ T cells into TH1/TH2 CD4+ T cells[70]. SETDB1 deletion in naive CD4+ T cells resulted in hyperactivated TH1 cells and an unstable commitment to the TH2 cell lineage (Fig. 3). In contrast to SUV39H1, SETDB1 is not thought to directly control promoter activity of a majority of the misexpressed TH1-specific genes. Instead, gene derepression correlated with the loss of silencing of nearby ERV elements, which then functioned as aberrant enhancers or enhancer control elements for at least some of the induced genes[70]. This mechanism raised two interesting questions. First, why are TH1-specific genes so sensitive to the loss of H3K9me despite the likely redundancy between H3K9 methyltransferases? Second, are ERV-linked control elements part of the normal gene regulation process during wild type haematopoiesis? Deletion of SETDB1 at an earlier stage (that is, in CD4−CD8− thymocytes) resulted in a partial block of the production of both CD4+ T cells and CD8+ T cells. The residual naive CD4+ SETDB1−/− T cells showed misexpression of genes typically expressed during meiosis or in other haematopoietic cell lineages, and this was independent of ERV misexpression[72]. Thus, depending on the cell type, SETDB1 may also mediate gene repression directly, not through transposable-element repression. Mice with a HSC-specific G9a deletion had no discernible defects in the generation of T cells[158,159]. However, similarly to Suv39h1-knockout and Setdb1-knockout mice, these mice failed to develop a protective TH2 cell response following T cell activation[160]. Instead, these activated G9a−/− T cells lost H3K9me2 and gained the expression of genes specific to TH17 cells[158,160], a TH cell lineage with important functions in modulating the immune response[161]. Interestingly, the presence of H3K9me2 and/or H3K9me3 at the interleukin-17A (IL-17A) promoter is triggered by the transcription factor OX40 (ref.[160]), which is a T cell co-stimulatory molecule in the tumour necrosis factor receptor superfamily and which is highly expressed in activated T cells[162]. This example illustrates how a specific transcription factor can regulate the heterochromatin state of cell type-specific genes without affecting H3K9me levels overall. However, given the large diversity of transcription factors that interact with HMTs, the rules that govern the specificity of their interaction with HMTs remain unknown. One of the early phenotypes in haematopoiesis found in Suv39h1-knockout mice was the increased risk of late-onset B cell lymphomas, which resemble non-Hodgkin lymphoma in humans[16]. The exact molecular mechanism driving tumorigenesis in SUV39H1 mutants is not understood, yet one of the central B cell fate-determining steps — V(D)J recombination of antibody light and heavy chains — is controlled by H3K9 methylation. In non-B cells, the complete immunoglobulin heavy chain variable region (Vh) locus is marked by H3K9me[31], and the forced methylation of H3K9 by artificially targeting G9A in cultured B cells is sufficient to prevent V(D)J recombination[163]. Interestingly, the transcription factor PAX5, which is a master regulator of B cell development, is required for the removal of H3K9me2 during B cell differentiation[31]. Thus, not only are H3K9me2 and H3K9me3 domains dynamic during development but chromatin modifiers are able to trigger localized loss or formation of heterochromatin. Whereas individual knockout of Suv39h1, G9a or Setdb1 was not sufficient to compromise H3K9me at the Vh locus, Suv39h1 deficiency in B cells nonetheless reduced isotype-specific IgA switching in mice[69,164,165]. Bulk H3K9me2 levels increase during the course of haematopoiesis[154], even though KDM3B, the dual-purpose arginine demethylase (targeting dimethylated histone H4 Arg3) and lysine demethylase (targeting H3K9me2), is also needed for proper gene activation during the maturation and differentiation of HSCs[166,167]. This finding suggests that H3K9me2 and/or H3K9me3 is important both for keeping genes silent until a cell-fate decision occurs and for de novo repression of genes that were active in earlier stages. Similarly, studies in C. elegans and mice confirmed that H3K9me2 and H3K9me3 are gained and lost at different subsets of tissue-specific genes during development[83,111]. Analysis of C. elegans muscles following loss of MET-2, which deposits H3K9me2, showed that the resulting cell type-specific pattern of gene derepression was driven by a specific set of transcription factors that have developmental stage-specific and lineage-specific expression patterns[83] (Box 4). Importantly, without transcription factor expression, the drop in H3K9 methylation level was not sufficient for gene derepression in either muscle or hypoderm[83] (Fig. 4).
Fig. 4

Transcription factors control tissue-specific transcription derepression upon H3K9me loss.

A major question is whether histone H3 Lys9 (H3K9) methylation contributes to gene repression in differentiated tissues by blocking transcription factor (TF) binding to chromatin[218]. A recent study addressed this question in Caenorhabditis elegans[83] by comparing the effects of loss of the SET domain bifurcated 1 (SETDB1)-like histone methyltransferase MET-2 on gene expression in embryos and in isolated muscle and hypoderm tissues. As depicted schematically, the genes upregulated by MET-2 loss constitute only a fraction (12% in embryos and 13% in muscle) of the genes marked by trimethylated H3K9 (H3K9me3), and the subset of genes affected is both tissue specific and developmental stage specific. In muscle and hypoderm, H3K9 methylation by MET-2 represses tissue-specific genes expressed in other cell lineages, including, but not restricted to, germline-specific genes[83,219,220]. In addition, MET-2 and SET-25 redundantly repress a large number of enhancer elements in muscle. Methylated H3K9 (H3K9me) heterochromatin ensures transcription inactivity by restricting the binding of a defined set of transcription factors at promoters and enhancers, which are either tissue specific — for example, HLH25, HLH-28 and HLH-29 in muscle and EFL-1 in hypoderm — or more widely expressed, for example CEH-48 (ref.[83]). As measured by assay for transposase-accessible chromatin with high-throughput sequencing (ATAC–seq), increased accessibility upon loss of H3K9me is neither sufficient nor necessary to drive transcription, but a subset of β-helix–turn–helix and bZip-containing factors that can recruit chromatin remodelling complexes such as SWI/SNF or the acetyltransferase complex CBP–p300 drive transcription at the genes that become both accessible and upregulated[83]. These results suggest that H3K9 methylation confers tissue-specific gene expression by restricting transcription factor access. The met-2 mutation compromises both muscle ultrastructure and worm mobility.

Transcription factors control tissue-specific transcription derepression upon H3K9me loss.

A major question is whether histone H3 Lys9 (H3K9) methylation contributes to gene repression in differentiated tissues by blocking transcription factor (TF) binding to chromatin[218]. A recent study addressed this question in Caenorhabditis elegans[83] by comparing the effects of loss of the SET domain bifurcated 1 (SETDB1)-like histone methyltransferase MET-2 on gene expression in embryos and in isolated muscle and hypoderm tissues. As depicted schematically, the genes upregulated by MET-2 loss constitute only a fraction (12% in embryos and 13% in muscle) of the genes marked by trimethylated H3K9 (H3K9me3), and the subset of genes affected is both tissue specific and developmental stage specific. In muscle and hypoderm, H3K9 methylation by MET-2 represses tissue-specific genes expressed in other cell lineages, including, but not restricted to, germline-specific genes[83,219,220]. In addition, MET-2 and SET-25 redundantly repress a large number of enhancer elements in muscle. Methylated H3K9 (H3K9me) heterochromatin ensures transcription inactivity by restricting the binding of a defined set of transcription factors at promoters and enhancers, which are either tissue specific — for example, HLH25, HLH-28 and HLH-29 in muscle and EFL-1 in hypoderm — or more widely expressed, for example CEH-48 (ref.[83]). As measured by assay for transposase-accessible chromatin with high-throughput sequencing (ATAC–seq), increased accessibility upon loss of H3K9me is neither sufficient nor necessary to drive transcription, but a subset of β-helix–turn–helix and bZip-containing factors that can recruit chromatin remodelling complexes such as SWI/SNF or the acetyltransferase complex CBP–p300 drive transcription at the genes that become both accessible and upregulated[83]. These results suggest that H3K9 methylation confers tissue-specific gene expression by restricting transcription factor access. The met-2 mutation compromises both muscle ultrastructure and worm mobility. The cell-type specificity of gene derepression upon loss of an H3K9 methyltransferase in mammals is not lymphoid specific. The loss of SETDB1 triggers a hyperactive pro-inflammatory response in macrophages at the promoter of pro-inflammatory cytokines, such as IL-6 (ref.[168]). In these cells, H3K9me3 normally restricts the recruitment of NF-κB, a central regulator of the inflammatory response in the immune system[168], which also plays critical roles in the development, survival and activation of B cells[169]. While it is unclear why the effect of H3K9me3 on the NF-κB pathway is specific to the macrophage lineage, such observations suggest that stress-induced signalling pathways regulate H3K9me in a cell type-specific manner. Histone H3 Lys9 (H3K9) dimethylation and trimethylation anticorrelate with transcription and chromatin accessibility[17,125,238,255,261,262]. However, how H3K9 methylation prevents transcription is not well understood. There are theoretically three ways in which methylated H3K9 (H3K9me) and chromatin compaction could suppress transcription: first, they could limit access of RNA polymerase complexes; second, they could restrict transcript elongation or the stability of mRNAs; or third, they could block the access or function of activating transcription factors. The first two scenarios predict that in the absence of dimethylated H3K9 (H3K9me2) and/or trimethylated H3K9 (H3K9me3), all heterochromatin sequences become broadly transcribed. If true, one would have to attribute the derepression of cell type-specific gene sets in individual H3K9 methyltransferase mutants to target-gene redundancy among them. However, a recent study analysing the transcriptomes of different tissues of Caenorhabditis elegans lacking detectable H3K9me2 and H3K9me3 argues that only a small subset of heterochromatin genes become aberrantly transcribed (upregulated) upon loss of H3K9me, even though a very large number of loci become accessible according to the assay for transposase-accessible chromatin with high-throughput sequencing (ATAC–seq) analysis[83]. The subset of genes upregulated by H3K9me loss is highly cell type specific[83]. A comparison of the derepressed genes showed that a specific set of transcription factors is required for gene derepression in the absence of H3K9me2 and H3K9me3 (ref.[83]) (Fig. 4). In wild-type cells, heterochromatin restricts their binding, thereby ensuring the maintenance of tissue identity, potentially through heterochromatin protein 1 (HP1)-like readers[263]. Similarly, the loss of a large number of heterochromatin-associated proteins in human fibroblasts had only a limited effect on gene and repeat derepression[206]. By contrast, overexpression of transcription factors linked to cell type reprogramming resulted in more widespread and more pronounced transcription activation when heterochromatin was challenged[206]. Interestingly, the most pronounced effect on gene expression was observed not following depletion of individual H3K9 methyltransferases but following depletion of Enhancer of rudimentary homologue (ERH), a factor that appears to regulate global levels of H3K9me3 in the cells studied. The exact mechanism by which ERH enhances H3K9me3 levels is unclear, yet once again it seems to involve a complex redundancy among H3K9 methyltransferases. Taken together, these studies illustrate that H3K9me2 and/or H3K9me3 prevents transcription by restricting transcription factor accessibility. The loss of H3K9me creates the potential for gene activation, which ultimately depends on the expression pattern of specific sets of transcription factors[83,264]. The need for sequence-specific transcription factors has also been observed for the reactivation of retroelements during early human embryogenesis[265] and in human fibroblasts[206]. Although these observations explain why the upregulation of H3K9me-modified genes is tissue specific, it does not explain why this particular set of transcription factors are able to activate genes upon loss of H3K9me, whereas others cannot.

Muscle differentiation and maintenance

The differentiation of muscle cells is a multistep process, driven initially by basic helix–loop–helix transcription factors such as myoblast determination protein 1 (MYOD1) in mammals[170,171]. Differentiation into muscle begins when myoblast precursors exit the cell cycle, rendering them unresponsive to mitogenic signals[172]. During this transition, S-phase genes acquire H3K9me3 and become permanently silenced[172,173]. Evidence that H3K9me might be an important epigenetic regulator of cell cycle exit was provided by the depletion of SUV39H1/H2, which resulted in aberrant transcription of a subset of S-phase-specific genes in differentiating (postmitotic) cells, but not in cycling muscle precursor cells[173]. The derepressed S-phase genes were largely regulated by the RB–E2F pathway, the central regulator of G1/S phase transition and cell cycle exit. Whereas the E2F-type transcription factors function as activators, when E2F is bound by RB, the complex mediates transcriptional repression[174]. Silencing through RB has been attributed to interactions with HDAC1 (refs[175,176]) and with SUV39H1 (refs[177-179]). Interestingly, the deposition of H3K9me3 and silencing of a subset of E2F targets correlates with their spatial clustering with pericentric heterochromatin in muscle cells[180]. Related to these results is a study of a C. elegans model of Emery–Dreyfuss muscular dystrophy, which arises from the introduction of a laminopathic mutant in otherwise wild-type worms (lmn-1Y59C or LMNAY45C in mammals). In this gain-of-function mutant, a large number of loci were aberrantly associated with the nuclear lamina in differentiated muscle in C. elegans, and the relevant promoters were statistically enriched for E2F binding sites. As expected, they became hyper-repressed[181]. Moreover, the association of these loci with the nuclear periphery was sensitive to loss of the H3K9me reader CEC-4 (ref.[181]), confirming that the lmn-1Y59C mutation led to a gain in H3K9me. Surprisingly, the loss of H3K9me2 by ablation of MET-2 in otherwise wild-type C. elegans muscle upregulated a range of genes not necessarily enriched for E2F binding sites[83]. In contrast to the role of SUV39H1, G9A promotes myoblast proliferation and inhibits cell cycle exit during myogenic differentiation[182]. Loss of G9a resulted in the upregulation of genes that were either expressed in non-muscle lineages (for example, genes of immune cells) or regulated cell cycle exit. Interestingly, these genes were not E2F targets, but instead were regulated by MYOD1[182]. Thus, although SUV39H1 and G9A both methylate H3K9, HMT loss can have opposite effects in differentiating cells. This divergence in phenotype contrasts with the highly similar effects that the same HMT mutants have on late T cell differentiation. Obviously, before any clinical application of H3K9 methylation inhibition, the specific pathways and genes targeted by a given HMT in the relevant tissue must be understood. A direct comparison between SUV39H1, G9A and SETDB1 in myogenesis is complicated by the intriguing role of SETDB1 in adult muscle regeneration. In adult mice, SETDB1 (ref.[61]), but not G9A[183], is required for the expansion of activated muscle stem cells (satellite cells), which are needed for muscle regeneration[61]. Again, contrasting with the role of SETDB1 in haematopoiesis, depletion of SETDB1 in adult muscle stem cells resulted in the premature expression of genes that would normally be expressed in terminally differentiated muscle and not the misexpression of non-muscle genes. Many of the upregulated loci were direct targets of SETDB1, as SETDB1 and H3K9me3 were both found at the relevant promoters. An analysis of SETDB1 and SUV39H1 function in wild type muscle development showed that upon muscle stem cell activation, WNT3A signalling triggers the export of SETDB1 from the nucleus by exportin 1 (ref.[61]). The cytosolic accumulation of SETDB1 correlated with loss of chromatin-bound SETDB1 and partial activation of its target genes[61]. This phenotype resembles the loss of the SETDB1 cofactor ATF7IP[57] or the response of MET-2, the SETDB1 homologue in C. elegans, following exposure to heat stress[60,63]. In both cases, the HMT was depleted from the nucleus and there was a global reduction in the level of H3K9me, leading to selective gene upregulation. By contrast, ischaemic or oxidative stress in rat myocytes was shown to trigger a rapid transcriptional upregulation of SUV39H1, and a coordinated downregulation of the acetylated H3K9 deacetylase SIRT6 (ref.[184]). Thus, in addition to the control of differentiation-induced patterns of gene expression by H3K9me, stress can illicit global responses by altering the level of H3K9me. The effects of stress on H3K9me levels may be relevant in rat neonatal ventricular myocytes, where an exposure to oxidative stress resulted in an increased likelihood of myocardial infarction[184]. Suv39h1-knockout mice and Suv39h2-knockout mice were both protected from myocardial infarction and showed a reduced accumulation of reactive oxygen species[184]. Currently it is unclear how stress-induced overexpression of SUV39H1 affects gene transcription and whether this transient state leads to lasting changes in heterochromatin. Nonetheless, it is an intriguing possibility that specific tissues undergo a long-term adaptation to stress through changes in their epigenetic landscape[44,60,63,185,186].

Neurogenesis

Neurogenesis is a complex process that results in an enormous diversity of cell types, ranging from oligodendrocytes to different types of astrocytes, and a vast number of specialized neurons. In addition to the process of differentiation, the nervous system provides interesting models for postmitotic adaptation, otherwise known as neuronal plasticity. Various studies have shown that H3K9me-mediated silencing contributes to neuronal plasticity, and to early stages of neurogenesis. As demonstrated in other tissues, all the major H3K9 methyltransferases are required to prevent non-lineage gene expression during neuronal development. Loss of SETDB1 in the neuronal lineage impairs differentiation of neurons and increases the chance of neuronal precursor cells developing into astrocytes[187]. In C. elegans the SETDB1 homologue MET-2 is required to restrict neuronal subtype-specific gene expression during terminal neuronal differentiation[188]. Setdb1-deficient mouse neurons are characterized by the upregulation of genes specific to glial cells and of germline-specific genes. Similarly to what has been observed in SETDB1-deficient CD4+ T cells, derepression in a minority of cases was mediated by a chimeric transcript originating at a nearby derepressed ERV[187]. Similarly to SETDB1, G9A and its catalytic activity are required for neuronal differentiation in cell culture models[189] and for the integrity of differentiated neurons in the postnatal brain. Postnatal depletion of GLP or G9A in neurons led to the derepression of non-neuron and neuronal progenitor genes, and ultimately resulted in behavioural phenotypes[190]. The most striking evidence for a role for H3K9me2, H3K9me3 and G9A in the maintenance of cell-fate choice, however, comes from the olfactory system. Olfactory sensory neurons are characterized by the monogenic and monoallelic expression of a specific olfactory receptor in each neuron, chosen from more than 1,000 available receptor genes in the genome[191]. The transcriptionally silent receptor genes are marked by H3K9me2 in olfactory epithelial stem cells and acquire H3K9me3 upon differentiation[192]. Interestingly, loss of G9a, which itself performs only H3K9 dimethylation, was sufficient to prevent H3K9 trimethylation at the olfactory receptor gene cluster, thereby interfering with normal monogenic receptor expression[193]. Although the enzyme or enzymes responsible for H3K9me3 at these genes are not known, the stage-specific, sequential methylation suggests an interplay between G9A and either SUV39H1/H2 or SETDB1/2 (ref.[193]). Evidence for an important role of SUV39H2 in neurogenesis, and/or neuron function, comes from a recent description of a rare loss-of-function mutation (A211S) in the pre-SET domain of SUV39H2 in an individual with autism spectrum disorder (ASD)[194]. There is also a general reduction of SUV39H1/H2 levels in neuronal regions of individuals with ASD[194]. Subsequent molecular analysis showed that SUV39H2 is essential for the repression of a cluster of protocadherin-β genes in the developing brain of mice, and its deletion triggered ASD-relevant symptoms[194]. In contrast to other tissues, the expression of the main H3K9 methyltransferases in neurons is subject to extensive regulation. SETDB1 is highly expressed in neuronal precursor cells, but its expression decreases as cells exit the cell cycle and progress through terminal differentiation. SETDB1 overexpression in differentiated neurons has been shown to alter several aspects of neuronal physiology, causing both cognitive impairment[195] and antidepressant-like effects[196]. SETDB1 overexpression can be observed in humans with Huntington disease, and the pharmacological inhibition of SETDB1 ameliorated symptoms in a mouse model of the disease[197]. Non-pathogenic huntingtin protein has been proposed to sequester and thus inactivate the ATF7IP–SETDB1 complex[198]. Accordingly, depletion of huntingtin in human ESCs or expression of mutant huntingtin in induced pluripotent stem cells derived from individuals with Huntington disease led to an increase in H3K9me3 levels, which suggests that a feedback loop from huntingtin to SETDB1 may control genes involved in neuronal differentiation[198]. In the mouse brain, G9A is expressed during the embryonic and synaptogenic periods, but its expression is reduced in later development[199]. Interestingly, during neuronal differentiation, G9A undergoes alternative splicing to include its E10 exon, which leads to increased nuclear localization of G9A and elevated H3K9me2 levels[189], which promotes neuronal polarization[200]. In addition, G9A expression and H3K9me2 levels are increased following nerve injury in the dorsal root ganglion. The increase in promoter H3K9me2 levels correlates with a downregulation of the central K+ channel and hypersensitivity to pain[201,202], which can be suppressed by G9A inhibition[201,202]. A similar increase in G9A expression and H3K9me2 levels has been observed during neonatal ethanol-induced neurodegeneration, with G9A upregulation contributing to neuronal apoptosis[199]. Conversely, the downregulation of G9A and a reduction of H3K9me2 levels has been observed in the brain of mice repeatedly exposed to cocaine[203]. Experimental downregulation of G9A correlates with increased neuronal plasticity and expression of genes that regulate dendritic plasticity[203,204]. Interestingly, haploinsufficiency of GLP, but not of G9a, is linked to Kleefstra syndrome, which is a rare genetic disease characterized by intellectual disability, autistic-like features, childhood hypotonia and facial dysmorphisms[205]. Considering this wide array of HMT mutant effects, it is likely that the modulation of HMT levels will be therapeutically helpful for neuronal diseases.

Conclusions and future directions

Taken together, the studies discussed herein demonstrate that H3K9me2-mediated and H3K9me3-mediated silencing is an important epigenetic regulator of gene expression throughout development, affecting cell type-specific genes. Surprisingly, this silencing function does not seem to be essential for cell or organismal viability, but rather it ensures the robustness of cell-fate decisions and the integrity of differentiated tissues. The limited and highly cell type-specific transcriptional changes observed in H3K9 methyltransferase mutants suggest that the loss of this heterochromatin mark is often not sufficient to induce transcription. Indeed, recent experiments in C. elegans show that most of the repressed genes remain transcriptionally silent in differentiated cells that have lost all H3K9me, and that a specific cell type-dependent set of transcription factors is necessary to drive transcription activation of genes that had been embedded in heterochromatin[83] (Box 4; Fig. 4). Similarly, depletion of a large number of heterochromatin-associated proteins in human cell culture permits the aberrant binding of transcription factors to repressed non-lineage genes, but not an extensive, nonspecific activation of genes and repeats embedded in heterochromatin[206]. Understanding the relevant molecular features of these transcription factors and what additional changes to chromatin allow active transcription of previously H3K9me-repressed genes will be essential if we are to understand the role of H3K9 methylation in development. The partial redundancy observed among the HMTs in mammals makes it necessary to compare single and double or even triple mutants of H3K9 methyltransferases directly to reveal the full picture. This complexity poses major challenges for experimental design, and encourages further study of heterochromatin in model organisms that have fewer H3K9-methylating enzymes. The involvement of H3K9 methyltransferases in various stress responses, their misexpression and aberrant recruitment in a range of human cancers, and the overall loss of H3K9me during aging highlight the medical relevance of deeply understanding the role of heterochromatin in differentiated cells.
  260 in total

1.  Reversal of H3K9me2 by a small-molecule inhibitor for the G9a histone methyltransferase.

Authors:  Stefan Kubicek; Roderick J O'Sullivan; E Michael August; Eugene R Hickey; Qiang Zhang; Miguel L Teodoro; Stephen Rea; Karl Mechtler; Jennifer A Kowalski; Carol Ann Homon; Terence A Kelly; Thomas Jenuwein
Journal:  Mol Cell       Date:  2007-02-09       Impact factor: 17.970

2.  Step-wise methylation of histone H3K9 positions heterochromatin at the nuclear periphery.

Authors:  Benjamin D Towbin; Cristina González-Aguilera; Ragna Sack; Dimos Gaidatzis; Véronique Kalck; Peter Meister; Peter Askjaer; Susan M Gasser
Journal:  Cell       Date:  2012-08-31       Impact factor: 41.582

3.  Histone H3K9 methylation is dispensable for Caenorhabditis elegans development but suppresses RNA:DNA hybrid-associated repeat instability.

Authors:  Peter Zeller; Jan Padeken; Robin van Schendel; Veronique Kalck; Marcel Tijsterman; Susan M Gasser
Journal:  Nat Genet       Date:  2016-09-26       Impact factor: 38.330

4.  Genomic and Proteomic Resolution of Heterochromatin and Its Restriction of Alternate Fate Genes.

Authors:  Justin S Becker; Ryan L McCarthy; Simone Sidoli; Greg Donahue; Kelsey E Kaeding; Zhiying He; Shu Lin; Benjamin A Garcia; Kenneth S Zaret
Journal:  Mol Cell       Date:  2017-12-21       Impact factor: 17.970

Review 5.  Ten principles of heterochromatin formation and function.

Authors:  Robin C Allshire; Hiten D Madhani
Journal:  Nat Rev Mol Cell Biol       Date:  2017-12-13       Impact factor: 94.444

6.  G9a histone methyltransferase plays a dominant role in euchromatic histone H3 lysine 9 methylation and is essential for early embryogenesis.

Authors:  Makoto Tachibana; Kenji Sugimoto; Masami Nozaki; Jun Ueda; Tsutomu Ohta; Misao Ohki; Mikiko Fukuda; Naoki Takeda; Hiroyuki Niida; Hiroyuki Kato; Yoichi Shinkai
Journal:  Genes Dev       Date:  2002-07-15       Impact factor: 11.361

7.  PRDI-BF1 recruits the histone H3 methyltransferase G9a in transcriptional silencing.

Authors:  Ildikó Gyory; Jian Wu; György Fejér; Edward Seto; Kenneth L Wright
Journal:  Nat Immunol       Date:  2004-02-22       Impact factor: 25.606

Review 8.  RNA-mediated epigenetic regulation of gene expression.

Authors:  Daniel Holoch; Danesh Moazed
Journal:  Nat Rev Genet       Date:  2015-01-02       Impact factor: 53.242

9.  Defining heterochromatin in C. elegans through genome-wide analysis of the heterochromatin protein 1 homolog HPL-2.

Authors:  Jacob M Garrigues; Simone Sidoli; Benjamin A Garcia; Susan Strome
Journal:  Genome Res       Date:  2014-12-02       Impact factor: 9.043

10.  Complete loss of H3K9 methylation dissolves mouse heterochromatin organization.

Authors:  Thomas Montavon; Nicholas Shukeir; Galina Erikson; Bettina Engist; Megumi Onishi-Seebacher; Devon Ryan; Yaarub Musa; Gerhard Mittler; Alexandra Graff Meyer; Christel Genoud; Thomas Jenuwein
Journal:  Nat Commun       Date:  2021-07-16       Impact factor: 14.919

View more
  2 in total

Review 1.  Role of Epigenetic Mechanisms in Chronic Pain.

Authors:  Daniela Mauceri
Journal:  Cells       Date:  2022-08-22       Impact factor: 7.666

Review 2.  Molecular Pathways and Genomic Landscape of Glioblastoma Stem Cells: Opportunities for Targeted Therapy.

Authors:  Andrew M Hersh; Hallie Gaitsch; Safwan Alomari; Daniel Lubelski; Betty M Tyler
Journal:  Cancers (Basel)       Date:  2022-07-31       Impact factor: 6.575

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.