Literature DB >> 31187595

TET2 missense variants in human neoplasia. A proposal of structural and functional classification.

Elena Bussaglia1, Rosa Antón2, Josep F Nomdedéu1, Pablo Fuentes-Prior2.   

Abstract

BACKGROUND: The human TET2 gene plays a pivotal role in the epigenetic regulation of normal and malignant hematopoiesis. Somatic TET2 mutations have been repeatedly identified in age-related clonal hematopoiesis and in myeloid neoplasms ranging from acute myeloid leukemia (AML) to myeloproliferative neoplasms. However, there have been no attempts to systematically explore the structural and functional consequences of the hundreds of TET2 missense variants reported to date.
METHODS: We have sequenced the TET2 gene in 189 Spanish AML patients using Sanger sequencing and NGS protocols. Next, we performed a thorough bioinformatics analysis of TET2 protein and of the expected impact of all reported TET2 missense variants on protein structure and function, exploiting available structure-and-function information as well as 3D structure prediction tools.
RESULTS: We have identified 38 TET2 allelic variants in the studied patients, including two frequent SNPs: p.G355D (10 cases) and p.I1762V (28 cases). Four of the detected mutations are reported here for the first time: c.122C>T (p.P41L), c.4535C>G (p.A1512G), c.4760A>G (p.D1587G), and c.5087A>T (p.Y1696F). We predict a complex multidomain architecture for the noncatalytic regions of TET2, and in particular the presence of well-conserved α+β globular domains immediately preceding and following the actual catalytic unit. Further, we provide a rigorous interpretation of over 430 missense SNVs that affect the TET2 catalytic domain, and we hypothesize explanations for ~700 additional variants found within the regulatory regions of the protein. Finally, we propose a systematic classification of all missense mutants and SNPs reported to date into three major categories (severe, moderate, and mild), based on their predicted structural and functional impact.
CONCLUSIONS: The proposed classification of missense TET2 variants would help to assess their clinical impact on human neoplasia and may guide future structure-and-function investigations of TET family members.
© 2019 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.

Entities:  

Keywords:  5-methylcytosine; TET2; classification of mutations; epigenetic regulation; neoplasia

Mesh:

Substances:

Year:  2019        PMID: 31187595      PMCID: PMC6625141          DOI: 10.1002/mgg3.772

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


INTRODUCTION

Methylation at the C5 position of cytosine bases to generate 5‐methylcytosine (5mC) is one of the major epigenetic modifications of mammalian genomes, with a profound impact on chromatic structure and gene expression (Feng, Jacobsen, & Reik, 2010; Smith & Meissner, 2013). Members of the Ten‐Eleven‐Translocation (TET) subfamily of Fe(II)‐ and 2‐oxoglutarate (2‐OG)‐dependent dioxygenases (EC 1.14.11.n2) help to revert this modification by iteratively oxidizing 5mC bases first to 5‐hydroxymethylcytosine (5hmC) (Ito et al., 2010; Tahiliani et al., 2009), and then to 5‐formylcytosine (5fC) and 5‐carboxy‐cytosine (5caC) (Ito et al., 2011). Since both 5fC and 5caC can be removed through the base excision‐repair pathway (Cortellino et al., 2011; He et al., 2011; Ito et al., 2010,2011; Ko et al., 2010; Rampal et al., 2014), TET enzymes catalyze the first step in active DNA demethylation, and therefore play an important role in transcription regulation. In addition, 5hmC bases function as stable, independent epigenetic marks, which have been shown to accumulate for example at sites of DNA damage and to promote genome stability (Kafer et al., 2016). On the other hand, TET2 has been recently shown to oxidize specific mRNAs to promote infection‐induced myelopoiesis (Shen et al., 2018). TET2 has been shown to possess important functions that are independent of its enzymatic activity. For instance, TET2 involvement in the resolution of inflammation depends upon the ability of the enzyme to recruit the histone deacetylase, HDAC2, to specifically repress IL‐6 transcription (Zhang et al., 2015). Further, both catalytic and noncatalytic activities of TET2 are required for mast cell differentiation and proliferation (Montagner et al., 2016). Finally, it has been recently reported that the enzyme regulates age‐related regenerative decline in the aging hippocampus, with important implications for neurogenic rejuvenation (Gontier et al., 2018). (For authoritative reviews on the function of TET proteins see e.g. refs. [Pastor, Aravind, & Rao, 2013; Rasmussen & Helin, 2016]). TET2 is expressed at particularly high levels in hematopoietic cells, and is critically relevant for normal hematopoiesis. Tet2 deletion in mice models causes an enlargement of the myeloid immature compartment raising the risk of acquiring additional transforming mutations (Li et al., 2011; Moran‐Crusio et al., 2011; Quivoron et al., 2011). Not surprisingly, numerous TET2 mutations have been described in a wide range of human myeloid malignancies ranging from acute myeloid leukemia (AML) to myeloproliferative neoplasms (MPN) (see e.g. [Abdel‐Wahab et al., 2009; Delhommeau et al., 2009; Langemeijer et al., 2009]). TET2 mutations are also common in some subgroups of mature T‐cell lymphomas (Quivoron et al., 2011; Zang et al., 2017). In addition to hematological and lymphoid neoplasms, TET2 variants have been more recently reported in nearly all cancer types, most notably colorectal, lung, and skin carcinomas. (For a complete list of articles describing TET2 mutations see Table S2; see ref. (Ko et al., 2015) for a recent review of TET proteins and their role in hematological cancers). TET2 mutation causes an expansion of immature hematopoietic precursors with a bias toward myeloid differentiation (Arends et al., 2018; Buscarlet et al., 2018). TET2 alterations have also been described in age‐related clonal hematopoiesis (ARCH; [Jaiswal et al., 2014]), as firstly demonstrated in elderly women (Busque et al., 2012). TET2 mutations may also be detected in healthy women years before the development of leukemia (Desai et al., 2018). Importantly, somatic TET2 mutations in normal elderly individuals with CH raise the mortality attributed to vascular events (Jaiswal et al., 2017), most likely by promoting an exacerbated atherosclerosis (Fuster et al., 2017). In this regard, TET2‐deficiency results in an increased pro‐inflammatory phenotype in murine macrophages, which may help to promote atherosclerosis (Abegunde, Buckstein, Wells, & Rauh, 2018; Cull, Snetsinger, Buckstein, Wells, & Rauh, 2017; Cull, Snetsinger, & Rauh, 2016). CH also alters cardiac repair by inflammatory pathways (Sano et al., 2018). Also in line with the inflammatory role of TET2, patients who underwent allogeneic bone marrow transplantation from donors with CH had an increased risk of chronic graft‐versus‐host disease (Frick et al., 2019). These findings are in line with TET2‐mediated repression of inflammatory mediators such as IL‐6, as mentioned above (Zhang et al., 2015). (For a recent review on CH and its relevance for hematopoietic malignancies, see [Bowman, Busque, & Levine, 2018]). The TET2 gene maps to chromosome 4q24 contains 11 exons, and has an open reading frame of 6009 nt. The encoded protein contains 2002 amino acid residues (isoform A, NM 001127208), with the actual catalytic unit located in the C‐terminal part of the protein (Figure 1a). The three‐dimensional (3D) structure of this TET2 region and the mechanism of 5mC oxidation have been recently characterized by X‐ray crystallography (Hu et al., 2013, 2015). The structural work revealed that the catalytic unit is comprised of two tandem cysteine‐rich modules, followed by the actual catalytic domain (Figure 1b,c). This domain, termed double‐stranded β‐helix (DSBH) for its major structural feature, contains a large, poorly conserved insertion between approximate positions Cys1464 and Glu1841. (All numbers correspond to the human enzyme). Thus, the structurally characterized fragment is an artificial construct in which residues Ser1481Asn1843 are replaced by a 15‐residues‐long Gly/Ser linker, and which therefore corresponds to only ≈22% of the full‐length protein. On the other hand, there is no structural information available for the long N‐terminal region of the protein, Met1Pro1131. Interestingly, a second transcript of the TET2 gene encodes for a protein that is essentially comprised of these N‐terminal residues, suggesting that it plays an important functional role(s) (isoform B, NM 017628).
Figure 1

Domain organization and distribution of missense variants in human TET2. (a) The domain architecture of the TET2 protein is schematically represented on top of the figure. Posttranslational modification (PTM) sites were taken from public databases, or predicted, either by similarity with the mouse enzyme (Bauer et al., 2015) or using software given in Methods. Notice that the long polypeptide stretch preceding the actual catalytic unit is predicted to contain several putative, relatively well‐conserved globular modules, separated by more variable PTM‐rich linkers. Single‐nucleotide variations reported in the human TET2 gene are given below the scheme, with the following code: missense mutations are represented by triangles, colored red if identified in hematopoietic and lymphoid neoplasms or black, if reported in solid tumors. Validated SNPs are given as circles, which are colored red and black if they have been in addition reported in cancer patients, or left empty otherwise. (b and c) Two approximately perpendicular views of the three‐dimensional structure of TET2 catalytic unit. Major structural domains are shown as ribbons, colored green (Cys‐N module), light green (Cys‐C), magenta and light pink (N‐ and C‐terminal halves of the DSBH domain, respectively). Zn2+ and Fe2+ ions are represented as gray and orange spheres, respectively; the side chains of their coordinating residues are also shown. The bound DNA oligonucleotide is shown with all its non‐hydrogen atoms, color‐coded (carbon, light pink; nitrogen, blue; oxygen, red; and phosphor, orange). Note the complex interdomain contacts, not only between the N‐ and C‐terminal halves of the catalytic domain, but also between the Zn2+‐binding domains with each other and with the DSHB module. In panel c, note the long insertion between residues Thr1463 and Gly1842 (C‐terminal end of N‐terminal half and N‐terminal start of the C‐terminal half, respectively). DSBH, domain, termed double‐stranded β‐helix

Domain organization and distribution of missense variants in human TET2. (a) The domain architecture of the TET2 protein is schematically represented on top of the figure. Posttranslational modification (PTM) sites were taken from public databases, or predicted, either by similarity with the mouse enzyme (Bauer et al., 2015) or using software given in Methods. Notice that the long polypeptide stretch preceding the actual catalytic unit is predicted to contain several putative, relatively well‐conserved globular modules, separated by more variable PTM‐rich linkers. Single‐nucleotide variations reported in the human TET2 gene are given below the scheme, with the following code: missense mutations are represented by triangles, colored red if identified in hematopoietic and lymphoid neoplasms or black, if reported in solid tumors. Validated SNPs are given as circles, which are colored red and black if they have been in addition reported in cancer patients, or left empty otherwise. (b and c) Two approximately perpendicular views of the three‐dimensional structure of TET2 catalytic unit. Major structural domains are shown as ribbons, colored green (Cys‐N module), light green (Cys‐C), magenta and light pink (N‐ and C‐terminal halves of the DSBH domain, respectively). Zn2+ and Fe2+ ions are represented as gray and orange spheres, respectively; the side chains of their coordinating residues are also shown. The bound DNA oligonucleotide is shown with all its non‐hydrogen atoms, color‐coded (carbon, light pink; nitrogen, blue; oxygen, red; and phosphor, orange). Note the complex interdomain contacts, not only between the N‐ and C‐terminal halves of the catalytic domain, but also between the Zn2+‐binding domains with each other and with the DSHB module. In panel c, note the long insertion between residues Thr1463 and Gly1842 (C‐terminal end of N‐terminal half and N‐terminal start of the C‐terminal half, respectively). DSBH, domain, termed double‐stranded β‐helix As mentioned above, somatic TET2 mutations have been repeatedly found in myeloid neoplasia. The pathogenic attribution of TET2 variants to hematologic neoplasms relies on the earlier description in other cases, functional studies and/or the absence of the newly described variant in the DNA from nonneoplastic samples. However, the identification of TET2 mutations in ARCH and the presence of allele variants of TET2 in nonhematologic neoplasms may complicate interpretation of new variants. In silico prediction of the functional consequences of TET2 variants could be clinically useful in case these criteria are not met. However, this requires a deep knowledge of both the wild‐type protein and representative bona fide neoplastic mutants. Rigorous pathogenicity prediction of SNVs is limited to the structurally characterized TET2 catalytic unit, but has not been systematically attempted to date. Here we report the identification and analysis of TET2 SNVs identified in 189 Spanish AML patients. Spurred by the lack of structural information on the noncatalytic regions of the enzyme, we have combined bioinformatics results and available functional information into a unified model of TET2 overall architecture. We present a rigorous interpretation of over 430 missense variants that affect the TET2 catalytic domain, and provide likely explanations for ~700 additional variants found in the regulatory regions of the protein. Finally, we present a proposal of classification of missense TET2 SNVs that takes into account these structural and functional data.

METHODS

Patients

Samples from patients with AML, diagnosed according to standard criteria (Beer et al., 2010), were analyzed for TET2 mutations using Sanger sequencing and NGS protocols as reported elsewhere (Nomdedéu et al., 2012, 2017).

DNA extraction and sequencing

Genomic DNA was extracted from bone marrow aspirates or from peripheral blood of healthy donors and patients using standard methods, and analyzed for mutations in NPM1, FLT3, CEBPA, and MLL genes using well‐established protocols (Munoz et al., 2003, 2001, 2003; Nomdedéu et al., 2012). For mutational analysis of the TET2 gene, DNA was amplified using primers spanning the entire coding region and PCR conditions reported elsewhere (Delhommeau et al., 2009). PCR products were purified with ExoSAP‐IT (Mouradov et al., 2014), and sequenced using the BigDyeTM Terminator Cycle Sequencing kit. Sequence analysis was performed on an ABI PRISM‐3100 Genetic Analyzer. In 20 cases, a targeted NGS approach that allowed a mean coverage of 88% of the coding TET2 sequence was also used, and variations were confirmed by Sanger sequencing (Nomdedéu et al., 2017).

Bioinformatics analysis

Missense mutations were retrieved from the COSMIC database (http://cancer.sanger.ac.uk/cosmic/gene/analysis?ln=TET2) or extracted from published work. Validated SNPs were taken from the NCBI database (http://www.ncbi.nlm.nih.gov/snp). For the identification of disordered regions, protein sequences were submitted to the DisMeta meta server (http://www-nmr.cabm.rutgers.edu/bioinformatics/disorder/; [Huang, Acton, & Montelione, 2014]) and in addition to servers PrDOS (http://prdos.hgc.jp/; [Ishida & Kinoshita, 2007]) and DICHOT (http://idp1.force.cs.is.nagoya-u.ac.jp/dichot/; [Fukuchi, Hosoda, Homma, Gojobori, & Nishikawa, 2011]). Local sequence motifs were predicted with ELM (http://elm.eu.org/; [Dinkel et al., 2016]) PROSITE (http://prosite.expasy.org/; [Sigrist et al., 2012]) and DLocalMotif (http://bioinf.scmb.uq.edu.au:8080/dlocalmotif/; [Mehdi, Sehgal, Kobe, Bailey, & Bodén, 2013]). Caspase cleavage sites were predicted with CaspDB (http://caspdb.sanfordburnham.org/; [Kumar, van Raam, Salvesen, & Cieplak, 2014]) and calpain cleavage sites with CaMPDB (http://calpain.org/predict.rb?cls=substrate; [duVerle, Ono, Sorimachi, & Mamitsuka, 2011]). Experimentally determined posttranslational modifications were taken from PhosphoSitePlus (http://www.phosphosite.org/; [Hornbeck et al., 2014]) or from cited references (Bauer et al., 2015; Nakagawa et al., 2015). Secondary structure was predicted with JPred4 (http://www.compbio.dundee.ac.uk/jpred/; [Drozdetskiy, Cole, Procter, & Barton, 2015]) and PSIPRED (http://bioinf.cs.ucl.ac.uk/psipred/; [Buchan, Minneci, Nugent, Bryson, & Jones, 2013]) assisted also by the results of YASPIN (http://www.ibi.vu.nl/programs/yaspinwww/; [Lin, Simossis, Taylor, & Heringa, 2005]) and of β‐hairpins predicted with βhairPred (http://www.imtech.res.in/raghava/bhairpred/; [Kaur & Raghava, 2003]). The impact of selected SNVs was assessed through metaservers PredictSNP (http://loschmidt.chemi.muni.cz/predictsnp/; [Bendl et al., 2014]) and Meta‐SNP (http://snps.biofold.org/meta-snp; [Capriotti, Altman, & Bromberg, 2013]). Threading and modeling were performed with RaptorX (http://raptorx.uchicago.edu/; [Källberg et al., 2012]) or Phyre2 (http://www.sbg.bio.ic.ac.uk/phyre2/; [Kelley, Mezulis, Yates, Wass, & Sternberg, 2015]), and models were finally refined with ModRefiner (https://zhanglab.ccmb.med.umich.edu/ModRefiner/; [Xu & Zhang, 2011]). The impact of point variants on protein structure and function was assessed with PROVEAN (http://provean.jcvi.org/, ref. [Choi, Sims, Murphy, Miller, & Chan, 2012]), PolyPhen‐2 (http://genetics.bwh.harvard.edu/pph2/index.shtml, ref. [Adzhubei et al., 2010]), and CUPSAT (http://cupsat.tu-bs.de/; ref. [Parthiban, Gromiha, & Schomburg, 2006]). Structure figures were prepared with PyMol (www.pymol.org).

RESULTS AND DISCUSSION

Novel TET2 mutations in Spanish neoplasia patients

Thirty‐eight TET2 allelic variants were identified in 189 studied AML patients, including two frequent SNPs: p.G355D (10 cases) and p.I1762V (28 cases) (Table S1). Four of the detected mutations are reported here for the first time: c.122C>T (p.P41L), c.4535C>G (p.A1512G), c.4760A>G (p.D1587G), and c.5087A>T (p.Y1696F) (Figure S1). Further, in 100 patients, we excluded the presence of additional variations affecting residues that bind Zn2+ ions in TET2 catalytic unit. Finally, none of the following missense mutations or SNPs was detected in 100 healthy controls: p.Y867H, p.T1270A, p.R1359P, p.A1512G, and p.P1723S.

Predicted domain organization of TET2 protein

A growing number of missense mutations and validated SNPs in TET2 (summarized in Table S2 and Table S3, respectively) affect the large N‐terminal, uncharacterized region of the protein (Met1Pro1131) or the long loop inserted within the catalytic unit (Cys1464Asn1843). Although both stretches are commonly described as intrinsically disordered regions (IDRs), it seems likely that such large, relatively well‐conserved polypeptides are relevant for TET2 structure and function. This prompted us to integrate the results of several bioinformatics tools into a working model of full‐length TET2 architecture (Figure 1a). Most notably, the results of our analyses strongly suggest that TET2 contains several globular domains in addition to the catalytic unit, which might adopt definite 3D structures upon binding for example to cognate modules in specific coregulators. These domains are linked by IDRs that are targets of various posttranslational modifications (PTMs; Figure 1a), and which appear to be functionally relevant. For instance, residue Lys110 has been recently reported to be acetylated by p300/CBP, which in turn results in DNMT1 binding, enhanced protein stability, and higher catalytic activity in vivo (Zhang et al., 2017). (See Figure 2a for a close‐up of p300 catalytic region, highlighting pockets that accommodate the substrate lysine residue, as well as surrounding residues). These findings explain the negative impact of mutant p.Lys110Arg, identified in a patient with myelodysplastic syndrome (Papaemmanuil et al., 2013). Alternatively, mutations that interfere with the deacetylation of AcLys100 by HDAC1/2 might result in a hyperactive TET2 enzyme. (For a model of the Leu107Lys113 TET2 peptide bound to HDAC1, see Figure 2b). Further, a well‐conserved bipartite nuclear localization signal is predicted between residues Gly93 and Asp115, and a 3D model of the putative complex between this TET2 stretch and the nuclear transport receptor, importin‐α, is given in Figure 2c. A sequence alignment around this NLS is given in Figure S2a and details of the putative TET2—importin‐α interaction interface are presented in Figures S2b–e. Alternatively, the Gly93Asp115 peptide might interact with the TPR‐repeat domain of another known TET2 interactor, O‐linked GlcNAc transferase (Chen, Chen, Bian, Fujiki, & Yu, 2013; Deplus et al., 2013; Vella et al., 2013), which shares structural similarities and a common binding mode with importin‐α (Jinek et al., 2004). These interactions might be regulated by phosphorylation of residue Ser99 by AMP‐activated kinase, which has been recently shown to stabilize TET2 protein. This PTM thus links hyperglycemia and diabetes to cancer through modification of the 5hmC landscape (Wu et al., 2018). We also note that the highly variable Asp297Thr395 linker is a likely major target of cysteine proteinases that regulate TET2 activity: caspases (Ko et al., 2013) and calpains (Wang & Zhang, 2014). Removal of the N‐terminal region and concomitant loss of binding sites for the important interactors mentioned above might explain, at least partly, the regulatory role of these proteinases.
Figure 2

Relevance of the non‐catalytic regions of TET2 protein for enzyme function. Focus is on experimentally verified or predicted roles of residues within the Gly93‐Gln116 TET2 stretch. (a) Close‐up of human p300 catalytic region, highlighting pockets that could accommodate the substrate lysine residue (Lys100; ref. [Zhang et al., 2017]) in TET2, as well as surrounding residues. p300 secondary structure elements are colored pink, and the CoA cofactor is shown with all its nonhydrogen atoms to the bottom‐left, color‐coded (carbon, green; oxygen, red; nitrogen, blue; phosphor, orange and sulfur, yellow). Well‐ordered PEG molecules bound close to the active site in the reported crystal structure (PDB code 4PZR, [Maksimoska, Segura‐Peña, Cole, & Marmorstein, 2014]) are also shown color‐coded (carbon, orange; oxygen, red). Molecule PEG1 overlays with the incoming lysine residue to be acetylated, while PEG2 was proposed to mimic a substrate residue with a small side chain, ideally glycine, flanking the substrate lysine. In TET2, however, Lys110 is flanked by residues with bulkier side chains, and the path of the TET2 peptide in the actual p300·TET2 complex might differ from that with canonical substrates, and/or it would require some rearrangements of residues surrounding the p300 active site for binding in a productive conformation. On the other hand, the shallow negatively charged pocket on p300 termed “P2 site” (top right in this panel) might accommodate Lys113; this interaction is critical for effective substrate binding (Liu et al., 2008). (b) 3D model of TET2 peptide Leu107‐Lys113 with acetylated Lys110 bound to HDAC1. The model has been generated according to the crystal structure of human HDAC1 bound to a substrate mimic based on the sequence of the histone H4 tail (5ICN, [Watson et al., 2016]). HDAC1 secondary structure elements are colored purple. Only side chains of HDAC1 residues that coordinate the catalytic Zn2+ ion are shown. TET2 peptide is green, with all side chains shown color‐coded. Note that the side chain of Lys113 is in the position to engage in electrostatic interactions with the bound inositol hexaphosphate (I6P) molecule, a physiologically relevant activator of HDACs (Watson et al., 2016). (c) 3D model of the Gly93‐Gln116 polypeptide of human TET2 bound to the nuclear importer, importin‐α (IMP‐α). The model has been developed based on the structure of human IMP‐α bound to CPB80 (Dias, Wilson, Rojas, Ambrosio, & Cerione, 2009). The IMP‐α helices are colored lime green, and the TET2 fragment in shown with all is non‐hydrogen atoms, color‐coded (carbon, orange; oxygen, red and nitrogen, blue). Note that the TET2 polypeptide runs in an extended, approximately antiparallel direction to the IMP‐α superhelix

Relevance of the non‐catalytic regions of TET2 protein for enzyme function. Focus is on experimentally verified or predicted roles of residues within the Gly93Gln116 TET2 stretch. (a) Close‐up of human p300 catalytic region, highlighting pockets that could accommodate the substrate lysine residue (Lys100; ref. [Zhang et al., 2017]) in TET2, as well as surrounding residues. p300 secondary structure elements are colored pink, and the CoA cofactor is shown with all its nonhydrogen atoms to the bottom‐left, color‐coded (carbon, green; oxygen, red; nitrogen, blue; phosphor, orange and sulfur, yellow). Well‐ordered PEG molecules bound close to the active site in the reported crystal structure (PDB code 4PZR, [Maksimoska, Segura‐Peña, Cole, & Marmorstein, 2014]) are also shown color‐coded (carbon, orange; oxygen, red). Molecule PEG1 overlays with the incoming lysine residue to be acetylated, while PEG2 was proposed to mimic a substrate residue with a small side chain, ideally glycine, flanking the substrate lysine. In TET2, however, Lys110 is flanked by residues with bulkier side chains, and the path of the TET2 peptide in the actual p300·TET2 complex might differ from that with canonical substrates, and/or it would require some rearrangements of residues surrounding the p300 active site for binding in a productive conformation. On the other hand, the shallow negatively charged pocket on p300 termed “P2 site” (top right in this panel) might accommodate Lys113; this interaction is critical for effective substrate binding (Liu et al., 2008). (b) 3D model of TET2 peptide Leu107Lys113 with acetylated Lys110 bound to HDAC1. The model has been generated according to the crystal structure of human HDAC1 bound to a substrate mimic based on the sequence of the histone H4 tail (5ICN, [Watson et al., 2016]). HDAC1 secondary structure elements are colored purple. Only side chains of HDAC1 residues that coordinate the catalytic Zn2+ ion are shown. TET2 peptide is green, with all side chains shown color‐coded. Note that the side chain of Lys113 is in the position to engage in electrostatic interactions with the bound inositol hexaphosphate (I6P) molecule, a physiologically relevant activator of HDACs (Watson et al., 2016). (c) 3D model of the Gly93Gln116 polypeptide of human TET2 bound to the nuclear importer, importin‐α (IMP‐α). The model has been developed based on the structure of human IMP‐α bound to CPB80 (Dias, Wilson, Rojas, Ambrosio, & Cerione, 2009). The IMP‐α helices are colored lime green, and the TET2 fragment in shown with all is non‐hydrogen atoms, color‐coded (carbon, orange; oxygen, red and nitrogen, blue). Note that the TET2 polypeptide runs in an extended, approximately antiparallel direction to the IMP‐α superhelix Furthermore, threading algorithms could identify accurate templates for some of the previously unforeseen globular domains, in particular for the Thr396Gln574 and Gln866Lys1044 stretches. (The low, previously unappreciated sequence similarity between these two stretches is shown in Figure S3, 3D models for these regions are given in Figure S4a,b, and model quality is illustrated in Figure S4c,d, respectively). Perhaps more relevant for TET2 enzymatic activity, the sequences immediately preceding and following the catalytic unit (residues Ala1045Pro1131 and Lys1924Ile2002 in human TET2, respectively) are highly conserved from reptiles to humans and have previously unrecognized counterparts in TET3 (Figure S5a,b), pointing to important conserved functions. The results of secondary structure prediction algorithms suggest that these regions might also fold into small α+β domains. Of note, we have been able to recombinantly express most of these regions in E. coli at high yields. The recombinant proteins are soluble in physiological conditions and could be straightforwardly purified without signs of degradation, indicating the presence of well‐folded globular domains (manuscript in preparation).

Genotype–phenotype correlation: a proposed classification of TET2 missense mutations

In Table S2 and Table S3, we present a brief analysis of all missense mutants and validated SNPs of human TET2 reported to date, respectively. In each case, sequence conservation, impact on experimentally confirmed or predicted PTMs and likely structural implications are considered. Considering the available results of structure and function studies (Hu et al., 2013, 2015) and bioinformatics data summarized in these tables and Figure 1a, we propose a hypothetical but testable classification of the pathogenicity of missense SNVs identified in the human TET2 gene into three major categories (Table 1).
Table 1

A proposed classification of TET2 missense mutations

Mutant classMutation impactExamples
IaImpairment of enzyme activity, but overall protein structure essentially conserved. (Replacement of Fe2+/2‐OG ligands, or major steric clashes with these side chains)p.R1261C/H/L/G, p.S1284F, p.H1881R/Q/N, p.R1896M/S/G, p.S1898F
IbSeverely compromised 3D structure and/or stability of the catalytic unit. (Introduction of polar or charged residues in the hydrophobic core, disruption of major internal H‐bonded networks, replacement of Zn2+ ligands)p.R1176G/T, p.L1229R, p.C1289Y/F, p.L1322Q, p.L1329Q/R, p.R1359C/H/P, p.H1380Y/R/N/P, p.V1864E
IIaCreation of small cavities in protein core, destabilization of major secondary structure elements and/or of partially buried H‐bonded networksp.E1207Q/D, p.L1231P, p.W1233G, p.Y1245S/C, p.E1879Q, p.I1897S, p.M1907K
IIbImpairment of the 3D structure of noncatalytic domainsp.A215V, p.V238F, p.A241V, p.P463L, p.L1065F, p.K1094I, p.T1985I
IIcImpaired recognition and processing of substrate DNAp.W1291R, p.K1299E/N, p.R1302G, p.S1303R/N, p.H1904R/Q, p.K1905E
IIdImpaired docking of other TET2 domains and/or of interacting proteins (e.g., IDAX, importin‐α, OGT)p.R96C, p.E1206K, p.E12434G, p.D1314G, p.E1320A
IIeAltered posttranslational modification of strictly conserved residues with functional implicationsp.K110R, p.K1339R
IIIaConservative mutations of buried residues within the catalytic unit, with only minimal effect on the protein stabilityp.I1175V, p.I1195V, p.V1199I, p.L1229I, p.F1287L, p.L1332F, p.L1360M, p.A1919V
IIIbConservative mutations of exposed, not strictly conserved residues within the catalytic unitp.S1204C, p.K1243R, p.L1312V, p.E1357Q, p.E1405Q
IIIcMutations of nonconserved residues within the noncatalytic regionsp.P23S, p.K53E, p.S334F, p.S585L, p.A727T, p.S1497P
A proposed classification of TET2 missense mutations

Severe mutations

Among them, we distinguish between (a) exchanges that impair binding of the essential cofactors, 2‐OG and/or Fe2+ (Figure 3a) and (b) mutations that would have an important impact on the global 3D structure of the TET2 catalytic unit (e.g., by disrupting major internal H‐bonded networks, Figure 3b, or by introducing polar/charged residues in the densely packed hydrophobic core, Figure 3c). It is conceivable that type Ib mutations would have a more severe impact on TET2 protein function.
Figure 3

Representative examples of mutations affecting the structure and function of human TET2 catalytic unit. (a) Close‐up of TET2 catalytic site. Shown in the picture are only cofactors Fe2+ (orange sphere) and 2‐OG (color‐coded: carbon, pink; nitrogen, red; oxygen, blue), as well as the side chains of residues directly involved in their coordination (color‐coded as the 2‐OG cofactor, but with carbon atoms green). Note that several TET2 missense mutations affect these residues (e.g., p.Arg1261Cys/His/Leu/Gly, p.His1881Arg/Glu/Asn; class Ia mutants). (b and c), representative examples of proposed type Ib variants. (b) Mutations likely to disrupt the overall 3D structure of the catalytic unit. Shown is a close‐up around residue Arg1359. Note the intricate network of strong H‐bonds centered on its side chain, which directly donates H‐bonds to the carbonyl oxygen from Met1907 (C‐terminal half of the DSBH domain), but is also connected through water molecules to the carbonyls of Gln1348 and Arg1366 (N‐terminal half). Note also that the residue immediately preceding Arg1359 is a Zn2+ ligand. Therefore, mutants p.Arg1359Cys/His/Pro would most likely result in the collapse of the whole 3D structure. (c) Representative example of a mutation introducing a polar or charged residue in the densely packed hydrophobic core. Replacement of Val1864 by a glutamate, as in p.Val1864Glu, would result in major clashes of the mutant Glu1864 carboxylate with the side chains of aliphatic residues shown in the picture and/or with main chain carbonyl oxygen atoms of these and/or other residues. (d) Close‐up around residue Ile1897. Only side chains of residues that make at least one vdW contact with one of the atoms of the Ile1897 side chain are shown, color‐coded according to the domain they belong to. Note that replacement of the aliphatic side chain by a shorter, polar serine, although disfavored, would not be expected to cause major structural rearrangements, as the mutant Ser1897 side chain would not clash with any of the surrounding core residues (class IIa mutation). Note in addition the somehow polar environment created by the nearby Thr1249 side chain. (e) Close‐up of the substrate DNA‐binding site; the side chains of some of the residues mutated in cancer patients are shown with all of their nonhydrogen atoms. Replacement of single residues, such as in mutants p.Arg1302Gly or p.Lys1905Glu, is likely to affect the rate of DNA oxidation, but would not be expected to completely abolish processing of the 5mC residue (class IIc variants). (f) Close‐up showing exposed, well‐conserved residues that are mutated in some cancer patients (e.g., p.Leu1312Val, p.Asp1314Gly, p.Glu1320Ala). Their relative proximity to the bound DNA oligonucleotide suggests that they might form a binding site for a TET2 cofactor such as IDAX/CXXC4, which recruits TET2 to DNA (Ko et al., 2013) (type IId mutants). (g) Close‐up around Ser1189‐Arg1201 TET2 stretch. Only a few side chains of interacting residues are shown for simplicity. Some TET2 missense variants that affect residues within this sequence are likely to be fully tolerated without any important rearrangements of TET2 protein structure (e.g., p.Ile1195Val, p.Val1199Ile; class IIIa). (h) Close‐up showing not conserved, exposed TET2 residues. Missense variants of these residues that introduce physicochemically related residues, commonly found in TET2 from other species (e.g., p.Ser1204Cys, p.Lys1243Arg, p.Glu1405Gln) are unlikely to have any impact on TET2 function (class III variants). DSBH, domain, termed double‐stranded β‐helix

Representative examples of mutations affecting the structure and function of human TET2 catalytic unit. (a) Close‐up of TET2 catalytic site. Shown in the picture are only cofactors Fe2+ (orange sphere) and 2‐OG (color‐coded: carbon, pink; nitrogen, red; oxygen, blue), as well as the side chains of residues directly involved in their coordination (color‐coded as the 2‐OG cofactor, but with carbon atoms green). Note that several TET2 missense mutations affect these residues (e.g., p.Arg1261Cys/His/Leu/Gly, p.His1881Arg/Glu/Asn; class Ia mutants). (b and c), representative examples of proposed type Ib variants. (b) Mutations likely to disrupt the overall 3D structure of the catalytic unit. Shown is a close‐up around residue Arg1359. Note the intricate network of strong H‐bonds centered on its side chain, which directly donates H‐bonds to the carbonyl oxygen from Met1907 (C‐terminal half of the DSBH domain), but is also connected through water molecules to the carbonyls of Gln1348 and Arg1366 (N‐terminal half). Note also that the residue immediately preceding Arg1359 is a Zn2+ ligand. Therefore, mutants p.Arg1359Cys/His/Pro would most likely result in the collapse of the whole 3D structure. (c) Representative example of a mutation introducing a polar or charged residue in the densely packed hydrophobic core. Replacement of Val1864 by a glutamate, as in p.Val1864Glu, would result in major clashes of the mutant Glu1864 carboxylate with the side chains of aliphatic residues shown in the picture and/or with main chain carbonyl oxygen atoms of these and/or other residues. (d) Close‐up around residue Ile1897. Only side chains of residues that make at least one vdW contact with one of the atoms of the Ile1897 side chain are shown, color‐coded according to the domain they belong to. Note that replacement of the aliphatic side chain by a shorter, polar serine, although disfavored, would not be expected to cause major structural rearrangements, as the mutant Ser1897 side chain would not clash with any of the surrounding core residues (class IIa mutation). Note in addition the somehow polar environment created by the nearby Thr1249 side chain. (e) Close‐up of the substrate DNA‐binding site; the side chains of some of the residues mutated in cancer patients are shown with all of their nonhydrogen atoms. Replacement of single residues, such as in mutants p.Arg1302Gly or p.Lys1905Glu, is likely to affect the rate of DNA oxidation, but would not be expected to completely abolish processing of the 5mC residue (class IIc variants). (f) Close‐up showing exposed, well‐conserved residues that are mutated in some cancer patients (e.g., p.Leu1312Val, p.Asp1314Gly, p.Glu1320Ala). Their relative proximity to the bound DNA oligonucleotide suggests that they might form a binding site for a TET2 cofactor such as IDAX/CXXC4, which recruits TET2 to DNA (Ko et al., 2013) (type IId mutants). (g) Close‐up around Ser1189Arg1201 TET2 stretch. Only a few side chains of interacting residues are shown for simplicity. Some TET2 missense variants that affect residues within this sequence are likely to be fully tolerated without any important rearrangements of TET2 protein structure (e.g., p.Ile1195Val, p.Val1199Ile; class IIIa). (h) Close‐up showing not conserved, exposed TET2 residues. Missense variants of these residues that introduce physicochemically related residues, commonly found in TET2 from other species (e.g., p.Ser1204Cys, p.Lys1243Arg, p.Glu1405Gln) are unlikely to have any impact on TET2 function (class III variants). DSBH, domain, termed double‐stranded β‐helix

Intermediate mutations

This category comprises (a) replacements of core residues of the catalytic unit with a limited impact on its structure and/or stability (Figure 3d), as well as (b) mutations that would disrupt the 3D structure of regulatory domains. Also included in this class are nonconservative exchanges of strictly conserved exposed residues that might weaken TET2 interactions with either (c) substrate DNA (Figure 3e), (d) with other protein domains or with other proteins (Figure 3f), or that (e) would interfere with conserved PTMs.

Mutations with mild to negligible effects

This category includes (a) conservative exchanges of buried residues within the catalytic unit, which would thus be fully tolerated (Figure 3g), (b) mutations that either directly or indirectly affect nonconserved PTMs (Figure 3h), and (c) other replacements of nonconserved residues within the noncatalytic modules of the enzyme. We compared our structure‐ and function based classification with tools routinely used to predict the impact of missense variants, PROVEAN and PolyPhen, and also with an algorithm that predicts changes in structural stability upon point mutations, CUPSAT. The results of these analyses for all SNVs that map to the TET2 catalytic unit are presented in the Table S4. As can be seen, type I mutations, for instance, are usually found among the most deleterious by PROVEAN (values much lower than the threshold between “neutral” and “deleterious” replacements, −2.50). Type Ia/Ib variants are also commonly predicted as “probably damaging” by PolyPhen. There are, however, a large number of variants that are also predicted as deleterious/probably damaging, for which careful inspection of the deposited 3D structures suggests at most a limited impact on the structure and/or function of the enzyme. On the other hand, some replacements of residues that are part of internal, complex H‐bonded networks (e.g., p.Arg1161Ser, p.Arg1176Gly/Thr) or that coordinate Zn2+ ions (e.g., Cys1221Arg, Cys1289Ser/Phe) are predicted as “stabilizing” by CUPSAT, even though loss of these interactions in the mutant proteins are quite likely to compromise overall folding and stability. In conclusion, neither simple analysis of residue conservation nor “blind” consideration of structural features allow for a detailed classification of TET2 variants, and we propose that our scheme should be used instead for predicting the likely impact of point mutations in the TET2 gene. Of note, the mutation pattern along the TET2 sequence is strikingly different between solid and hematologic neoplasms (Figure 1a). However, it remains to be investigated whether TET2 variants that belong to different categories are unevenly distributed in myeloid neoplasia, solid tumors and tumor‐infiltrating lymphocytes, and whether they would have different effects on atherogenesis (Bowman et al., 2018).

Clinical significance of TET2 mutations

Previous attempts to use the TET2 gene as a predictor of clinical variables including disease‐free survival (DFS) and overall survival (OS) have led to contradictory results. For instance, the presence of TET2 mutated was reported as a neutral (Bejar et al., 2011) or even favorable prognostic biomarker in MDS (Kosmider, Gelsi‐Boyer, Cheok, et al., 2009), and in chronic myelomonocytic leukemia (CMML) (Grossmann et al., 2011; Kohlmann et al., 2010). Further, TET2 mutation status did not influence OS or DFS in a cohort of cytogenetically normal AML patients younger than 60 years (Damm et al., 2014), and TET2 gene status was not significantly correlated with DFS or OS in de novo AML (Nibourel et al., 2010). In addition, in a large cohort of younger adult patients with AML, TET2 variants did not impact the response to induction therapy and clinical outcome (Gaidzik et al., 2012). Finally, in a more recent study of an even larger cohort of 1,540 AML patients, TET2 mutations were much more frequently associated to “favorable risk” or “intermediate risk” than to “adverse risk” categories according to the European LeukemiaNet (ELN) classification. Of note, TET2 mutations were quite rarely found in isolation in AML patients, indicating that they are not sufficient for overt leukemia (Papaemmanuil et al., 2016). Also along these lines, the presence of mutant TET2 did not affect survival or leukemic transformation in patients with polycythemia vera or primary myelofibrosis (Tefferi, 2010; Tefferi et al., 2009), and TET2‐mutated MPN patients were not cytogenetically different from their TET2‐wild‐type counterparts (Hussein et al., 2010). Finally, the results of a recent meta‐analysis suggest that TET2 mutations may not impact prognosis on OS of patients with MDS (Guo, Zhang, Zou, Fan, & Lyu, 2017). In striking contrast, TET2 mutations have been associated with decreased OS in de novo AML patients with intermediate‐risk cytogenetics compared to TET2‐wild‐type patients (Abdel‐Wahab et al., 2009), and other authors have corroborated this association between TET2 mutations and poor prognosis in AML (Aslanyan et al., 2014; Metzeler et al., 2011; Ohgami et al., 2015). Further, TET2 gene mutation has been reported to negatively impact patient survival in CMML (Kosmider, Gelsi‐Boyer, Ciudad, et al., 2009) as well as in the most common peripheral T‐cell lymphomas (Lemonnier et al., 2012). Furthermore, the nonepigenetic effects of TET2 variants may also be of some relevance in hematologic neoplasms, as it has been recently shown for other TET family members (Khoueiry et al., 2017). Notwithstanding other differences in the molecular pathologies of various hematological neoplasms, we consider that the apparently contradictory results listed above may be due, in part, to studying patient cohorts in which TET2 mutations have largely different structural/functional consequences. Therefore, establishment of a scoring system and patient stratification according to our proposed classification might be helpful in future studies aimed at establishing the clinical relevance of TET2 variants. In particular, we might expect a significant correlation between the presence of severe TET2 mutations (classes Ia/Ib) and a poor disease outcome. Furthermore, these patients could benefit the most from treatment with hypomethylating drugs, as suggested in recent studies (Bejar et al., 2014). In this regard, a much better response to the DNA hypomethylating agent, azacitidine has been reported in AML and MDS patients carrying TET2 mutations than in wild‐type patients (82% vs. 45% response rate, including hematological improvement; [Itzykson et al., 2011]). The understanding of the structural and functional consequences of TET2 mutations may also be useful for the optimal design of advanced immunotherapeutic approaches. It has been recently reported the cure of a patient with chronic lymphocytic leukemia carrying the class IIa p.Glu1879Gln variant in one TET2 allele, after disruption of the second TET2 allele in its T cells through lentiviral integration (Fraietta et al., 2018). Altogether, consideration of the structure and function impact of TET2 SNVs might have important implications for predicting the prognosis of leukemia patients and their stratification in future clinical trials.

CONFLICT OF INTEREST

None of the authors declare any conflict of interest.

AUTHORS CONTRIBUTIONS

E.B. performed experiments and analyzed results. R.A. and P.F.‐P. performed bioinformatics analyses. J.N. and P.F.‐P. planned experiments, discussed the results, and wrote the paper. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  102 in total

1.  Prediction of beta-turns in proteins from multiple alignment using neural network.

Authors:  Harpreet Kaur; Gajendra Pal Singh Raghava
Journal:  Protein Sci       Date:  2003-03       Impact factor: 6.725

2.  Interleukin-3 receptor alpha chain (CD123) is widely expressed in hematologic malignancies.

Authors:  L Muñoz; J F Nomdedéu; O López; M J Carnicer; M Bellido; A Aventín; S Brunet; J Sierra
Journal:  Haematologica       Date:  2001-12       Impact factor: 9.941

3.  The structural basis of protein acetylation by the p300/CBP transcriptional coactivator.

Authors:  Xin Liu; Ling Wang; Kehao Zhao; Paul R Thompson; Yousang Hwang; Ronen Marmorstein; Philip A Cole
Journal:  Nature       Date:  2008-02-14       Impact factor: 49.962

4.  Frequent TET2 mutations in systemic mastocytosis: clinical, KITD816V and FIP1L1-PDGFRA correlates.

Authors:  A Tefferi; R L Levine; K-H Lim; O Abdel-Wahab; T L Lasho; J Patel; C M Finke; A Mullally; C-Y Li; A Pardanani; D G Gilliland
Journal:  Leukemia       Date:  2009-03-05       Impact factor: 11.528

5.  The superhelical TPR-repeat domain of O-linked GlcNAc transferase exhibits structural similarities to importin alpha.

Authors:  Martin Jínek; Jan Rehwinkel; Brooke D Lazarus; Elisa Izaurralde; John A Hanover; Elena Conti
Journal:  Nat Struct Mol Biol       Date:  2004-09-12       Impact factor: 15.369

6.  Acute myeloid leukemia with MLL rearrangements: clinicobiological features, prognostic impact and value of flow cytometry in the detection of residual leukemic cells.

Authors:  L Muñoz; J F Nomdedéu; N Villamor; R Guardia; D Colomer; J M Ribera; J P Torres; J J Berlanga; C Fernández; A Llorente; M P Queipo de Llano; J M Sánchez; S Brunet; J Sierra
Journal:  Leukemia       Date:  2003-01       Impact factor: 11.528

7.  Immunophenotypic findings in acute myeloid leukemia with FLT3 internal tandem duplication.

Authors:  Luz Muñoz; Ana Aventín; Neus Villamor; Jordi Juncà; Gemma Acebedo; Alicia Domingo; María Rozman; J Pio Torres; Mar Tormo; Josep F Nomdedéu
Journal:  Haematologica       Date:  2003-06       Impact factor: 9.941

8.  A simple and fast secondary structure prediction method using hidden neural networks.

Authors:  Kuang Lin; Victor A Simossis; Willam R Taylor; Jaap Heringa
Journal:  Bioinformatics       Date:  2004-09-17       Impact factor: 6.937

9.  CUPSAT: prediction of protein stability upon point mutations.

Authors:  Vijaya Parthiban; M Michael Gromiha; Dietmar Schomburg
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

10.  PrDOS: prediction of disordered protein regions from amino acid sequence.

Authors:  Takashi Ishida; Kengo Kinoshita
Journal:  Nucleic Acids Res       Date:  2007-06-12       Impact factor: 16.971

View more
  1 in total

1.  TERT and TET2 Genetic Variants Affect Leukocyte Telomere Length and Clinical Outcome in Coronary Artery Disease Patients-A Possible Link to Clonal Hematopoiesis.

Authors:  Trine B Opstad; Svein Solheim; Alf-Åge R Pettersen; Are A Kalstad; Harald Arnesen; Ingebjørg Seljeflot
Journal:  Biomedicines       Date:  2022-08-19
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.