Literature DB >> 30052061

Gene body profiles of 5-hydroxymethylcytosine: potential origin, function and use as a cancer biomarker.

Gerd P Pfeifer1, Piroska E Szabó1.   

Abstract

Entities:  

Keywords:  5-hydroxymethylcytosine; DNA methylation; TET enzymes; cancer biomarkers; gene bodies; transcription

Mesh:

Substances:

Year:  2018        PMID: 30052061      PMCID: PMC6190241          DOI: 10.2217/epi-2018-0066

Source DB:  PubMed          Journal:  Epigenomics        ISSN: 1750-192X            Impact factor:   4.778


× No keyword cloud information.
The DNA base 5-hydroxymethylcytosine (5hmC) strongly accumulates along transcribed sequences (gene bodies) of tissue-specific genes. We discuss the potential origin and function of 5hmC in gene bodies and describe how genomic 5hmC patterns may be useful in cancer diagnosis. Since 5-hydroxymethylcytosine (5hmC) was first described as a normally occurring DNA base in mammals in 2009 [1,2], many studies have investigated the mechanisms of its formation, established genomic profiles of 5hmC distribution in different tissues or cell lines and generated hypotheses to address its biological function. This DNA base is produced by enzymatic oxidation of 5-methylcytosine (5mC) at CpG dinucleotide sequences by a small family of three mammalian 5mC oxidases, the TET proteins, which use α-ketoglutarate, oxygen and Fe++ as cofactors [2,3]. The TET proteins have the ability to oxidize the initially formed 5hmC base further and sequentially produce 5-formylcytosine and 5-carboxylcytosine. The latter two bases can be removed from DNA by base excision repair initiated by thymine DNA glycosylase  [4]. The completed pathway works as a mechanism to achieve active, replication-independent DNA demethylation leading to loss of methylation at CpG sites. However, given the considerable abundance of 5hmC in specific cell types (for example, it can reach a level of about 1% of all DNA cytosine bases in human brain-derived neurons), there has been the assumption that 5hmC has its own biological meaning and may be recognized by specific reader proteins [5]. Yet, these reader proteins have remained elusive with no clearly demonstrated roles in vivo and the function of 5hmC has remained enigmatic altogether.

5hmC genomic distribution

Methods to map the distribution of 5hmC along the genome include antibody-based immunoprecipitation methods (hMeDIP) followed by array or sequence analysis, pull-down approaches after specific derivatization of the hydroxymethyl groups by transfer of modified glucose residues and a few single base resolution approaches [6]. These profiling studies have shown that 5hmC is generally depleted in gene-poor areas of mammalian genomes but is enriched in transcribed sequences (gene bodies) and is also found close to intergenic and intragenic enhancer elements and at the edges of CpG-rich promoter regions [7]. ChIP-sequencing studies have shown that the 5hmC mark co-occurs with RNA polymerase II and the histone modifications H3K79me3 and H3K36me3, all markers of active transcription units, but is strongly depleted from regions associated with inactive chromatin such as those marked by the polycomb complex that produces H3K27me3. Of note, 5mC also rarely co-occurs with polycomb marks. As may be concluded from this data, 5hmC is clearly a gene-centered mark that correlates with active genes. At enhancer elements, 5hmC is most likely deposited by the TET2 5mC oxidase as shown by Tet2-specific or Tet1/2/3 triple knockout experiments [8,9]. Both TET1 and TET3 contain a CXXC domain, a zinc finger module capable of binding to unmethylated CpG-rich DNA sequences. Therefore, it is thought that at least those TET1 and TET3 isoforms that contain this domain are targeted to CpG islands, where they may function primarily by oxidizing 5mC bases that have been created there by erroneous de novo DNA methylation events [3,10]. The envisioned task of these CpG island-targeted TET proteins is to keep CpG islands free of DNA methylation.

Tissue-specificity of 5hmC gene body profiles

Within genes, 5hmC is depleted near the transcription start site and near the end of the transcription units but reaches much higher levels along the transcribed sequences (Figure 1). Many studies have shown that 5hmC in gene bodies positively correlates with gene expression levels [7]. Global and gene-specific levels of 5hmC are indicators of cellular state and tissue type [11]. Genes that are expressed at substantial levels in specific tissues or cell types will often carry higher levels of 5hmC when comparing with other tissues in which the same genes are not expressed but these differences also depend on the global levels of 5hmC in different cells. The observation that tissue-specifically expressed genes are marked by 5hmC has been made for example in several neuronal cell types, cardiomyocytes, various epithelial cells and immune cells. On the other hand, housekeeping genes, even though they may be expressed at high levels, contain 5hmC only at low frequencies. These genes include mitochondrial and ribosomal proteins, which despite being highly transcribed carry very low levels of 5hmC in their gene bodies [12]. It is unclear whether levels of 5hmC change within genes that undergo short-term induction, for example during the DNA damage response.

Schematic representation of a gene body profile of 5-hydroxymethylcytosine (5hmC).

The levels of 5hmC are very low near transcription start sites, increase throughout the gene bodies and then decline near the end of the transcription units (TES). This hypothetical gene is expressed in tissue A (red line) but not in tissue B (green line) with both tissues having similar global levels of 5hmC. A housekeeping gene expressed in all tissues is shown for comparison (blue line).

Schematic representation of a gene body profile of 5-hydroxymethylcytosine (5hmC).

The levels of 5hmC are very low near transcription start sites, increase throughout the gene bodies and then decline near the end of the transcription units (TES). This hypothetical gene is expressed in tissue A (red line) but not in tissue B (green line) with both tissues having similar global levels of 5hmC. A housekeeping gene expressed in all tissues is shown for comparison (blue line).

How are tissue-specific genes marked by 5hmC & what is the role of this DNA base within these genes?

TET proteins perform key roles in cell lineage differentiation processes [13]. Since 5hmC deposition seems to accompany upregulation of cell type-specific genes during the differentiation of cells from stem or progenitor cells, the question arises how this process occurs mechanistically. The simple explanation that TET proteins are constant companions of RNA polymerase II elongation complexes seems unlikely since housekeeping genes are poorly occupied by 5hmC. Since the genes most highly decorated by 5hmC, are tissue-specific genes, its occurrence in these genes must somehow be linked to the tissue specificity of their expression. Tissue specificity is usually determined by transcription factors that localize to promoter and/or enhancer regions of the genes that characterize a cell type. Therefore it is conceivable that TET proteins are recruited by tissue-specific transcription factors and, through unknown mechanisms, are then ‘handed over’ to the transcription elongation complexes to oxidize 5mC along gene bodies. In this context, it is of interest that TET2 has been identified as a binding partner of AF9, a component of transcriptional elongation complexes [14]. There are so far only relatively few examples where TET recruitment has been demonstrated in the form of direct physical interaction between TET proteins and a transcription factor. They include pluripotency factors, lineage-specific pioneer factors and other important regulators. Such interactions have been shown, for example for FOXA1 and TET1 [15], NANOG and TET1 [16], FOXO3A and TET2 [17], WT1 and TET2 and TET3 [18] and the master hematopoietic transcription factor RUNX1 and TET2 and TET3 [19]. Pioneer factors operate to open inaccessible chromatin during lineage commitment and cell differentiation processes. However, pioneer factors are highly specialized and there are many other cell type-specific transcription factors that determine the full repertoire of tissue-specific gene expression often following the initial activity of pioneer factors. Given the fact that mammalian organisms have in the order of 200 different cell types and hence at least that many transcription factor combinations that determine cellular identity, it is difficult to imagine how only three TET proteins would be able to interact with so many transcription factors in a specific manner. The question is if differences may exist in chromatin structure, promoter composition and/or cis-regulatory elements of tissue-specific genes compared with housekeeping genes that may specify TET occupancy, perhaps in a combinatorial manner. Curiously, Drosophila and other organisms that have very low levels of or completely lack 5mC and 5hmC also have tissue-specific genes that function in the absence of 5hmC-dependent (and 5mC-dependent) gene control mechanisms. This raises the question as to whether 5hmC is needed for high fidelity gene control only in long-living creatures. One other unresolved issue is if and how 5hmC in gene bodies increases transcript levels. Genetic ablation of the DNA methyltransferase Dnmt3b, which causes depletion of 5mC and consequentially 5hmC in gene bodies, leads to increased transcriptional noise due to intragenic aberrant transcription initiation [20]. Simultaneous inactivation of the Tet1 and Tet3 genes also resulted in loss of transcriptional fidelity in early embryos [21]. It is possible that 5hmC is even more potent than 5mC in suppressing inappropriate transcription initiation within gene bodies. However, formal proof for this concept as well as an understanding of the mechanisms of this regulation are so far lacking.

Low levels of 5hmC in cancer

The 5hmC base is strongly depleted in almost all types of human cancer in comparison with the normal tissues in which the tumors emerge [22]. It is still unclear how this depletion arises and what the consequences are with regards to tumorigenesis. However, it is well known that malignant tumors lose many features of normal tissue/cell architecture and become de-differentiated. When viewing 5hmC as a marker of cell- or tissue-specific genes, its loss in cancer cells would appear as a logical event. However, it remains unknown if loss of 5hmC in gene bodies is a cause or a consequence of malignant transformation and whether other factors contribute to its loss [23].

5hmC profiles as a cancer marker in liquid biopsies

The global loss of 5hmC in tumors may be accompanied by gain of this mark in certain cancer-specific genes. Therefore, characterization of 5hmC profiles has emerged as an attractive practical approach for tumor classification and tumor detection. Interestingly, these altered patterns of 5hmC are not only detectable in tumor-normal tissue pairs obtained from tumor resections or biopsies, but can be identified in circulating free DNA from cancer patients. For example, 5hmC-based liquid biomarkers distinguishing healthy controls from individuals with cancer were highly predictive for colorectal and gastric cancers [24] and for lung cancer, hepatocellular carcinoma and pancreatic cancer [25]. Although tumor-specifically methylated DNA has previously been used as a tumor marker for blood-based biopsies, most sodium bisulfite-based DNA methylation detection methods further degrade the already highly fragmented DNA in blood/serum samples. However, the selective labeling approach used for 5hmC profiling does not include bisulfite treatment and may therefore have advantages over previously used epigenetic assays to detect cancer in serum samples. These promising approaches will need to be further optimized and applied to larger clinical studies.
  24 in total

1.  Tet-mediated formation of 5-carboxylcytosine and its excision by TDG in mammalian DNA.

Authors:  Yu-Fei He; Bin-Zhong Li; Zheng Li; Peng Liu; Yang Wang; Qingyu Tang; Jianping Ding; Yingying Jia; Zhangcheng Chen; Lin Li; Yan Sun; Xiuxue Li; Qing Dai; Chun-Xiao Song; Kangling Zhang; Chuan He; Guo-Liang Xu
Journal:  Science       Date:  2011-08-04       Impact factor: 47.728

2.  RUNX1 regulates site specificity of DNA demethylation by recruitment of DNA demethylation machineries in hematopoietic cells.

Authors:  Takahiro Suzuki; Yuri Shimizu; Erina Furuhata; Shiori Maeda; Mami Kishima; Hajime Nishimura; Saaya Enomoto; Yoshihide Hayashizaki; Harukazu Suzuki
Journal:  Blood Adv       Date:  2017-09-06

Review 3.  TET-mediated active DNA demethylation: mechanism, function and beyond.

Authors:  Xiaoji Wu; Yi Zhang
Journal:  Nat Rev Genet       Date:  2017-05-30       Impact factor: 53.242

4.  Simultaneous deletion of the methylcytosine oxidases Tet1 and Tet3 increases transcriptome variability in early embryogenesis.

Authors:  Jinsuk Kang; Matthias Lienhard; William A Pastor; Ashu Chawla; Mark Novotny; Ageliki Tsagaratou; Roger S Lasken; Elizabeth C Thompson; M Azim Surani; Sergei B Koralov; Sundeep Kalantry; Lukas Chavez; Anjana Rao
Journal:  Proc Natl Acad Sci U S A       Date:  2015-07-21       Impact factor: 11.205

Review 5.  Genomic distribution and possible functions of DNA hydroxymethylation in the brain.

Authors:  Lu Wen; Fuchou Tang
Journal:  Genomics       Date:  2014-09-07       Impact factor: 5.736

6.  Correlated 5-Hydroxymethylcytosine (5hmC) and Gene Expression Profiles Underpin Gene and Organ-Specific Epigenetic Regulation in Adult Mouse Brain and Liver.

Authors:  I-Hsuan Lin; Yi-Fan Chen; Ming-Ta Hsu
Journal:  PLoS One       Date:  2017-01-26       Impact factor: 3.240

7.  Ten-eleven translocation 2 interacts with forkhead box O3 and regulates adult neurogenesis.

Authors:  Xuekun Li; Bing Yao; Li Chen; Yunhee Kang; Yujing Li; Ying Cheng; Liping Li; Li Lin; Zhiqin Wang; Mengli Wang; Feng Pan; Qing Dai; Wei Zhang; Hao Wu; Qiang Shu; Zhaohui Qin; Chuan He; Mingjiang Xu; Peng Jin
Journal:  Nat Commun       Date:  2017-06-29       Impact factor: 14.919

Review 8.  5-hydroxymethylcytosine profiling as an indicator of cellular state.

Authors:  Alexander Laird; John P Thomson; David J Harrison; Richard R Meehan
Journal:  Epigenomics       Date:  2013-12       Impact factor: 4.778

9.  Role of Tet proteins in enhancer activity and telomere elongation.

Authors:  Falong Lu; Yuting Liu; Lan Jiang; Shinpei Yamaguchi; Yi Zhang
Journal:  Genes Dev       Date:  2014-09-15       Impact factor: 11.361

10.  AF9 promotes hESC neural differentiation through recruiting TET2 to neurodevelopmental gene loci for methylcytosine hydroxylation.

Authors:  Yunbo Qiao; Xiongjun Wang; Ran Wang; Yuanyuan Li; Fang Yu; Xianfa Yang; Lu Song; Guoliang Xu; Y Eugene Chin; Naihe Jing
Journal:  Cell Discov       Date:  2015-07-28       Impact factor: 10.849

View more
  3 in total

Review 1.  Integration of Epigenetic Mechanisms into Non-Genotoxic Carcinogenicity Hazard Assessment: Focus on DNA Methylation and Histone Modifications.

Authors:  Daniel Desaulniers; Paule Vasseur; Abigail Jacobs; M Cecilia Aguila; Norman Ertych; Miriam N Jacobs
Journal:  Int J Mol Sci       Date:  2021-10-11       Impact factor: 5.923

Review 2.  Aging of Vascular System Is a Complex Process: The Cornerstone Mechanisms.

Authors:  Anastasia V Poznyak; Nikolay K Sadykhov; Andrey G Kartuesov; Evgeny E Borisov; Vasily N Sukhorukov; Alexander N Orekhov
Journal:  Int J Mol Sci       Date:  2022-06-22       Impact factor: 6.208

3.  Genome-wide 5-hydroxymethylcytosine (5hmC) emerges at early stage of in vitro differentiation of a putative hepatocyte progenitor.

Authors:  Jesús Rafael Rodríguez-Aguilera; Szilvia Ecsedi; Chloe Goldsmith; Marie-Pierre Cros; Mariana Domínguez-López; Nuria Guerrero-Celis; Rebeca Pérez-Cabeza de Vaca; Isabelle Chemin; Félix Recillas-Targa; Victoria Chagoya de Sánchez; Héctor Hernández-Vargas
Journal:  Sci Rep       Date:  2020-05-08       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.