Literature DB >> 30723849

Bisulfite-free and base-resolution analysis of 5-methylcytidine and 5-hydroxymethylcytidine in RNA with peroxotungstate.

Fang Yuan1, Ying Bi2, Paulina Siejka-Zielinska2, Ying-Lin Zhou3, Xin-Xiang Zhang3, Chun-Xiao Song2.   

Abstract

5-Methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC), two of the best-studied DNA modifications, play crucial roles in normal development and disease in mammals. Although 5-methylcytidine (m5C) and 5-hydroxymethylcytidine (hm5C) have also been identified in RNA, their distribution and biological function in RNA remain largely unexplored, due to the lack of suitable sequencing methods. Here, we report a base-resolution sequencing method for hm5C in RNA. We applied the selective oxidation of hm5C to trihydroxylated-thymine (thT) mediated by peroxotungstate. thT was subsequently converted to T during cDNA synthesis using a thermostable group II intron reverse transcriptase (TGIRT). Base-resolution analysis of the hm5C sites in RNA was performed using Sanger sequencing. Furthermore, in combination with the TET enzyme oxidation of m5C to hm5C in RNA, we expand the use of peroxotungstate oxidation to detect m5C in RNA at base-resolution. By using this method, we confirmed three known m5C sites in human tRNA, demonstrating the applicability of our method in analyzing real RNA samples.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30723849      PMCID: PMC6984333          DOI: 10.1039/c9cc00274j

Source DB:  PubMed          Journal:  Chem Commun (Camb)        ISSN: 1359-7345            Impact factor:   6.222


Epitranscriptome, which refers to the multitude of RNA chemical modifications, has vital roles in post-transcriptional gene regulation.1–3 5-Methylcytidine (m5C) and 5-hydroxymethylcytidine (hm5C) are two of the major RNA modifications in eukaryotic cells, however, our understanding of them is still in its infancy. M5C is abundant in noncoding RNA, and has the ability to stabilize tRNA secondary structure,4–6 but the knowledge about its distribution and function in mRNA are still very limited due to the inconsistent results obtained from the current sequencing methods.7–9 Hm5C has been shown to be enriched in Drosophila melanogaster mRNA, increase mRNA translation and play a central role in Drosophila brain development.10 Hm5C also exists in mammalian RNA, albeit at low levels,11 and the TET proteins that oxidize 5-methylcytosine (5mC) to 5-hydroxymethylcytosine (5hmC) in DNA can also do so in RNA.10,12 However, the distribution and regulation roles of hm5C in the mammalian transcriptome remain unknown due to the lack of sensitive and robust sequencing methods. The most common way to sequence m5C in RNA is to adopt bisulfite sequencing, which is widely used to sequence 5mC in DNA. Bisulfite treatment deaminates unmethylated cytosine to uracil in single-strand RNA, while leaving m5C unconverted. Therefore, bisulfite sequencing provides base-resolution information of m5C. Using bisulfite sequencing, widespread m5C sites were identified in both coding and non-coding RNAs.7,8 However, bisulfite treatment employs sequential thermal acidic and alkaline conditions that severely damage the RNA. Further analysis also revealed potential false positives from RNA bisulfite sequencing due to incomplete conversion of unmethylated cytosine in the double-stranded RNA regions and other modifications resistant to bisulfite treatment.13–15 Other methods to sequence m5C in RNA are immunoprecipitation-based that use m5C-specific antibodies or methyltransferases to pull down m5C-containing RNA.16–18 These methods, however, do not have base-resolution and lose the quantitative levels of m5C. Mapping hm5C in RNA is even more challenging. To date, there is no base-resolution sequencing method for hm5C. The only reported method is the antibody-based immunoprecipitation approach.10 This method has been applied to the Drosophila transcriptome, but has yet to be successful in the mammalian transcriptome. Clearly, new RNA-friendly and high-resolution sequencing methods are highly desirable to further study the elusive distribution, localization and biological roles of these two modifications in RNA. Here, we report bisulfite-free and base-resolution sequencing methods for hm5C and m5C based on peroxotungstate oxidation. Peroxotungstate oxidation was first developed by the Okamoto group19,20 for selective oxidation of 5hmC in DNA to trihydroxylated-thymine (thT). thT is a thymine derivative, and will induce C-to-T transition in DNA after PCR. However, the peroxotungstate oxidation reaction requires single-strand DNA. The reaction is strongly inhibited in double-strand DNA with a conversion rate of less than 10%, which severely limits its application.20 Although this reaction is not suitable for DNA samples, we hypothesized that it could be ideal to detect hm5C in RNA, which is mostly single-stand (Fig. 1a). We termed this approach peroxotungstate oxidation sequencing (WO-Seq).
Fig. 1

Peroxotungstate reaction on hm5C-containing RNA. (a) Illustration of the peroxotungstate reaction and workflow of WO-Seq. Hm5C-containing RNA is specifically oxidized by peroxotungstate, and then reverse transcribed by thermostable group II intron reverse transcriptase (TGIRT). The oxidation product of hm5C (thT) is converted to T during cDNA synthesis, thus can be used for base-resolution sequencing of hm5C in RNA. (b) HPLC-MS/MS results of the hydrolysed product of synthesized hm5C-containing RNA1 before and after the peroxotungstate reaction. Peaks of adenosine (rA), guanosine (rG) and hm5C are labelled in the figure. (c) MALDI-MS characterization of an hm5C-containing RNA fragment of RNA1 treated with peroxotungstate. Calculated m/z is shown in black, observed m/z is shown in red.

We started with optimizing the oxidation conditions of the peroxotungstate against in vitro-transcribed hm5C-containing RNA1. MALDI-TOF MS and HPLC-MS/MS were used to monitor the reaction rate. After two rounds of 4 hours incubation at 60 °C, the hm5C peak in HPLC-MS/MS was undetectable (Fig. 1b), and the MALDI peak of RNA fragments containing one hm5C changed from m/z = 4204.4 to m/z = 4238.8. This is consistent with the calculated m/z change from hm5C-containing RNA to thT-containing RNA (Fig. 1c). Sensitivity of the peroxotungstate treatment for hm5C was also tested (Fig. S1, ESI†). Samples of different combination of hm5C modified RNA and unmodified RNA were treated by peroxotungstate, and then analysed by HPLC-MS/MS. The conversion rates of hm5C were similar in all samples, indicating that the peroxotungstate treatment is suitable for real biological samples which has low hm5C content. Next, we investigated the potential of the hm5C-to-T transition during cDNA synthesis using the peroxotungstate-oxidized RNA template. We designed and synthesized a 73mer RNA that contained three hm5C sites (RNA2). To enable us to monitor the efficiency of the hm5C-to-T conversion, one hm5C was positioned so that, upon successful hm5C-to-T conversion, a TaqαI restriction enzyme recognition site in the resulting RT-PCR product was destroyed (Fig. 2a). Since thT is not a natural occurring base, we first sought to investigate its behavior during cDNA synthesis. Several commercially available reverse transcriptases were tested on this RNA template. Interestingly, only the thermostable group II intron reverse transcriptase (TGIRT)21,22 could read though all reacted hm5C sites, while Superscript III and Bst 3.0 DNA polymerase induced truncations at the reacted hm5C sites (Fig. 2b). The reaction conditions of TGIRT were further optimized to get the best reverse transcription efficiency, and after subsequent PCR, the DNA products were digested with TaqαI. As shown in Fig. 2c, 67% of the PCR products from the oxidized hm5C-containing RNA2 sample stayed intact after the TaqαI treatment, indicating loss of restriction enzyme cut site and the successful base change induced by the peroxotungstate-oxidized RNA during cDNA synthesis.
Fig. 2

Restriction enzyme digestion assay showed effective base change during cDNA synthesis using the peroxotungstate-oxidized RNA template. (a) Illustration of the restriction enzyme digestion assay for the investigation of the base change mediated by peroxotungstate. X represent T or A or G, while Y is the complementary base of it. (b) Reverse transcription products of hm5C-containing RNA2 before and after peroxotungstate treatment using different reverse transcriptases. Hm5C-containing RNA2 has three hm5C sites at position 26, 36 and 47. The full length is 73 mer. (c) RT-PCR product of the 73-mer model RNA2 containing a TaqαI cut site. Samples without peroxotungstate treatment and control normal cytidine (rC) containing RNA2 treated with peroxotungstate were cleaved completely. About 67% of the reacted hm5C-RNA amplified product stayed intact, indicating the loss of the restriction enzyme cut site and the successful base change.

We then performed Sanger sequencing of the PCR product from the oxidized RNA samples (Fig. 3). At each hm5C site, a new peak of thymine signal appears, confirming the base change is indeed C-to-T. In order to accurately quantify the C-to-T conversion rate, the PCR product was cloned and sequenced individually. A 62.1% conversion rate was observed from a total of 66 hm5C sites sequenced (Fig. S2, ESI†), consistent with the restriction enzyme digestion result. As a control, PCR products of peroxotungstate-treated normal cytosine (rC)-containing RNA2 and m5C-containing RNA2 were also cloned and sequenced (Fig. S3, ESI†). Results showed that both rC and m5C sites did not change after the treatment, indicating an excellent selectivity of peroxotungstate oxidation on hm5C. Notably, peroxotungstate oxidation is a mild reaction, which showed less damage on RNA compared with bisulfite reaction (Fig. S4, ESI†).
Fig. 3

Electropherograms of Sanger sequencing results before and after WO-Seq. The conversion of C-to-T happened at each hm5C site.

After demonstrating WO-Seq for hm5C sequencing, we next sought to expand its use for m5C sequencing in RNA. In DNA, 5hmC is generated by the oxidation of 5mC mediated by the TET enzyme. Recently, the mammalian TET enzyme was reported to have the ability of oxidizing m5C to hm5C in RNA.12 We tested Naeglaria Tet-like oxygenase (NgTET1)23 and showed it can also oxidize m5C to hm5C on m5C-containing RNA1 by both MALDI-MS and HPLC-MS/MS (Fig. S5, ESI†). Based on this, we further aimed to combine the peroxotungstate oxidation with NgTET1 oxidation to detect m5C in a procedure we termed TET-Assisted WO-Seq (TAWO-Seq) (Fig. 4a). The results of both oxidation reactions were verified by HPLC-MS/MS (Fig. S6, ESI†). Restriction enzyme digestion assays and Sanger sequencing were performed (Fig. 4b and c). Sanger sequencing results showed the C-to-T transition at each m5C site. The m5C-to-T conversion rate was 50% estimated by restriction enzyme analysis, lower than that of hm5C, due to incomplete m5C to hm5C oxidation by NgTET1 (Fig. S6, ESI†). We also cloned and sequenced individual PCR product for the m5C sample. As shown in Fig. S7 (ESI†), 33.3% of the total m5C sites were successfully detected. Commercially available mouse Tet1 (mTet1) was also tested for the TAWO-Seq, which gives similar results with NgTET1 (Fig. S8a, ESI†). We further demonstrated that β-glucosyltransferase (βGT) can label hm5C with glucose and thereby protect it from peroxotungstate oxidation (Fig. S9, ESI†). Combining βGT protection with TAWO-Seq could therefore enable it to detect m5C specifically.
Fig. 4

The combination of NgTET1 oxidation and peroxotungstate reaction in detecting m5C in RNA in TAWO-Seq. (a) Illustration of TAWO-Seq strategy for the identification of m5C in RNA at single-nucleotide resolution. (b) Restriction enzyme digestion assay of (+) and (–) NgTET1-assisted peroxotungstate-treated samples. About 50% of the m5C sites were detected. (c) Sanger-sequencing results before and after TAWO-Seq.

To further demonstrate the utility of TAWO-Seq on real RNA sample, we applied it to the endogenous tRNAAsp(GUC) in 293T cells. The tRNAAsp(GUC) contains three known m5C sites at structural positions 38, 47 and 48 (Fig. S10a, ESI†).24–26 Both NgTET1 and mTet1 were used to oxidize the tRNA and the products were then treated with peroxotungstate. The RT-PCR product of treated tRNAAsp(GUC) was cloned and sequenced. As shown in Fig. S10b and c (ESI†), 35.2% of the m5C sites were successfully detected with NgTET1 assisted WO-Seq, and 37.5% of the m5C sites were detected using mTet1 assisted WO-Seq, which demonstrated the applicability of TAWO-Seq to real RNA samples. Among the three m5C sites in tRNAAsp(GUC), we found that m5C at position 48 has the highest C-to-T conversion rate (68.4% by NgTET1 assisted WO-Seq, 77.8% by mTet1 assisted WO-Seq). According to the tRNAAsp(GUC) structure, this site is in a double-stranded CpG context, which is an ideal substrate of TET enzyme.23 It is likely that the different m5C-to-T conversion rates of three m5C sites are caused by the sequence preference of the TET proteins. In conclusion, we have described WO-Seq as an RNA friendly, chemical oxidation-based, base-resolution method to sequence hm5C in RNA. We demonstrate the specific hm5C-to-T transition using peroxotungstate to oxidize the RNA followed by cDNA synthesis with the TGIRT enzyme, and Sanger sequencing results have proved the base-resolution sequencing ability of this method. We further demonstrate the ability of TAWO-Seq to detect m5C by combining WO-Seq with the prior NgTET1 or mTet1 oxidation of m5C to hm5C. The successful detection of m5C sites in human tRNA demonstrates our method is applicable for real RNA samples. Both WO-Seq and TAWO-Seq could potentially solve the false positive issue of bisulfite sequencing since they directly detect modified cytosine without affecting unmodified cytosine. Further improvement of both methods to increase the conversion rate and apply to mRNA samples using next-generation sequencing technology are underway in the lab. We believe this method could be highly useful for the identification of unexplored m5C/hm5C distribution and function in the transcriptome. We would like to acknowledge P. Spingardi, G. Berridge and B. Kessler for helping with the HPLC-MS/MS; C. He for the mTet1; F. Howe for editing the manuscript. This work was supported by the Ludwig Institute for Cancer Research. Work in the C.-X. Song lab is also supported by Cancer Research UK (C63763/A26394 and C63763/A27122), NIHR Oxford Biomedical Research Centre, and Conrad N. Hilton Foundation. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health. F. Yuan and Y. Bi are supported by China Scholarship Council.

Conflicts of interest

There are no conflicts to declare. Click here for additional data file.
  25 in total

Review 1.  The emerging epitranscriptomics of long noncoding RNAs.

Authors:  Andrew Shafik; Ulrike Schumann; Maurits Evers; Tennille Sibbritt; Thomas Preiss
Journal:  Biochim Biophys Acta       Date:  2015-11-02

2.  5-Hydroxymethylcytosine-selective oxidation with peroxotungstate.

Authors:  Akimitsu Okamoto; Kaori Sugizaki; Akiko Nakamura; Hiroyuki Yanagisawa; Shuji Ikeda
Journal:  Chem Commun (Camb)       Date:  2011-09-15       Impact factor: 6.222

3.  Base-Resolution Analysis of 5-Hydroxymethylcytosine by One-Pot Bisulfite-Free Chemical Conversion with Peroxotungstate.

Authors:  Gosuke Hayashi; Kenta Koyama; Hidefumi Shiota; Asuka Kamio; Takayoshi Umeda; Genta Nagae; Hiroyuki Aburatani; Akimitsu Okamoto
Journal:  J Am Chem Soc       Date:  2016-10-24       Impact factor: 15.419

4.  5-Methylcytidine is required for cooperative binding of Mg2+ and a conformational transition at the anticodon stem-loop of yeast phenylalanine tRNA.

Authors:  Y Chen; H Sierzputowska-Gracz; R Guenther; K Everett; P F Agris
Journal:  Biochemistry       Date:  1993-09-28       Impact factor: 3.162

5.  Thermostable group II intron reverse transcriptase fusion proteins and their use in cDNA synthesis and next-generation RNA sequencing.

Authors:  Sabine Mohr; Eman Ghanem; Whitney Smith; Dennis Sheeter; Yidan Qin; Olga King; Damon Polioudakis; Vishwanath R Iyer; Scott Hunicke-Smith; Sajani Swamy; Scott Kuersten; Alan M Lambowitz
Journal:  RNA       Date:  2013-05-22       Impact factor: 4.942

6.  Transcriptome-wide mapping of 5-methylcytidine RNA modifications in bacteria, archaea, and yeast reveals m5C within archaeal mRNAs.

Authors:  Sarit Edelheit; Schraga Schwartz; Maxwell R Mumbach; Omri Wurtzel; Rotem Sorek
Journal:  PLoS Genet       Date:  2013-06-27       Impact factor: 5.917

7.  5-methylcytosine promotes mRNA export - NSUN2 as the methyltransferase and ALYREF as an m5C reader.

Authors:  Xin Yang; Ying Yang; Bao-Fa Sun; Yu-Sheng Chen; Jia-Wei Xu; Wei-Yi Lai; Ang Li; Xing Wang; Devi Prasad Bhattarai; Wen Xiao; Hui-Ying Sun; Qin Zhu; Hai-Li Ma; Samir Adhikari; Min Sun; Ya-Juan Hao; Bing Zhang; Chun-Min Huang; Niu Huang; Gui-Bin Jiang; Yong-Liang Zhao; Hai-Lin Wang; Ying-Pu Sun; Yun-Gui Yang
Journal:  Cell Res       Date:  2017-04-18       Impact factor: 25.617

8.  Detection of expanded RNA repeats using thermostable group II intron reverse transcriptase.

Authors:  Samuel T Carrell; Zhenzhi Tang; Sabine Mohr; Alan M Lambowitz; Charles A Thornton
Journal:  Nucleic Acids Res       Date:  2018-01-09       Impact factor: 16.971

9.  The birth of the Epitranscriptome: deciphering the function of RNA modifications.

Authors:  Yogesh Saletore; Kate Meyer; Jonas Korlach; Igor D Vilfan; Samie Jaffrey; Christopher E Mason
Journal:  Genome Biol       Date:  2012-10-31       Impact factor: 13.583

10.  NSun2-mediated cytosine-5 methylation of vault noncoding RNA determines its processing into regulatory small RNAs.

Authors:  Shobbir Hussain; Abdulrahim A Sajini; Sandra Blanco; Sabine Dietmann; Patrick Lombard; Yoichiro Sugimoto; Maike Paramor; Joseph G Gleeson; Duncan T Odom; Jernej Ule; Michaela Frye
Journal:  Cell Rep       Date:  2013-07-18       Impact factor: 9.423

View more
  12 in total

Review 1.  A molecular-level perspective on the frequency, distribution, and consequences of messenger RNA modifications.

Authors:  Joshua D Jones; Jeremy Monroe; Kristin S Koutmou
Journal:  Wiley Interdiscip Rev RNA       Date:  2020-01-21       Impact factor: 9.957

Review 2.  The Methylation Game: Epigenetic and Epitranscriptomic Dynamics of 5-Methylcytosine.

Authors:  Adele Alagia; Monika Gullerova
Journal:  Front Cell Dev Biol       Date:  2022-06-03

3.  Chemical Method to Sequence 5-Formylcytosine on RNA.

Authors:  Ang Li; Xuemeng Sun; A Emilia Arguello; Ralph E Kleiner
Journal:  ACS Chem Biol       Date:  2022-02-25       Impact factor: 4.634

4.  m5CRegpred: Epitranscriptome Target Prediction of 5-Methylcytosine (m5C) Regulators Based on Sequencing Features.

Authors:  Zhizhou He; Jing Xu; Haoran Shi; Shuxiang Wu
Journal:  Genes (Basel)       Date:  2022-04-12       Impact factor: 4.141

Review 5.  Mapping the epigenetic modifications of DNA and RNA.

Authors:  Lin-Yong Zhao; Jinghui Song; Yibin Liu; Chun-Xiao Song; Chengqi Yi
Journal:  Protein Cell       Date:  2020-05-22       Impact factor: 14.870

Review 6.  Analysis of RNA Modifications by Second- and Third-Generation Deep Sequencing: 2020 Update.

Authors:  Yuri Motorin; Virginie Marchand
Journal:  Genes (Basel)       Date:  2021-02-16       Impact factor: 4.096

Review 7.  Deciphering Epitranscriptome: Modification of mRNA Bases Provides a New Perspective for Post-transcriptional Regulation of Gene Expression.

Authors:  Suresh Kumar; Trilochan Mohapatra
Journal:  Front Cell Dev Biol       Date:  2021-03-16

8.  m5C-Atlas: a comprehensive database for decoding and annotating the 5-methylcytosine (m5C) epitranscriptome.

Authors:  Jiongming Ma; Bowen Song; Zhen Wei; Daiyun Huang; Yuxin Zhang; Jionglong Su; João Pedro de Magalhães; Daniel J Rigden; Jia Meng; Kunqi Chen
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

Review 9.  The role of RNA m5C modification in cancer metastasis.

Authors:  Qiaofeng Zhang; Furong Liu; Wei Chen; Hongrui Miao; Huifang Liang; Zhibin Liao; Zhanguo Zhang; Bixiang Zhang
Journal:  Int J Biol Sci       Date:  2021-08-02       Impact factor: 6.580

10.  iRNA5hmC: The First Predictor to Identify RNA 5-Hydroxymethylcytosine Modifications Using Machine Learning.

Authors:  Yuan Liu; Dasheng Chen; Ran Su; Wei Chen; Leyi Wei
Journal:  Front Bioeng Biotechnol       Date:  2020-03-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.