Literature DB >> 27079978

Base pairing and structural insights into the 5-formylcytosine in RNA duplex.

Rui Wang1, Zhipu Luo2, Kaizhang He3, Michael O Delaney3, Doris Chen1, Jia Sheng4.   

Abstract

5-Formylcytidine (f(5)C), a previously discovered natural nucleotide in the mitochondrial tRNA of many species including human, has been recently detected as the oxidative product of 5-methylcytidine (m(5)C) through 5-hydroxymethylcytidine (hm(5)C) in total RNA of mammalian cells. The discovery indicated that these cytosine derivatives in RNA might also play important epigenetic roles similar as in DNA, which has been intensively investigated in the past few years. In this paper, we studied the base pairing specificity of f(5)C in different RNA duplex contexts. We found that the 5-formyl group could increase duplex thermal stability and enhance base pairing specificity. We present three high-resolution crystal structures of an octamer RNA duplex [5'-GUA(f(5)C)GUAC-3']2 that have been solved under three crystallization conditions with different buffers and pH values. Our results showed that the 5-formyl group is located in the same plane as the cytosine base and forms an intra-residue hydrogen bond with the amino group in the N4 position. In addition, this modification increases the base stacking between the f(5)C and the neighboring bases while not causing significant global and local structure perturbations. This work provides insights into the effects of 5-formylcytosine on RNA duplex.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27079978      PMCID: PMC4889945          DOI: 10.1093/nar/gkw235

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

RNA is involved in numerous biochemical and cellular processes as a carrier of genetic information, an adapter in protein synthesis, a structural scaffold in subcellular organelles and a functional catalyst and regulator of biochemical reactions (1–5). Over 140 naturally occurring chemical modifications have been identified in mRNA, rRNA, tRNA and other non-coding RNAs (6). Many of these modifications have been shown to play critical roles in maintaining and diversifying RNA structures and functions. Although it had been assumed that these modifications were stable and static, there is evidence that some of these post-transcriptional RNA modifications, similar to those DNA and protein epigenetic markers, can be very dynamic with unexpected regulatory and/or signaling functions. For example, N6-methyladenosine (m6A), the most common internal modification in eukaryotic mRNAs and some long non-coding RNAs, appears to be involved in epigenetic inheritance. m6A can undergo oxidative demethylation catalyzed by dioxygenases including FTO, AlkBH1, AlkBH3 and AlkBH5 (7–11). This methylation-demethylation process is dynamic and may be important in RNA structural switches, protein recognition, mRNA stability, miRNA processing, protein translation and stem cell differentiation (12,13). Another potential ‘RNA epigenetic’ marker is the 5-methylcytosine (m5C), which was first discovered over a half century ago in cellular RNAs across all three domains of life (14–21). The structures and epigenetic functions of m5C and oxidative analogues 5-hydroxymethyl-, 5-formyl- and 5-carboxyl-cytosines (hm5C, f5C and ca5C, respectively) in DNA have been extensively studied in the past few years (22,23). In comparison, their corresponding investigations in RNA contexts are left far behind. It is known that the 5-methylation of cytosine in long non-coding RNAs affects the binding of chromatin-modifying complexes and is thought to regulate cellular epigenetic status (20). It was also demonstrated that the m5C is subject to in vivo oxidation to generate hm5C and f5C in total RNA from all domains of life and in polyA-enriched RNA fractions from mammalian cells (24). In addition, the in vitro formation of hm5C from m5C can be mediated by TET proteins (25), the 10-11 translocation family of enzymes that are the major players in the m5C oxidative metabolism in DNA. Very recently, the transcriptome-wide distribution of hm5C has been investigated in Drosophila and the RNA hydroxymethylation was found to favor mRNA translation (26). All these discoveries support the hypothesis that the transformative m5C could serve as an RNA epigenetic marker. Therefore, studying the structures of its oxidative intermediates (Scheme 1) in different RNA contexts will help elucidate the biological roles of these naturally occurring modifications in RNA functions and regulation.
Scheme 1.

Proposed oxidative metabolism of m5C (1) through hm5C (2), f5C (3) and ca5C (4). The hydroxylation of m5C can be mediated by TET enzymes, which very likely could perform the following oxidation steps, similar to their functions in DNA.

Proposed oxidative metabolism of m5C (1) through hm5C (2), f5C (3) and ca5C (4). The hydroxylation of m5C can be mediated by TET enzymes, which very likely could perform the following oxidation steps, similar to their functions in DNA. In the tRNAs of many species, including humans, f5C is observed at the wobble position of the anticodon loop of mitochondrial methionine tRNA (27–31). This modification increases the dynamics of loop residues and affords the single tRNAMet the ability to decode both AUG and AUA in translational initiation and elongation sites of mRNA (32–34). Although the crystal structure of f5C within the codon-anticodon complex in ribosomal A-site has been studied (35), a clear view of its local geometry remains elusive. In addition, the base recognition abilities and structural features of f5C in RNA duplexes are also unknown. In this paper, we studied the base paring specificity of f5C in different RNA duplex contexts and found that the 5-formyl modification could increase duplex stability relative to unmodified C and enhanced the discrimination for G over the other bases. We also present three high-resolution crystal structures of an octamer RNA duplex [5′-GUA(f5C)GUAC-3′]2 containing two consecutive f5C:G pairs that have been solved in three different conditions. We found that the 5-formyl group does not cause significant local or global structural perturbations. The formyl group is located in the same plane as the cytosine base and forms an intra-residue hydrogen bond with the amino group at N4 position, increasing the base stacking between the f5C and the neighboring bases. In addition, the f5C interacts with neighboring residues through bridging water molecules.

MATERIALS AND METHODS

Synthesis and purification of f5C-containing RNA oligonucleotides

f5C-modified RNA sequences were synthesized on a 1-μmol scale using a MerMade 12 synthesizer through ACE chemistry procedures (36). The f5C 2′-ACE-phosphoramidite (0.067 M in anhydrous acetonitrile), synthesized with the previously published method (32), was coupled to the growing polyribonucleotide chain with a coupling time of 3.5 min using 5-ethylthio-1H-tetrazole (0.5 M in anhydrous acetonitrile) as the activator. Once the synthesis of the polyribonucleotide chain was completed, the phosphate protecting groups were removed from the immobilized polyribonucleotide by treatment with disodium 2-carbamoyl-2-cyanoethylene-1,1-dithiolate-trihydrate in DMF for 15 min. The support was washed with water (5 ml), and the column was flushed with argon gas for 5 min to dry the support. The support was then transferred to a 2-ml Eppendorf tube, and the polyribonucleotide was cleaved from the support and the exocyclic amine protecting groups were removed by treatment with 1:3 (v/v) tert-butylamine:water for 6 h at 55°C. The sample was cooled to room temperature, filtered and lyophilized to obtain the crude polyribonucleotide. The 2′-ACE group was removed with acetate/TEMED using the protocol recommended by Dharmacon. The oligonucleotides were desalted by ethanol precipitation and then purified by ion-exchange HPLC over a PA-100 column (Dionex). The oligonucleotides were eluted with a linear gradient of 0–35% buffer B (buffer A was pure water, and buffer B was 2 M ammonium acetate, pH 7.1) over 20 min at a flow rate of 1 ml/min. Collected fractions were lyophilized, desalted with Waters Sep-Pac C18 columns and re-concentrated. All the samples were verified by ESI-MS (mass spectra and sequences are shown in Supplementary Table S1 and Figures S1–3).

Thermal denaturation and CD experiments

Solutions of the duplex RNAs (0.5 μM) were prepared by dissolving the purified oligonucleotides in 10 mM sodium phosphate (pH 6.5) with 100 mM NaCl. The solutions were heated to 85°C for 3 min, then cooled slowly to room temperature, and stored at 4°C for overnight. Thermal denaturation was performed in a Cary 300 UV-Visible Spectrophotometer equipped with a temperature controller. The temperature reported is the block temperature. Each curve was acquired at 260 nm by heating and cooling from 5 to 85°C four times at a rate of 0.5°C/min. Experiments were repeated at least four times. Circular Dichroism (CD) studies were carried out in the same buffer utilizing a Jasco-815 CD spectrometer at room temperature in a quartz cell with a 10-mm path length. CD spectra were collected from 380 to 200 nm and with a scanning speed of 100 nm/min. The bandwidth was 1.0 nm, and the digital integration time was 1.0 s. All CD spectra were baseline-corrected for signal contributions due to the buffer.

Crystallization and diffraction data collection

RNA samples (0.5 mM duplex) were heated to 80°C for 3 min, cooled slowly to room temperature and placed at 4°C overnight before crystallization. Nucleic Acid Mini Screen Kits (Hampton Research), Natrix (Hampton Research) and Nuc-Pro-HTS (Jena Bioscience) were used to screen crystallization conditions at both 4 and 20°C using the hanging drop method. Perfluoropolyether was used as cryoprotectant for crystal mounting. Data was collected under a liquid nitrogen stream at −174°C. All diffraction data were collected at beam lines SER-CAT 22-BM at Advance Photon Source, Argonne National Laboratory, USA. A number of crystals that grew under different conditions were scanned, and the data were collected at a wavelength of 1.0 Å. All data were processed using HKL2000 and DENZO/SCALEPACK (37).

Structure determination and refinement

All of the three RNA structures presented here were solved by molecular replacement with PHASER using PDB structure 197D (the deoxy version of the same sequence) as the search model, followed by refinement using REFMAC. The refinement protocol included simulated annealing, positional refinement, restrained B-factor refinement and bulk solvent correction. The stereo-chemical topology and geometrical restraint parameters of DNA/RNA were applied (38). The topologies and parameters for 5-formylcytosine were constructed using Jligand (39). After several cycles of refinement, a number of highly ordered waters and metal ions were added. Anisotropic refinement was applied for high resolution data sets with space group P21 and P32. TLS refinement was carried out for dataset processed in C2 space group. Cross-validation (40) with a 5% test set was monitored during the refinement. The σA-weighted maps (41) of the (2m|Fo|-D|Fc|) and the difference (m|Fo|-D|Fc|) density maps were computed and used throughout the model building.

Quantum mechanical calculations

Quantum mechanical calculations of the 1-methyl-5-formyl cytosine were performed with the Gaussian09 package (42). Geometry optimization was carried out with ‘tight’ convergence criteria, which required that the maximum and average forces on an atom in the final iteration diminished to less than 1.5 × 10−5 and 1 × 10−5 Hartree/Bohr, respectively, and that the maximum and average displacements in the last two iterations were less than 6 × 10−5 and 4 × 10−5 Bohr, respectively. All density functional theory (DFT) calculations were performed using an ‘ultra-fine’ numerical integration grid, including 99 radial shells and 590 angular points per shell. DFT and second-order Møller-Plesset perturbation theory (MP2) were employed in conjunction with the 6–31G(d,p) basis set. For DFT calculations, the B3LYP (43–45) function was used. To account for solvation effects, all structures were fully optimized with the polarizable continuum model (PCM) (46,47). These calculations were designated as B3LYP/6-31G(d,p)/PCM and MP2/6-31G(d,p)/PCM, respectively. Harmonic vibrational frequency calculations at the same level verified that each structure was a minimum with zero imaginary frequency. Single point energy calculations were performed at the geometries optimized at the B3LYP/6-31G(d,p)/PCM level. The B3LYP was combined with the PCM in conjunction with the more complete 6–311++G(2d,2p) basis set. The zero point energy and thermal corrections obtained from the B3LYP/6-31G(d,p)/PCM level of theory were then added to the electronic energies to obtain free energies.

RESULTS AND DISCUSSION

Thermodynamic stabilities and spectroscopic properties of f5C-containing RNA duplexes

We synthesized three sets of RNA oligonucleotides to investigate the thermodynamic stability and base pairing specificity of 5-formylcytosine-containing duplexes. As shown in Table 1 and Supplementary Figure S4A–E, duplexes containing the f5C-G pair have significantly higher melting temperatures (Tm) than the native counterparts. In the 7-mer duplex-1 system, the 5-formyl modification increased the Tm by 4.8°C relative to the duplex with only native Watson–Crick pairs (entry 2 versus 7). Consistent with this result, the Tm of the f5C-modified 12-mer duplex-2 was 4.1°C higher than the unmodified duplex (entry 12 versus 17). In the context of a self-complementary 8-mer (duplex-3), two consecutive f5C-G pairs resulted in a 7.3°C increase in Tm relative to the unmodified duplex (entry 21 versus 22). Although a previous study found that the f5C in the anticodon loop of tRNAMet decreases the thermal stability of the codon-anticodon interaction due to the decreased base stacking relative to the unmodified tRNA (32), our results indicate that the f5C modification might enhance the base stacking in RNA duplex context since the 5-formyl group is not directly involved in the hydrogen bonding interactions. This result is also consistent with a previous study of DNA duplexes modified with f5C, in which a 5-formyl-2′-deoxycytosine-modified 25-mer duplex had a Tm 1°C higher than that of the unmodified duplex (48).
Table 1.

Melting temperatures of native and f5C-modified RNA duplexes

EntrySequencesBase pair T m (°C)aΔTm (°C)b
Duplex-1 1 I: 5′-UAGCUCC-3′
2 I + 3′-AUCGAGG-5′C:G38.1
3 I + 3′-AUCUAGG-5′C:U23.9−14.2
4 I + 3′-AUCCAGG-5′C:C18.1−20.0
5 I + 3′-AUCAAGG-5′C:A16.2−21.9
6 II: 5′-UAGf5CUCC-3′
7 II + 3′-AUCGAGG-5′f5C:G42.9 +4.8
8 II + 3′-AUCUAGG-5′f5C:U24.5−13.6
9 II + 3′-AUCCAGG-5′f5C:C18.6−19.5
10 II + 3′-AUCAAGG-5′f5C:A19.6−18.5
Duplex-2 11 III: 5′-GGACUCCUGUAG -3′
12 III + 3′-CCUGAGGACAUC -3′C:G63.5
13 III + 3′-CCUGAUGACAUC -3′C:U46.9−17.6
14 III + 3′-CCUGACGACAUC -3′C:C47.3−17.2
15 III + 3′-CCUGAAGACAUC -3′C:A48.7−15.8
16 IV: 5′-GGACUf5CCUGUAG -3′
17 IV + 3′-CCUGAGGACAUC -3′f5C:G67.6 +4.1
18 IV + 3′-CCUGAUGACAUC -3′f5C:U49.8−13.7
19 IV + 3′-CCUGACGACAUC -3′f5C:C49.1−14.4
20 IV + 3′-CCUGAAGACAUC -3′f5C:A52.0−11.5
Duplex-3 21 V: (5′-GUACGUAC-3′)2C:G35.3
22 VI: (5′-GUAf5CGUAC-3′)2f5C:G42.6 +7.3

a T ms were measured in 10 mM sodium phosphate (pH 7.0) containing 100 mM NaCl. Tm values reported are the averages of four measurements.

bΔTm values are relative to the duplexes with only Watson–Crick pairs.

a T ms were measured in 10 mM sodium phosphate (pH 7.0) containing 100 mM NaCl. Tm values reported are the averages of four measurements. bΔTm values are relative to the duplexes with only Watson–Crick pairs. We next evaluated the mismatched pairing with f5C. The Tms of f5C-U and f5C-C mismatch-containing duplex-1 sequences were similar to those of the native mismatched duplexes (entry 3 versus 8 and 4 versus 9). The f5C-A mismatched duplex was significantly more stable than the duplex containing the C-A mismatch (entry 5 versus 10). In the duplex-2 system, f5C stabilized all the three mismatched duplexes relative to duplexes with mismatches with C (entry 13 versus 18, 14 versus 19, and 15 versus 20). These data support our hypothesis that the f5C has more favorable base stacking than cytosine, balancing the lack of optimal hydrogen bonding of these mispairs. Interestingly, the f5C-G pairs have overall better discrimination against all the other mispairs when f5C-G is compared to other mispairs. For example, the Tm of the f5C-G-containing duplex-2 is 23.3°C higher than that of the f5C-A-containing duplex (entry 7 versus 10), whereas the difference between Tms of the C-G- and C-A-containing duplexes is 21.9°C (entry 2 versus 5). This demonstrates higher RNA base pairing specificity in the presence of 5-formyl group. We also compared the circular dichroism spectra of RNA duplexes containing C-G and f5C-G pairs. As shown in Supplementary Figure S4F and G, unmodified duplex-1 and duplex-2 show characteristic peaks of the A-form helical conformation in solution: a strong positive peak around 270 nm and a weak negative peak around 240 nm, typical peaks of RNA duplexes with mostly GC pairs (49,50). The f5C-modified 12-mer duplex has a spectrum similarly to that of the native duplex (Supplementary Figure S4G), whereas the presence of f5C in the 7-mer duplex-1 down-shifts the positive peak by about 10 nm and eliminated the negative peak at 240 nm (Supplementary Figure S4F). Both duplex and hairpin structures are present in the solution of self-complementary 8-mer duplex-3 (51), and a down-field shift of the strong positive peak was observed in the spectrum of the modified oligonucleotide compared to the unmodified one (Supplementary Figure S4H). The CD spectra of nucleic acids are affected by many factors including sequence, base stacking and overall conformation; given this, our data indicate that the 5-formyl modification does not cause gross perturbations in folding but does have some impact on double helix parameters. This is in agreement with conclusions from a previous study of f5C (48) and the m5C, hm5C modified DNA duplexes (52).

Overall crystal structures of f5C RNA duplex

To investigate the structural features of f5C RNA duplexes and to explore the mechanisms of the enhancement of duplex stability due to the 5-formyl group, we obtained three crystal structures for the f5C-containing self-complementary duplex-3, a purine-pyrimidine alternating octamer [5′-GUA(f5C)GUAC-3′]2, under three different buffer and pH conditions. The detailed crystallization conditions and data collection and structure refinement statistics are summarized in Table 2. All the structures were solved by molecular replacement using a DNA duplex with the same sequence (PDB ID: 197D) as the search model (53). In HEPES buffer at pH 7, the RNA molecules crystallized in the space group of P21. In Tris buffer at pH 7.5, the space group was P32, and in sodium cacodylate at pH 6.0, the space group was C2. To determine whether the 5-formyl group is involved in molecular packing, we investigated the helix-helix interactions in each asymmetric unit. As shown in Figure 1A and C, the duplexes in the C2 and P21 space groups are stacked in head-to-tail fashion with the G1:C8 pair stacked on the C8:G1 pair (Figure 1B), forming essentially continuous helices that pack side-by-side (Figure 1A). In the P32 structure, there are three duplexes in each asymmetric unit that pack along three helical axes at an angle of ∼100° (Figure 1D), forming a ‘bowl’ shaped structure. Further analysis indicates that although the 5-formyl group is not directly involved in the duplex–duplex interactions, the 2′-hydroxyl of the f5C residue hydrogen bonds with a phosphate oxygen in the A3 residue of one adjacent duplex (Figure 1E).
Table 2.

X-ray data and structure refinement statistics of the f5C-containing self-complementary duplex-3 crystals grown under three different conditions

Condition 1 (PDB ID: 5HNJ)Condition 2 (PDB ID:5HN2)Condition 3 (PDB ID:5HNQ)
Crystallization50 mM Na•HEPES (pH 7.0), 50 mM MgSO4, 1.6 M Li2SO410% MPD, 50 mM Tris•HCl (pH 7.5), 50 mM NH4Ac, 10 mM MgCl210% MPD, 40 mM sodium cacodylate(pH 6.0), 12 mM spermine tetrahydrochloride, 80 mM NaCl, 12 mM KCl, 20 mM MgCl2
Scaling
Space group P21 P32 C2
Unit cell parameters, Å30.7 × 45.9 × 45.442.3 × 42.3 × 58.6138.9 × 44.3 × 50.6
Unit cell parameters, degrees90, 100.9, 9090, 90, 12090, 102.8, 90
Resolution range, Å (last shell)30–1.24 (1.28–1.24)30–1.5 (1.55–1.5)50–2.4 (2.49–2.4)
Unique reflections (last shell)35 304 (3511)18 577 (1829)12 285 (1225)
Completeness, %100 (100)98 (96.7)99.6 (99.2)
Rmerge, %4.6 (35.5)5.8 (36.5)4.9 (33.0)
<I/σ(I)>35.4 (2.1)35.9 (2.1)22.3 (2.0)
Redundancy7.2 (5.7)5.3 (5.3)3.7 (3.8)
Refinement
Molecules per asymmetric unit3 duplexes3 duplexes6 duplexes and 1 single strand
Resolution range, Å25.2–1.2422.9–1.528.7–2.4
Number of reflections33 51517 55111 304
Completeness, %99.998.199.7
Rwork, %13.412.322.1
Rfree, %15.514.026.1
Bond length rmsd, Å0.0200.0220.016
Bond angle rmsd, degrees2.22.52.4
Overall B-factor with water, Å218.622.2103.5
Figure 1.

Molecular packing patterns of f5C-RNA in asymmetric unit cells. (A) Side view of duplex stacking in one asymmetric unit with C2 space group. (B) Zoom in view of the duplex terminal junction showing head-to-tail stacking of two terminal C8:G1 pairs; this occurs in both C2 and P21 space group asymmetric units. (C) Side view of duplex stacking in one asymmetric unit with P21 space group. (D) Top view of duplex stacking in one asymmetric unit with P32 space group. (E) Zoom in view of interaction between by 2′ hydroxyl of f5C and the phosphate oxygen of A3 in adjacent duplexes.

Molecular packing patterns of f5C-RNA in asymmetric unit cells. (A) Side view of duplex stacking in one asymmetric unit with C2 space group. (B) Zoom in view of the duplex terminal junction showing head-to-tail stacking of two terminal C8:G1 pairs; this occurs in both C2 and P21 space group asymmetric units. (C) Side view of duplex stacking in one asymmetric unit with P21 space group. (D) Top view of duplex stacking in one asymmetric unit with P32 space group. (E) Zoom in view of interaction between by 2′ hydroxyl of f5C and the phosphate oxygen of A3 in adjacent duplexes. In the whole unit cell level, the molecular packing in different space groups looks very similar (Supplementary Figure S5). Investigating the intermolecular interactions between duplexes shows uniform hydrogen-bonds networks. In C2 space group, there are very few intermolecular interactions observed at medium resolution (2.4 Å). For higher resolution (1.5 Å) data in P32 space group, very strong H-bond networks were formed by OP1 of f5C4 and O2′ and O3′ of A3 mediated by two water molecules which located around the 3-fold axis of the cell (Supplementary Figure S6A), which can be assigned into three parallel panels (Supplementary Figure S6B). Notably, it is possible that the top water molecule in Supplementary Figure S6B can be assigned to NH4+ ion (54) since we used ammonium acetate in the crystallization buffer and it is technically very difficult to differentiate the two residues based on the density map. The terminal O2′ and O3′ of chain A, C and E can also form H-bond networks by direct interaction or by water-mediated interactions (Supplementary Figure S6C and D). In addition, eight water molecules were also observed to form in line mediating H-bond networks between chain A, B, E and F (Supplementary Figure S7). At much higher resolution (1.24 Å) in P21 space group, the terminal O2′ and O3′ in chain A, C and E can form very strong H-bond networks without any water molecules (Supplementary Figure S6E). These different inter-strand H-bond networks might have resulted in different space groups and further confirm that the 5-formyl group is not directly interacting with other residues.

Conformation of f5C in the RNA duplex

The root mean square deviations (r.m.s.d) between the three structures are less than 0.4 Å (Figure 2A), and the conformations of the f5C:G pairs from the three structures are almost identically superimposable (Figure 2B). This implies that the f5C has a very similar conformation over a pH range from 6.0 to 7.5. The density map of the f5C clearly shows that the 5-formyl group is located in the same plane as the cytosine base, with the bond of the carbonyl group parallel to the C4-N4 bond (Figure 2C); this effectively expands the conjugation system of cytosine ring. This structural feature is different from the geometry of f5C observed in tRNA codon-anticodon interactions (35), in which the carbonyl group turns to its 5′-phosphate oxygen and forms a dihedral angle of ∼60° with the cytosine plane (Figure 2D), although the density map of 5-formyl group in this structure is not clear due to the low resolution of the overall complex structure. The different 5-formyl conformations might also be attributed to the presence of ribosomal binding, which results in different structure context than duplex. Notably, the short distance between the formyl oxygen and the N4 atom (2.7 Å) indicates a strong hydrogen bonding interaction that might stabilize the observed f5C conformation.
Figure 2.

Overview and comparison of the structures obtained under different crystallization conditions. (A and B) Superimposed (A) duplexes and (B) f5C-G pairs from P32 (red), P21 (blue) and C2 (green) space groups. (C) Electron density map of f5C residue with the oxygen atom pointing up in the same plane as the cytosine (2Fo-Fc map with σ 1.0). (D) Comparison of f5C residue in P32 (green) and the one in the tRNA codon–anticodon interaction (magenta; PDB ID: 4GKK). The two spheres represent the 5-formyl oxygen atoms. (E and F) Superimposed (E) duplexes and (F) f5C-G pairs from P32 space group structure (red) and unmodified duplex structure.

Overview and comparison of the structures obtained under different crystallization conditions. (A and B) Superimposed (A) duplexes and (B) f5C-G pairs from P32 (red), P21 (blue) and C2 (green) space groups. (C) Electron density map of f5C residue with the oxygen atom pointing up in the same plane as the cytosine (2Fo-Fc map with σ 1.0). (D) Comparison of f5C residue in P32 (green) and the one in the tRNA codon–anticodon interaction (magenta; PDB ID: 4GKK). The two spheres represent the 5-formyl oxygen atoms. (E and F) Superimposed (E) duplexes and (F) f5C-G pairs from P32 space group structure (red) and unmodified duplex structure. Next, we calculated energies of the 5-formylcytosine base using both DFT and MP2 quantum mechanics methods. Both bond lengths (C5-C7 and C7-O7) and angles (C5-C7-O7) from the minimized 1-methyl-5-formylcytosine structures are consistent with those in the crystal structures (Supplementary Figure S8). Our calculations also indicate that the intramolecular hydrogen bonding contributes to the conformational stability of the f5C residue. As shown in Supplementary Figure S9, although the 5-formyl group is still coplanar with the cytosine ring after rotating for 180°, which stands for another relatively stable f5C conformation without the internal hydrogen bonding, this rotated conformation results in the overall free energy increase of 4.7 kcal/mol compared to the present one in the crystal structures. We then compared the f5C-containing duplex structures to an ideal RNA structure model with the same sequence that was generated using the software Coot (55). The duplexes align well with an r.m.s.d of about 1.2 Å; major differences are localized in the two terminal base pairs (Figure 2E). The comparison of C-G pairs in the f5C-containing duplex with the ideal duplex showed slightly different extents of base-pairing buckle with almost identical hydrogen bonding strengths (Figure 2F). In addition, we also compared our f5C-containing duplex structure to the crystal structure of a DNA with the same sequence (PDB ID: 197D) and another RNA structure with similar sequence (5′-(GUAUAUAC)2-3′; PDB ID: 246D). All duplexes and local C-G pairs have very similar structures (Supplementary Figure S10). The further comparison between our ribo-f5C residue and the deoxyribo-f5C from previously published DNA structures (PDB ID: 1VE8, 4QC7 and 4QKK) (56,57) also showed very high structural isomorphism with similar internal hydrogen bonding strength (Supplementary Figure S11), although the sugar puckers are different in A-form RNA and B-form DNA. We further compared the geometric parameters of all the base pairs and base-pair steps in the f5C-containing duplex with the ones in the ideal RNA duplex using 3DNA software tools (58). The average values of these parameters are generally quite similar in the modified and native counterparts (Supplementary Tables S2 and 3). Figure 3 depicts the most characteristic and significant structural changes induced by the 5-formyl group. There were changes in buckle, opening, shift and tip, although it is possible that these parameters may also be partially impacted by the crystallization conditions. The f5C-G pair shows approximately a 5° positive buckle and a 2° negative opening (Figure 3A and B), different as the other pairs in both duplexes. The f5C-G pair involved steps show the most negative shifts of 0.2–0.3 Å and the most positive tip values (Figure 3C and D). Adjacent pairs compensate for these local effects resulting in similar average parameters for the modified and native duplexes. As the hydrogen bonding strength of f5C-G pair is very similar to that of the native C-G pair, the integrated local structural changes caused by the 5-formyl group together with the expanded planar system of f5C most likely increase the base-pair stacking interactions, resulting in enhanced thermal stability of the f5C-containing duplex. Indeed the f5C-involved steps have larger overlap areas compared to those in the unmodified duplex, especially in the region where the 5-formyl group lies on the top of the 5′ neighboring A3 residue (Table 3 and Supplementary Figure S12). As a result, the total overlap area of all base steps in f5C-containing duplex is 5 Å2 more than that in the unmodified duplex.
Figure 3.

Local base pair and base-pair step parameters (A) buckle (°), (B) opening (°), (C) shift (Å) and (D) tip (Å) for f5C-containing (triangle) and native (circle) duplexes. The f5C involved base pairs are labeled in red.

Table 3.

Overlap of base pair steps in native and f5C-containing duplexes

EntryStepsTotal overlap area (Å2)a
Nativef5C duplex
1GU/AC9.6710.03
2UA/UA2.082.04
3AC/GU9.6812.17
4CG/CG4.584.11
5GU/AC9.6812.30
6UA/UA2.091.79
7AC/GU9.6810.26
Overall47.4652.7

aThe total overlap area includes both intra-strand and inter-strand overlap within the four bases of each base pair step.

Local base pair and base-pair step parameters (A) buckle (°), (B) opening (°), (C) shift (Å) and (D) tip (Å) for f5C-containing (triangle) and native (circle) duplexes. The f5C involved base pairs are labeled in red. aThe total overlap area includes both intra-strand and inter-strand overlap within the four bases of each base pair step.

Hydration of f5C in major groove

The formyl group is located in the major groove where it may affect hydration pattern and duplex hydrophobicity of RNAs, which influence their biochemical properties and enzyme recognition processes. Therefore, we investigated the hydration pattern along A3-f5C4-G5 and their interactions with waters in the major groove within the typical hydrogen bonding range of 2.8–3.4 Å. The locations of these waters in the structure of P32 space group are shown in Figure 4. A water molecule (W1) bridges from the 5-formyl oxygen to the phosphate backbone of the A3 residue. The additional water molecules, W2 and W3, that links the N7 of A3 further expand the interactions of f5C with the neighboring residues. There are two more water molecules (W4 and W5) that connect the N4 of f5C and the O6 of G5′, its pairing partner on the opposite strand. In addition, f5C also has interactions with its 3′ neighboring G5 residue through the networking of W4-W8-W9-W10, which are further bound to the two oxygen atoms in the phosphate backbone. Very similar hydration pattern in the P21 structure (Supplementary Figure S13) under different crystallization conditions is also observed, indicating this water-network is highly conserved and not pH-dependent. In addition, the f5C in DNA duplex (PDB ID: 4QKK) also shows a similar water-bridging network as showed in Supplementary Figure S14. These hydrogen bonds might pre-organize the conformation of the 5-formyl group and RNA backbone, rigidifying the single stranded RNA and reducing the entropy cost during the duplex formation, therefore resulting in the higher stability of the f5C-modified duplex.
Figure 4.

Local hydration pattern of A3-f5C4-G5 in the major groove of the duplex structure with the space group P32. The hydrogen bonds with water molecules within the range of 2.8–3.4 Å are represented by the dashed lines.

Local hydration pattern of A3-f5C4-G5 in the major groove of the duplex structure with the space group P32. The hydrogen bonds with water molecules within the range of 2.8–3.4 Å are represented by the dashed lines. The solvation energies of the f5C-modified and native RNA duplexes were calculated using the ‘solvate’ module of the SEQMOL program. The f5C-containing RNA duplex had lower solvation energy and ∼200 Å2 less accessible surface area than the native duplex (Supplementary Table S4). This indicates that the 5-formyl group actually reduced hydration of the duplex probably by disrupting the water molecules in the narrow and deep major groove of A-form RNA duplex (59), despite the water molecules observed bound to the f5C residue. The decreased solvation might also contribute the duplex stability, as it has been shown to do for RNA duplexes modified with 2-thiouridine and with 2-thiocytosine (60,61).

CONCLUSION

In this work, we have studied the thermodynamic stability and base pairing specificity of f5C in different RNA duplexes. We found that relative to the natural cytosine, the 5-formyl modification increased duplex stability and enhanced the base pairing discrimination for C:G pair relative to other mispairs. Analysis of three high-resolution crystal structures of an f5C-modified octamer RNA duplex [5′-GUA(f5C)GUAC-3′]2 obtained under different conditions indicated that the 5-formyl group does not cause significant overall or local structural perturbations. The formyl group is located in the same plane as cytosine and forms a pH-independent hydrogen bond with the N4 position, expanding the conjugation system of the ring and increasing the base stacking between the f5C and the neighboring bases. Although the 5-formyl group reduced the overall duplex solvation energy, the f5C interacts with its neighboring residues through bridging water molecules, which might further contribute to the duplex thermal stability and the biochemical roles of f5C in RNA. We speculate that the duplex-stabilizing water-mediated interactions between the 5-formyl group and neighboring residues together with the conformation of 5-formyl group influences its potential epigenetic roles, which are based on the recognition of f5C residue by enzymes that oxidize this modified nucleotide to ca5C. The identification of the working enzymes, the crystal structure of a ca5C-containing RNA duplex, and energy required to catalyze the reaction from f5C to ca5C in the duplex context are currently under investigation.

ACCESSION NUMBERS

The three RNA structures have been deposited in Protein Data Bank (www.rcsb.org) with the PDB IDs: 5HNJ (P21 duplex structure), 5HN2 (P33 duplex structure) and 5HNQ (C2 duplex structure). Click here for additional data file.
  56 in total

1.  Role of unsatisfied hydrogen bond acceptors in RNA energetics and specificity.

Authors:  Nathan A Siegfried; Ryszard Kierzek; Philip C Bevilacqua
Journal:  J Am Chem Soc       Date:  2010-04-21       Impact factor: 15.419

Review 2.  Comparative enzymology and structural biology of RNA self-cleavage.

Authors:  Martha J Fedor
Journal:  Annu Rev Biophys       Date:  2009       Impact factor: 12.981

3.  Thermodynamics of RNA-RNA duplexes with 2- or 4-thiouridines: implications for antisense design and targeting a group I intron.

Authors:  S M Testa; M D Disney; D H Turner; R Kierzek
Journal:  Biochemistry       Date:  1999-12-14       Impact factor: 3.162

4.  RNA oligonucleotide synthesis via 5'-silyl-2'-orthoester chemistry.

Authors:  Stephanie A Hartsel; David E Kitchen; Stephen A Scaringe; William S Marshall
Journal:  Methods Mol Biol       Date:  2005

5.  Features and development of Coot.

Authors:  P Emsley; B Lohkamp; W G Scott; K Cowtan
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2010-03-24

6.  Sequence specific thermodynamic and structural properties for DNA.RNA duplexes.

Authors:  L Ratmeyer; R Vinayak; Y Y Zhong; G Zon; W D Wilson
Journal:  Biochemistry       Date:  1994-05-03       Impact factor: 3.162

7.  RNA biochemistry. Transcriptome-wide distribution and function of RNA hydroxymethylcytosine.

Authors:  Benjamin Delatte; Fei Wang; Long Vo Ngoc; Evelyne Collignon; Elise Bonvin; Rachel Deplus; Emilie Calonne; Bouchra Hassabi; Pascale Putmans; Stephan Awe; Collin Wetzel; Judith Kreher; Romuald Soin; Catherine Creppe; Patrick A Limbach; Cyril Gueydan; Véronique Kruys; Alexander Brehm; Svetlana Minakhina; Matthieu Defrance; Ruth Steward; François Fuks
Journal:  Science       Date:  2016-01-15       Impact factor: 47.728

Review 8.  5-methylcytosine in RNA: detection, enzymatic formation and biological functions.

Authors:  Yuri Motorin; Frank Lyko; Mark Helm
Journal:  Nucleic Acids Res       Date:  2009-12-08       Impact factor: 16.971

9.  Crystal structures of B-DNA dodecamer containing the epigenetic modifications 5-hydroxymethylcytosine or 5-methylcytosine.

Authors:  Daniel Renciuk; Olivier Blacque; Michaela Vorlickova; Bernhard Spingler
Journal:  Nucleic Acids Res       Date:  2013-08-20       Impact factor: 16.971

10.  Long non-coding RNAs as targets for cytosine methylation.

Authors:  Thomas Amort; Marie F Soulière; Alexandra Wille; Xi-Yu Jia; Heidi Fiegl; Hildegard Wörle; Ronald Micura; Alexandra Lusser
Journal:  RNA Biol       Date:  2013-04-01       Impact factor: 4.652

View more
  10 in total

1.  Oxidized Derivatives of 5-Methylcytosine Alter the Stability and Dehybridization Dynamics of Duplex DNA.

Authors:  Paul J Sanstead; Brennan Ashwood; Qing Dai; Chuan He; Andrei Tokmakoff
Journal:  J Phys Chem B       Date:  2020-02-05       Impact factor: 2.991

2.  5-Formylcytosine does not change the global structure of DNA.

Authors:  Jack S Hardwick; Denis Ptchelkine; Afaf H El-Sagheer; Ian Tear; Daniel Singleton; Simon E V Phillips; Andrew N Lane; Tom Brown
Journal:  Nat Struct Mol Biol       Date:  2017-05-15       Impact factor: 15.369

Review 3.  A molecular-level perspective on the frequency, distribution, and consequences of messenger RNA modifications.

Authors:  Joshua D Jones; Jeremy Monroe; Kristin S Koutmou
Journal:  Wiley Interdiscip Rev RNA       Date:  2020-01-21       Impact factor: 9.957

Review 4.  The Importance of Being Modified: The Role of RNA Modifications in Translational Fidelity.

Authors:  Paul F Agris; Amithi Narendran; Kathryn Sarachan; Ville Y P Väre; Emily Eruysal
Journal:  Enzymes       Date:  2017-04-22

5.  Identification and Quantification of Modified Nucleosides in Saccharomyces cerevisiae mRNAs.

Authors:  Mehmet Tardu; Joshua D Jones; Robert T Kennedy; Qishan Lin; Kristin S Koutmou
Journal:  ACS Chem Biol       Date:  2019-06-25       Impact factor: 5.100

Review 6.  Naturally occurring modified ribonucleosides.

Authors:  Phillip J McCown; Agnieszka Ruszkowska; Charlotte N Kunkler; Kurtis Breger; Jacob P Hulewicz; Matthew C Wang; Noah A Springer; Jessica A Brown
Journal:  Wiley Interdiscip Rev RNA       Date:  2020-04-16       Impact factor: 9.349

Review 7.  Dealing with an Unconventional Genetic Code in  Mitochondria: The Biogenesis and Pathogenic  Defects of the 5-Formylcytosine Modification in  Mitochondrial tRNAMet.

Authors:  Lindsey Van Haute; Christopher A Powell; Michal Minczuk
Journal:  Biomolecules       Date:  2017-03-02

8.  Nucleobase carbonyl groups are poor Mg2+ inner-sphere binders but excellent monovalent ion binders-a critical PDB survey.

Authors:  Filip Leonarski; Luigi D'Ascenzo; Pascal Auffinger
Journal:  RNA       Date:  2018-11-08       Impact factor: 4.942

9.  Atranorin driven by nano materials SPION lead to ferroptosis of gastric cancer stem cells by weakening the mRNA 5-hydroxymethylcytidine modification of the Xc-/GPX4 axis and its expression.

Authors:  Zhentian Ni; Xiaoli Nie; Hairong Zhang; Lingquan Wang; Zixiang Geng; Xiling Du; Haiyang Qian; Wentao Liu; Te Liu
Journal:  Int J Med Sci       Date:  2022-09-25       Impact factor: 3.642

Review 10.  Challenges with Simulating Modified RNA: Insights into Role and Reciprocity of Experimental and Computational Approaches.

Authors:  Rebecca J D'Esposito; Christopher A Myers; Alan A Chen; Sweta Vangaveti
Journal:  Genes (Basel)       Date:  2022-03-18       Impact factor: 4.141

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.