Literature DB >> 35638108

The Dihydrouridine landscape from tRNA to mRNA: a perspective on synthesis, structural impact and function.

Olivier Finet¹, Carlo Yague-Sanz¹, Florian Marchand¹, Damien Hermand¹.

Abstract

The universal dihydrouridine (D) epitranscriptomic mark results from a reduction of uridine by the Dus family of NADPH-dependent reductases and is typically found within the eponym D-loop of tRNAs. Despite its apparent simplicity, D is structurally unique, with the potential to deeply affect the RNA backbone and many, if not all, RNA-connected processes. The first landscape of its occupancy within the tRNAome was reported 20 years ago. Its potential biological significance was highlighted by observations ranging from a strong bias in its ecological distribution to the predictive nature of Dus enzymes overexpression for worse cancer patient outcomes. The exquisite specificity of the Dus enzymes revealed by a structure-function analyses and accumulating clues that the D distribution may expand beyond tRNAs recently led to the development of new high-resolution mapping methods, including Rho-seq that established the presence of D within mRNAs and led to the demonstration of its critical physiological relevance.

Entities: Chemical

Keywords: Dihydrouridine; Dus; RNA modification; epitranscriptome

Mesh：

Substances：

Year: 2022 PMID： 35638108 PMCID： PMC9176250 DOI： 10.1080/15476286.2022.2078094

Source DB: PubMed Journal: RNA Biol ISSN： 1547-6286 Impact factor: 4.766

The RNA backbone is a succession of covalently bound ribonucleotides whose nucleobase, ribose or 5’-extremity can be modified. To date, there are more than 150 known RNA chemical modifications, spanning the three domains of life and viruses. RNA modifying enzymes can work as (cofactor-dependent) stand-alone proteins, be part of a protein complex that is required for modification or are guided by small nucleolar RNAs (snoRNAs). tRNAs are the most heavily modified RNA species with up to 25% (in eukaryotes) and 15% (in prokaryotes) of their ribonucleotides being modified [1]. It is estimated that one-fifth of all known tRNA modifications is spread across all domains of life [2]. Ribosomal RNAs are also widely modified yet to a lesser extent with up to 2% of modified positions [3]. The abundance of rRNAs and tRNAs facilitated their study and the modification status of their building blocks. However, other RNA species have been known to carry post-transcriptional modifications (PTMs) for decades. Besides the well-characterized eukaryotic mRNA 5’-cap, internal modifications are found in coding RNAs, such as the highly abundant m6A, which was detected in mRNAs more than forty years ago [4]. In Eukarya, snoRNAs and snRNAs carry 5’-end and internal modifications but have a narrower range of PTMs [5]. Over the past decade, a still-increasing set of modifications including m6A, m6Am, m5C, hm5C, ψ, m1A or 2’-O-Me was mapped at transcriptome-scale, which has been largely discussed elsewhere, for example [6,7].

Dihydrouridine is structurally unique

The dihydrouridine (D, sometimes DHU) RNA modification is a modified pyrimidine nucleoside whose corresponding nucleobase is 5,6-dihydrouracil. D is synthesized from uridine (U) by hydrogenation [8] (Fig. 1A). Reduction of the uridine C5-C6 bond generates a saturated nonplanar and nonaromatic nucleobase that is a landmark of dihydrouridine (Fig. 1B). Although the chemical synthesis of dihydrouracil was already reported in 1896, its first detection in a biological sample dates back from 1952 when it was isolated from the beef spleen [9]. Using in vitro approaches, dihydrouridine monophosphate (dihydro-UMP) was shown to be efficiently introduced into RNA molecules, but whether D was a genuine component of cellular RNA was still to be determined [10,11]. D was then reported as a naturally occurring component of yeast tRNAAla and included in the first published structure of a ribonucleic acid [12,13] (Fig. 1C). Simultaneously to the publication of the tRNA sequence, Visser and colleagues were already discussing the possibility that ‘non-random distribution of hydrogenated pyrimidine [dihydrouridine] may be explained more readily by a process of enzymatic hydrogenation at the polynucleotide level’ ([11], p. 297).

Figure 1.

Biochemistry of dihydrouridine.

Biochemistry of dihydrouridine. In the next decades, optical studies along with X-ray and NMR crystallographic analyses elucidated the biochemical properties of dihydrouridine. The structures of the nucleobase, the nucleotide, of D-containing tRNAs and D-containing oligoribonucleotides led to the conclusions that; (I) the carbon 6 of the nucleobase (C6) is out of the plane after the C5-C6 double bond reduction, (II) the deviation from the planar nature of the pyrimidine results in the loss of the stacking ability with neighboring nucleobases, (III) the C2’-endo conformation is adopted by the ribose moiety whereas canonical ribonucleotides prefers the C3’-endo conformation (Fig. 1D) and (IV) the C2’-endo pucker is propagated to the 5’-nucleotide (references and comments in Table 1). Therefore, the complete destacking of the bases and the unusual adoption of C2’-endo ribose pucker make dihydrouridine a unique modification [14]. The structural properties of dihydrouridine include the potential destabilization of the RNA structure (by promoting C2’-endo conformation) and molecular flexibility (by spanning the sugar-phosphate backbone, Fig. 1E). In parallel, the D crystal structure revealed the adoption of gauche-trans or trans-gauche conformations around the C4’-C5’ ribose bond, rather than the common gauche-gauche rotamer (Fig. 1F, references in Table 1). In addition, the dihydrouracil nucleobase was shown to be in anti orientation in respect to the ribose moiety (i.e. with C2 = O pointing away from the sugar) (Movie 1, references in Table 1). More recently, the contribution of D in folding of the D loop was investigated by NMR, which revealed that the absence of D results in the stable stem-loop hairpin to adopt several undefined interconverting conformations in solution [15].

Table 1.

Seminal studies on optical and structural properties of dihydro-uracil/-uridine

Structural and optical studies	Comments	References
dihydrouracil	C5-C6 and N1 out of the base planeexpectation for Watson-Crick interaction impairment	[140]
dihydrouridine	deviation from planarityanti orientationC2’-endo conformation	[143]
	anti orientationfavoured gauche-trans and trans-gauche conformations	[149]
	deviation from planarityanti orientationpreference for C3’-exo and C2’-endo conformationspreference for gauche-gauche conformation	[85]
	deviation from planarityanti orientationC2’-endo conformationfavoured gauche-trans and trans-gauche conformations	[142]
	C6 out of the base planeno base stacking and anti orientationC2’-endo conformationfavoured gauche-trans and trans-gauche conformations	[141]
dinucleoside phosphate:DpA, ApD, GpD	DpA stackingApD and GpD low stacking	[137]
yeast or E. coli tRNAs	formation of a D-loop with increased hydrophobicityD provides extra flexibilityinteraction of D-loop and TψC-loop	[135,138,139,144–146]
trinucleoside phosphate: Dp(acp³U)pA	C2’-endo conformation and upstream propagation	150
trinucleoside phosphate:ApDpA	C6 out of the base planeno base stackingenhanced C2’-endo conformation at low temperature (5°C) and downstream propagation	[134,, 136]

The table is divided into three columns; (I) studied chemical entities, (II) main conclusions relative to the nucleobase (green writing) or to the ribose (blue writing) and (III) references. C (carbon), N (nitrogen), A (adenosine), D (dihydrouridine), G (guanosine), p (5’-3’ phosphodiester bond), acp3U (3-(3-amino-3-carboxypropyl) uridine).

Seminal studies on optical and structural properties of dihydro-uracil/-uridine The table is divided into three columns; (I) studied chemical entities, (II) main conclusions relative to the nucleobase (green writing) or to the ribose (blue writing) and (III) references. C (carbon), N (nitrogen), A (adenosine), D (dihydrouridine), G (guanosine), p (5’-3’ phosphodiester bond), acp3U (3-(3-amino-3-carboxypropyl) uridine). To sum up, D is the sole known non-aromatic modified nucleotide, a feature that promotes noncanonical ribose conformation and hinders proper base stacking. These features have potential deep consequences on many if not all aspects of processes implicating RNA. Indeed, the RNA ‘structurome’ affects splicing, translation or stability [16,17]. In the context of the epitranscriptomic landscape and its readers, it is worth mentioning that the C2′-endo conformation promoted by D is bound by RNA recognition motifs (RRM) and that D may therefore enforce the binding of the large repertoire of RNA binding proteins [18].

Detection of dihydrouridine

Since the discovery of D, several methods to detect this modified nucleotide were developed based on its physicochemical properties. Their recent adaptations to high-throughput sequencing are discussed in a separate section.

Sodium borohydride and alkaline treatments

Distinguishing the RNA modification from its canonical nucleotide is challenging. Chemical treatments affecting D are summarized in Table 2. The dihydrouridine undergoes ring opening upon sodium borohydride (NaBH4) or alkaline (OH−) treatments, resulting in the formation of an ureido-group (NH2CONH) linked to an alcohol or a carboxylic acid, respectively (Fig. 2) [19-21]. The accumulation of ureido-groups can be quantified by a colorimetric assay [22]. The ribosylureidopropanol (D + NaBH4) can be used for labelling of RNA with a fluorescent dye or for the cleavage of ureidopropanol upon acid conditions (H+) [23-30]. The ribosylureidopropionic acid (D + OH−) is used as a semi-quantitative tool following the breakdown between ribose and ureidopropionic acid, this latter being decomposed to β-alanine that serves as a substrate for a colorimetric assay with ninhydrin [31,32]. The D-ring disruption upon OH− condition was also shown to generate an RT (reverse transcription) termination assessed by primer extension [33]. Both treatments, NaBH4 or OH− result in an abasic site (nucleobase-free ribose) that leads to the cleavage of the RNA chain with aniline treatment [34,35]. These chemical reactivities of D have been recently exploited to develop transcriptome-wide mapping of D occupancy (see below).

Table 2.

Chemical reactions and techniques specifically applicable to D

Principles	tRNA	Comments	References
alkaline hydrolysis	/	sodium hydroxide → cleavage at the N3-C4 linkage of dihydrouracil	[19]
β-alanine detection	Y	sodium hydroxide → D-ring opening → partial β-alanine formation→ ninhydrin colorimectric assay	[32]
sodium borohydride reduction	Y	sodium borohydride → cleavage at the N3-C4 linkage of dihydrouridine	[20]
hydrochloric acid hydrolysis	B, Y, M	sodium borohydride → D-ring opening → hydrochloric acid→ cleavage of the glycosodic bond	[24,and b; 26]
loss of absorbance	Y	loss of absorbance at 265 nm upon mild sodium hydroxide treatment	[132]
cleavage at D position	Y	sodium borohydride → D-ring opening → aniline treatment→ RNA chain cleavage	[34]
ureido-group detection	B, Y, M	sodium hydroxide → D-ring opening → solution neutralization→ colorimetric assay after iron chloride addition (550 nm)	[22]
replacement of D by proflavine or EtBr	Y	sodium borohydride → D-ring opening → incubation with the dye	[29]
microarray	Y	differential fluorescent labeling of tRNA from WT and dus strains→ annealing on a microchip → quantification of A-U vs A-D interaction	151
primer extension	B, Y	sodium hydroxide → D-ring opening→ assessment of RT termination by primer extension	[23,33]
fluorescent labeling with rhodamine 110	B	in vitro tRNA → in vitro dihydrouridylation → sodium borohydride→ incubation with the dye	[23]
replacement of D by Cy3 or 5	B, Y	sodium borohydride → D-ring opening → incubation with the dye	[27]
benzoyhydrazide addition	B	sodium borohydride → THU formation → incubation with benzohydrazide	[42]
predictive modelling	B, Y	jackknife-based test to predict modified sites	[113,115]
nanopore sequencing	B	RNA through a nanopore scale → specific ionic signature	[116]

The table is divided into four columns; (I) list of principles, (II) phylogenetic origin of the studied tRNA; B (bacterial), Y (yeast), M (mammalian), (III) general comments on the principle. EtBr (ethidium bromide), N (nitrogen), C (carbon), RT (reverse transcription), WT (wild type), dus (dihydrouridine synthase gene), A-U (adenine-uridine Watson-Crick interaction), A-D (adenine-dihydrouridine Watson-Crick interaction), Cy3/5 (cyanines 3 or 5), THU (tetrahydrouridine) and (IV) references. Only the seminal works are cited although most of the techniques were implemented in other studies. Chromatographic applications are not listed here.

Figure 2.

Molecular fate of D upon sodium borohydride (NaBH4) or alkaline (OH−) treatments.

Molecular fate of D upon sodium borohydride (NaBH4) or alkaline (OH−) treatments. Chemical reactions and techniques specifically applicable to D The table is divided into four columns; (I) list of principles, (II) phylogenetic origin of the studied tRNA; B (bacterial), Y (yeast), M (mammalian), (III) general comments on the principle. EtBr (ethidium bromide), N (nitrogen), C (carbon), RT (reverse transcription), WT (wild type), dus (dihydrouridine synthase gene), A-U (adenine-uridine Watson-Crick interaction), A-D (adenine-dihydrouridine Watson-Crick interaction), Cy3/5 (cyanines 3 or 5), THU (tetrahydrouridine) and (IV) references. Only the seminal works are cited although most of the techniques were implemented in other studies. Chromatographic applications are not listed here. Thin-layer chromatography was first used to spot 3H-labelled D on a cellulose membrane after oxidation with an inorganic salt and reduction with borohydride [36]. In reverse-phase high performance liquid chromatography (HPLC), D is the earliest eluting nucleoside, sometimes barely distinguished from contaminants with short retention times. However, its detection is possible at 254 nm [37] and detectability is improved by lowering the wavelength at 210 or 230 nm [38]. Accurate determination of D is readily obtained by liquid chromatography coupled to mass spectrometry (LC-MS) [37].

Effects on double strand formation

The comparison of binding properties of polyU- vs polyD-nucleotides with polyA revealed a reduced ability of polyD to interact with polyA although the reduction of uridine occurs at Hoogsteen edge (C5 = C6) of the nucleobase whereas the hydrogen bonds of the Watson-Crick interactions are formed from positions 3 and 4 (N3-C4 = O) [39]. The decreased binding capacity is nevertheless too weak to induce a clear RT termination in a primer extension context [40]. Nevertheless, Phizicky and colleagues took advantage of the decreased D-A binding to implement a microarray-based technology to monitor the presence of D at specific tRNA positions [33,41].

Labeling with a fluorescent dye

As stated above, the treatment of D by sodium borohydride is a prerequisite for the subsequent incorporation of fluorescent molecules, and more specifically of NH2-dyes (e.g. proflavine, rhodamine, cyanine hydrazide). Fig. 2 shows that the D-ring opening forms ribosylureidopropanol in a NaBH4-dependent manner. However, this commonly accepted N3-C4 cleavage has been recently challenged [42]. Cooperman and colleagues induced tRNA reduction with sodium borohydride and performed labelling with NH2-containing fluorophores nucleophiles. By combining TLC (thin layer chromatography) and mass spectrometry, they detected tetrahydrouridine (THU) instead of the expected ribosylureidopropanol. Based on their results, they proposed that the dihydrouridine C4 carbonyl group is reduced by the H− donor NaBH4 to THU. Upon the addition of an NH2-dye in acid conditions, a nucleophilic substitution occurs on the C4 hydroxyl group by formation of a Schiff base-bearing intermediate called tetrahydrocytidine (THC) and consecutive fluorophore binding. Fig. 3 summarizes both mechanisms in the context of the addition of the rhodamine 110 fluorophore.

Figure 3.

Proposed mechanism of D rhodamine labeling following sodium borohydride (NaBH4) reduction.

Seminal landscape of the distribution of Dihydrouridine

Based on 602 tRNA sequences from viral, prokaryotic and eukaryotic species, D is the second most prevalent tRNA modification (925 counts) after pseudouridine (1,164 counts) [43]. The dihydrouridylated positions include the canonical D16, D17, D20, D20a, D20b and D47 (Fig. 4A) and the rare non-canonical D14, D17a, D21 and D48. Among the most frequent modified positions are D20 and D16 both positioned in the eponym D-loop that has a pivotal role in the establishment of secondary and tertiary structures of tRNAs [15,44]. The biochemical specificities of D play a role in the cloverleaf-related tRNA secondary structure and in the L-shaped tRNA tertiary structure that is achieved through D- and T-loops interaction (reviewed in [45]). Although the most important residues for the kissing D/T loops are not dihydrouridines, a compilation of crystal structures highlighted a set of base pairing events where D is involved through various types of interactions (cis or trans interactions between Watson-Crick, Hoogsteen and sugar edges) [46,47].

Figure 4.

The biology of dihydrouridine.

Dihydrouridine in eukaryotes

All eukaryotic tRNA species have been described with at least one dihydrouridine, with the exception of tRNAselenocysteine. Particularly, the 18 described cytoplasmic tRNAiMet only have D16 but no D17, D20, D20a, or D20b. In mitochondrial tRNAs (mt-tRNAs), all six canonical D are found, except D20b [43]. Importantly, there are dihydrouridylated mt-tRNAs that are encoded by the mitochondrial genome; at least three mammalian mt-tRNAs are known to have a D20 and accordingly, the putative cognate enzyme was shown to localize in human and murine mitochondria [48,49]. It has been shown that a set of mammalian mt-tRNAs have a truncated cloverleaf structure by lacking the D-loop [50,51]. Surprisingly, these tRNAs seem to adopt a functional tertiary structure by establishing unique interactions in a Mg2+-dependent manner [52,53]. Phizicky and colleagues identified the first dihydrouridine synthase Dus1 in 2002 in S. cerevisiae [54] based on a dihydrouridylation assay using a collection of GST-ORF fusion proteins. Homology searches highlighted three related enzymes encoded in the yeast genome. The specific modification of tRNAPhe by Dus1 and tRNALeu by Dus2 supported that Dus are substrate-specific enzymes, which was confirmed and expanded in an elegant microarray-based experiment relying on the D-dependent alteration of strands interaction. The resulting tRNA D landscape was U16 and U17 targeted by Dus1, U20 by Dus2, U20a and U20b by Dus4 and the extra D-loop U47 by Dus3 (Fig. 4A). The budding yeast quadruple dus mutant was shown to be viable and to lack any detectable dihydrouridine [33]. Early on, the D landscape was suspected to expand beyond tRNAs as D was found associated with plant and mammalian histone-bound RNAs [55,56] and detected on an enzymatic digest of rat U5 snRNA [57]. However, this was not further analyzed until recently (see below).

Dihydrouridine in prokaryotes

All canonical D residues (D16, D17, D20, D20a, D20b) are found in Bacteria. D47 is a scarce modification with a unique occurence described so far (tRNAMet of B. subtilis), although the position 47 in bacterial tRNAs is a U in almost 90% of the 134 known sequences. D20b is also very uncommon and found on a cyanobacterial tRNAGlu [43]. A peculiarity of the bacterial dihydrouridine landscape is the presence of a unique D on the 23S rRNA in Gram-negative (E. coli) and -positive (M. hominis, C. sporogenes) bacteria [58-60]. In E. coli, D is located at position 2449, a residue located in the highly conserved 23S rRNA central loop of domain V [59-61]. Remarkably, this region is part of the peptidyltransferase center that is also one of the sites of interaction with antibiotics targeting ribosomal activity. However, D2449 is dispensable in E. coli [62]. The Gram-positive C. acetobutylicum is the only known organism to have a D on its 16S rRNA. The modification occurs on position 1211 or 1212 but its function is still unknown [63]. Another type of non-coding RNA known to be dihydrouridylated is the Y RNA (a non-coding RNA involved in RNA degradation) from the γ-proteobacterium S. typhimurium. It has a DusA-dependent D and folds in a tRNA-like manner [64]. The de Crécy-Lagard laboratory implemented an in silico comparative genomic screen in order to find bacterial dus genes. Because no D was ever detected in P. furiosus, they assumed that the genome of this organism should not contain any dus gene, in contrast to other microorganisms such as E. coli or S. cerevisiae. By doing so, they found ortholog genes absent in the D-free P. furiosus species but present in other D-containing species [65]. This approach led to the identification of three E. coli Dus enzymes referred to as DusA, DusB and DusC that have non-redundant activities on tRNAs (Fig. 4B). E. coli DusB and DusC are mono-specific proteins that target U17 and U16, respectively while DusA substrates include U20 and U20a [66]. Importantly, this conclusion is not valid for the whole bacterial world as the DusB protein from Mycoplasma capricolum was shown to modify U17, U20 and U20a, making it the only known Dus enzyme to modify three different tRNA sites [67].

First insights into the biological relevance of Dihydrouridine

Implications in human health and development

The isolated tRNAPhe from malignant human tissues was reported to contain more dihydrouridines [68]. D is also present in urine samples and more significantly abundant in the urine of lymphoma patients [69,70]. This is in line with the idea that tumour tissues undergo high turnover of tRNAs that can be quantified and used as a noninvasive biomarker for diagnosis and treatment of cancer [38,71,72]. More generally, D can be considered as a metabolic modulator for a large set of pathological conditions; D is upregulated in the serum of patients with the major form of oral cancer in the world (oral squamous cell carcinoma) [73] and downregulated in the serum of mice bearing breast cancer [74]. D is associated with lethal prostate cancer [75]. At the molecular level, the human DUS2 protein (hDUS2) was shown to act as an inhibitory factor of the interferon-induced protein kinase PKR – whose kinase activity is enhanced in melanomas and colorectal cancers [76]. Similarly, the anti-cancer ginsenoside compound was shown to repress the expression of hDUS2 in human colorectal cancers cells [77]. To date, the most comprehensive study linking D with cancer was provided by Nakamura and colleagues [78]. hDUS2 showed a 3-fold overexpression in non-small cell lung carcinomas (NSCLCs) compared to healthy samples. The hDUS2 protein followed the same pattern, localized at endoplasmic reticulum and harbored a C-terminal double-stranded RNA binding motif (DSRM or dsRBD). In addition, an interaction between hDUS2 and the glutamyl-prolyl tRNA synthetase EPRS was reported. Phenotypically, the suppression of the tumor cell growth was observed after siRNA-dependent hDUS2 depletion and the NSCLCs patients with high levels of hDUS2 showed worse prognostics. The subsequent model was that overexpression of hDUS2 led to the hypermodification of tRNAs and consecutive increase of conformational flexibility. Because hDUS2 interacted with EPRS, they hypothesized that the tRNAs were more promptly charged in NSCLCs, which globally resulted in a more efficient translation although this remains to be established. In addition, increased D level was associated with worse outcomes in several cancers [79,80]. hDUS3 was also shown to be an inhibitor of the regenerative ability of the central nervous system [81]. Strikingly, the dihydrouridine was more abundantly detected during neural development in human embryonic stem cells [82].

Implications in prokaryotic growth

The quantitative detection of dihydrouridine led to the conclusion that psychrophilic bacteria have up to 70% more D on tRNAs than their mesophilic counterparts, which is in contradiction with the general observation that tRNAs of psychrophiles tend to be hypomodified [83]. The psychrophilic organisms that grow between 0°C and 20°C – with an optimum at 15°C – have the necessity to cope with low environmental temperatures, unlike the mesophiles bacteria that live above 20°C. It has been established by NMR that low temperatures tend to stabilize the C2’-endo conformation of the dihydrouridine ribose moiety [84,85]. The accumulation of D in psychrophilic prokaryotes could therefore constitute an evolutive adaptation to allow these organisms to maintain the conserved L-shaped conformation of tRNA, despite a growth at very low temperatures that otherwise could be detrimental for tRNA structure and function. More generally, the set of modifications present on a tRNA depends on environmental cues, such as the temperature. In agreement with this idea, the in vitro synthesis of dihydrouridine in the hyperthermophilic bacterium T. thermophilus is possible on an unmodified tRNAPhe at 60°C but not at 80°C where the tRNA substrate needs to carry other modified nucleosides [86]. At the transcriptional level, the gene coding for the mesophilic Clostridium botulinum DusB homolog is downregulated during a heat shock stress at 45°C. In line with the above principles, the bacterium would require less D at high temperatures and would therefore decrease the expression of its cognate enzyme [87]. DusC is also differentially regulated in response to the growth temperature in the thermophilic Bacillus manusensis bacterium [88]. An interesting case is found in the Archaea Methanococcoides burtonii that has a thermal niche around 2°C and an in vitro optimal growth at 23°C. It was noticed that despite its low percentage of modified tRNA nucleotides – only 2%, which is one of the lowest in the living world – this psychrotolerant archaeon has, on average, more than one D residue per tRNA [89]. The fact that an archaeon living at low temperatures and having a limited tRNA epitranscriptome possesses D residues is another clue that D impacts the flexibility of tRNAs. Finally, D was also shown to be dramatically affected when the Lactobacillus agilis gastrointestinal bacterium is grown on an alternative energy source, underlying again the dynamic regulation of this modification [90].

Structure-function analyses of the Dus enzymes

Dus are flavin-dependent enzymes that function similarly to the dihydroorotate and dihydropyrimidine dehydrogenases. Based on kinetics and structural data, an FMN- and NADP-dependent enzymatic mechanism has been proposed [91-93]. The reductive half-reaction is initiated with NADPH binding to the Dus enzyme. The NADPH transfers a hydride to the Dus-bound flavin prosthetic group (FMN to FMNH−), which reduces the enzyme. The dissociation of NADP results in a free reduced enzyme that binds the tRNA. In the oxidative half reaction, the nucleophilic C6 of uridine is reduced. The second hydride transfer to C5 likely occurs through the oxidation of a highly conserved Dus cysteine residue (Cys) that argues in favour of an evolutionary conserved mechanism (Fig. 4C) [94,95]. Interestingly, the reduced Dus enzyme has a very slow oxidative half-reaction when an in vitro transcribed tRNA is used as a substrate, contrary to a purified tRNA (from a dus- mutant). This strongly suggests that a tRNA already bearing modifications is the genuine substrate and that an ordering of modifications may exist in vivo [92]. To date, six Dus enzymes structures have been published (Fig. 4D). The seminal crystallographic structure of an unknown FMN-binding protein in T. maritima revealed an oxidoreductase enzyme with two domains; an N-terminal TIM barrel and a C-terminal helical domain [96]. Later, this enzyme was referred as a dihydrouridine synthase. The T. thermophilus DusA crystal highlighted the same general structure [93]. Moreover, the FMN cofactor (flavin mononucleotide) was captured in a positively charged groove at the center of the N-terminal domain corresponding to the catalytic site. DusA-tRNAPhe complex revealed that DusA interacts with the D-stem loop, the anticodon stem loop and the T-stem loop of tRNA and that the D-loop but not the D-stem is strongly distorted when DusA is bound. The third published bacterial Dus structure was DusC from E. coli that also displays a two-domain conformation with an N-terminal catalytic domain and a C-terminal RNA binding domain [97,98]. The structural similarities between T. thermophilus DusA and E. coli DusC led to the hypothesis that they share the same catalytic mechanism. Remarkably, notable structural dissimilarities were discovered by comparing the bacterial DusA (targeting tRNA-U20 and 20a) and DusC (targeting tRNA-U16). Both enzymes adopted the same general fold – while having different substrate specificities – but bound and recognized the tRNA in different orientations. The tRNA binding differed by a 160° rotation that resulted in the proper integration of the targeted uridine in the catalytic pocket. This trademark way of catalyzing a reaction is unique in RNA enzymology and is achieved through specific binding signatures. According to its target, each Dus enzyme has a cluster of amino acids – that is phylogenetically conserved in Bacteria – that defines the docking of tRNA to allow the reduction of a specific uridine [97]. The missing DusB structure was provided by Hamdane and colleagues [66]. Even though the crystal was incomplete, it was concluded that E. coli DusB adopted the same overall structure with an N-terminal TIM barrel fold carrying the catalytic function and a C-terminal helical domain. The tRNA docking in DusB was similar to the one of DusC, which makes sense since DusB and C modify neighboring nucleotides (17 and 16). However, a major difference between DusB and C relied on the positioning of the nucleobase into the catalytic center. Reversed polar and nonpolar amino acids in the catalytic pocket of DusB led to the 180 degrees rotation of the nucleobase that is targeted for reduction. In conclusion, diversification of bacterial Dus specificities was made possible through two astonishing strategies; nucleobase rotation or tRNA docking rotation (Fig. 5). More recently, the DusA structure from P. aeruginosa was resolved by combining in vitro and in silico methods [99].

Figure 5.

Molecular strategies of bacterial Dus enzymes for substrate specificities.

Molecular strategies of bacterial Dus enzymes for substrate specificities. The only available eukaryotic Dus structure is the human DUS2. hDUS2 is particularly important because it is associated with pulmonary carcinogenesis and, unlike other Dus enzymes, harbors a dsRBD. This domain turns out to be conserved in animals (mammals, amphibia, flatworms, nematodes, insects) [94]. The sequence similarity is quite low between hDUS2 and bacterial Dus or even yeast Dus2, suggesting a potential novel strategy for the substrate recognition [100]. The analogy between bacterial Dus and hDUS2 includes an N-terminal catalytic domain folded in a TIM barrel, an interaction of the catalytic domain with FMN, a high sequence conservation of the residues required in the active site – including the cysteine as an H− donor and the presence of a central helical domain. The C-terminal dsRBD domain was shown to be necessary but not sufficient for D20 synthesis on yeast tRNA extracted from a ∆dus2 strain. Furthermore, this domain was suggested to serve as primary tRNA binding site, before the canonical helical domain [101]. Using a dsRBD for tRNA recognition was never reported before, making hDUS2 the only known tRNA modifying enzyme to adopt this strategy. This unexpected feature was recently clarified by showing that the helical domain of hDus2 was less electropositive than its yeast orthologue partly because of the absence of two lysine residues, which played a role in the emergence of a new tRNA binding mode [102,103].

New high-resolution methods to decipher the transcriptome-wide distribution of Dihydrouridine

As indicated above, NaBH4 or OH− treatments result in an abasic site (nucleobase-free ribose) leading to the cleavage of the RNA chain with aniline treatment [34,35]. This led to the development of AlkAniline-Seq, a method relying on the chemistry-based enrichment of sequencing libraries with fragments containing certain modifications including D [104,105]. While AlkAniline-Seq responded to the presence of D, the signal strength was considerably lower than for other marks including m7G. This is likely due to the incomplete formation of abasic sites at D position following limited alkaline hydrolysis. On a completely different basis, RNA-mediated activity-based protein profiling (RNABPP) relies on metabolic RNA labelling, mRNA interactome capture and quantitative proteomics. In this approach, a 5-halopyrimidine (typically 5-FUrd, 5-fluorouridine for DUS3L) is incorporated in RNA and form stable RNA-protein adducts with the modifying enzyme, allowing RNA interactome capture [106]. These experiments confirmed that U46-48 in the tRNA variable loop is the major substrate of DUS3L. Notably, the approach also revealed DUS3L crosslinking peaks on non-tRNA substrates including mature mRNAs [106]. These data are reminiscent of a previous work reporting that in cardiomyocytes, the human DUS3 homolog was shown to interact with polyA+ RNAs, raising the possibility of mRNAs hDUS3-specific modification in this specialized cell type [107]. We recently introduced Rho-seq to globally detect the presence of D residues on RNA [108]. In a two-step protocol, total RNA is first incubated with NaBH4 and then covalently bound by the rhodamine fluorophore (Rho). From there, the Rho-seq protocol unfolds as follows; (I) RNA extraction from WT and ∆4 strains (lacking the four dus genes), (II) Rho+ and mock-treatments, (III) ribodepletion, (IV) cDNA synthesis and library preparation, (V) strand-specific deep-sequencing, (VI) data analysis by implementation of a multifactorial analysis. The detailed protocol is available in a separate publication [109]. Rho-seq provided the first transcriptome-wide Dus1 to 4-specific repertoire of the distribution of D and highlighted that D is an integral component of yeast and human mRNAs, though at low level. These data also provided a framework for the previous findings that DUS1 and DUS3 cross-link to mRNAs in both yeast and human [110,111]. The absence of detected D-sites on the bacterial coding transcriptome suggests that mRNA dihydrouridylation may be a eukaryotic-specific mechanism although this requires additional support [108]. A set of in silico tools were developed to predict the presence of D on various RNA types. These computational predictions need however to consider a larger set of proven D-containing sequences to build more objective tools [112-115]. Finally, the detection of modified nucleotides with the Oxford Nanopore Technologies is still in its infancy but further developments will likely confirm the seminal detection of D on E. coli tRNAs through the nanoscale pore [116].

Is dihydrouridine an adaptative mark regulating translation?

Similarly to other mRNA modifications, the detected D-sites could be relevant in a specific physiological context. Two previously reported cases are illustrated in the literature: induced pseudouridylation upon heat shock and widespread methylation during the meiotic programme [117,118]. We found that the deletion of dus3 in fission yeast specifically affects meiotic chromosome segregation and the translation of a set of proteins including tubulins while the mitotic cell cycle is barely affected [108]. Ribosome profiling and proteomics revealed that a dihydrouridylated mRNA can be translated but that the modification affects this process, which may result in modulation of the translational speed and ribosome stalling [108]. In the late sixties, several teams investigated the in vitro coding properties of D-containing oligoribonucleotides. Despite the technical boundaries encountered at that time, they concluded that the presence of D resulted in the loss of coding ability. Rottman and Cerutti showed that a ribopolymer carrying 4.2% of D (and 95.8% of U) lost up to 60% of Phe(UUU) residues incorporation into the protein [119]. Another study highlighted the complete loss of ability for GUD, GDU and GDD trinucleotides to code for Val (GUU) [120]. The conclusion was the same for the dihydrouridylation of the AUG (Met) codon [121,122]. These were the first data supporting that D-containing mRNAs could be translationally repressed. The mRNAs encoding several subunits of the CCT (chaperonin-containing tailless complex polypeptide 1) complex, which is required for the folding of newly synthesized tubulin and actin proteins [123], contain D. A possibility to investigate is that the D modification modulates the speed of translation to allow co-translational folding. Interestingly, Dus2 was shown to help in the detoxification of the amyloid-β peptides which form aggregates in the Alzheimer’s disease [124]. As stated above, the alterations of the expression level of some Dus enzymes and the D landscape in cancers will be the topic of additional investigations to understand if translation is affected and to test if this is mediated by modified mRNAs and/or tRNAs.

Perspectives

It took almost 40 years from the first detection of dihydrouridine within tRNAs to the discovery of Dus enzymes, and 20 more years to expand the D landscape to the coding RNA world. Recent advances open up several interesting research directions. A peculiarity of E. coli is the 23S-D2449 deposition is independent of the dusA-B-C genes [108]. To our knowledge, this is the first experimental evidence suggesting that D could be deposited by a non-canonical dihydrouridine synthase. In Bacteria, uracil can be reduced to dihydrouracil by the dihydropyrimidine dehydrogenase complex formed by PreA and PreT [125]. Noticeably, the dihydropyrimidine dehydrogenase is of clinical importance in human because deficiency in this enzyme (DPYD in H. sapiens) leads to a severe sensitivity to the administration of 5-fluorouracil, an agent used widely to treat cancer [126]. The catalytic activity of the DPYD enzyme is reversible, suggesting the immediate possibility that Dus enzymes may also function backward as D eraser to restore uracil from D in some circumstances. A proof of concept supporting this possibility was already reported [92]. It will also be of interest to explore the D landscape in plants where it is predicted to be present on mRNAs as well [127]. Interestingly, D could be a developmentally regulated modification in A. thaliana where the DUS orthologs are expressed at low level in rosette leaves and apex tissues, unlike other RNA modifying enzyme genes [128]. Deciphering the synthesis of a carboxypropylated dihydrouridine (acp3D) from D in T. brucei is another important perspective to unravel the biology of this complex modification [129]. Another interesting possibility discussed above is the capacity for D to alter the recognition of RNA by interacting proteins. While the possibility of D readers will be investigated by classical unbiased approaches, a pilot study predicted 20 RNA binding-proteins (RBPs) to interact with D [114]. Interestingly, these RBPs are enriched for the alternative mRNA splicing process, including the SRSF9, SFPQ and ESRP2 splicing factors. The abundant, simple yet fascinating D modification is only about to reveal its importance. Click here for additional data file.

145 in total

Review 1. Identification of modified residues in RNAs by reverse transcription-based methods.

Authors: Yuri Motorin; Sébastien Muller; Isabelle Behm-Ansmant; Christiane Branlant
Journal: Methods Enzymol Date: 2007 Impact factor: 1.600

2. The nuclear 5S RNAs from chicken, rat and man. U5 RNAs are encoded by multiple genes.

Authors: A Krol; H Gallinaro; E Lazar; M Jacob; C Branlant
Journal: Nucleic Acids Res Date: 1981-02-25 Impact factor: 16.971

3. Identification of the tRNA-dihydrouridine synthase family.

Authors: Anthony C Bishop; Jimin Xu; Reid C Johnson; Paul Schimmel; Valérie de Crécy-Lagard
Journal: J Biol Chem Date: 2002-04-30 Impact factor: 5.157

4. Fluorescent labeling of tRNA dihydrouridine residues: Mechanism and distribution.

Authors: Jaskiran Kaur; Monika Raj; Barry S Cooperman
Journal: RNA Date: 2011-05-31 Impact factor: 4.942

5. 3-(3-amino-3-carboxypropyl)-5,6-dihydrouridine is one of two novel post-transcriptional modifications in tRNALys(UUU) from Trypanosoma brucei.

Authors: Jesper S Krog; Yaiza Español; Anders M B Giessing; Agnieszka Dziergowska; Andrzej Malkiewicz; Lluís Ribas de Pouplana; Finn Kirpekar
Journal: FEBS J Date: 2011-10-28 Impact factor: 5.542

6. An atlas of RNA base pairs involving modified nucleobases with optimal geometries and accurate energies.

Authors: Mohit Chawla; Romina Oliva; Janusz M Bujnicki; Luigi Cavallo
Journal: Nucleic Acids Res Date: 2015-06-27 Impact factor: 16.971

7. 15N-labeled tRNA. Identification of dihydrouridine in Escherichia coli tRNAfMet, tRNALys, and tRNAPhe by 1H-15N two-dimensional NMR.

Authors: D R Davis; R H Griffey; Z Yamaizumi; S Nishimura; C D Poulter
Journal: J Biol Chem Date: 1986-03-15 Impact factor: 5.157

Review 8. tRNA Modifications: Impact on Structure and Thermal Adaptation.

Authors: Christian Lorenz; Christina E Lünse; Mario Mörl
Journal: Biomolecules Date: 2017-04-04

9. MODOMICS: a database of RNA modification pathways. 2021 update.

Authors: Pietro Boccaletto; Filip Stefaniak; Angana Ray; Andrea Cappannini; Sunandan Mukherjee; Elżbieta Purta; Małgorzata Kurkowska; Niloofar Shirvanizadeh; Eliana Destefanis; Paula Groza; Gülben Avşar; Antonia Romitelli; Pınar Pir; Erik Dassi; Silvestro G Conticello; Francesca Aguilo; Janusz M Bujnicki
Journal: Nucleic Acids Res Date: 2022-01-07 Impact factor: 16.971

10. Accurate identification of RNA D modification using multiple features.

Authors: Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal: RNA Biol Date: 2021-03-17 Impact factor: 4.652