Literature DB >> 27937735

Deazaguanine derivatives, examples of crosstalk between RNA and DNA modification pathways.

Geoffrey Hutinet1, Manal A Swarjo2, Valérie de Crécy-Lagard1.   

Abstract

Seven-deazapurine modifications were thought to be highly specific of tRNAs, but have now been discovered in DNA of phages and of phylogenetically diverse bacteria, illustrating the plasticity of these modification pathways. The intermediate 7-cyano-7-deazaguanine (preQ0) is a shared precursor in the pathways leading to the insetion of 7-deazapurine derivatives in both tRNA and DNA. It is also used as an intermediate in the synthesis of secondary metabolites such as toyocamacin. The presence of 7-deazapurine in DNA is proposed to be a protection mechanism against endonucleases. This makes preQ0 a metabolite of underappreaciated but central importance.

Entities:  

Keywords:  DNA; Deazaguanine modifications; tRNA

Mesh:

Substances:

Year:  2016        PMID: 27937735      PMCID: PMC5699537          DOI: 10.1080/15476286.2016.1265200

Source DB:  PubMed          Journal:  RNA Biol        ISSN: 1547-6286            Impact factor:   4.652


Introduction

DNA and RNA, the two cellular information polymers are made of very similar building blocks: nucleotides composed of a nitrogenous base, a five carbon sugar and a phosphate group. The seemingly small differences between the sugars (ribose in RNA and 2′-deoxyribose in DNA) and one of the four bases (thymine (T) in DNA and uracil (U) in RNA) do however dramatically change the properties of the two molecules. DNA is more stable as a double stranded helix, and has been recruited as the core matrix of genetic information. RNA is generally single stranded and more flexible, harboring different shapes and functions in the cell. Historically, the fields of DNA and RNA research had been kept quite separate with the two communities rarely interacting, but this has recently changed with 1) the popularity of the RNA world hypothesis, and the realization that DNA is a modified form of RNA; 2) the growing interest in RNA modifications that has revealed how similar many of the RNA and DNA modification machineries are (see reference [1] for a detailed discussion). While the first nucleoside modifications were found in genomic DNA, with the discovery of 5-methyl-deoxycytosine (m5dC or 5mC) in Mycobacteria in 1925, they are more diverse and chemically complex in RNA. Around twenty modifications have been found to date in DNA. Most of the known genomic DNA modifications are methylations, although more complex modifications do exist, particularly in phages. In contrast, more than one hundred modifications have been found in RNA, and some of these modifications are very complex, e.g., wybutosine which requires five enzymes for its synthesis. Modifications are mainly found in stable RNAs such as transfer-RNAs (tRNAs) and ribosomal-RNA (rRNAs), and a few are also found in messenger-RNA (mRNAs). The molecular and biochemical characterization of RNA modifying enzymes has revealed multiple cases of broad specificity for various RNA substrates. For example, some pseudouridine synthases have been shown to modify both mRNA and tRNA substrates, whereas methylases of the RlmD family methylate rRNAs in some organisms and tRNAs in others. More recently, examples of crosstalks have been observed between the RNA and DNA modification machineries. It is the case in the eukaryotic DNMT family of methyltransferases that is subdivided in three groups: DNMT1, 2 and 3. DNMT1 and DNMT3 methylate the carbon 5 of cytosines in DNA CpG sequences, playing key roles in epigenetic regulation. DNMT2, despite its high sequence similarity to DNMT1 and 3, modifies specific tRNAs, e.g., tRNAAspGUC, at the position C38. Crosstalks also exist in the APOBEC family of proteins that deaminate cytosines into uridines. Although the first proteins discovered in the APOBEC family were mRNA modifying proteins, almost all members of the APOBEC family also modify single stranded DNA. Similarly, some members of the ADAT family, first discovered as a family of deaminases responsible for conversion of adenosine to inosine in tRNAs, deaminate cytosine into uridine in DNA. Finally, the AlkB proteins are a family of oxidative dealkylases that are broadly distributed among sequenced organisms. The founding members of the AlkB family were initially identified as proteins involved in the repair of damaged DNA and RNA molecules (see ref [14] for review). It was later shown that members of this family, such as the mammalian ALKBH8, are responsible for the hydroxylation of mcm5U in tRNAs. In addition, members of the FTO subgroup were found to demethylate mRNAs, adding regulatory functions to the AlkB family. In retrospect, these examples of crosstalk are not surprising, as deamination and methylation reactions were already known for both RNA and DNA. The novelty mainly derives from the switch in specificity within an enzyme superfamily from a RNA to a DNA substrate. However, the recent discovery that DNA contains 7-deazaguanine derivatives, modifications that were believed to exist exclusively in tRNA, is a novel and unexpected example of crosstalk between the RNA and DNA modification machineries, which is the focus of this review.

Queuosine and archaeosine function in tRNA

Queuosine (Q) and archaeosine (G+) are long-known 7-deazaguanine derivatives that were thought to exist exclusively in tRNA molecules. Q is found in many eukaryotes and bacteria at the wobble position of tRNAs harboring G34U35N36 anticodon sequences. These tRNAs are responsible for the insertion of Asn, Asp, His and Tyr amino acids in proteins. A role for Q in decoding efficiency was demonstrated over 30 years ago and recent work by the Farabaugh laboratory has also shown that Q affects decoding accuracy in both directions depending on the codon. However, Q is not required for growth under most tested conditions and its physiological importance remains elusive, particularly as it was repeatedly lost in the course of evolution. In bacteria, the absence of Q does not seem to be critical in exponential growth but affects growth under stress conditions in a diverse set of organisms and is required for the synthesis of a virulence factor in Shigella. In mammals, the queuine base is acquired from the diet or microflora and the absence of both Q and tyrosine is lethal. A recent study in flies led to a model in which Q plays a key role in avoiding miscoding. Q is also a positive determinant for the activity of the tRNA methyltransferase DNMT2. G+ is found at position 15 of most archaeal tRNAs that harbor a G at that position and was recently found at position 13 in Thermoplasma acidophilum tRNAs. Theoretical studies predict that G+15 plays a structural role, stabilizing the L-shaped tRNA structure through strengthening the G15-C48 interaction. G+ is not essential in Haloferax volcanii nor in Thermococcus kodakarensis, but its absence leads to a cold sensitivity phenotype in extreme halophiles. Both the Q and G+ modifications require complex pathways for their synthesis and are derived from the 7-cyano-7-deazaguanine intermediate preQ0.

PreQ0 synthesis in bacteria and archaea

PreQ0 is synthesized de novo from GTP in many bacteria and archaea by a series of complex reactions catalyzed by four enzymes (Fig. 1, purple arrows). The first step of the preQ0 pathway, the formation of dihydroneopterin triphosphate (H2NTP), is not a dedicated step but is shared with the tetrahydrofolate (THF) and biopterin (BH4) pathways. It is catalyzed by GTP cyclohydrolase I (EC 3.5.4.16). Most organisms use a Zn2+-dependent GTP cyclohydrolase I (FolE), a member of the Tunnel-fold (T-fold) structural superfamily. However, in a third of sequenced bacteria and in most archaea, FolE is replaced by FolE2, another T-fold superfamily member that utilizes other metals. The first dedicated step of the preQ0 pathway is the conversion of H2NTP to 6-carboxy-5,6,7,8-tetrahydropterin (CPH4) by CPH4 synthase (EC 4.1.2.50, QueD). QueD is a member the COG0720 family that also contains close homologs involved in THF and BH4 synthesis, revealing a catalytic promiscuity that make these genes difficult to annotate by sequence similarity alone. CPH4 is then converted to 5-carboxydeazaguanine (CDG) by CDG synthase (EC 4.3.99.3, QueE), a SAM dependent iron-sulfur cluster protein. 7-Cyano-7-deazaguanine synthase (EC 6.3.4.20, QueC), then catalyzes the formation of preQ0 from CDG through the recently discovered intermediate 7-amido-7-deazaguanine (ADG) in two ATP dependent reactions.
Figure 1.

Deazaguanine derivative synthesis pathways. GTP is the preQ0 precursor in both bacteria and archaea (purple arrows). In most bacteria, four more enzymatic steps lead to the insertion of Q in tRNAs at position 34 (red arrows). In a few organisms, preQ0 can be transformed to secondary metabolites such as toyacamycin or sangivamycin antibiotics (red arrows, toy genes). In eukaryotes, queuine is salvaged (green circle) and directly transferred to tRNAs (green arrows). Bacteria salvage preQ1 (red circle), and both bacteria and archaea salvage preQ0 (purple circle). In archaea, preQ0 is transferred to position 15 of tRNA before being modified to G+ (blue arrows). PreQ0 and ADG have been found in bacterial DNA (dashed red arrows) and G+ in phage DNA (dashed yellow arrow). All dashed arrows represent uncharacterized reactions. All molecule abbreviations and protein names are described in the text.

Deazaguanine derivative synthesis pathways. GTP is the preQ0 precursor in both bacteria and archaea (purple arrows). In most bacteria, four more enzymatic steps lead to the insertion of Q in tRNAs at position 34 (red arrows). In a few organisms, preQ0 can be transformed to secondary metabolites such as toyacamycin or sangivamycin antibiotics (red arrows, toy genes). In eukaryotes, queuine is salvaged (green circle) and directly transferred to tRNAs (green arrows). Bacteria salvage preQ1 (red circle), and both bacteria and archaea salvage preQ0 (purple circle). In archaea, preQ0 is transferred to position 15 of tRNA before being modified to G+ (blue arrows). PreQ0 and ADG have been found in bacterial DNA (dashed red arrows) and G+ in phage DNA (dashed yellow arrow). All dashed arrows represent uncharacterized reactions. All molecule abbreviations and protein names are described in the text. Crystal structures have been determined for all preQ0 synthesis enzymes. FolE, FolE2 and QueD are members of the T-fold structural superfamily of multimeric pterin and purine binding proteins. T-fold enzymes are classified into two structural subfamilies: a unimodular subfamily, composed of proteins that are formed from subunits possessing a single T-fold domain, and a bimodular subfamily, composed of proteins formed from subunits possessing tandem T-fold domains. FolE and QueD are, respectively, homodecameric and homohexameric unimodeular T-fold enzymes.; whereas FolE2 is a homotetrameric bimodular enzyme In all three proteins, the active sites lie at the interfaces between monomeric subunits. QueC exists as a homodimer, and its monomeric subunit is constituted of an N-terminal domain possessing the Rossman fold architecture characteristic of many nucleotide binding proteins, and a helical zinc-binding C-terminal domain, with the active site predicted to be at the interface between the two domains. QueE is a homodimer built around a modified AdoMet radical fold. Except for QueC, the structural details of substrate and cofactor recognition have been elucidated for all the dedicated preQ0 synthesis enzymes.

Q synthesis in bacteria

Three more steps are required to synthesize Q in bacteria (Fig. 1, red arrows). The preQ0 precursor is first reduced to 7-aminomethyl-7-deazaguanine (preQ1) by the NADPH-dependent 7-cyano-7-deazaguanine reductase (EC 1.7.1.13) enzyme QueF. QueF is closely related to FolE in primary structure, and the two families can be distinguished at the sequence level by a QueF-specific motif involved in NADPH binding and by the FolE-specific zinc binding residues. QueF is also a T-fold enzyme and, like GTP cyclohydrolase I, is represented by the unimodular and bimodular T-fold subfamilies. In unimodular QueF, a homodecameric enzyme, catalysis occurs at the intersubunit interfaces; whereas in bimodular QueF, a homodimer, catalysis occurs at the intrasubunit interface between the two T-fold modules, not at the intersubunit interface as in the bimodular FolE2. The entire pathway to preQ1 is independent of tRNA. tRNA comes into play when the bacterial tRNA-guanine transglycosylase (EC 2.4.2.29, bTGT) enzyme removes the G base at position 34 from the target tRNAs and replaces the base with preQ1 (Fig. 1). Extensive biochemical and structural studies on bTGT have shown that the enzyme is a homodimer that binds one tRNA molecule with its U33G34U35 sequence as the recognition element. The monomer consists of an irregular (β/α)8 TIM barrel domain that harbors specific loop insertions that guarantee proper tRNA recognition, and a tightly attached C-terminal zinc-binding domain involved in dimerization (Fig. 2). A reaction mechanism has been proposed in which G34 of tRNA first binds in the purine recognition pocket, constituted of the residues D102, S103, D156, Q203 and G230 (Zymomonas mobilis TGT protein numbering, Fig. 2). Nucleophilic attack by an active-site aspartate side chain (D280 in Z. mobilis TGT) on the ribosyl C1′ of G34 detaches the guanine base from the RNA, and results in formation of a covalent intermediate between TGT and RNA. A subsequent conformational change in the active site facilitates the release of the detached guanine base from the pocket and the binding of preQ1 in the same pocket with L231 and A232 as specificity determinants. Deprotonation of N9 of the incoming preQ1 by another active-site aspartate (D102) allows nucleophilic attack by N9 on ribose C1′ to form the preQ1-tRNA product.
Figure 2.

Catalytic and substrate recognition residues in different TGT subgroups. Partial alignments of Zymomonas mobilis (Zm) bTGT (PDB 1WKF), Escherichia coli (Ec) bTGT (RefSeq AMK98755), Homo sapiens (Hm) QTRT1 (RefSeq AAH15350), Homo sapiens QTRTD1 (RefSeq EAW79613), Pyrococcus horikoshii (Ph) arcTGT (PDB 1IQ8), Salmonella enterica serovar Montevideo (Sm) bDpdA (Genbank EFY12575), and E. coli phage 9g pDpdA (RefSeq YP_009032326). The conserved residue for substrate recognition, zinc binding, and catalytic activity are shown in bold and numbered. The arcTGT exhibits three supplementary C-terminal domains (C1, C2 and PUA), represented in black on the archaea line.

Catalytic and substrate recognition residues in different TGT subgroups. Partial alignments of Zymomonas mobilis (Zm) bTGT (PDB 1WKF), Escherichia coli (Ec) bTGT (RefSeq AMK98755), Homo sapiens (Hm) QTRT1 (RefSeq AAH15350), Homo sapiens QTRTD1 (RefSeq EAW79613), Pyrococcus horikoshii (Ph) arcTGT (PDB 1IQ8), Salmonella enterica serovar Montevideo (Sm) bDpdA (Genbank EFY12575), and E. coli phage 9g pDpdA (RefSeq YP_009032326). The conserved residue for substrate recognition, zinc binding, and catalytic activity are shown in bold and numbered. The arcTGT exhibits three supplementary C-terminal domains (C1, C2 and PUA), represented in black on the archaea line. Two subsequent reactions are required to synthesize the final Q product. First, S-adenosylmethionine tRNA ribosyltransferase-isomerase (EC 2.4.99.17, QueA) catalyzes the addition of an epoxycyclopentandiol ring to preQ1 to form the epoxyqueuosine (oQ). QueA uses the ribose moiety from S-adenosyl-methionine (SAM), and transfers it to the ammonium moiety of preQ1. Finally, the epoxyqueuosine reductase (EC 1.17.99.6) reduces oQ into Q60. In E. coli, this reaction is catalyzed by QueG, a B12-dependent enzyme with iron-sulfur clusters. Because QueG is missing in many organisms that harbor Q in tRNA (Zallot and de Crécy-Lagard, unpublished), it is expected that non-orthologous enzymes catalyzing the same reaction are yet to be discovered in these organisms.

G+ synthesis in archaea

ArcTGT is the archaeal bTGT homolog that inserts preQ0 at position 15 (or 13 in specific organisms) in the dihydrouridine arm (D-arm) of target tRNAs (Fig. 1, blue arrows). Like bTGT, arcTGT functions as a homodimer. The monomer is constituted of an N-terminal catalytic domain, and three arcTGT-specific C-terminal domains C1, C2 and C3/PUA (PseudoUridine synthase and Archaeosine transglycosylase domain). The catalytic domain is built around an (α/β)8 TIM barrel subdomain and a zinc-binding subdomain, making it very similar in structure to the bTGT monomer (Fig. 2). This catalytic domain also mediates the homodimerization of arcTGT through the zinc-binding sites, similar to bTGT homodimerization. The C1 domain provides an additional homodimerization interface, while the C2 and C3/PUA domains provide a scaffold for tRNA binding. However, unlike the bTGT homodimer which binds one tRNA molecule, the arcTGT homodimer binds two tRNA molecules such that each tRNA interacts with both protein subunits. One subunit provides binding capacity through its C-terminal domains, while the other subunit performs the reaction. ArcTGT distorts the tRNA structure from the canonical L-shaped form to the λ-form in which the D-arm is single stranded and protruded, allowing G15 to be accessible for transglycosidation. The enzyme presumably positions the exposed G15 in the active site by “counting” the nucleotides from G1 to G15 in the λ-form. Although the substrate binding pocket and catalytic residues are conserved between the bacterial and archaeal enzymes, the replacement of L231/A232 in bTGTs by V197/V198 in arcTGTs (Fig. 2) allows for the change of substrate specificity from preQ1 to preQ056. Once preQ0 has been inserted into the target archaeal tRNA, one last amidotransferase step is required to produce the G+ modification. Surprisingly, a great diversity of amidotranferase enzymes have been recruited to catalyze this reaction (Fig. 1). The first to be discovered was archaeosine synthase (EC 2.6.1.97, ArcS). ArcS catalyzes the conversion of preQ0–tRNA to G+-tRNA in an ATP independent manner and may use glutamine, asparagine as well as free ammonia as ammonium donors in vitro. ArcS is evolutionarily and structurally related to arcTGT. It has retained the three tRNA binding domains found in arcTGT but has acquired an insertion of four α-helices and 3 β-strands in its C1 domain that confer a Rossman fold architecture to the domain. This domain extension contains a conserved PCX3KPYX2SX2H motif proposed to be involved in catalytic activity. Like the TGT family, ArcS proteins form homodimers but are predicted to bind the L-form, not the λ-form tRNA. Many archaea that synthesize G+ lack ArcS homologs and, in those organisms, two non-orthologous enzymes were found to be responsible for the formation of G+ 65. The first is QueF-Like (QueF-L) which converts preQ0 base of preQ0-modified tRNA to G+, unlike the bacterial preQ1 synthase QueF which acts on the free preQ0 base as substrate. QueF-Like is also a homodecameric T-fold enzyme with a similar structure to that of QueF, except that its pentameric subunits are tightly wound, resulting in elimination of the QueF-specific NADPH binding site. The second enzyme that replaces ArcS in some archaea is Gat-QueC, a homolog of QueC fused to a glutamine amidotransferase class-II domain (GATase). This enzyme is yet to be biochemically characterized, and it remains unclear whether it acts before or after integration of preQ0 in tRNA and, consequently, whether the corresponding arcTGT integrates the G+ base or the preQ0 base in the tRNAs of these organisms.

Q salvage in eukaryotes

Most eukaryotes harbor Q in their tRNAs but do not produce the modification de novo. Instead, they salvage the queuine base (q) from their diet or microflora (see reference [67] for recent review) using transporters that are yet to be discovered. Members of the DUF219 family have recently been implicated in q salvage in eukaryotes. Structural modeling suggests a possible ribonucleotide hydrolase function for this family, but its exact role is yet to be determined. The eukaryotic TGTs (eTGT) insert the q base directly in tRNAs (Fig. 1, green arrow). eTGTs are heterodimers composed of a catalytic subunit (queuine tRNA-ribosyltransferase catalytic subunit 1 or QTRT1) and an accessory subunit that lacks the active site Asp residue (queuine tRNA-ribosyltransferase accessory subunit 2 or QTRT2, previously called QRTD1). Key differences between the eukaryotic and prokaryotic TGTs are in the substrate binding pocket where Val233 is replaced by a glycine and Cys158 is replaced by a valine. These changes enlarge the binding pocket and allow the binding of a larger substrate (q versus preQ0 or preQ1).

Seven-deazapurines salvage in bacteria

bTGT is the signature gene of the Q pathway. All organisms that harbor a bTGT homolog can harbor Q in tRNA and no organism lacking bTGT has ever been found to contain Q in tRNA (de Crécy-Lagard, unpublished). However, many organisms with bTGT homologs lack preQ0 synthesis enzymes and/or QueF. In these cases, the preQ0 or preQ1 precursor must be salvaged. preQ0 is certainly available in natural environments as an intermediate of the Q and G+ pathway as described above, but also as a secondary metabolite or secondary metabolite precursor. It also may be recycled from tRNA degradation products, because the free G+ base spontaneously deamidates to preQ0. Specific preQ1/preQ0 transporters of the ECF family (QueT and QrtT) have been predicted to be involved in 7-deazapurine salvage but have never been experimentally validated as such, and their sparse distribution suggests that other transporter families exist. We recently reported that the COG1738/YhhQ family is a preQ0-specific transporter found in E. coli and is widespread in bacteria. Of note, some bacteria such as Chlamydia trachomatis not only lack the precursor pathway but also QueA homologs. Queuine should therefore be salvaged in these bacteria. In summary, while the de novo Q pathway is well characterized, many open questions remain regarding the possibility of Q salvage in bacteria.

Discovery of dADG and dpreQ0 in bacterial DNA

Comparative genomic analyses led to the observation that two copies of the tgt gene are present in specific bacteria. One copy encodes the canonical bTGT, while the other copy encodes a larger, more divergent protein that exhibited common structural features with both arcTGT and bTGT enzymes. Careful sequence analysis showed that the catalytic residues and substrate binding pocket of arcTGT (e.g., D95, D249, V217, and V218 in P. horikoshii arcTGT numbers) are conserved in this protein (Fig. 2), suggesting that it may bind preQ0. In addition, the tRNA-binding C3/PUA domain characteristic of arcTGT is missing from this protein, and the zinc binding motif is present in a supplementary domain (Fig. 2). Physical clustering analyses strongly linked this tgt paralog to DNA metabolism genes, leading to the prediction that this enzyme might be involved in inserting preQ0 or a derivative in DNA. This prediction was tested by analyzing genomic DNA, extracted from different bacteria that contained copies of the tgt paralog, by mass-spectrometry, specifically searching for the deoxyribose forms of the Q pathway intermediates. Deoxy-preQ0 (dpreQ0) was detected as predicted but seemed to be a minor product. The main deazapurine modification found in the DNA of these organisms was dADG, the deoxyribose form of the preQ0 precursor ADG. The discovery of these modifications led to renaming these tgt paralogs as dpdA, for DeoxyPurine in DNA.

Discovery of dG+ in bacteriophage 9g DNA

Many bacteriophages (or phages) encode TGT-like proteins (Fig. 3). Early studies of the Streptococcus Dp-1 phage, and the Mycobacterium phage Rosebush had led to the proposal that these genes are involved in the synthesis of Q in tRNA, interfering with or enhancing the translation of the host or phage proteins. Another possible function was proposed by the Letarov group, who suggested that Q could be incorporated in the DNA of Escherichia coli phage 9g, causing its resistance to cleavage by several endonucleases. Consistent with this proposal is the observation that most phage TGT-like proteins are closer in sequence to DpdA than to the canonical TGT enzymes. Although the catalytic core and substrate recognition residues of arcTGT are conserved in this protein (Fig. 2), two out of the four residues constituting the zinc binding sites are not conserved (Fig. 2).
Figure 3.

Genome neighborhoods of dpdA in bacteria and phages. The dpd and the queuosine related genes are defined in the text, general family memberships were noted for the other genes. The host of each phage is indicated between parenthesis aside of each phage name. Accession numbers for the corresponding DpdA proteins are: WP_001542917.1 for Salmonella enterica subsp. enterica serovar Montevideo str., WP_015211588 for Microcoleus sp PCC 7113, ABS05840 for Kineococcus radiotolerans SRS30216, YP_009032326 for Escherichia coli phage 9g, YP_004306895 for Streptococcus phage Dp-1, NP_817763 for Mycobacterium phage Rosebush, and YP_655116 for Mycobacterium phage Orion.

Genome neighborhoods of dpdA in bacteria and phages. The dpd and the queuosine related genes are defined in the text, general family memberships were noted for the other genes. The host of each phage is indicated between parenthesis aside of each phage name. Accession numbers for the corresponding DpdA proteins are: WP_001542917.1 for Salmonella enterica subsp. enterica serovar Montevideo str., WP_015211588 for Microcoleus sp PCC 7113, ABS05840 for Kineococcus radiotolerans SRS30216, YP_009032326 for Escherichia coli phage 9g, YP_004306895 for Streptococcus phage Dp-1, NP_817763 for Mycobacterium phage Rosebush, and YP_655116 for Mycobacterium phage Orion. Physical clustering analysis of the 9g phage genome revealed that the dpdA gene clustered with a gat-queC homolog, suggesting that G+ might be inserted in this phage. This prediction was validated by mass-spectrometry and it was shown that 25% of the dG residues were modified to dG+ in the 9g DNA, a first occurrence of the G+ base outside the archaeal kingdom.

The Dpd cluster and horizontal gene transfer

Similarly to tgt being the signature gene for Q in tRNA, dpdA is the signature gene for the presence of deazapurine derivatives in DNA. dpdA is part of a genomic island of about 20 kb in size (Fig. 3). Synteny analyses of closely related strains as well as co-distribution analysis of cluster genes with dpdA helped define the island boundaries. Deletion of the whole dpd cluster in Salmonella enterica serovar Montevideo was shown to abolish the insertion of ADG and preQ0 in DNA. In the S. Montevideo genomes, the cluster is inserted at the leuX locus, an hypervariable region that is a known landing site for pathogenicity virulence and defense genes (Fig. 4). 70% of the available S. Montevideo sequences in the SEED database (SEED viewer version 2.081) contain the dpd cluster. This data, in combination with the phylogenetic analysis of the DpdA family that carries many deviations from the species tree and the wide and sporadic distribution of the dpd cluster around the bacterial tree, makes a strong case for the great mobility of this cluster and its propagation by horizontal gene transfer.
Figure 4.

Schematic representation of the variable region between leuV and yeeN genes in different Salmonella genomes. The genes are colored by functional groups as indicated on the figure.

Schematic representation of the variable region between leuV and yeeN genes in different Salmonella genomes. The genes are colored by functional groups as indicated on the figure. Three genes, dpdB, dpdC and dpdD, are nearly always associated with dpdA in the bacterial dpd clusters; while the dpdEFJK genes, all related to DNA metabolism, have a more variable distribution (Fig. 3). One or more preQ0 synthesis genes are also present in some clusters (Fig. 3). Based on the sequence similarity with TGT, DpdA is the most promising candidate for the enzyme that inserts preQ0 and/or ADG in DNA. DpdB is found in 92% of the genomes that harbor dpdA. Sequence similarity analyses showed that DpdB is a member of the ParB nuclease superfamily, more specifically related to the DndB subgroup of proteins. DndB is part of the dnd cluster that introduces a phosphorothioate modification in the DNA backbone. DndB is not part of the core modification pathway but is a DNA binding protein with a regulatory role, suggesting that DpdB might bind DNA and could be involved in target recognition. dpdC was found in 88% and DpdD in 90% of the genome containing dpdA. Both encoded proteins were similar to proteins of unknown function, DUF328 for DpdC and DUF3225 for a small portion of dpdD, making any functional prediction difficult at this stage.

Function of deazapurines in DNA

Up to 25% of the G bases in 9g phage DNA are converted to G+. The modification event in the bacterial genomes is more rare because only 0.01% of the G bases were observed to be modified to ADG/preQ0 in the DNA analyzed. 9g DNA is also resistant to cleavage by a variety of endonucleases, hence dG+ might have an anti-restriction function allowing the phage to escape host restriction. S. Montevideo DNA on the other hand is not resistant to cleavage by restriction enzymes. However, at least one gene in the dpd cluster is involved in cutting DNA that lacks the deazapurine modification based on transformation efficiency assays, suggesting that the cluster carries a restriction/modification system that protects the bacteria from foreign DNA. The identity of the restriction enzyme remains an open question. Candidates includes genes like dpdD or dpdC that are of unknown function; or some of the other Dpd proteins, like DpdE that is part of the SNF-II superfamily that contains endonucleases from type I and III modification/restriction systems.

Predicted diversity of deazapurine modification of phages

G+ is present in E. coli phage 9g DNA, and the presence of very similar gene clusters, including the gat-queC gene, suggests that G+ incorporation should also be the case for E. coli phages JenK1, and JenP1/2. Analysis of the genomic context of DpdA-like encoding genes in all sequenced phages suggests that other deazapurine derivatives might be found in phages. Some phages, e.g., Mycobacteriophage Rosebush, encode all enzymes of the preQ0 biosynthesis pathway (from FolE to QueC, Fig. 3), and are predicted to insert preQ0 or ADG in their DNA. Other phages like Streptococcus phage Dp-1 also encode QueF, and could therefore insert preQ1. Finally, some phages like Mycobacteriophage Orion only harbor a dpdA-like gene raising the possibility of salvage of Q precursors from the host.

Conclusion

It has become quite clear recently that 7-cyano-7-deazaguanine, or preQ0, is a key metabolite that is recruited in different pathways and is used in a variety of ways. PreQ0 can be the final product, as recently described in the Streptomyces, but its most frequent fate (in around 70% of sequenced bacteria and nearly all archaea) is to be inserted in tRNA where it is further modified to Q or G+ (Fig. 1). The preQ0 base is also used as precursor of secondary metabolites, such as toyocamycin and sangivamycin in several Actinomycetes, or can be inserted in DNA in organisms that harbor the dnd genomic island. It is still not clear yet if ADG is incorporated directly in DNA or if it is synthesized after preQ0 incorporation. Some organisms will use the preQ0 precursor in multiple pathways like S. Montevideo that harbors both Q in tRNA and ADG in DNA, hence the preQ0 synthesis cluster is sometimes duplicated (Fig. 3). Finding the preQ0 transporter gene yhhQ imbedded in certain dpd clusters also suggests that salvaged preQ0 could be inserted in DNA. The insertion of preQ0 in a nucleic acid polymer required the presence of a transglycosylase of the TGT family that must exchange the guanine base with the 7-deazaguanine derivative. The plasticity of the TGT family was well established as small variations in sequences allowed the different orthologs to favor preQ0, preQ1 or q substrates. It had also been shown that the bTGT could modify mRNA in vitro and artificially modify DNA when thymines were replaced by deoxyuracils. The discovery of the role of the TGT paralog DndA in modifying DNA has shown that this can happen in vivo and opens many questions on the identity of potential sequence specific endonuclease inhibited by the insertion of dADG, on the role of deazapurine in protecting DNA from restriction, and on how the normal replication/transcription machineries can deal with these modifications.
  83 in total

1.  A novel class of modular transporters for vitamins in prokaryotes.

Authors:  Dmitry A Rodionov; Peter Hebbeln; Aymerick Eudes; Josy ter Beek; Irina A Rodionova; Guus B Erkens; Dirk J Slotboom; Mikhail S Gelfand; Andrei L Osterman; Andrew D Hanson; Thomas Eitinger
Journal:  J Bacteriol       Date:  2008-10-17       Impact factor: 3.490

Review 2.  Queuosine modification of tRNA: its divergent role in cellular machinery.

Authors:  Manjula Vinayak; Chandramani Pathak
Journal:  Biosci Rep       Date:  2009-11-23       Impact factor: 3.840

Review 3.  Structure and function of mammalian DNA methyltransferases.

Authors:  Renata Zofia Jurkowska; Tomasz Piotr Jurkowski; Albert Jeltsch
Journal:  Chembiochem       Date:  2010-11-29       Impact factor: 3.164

4.  Discovery and characterization of an amidinotransferase involved in the modification of archaeal tRNA.

Authors:  Gabriela Phillips; Vimbai M Chikwana; Adrienne Maxwell; Basma El-Yacoubi; Manal A Swairjo; Dirk Iwata-Reuyl; Valérie de Crécy-Lagard
Journal:  J Biol Chem       Date:  2010-02-03       Impact factor: 5.157

Review 5.  Biosynthesis and Function of Modified Bases in Bacteria and Their Viruses.

Authors:  Peter Weigele; Elisabeth A Raleigh
Journal:  Chem Rev       Date:  2016-06-20       Impact factor: 60.622

6.  Identification of four genes necessary for biosynthesis of the modified nucleoside queuosine.

Authors:  John S Reader; David Metzgar; Paul Schimmel; Valérie de Crécy-Lagard
Journal:  J Biol Chem       Date:  2003-12-02       Impact factor: 5.157

7.  Queuosine modification of the wobble base in tRNAHis influences 'in vivo' decoding properties.

Authors:  F Meier; B Suter; H Grosjean; G Keith; E Kubli
Journal:  EMBO J       Date:  1985-03       Impact factor: 11.598

8.  Towards a systems approach in the genetic analysis of archaea: Accelerating mutant construction and phenotypic analysis in Haloferax volcanii.

Authors:  Ian K Blaby; Gabriela Phillips; Crysten E Blaby-Haas; Kevin S Gulig; Basma El Yacoubi; Valérie de Crécy-Lagard
Journal:  Archaea       Date:  2010-12-23       Impact factor: 3.273

9.  Transcriptome-wide mapping of pseudouridines: pseudouridine synthases modify specific mRNAs in S. cerevisiae.

Authors:  Alexander F Lovejoy; Daniel P Riordan; Patrick O Brown
Journal:  PLoS One       Date:  2014-10-29       Impact factor: 3.240

10.  Computational identification of novel biochemical systems involved in oxidation, glycosylation and other complex modifications of bases in DNA.

Authors:  Lakshminarayan M Iyer; Dapeng Zhang; A Maxwell Burroughs; L Aravind
Journal:  Nucleic Acids Res       Date:  2013-06-28       Impact factor: 16.971

View more
  17 in total

1.  Epoxyqueuosine Reductase QueH in the Biosynthetic Pathway to tRNA Queuosine Is a Unique Metalloenzyme.

Authors:  Qiang Li; Rémi Zallot; Brian S MacTavish; Alvaro Montoya; Daniel J Payan; You Hu; John A Gerlt; Alexander Angerhofer; Valérie de Crécy-Lagard; Steven D Bruner
Journal:  Biochemistry       Date:  2021-10-15       Impact factor: 3.162

2.  CASP13 target classification into tertiary structure prediction categories.

Authors:  Lisa N Kinch; Andriy Kryshtafovych; Bohdan Monastyrskyy; Nick V Grishin
Journal:  Proteins       Date:  2019-07-24

3.  Discovery of novel bacterial queuine salvage enzymes and pathways in human pathogens.

Authors:  Yifeng Yuan; Rémi Zallot; Tyler L Grove; Daniel J Payan; Isabelle Martin-Verstraete; Sara Šepić; Seetharamsingh Balamkundu; Ramesh Neelakandan; Vinod K Gadi; Chuan-Fa Liu; Manal A Swairjo; Peter C Dedon; Steven C Almo; John A Gerlt; Valérie de Crécy-Lagard
Journal:  Proc Natl Acad Sci U S A       Date:  2019-09-03       Impact factor: 12.779

4.  Editorial: RNA modifications - what to read first?

Authors:  Mark Helm
Journal:  RNA Biol       Date:  2017-09-02       Impact factor: 4.652

5.  Identification of a Novel Epoxyqueuosine Reductase Family by Comparative Genomics.

Authors:  Rémi Zallot; Robert Ross; Wei-Hung Chen; Steven D Bruner; Patrick A Limbach; Valérie de Crécy-Lagard
Journal:  ACS Chem Biol       Date:  2017-02-08       Impact factor: 5.100

6.  The Escherichia coli COG1738 Member YhhQ Is Involved in 7-Cyanodeazaguanine (preQ₀) Transport.

Authors:  Rémi Zallot; Yifeng Yuan; Valérie de Crécy-Lagard
Journal:  Biomolecules       Date:  2017-02-08

7.  Mycobacteriophage CRB2 defines a new subcluster in mycobacteriophage classification.

Authors:  Cristian Alejandro Suarez; Jorgelina Judith Franceschelli; Héctor Ricardo Morbidoni
Journal:  PLoS One       Date:  2019-02-27       Impact factor: 3.240

8.  Complete Genome Sequence of Escherichia coli Siphophage BRET.

Authors:  Solange Ngazoa-Kakou; Yuyu Shao; Geneviève M Rousseau; Audrey A Addablah; Denise M Tremblay; Geoffrey Hutinet; Nicolas Lemire; Pier-Luc Plante; Jacques Corbeil; Aristide Koudou; Benjamin K Soro; David N Coulibaly; Serge Aoussi; Mireille Dosso; Sylvain Moineau
Journal:  Microbiol Resour Announc       Date:  2019-01-31

9.  Pantoea Bacteriophage vB_PagS_Vid5: A Low-Temperature Siphovirus That Harbors a Cluster of Genes Involved in the Biosynthesis of Archaeosine.

Authors:  Eugenijus Šimoliūnas; Monika Šimoliūnienė; Laura Kaliniene; Aurelija Zajančkauskaitė; Martynas Skapas; Rolandas Meškys; Algirdas Kaupinis; Mindaugas Valius; Lidija Truncaitė
Journal:  Viruses       Date:  2018-10-25       Impact factor: 5.048

10.  Superior cellular activities of azido- over amino-functionalized ligands for engineered preQ1 riboswitches in E.coli.

Authors:  Eva Neuner; Marina Frener; Alexandra Lusser; Ronald Micura
Journal:  RNA Biol       Date:  2018-10-26       Impact factor: 4.652

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.