Literature DB >> 24646681

A highly conserved family of domains related to the DNA-glycosylase fold helps predict multiple novel pathways for RNA modifications.

Abstract

A protein family including mammalian NEMF, Drosophila caliban, yeast Tae2, and bacterial FpbA-like proteins was first defined over a decade ago and found to be universally distributed across the three domains/superkingdoms of life. Since its initial characterization, this family of proteins has been tantalizingly linked to a wide range of biochemical functions. Tapping the enormous wealth of genome information that has accumulated since the initial characterization of these proteins, we perform a detailed computational analysis of the family, identifying multiple conserved domains. Domains identified include an enzymatic domain related to the formamidopyrimidine (Fpg), MutM, and Nei/EndoVIII family of DNA glycosylases, a novel, predicted RNA-binding domain, and a domain potentially mediating protein-protein interactions. Through this characterization, we predict that the DNA glycosylase-like domain catalytically operates on double-stranded RNA, as part of a hitherto unknown base modification mechanism that probably targets rRNAs. At least in archaea, and possibly eukaryotes, this pathway might additionally include the AMMECR1 family of proteins. The predicted RNA-binding domain associated with this family is also observed in distinct architectural contexts in other proteins across phylogenetically diverse prokaryotes. Here it is predicted to play a key role in a new pathway for tRNA 4-thiouridylation along with TusA-like sulfur transfer proteins.

Entities: Chemical Disease Gene Species

Keywords: DNA glycosylase; FbpA; IscS; NEMF; Tae2; TusA; base modification; caliban; fibronectin-binding; tRNA 4-thiouridylation

Mesh：

Substances：
RNA
DNA Glycosylases

Year: 2014 PMID： 24646681 PMCID： PMC4075521 DOI： 10.4161/rna.28302

Source DB: PubMed Journal: RNA Biol ISSN： 1547-6286 Impact factor: 4.652

Introduction

Over 10 y ago, a gene family conserved across all three superkingdoms of life was identified and determined to contain two tandem copies of the nucleic acid-binding helix-hairpin-helix (HhH) domain.- Based on its phyletic distribution, this family was traced back to the last universal common ancestor (LUCA) of life. The HhH domain pair and its phyletic pattern suggested a general functional role for the family in a nucleic acid-related role in universally conserved pathways: either RNA-metabolism in the context of translation or DNA repair or recombination. Establishment of a relationship between the HhH-domain pair in these proteins to the one found in ribosomal proteins of the S13/S18 family supported the former function in particular. Since the initial characterization of this family, which includes the Tae2 protein from Saccharomyces cerevisiae, the Caliban (Clbn) protein from Drosophila melanogaster, the mammalian NEMF proteins, and the so-called fibronectin-binding (FbpA-like) proteins from bacteria, several studies have resulted in attribution of a wide range of functional roles for these proteins. These include fibronectin binding in certain pathogenic bacteria,- a core component of the ribosome-associating, co-translational degradation complex RQC in yeast,, regulation of the DNA-damage response in Drosophila, and mediation of nuclear export in Drosophila and human. Given the disparate nature of these findings, we decided to revisit this gene family using state-of-the-art techniques in sequence analysis and comparative genomics while tapping the wealth of new information that has accumulated in the years since its initial characterization. Here we identify and characterize the distinct globular domains conserved across all members of the gene family in addition to the HhH domain pair. One of these domains is predicted to be an enzymatic domain related to the bifunctional DNA glycosylase/endonuclease domain involved in Base Excision Repair (BER), commonly referred to as the Formamidopyrimidine, MutM, and Nei/EndoVIII DNA glycosylase (FMN-DG; also referred to in the literature as Fpg/Nei, Fapy DNA glycosylase, glycosylase/AP-lyase, or Endonuclease VIII) domain.- We identify shared and distinct features of the active site of these two related domains, implying both similarities and differences in their catalytic mechanisms. Another domain in this gene family is predicted to be a novel RNA-binding domain, with a potential role in a variant of the tRNA 4-thiouridylation pathway present in a subset of prokaryotes. Based on these observations and additional genome contextual evidence, we propose that the fundamental functional role of this ancient gene family is related to processing/modification of double-stranded RNA, perhaps rRNA.

Results

Delineation of the NEMF/FbpA/Caliban/Tae2gene family and its core architectures

To comprehensively characterize this gene family, we collected all related sequences using known members as seeds to initiate sequence profile searches against the non-redundant (nr) protein database at the National Center for Biotechnology Information. Given the presence of a large coiled-coil domain in the gene family, we applied the low complexity seg filter to these searches to avoid inclusion of genes with spurious similarity. Membership of proteins displaying relationships with borderline significance was confirmed by initiating reverse searches. Sequences obtained were then aligned and potential globular regions shared across the gene family were identified by inspection of these alignments after mapping the location of the known HhH domains and the coiled-coil regions onto the alignments (see Methods, Supplemental Material). Orthologs of the gene family across all three superkingdoms of life were identified, including the NEMF, bacterial FbpA-like proteins, Caliban, and Tae2; accordingly, we termed this family NFACT. Representatives of the family were found across all major archaeal lineages including the euryarchaeota, crenarchaeota, korarchaeota, and thaumarchaeota. The NFACT family is also found across most major bacterial lineages, although it is notably absent in the α-, β-, and γ-proteobacterial lineages (despite being present in δ- and ε-proteobacteria) and actinobacteria. In eukaryotes, the NFACT family is again present in all major lineages including the diplomonads, parabasalids, heteroloboseans, kinetoplastids, chromoalveolates, apicomplexa, and the crown-group eukaryotes encompassing the plant, ameobozoan, animal, and fungal lineages (with a notable absence in the basidiomycete fungi). Taken as a whole, despite losses in certain terminal lineages, this phyletic spread unquestionably points to presence of the NFACT family in the LUCA (see Supplemental Material for complete sequence and phyletic distribution). The conserved core of the NFACT family found across all members is formed by four domains interrupted by the coiled-coil region (Fig. 1; Supplemental Material): from N terminus to C terminus these entail an uncharacterized N-terminal domain, the two HhH domains, the coiled-coil region, and a domain currently annotated as DUF814 (Domain of Unknown Function 814) in Pfam. The first three domains from the N-terminus are currently incorrectly annotated as a single domain in Pfam: the FbpA domain. We propose renaming the N-terminal domain the NFACT-N (for NEMF, FbpA, Caliban, Tae2, N-terminal) domain and separating it from the downstream HhH domains (Fig. 1). In archaea and eukaryotes, an additional C-terminal domain annotated in Pfam as DUF3441 is present, clearly establishing the archaeal version of the family as the one inherited by eukaryotes. This core (NFACT-N+HhH+HhH+coiled-coil+DUF814[+DUF3441]) has proven resistant to domain accretion during evolution, although a few limited elaborations are observed in eukaryotes. In plants, Entamoeba, and Giardia, a zinc knuckle domain (ZnK) insertion is present between the DUF814 and DUF3441 domains. The most parsimonious explanation for this unusual phyletic distribution is independent, secondary acquisition of the ZnK in these three distant lineages, a scenario supported by the lack of specific sequence similarity across the different ZnKs. While the ZnK is not universally present in chlorophyte alga, its presence in Ostreococcus suggests that it was acquired early in the evolution of Viridiplantae. One additional potential domain fusion of note is the C-terminal fusion to four copies of the RNA-binding RRM domain in the roundworm Loa loa (Fig. 1).

Figure 1. Domain architectures and conserved gene neighborhoods relating to NFACT proteins. Domains architectures are depicted by adjoining polygonal shapes labeled with individual domain names, size of domains are not drawn precisely to scale. Individual genes within conserved gene neighborhoods are depicted with boxed arrows with the arrowhead pointing toward the 3′ end of the gene. Each architecture/neighborhood is labeled with gene name, GenBank gene identifier (gi) number, and organism name separated by semicolons. Architectures/neighborhoods relating to the NFACT gene family are boxed in purple; those relating specifically to independent contexts of the NFACT-R domain are boxed in orange. The coiled-coil region characteristic of the NFACT gene family is depicted as a light green circle in architectures and a light green box in gene neighborhoods. Abbreviations: CC, coiled-coil; HhH, helix-hairpin-helix; ZnK, zinc knuckle; ZnR, zinc ribbon.

NFACT-N domain

Iterative profile searches initiated with NFACT-N domain sequences and their downstream HhH domains against the nr database recovered significant matches extending along the length of the DNA glycosylase (FMN-DG) domain and its characteristic C-terminal HhH domain pair. For example, a sequence from the H. sapiens NEMF protein recovered the FMN-DG and HhH domains from the fungus Togninia minima (gi: 500258929, iteration: 4, e-value: 2e−4) and the acidobacterium Terriglobus roseus (gi: 390956237, iteration: 4, e-value: 7e−4) in PSI-BLAST. Reciprocal searches confirmed these relationships: a search initiated with the same sequence from Terriglobus yielded the NFACT-N from Methanocella arvoryzae (gi: 147920849, PSI-BLAST iteration: 1, e-value: 4e−3). As an independent means of confirmation, a HMM profile constructed from the multiple sequence alignment of the NFACT-N and HhH domains was searched against a database of HMMs constructed for individual PDB entries using the HHpred program. In addition to detecting PDB: 3doa (structure of a N-terminal NFACT fragment), this search again revealed a significant relationship between the NFACT-N and HhH domains and cognate domains in FMN-DG; for example, significant matches are retrieved for FMN-DGs from Arabidopsis thaliana (PDB: 3twl, P value: 1e−5, probability: 95.9%) and Geobacillus stearothermophilus (PDB: 3u6p, P value: 6.2e−5, probability: 94.0%). The FMN-DG family is well-distributed across bacteria, absent in archaea, and only present in scattered eukaryotic lineages. The family can be divided into two distinct subfamilies, the Fpg/MutM-like subfamily found in bacteria, plants, and fungi and the eukaryotic Nei subfamily primarily observed in animals., Previous analysis indicated that eukaryotic versions of the Fpg/MutM-like subfamily likely emerged via horizontal gene transfer (HGT) from bacteria relatively early in eukaryotic evolution, while the Nei subfamily emerged later following HGT from a bacterial source to the stem of the animal lineage. Our analysis identified Nei homologs in the early-branching eukaryote diplomonad Giardia, suggesting the possibility that Nei also might have been acquired earlier in eukaryotic evolution (AMB, LA, personal observations). Thus, the phyletic patterns of NFACT-N point to an origin in the LUCA, whereas the FMN-DGs appear to have emerged first in bacteria. This suggests that the latter are likely to have been derived from the former early in bacterial evolution. To better understand the relationship between NFACT-N and FMN-DNA glycosylases, we constructed a structure-guided super-alignment, first aligning known FMN-DG structures with the 3DOA structure and then adding further NFACT-N/HhH sequences (see Materials and Methods). At this point it became evident that despite the clear homology between the domains, the shared core scaffold had undergone a multi-step structural reorganization via duplication during divergence (Fig. 2). Both domains feature a core containing eight β-strands and two α-helices leading into the dyad of HhH motifs. The NFACT-N domain, which was inferred to represent the ancestral condition, has two repeats of a basic structural element, each containing an α-helix leading into a 4-stranded β-meander, yielding a sandwich-like fold with the two stacking β-sheets from each repeat oriented at a roughly 30 degree angle to the other (Fig. 2A). In FMN-DG, this structure, including the orientation of the stacking β-sheets, is retained but the connectivity between the helix/meander units has been substantially altered (Fig. 2). The most parsimonious explanation for the “re-wiring” of the connectivity in FMN-DG entails the following steps, in some ways reminiscent of the recently elucidated steps underlying the derivation of the FYVE domain from the canonical binuclear treble clef domain core (Fig. 2B): (1) duplication of one of the repeats in the original 2-repeat structure yielding a 3-repeat intermediate. (2) Given the packing of the sheets against each other to form a sandwich, the 3-repeat state leads to an inherently unstable condition with competition between alternative repeats to reconstitute the original sandwich. (3) This instability was resolved by natural selection through partial loss of a subset of the elements to reconstitute the original two-sheet sandwich. This reconstitution, rather than proceeding via the loss of a complete superfluous repeat, resulted from complementary, partial loss of elements from repeat 1 and 3, while retaining the overall sandwich structure intact. Using this understanding as a guide, an alignment of the spatially equivalent strands and helices across both domains was constructed. This revealed extensive sequence conservation between the aligned individual elements of both domains, strongly supporting the above proposed scenario (Fig. 3).

Figure 3. Multiple sequence alignment of NFACT-N+HhH domains with selected FMN-DG+HhH domain sequences. FMN-DG sequences from solved crystal structures are at the top of the alignment followed by the NFACT-N sequences. Secondary structure elements are depicted as follows: extended loop regions are represented by black lines, β-strands represented by orange arrows, and α-helices represented by purple cylinders. The transition from the core enzymatic domain to the HhH domains is labeled with a black arrow above the alignment. Individual sequences are labeled to the left with gene name, organism name, and gi number separated by an underscore. Gene names are replaced by PDB identifiers where appropriate. Numbers to the left and right of the alignment correspond to amino acid position within the protein encoding the domain. Insert regions are excised and replaced with numbers indicating the length in amino acids of the insert. Due to the “re-wiring” between the core enzymatic domains (Fig. 2), FMN-DG sequences are not presented in linear order; “breaks” in this order are marked at appropriate positions with “x.” Coloring is based on the consensus line at the bottom of the alignment: h, hydrophobic (shaded in yellow); s, small (shaded in green); l, aliphatic (shaded in yellow); -, negatively charged (shaded in purple); p, polar (shaded in blue); +, positively charged (shaded in purple); a, aromatic (shaded in yellow); b, big (shaded in gray); u, tiny (shaded in green); c, charged (shaded in purple). Columns corresponding to active site residue positions are shaded in red, colored in white, and marked at the top with “*.” Columns corresponding to positions involved in either direct or indirect substrate recognition are shaded in brown, colored in white, and marked with “^.” The column corresponding to the conserved glutamate/histidine residue in the HhH domains is shaded in red, colored in yellow, and marked with “&.” The column corresponding to the conserved lysine/arginine residue specific to NFACT-N is marked with a “%.” Organism abbreviations as follows: Aboo, Aciduliprofundum boonei; Alai, Acholeplasma laidlawii; Aory, Aspergillus oryzae; Aper, Aeropyrum pernix; Atha, Arabidopsis thaliana; Bcer, Bacillus cereus; Bthu, Bacillus thuringiensis; CCal, Candidatus Caldiarchaeum; CChl, Candidatus Chloracidobacterium; CKor, Candidatus Korarchaeum; Cele, Caenorhabditis elegans; Cint, Ciona intestinalis; Cowc, Capsaspora owczarzaki; Cpas, Clostridium pasteurianum; Cpha, Chlorobium phaeobacteroides; Csym, Cenarchaeum symbiosum; Dpul, Daphnia pulex; Drer, Danio rerio; Dtur, Dictyoglomus turgidum; Ecol, Escherichia coli; Efae, Enterococcus faecalis; Ehis, Entamoeba histolytica; Fnec, Fusobacterium necrophorum; Glam, Giardia lamblia; Gsp., Geobacillus sp.; Gste, Geobacillus stearothermophilus; Hmag, Hydra magnipapillata; Hsap, Homo sapiens; Hter, Halorubrum terrestre; Hvol, Haloferax volcanii; Klac, Kluyveromyces lactis; Ldel, Lactobacillus delbrueckii; Llac, Lactococcus lactis; Lmon, Listeria monocytogenes; Mbre, Monosiga brevicollis; Mmus, Mus musculus; Mpsy, Methanolobus psychrophilus; Myel, Metallosphaera yellowstonensis; Nfis, Neosartorya fischeri; Ngar, Natrinema gari; Ngru, Naegleria gruberi; Pmar, Prochlorococcus marinus; Psp., Pyrococcus sp.; Ptet, Paramecium tetraurelia; Rnor, Rattus norvegicus; Saci, Sulfolobus acidocaldarius; Saur, Staphylococcus aureus; Scer, Saccharomyces cerevisiae; Sequ, Streptococcus equi; Skow, Saccoglossus kowalevskii; Spur, Strongylocentrotus purpuratus; Tbru, Trypanosoma brucei; Tcru, Trypanosoma cruzi; Tlie, Thermovirga lienii; Tori, Theileria orientalis; Tsp., Thermococcus sp.; Tsp., Thermotoga sp.; Uarc, uncultured archaeon.

Figure 2. Structural relationship between NFACT-N and FMN-DG domains. (A) Cartoon renderings of solved crystal structures (top) for the NFACT-N (left) and FMN-DG domains (right) accompanied by corresponding topology diagrams (bottom). Residues conserved across the domains are rendered as ball-and-stick in the cartoons; active site/enzymatic residues are colored in red, residues with a likely direct or indirect role in substrate recognition are colored in blue, and the conserved asparagine/histidine found in the HhH domains is colored in yellow. The PDB identifier is provided to the right of both cartoons. The labeling scheme provided below each diagram reflects the spatial/evolutionary conservation of each element as evident from the solved crystal structures and as referred to in the text. (B) Stepwise scenario for the emergence of the FMN-DG domain from the ancestral NFACT-N domain. Domains are depicted as arrays of secondary structure elements to show the wiring between elements. Figure 3. Multiple sequence alignment of NFACT-N+HhH domains with selected FMN-DG+HhH domain sequences. FMN-DG sequences from solved crystal structures are at the top of the alignment followed by the NFACT-N sequences. Secondary structure elements are depicted as follows: extended loop regions are represented by black lines, β-strands represented by orange arrows, and α-helices represented by purple cylinders. The transition from the core enzymatic domain to the HhH domains is labeled with a black arrow above the alignment. Individual sequences are labeled to the left with gene name, organism name, and gi number separated by an underscore. Gene names are replaced by PDB identifiers where appropriate. Numbers to the left and right of the alignment correspond to amino acid position within the protein encoding the domain. Insert regions are excised and replaced with numbers indicating the length in amino acids of the insert. Due to the “re-wiring” between the core enzymatic domains (Fig. 2), FMN-DG sequences are not presented in linear order; “breaks” in this order are marked at appropriate positions with “x.” Coloring is based on the consensus line at the bottom of the alignment: h, hydrophobic (shaded in yellow); s, small (shaded in green); l, aliphatic (shaded in yellow); -, negatively charged (shaded in purple); p, polar (shaded in blue); +, positively charged (shaded in purple); a, aromatic (shaded in yellow); b, big (shaded in gray); u, tiny (shaded in green); c, charged (shaded in purple). Columns corresponding to active site residue positions are shaded in red, colored in white, and marked at the top with “*.” Columns corresponding to positions involved in either direct or indirect substrate recognition are shaded in brown, colored in white, and marked with “^.” The column corresponding to the conserved glutamate/histidine residue in the HhH domains is shaded in red, colored in yellow, and marked with “&.” The column corresponding to the conserved lysine/arginine residue specific to NFACT-N is marked with a “%.” Organism abbreviations as follows: Aboo, Aciduliprofundum boonei; Alai, Acholeplasma laidlawii; Aory, Aspergillus oryzae; Aper, Aeropyrum pernix; Atha, Arabidopsis thaliana; Bcer, Bacillus cereus; Bthu, Bacillus thuringiensis; CCal, Candidatus Caldiarchaeum; CChl, Candidatus Chloracidobacterium; CKor, Candidatus Korarchaeum; Cele, Caenorhabditis elegans; Cint, Ciona intestinalis; Cowc, Capsaspora owczarzaki; Cpas, Clostridium pasteurianum; Cpha, Chlorobium phaeobacteroides; Csym, Cenarchaeum symbiosum; Dpul, Daphnia pulex; Drer, Danio rerio; Dtur, Dictyoglomus turgidum; Ecol, Escherichia coli; Efae, Enterococcus faecalis; Ehis, Entamoeba histolytica; Fnec, Fusobacterium necrophorum; Glam, Giardia lamblia; Gsp., Geobacillus sp.; Gste, Geobacillus stearothermophilus; Hmag, Hydra magnipapillata; Hsap, Homo sapiens; Hter, Halorubrum terrestre; Hvol, Haloferax volcanii; Klac, Kluyveromyces lactis; Ldel, Lactobacillus delbrueckii; Llac, Lactococcus lactis; Lmon, Listeria monocytogenes; Mbre, Monosiga brevicollis; Mmus, Mus musculus; Mpsy, Methanolobus psychrophilus; Myel, Metallosphaera yellowstonensis; Nfis, Neosartorya fischeri; Ngar, Natrinema gari; Ngru, Naegleria gruberi; Pmar, Prochlorococcus marinus; Psp., Pyrococcus sp.; Ptet, Paramecium tetraurelia; Rnor, Rattus norvegicus; Saci, Sulfolobus acidocaldarius; Saur, Staphylococcus aureus; Scer, Saccharomyces cerevisiae; Sequ, Streptococcus equi; Skow, Saccoglossus kowalevskii; Spur, Strongylocentrotus purpuratus; Tbru, Trypanosoma brucei; Tcru, Trypanosoma cruzi; Tlie, Thermovirga lienii; Tori, Theileria orientalis; Tsp., Thermococcus sp.; Tsp., Thermotoga sp.; Uarc, uncultured archaeon. The FMN-DG enzymes have been studied in the context of removal of DNA bases damaged through oxidation, e.g., 8-oxoguanine (8OG). FMN-DG is distinct from other types of DNA glycosylases in that it also catalyzes the next step in DNA repair, the introduction of an endonucleolytic break at the site of base removal. The catalytic process of the FMN-DG domain includes, in order of steps: (1) recognition of oxidized bases, (2) flipping of the oxidized “base” from the double helix to position the base in the active site pocket, (3) removal of the base, and (4) induction of an endo-nucleolytic break in the DNA backbone following opening of the deoxyribose ring associated with the excised base. Despite extensive research on these activities, the basis for steps 1 and 2 remain relatively poorly understood as the implicated residues appear to vary considerably even among closely related members of the same subfamily of FMN-DGs, a phenomenon probably related to the observed promiscuity in oxidized base substrates. Steps 3 and 4 draw on a set of core residues clustered in the active site pocket, including an absolutely conserved proline residue, one or two conserved glutamate residues located in distinct spatial positions near the N terminus of H1, and a lysine residue found in the loop between strand-1’ and strand-2’ (Figs. 2A, 3, and 4A). Consensus based on experimental studies currently holds that the proline residue forms a Schiff-base intermediate with the C1’ atom of the damaged base and that at least one of the two conserved glutamate residues is involved in initiating the base removal step.- The role of the lysine has been debated but appears to also be involved in the base removal step while possibly also being a key player in the backbone cleavage reaction., In comparison, the NFACT-N domain harbors an absolutely conserved arginine positionally equivalent to the lysine. Helix-1 in NFACT-N also contains a well-conserved aspartate found in one or the other of the two positions where conserved glutamates are found in the FMN-DG domain (Figs. 2A and 3). Strikingly, NFACT-N does not contain any absolutely conserved prolines, either at the N terminus or elsewhere in the domain (Fig. 3). NFACT-N also features an additional absolutely conserved aspartate in the predicted active site pocket, one residue downstream of the aforementioned arginine, which could compensate for the second aspartate seen in the N terminus of H1 in the cognate FMN-DG domain (Figs. 2A and 3). Two additional well-conserved residues outside of the active site pocket with conserved cognates in FMN-DG include a glutamate found near the C terminus of strand-3′ and a well-conserved asparagine found at the N terminus of strand-4’. The glutamate residue appears equivalent to a highly conserved histidine residue in FMN-DG, which plays a role in substrate nucleic acid binding; however, in NFACT-N, this residue appears to form a conserved salt bridge with a positively charged residue in the linker region found between the NFACT-N domain and the HhH domains. The conserved NFACT-N asparagine appears positionally equivalent to a polar residue typically taking the form of a serine or threonine in FMN-DG domains that contributes to nucleic acid recognition (Figs. 2A and 3); in NFACT-N, this residue might help position a well-conserved NFACT-N-specific arginine/lysine, which points out from the predicted catalytic core and could be involved in nucleic acid recognition (Fig. 3).

Figure 4. Known and predicted reactions. (A) Base removal and ring-opening steps catalyzed by FMN-DNA glycosylases (top) and predicted analogous steps catalyzed by NFACT-N during a potential base-exchange reaction (bottom). The introduced free base in the NFACT-N reaction is labeled with a red “M,” indicating the base is potentially modified in some way despite being shown here as a uridine. (B) 2-thiouridylation, canonical 4-thiouridylation, and the novel, predicted 4-thiouridylation sulfur relay pathways. Potential intermediate step in the novel 4-thiouridylation pathway involving transfer to a conserved cysteine on IscU is shown in dotted lines, reflecting uncertainty in whether this step is present in all organisms containing the ThiI+NFACT-R fusion or whether it is restricted to a subset of them. Crosslinking experiments capturing FMN-DG active site intermediates clearly indicate that proline-mediated Schiff base formation occurs following base excision and deoxyribose ring-opening but prior to nicking of the phosphodiester backbone (Fig. 4A). One of the two conserved H1 aspartates is consistently implicated in base removal upstream of the Schiff base formation, although which aspartate is involved may depend on the substrate. Conflicting views on the lysine indicate a role in either base-removal upstream of Schiff base formation or in initiating the DNA cleavage reaction. The lack of a conserved proline or any compensatory amine-bearing residue in the cognate NFACT-N active site unequivocally establishes NFACT-N as incapable of forming the Schiff base intermediate and thus is unlikely to be involved in DNA cleavage as catalyzed by FMN-DG. However, like FMN-DG, NFACT-N displays an arginine equivalent to lysine in the former and two acidic active site residues. Hence, based on the above-outlined spatial position- and residue- conservation between the domains, we predict NFACT-N potentially catalyzes base-removal as observed in FMN-DG (Fig. 4A, see below). These observations, combined with the knowledge that the FMN-DG domains were derived from the ancestral NFACT-N domain, also potentially assists in distinguishing between proposed roles for the lysine in the FMN-DG reaction mechanism. As the conserved proline, Schiff base formation, and nuclease activity are unique to FMN-DG, they must necessarily be catalytic innovations secondary to the roles of the ancestral aspartate/glutamate and arginine/lysine. Thus, a role for the lysine/arginine in base removal during glycosylase activity is likely to be the ancestral role (Fig. 4A), although we cannot rule out that the lysine has secondarily acquired an additional role in DNA cleavage in FMN-DG. This role for the lysine is also consistent with mutational studies finding base removal, as opposed to DNA cleavage, to be most affected by lysine substitution. One additional residue conserved in both NFACT-N- and FMN-DG-fused HhH domains is noteworthy: a well-conserved glutamate residue (sometimes replaced by histidine in NFACT-N) found in the first helix of the second HhH domain, which mediates a backbone contact with a distinctive, conserved loop structure immediately C-terminal to the HhH domains in NFACT-N and C-terminal to an inserted zinc ribbon domain found downstream of the HhH domains in FMN-DG. This glutamate-backbone contact positions the HhH domains for interaction with the nucleic acid substrate on the side opposite to the NFACT-N and FMN-DG active sites (Figs. 2A and 3).Thus, this structural constraint appears to have maintained similar active site clefts for the two families with the HhH motifs forming a nucleic acid binding “cap” in both cases to accommodate a double-stranded substrate.

DUF814 domain

Searches with individual DUF814 sequences as seeds failed to recover any remote relationships with other known domains. A multiple sequence alignment constructed for the domain indicates the DUF814 domain consists of an α/β structure with at least seven β-strands and three α-helices. DUF814 domains of the NFACT gene family contain a well-conserved DxxxH motif with the two conserved residues at either end of the third predicted strand in addition to a conserved downstream serine residue found in the second helix of this domain (Supplemental Material). These domains are present in two additional contexts: (1) N-terminal fusion to a PP-loop domain in a diverse range of bacteria including firmicutes, nitrosporae, fusobacteria, synergistetes, spirochetes, delta-, epsilon-, and some gamma-proteobacteria, aquificae, dictyoglomi, planctomycetes as well as few thaumarchaeota and (2) as a solo domain C-terminally fused to a small coiled-coil region present across all eukaryotic lineages, typified by the CCDC25 protein in humans and the Jlp2 protein in Saccharomyces cerevisiae (Supplemental Material). An alignment constructed of only members of the PP-loop-fused DUF814 family revealed the same core secondary structure but lacked the conserved residues found in the NFACT family, instead featuring two nearly-absolutely conserved arginines N-terminal to the first strand and the first helix, respectively (Supplemental Material). The PP-loop domain belongs to the ThiI-like PP-loop family, which catalyzes 4-thiouridylation at nucleotide 8 of bacterial and archaeal tRNA. ThiI-like domains were previously thought to be universally fused to an N-terminal RNA-binding THUMP domain with versions from many bacteria and a few archaea additionally fused C-terminally to a Rhodanese (RHOD) domain containing an absolutely conserved cysteine residue. Bacteria and archaea lacking the C-terminal RHOD domain likely interact with a stand-alone RHOD domain during thiouridylation. Canonical ThiI-mediated 4-thiouridylation of tRNA begins with mobilization of sulfur from free cysteine via the IscS desulfurase, followed by transfer of the sulfur to the conserved cysteine residue found in the RHOD domain (Fig. 4B). In parallel, the THUMP domain binds and positions the tRNA and the PP-loop domain activates the oxo-group of the target uridine using ATP to form an adenylated intermediate., Sulfotransfer to the uridine then proceeds through either direct attack by the RHOD-bound persulfide sulfur on the adenylated intermediate or by a free sulfide generated after attack by an additional sulfur provided by the conserved cysteine residue internal to the core PP-loop domain (Fig. 4B). Two striking observations regarding the ThiI+DUF814 proteins were immediately apparent: (1) the ThiI+DUF814 protein shows an almost entirely mutually exclusive phyletic distribution pattern with respect to the THUMP-containing ThiI-like enzymes (Supplemental Material) and (2) while the ThiI+DUF814 proteins lack both THUMP and RHOD fusions, they retain the conserved residues required for ATP utilization/adenylation found across all PP-loop domains in addition to conserving the internal cysteine residue and CxxC motif characteristic and specific to the ThiI-like PP-loop domains (Supplemental Material). It is also worth noting the variable flexible loop region covering the ThiI active site appears to be more elaborate in the ThiI-DUF814 fusion proteins than canonical ThiI-like domains and houses several unique, strongly conserved motifs (GRxRxxQ and TxxE and a glutamine) (Supplemental Material). In Francisella philomiragia, the ThiI+DUF814 protein is additionally fused to a TusA/SirA-like (TusA) domain at the C terminus (Fig. 1). Inspection of the gene neighborhoods surrounding the ThiI+DUF814 fusion proteins revealed further association with a TusA domain, the only conserved gene neighborhood association found across phylogenetically diverse bacteria. Within these neighborhoods, a subset of bacteria including several aquificae and a single γ-proteobacterium additionally associates with an IscS-like desulfurase with aquificae further containing an IscU/NifU protein (Fig. 1; Supplemental Material). While this is the first reported linkage between ThiI-like PP-loop domains and TusA domains, TusA domains combine with a distinct PP-loop family, the IscS desulfurases, and several additional domains to catalyze tRNA 2-thiouridylation (Fig. 4B). In this pathway, IscS-abstracted sulfur is first transferred to a conserved cysteine in TusA, initiating a complex sulfur transfer pathway, which continues until the PP-loop ATPase domain incorporates the sulfur into the base via adenylation (Fig. 4B). Alignment of the TusA proteins associating with ThiI+DUF814 revealed retention of the canonical conserved cysteine essential for sulfur transfer (Supplemental Material). The above web of contextual information presents several reasons strongly supporting the ThiI+DUF814 proteins being part of a distinct pathway catalyzing 4-thiouridylation in a subset of prokaryotes: (1) mutual exclusivity in phyletic distributions between ThiI+DUF814 and other ThiI-like enzymes indicates functional equivalence of the two (Supplemental Material). (2) The high degree of sequence similarity between the ThiI enzymes regardless of domain fusions and the specific conservation of critical residues necessary for ThiI’s role in 4-thiouridylation. (3) The presence of the TusA domain, which can compensate for the absence of the RHOD domain and act as a sulfur acceptor prior to transfer to the tRNA. This proposed 4-thiouridylation pathway is predicted to proceed as follows (Fig. 4B): (1) Analogous to its role in 2-thiouridylation of tRNA, the TusA domain likely accepts sulfur abstracted from the free cysteine pool in the cell by the IscS-like desulfurase domain;, the sulfur relay could also include transfer through IscU/NifU based on its scattered presence in gene neighborhoods (Fig. 1), potentially representing the first known involvement of IscU/NifU in tRNA thiolation as opposed to its standard role in FeS cluster biogenesis. (2) TusA then functions similar to the RHOD domain in standard 4-thiouridylation pathways by interacting with ThiI and positioning the donor sulfur near the active site of ThiI. (3) The DUF814 domain binds the tRNA substrate in lieu of the THUMP domain, positioning it for ThiI-catalyzed adenylation of the target uridine and ultimately sulfur transfer. Given these observations and previous connections between the NFACT gene family and nucleic acid-binding, we can also predict DUF814 acts as an RNA-binding domain in the NFACT gene family; henceforth, we refer to this domain as the NFACT RNA-binding (NFACT-R) domain. Solo NFACT-R domains prototyped by eukaryotic CCDC25/Jlp2-like proteins have not been previously experimentally characterized; however, functional coupling networks detect strong associations with several genes encoding diverse RNA-binding domains in humans, consistent with a role for the domain in RNA-binding. CCDC25 shares interactions with proteins which also interact with NFACT proteins, although the two have yet to be directly linked. The CCDC25/Jlp2-like family of NFACT-R domains exhibits strong conservation in eukaryotes suggesting purifying selection, a feature of RNA-associated domains functioning in core biological processes like base modification and translation. The close relationship between the solo NFACT-R domain and the version in the NFACT proteins along with its pan-eukaryotic distribution suggests that it likely emerged through partial duplication from a NFACT precursor prior to the Last Eukaryotic Common Ancestor (LECA). Hence, we predict that it is likely to function in an RNA-binding role in a core cellular function, perhaps even in the same complex as the NFACT proteins (see below).

DUF3441 domain

In the Pfam database, DUF3441 is presented as an eukaryote-specific domain. However, we recovered divergent yet clearly homologous versions at the C-terminus of archaeal NFACT proteins. For example, a PSI-BLAST search with the C-terminal domain from the archaeon Aeropyrum pernix recovered the entirety of the eukaryotic NFACT DUF3441 domain from the annelid Capitella teleta (gi: 443707183, iteration: 4, e-value: 2e−03). Given its presence as a core NFACT domain, we rename this domain the NFACT-C domain (NFACT C-terminal domain). Secondary structure predictions based on a multiple sequence alignment suggest that it adopts an α/β fold. In contrast to other domains in NFACT proteins, there is little absolute conservation of residues outside of a strongly-conserved PG motif (Supplemental Material); however, several positions in the domain are retained as different polar or charged residues. This pattern suggests NFACT-C could mediate protein–protein contacts within a larger complex rather than playing a catalytic role.

Contextual analysis of the NFACT gene family

We then investigated the NFACT family itself for potential conserved gene neighborhoods and functional interaction networks, as this information can provide insight into function of uncharacterized domains., Gene neighborhoods extracted for bacterial NFACT genes did not recover any conserved associations; however, archaeal NFACT genes formed a conserved gene neighborhood across all archaeal lineages, including the euryarchaeota, crenarchaeota, nanoarchaeota, thaumarchaeota, and the caldiarchaeota with two genes coding for: (1) An active Mut7-C RNase domain of the PIN nuclease fold with its accompanying C-terminal Zn-ribbon domain, and (2) the AMMECR1 domain containing the RAGNYA fold (Supplemental Material). The conserved connection to the Mut7-C RNase is again strongly suggestive of an RNA-related role for NFACT. Additionally, AMMECR1 has previously been linked to involvement in an as-yet-uncharacterized RNA base modification, potentially entailing the transfer of a modifying group onto an RNA base via a conserved cysteine. Notably, most AMMECR1 domains found in this conserved neighborhood retains all previously predicted enzymatic residues (except the crenarchaeon Vulcanisaeta where the predicted catalytic cysteine residue is replaced by a serine; see Supplemental Material). Functional interaction networks constructed primarily by co-expression patterns and protein–protein interaction data were also suggestive of an RNA-related role for the NFACT family. The yeast Tae2 protein is most strongly linked to a cluster of proteins involved in ribosomal subunit biogenesis. Consistent with this, recent studies identified it as a part of the large (60S) ribosomal subunit interacting Ribosomal Quality Control Complex (RQC)., Cognate NEMF proteins from mammals were predominantly linked to proteins harboring diverse RNA-binding domains, many with demonstrated roles in splicing and rRNA biogenesis.

Discussion

Emerging picture of NFACT as a potential RNA-modifying enzyme

Multiple independent lines of evidence support a RNA-related role for NFACT proteins: (1) the previously determined evolutionary relationship between the NFACT HhH domains and those observed in the S13/S18 ribosomal proteins, (2) the evolutionary relationship between the NFACT-N domain and the FMN-DG catalytic domain suggesting NFACT might function as an enzyme operating on bases in double stranded nucleic acids, (3) conserved operonic associations in archaea with the AMMECR1 and the Mut7-C RNase domain pointing in the direction of RNA processing and modification, and (4) experimental evidence from yeast, and functional network associations suggesting one or more rRNAs as probable substrates. In terms of RNA-modification reactions, parallels can be drawn between the glycosylase reaction catalyzed by FMN-DG and base-modification mechanisms catalyzed by structurally unrelated RNA-modifying enzymes: the pseudouridine synthases catalyzing formation of pseudouridine and the tRNA-guanine transglycosylases (TGTs) catalyzing the “base-swapping” mechanism, which inserts the preQ0 base precursor of archaeosine in archaea and queuosine in bacteria and eukaryotes.- Members of the pseudouridine synthase fold, which includes the TruB, RluA, RsuA, and TruA-like families,, similar to FMN-DG require an initial base-flipping step to position a base in the active site prior to modification. Pseudouridine synthases utilize an absolutely conserved aspartate as the primary catalytic residue during the uridine “base rotation” reaction resulting in formation of pseudouridine in tRNA substrates., Several families of pseudouridine synthases additionally contain a well-conserved lysine or arginine residue in the active site, which forms a salt bridge with the catalytic aspartate. Despite similarities, the aspartate in FMN-DG plays only a transient role in the catalytic mechanism, sharply contrasting the centrality of the aspartate in the pseudouridine synthase mechanism, which is thought to directly form a Michael adduct with a carbon in the pyrimidine ring of the uridine as a reaction intermediate., Thus, it seems unlikely NFACT would follow the mechanism template established in pseudouridine synthases. TGTs, members of the TIM-barrel fold, also require an absolutely conserved aspartate residue when catalyzing the swapping of a tRNA guanine base for the preQ0 precursor base. In contrast to the pseudouridine synthase mechanism and closer to the FMN-DNA glycosylase mechanism, this aspartate is involved in the initial step of the reaction facilitating the removal of the base from the sugar backbone. A further parallel with FMN-DG is observed in the base-exchange step wherein N9 of the imidazole ring of the preQ0 base directly attacks the C1’ of the “base-less” sugar, resulting in attachment of the preQ0 to the tRNA sugar backbone (Fig. 4A). This step strikingly resembles the attack by the nitrogen atom belonging to the conserved FMN-DG proline residue on the C1’ of the DNA substrate backbone following base removal. It is possible that the secondary emergence of the proline and the Schiff base observed in FMN-DG activity displaced a nitrogen-based attack from an incoming base similar to the elucidated TGT mechanism in the ancestral NFACT-N domain (Fig. 4A). This reasoning leads to a proposal that the NFACT proteins could possibly catalyze a base-exchange reaction in which a regular RNA base is initially removed and replaced by a new base, similar to the action of the TGTs. While more speculative in nature, it is possible to obtain certain clues regarding the nature of the potential target of the predicted NFACT-N catalytic domain: if NFACT-N were to act similar to FMN-DG, then among the endogenous bases in RNA uridine or cytidine contain spatially comparable carbonyl groups to FMN-DG substrates. In addition to formation of pseudouridine and several pseudouridine derivatives, a range of rRNA base modifications have been previously characterized and include various forms of methylated, hydroxylated, and acetylated bases., However, while pseudouridine and deazaguanine (e.g., preQ0) formation shares some kinship with base-exchange reactions and at least some modified rRNA bases do exist as free bases in the cell, most well-studied base modifications typically proceed via direct enzymatic attachment of a chemical group on an existing base and not through a base-exchange reaction. Additionally, it is not clear that NFACT catalyzes a terminal reaction on a base: a potential exchanged base could, similar to preQ0 attachment via TGT, be followed by further modifications catalyzed by distinct enzymes. Another possibility is that NFACT could act as an rRNA repair enzyme, replacing damaged bases with normal bases. Oxidation of RNA bases, particularly in conjunction with cell death, is currently an area of emerging research interest;, given the connections between NFACT-N and FMN-DG, this possibility could warrant additional investigation. This proposed role for NFACT in rRNA base-modification potentially unifies some of the disparate experimental findings on these proteins. The yeast NFACT protein Tae2 appears to activate the transcription factor Hsf1 in response to translational stress detected by the RQC, ultimately leading to degradation of peptides derived from mRNAs lacking a stop codon. It is conceivable that this proceeds via an effect of the proposed modification on translation fidelity consistent with the observed role of certain rRNA modifications, such as pseudouridylation, on ribosomal stability and translation fidelity. Translation of Internal Ribosomal Entry Sites (IRES)-bearing mRNAs, which play a role in responding to stress conditions across eukaryotes, also depends on rRNA base modifications like pseudouridylation., Hence, it is conceivable that the proposed modification mediated by NFACT proteins might also intersect with IRES-mediated translation under stress. The Drosophila NFACT protein caliban has been proposed to be part of a network including p53, caspase, and Hid proteins during DNA damage-induced apoptosis., Here again the modification could play a role as part of a stress response ensuing from DNA damage, as many key oncogenes and apoptosis factors are translated via IRES. Certain studies have suggested that members of the NFACT family are virulence factors of pathogenic bacteria involved in mediating adhesion and invasion in light of its capacity to bind fibronectin/fibrinogen.-,- These studies have also sought to demonstrate its presence in the extracellular space around some of these bacteria.,, From an evolutionary perspective, however, the nearly universal phyletic distribution pattern of NFACT is indicative of a broadly conserved fundamental functional role. Hence, any role restricted to a subset of pathogenic species in the bacterial superkingdom is unlikely to represent an ancestral role. Building on this, the strong evolutionary conservation observed across the NFACT gene family is inconsistent with typical bacterial virulence factors involved in invasion, which are under strong selective pressure to diverge rapidly in response to evolving host factors. NFACT’s notable presence in the extracellular space is puzzling, particularly since groups studying it as FbpA and our own computational analyses failed to identify any means of transport to the cell surface either coded as a signal within NFACT (i.e., a signal anchor) or via other transport factors. One possible explanation for this, not explored in the current literature, is that the extracellular NFACT proteins are byproducts of cell lysis during biofilm formation; consistent with this, these proteins have been identified in extracellular space occupied by biofilm with no apparent direct attachment to the cell membrane.,, This would also be consistent with recent results questioning various aspects of potential direct NFACT roles in mediating adhesion and invasion during infection.,- Thus, the proposed extracellular fibronectin/fibrinogen-binding role is unlikely to be a general one for this family and might merely reflect promiscuous interactions mediated by the extensive coiled-coil regions and facilitated by biofilm formation. However, we cannot rule out the possibility that in certain pathogenic bacteria NFACT might contribute specifically to biofilm formation or adhesion.

General conclusions

Through synthesis of sequence, genome, functional interaction, and structural data, we propose a potential role for the NFACT proteins conserved across the three superkingdoms of life in RNA-base modification in the context of translation. We present evidence that this function is mediated by the catalytic activity of the NFACT-N domain, shown here to be related to DNA glycosylases. We also predict an RNA-binding role for the NFACT-R (formerly DUF814) domain. In a diverse subset of bacteria lacking the canonical 4-thiouridylation pathway, NFACT-R is also predicted to contribute to a novel variant of the tRNA 4-thiouridylation pathway. This pathway appears to combine the ThiI domain observed in the canonical tRNA 4-thiouridylation pathway, the TusA domain hitherto known only from the tRNA 2-thiouridylation pathway, and the NFACT-R domain. This observation points to a novel variation in the ancient thiolated base biosynthesis pathways in bacteria and shows how they have repeatedly drawn components from a common pool of players involved in synthesis of sulfur-containing metabolites.,

Materials and Methods

Iterative sequence profile searches were performed using the web implementation of the PSI-BLAST program (with the following listed parameters different from default: -num_descriptions 20000, -evalue 20, -comp_based_stats 1, -pseudocount 30, -psi-blast_threshold 0.01) and web version 1.5 of the JACKHMMER (http://hmmer.janelia.org/search/jackhmmer) program run with default parameters against the non-redundant (NR) protein database at the National Center for Biotechnology Information (NCBI). Multiple sequence alignments were built by the Kalign2 and MUSCLE programs with default parameters, followed by manual adjustments on the basis of profile–profile and structural alignments. Similarity-based clustering for both classification and culling of nearly identical sequences was performed using the BLASTCLUST program with empirically determined length and score threshold parameters (ftp://ftp.ncbi.nih.gov/blast/documents/blastclust.html). The web-based implementation of the HHpred program based on the HHsuite-2.0.15 software package with default parameters was used for profile–profile comparisons, searching against the PDB70 and pfamA_27.0 pre-configured databases. Structure similarity searches were performed using the DaliLite v. 3 program. Secondary structures were predicted using the JPred 3 program with default parameters. For previously known domains, the Pfam database release 27.0 was used as a guide and augmented by addition of newly detected divergent members. Structural visualization and manipulations were performed using the Open-Source PyMOL 1.5.0.3 (http://www.pymol.org) program. Funcoup3.0 was used to analyze contextual information based in interaction and expression data. Gene neighborhoods were determined using either the PTT file (downloadable from the NCBI ftp site) or the GenBank file in the case of whole genome shot gun sequences. The protein sequences of all neighbors were clustered using the BLASTCLUST program (ftp://ftp.ncbi.nih.gov/blast/documents/blastclust.html) to identify related sequences in gene neighborhoods. Each cluster of homologous proteins were then assigned an annotation based on the domain architecture or conserved shared domain. Neighborhoods were further refined by ensuring that genes are unidirectional on the same strand of DNA and shared a putative common promoter to be counted as a single operon. If they were on opposite strands they were examined for potential bidirectional promoter sharing patterns. In-house Perl scripts were used to automate this analysis of genome context.

89 in total

Review 1. Eukaryotic endonuclease VIII-like proteins: new components of the base excision DNA repair system.

Authors: I R Grin; D O Zharkov
Journal: Biochemistry (Mosc) Date: 2011-01 Impact factor: 2.487

2. Role of fibronectin-binding protein A in Clostridium difficile intestinal colonization.

Authors: Amira Barketi-Klai; Sandra Hoys; Sylvie Lambert-Bordes; Anne Collignon; Imad Kansau
Journal: J Med Microbiol Date: 2011-02-24 Impact factor: 2.472

Review 3. Oxidative damage to RNA: mechanisms, consequences, and diseases.

Authors: Qiongman Kong; Chien-Liang Glenn Lin
Journal: Cell Mol Life Sci Date: 2010-02-11 Impact factor: 9.261

4. Functional diversification of the RING finger and other binuclear treble clef domains in prokaryotes and the early evolution of the ubiquitin system.

Authors: A Maxwell Burroughs; Lakshminarayan M Iyer; L Aravind
Journal: Mol Biosyst Date: 2011-05-06

Review 5. Fibronectin: a multidomain host adhesin targeted by bacterial fibronectin-binding proteins.

Authors: Brian Henderson; Sean Nair; Jaqueline Pallas; Mark A Williams
Journal: FEMS Microbiol Rev Date: 2011-01 Impact factor: 16.408

6. Manganese binds to Clostridium difficile Fbp68 and is essential for fibronectin binding.

Authors: Yi-Pin Lin; Chih-Jung Kuo; Xhelil Koleci; Sean P McDonough; Yung-Fu Chang
Journal: J Biol Chem Date: 2010-11-09 Impact factor: 5.157

7. rRNA pseudouridylation defects affect ribosomal ligand binding and translational fidelity from yeast to human cells.

Authors: Karen Jack; Cristian Bellodi; Dori M Landry; Rachel O Niederer; Arturas Meskauskas; Sharmishtha Musalgaonkar; Noam Kopmar; Olya Krasnykh; Alison M Dean; Sunnie R Thompson; Davide Ruggero; Jonathan D Dinman
Journal: Mol Cell Date: 2011-11-18 Impact factor: 17.970

8. Structural basis for Fe-S cluster assembly and tRNA thiolation mediated by IscS protein-protein interactions.

Authors: Rong Shi; Ariane Proteau; Magda Villarroya; Ismaïl Moukadiri; Linhua Zhang; Jean-François Trempe; Allan Matte; M Eugenia Armengod; Miroslaw Cygler
Journal: PLoS Biol Date: 2010-04-13 Impact factor: 8.029

9. Insights into the evolution of Archaea and eukaryotic protein modifier systems revealed by the genome of a novel archaeal group.

Authors: Takuro Nunoura; Yoshihiro Takaki; Jungo Kakuta; Shinro Nishi; Junichi Sugahara; Hiromi Kazama; Gab-Joo Chee; Masahira Hattori; Akio Kanai; Haruyuki Atomi; Ken Takai; Hideto Takami
Journal: Nucleic Acids Res Date: 2010-12-15 Impact factor: 16.971

10. The RNA Modification Database, RNAMDB: 2011 update.

Authors: William A Cantara; Pamela F Crain; Jef Rozenski; James A McCloskey; Kimberly A Harris; Xiaonong Zhang; Franck A P Vendeix; Daniele Fabris; Paul F Agris
Journal: Nucleic Acids Res Date: 2010-11-10 Impact factor: 16.971

12 in total

Review 1. Resilience of biochemical activity in protein domains in the face of structural divergence.

Authors: Dapeng Zhang; Lakshminarayan M Iyer; A Maxwell Burroughs; L Aravind
Journal: Curr Opin Struct Biol Date: 2014-06-19 Impact factor: 6.809

2. Structural basis for translational surveillance by the large ribosomal subunit-associated protein quality control complex.

Authors: Dmitry Lyumkis; Dario Oliveira dos Passos; Erich B Tahara; Kristofor Webb; Eric J Bennett; Staal Vinterbo; Clinton S Potter; Bridget Carragher; Claudio A P Joazeiro
Journal: Proc Natl Acad Sci U S A Date: 2014-10-27 Impact factor: 11.205

3. Oxidative opening of the aromatic ring: Tracing the natural history of a large superfamily of dioxygenase domains and their relatives.

Authors: A Maxwell Burroughs; Margaret E Glasner; Kevin P Barry; Erika A Taylor; L Aravind
Journal: J Biol Chem Date: 2019-05-15 Impact factor: 5.157

4. Alanine Tails Signal Proteolysis in Bacterial Ribosome-Associated Quality Control.

Authors: Iryna Lytvynenko; Helge Paternoga; Anna Thrun; Annika Balke; Tina A Müller; Christina H Chiang; Katja Nagler; George Tsaprailis; Simon Anders; Ilka Bischofs; Julie A Maupin-Furlow; Christian M T Spahn; Claudio A P Joazeiro
Journal: Cell Date: 2019-05-30 Impact factor: 41.582

Review 5. Detection and Degradation of Stalled Nascent Chains via Ribosome-Associated Quality Control.

Authors: Cole S Sitron; Onn Brandman
Journal: Annu Rev Biochem Date: 2020-06-20 Impact factor: 23.643

6. Protein synthesis. Rqc2p and 60S ribosomal subunits mediate mRNA-independent elongation of nascent chains.

Authors: Peter S Shen; Joseph Park; Yidan Qin; Xueming Li; Krishna Parsawar; Matthew H Larson; James Cox; Yifan Cheng; Alan M Lambowitz; Jonathan S Weissman; Onn Brandman; Adam Frost
Journal: Science Date: 2015-01-02 Impact factor: 47.728

Review 7. Ribosome-associated quality-control mechanisms from bacteria to humans.

Authors: Sebastian Filbeck; Federico Cerullo; Stefan Pfeffer; Claudio A P Joazeiro
Journal: Mol Cell Date: 2022-04-21 Impact factor: 19.328

8. Convergence of mammalian RQC and C-end rule proteolytic pathways via alanine tailing.

Authors: Anna Thrun; Aitor Garzia; Yu Kigoshi-Tansho; Pratik R Patil; Charles S Umbaugh; Teresa Dallinger; Jia Liu; Sylvia Kreger; Annarita Patrizi; Gregory A Cox; Thomas Tuschl; Claudio A P Joazeiro
Journal: Mol Cell Date: 2021-04-27 Impact factor: 17.970

9. The eukaryotic translation initiation regulator CDC123 defines a divergent clade of ATP-grasp enzymes with a predicted role in novel protein modifications.

Authors: A Maxwell Burroughs; Dapeng Zhang; L Aravind
Journal: Biol Direct Date: 2015-05-15 Impact factor: 4.540

10. Structure and assembly pathway of the ribosome quality control complex.

Authors: Sichen Shao; Alan Brown; Balaji Santhanam; Ramanujan S Hegde
Journal: Mol Cell Date: 2015-01-08 Impact factor: 17.970