Literature DB >> 17576679

Functional specialization of domains tandemly duplicated within 16S rRNA methyltransferase RsmC.

S Sunita1, Elzbieta Purta, Malgorzata Durawa, Karolina L Tkaczuk, J Swaathi, Janusz M Bujnicki, J Sivaraman.   

Abstract

RNA methyltransferases (MTases) are important players in the biogenesis and regulation of the ribosome, the cellular machine for protein synthesis. n class="Chemical">RsmC is a MTase that catalyzes the transfer of a methyl group from S-adenosyl-l-methionine (SAM) to G1207 of 16S rRNA. Mutations of G1207 have dominant lethal phenotypes in Escherichia coli, underscoring the significance of this modified nucleotide for ribosome function. Here we report the crystal structure of E. coli RsmC refined to 2.1 A resolution, which reveals two homologous domains tandemly duplicated within a single polypeptide. We characterized the function of the individual domains and identified key residues involved in binding of rRNA and SAM, and in catalysis. We also discovered that one of the domains is important for the folding of the other. Domain duplication and subfunctionalization by complementary degeneration of redundant functions (in particular substrate binding versus catalysis) has been reported for many enzymes, including those involved in RNA metabolism. Thus, RsmC can be regarded as a model system for functional streamlining of domains accompanied by the development of dependencies concerning folding and stability.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17576679      PMCID: PMC1934991          DOI: 10.1093/nar/gkm411

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

n class="Chemical">S-adenosyl-l-methionine (SAM)-dependent methyltransferases (MTases) transfer the methyl group from SAM to carbon, oxygen or nitrogen atoms of the target. Substrates for SAM-dependent MTases include DNA, RNA, proteins, polysaccharides, lipids or small molecules, implying their key importance in most biological processes (1,2) Methylation of DNA has crucial roles in DNA damage repair, regulation of expression and embryonic development (3). In prokaryotes, DNA MTases are part of restriction–modification systems, which protect the cells from viral invasion (4). Protein methylation is a post-translational process that typically occurs on arginine or lysine residues and is found in prokaryotic and eukaryotic signal transduction pathways, and a role in intracellular signaling has been identified (5). Recently, numerous RNA MTases have been discovered and their function studied (6). RNA plays a central role in the flow of biological information. Ribosomal RNA undergoes modifications by a number of enzymes during the maturation of ribosome. To date, over 100 different types of nucleotide modifications have been identified, out of which about one-third are present in rRNA (7). Of all spn class="Species">ecies, rRNA modifications are the best characterized in Escherichia coli (7). There are three basic types of post-transcriptional modifications found in rRNA, namely base methylation, ribose methylation and pseudouridylation. Base methylation is the simplest and most common type of modification found in rRNA of prokaryotes. It occurs at the final stages of ribosome maturation and the rRNA sequences in which it occurs are highly conserved. Ribose methylation, occurring at the 2′ hydroxyl position on the sugar backbone, is more common in eukaryotes, and is less frequent in bacteria (8). rRNA MTases play a crucial role in the assembly, maturation and regulation of the protein synthetic cellular machinery (9). In spite of the wealth of literature on rRNA methylation, the structural information currently available for RNA MTases is insufficient to elucidate their mn class="Species">echanism of action. The structure of an rRNA MTase in complex with the substrate RNA is available only for the ternary complex of E. coli 23S rRNA m5U MTase RlmD (previously called RumA) with the ribosomal substrate and SAH (S-adenosyl-l-homocysteine) (10). RNA MTases are thus relatively less well-understood compared to DNA MTases, whose structure–function relationships are well established (4). As a continuation of our efforts to understand the structure and function of rRNA modifying enzymes, we have undertaken structure determination and functional analysis of n class="Species">E. coli RsmC that specifically methylates the N2 atom of G1207 in 16S rRNA. The modified residue m2G1207 occurs in a region of the rRNA that is involved in the recognition of peptide chain termination codons. In vivo, transversion mutants of G1207, namely C1207 and U1207, were shown to have dominant lethal phenotypes (11). Here we report the crystal structure of n class="Chemical">RsmC from E. coli refined at 2.1 Å resolution. RsmC is the first structurally characterized MTase, which exhibits the phenomenon reported earlier for many enzymes, including those involved in RNA modification: presence of duplicated, mutually homologous domains, which preserved the ancestral 3D fold, but accumulated divergent mutations in different regions, leading to the complementary loss of conserved motifs and selective retention of different aspects of function present in the ancestral non-duplicated enzyme. Thus, we combined computational and experimental analyses to identify the key amino acids involved in different functions and to assign the roles to the two domains of RsmC.

MATERIALS AND METHODS

Recombinant DNA techniques

The n class="Chemical">rsmC gene, cloned into pCA24N vector with a non-cleavable N-terminal His6 tag and corresponding strain with the knocked-out rsmC gene, were obtained as a gift from the ASKA recloned library [NBRP (NIG, Japan): E. coli]. Site-directed mutagenesis of the rsmC gene was performed by a PCR-based technique according to the QuikChange site-directed mutagenesis strategy (Stratagene) following the manufacturer's instructions. NTD- and CTD-RsmC variants were constructed by recloning single domains into the pET28 vector by removing single domains in the PCR reaction. The mutant genes were sequenced and found to contain only the desired mutations.

Expression and purification

For the native protein, plasmid DNA carrying n class="Chemical">rsmC gene was transformed into E. coli BL21 (DE3) and grown in 1 l of LB media at 37°C till the OD600 reached 0.5–0.6. Induction of the culture was then carried out with 100 µM IPTG after cooling it down to room temperature. The cells were continuously grown overnight at 25°C in a shaking flask at 180 rpm. The next day, cells were harvested by centrifugation (9000 g for 30 min, 4°C) and pelleted. The cell pellet was first washed with pre-binding buffer (10 mM Na-Hepes pH 7.9, 0.17 M NaCl), and resuspended in 20 ml of binding buffer (20 mM Na–Hepes pH 7.9, 0.5 M NaCl), 5 mM imidazole (pH 7.0), 5%(v/v) glycerol, 10 mM BME, 0.5% (v/v) Triton-X-100 and 1 tablet of Complete™ EDTA-free protease inhibitor cocktail (Roche diagnostics). The buffer conditions were slight modifications to the ones mentioned in an earlier work describing the purification, cloning and characterization of RsmC (33). For the selenomethionine (SelMet) substituted RsmC, the cells were grown in Le-Master medium (36), using the DL41 strain of E. coli (methionine auxotroph). The purification of n class="Chemical">RsmC was carried out at room temperature. Both native and SelMet RsmC were purified using the same two-step protocol: DEAE sepharose (Amersham biosciences) column followed by Ni-NTA beads (Qiagen) purification. After binding the protein to the Ni-NTA resin for 30–40 min, the beads were washed with binding buffer (without Triton-X100). The protein was then eluted with 20 mM Na-Hepes pH 7.9, 0.5 M NaCl, 5 mM BME, 0.5 M imidazole, 5%(v/v) glycerol. Furthermore, RsmC was passed through a Superdex-200 gel filtration column using an AKTA-FPLC UPC-900 system (Amersham Biosciences). The gel filtration buffer was the same as the final protein storage buffer: 20 mM HEPES pH 7.9, 0.5 M NaCl, 5 mM BME, 10 mM MgCl2 and 5%(v/v) glycerol. The protein eluted as a monomer (∼40 kDa). The peak fractions were pooled together and concentrated to 4.5 mg/ml by ultra filtration, using a Centriprep centrifugal filter device from Millipore, with a molecular weight cut-off of 10 kDa.

Purification and refolding of C-RsmC from inclusion bodies

Inclusion bodies were colln class="Species">ected from the cell extract by centrifugation at 20 000 rpm and resuspended in buffer B (10 mM Tris, 50 mM NaCl, 10 mM imidazole) supplemented with 6 M urea. Dissolved pellet was then centrifuged, followed by addition of buffer B and Ni-NTA resin was equilibrated with buffer B. After 1 h incubation, Ni-NTA resin with the bound protein was washed three times with buffer B. The deletion mutant protein RsmC-CTD was eluted with elution buffer (10 mM Tris, 50 mM NaCl, 10 mM imidazole) supplemented with 6 M urea. Refolding of the purified RsmC-CTD was achieved by sequential dialysis with reducing urea concentrations from 6 M to 4 M, 2 M, 1 M, 0 M against RF buffer (100 mM Tris pH 8.8 400 mM l-arginine, 10% glycerol, 0.5% TritonX-100, 1 mM EDTA,1 mM DTT). The dialysis buffer was exchanged every 24 hours. The composition of the dialysis buffer (suitable for the subsequent ITC analyses) was 20 mM Na-Hepes pH 7.0, 300 mM NaCl, 5%(v/v) glycerol, 10 mM MgCl2, 10 mM BME.

In vitro methylation assay

30S ribosomal subunits were isolated as described previously (37). Quantitation of subunits was determined by an class="Species">bsorbance at 260 nm (1 A260 unit is equivalent to 34.5 pmol of 30S ribosomes). In vitro methylation reactions were carried out using 2 µg pure RsmC protein or its variants, 6 µM [methyl-14C]-SAM (52.8 mCi/mmol, NEN) and 3 µM 30S RNA ribosome subunit isolated from the rsmC_knockout (K.O.) strain in the total volume 60 µl of the buffer (50 mM PIPES [piperazine-N,N’-bis(2-ethanesulfonic acid)]-Na (pH 7.0), 4 mM MgCl2). After 60 min incubation at 37°C, methylation was stopped by heating the reaction mixture to 70°C for 10 min. The RNA was precipitated with 10% TCA onto Whatman GF/C filter disks. The disks were washed twice with 5% TCA, once with 5 ml ethanol and air-dried. The filter-bound radioactivity was determined by liquid scintillation counting.

MALDI-TOF analysis

The native and n class="Chemical">SelMet substituted RsmC was further analyzed for the incorporation of selenium on a Voyager-STR MALDI-TOF mass spectrometer (Applied Biosystems) by comparing the experimentally measured molecular weight of the native protein with that of the SelMet protein.

Dynamic light scattering (DLS)

DLS measurements were performed at room temperature on a DynaPro (Protein Solutions) DLS instrument. The homogeneity of the protein n class="Chemical">samples was monitored during the various stages of concentration in order to avoid aggregation. The percentage of polydispersity was below 16% and the SOS error was less than 10 for all protein samples at various concentrations.

Isothermal titration calorimetry (ITC)

n class="Chemical">SAM was procured from MP biomedicals. For the titration experiments, the protein (both native and variants), was extensively dialyzed against a 500-fold excess volume of the buffer containing 20 mM Na-Hepes pH 7.0, 5%(v/v) glycerol, 10 mM MgCl2, 0.3 M NaCl, 10 mM BME, for ∼14 h. SAM solutions were prepared by weight, in the same dialysis buffer. The ITC experiments were carried out using VP-ITC calorimeter (Microcal, LLC) at 20°C using 0.02–0.06 mM of the protein in the sample cell and 1–2 mM of SAM in the injector. All samples were thoroughly degassed and then centrifuged to get rid of precipitates. Injection volumes of 4–5 µl per injection were used for the different experiments and for every experiment, the heat of dilution for each ligand was measured and subtracted from the calorimetric titration experimental runs for the protein. Consecutive injections were separated by at least 4 min to allow the peak to return to the baseline. The ITC data was analyzed using a single site fitting model using Origin 7.0 (OriginLab Corp.) software.

Crystallization and data collection

n class="Chemical">RsmC was crystallized using the hanging drop vapor diffusion method. Initial crystals were obtained from a Jena Biosciences (Jena, Germany) screen and further optimized. The best crystals were obtained when a volume of 1 µl of reservoir solution containing 25% (w/v) PEG MME 5000, 0.1 M TrisHCl pH 8.5, 0.2 M ammonium sulfate was mixed with 1 µl of protein (Hanging drop). Diffraction quality crystals formed in 3 days, with the smallest dimension measuring ∼0.14 mm. RsmC crystals belonged to the space group C2 with one molecule in the asymmetric unit. The cell parameters were a = 123.94, b = 51.50, c = 73.33, ß = 121.52. The Matthew's co-efficient was 2.49 Å3/Da and the solvent content 50.7% (38). The crystals were dirn class="Species">ectly taken from the drop, and flash cooled in a N2 cold stream at 100°K. The native crystals diffracted up to 2.5 Å resolution using an R-axis 1V++ image plate detector mounted on a RU-H3RHB rotating anode generator (Rigaku Corp., Tokyo, Japan). Synchrotron data were collected at beam lines X12C and X29, NSLS, Brookhaven National Laboratory for the SelMet protein. A complete SAD dataset was collected (Table 1) using Quantum 4-CCD detector (Area Detector Systems Corp., Poway, CA, USA) to 2.1 Å resolution. Data were processed and scaled using the program HKL2000 (39).
Table 1.

Crystallographic data and refinement statistics

Data setPeakHigh resolution
Cell parameters and Space groupa = 123.94, b = 51.50, c = 73.33, β = 121.52a = 123.94, b = 51.50, c = 73.33, β = 121.52
C2C2
Data collection
Resolution range (Å)20–2.450–2.04
Wavelength (Å)0.97920.9792
Observed reflections > 1σ110250170781
Unique reflections1518746172
Completeness (%)96.993.5
Overall (I/σI)12.915.2
Rsyma (%)7.97.4
Refinement and quality
Resolution range (Å)15–2.1
Rworkb (no. of reflections)0.204 (37909)
Rfreec (no. of reflections)0.256 (3223)
RMSD bond lengths (Å)0.01
RMSD bond angles(°)1.78
Average B-factorsd2)
    Main chain25.03
    Side chain27.68
B-rmsd main chain (Å2)1.33
B-rmsd side chain (Å2)2.25
Ramachandran plot
Most favored regions (%)88.6
Additional allowed regions (%)10
Generously allowed regions (%)1.3
Disallowed regions (%)0

aRsym = Σ|Ii − 〈I〉|/|Ii| where Ii is the intensity of the ith measurement, and 〈I〉 is the mean intensity for that reflection.

bRwork = |Fobs−Fcalc|/|Fobs| where Fcalc and Fobs are the calculated and observed structure factor amplitudes, respectively.

cRfree = as for Rwork, but for 8% of the total reflections chosen at random and omitted from refinement.

dIndividual B-factor refinements were calculated.

Crystallographic data and refinement statistics aRsym = Σ|Ii − 〈I〉|/|Ii| where Ii is the intensity of the ith measurement, and 〈I〉 is the mean intensity for that reflpan class="Species">ection. bRwork = |Fon class="Species">bs−Fcalc|/|Fobs| where Fcalc and Fobs are the calculated and observed structure factor amplitudes, respectively. cRfree = as for Rwork, but for 8% of the total reflpan class="Species">ections chosen at random and omitted from refinement. dIndividual B-factor refinements were calculated.

Structure solution and refinement

Of the expn class="Species">ected seven selenium sites in the asymmetric unit, five were located by the program SOLVE (40). The N-terminal, as well as the C-terminal methionine, was disordered. The initial phases were further improved by density modification using Sharp (v 3.0.15) (41) that improved the overall figure of merit (FOM) to 0.73. The ARP/wARP (42) built ∼65% of the molecule. The remaining parts of the model were built manually using the program O (43). Further cycles of model building alternating with refinement using the program CNS (44) resulted in the final model, with an R-factor of 0.21 (Rfree = 0.26) to 2.1 Å resolution with no sigma cutoff used during refinement. The final model comprises 334 residues (Ala3-Met336) and 231 water molecules. The N-terminal His-tag and the linker residues were not visible in the electron density map. PROCHECK (13) analysis shows no residues in the disallowed regions of the Ramachandran plot. A simulated annealing Fo–Fc omit map of the putative SAM-binding site of RsmC is shown (Figure 1c).
Figure 1.

Ribbon diagram showing the domain duplication in the RsmC structure. (a) Full-length protein. The N-terminal domain (putative RNA-binding domain: residues 3–150) is depicted in red and the C-terminal domain (SAM-binding domain: residues 179–336) in blue. The N- and C-termini are labeled. (b) Superposition of the N-terminal and C-terminal domains (blue and red, respectively) in stereo. These figures were prepared using the programs MOLSCRIPT (48) and Raster3D (49). (c) Simulated annealing Fo–Fc omit map in the putative SAM-binding site of RsmC. The key residue Asp202 and all atoms within 3.5 Å of Asp202 were omitted prior to refinement and map calculation. The map is contoured at a level of 3.0σ. This figure was prepared using PyMol (www.pymol.org).

Ribbon diagram showing the domain duplication in the n class="Chemical">RsmC structure. (a) Full-length protein. The N-terminal domain (putative RNA-binding domain: residues 3–150) is depicted in red and the C-terminal domain (SAM-binding domain: residues 179–336) in blue. The N- and C-termini are labeled. (b) Superposition of the N-terminal and C-terminal domains (blue and red, respectively) in stereo. These figures were prepared using the programs MOLSCRIPT (48) and Raster3D (49). (c) Simulated annealing Fo–Fc omit map in the putative SAM-binding site of RsmC. The key residue Asp202 and all atoms within 3.5 Å of Asp202 were omitted prior to refinement and map calculation. The map is contoured at a level of 3.0σ. This figure was prepared using PyMol (www.pymol.org).

Bioinformatics analyses

Sequence searches were carried out with PSI-BLAST (45), and multiple sequence alignment was constructed with MUSCLE (46). Sequence conservation was calculated from the sequence alignment and mapped onto the protein structure using ConSurf (47). Structure manipulations and modeling was carried out with SwissPDBViewer and PyMol. Structure database searches and superpositions were done with DALI (16).

Protein Data Bank accession code

Coordinates and structure factors have been deposited with RCSB Protein Data Bank with code 2PJD.

RESULTS

Overall structure

The structure of n class="Chemical">RsmC from E. coli was solved by the single-wavelength anomalous dispersion (SAD) (12) method from synchrotron data using SelMet-labeled protein and was refined to a final R-factor of 0.21 (Rfree = 0.26%) at 2.1 Å resolution. The asymmetric unit contains one RsmC molecule comprising 334 residues from Ala3 to Met336 and a total of 232 water molecules. Neither the N-terminal His-tag nor the C-terminal residues Thr337-Gly343 had interpretable density and were not modeled. The RsmC molecules eluted as a monomer from the gel filtration column. This was consistent with observations in the dynamic light scattering experiments as well as the analysis of intermolecular contacts in the crystal (data not shown). Analysis of the Ramachandran plot using the program PROCHECK (13) showed 88.6% of all residues within the most favored regions and no residues in the disallowed regions. The crystallographic statistics are given in Table 1. The structure of the full-length n class="Chemical">RsmC with overall dimensions of ∼35 × 40 × 60 Å reveals the presence of two homologous domains of a mixed α/β fold, characteristic for SAM-dependent MTases (Figure 1 ribbon diagram). The existence of intramolecular homology in RsmC has been earlier predicted by bioinformatics analysis (14). The N-terminal domain (NTD) consists of seven β-strands and five α-helices and the C-terminal domain (CTD) has nine β-strands and six α-helices. The NPPF (N269-F271) tetrapeptide motif, which is conserved in m2G MTases (15) is located in a loop between β4 and α5 of the CTD (Figure 2). This motif is absent from the NTD.
Figure 2.

Structure-based sequence alignment of two domains of RsmC, RlmG, together with their closest homolog MJ0882 (1dus). The superposition of RsmC-NTD, RsmC-CTD and MJ0882 was performed with O program (43). For RsmC and RlmG families three representative members are shown: E. coli (Ec), P. aeruginosa (Pa) and V. cholerae (Vc). For MJ0882, homologs are from B. subtilis (Bs) and T. maritima (Tm). Residues that are conserved within families are highlighted. For EcRsmC, the sequence ruler and the secondary structural elements are shown in the upper panel. Common motifs (conservation at the 3D level) are indicated below the alignment and conserved regions of functional importance (RNA-binding in the NTD and SAM-binding and catalysis in the CTD) are boxed.

Structure-based sequence alignment of two domains of n class="Chemical">RsmC, RlmG, together with their closest homolog MJ0882 (1dus). The superposition of RsmC-NTD, RsmC-CTD and MJ0882 was performed with O program (43). For RsmC and RlmG families three representative members are shown: E. coli (Ec), P. aeruginosa (Pa) and V. cholerae (Vc). For MJ0882, homologs are from B. subtilis (Bs) and T. maritima (Tm). Residues that are conserved within families are highlighted. For EcRsmC, the sequence ruler and the secondary structural elements are shown in the upper panel. Common motifs (conservation at the 3D level) are indicated below the alignment and conserved regions of functional importance (RNA-binding in the NTD and SAM-binding and catalysis in the CTD) are boxed. The DALI search (16) shows that there is no structure in the PDB with global similarity to the entire n class="Chemical">RsmC. However, the isolated NTD and CTD show expected similarity to SAM-dependant MTases from the RFM superfamily (17) as well as to each other. In particular, the NTD shows higher similarity to the CTD than to any other structure: RMSD 2.4 Å for 135 Cα atoms, DALI Z-score of 13.1. As predicted by bioinformatics analyses (14), among other proteins of known structure, MJ0882, a putative MTase from Methanococcus jannaschii (PDB code 1dus) is the closest homolog of both NTD and CTD: it superimposes onto the NTD with 2.5 Å RMSD over 138 Cα atoms, DALI Z-score of 13.1 and onto the CTD with RMSD 2.0 Å over 173 Cα atoms, DALI Z-score of 23.3. Other MTases from the large RFM superfamily show significant, but lower structural similarity (data not shown). Although the structures of the NTD and CTD of n class="Chemical">RsmC are highly similar to each other, the structure-based sequence alignment of the two domains indicates that there is only 12% amino acid identity between them (Figure 2). Clearly noticeable is the preservation of a non-polar character of the residues forming the ß-sheet core of both domains and the lack of conservation of residues at the surface. These features suggest that both domains of RsmC originated by intragenic tandem duplication from a primitive single-domain ancestor similar to MJ0882, and that they accumulated divergent mutations that made them dissimilar on the surface, while preserving the structural scaffold. It is important to note that the NTD appears to have accumulated more sequence and structural changes than the CTD with respect to MJ0882: while the CTD exhibits 22.7% amino acid sequence identity to MJ0882, the NTD shows 11.4% identity both to the CTD and to MJ0882 (see also the aforementioned DALI Z-scores, 23.3 versus 13.1). Although the sequence analysis of n class="Chemical">RsmC had been reported (14), thus far no high-resolution structure was available to provide a 3D framework for sequence-function considerations. Both domains of RsmC are members of the RFM superfamily of MTases, which is characterized by the presence of a series of motifs conserved at the structural level, and typically also at the sequence level (17). Motifs I, II and III form a SAM-binding pocket, while motifs X and IV usually form the ‘floor’ and the ‘roof’ of the catalytic site and may be important for the methyl group donor SAM and substrate binding, positioning them in optimal orientation for the methyl group transfer to occur. Motif VI often participates in the formation of the active site from the substrate side, motifs V and VII are typically important for the structural stability and motif VIII can participate in substrate binding. On the sequence level, motif I is strongly conserved among nearly all members of the RFM superfamily and typically assumes the pattern similar to (D/E)XGXGXG. Motif IV typically contains the key substrate-binding and/or catalytic residues and assumes very different sequence patterns in MTase families acting on different molecules. In MTases acting on exocyclic amino groups of nucleic acid bases (those yielding m6A, m4C and m2G modifications), the typical pattern of conservation is (N/D/S)PP(Y/F/W/H) (15). To identify the potential functionally important sites in both domains of n class="Chemical">RsmC, we calculated the sequence conservation in the RsmC family and mapped it onto the protein surface. This analysis reveals two conserved patches: the larger one lining up a deep pocket in the CTD formed by motifs: X, I, II, III, IV and VI, and the smaller one on the exposed protuberance of the NTD formed by motifs VII and VIII. Importantly, the conservation is asymmetric across the domains—neither the NTD pocket nor the CTD protuberance shows any significant conservation (Figure 3A). On the other hand, mapping of the electrostatic potential on the surface of RsmC reveals that the protein is almost uniformly negatively charged with the exception of a small positive patch on the conserved NTD protuberance (Figure 3B). We carried out analogous analyses for a comparative model of RlmG (YgjO), a MTase closely related to RsmC and also exhibiting two domains, but specific for m2G modification at the G1835 another position of 23S rRNA (18). The distribution of conservation in the RlmG family is similar to that in the RsmC family, with high conservation in the CTD pocket and on the NTD protuberance (Figure 2 and Supplementary Figure 1). Interestingly, while the CTD pocket is conserved between RsmC and RlmG, the NTD protuberance is not, i.e. motifs VII and VIII in both families exhibit different conserved amino acids (Figure 2). RlmG is also negatively charged, with positive patches on both NTD and CTD protuberances (Supplementary Figure 2). Conservation of the pocket with motifs I and IV suggest that the CTD of RsmC and RlmG is important for binding of the SAM cofactor and the catalysis of the methyl transfer reaction. On the other hand, a positively charged protuberance that shows differential conservation in MTase families of different specificity is likely to be important for the recognition and binding of their different rRNA substrates. This prediction is further supported by bioinformatics methods for prediction of RNA-binding sites RNABindR (19) and BindN (20) that identify region 130–145 (encompassing motif VIII in the NTD) as a likely RNA-binding site (data not shown).
Figure 3.

(A) Amino acid sequence conservation in the RsmC family mapped onto the RsmC surface using ConSurf (from red: no conservation, to blue: identity). (B) Electrostatic potential mapped onto the RsmC surface (from red −5 kT to blue, +5 kT): showing the positively charged protuberance in the NTD and the dominantly negatively charged CTD (left and right, respectively) separated by a cleft. (C) Ribbon diagram of RsmC with residues studied by mutagenesis shown in white. The docked SAM molecule is shown in orange, and the docked substrate guanosine 1207 in cyan. Ligands were docked manually, to visualize the active site, in analogy to other MTase structures. (D) A detailed view of the predicted ligand-binding/active site pocket in RsmC. Atoms of docked ligands and predicted ligand-binding residues are colored using the following scheme: C, gray; O, red; N, blue; S (in SAM) and P (in guanosine), yellow. The figures were produced with SwissPDBViewer.

(A) Amino acid sequence conservation in the n class="Chemical">RsmC family mapped onto the RsmC surface using ConSurf (from red: no conservation, to blue: identity). (B) Electrostatic potential mapped onto the RsmC surface (from red −5 kT to blue, +5 kT): showing the positively charged protuberance in the NTD and the dominantly negatively charged CTD (left and right, respectively) separated by a cleft. (C) Ribbon diagram of RsmC with residues studied by mutagenesis shown in white. The docked SAM molecule is shown in orange, and the docked substrate guanosine 1207 in cyan. Ligands were docked manually, to visualize the active site, in analogy to other MTase structures. (D) A detailed view of the predicted ligand-binding/active site pocket in RsmC. Atoms of docked ligands and predicted ligand-binding residues are colored using the following scheme: C, gray; O, red; N, blue; S (in SAM) and P (in guanosine), yellow. The figures were produced with SwissPDBViewer.

Structure–function relationships in RsmC

To characterize the function of each domain of n class="Chemical">RsmC and to confirm the predicted role of individual residues, we designed and constructed two deletion mutants corresponding to the isolated NTD and CTD (amino acids 1–158 and 159–336, respectively), and a series of point mutants of conserved residues in the full-length RsmC that mapped to the predicted SAM-binding site, guanosine-binding/catalytic site and the RNA-binding site. For the potential RNA-binding site we constructed three double mutants in the neighboring positively charged residues (Figure 3C). The NTD as well as the point mutants expressed and purified easily using procedure optimized for the wild-type protein, while the isolated CTD turned out to be very difficult to purify in these conditions and only the purification and refolding from inclusion bodies enabled us to obtain sufficient amounts of the deletion mutant protein for further experiments. It is known that a maltose-binding protein (MBP) can act as a ‘passive chaperone’ to improve the solubility and promote the proper folding of their fusion partners (21). Thus, we constructed two variants of the RsmC CTD, fused to the MBP either in the N- or C-terminus of the isolated domain (i.e. MBP–CTD or CTD–MBP). We found that the MBP–CTD fusion protein purifies well, similar to the wild-type RsmC (NTD–CTD), while the CTD–MBP fusion protein purifies poorly, similar to the isolated CTD (data not shown). In the MBP–CTD fusion, the MBP domain physically replaces the NTD of the wt RsmC and has the opportunity to fold before the CTD, as it leaves the ribosome earlier. On the other hand, in the CTD–MBP fusion CTD leaves the ribosome first, and it is likely that it starts to fold before it has a chance to interact with the MBP domain. Our results suggest that the RsmC CTD has lost the ability to fold on its own and requires a pre-folded ‘intramolecular chaperone’ localized at its N-terminus, be it the NTD or another well-folded domain such as MBP. In order to characterize the function of individual residues, we carried out the functional, biochemical and biophysical characterization of the point mutants. The biochemical assay involving the in vitro methylation of ribosomes isolated from the n class="Chemical">rsmC▵ strain (see ‘Materials and methods’ section for details) revealed that all mutant proteins exhibit reduced activities compared to the wild-type RsmC (Figure 4). In particular, alanine substitution of residues predicted to be important for SAM binding showed the most severe loss of activity (D202A in motif I to 4% and D227A in motif II to 13%). The alanine substitution of the Asn residue in the predicted catalytic NPPF motif IV (N268A) has reduced the activity to 20% of the wild-type. On the other hand, substitutions of individual residues in the predicted RNA-binding site had relatively mild effects on the RsmC activity—their activity was typically reduced only to 30–50% of the wt enzyme (Figure 4). Double mutants exhibited further reduction of activity, e.g. K86S/K88S to 16%. These results are very similar to those obtained in the course of mutagenesis of the rRNA:m6A methyltransferase ErmC’ (22), where it was also impossible to obtain a mutant that would be completely inactive in vitro even with multiple substitutions in the predicted RNA-binding site.
Figure 4.

In vitro MTase activity of the mutant RsmC variants, measured on the 30S RNA ribosome subunits isolated from the rsmC K.O. strain. The activity is shown as the percentage of the wild-type MTase activity. Double and single substitutions in the presumed RNA-binding site are indicated in gray, substitutions in the SAM-binding site are indicated in black, and the substitution in the active site is indicated in white.

In vitro MTase activity of the mutant n class="Chemical">RsmC variants, measured on the 30S RNA ribosome subunits isolated from the rsmC K.O. strain. The activity is shown as the percentage of the wild-type MTase activity. Double and single substitutions in the presumed RNA-binding site are indicated in gray, substitutions in the SAM-binding site are indicated in black, and the substitution in the active site is indicated in white. The interactions between n class="Chemical">RsmC (and its mutant variants) and the methyl group donor SAM were studied by the Isothermal Titration Calorimetry. The thermodynamics of binding is given in Table 2. The mutants D202A and D227A in the potential SAM-binding site in the CTD showed complete inability to bind the cofactor (Figure 5), while the N268A mutant in the predicted catalytic motif NPPF that coordinates interactions between SAM and the target guanosine showed almost 5-fold reduction in the SAM-binding affinity (Supplementary Figure 1 and Table 2). On the other hand, mutants in the predicted RNA-binding site in the NTD could still bind SAM with wild-type-like affinities (Table 2) indicating that their reduced activity is not due to the compromised cofactor binding.
Table 2.

ITC data for titration of RsmC variants with SAM

Protein Binding to SAMKa (105/M)−ΔH (kcal/mol)−ΔS (cal/deg.mol)−ΔG (kcal/mol)Number of binding sites
Native.091.524.9.15.94
R138A.781.846.4.03.99
K133A.96.92.19.07.97
K133E.912.087.1.91
K133S.74.03−3.42.03.2
R138S, R139S.89.6−5.06.09.11
K86A.16.391.6.97.04
R13S.741.685.8.05.95
R115S, R119S.890.41.3.09.93
K86S, K88S.941.334.5.08.99
N268A.351.669.09.03
D202ANB (no binding)
D227ANB (no binding)
NTDNB (no binding)
MBP-CTDNB (no binding)

Ka, binding affinity.

ΔH, ΔS, ΔG—change in enthalpy, entropy and Gibb's free energy, respectively.

Figure 5.

ITC spectra for RsmC wild type and mutants. Baseline subtracted raw ITC data for injections of SAM (ligand) is indicated in the upper panels of each of the ITC profiles shown (for the wild-type as well as the variants of RsmC). The peaks normalized to the ligand/protein molar ratio were integrated as is shown in the bottom panels. The solid dots indicate the experimental data and the best fit to the experimental data were obtained from a non-linear least squares method of fitting using a one-site binding model depicted by a solid line. (a) Wild type (b) D202A (c) D227A.

ITC spn class="Species">ectra for RsmC wild type and mutants. Baseline subtracted raw ITC data for injections of SAM (ligand) is indicated in the upper panels of each of the ITC profiles shown (for the wild-type as well as the variants of RsmC). The peaks normalized to the ligand/protein molar ratio were integrated as is shown in the bottom panels. The solid dots indicate the experimental data and the best fit to the experimental data were obtained from a non-linear least squares method of fitting using a one-site binding model depicted by a solid line. (a) Wild type (b) D202A (c) D227A. ITC data for titration of pan class="Chemical">RsmC variants with n>n class="Chemical">SAM Ka, binding affinity. ΔH, ΔS, ΔG—change in enthalpy, entropy and Gibb's free energy, resppan class="Species">ectively. ITC experiments on the NTD of n class="Chemical">RsmC with SAM indicated that this domain is not capable of binding SAM (Supplementary Figure 1). We failed to obtain a preparation of the isolated CTD (with MBP cleaved off) that would be suitable for ITC measurements. Interestingly, the entire MBP–CTD fusion protein that could be purified, was unable to bind SAM, which indicates one of the three possibilities (i) the CTD is misfolded (despite the presence of MBP) or (ii) some portion of MBP blocks the access to the SAM-binding site on a properly folded CTD or (iii) SAM binding by RsmC requires the presence of both NTD and CTD. Our crystal-structure-based docking model suggests that the NTD does not make direct contacts with SAM. Thus, based on our analysis of the calorimetric studies on the wild type RsmC and point mutants in the CTD that are incapable of SAM binding, we propose that the NTD has evolved to be an essential intramolecular chaperone of the CTD that promotes the formation of the SAM-binding site. The presented data allows us to conclude that the CTD of n class="Chemical">RsmC is involved in SAM binding and catalysis of the N2-guanosine methylation reaction, while the NTD is important for the folding of CTD and contains residues that are important (but not essential) for the RNA MTase activity, not by direct involvement in cofactor binding, but most likely by RNA binding. We were unable to measure the binding of RsmC and its variants to the ribosome; however the analysis of protein structure and sequence conservation strongly suggests that the NTD is the principal substrate-recognition and binding module of the RsmC. Thus, despite the homology between NTD and CTD they appear to perform completely different and complementary roles.

DISCUSSION

Domain duplication and functional spn class="Species">ecialization is a common evolutionary process. The duplication of a gene encoding a primitive multifunctional protein yields two independent proteins or one protein with two similar domains, which may experience relaxation of functional constraints and increased rate of mutations [review: (23)]. A number of primitive homooligomeric enzymes have been reported to possess heterooligomeric counterparts with specialized subunits, the best known example being probably the proteasome [review: (24)]. Among enzymes involved in RNA metabolism, the most frequent specialization in enzymes composed of two or more homologous domains concerns substrate-binding, catalysis or structural stability, accompanied by the degeneration of ancestral activities. Examples include heterodimeric tRNA deaminases (25) and heterotetrameric tRNA:m1A58 MTases (26,27). Similar mechanisms have been postulated for other MTases, including the protein-modifying enzyme PRMT7 comprising two domains in the single polypeptide (28) and eukaryotic DNA MTases Dnmt3a/Dnmt3b/Dnmt3L, where the ‘degenerated’ Dnmt3L is a regulatory subunit in the heterodimeric complex with Dnmt3a or Dnmt3b (29,30). However, thus far no structural information existed to analyze this phenomenon in detail. The structure of n class="Chemical">RsmC provides the first atomic-level picture of an RNA-modification enzyme as well as of an MTase which comprises two domains apparently derived from a common ancestor, which underwent differential functional specialization. According to ITC measurements, RsmC binds only one SAM molecule, and mutational analyses clearly demonstrate that conserved residues in the CTD are responsible for SAM binding. The direct involvement of the NTD in rRNA binding remains to be established, nonetheless mutational analyses of residues in the conserved charged patch on the NTD surface, predicted to be involved in RNA binding by the RNABindR and BindN methods, give strong support for this prediction. Substitutions of these residues significantly affected the MTase activity, while they had no effect on the SAM-binding ability of the enzyme. Despite the conservation of the structural ‘MTase-like’ scaffold, two RFM domains of RsmC exhibit complementary pattern of sequence loss or conservation in motifs implicated in substrate-binding (NTD) versus cofactor-binding and catalysis (CTD). Not surprisingly, the isolated domains are unable to carry out the methylation reaction. Moreover, even when the two isolated domains of RsmC are mixed together, they fail to form a catalytically active complex, suggesting that cooperation between the domains requires physical linkage or begins already at the stage of protein synthesis. Indeed, we found that the CTD requires a well-folded N-terminal partner to fold correctly. It is also possible that the peptide linker between the NTD and the CTD plays a role in coordinating binding and catalysis. This specialization of complementary functions and resulting mutual dependence of domains (concerning both protein stability and enzymatic activity) are likely to be common to other ‘pseudodimeric’ MTases with partially degenerated motifs, such as the protein-arginine MTase PRMT7 and in RNA modification enzymes composed of several homologous domains. Rn class="Species">ecently, Dontsova and coworkers characterized experimentally three E. coli rRNA:m2G MTases: RlmL that modifies G2445 in 23S rRNA (31), RlmG that modifies G1835 in 23S rRNA (18) and RsmD that modifies G66 in 16S rRNA (32). They have demonstrated that RsmD is encoded by the YhhF open reading frame (ORF), and that the YgjO ORF encodes not the RsmD enzyme as previously believed (14,33), but RlmG. They have also determined the structure of YhhF/RsmD, which revealed a single catalytic domain (32). Based on these findings, Dontsova and coworkers proposed a hypothesis that E. coli rRNA:m2G MTases can be divided into two categories based on the domain structure and substrate specificity: MTases composed of multiple domains would recognize protein-free ribosomal RNA in vitro and most probably, unfolded early assembly intermediates in vivo, while MTases comprising only the catalytic domain would recognize only late assembly intermediates resembling the completed 30S particle and not the free RNA (34). They predicted that RlmG and RlmL (whose structures remain unknown) are composed of multiple domains, and that RsmC closely resembles RsmD in that it is composed only of a single domain (32). On the other hand, our results clearly show that RsmC is composed of two domains and is closely related to RlmG (YgjO) rather than RsmD (YhhF). Besides, RsmD has been shown to require the presence of proteins S7 and S19 with the 16S rRNA to be recognized by the enzyme (35). Thus, it appears that the relationship between structure and substrate specificity in rRNA:m2G MTases is more complex and cannot be inferred simply from the number of domains in different proteins.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR online.
  45 in total

1.  In silico analysis of the tRNA:m1A58 methyltransferase family: homology-based fold prediction and identification of new members from Eubacteria and Archaea.

Authors:  J M Bujnicki
Journal:  FEBS Lett       Date:  2001-10-26       Impact factor: 4.124

Review 2.  AdoMet-dependent methylation, DNA methyltransferases and base flipping.

Authors:  X Cheng; R J Roberts
Journal:  Nucleic Acids Res       Date:  2001-09-15       Impact factor: 16.971

3.  Automated protein model building combined with iterative structure refinement.

Authors:  A Perrakis; R Morris; V S Lamzin
Journal:  Nat Struct Biol       Date:  1999-05

4.  A comprehensive view on proteasomal sequences: implications for the evolution of the proteasome.

Authors:  Christoph Gille; Andrean Goede; Cord Schlöetelburg; Robert Preissner; Peter Michael Kloetzel; Ulf B Göbel; Cornelius Frömmel
Journal:  J Mol Biol       Date:  2003-03-07       Impact factor: 5.469

5.  Alanine-scanning mutagenesis of the predicted rRNA-binding domain of ErmC' redefines the substrate-binding site and suggests a model for protein-RNA interactions.

Authors:  Gordana Maravić; Janusz M Bujnicki; Marcin Feder; Sándor Pongor; Mirna Flögel
Journal:  Nucleic Acids Res       Date:  2003-08-15       Impact factor: 16.971

6.  ConSurf: identification of functional regions in proteins by surface-mapping of phylogenetic information.

Authors:  Fabian Glaser; Tal Pupko; Inbal Paz; Rachel E Bell; Dalit Bechor-Shental; Eric Martz; Nir Ben-Tal
Journal:  Bioinformatics       Date:  2003-01       Impact factor: 6.937

7.  Phylogenomic analysis of 16S rRNA:(guanine-N2) methyltransferases suggests new family members and reveals highly conserved motifs and a domain structure similar to other nucleic acid amino-methyltransferases.

Authors:  J M Bujnicki
Journal:  FASEB J       Date:  2000-11       Impact factor: 5.191

8.  An adenosine deaminase that generates inosine at the wobble position of tRNAs.

Authors:  A P Gerber; W Keller
Journal:  Science       Date:  1999-11-05       Impact factor: 47.728

9.  Automated MAD and MIR structure solution.

Authors:  T C Terwilliger; J Berendzen
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  1999-04

10.  RNA:(guanine-N2) methyltransferases RsmC/RsmD and their homologs revisited--bioinformatic analysis and prediction of the active site based on the uncharacterized Mj0882 protein structure.

Authors:  Janusz M Bujnicki; Leszek Rychlewski
Journal:  BMC Bioinformatics       Date:  2002-04-03       Impact factor: 3.169

View more
  17 in total

1.  Actin-binding protein ABP140 is a methyltransferase for 3-methylcytidine at position 32 of tRNAs in Saccharomyces cerevisiae.

Authors:  Akiko Noma; Sanghyun Yi; Takayuki Katoh; Yoshimi Takai; Takeo Suzuki; Tsutomu Suzuki
Journal:  RNA       Date:  2011-04-25       Impact factor: 4.942

2.  Identification and modeling of a phosphatase-like domain in a tRNA 2'-O-ribosyl phosphate transferase Rit1p.

Authors:  Anna Czerwoniec; Janusz M Bujnicki
Journal:  Cell Cycle       Date:  2011-10-15       Impact factor: 4.534

3.  Crystallization and preliminary X-ray crystallographic analysis of putative tRNA-modification enzymes from Pyrococcus furiosus and Thermus thermophilus.

Authors:  Marcus Fislage; Martine Roovers; Stefan Münnich; Louis Droogmans; Wim Versées
Journal:  Acta Crystallogr Sect F Struct Biol Cryst Commun       Date:  2011-10-27

4.  Properties of small rRNA methyltransferase RsmD: mutational and kinetic study.

Authors:  Olga V Sergeeva; Irina V Prokhorova; Yerdos Ordabaev; Philipp O Tsvetkov; Petr V Sergiev; Alexey A Bogdanov; Alexander A Makarov; Olga A Dontsova
Journal:  RNA       Date:  2012-04-25       Impact factor: 4.942

5.  Structural insights into the function of 23S rRNA methyltransferase RlmG (m²G1835) from Escherichia coli.

Authors:  Heng Zhang; Zeng-Qiang Gao; Yong Wei; Wen-Jia Wang; Guang-Feng Liu; Eleonora V Shtykova; Jian-Hua Xu; Yu-Hui Dong
Journal:  RNA       Date:  2012-07-02       Impact factor: 4.942

6.  Crystal structure of the Thermus thermophilus 16 S rRNA methyltransferase RsmC in complex with cofactor and substrate guanosine.

Authors:  Hasan Demirci; Steven T Gregory; Albert E Dahlberg; Gerwald Jogl
Journal:  J Biol Chem       Date:  2008-07-30       Impact factor: 5.157

7.  The open reading frame TTC1157 of Thermus thermophilus HB27 encodes the methyltransferase forming N²-methylguanosine at position 6 in tRNA.

Authors:  Martine Roovers; Yamina Oudjama; Marcus Fislage; Janusz M Bujnicki; Wim Versées; Louis Droogmans
Journal:  RNA       Date:  2012-02-15       Impact factor: 4.942

8.  Multiple Motif Scanning to identify methyltransferases from the yeast proteome.

Authors:  Tanya C Petrossian; Steven G Clarke
Journal:  Mol Cell Proteomics       Date:  2009-04-07       Impact factor: 5.911

9.  Critical residues for cofactor binding and catalytic activity in the aminoglycoside resistance methyltransferase Sgm.

Authors:  Miloje Savic; Tatjana Ilic-Tomic; Rachel Macmaster; Branka Vasiljevic; Graeme L Conn
Journal:  J Bacteriol       Date:  2008-06-27       Impact factor: 3.490

10.  Trm13p, the tRNA:Xm4 modification enzyme from Saccharomyces cerevisiae is a member of the Rossmann-fold MTase superfamily: prediction of structure and active site.

Authors:  Karolina L Tkaczuk
Journal:  J Mol Model       Date:  2009-08-22       Impact factor: 1.810

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.