Nancy F Ramia1, Li Tang, Alexis I Cocozaki, Hong Li. 1. Institute of Molecular Biophysics, Florida State University, Tallahassee, FL 32306, USA and Department of Chemistry and Biochemistry, Florida State University, Tallahassee, FL 32306, USA.
Abstract
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) offer an adaptive immune system that protects bacteria and archaea from nucleic acid invaders through an RNA-mediated nucleic acid cleavage mechanism. Our knowledge of nucleic acid cleavage mechanisms is limited to three examples of widely different ribonucleoprotein particles that target either DNA or RNA. Staphylococcus epidermidis belongs to the Type III-A CRISPR system and has been shown to interfere with invading DNA in vivo. The Type III-A CRISPR system is characterized by the presence of Csm1, a member of Cas10 family of proteins, that has a permuted histidine-aspartate domain and a nucleotidyl cyclase-like domain, both of which contain sequence features characteristic of nucleases. In this work, we show in vitro that a recombinant S. epidermidis Csm1 cleaves single-stranded DNA and RNA exonucleolytically in the 3'-5' direction. We further showed that both cleavage activities are divalent-metal-dependent and reside in the GGDD motif of the cyclase-like domain. Our data suggest that Csm1 may work in the context of an effector complex to degrade invading DNA and participate in CRISPR RNA maturation.
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) offer an adaptive immune system that protects bacteria and archaea from nucleic acid invaders through an RNA-mediated nucleic acid cleavage mechanism. Our knowledge of nucleic acid cleavage mechanisms is limited to three examples of widely different ribonucleoprotein particles that target either DNA or RNA. Staphylococcus epidermidis belongs to the Type III-A CRISPR system and has been shown to interfere with invading DNA in vivo. The Type III-A CRISPR system is characterized by the presence of Csm1, a member of Cas10 family of proteins, that has a permuted histidine-aspartate domain and a nucleotidyl cyclase-like domain, both of which contain sequence features characteristic of nucleases. In this work, we show in vitro that a recombinant S. epidermidis Csm1 cleaves single-stranded DNA and RNA exonucleolytically in the 3'-5' direction. We further showed that both cleavage activities are divalent-metal-dependent and reside in the GGDD motif of the cyclase-like domain. Our data suggest that Csm1 may work in the context of an effector complex to degrade invading DNA and participate in CRISPR RNA maturation.
Bacteria and archaea are greatly outnumbered by bacteriophages and other mobile genetic elements such as plasmids (1–5). Accordingly, these organisms have developed different strategies to protect themselves against the infections (6). Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) represent a recently identified prokaryotic defense system that resembles RNA interference in eukaryotes (7). The CRISPR-based defense systems use small RNA, termed CRISPR (cr)RNA, to guide the destruction of foreign RNA or DNA (7–11). A CRISPR locus is typically composed of a leader sequence, several CRISPR-associated (cas) genes and an array of repeat sequences of 30–50 nucleotides separated by distinct spacers of similar sizes. The sequences of some spacers were found to match those from invading phages or plasmids (12–15), which offered the initial evidence for the RNA interference defense mechanism.There are three functional stages of the CRISPR-based immunity. These include integration of new DNA fragments from the invader, transcription and processing of crRNA and degradation of the invading DNA. The cas gene-encoded Cas proteins play critical roles in each of these functional stages. Cas proteins, other than Cas1 and Cas2 that are predicted to function in integration of new spacers, vary largely in sequence and motifs (16–18), which makes it difficult to predict and elucidate their functional roles. The CRISPR–Cas systems are classified into three major types (I, II and III) and their respective subtypes based on the presence and phylogeny of signature cas genes encoding crRNA processing and effector complex proteins (18). Both Type I and III systems use a single endoribonuclease to generate functional crRNA that comprise an 8-nt 5′ tag derived from the 3′ region of the repeat and a spacer sequence, and in some cases, an additional 3′ tail derived from the 5′ region of the following repeat (9,19–22). Each mature crRNA is loaded onto a multi-subunit effector complex made of Cas proteins that then cleaves RNA or DNA complementary to the crRNA (9,11,20,23–25). Different from Type I and III systems, Type II systems require trans-activating CRISPR RNA (tracrRNA) along with the host-encoded RNase III to generate crRNA and a single signature protein bound with tracrRNA and crRNA to degrade foreign DNA (26–28). The effector complexes studied so far differ dramatically in composition and the mode of nucleic acid cleavage. Examples of both Types I and II effector complexes have been shown to target invading DNA, although by very different mechanisms (24,27–29). In contrast, examples of the Type III effector complexes have been shown to target either RNA or DNA (10,11,25).Type I effector complexes are characterized by the presence of Cas3, a protein possessing the helicase (DExD/H) domain, and a ribonucleoprotein complex containing the Cas8 superfamily of proteins (17,18). While the Cas8 family of proteins is believed to play an important role in the effector complex assembly (30,31), Cas3 is the key player in DNA degradation (24). Cas3 often has a fused nuclease histidine–aspartate (HD) domain, but in some organisms, works with a separately encoded HD protein (7,16,18,32). Multiple reports have shown a metal-dependent nuclease activity in the HD domain of Cas3 or the isolated HD protein (33–35). The best studied Type I effector complex is the Cascade (for CRISPR-associated complex for antiviral defense) of Escherichia coli. Cascade consists of five Cas proteins and a crRNA and binds invading DNA complementary to the crRNA. Structural and biochemical studies have provided insights on the functional mechanism of the Cascade complex (9,24,36,37). Following base-pairing of crRNA with the complementary DNA strand, the non-complementary DNA strand is displaced, leading to formation of an R-loop (24,36,37). The R-loop formation recruits Cas3 (23,24) that uses its nuclease domain to nick the target DNA and its helicase domain to unwind DNA for further degradation of the negative strand (33,35,38).Type II effector complexes are characterized by the presence of a multi-domain protein Cas9 that possesses an N-terminal RuvC and an HNH nuclease domain located in the middle of the protein (7). Cas9 bound with crRNA and tracrRNA is sufficient for silencing of the invading DNA (27,28). Cas9 produces double-strand breaks by cleaving the DNA strand complementary to crRNA via its HNH domain, and the non-complementary strand via its Ruv-C-like domain (28), possibly also through the formation of an R-loop structure (27,28,39). Type II systems comprise the simplest CRISPR effector complexes with multiple potential applications for genetic engineering and DNA modeling (28,40,41).Type III effector complexes contain a protein of the Cas10 superfamily and a number of repeat-associated mysterious proteins (16–18). Cas10 also possesses the HD nuclease domain, although with a permuted arrangement of catalytic residues (7), and a nucleotidyl cyclase-like domain (17,42,43). Subtype III-B effector complex, called the Cmr complex, targets invading RNA for destruction (11,25). Active Cmr complex can be reconstituted in vitro from its protein subunits (Cmr1–Cmr7) and a crRNA (11,25,43,44). However, exhaustive search for the catalytic subunit has not yet led to a positive identification. Despite prominent features similar to those found in nucleases, neither the permuted HD domain nor the cyclase-like domain of the characteristic protein, Cas10 (Cmr2), is found to be directly involved in cleaving the crRNA-bound target RNA (43).Interestingly, another studied Type III system belonging to subtype III-A appears to target invading DNA (10,45). The CRISPR system of the clinically isolated Staphylococcus epidermidis strain RP62A contains Type III-A proteins Csm1–Csm6 (Figure 1) where the Cas10 protein, Csm1, has a similar domain arrangement as that in Cmr2. In S. epidermidis RP62A, a DNA plasmid containing a CRISPR target interrupted by a self-splicing intron sequence was found to be evaded by its CRISPR system, suggesting that the mRNA was not targeted (10). Like Cmr2, Csm1 contains both the permuted HD and the nucleotidyl cyclase-like domains (7,17) and has been hypothesized to be responsible for cleaving DNA (18). However, no biochemical characterization of this activity is available, and the mechanism of interference for the Type III-A is still poorly understood.
Figure 1.
Schematic organization of the CRISPR locus of Staphylococcus epidermidis RP62a and of Csm1 protein domains. (A) Repeats are represented as green boxes and spacers as colored blocks. The leader region is shown as a black box and the CRISPR-associated (cas) genes are shown as block arrows with their locus names according to the comprehensive microbial resource labeled on top and common names on the bottom. (B) The graphical representation of the Csm1 protein domains. The histidine–aspartate domain (HD) is shown in light blue, Zn fingers domain is in pink and the nucleotidyl cyclase-like domain is shown in orange.
Schematic organization of the CRISPR locus of Staphylococcus epidermidis RP62a and of Csm1 protein domains. (A) Repeats are represented as green boxes and spacers as colored blocks. The leader region is shown as a black box and the CRISPR-associated (cas) genes are shown as block arrows with their locus names according to the comprehensive microbial resource labeled on top and common names on the bottom. (B) The graphical representation of the Csm1 protein domains. The histidine–aspartate domain (HD) is shown in light blue, Zn fingers domain is in pink and the nucleotidyl cyclase-like domain is shown in orange.In this work, we report characterization of a nuclease activity of S. epidermidis Csm1 (SeCsm1). Our results show that SeCsm1 cleaves single-stranded (ss)DNA and RNA in a divalent metal-dependent fashion, and exhibits a 3′–5′ exonuclease activity. Mutational analysis suggests that the catalytic center resides in the cyclase-like rather than the HD domain, although the HD domain also plays a functional role in DNA cleavage by SeCsm1. Our results provide supporting evidence that the Type III-A effector is a DNA-targeting complex.
MATERIALS AND METHODS
Cloning and expression of Serp2461
Csm1 (SERP2461) was amplified from genomic DNA and cloned into two Gateway vectors (Invitrogen) to create pDest-SERP2461 and Pvp16-SERP2461 that express either a polyhistidine-tagged (His-SeCsm1) or a Maltose Binding Protein-fused protein (MBP-SeCsm1). The MBP-SeCsm1 contains a TEV cleavage site that allows removal of the MBP fusion tag (Figure 2A). To express either Csm1 protein, 2–6 l of Luria–Bertani medium was inoculated with a saturated overnight culture of E. coli cells (Rosetta DE3) transformed with the plasmid and grown at 37°C. Protein expression was induced by addition of 0.5 mM of isopropyl-β-thiogalactoside at OD 0.6–0.8. After growth for additional 16 h at 18°C, the cells were harvested by centrifugation.
Figure 2.
Purification of SeCsm1. (A) Schematic MBP-SeCsm1 chimera protein. (B) Sodium dodecyl sulfate-polyacrylamide gel electrophoresis analysis of purified wild-type MBP-SeCsm1 (WT), histidine–aspartate (HD) domain mutant (H14, D15 to asparagine, HDmut) and GGDD mutant (D586, D587 to asparagine, DDmut). (C) Silver-stained sodium dodecyl sulfate-polyacrylamide gel electrophoresis analysis of MBP removal from MBP-SeCsm1. The MBP tag was cleaved using TEV protease and the cleavage mixture was separated on a gel filtration column. Fraction numbers refer to those of the gel filtration elution. Fractions 17–19 were used for activity assays. The star denotes the tag-free SeCsm1 as confirmed by mass spectrometry analysis.
Purification of SeCsm1. (A) Schematic MBP-SeCsm1 chimera protein. (B) Sodium dodecyl sulfate-polyacrylamide gel electrophoresis analysis of purified wild-type MBP-SeCsm1 (WT), histidine–aspartate (HD) domain mutant (H14, D15 to asparagine, HDmut) and GGDD mutant (D586, D587 to asparagine, DDmut). (C) Silver-stained sodium dodecyl sulfate-polyacrylamide gel electrophoresis analysis of MBP removal from MBP-SeCsm1. The MBP tag was cleaved using TEV protease and the cleavage mixture was separated on a gel filtration column. Fraction numbers refer to those of the gel filtration elution. Fractions 17–19 were used for activity assays. The star denotes the tag-free SeCsm1 as confirmed by mass spectrometry analysis.
Purification of MBP-SeCsm1
While the pDest-SERP2461 clone produced only insoluble His-SeCsm1 protein, Pvp16-SERP2461 resulted in soluble MBP-SeCsm1. To purify the MBP-SeCsm1 protein, cells were lysed using a homogenizer in ice-cold buffer containing 25 mM sodium phosphate pH 7.4, 400 mM NaCl, 10% glycerol and 5 mM of dithiothreitol, and the protease inhibitor 0.1% v/v phenylmethylsulfonyl fluoride. The cell lysate was clarified by centrifugation at 22 000 rpm for 30 min at 4°C and the supernatant was loaded onto an amylose column pre-equilibrated with the lysis buffer. The column was then washed with the lysis buffer plus 1 mM maltose before elution with the lysis buffer containing additional 50 mM of maltose. The eluted protein was further purified by monoQ ion exchange and size-exclusion chromatography on a S200 16/60 Superdex column. The final protein was concentrated to 5 mg/ml and stored in a buffer containing 20 mM Tris pH 7.5, 400 mM NaCl, 10% glycerol and 5 mM dithiothreitol (Figure 2B)To remove the MBP fusion tag, the MBP-SeCsm1 sample was incubated with TEV protease overnight at 4°C. The reaction products were loaded on an analytical superose6 column (GE Healthcare) to separate the cleaved SeCsm1 from MBP, TEV and the uncleaved protein. Despite these efforts, homogenous SeCsm1 without MBP tag could not be completely separated from the uncleaved protein (Figure 2C). However, we confirmed the presence of the cleaved SeCsm1 protein by mass spectrometry analysis.To create SeCsm1 mutants, the Pvp16-SERP2461 plasmid was mutated using the QuikChange kit (Stratagene), and the mutant MBP-SeCsm1 and SeCsm1 proteins were purified in a way similar to that of the wild-type protein.
DNA and RNA cleavage assays
Single-stranded DNA and RNA substrates were purchased from Integrated DNA Technology (see Table 1 for sequence and length information). The DNA and RNA strands were 5′-radioactively-labeled with 32P using the T4 polynucleotide kinase (Invitrogen) following the procedure of the manufacturer. The 3′-end-labeling of RNA substrates was performed using T4 RNA ligase (New England Biolabs) and cytidine-3′,5′-bis-phosphate (Perkin Elmer). To generate double-stranded (ds) substrates used in the nuclease assays, the radioactively labeled DNA or RNA was incubated with the unlabeled complementary strand in a hybridization buffer containing 20 mM Tris pH 7.5 and 50 mM KCl. The mixture was incubated at 95°C for 15 min and then left to stand at room temperature. To form a bulge DNA substrate, DNA10 is annealed to DNA11, which creates a DNA duplex with 44 base mismatches in the middle, mimicking an R-loop formation.
Table 1.
DNA and RNA oligos used for enzyme activity assays
DNA and RNA oligos used for enzyme activity assaysTo perform DNA or RNA cleavage, radioactively labeled DNA and RNA substrates were incubated for 1 h at 24°C in the presence or absence of the protein in a cleavage buffer containing 20 mM HEPES pH 7.0, 100 mM KCl, 2 mM MgCl2 and (in the case of RNA substrates) 1 U of SUPERase-In Ribonuclease inhibitor (Applied Biosystems). The reaction was stopped by addition of the 2× formamide loading buffer. The mixture was then separated on a 15% denaturing polyacrylamide gel electrophoresis gel and the result was visualized using a phosphor screen (GE Healthcare).
RESULTS
Csm1 cleaves ssDNA and ssRNA
Csm1 nuclease activity was tested by adding purified MBP-SeCsm1 or a sample containing tag-free SeCsm1 protein to different substrates including ss- and dsDNA of different lengths, DNA–RNA hybrid, ssRNA and circular ssDNA (phage M13). No difference in cleavage activity was observed between MBP-SeCsm1 and SeCsm1. Thus, SeCsm1 is used for all subsequent discussions unless otherwise noted. In a cleavage buffer containing 20 mM HEPES (pH 7.0), 100 mM KCl, 2 mM MgCl2 and 1 U of SUPERase-In Ribonuclease inhibitor, SeCsm1 has a clear cleavage activity on ssDNA (Figure 3A) and ssRNA (Figure 3B). SeCsm1 cleaved ssDNA and ssRNA irrespective of the sequence and length.
Figure 3.
SeCsm1 cleaves single-stranded (ss)DNA and RNA. The sequences and length of the DNA and RNA substrates used are shown in Table 1. Lane 1 (labeled “Marker”) is an RNA marker in both panels. (A) The ssDNA cleavage activity of SeCsm1 and its metal dependence. SeCsm1 cleaves ssDNA in the presence of Mg2+ and Mn2+ but not other divalent cations tested. The metal-dependent cleavage activity is inhibited by the addition of ethylenediaminetetraacetic acid (EDTA). The concentration of the enzyme used in all reactions is 1 µM, that of each divalent metal is 2 mM and that of EDTA is 10 mM. (B) The ssRNA cleavage activity of SeCsm1 and its metal dependence. Cleavage of ssRNA by SeCsm1 also depends on Mg2+ and Mn2+ and is inhibited by EDTA. Same concentrations are used as in (A).
SeCsm1 cleaves single-stranded (ss)DNA and RNA. The sequences and length of the DNA and RNA substrates used are shown in Table 1. Lane 1 (labeled “Marker”) is an RNA marker in both panels. (A) The ssDNA cleavage activity of SeCsm1 and its metal dependence. SeCsm1 cleaves ssDNA in the presence of Mg2+ and Mn2+ but not other divalent cations tested. The metal-dependent cleavage activity is inhibited by the addition of ethylenediaminetetraacetic acid (EDTA). The concentration of the enzyme used in all reactions is 1 µM, that of each divalent metalis 2 mM and that of EDTA is 10 mM. (B) The ssRNA cleavage activity of SeCsm1 and its metal dependence. Cleavage of ssRNA by SeCsm1 also depends on Mg2+ and Mn2+ and is inhibited by EDTA. Same concentrations are used as in (A).
SeCsm1 requires divalent metals in cleaving nucleic acids
To identify divalent metals required for the cleavage activity of SeCsm1, we tested a panel of divalent metal ions that are known to be required by metal-dependent nucleases. Among the metals tested, only Mg2+ and Mn2+, but not Ni2+, Cu2+, Zn2+, Co2+,Ca2+ or Fe2+, supported the activity of SeCsm1. Furthermore, SeCsm1 appeared to be more active with Mg2+ than with Mn2+. As expected with metal-dependent nucleases, the cleavage reaction of SeCsm1 was completely inhibited by the metal-chelating agent ethylenediaminetetraacetic acid (Figure 3A and B).
SeCsm1 possesses a 3′-5′ exonuclease activity
Based on the observed pattern of cleavage with SeCsm1, we hypothesized that SeCsm1 is an exonuclease. To test our hypothesis, we generated dsDNA substrates containing an overhang of the radioactively labeled DNA on either the 5′- or the 3′-end (Figure 4). Cleavage with SeCsm1 of two different DNAs showed that the substrate with its 3′-end protected was not cleaved, whereas that with its 5′-end protected generated a major product corresponding to the fragment without its unprotected 3′ region (Figure 4A and B). The minor and shorter cleavage products observed for both DNA substrates (<10 nucleotides) likely were due to cleavage of unannealed DNA after continued incubation. This result shows that SeCsm1 is a DNA exonuclease in the 3′–5′ direction.
Figure 4.
SeCsm1 is a 3′–5′ exonuclease. DNA strands are annealed to form duplexes with either a 3′ (DNA3:DNA5, DNA7:DNA8) or 5′ overhang (DNA3:DNA6, DNA7:DNA9). The duplexes with a 3′ overhang, but not 5′ overhang, resulted in a major cleavage product consistent with the base-paired region (denoted with a star). Both types of duplexes exhibited shorter cleavage products that are believed to be due to cleavage of some unannealed ssDNA. The sequences and length of the DNA substrates used are shown in Table 1. RNA substrates have been labeled on either 5′- or 3′-ends, and tested for ssRNA cleavage in a time-dependent manner, or annealed to form duplexes (A) Cleavage of duplexes formed by DNA3, DNA5 and DNA6 (Table 1). (B) Cleavage of duplexes formed by DNA7, DNA8 and DNA9. (C) Time-course study of the cleavage of SeCsm1 on RNA1 labeled on the 5′- or 3′-end. (D) Cleavage results of SeCsm1 on RNA strands with either 3′ or 5′ overhangs and a bulged structure. The RNA2 strand has been labeled on either the 3′- or 5′-end. RNA oligos used to form these substrates are described in Table 1. The first lane is a 5′-labeled RNA marker.
SeCsm1 is a 3′–5′ exonuclease. DNA strands are annealed to form duplexes with either a 3′ (DNA3:DNA5, DNA7:DNA8) or 5′ overhang (DNA3:DNA6, DNA7:DNA9). The duplexes with a 3′ overhang, but not 5′ overhang, resulted in a major cleavage product consistent with the base-paired region (denoted with a star). Both types of duplexes exhibited shorter cleavage products that are believed to be due to cleavage of some unannealed ssDNA. The sequences and length of the DNA substrates used are shown in Table 1. RNA substrates have been labeled on either 5′- or 3′-ends, and tested for ssRNA cleavage in a time-dependent manner, or annealed to form duplexes (A) Cleavage of duplexes formed by DNA3, DNA5 and DNA6 (Table 1). (B) Cleavage of duplexes formed by DNA7, DNA8 and DNA9. (C) Time-course study of the cleavage of SeCsm1 on RNA1 labeled on the 5′- or 3′-end. (D) Cleavage results of SeCsm1 on RNA strands with either 3′ or 5′ overhangs and a bulged structure. The RNA2 strand has been labeled on either the 3′- or 5′-end. RNA oligos used to form these substrates are described in Table 1. The first lane is a 5′-labeled RNA marker.Cleavage of RNA by SeCsm1 also revealed the 3′–5′ exonuclease activity (Figure 4C). Short cleavage products were observed for the 5′- but not 3′-labeled substrate, although the 3′-labeled substrate was consumed by the enzyme in a time-dependent manner (Figure 4C). To further characterize the cleavage activity of SeCsm1 on RNA substrates, dsRNA substrates with 5′ or 3′ single-stranded overhangs, as well as bulged RNA duplexes labeled on either the 5′- or the 3′-end, were incubated with SeCsm1. The patterns of activity observed were more complex than that with DNA substrates and could not be simply explained by the 3′–5′ exonuclease activity. All three types of dsRNA showed cleavage products that were consistent with the 3′–5′ exonuclease and possibly an unknown endonuclease activity. The patterns, however, may also be explained by alternative secondary structures within the labeled strand (RNA2) that resulted in free 3′-end in all three cases. Whether SeCsm1 contains an endonuclease activity in addition to the 3′–5′ exonuclease activity requires a more extensive study with additional RNA substrates.Consistent with the 3′–5′ exonuclease single-stranded activity on DNA, SeCsm1 showed no activity on a dsDNA or a bulged DNA structure generated by annealing two strands of 100 and 110 bases containing a central mismatched region of 44 bases (Figure 5A and B). Furthermore, SeCsm1 did not cleave the circular ssDNA of the M13 phage or a double-stranded plasmid DNA (Supplementary materials, Supplementary Figure S1).
Figure 5.
SeCsm1 does not have a DNA endonuclease cleavage activity. Sequences of the DNA substrates used are shown in Table 1. The first lane is a 5′-labeled RNA marker. (A) SeCsm1 cleaves single-stranded DNA1 and DNA3 (lanes 3 and 5) but not the DNA1:DNA3 duplex (lanes 7 and 9). (B). SeCsm1 does not cleave the bulged duplex DNA mimicking an artificial R-loop. The DNA bulge is formed by annealing DNA 10 and DNA11, where either DNA10 or DNA11 is radiolabeled.
SeCsm1 does not have a DNA endonuclease cleavage activity. Sequences of the DNA substrates used are shown in Table 1. The first lane is a 5′-labeled RNA marker. (A) SeCsm1 cleaves single-stranded DNA1 and DNA3 (lanes 3 and 5) but not the DNA1:DNA3 duplex (lanes 7 and 9). (B). SeCsm1 does not cleave the bulged duplex DNA mimicking an artificial R-loop. The DNA bulge is formed by annealing DNA 10 and DNA11, where either DNA10 or DNA11 is radiolabeled.
The GGDD motif of the cyclase-like domain contains the active site
To locate the active site of SeCsm1 responsible for the DNA cleavage activity, we performed site-directed mutagenesis of residues potentially involved in catalysis within both the HD and the cyclase-like domains (Figure 6A). We mutated the signature residues, His14 and Asp15, of the HD domain to asparagine to create HDmut. These two residues likely participate in the coordination of metal ions as found in HD proteins (34,35). We also mutated the residues, Asp586 and Asp587, of the cyclase-like domain to asparagine to create DDmut. The corresponding aspartates are part of the GGDD motif in the homologous protein Cmr2 that binds two Ca2+ ions (43) (full-sequence alignment of SeCsm1 and Cmr2 present in Supplementary Figure S2). A similar motif in nucleotidyl cyclases containing GGDEF forms the catalytic center for the nucleotide cyclization reaction (46). The GGDD motif in Cmr2 is thus a degenerate GGDEF motif and has no known cyclization activity (43). All four mutated residues of SeCsm1, His14, Asp15, Asp586 and Asp587 are highly conserved in more than 100 Csm1 sequences examined. Based on the similar gel filtration elution patterns, we believe that these mutations did not disrupt folding of SeCsm1 (data not shown).
Figure 6.
The GGDD motif of the cyclase-like domain of SeCsm1 is involved in the nuclease activity. (A) Sequence features and representative structure of a known HD domain from Thermus thermophilus Cas3 (TTHB187) (left) and those of the cyclase-like domain from Pyrocuccus furiosus Cmr2 (Pf1129) (right). The bound metal ions in both structures are represented by orange spheres and residues coordinated with the metals are labeled. Note that the HD domain of SeCsm1 has a permuted catalytic residue arrangement as compared with that of Cas3, whereas its GGDD motif shares the same sequence pattern as the GGDD motif of Cmr2. The numbers in the sequence alignments represent the numbers of residues not shown. (B) Comparison of wild-type SeCsm1 (WT) and its mutants (DDmut and HDmut) in ssDNA cleavage activity. (C) Comparison of the wild-type SeCsm1 (WT) and its mutants (DDmut and HDmut) in ssRNA cleavage activity.
The GGDD motif of the cyclase-like domain of SeCsm1 is involved in the nuclease activity. (A) Sequence features and representative structure of a known HD domain from Thermus thermophilus Cas3 (TTHB187) (left) and those of the cyclase-like domain from Pyrocuccus furiosus Cmr2 (Pf1129) (right). The bound metal ions in both structures are represented by orange spheres and residues coordinated with the metals are labeled. Note that the HD domain of SeCsm1 has a permuted catalytic residue arrangement as compared with that of Cas3, whereas its GGDD motif shares the same sequence pattern as the GGDD motif of Cmr2. The numbers in the sequence alignments represent the numbers of residues not shown. (B) Comparison of wild-type SeCsm1 (WT) and its mutants (DDmut and HDmut) in ssDNA cleavage activity. (C) Comparison of the wild-type SeCsm1 (WT) and its mutants (DDmut and HDmut) in ssRNA cleavage activity.DNA cleavage assay with the wild-type and the mutants showed that while HDmut had a reduced activity, DDmut completely abolished ssDNA cleavage activity (Figure 6B). When tested on ssRNA substrate, HDmut generated a similar cleavage pattern as the wild-type SeCsm1, while DDmut resulted in less number of products (Figure 6C), suggesting that the GGDD motif is also an RNA cleavage center. The common product observed in both HDmut and DDmut cleavage is likely due to an alternative cleavage site. These results suggest that although the GGDD motif is directly responsible for the 3′–5′ exonuclease activity of SeCsm1 on ssDNA and ssRNA, the HD domain likely takes part in this process.
DISCUSSION
Staphylococcus epidermidis strain RP62a is one of the first organisms where interference with horizontal gene transfer was described and its CRISPR system was found to target DNA (10). Yet the molecular mechanism of how this organism destroys invading DNA remains uncharacterized. SeCsm1 is a hallmark protein of this organism that is believed to be involved in DNA interference (16,17) and crRNA processing (21). SeCsm1 is classified as a member of the Cas10 family of proteins that are characterized by a permuted HD domain and a nucleotidyl cyclase-like domain adopting a distinct RNA Recognition Motif fold (47). Although both domains likely have divalent metal-binding sites, their roles in nucleic acid cleavage remain undefined.We showed that SeCsm1 cleaves ssDNA and ssRNA. This result is in agreement with the previously predicted nuclease activity in SeCsm1 (7,17,18), and with the finding that SeCsm1 may be involved in the maturation of crRNA (21). More recently, Csm1 is found to form a complex termed Cas10/Csm with Csm2–5 and the crRNA comparable with Cascade and the Type III-B Cmr complexes (48). This complex is thought to mediate immunity against invading DNA and crRNA maturation in S. epidermidis. The possible nuclease(s), however, involved in silencing DNA and processing crRNA remain unknown. The CRISPR system of S. epidermidis is a ‘minimal CRISPR system’ and the nuclease activity of Csm1 could be used for generation of mature crRNA as well as silencing the invading DNA. The 3′–5′ DNA exonuclease activity of SeCsm1 may be used in the context of the intact effector complex to cleave the invading DNA after it has been nicked by another endonuclease.The discovery that the GGDD motif within the cyclase-like domain rather than the signature HD domain of SeCsm1 exhibits a nuclease activity is surprising to us. First, the GGDD motif of the Type III-B Cas10 protein, Cmr2, was found to have no endoribonuclease activity (43). Second, multiple HD proteins or HD domains have been found to possess ssDNA cleavage activity (33–35). The fact that SeCsm1 GGDD is mainly responsible for the observed DNA cleavage suggests that this motif may have an intrinsic nucleic acid cleavage activity and that this activity may depend on the context of the entire protein. Indeed, despite both being members of the Cas10 family, Csm1 and Cmr2 share a low sequence homology, suggesting differences in other regions of the proteins. To try to explain why the HD domain of SeCsm1 is not critical to its nuclease activity, we note that the HD domain of Csm1 has a permuted arrangement of HD … H … .D (Figure 6A), whereas a typical HD domain has the arrangement of H … .HD … .D (49).The nuclease activity of SeCsm1 observed is metal-ion-dependent and the enzyme is active in the presence of Mg2+ and Mn2+ but not Cu2+, Zn2+ or Ni2+. The similar dependence on divalent metals has been observed in other CRISPR nucleases that include the HD proteins (33–35), Cas1 (50), Cas2 (51,52) and Cas5d (31). We note that none of these previously known CRISPR nuclease proteins include the GGDD motif. Thus our discovery adds another important protein motif capable of cleaving nucleic acids.Although our results show that SeCsm1 cleaves DNA and RNA non-specifically, it is plausible that the required specificity for DNA and RNA is conferred by other Csm proteins and the bound crRNA of the Cas10/Csm complex. During DNA silencing, one or more Csm subunits (Csm2–Csm5) may be required to produce the first cut in the DNA, and the exonuclease activity of Csm1 further digests the DNA with free 3′-end. Alternatively, the Csm complex might have a surveillance function similar to Cascade, and when it encounters an invading DNA complementary to the crRNA, another protein is recruited to perform the actual degradation process. Further studies of the Csm proteins are required to distinguish these models and to uncover the mechanism of DNA silencing in S. epidermidis.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Funding for open access charge: National Institutes of Health R01 [GM099604 to H.L.].Conflict of interest statement. None declared.
Authors: John van der Oost; Matthijs M Jore; Edze R Westra; Magnus Lundgren; Stan J J Brouns Journal: Trends Biochem Sci Date: 2009-07-29 Impact factor: 13.807
Authors: Rita Przybilski; Corinna Richter; Tamzin Gristwood; James S Clulow; Reuben B Vercoe; Peter C Fineran Journal: RNA Biol Date: 2011-05-01 Impact factor: 4.652
Authors: Ki Hyun Nam; Charles Haitjema; Xueqi Liu; Fran Ding; Hongwei Wang; Matthew P DeLisa; Ailong Ke Journal: Structure Date: 2012-07-26 Impact factor: 5.006
Authors: Raymond H J Staals; Yifan Zhu; David W Taylor; Jack E Kornfeld; Kundan Sharma; Arjan Barendregt; Jasper J Koehorst; Marnix Vlot; Nirajan Neupane; Koen Varossieau; Keiko Sakamoto; Takehiro Suzuki; Naoshi Dohmae; Shigeyuki Yokoyama; Peter J Schaap; Henning Urlaub; Albert J R Heck; Eva Nogales; Jennifer A Doudna; Akeo Shinkai; John van der Oost Journal: Mol Cell Date: 2014-11-06 Impact factor: 17.970