| Literature DB >> 28180279 |
Laurie P Cooper1, Gareth A Roberts1, John H White1, Yvette A Luyten2, Edward K M Bower1, Richard D Morgan2, Richard J Roberts2, Jodi A Lindsay3, David T F Dryden4.
Abstract
Staphylococcus aureus displays a clonal population structure in which horizontal gene transfer between different lineages is extremely rare. This is due, in part, to the presence of a Type I DNA restriction-modification (RM) system given the generic name of Sau1, which maintains different patterns of methylation on specific target sequences on the genomes of different lineages. We have determined the target sequences recognized by the Sau1 Type I RM systems present in a wide range of the most prevalent S. aureus lineages and assigned the sequences recognized to particular target recognition domains within the RM enzymes. We used a range of biochemical assays on purified enzymes and single molecule real-time sequencing on genomic DNA to determine these target sequences and their patterns of methylation. Knowledge of the main target sequences for Sau1 will facilitate the synthesis of new vectors for transformation of the most prevalent lineages of this 'untransformable' bacterium.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28180279 PMCID: PMC5399793 DOI: 10.1093/nar/gkx067
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Staphylococcus aureus genomes showing the genes and the typical organization of target recognition domains (TRDs) in the HsdS DNA sequence specificity subunit. (A) Strain ED133 (CC133) has two hsdS; strain 11819–97 (CC80) has three hsdS (CC80) and strain JKD6159 (CC93) contains an extra Type I RM system from a different Type I RM family. From top to bottom: ED133, 11819–97, JKD6159. hsdR (red), hsdM (blue), hsdS (yellow). (B) The structural organization of the HsdS specificity subunit. The conserved regions (cr) are common to all S subunits within a family. The two TRDs (TRD1 and TRD2) define the target sequences recognized by the RM enzyme and can be swapped between S subunits of the same family to generate new specificities.
TRD targets shown from 5΄ to 3΄
|
|
Figure 2.Amino acid sequence and secondary structure alignment of two pairs of TRDs recognizing the same DNA target. The TRD sequences are highlighted in yellow. Consensus secondary structure shows ‘h’ for α helix and ‘e’ for β sheet. (A) TRDs B and P are examples of a first and a second TRD respectively recognizing 5΄-AGG-3΄. (B) TRDs U and c* are both examples of second TRDs with the same specificity, 5΄-GAY-3΄. The long predicted α helices at the start and the end of the sequences are the conserved helical spacer regions in the HsdS subunits while the sequence between these helices makes up the TRD.
The Sau1 RM systems with published recognition sequences
| Strain name and genome reference | Clonal Complex or Sequence Type | S subunit name in REBASE | Recognition sequence | TRDs assigned | Suggested generic name | Experimental method | Reference for target specificity and method |
|---|---|---|---|---|---|---|---|
| MW2 ( | CC1 | S.SauMW2I | CC | AF | CC1-1 | g, s, a | g ( |
| S.SauMW2II | CC | AG | CC1-2 (CC8-2) | g, s, a | a ( | ||
| N315 ( | CC5 | S.SauN315II | CC | AH | CC5-2 | g, s, a | s (CC8-1 and CC8-2 in strain NRS384 are from ref. |
| S.SauN315I |
| BD | CC5-1 (CC8-1) | g, s, a | |||
| MRSA252 ( | CC30 | S.SauMRSII | GW | CD | CC30-1 | g, s | s ( |
| S.SauMRSI | GG | JK | CC30-2 | s | s ( | ||
| JKD6159 ( | CC93 | S.SauJKDIII | GA | Not a | CC93-3 | s | s ( |
| S.SauJKDII | GGH | b*K | CC93-2 | s | Note the ambiguity in assigning CC93-1 and CC93-3 is clarified with strains ED133 and 32320 and from Table | ||
| S.SauJKDI | C | Ma* | CC93-1 | s | |||
| ED133 ( | CC133 | S.Sau133ORF451P | C | ME | CC133-1 | g | g ( |
| S.Sau133ORF1794P | GG | Jd* | CC133-2 | s | s (this work) | ||
| 32320 ( | CC133 | S.Sau32320ORFAP | C | ME | CC133-1 | g | g ( |
| S0385 ( | CC398 | S.SauSTORF499P |
| NE | CC398-1 | g, s | g ( |
Target sites are shown from 5΄ to 3΄ with the length of the non-specific spacer shown as a number. Underlined A or T indicates the site of adenine methylation on the top or bottom strands respectively. The experimental methods used are indicated as g = target obtained by DNA cleavage with a purified enzyme, s = target obtained by SMRT sequencing of E. coli ER2796 genomic DNA, a = target obtained by ATPase assay with a purified enzyme. Full details are given in the Supplementary Data. S.Sau133ORF1794P is characterized in this work but is included here as it is part of the RM system found in strain ED133. SauMRSI and SauMRSII characterized by Monk et al. and S.SauSTORF499P characterized by Chen et al. are also further characterized in this work.
The ‘artificial’ Sau1 systems containing novel pairings of TRDs
| ‘Artificial’ | |||||
|---|---|---|---|---|---|
| Recognition sequence | TRDs assigned | Experimental method | Recognition sequence | TRDs assigned | Experimental method |
|
| BE | a |
| Na* | s |
| GG | JE | g, s |
| Nc* | s |
|
| NI | g |
| Nd* | g, s |
|
| NK | g | GAR | RE | s |
|
| NL | g | CA | TE | s |
|
| NP | s | CNG | VE | s |
|
| NQ | g, s | TCT | XE | g, s |
|
| NS | s | G | ZE | a |
|
| NU | g, s | G | ZS | a |
|
| NW | g, s | GGH | b*E | s |
|
| NY | g, s | G | e*E | g, s |
Target sites are shown from 5΄ to 3΄ with the length of the non-specific spacer shown as a number. Underlined A or T indicates the site of adenine methylation on the top or bottom strands respectively. The experimental methods used are indicated as g = target obtained by DNA cleavage with a purified enzyme, s = target obtained by SMRT sequencing of E. coli ER2796 genomic DNA, a = target obtained by ATPase assay with a purified enzyme. Full details are given in the Supplementary Data.
The Sau1 RM systems investigated in this project
| Strain name and genome reference | Clonal Complex or Sequence Type | S subunit name in REBASE | Recognition sequence | TRDs assigned | Suggested generic name | Experimental method |
|---|---|---|---|---|---|---|
| CO1791 ( | CC97 | S.SauC01791ORFAP | CCAY-6-RTC | Ac* | CC97-1 | s |
| HO5096 ( | CC22 | S.Sau5096I |
| BI | CC22-1 | g, s |
| LGA251 ( | ST425 | S.Sau251I | GW | CE | ST425-1 | g, s* |
| S.Sau251ORF16900P | G | e*f* | ST425-2 | Not expressed, no signature with s*. | ||
| S.Sau251II | GA | Not a | Same as CC93-3 | s* | ||
| Isolate 3 ( | CC51 | S.SauL3ORFAP | GG | JP | CC51-1 | s |
| Isolate 3067 ( | CC45 | S.Sau347I | GW | CL | CC45-1 | g |
| Isolate 3150 ( | CC15 | S.SauL315ORFAP | CA | OE | CC15-1 | s |
| SA40 ( | CC59 | S.SauSA40ORF370P | GG | JQ | CC59-1 | a |
| CN1 ( | CC72 | S.SauCN1ORF415P | GAR | RQ | CC72-1 | a |
| S.SauCN1ORF1757P | GG | JS | CC72-2 | a | ||
| MSHR1132 ( | CC75 | S.Sau1132ORF3780P | CA | TU | CC75-1 | g |
| S.Sau1132ORF16570P | CNG | VW | CC75-2 | s | ||
| NCTC13435 NCBI Biosample identifier: | ST80 | S.Sau13435ORF394P | TCTA-?-TAG | XY | ST80-1 | Not expressed, no signature with s or s*. |
| SAMEA2479566 | S.Sau13435ORF1751P | GAC-6-TTYG | ZW | ST80-2 | a, s* | |
| S.Sau13435ORF2165P | TCTA-6-RTTC | Xf* | ST80-3 | s, s* | ||
| 32326 ( | CC873 | S.Sau32326ORFAP | G | e*D | CC873-1 | a |
Target sites are shown from 5΄ to 3΄ with the length of the non-specific spacer shown as a number. Underlined A or T indicates the site of adenine methylation on the top or bottom strands respectively. TRD pair e*f* in strain LGA251 was not cloned in E. coli while TRD pair XY was cloned. However, no target modification was observed using SMRT on genomic DNA from either E. coli or S. aureus for these TRD pairs. If the genes are translated, their target is inferred from other TRDs in this table although the spacer length remains undefined. The experimental methods used are indicated as g = target obtained by DNA cleavage with a purified enzyme, s = target obtained by SMRT sequencing of E. coli ER2796 genomic DNA, s* = target obtained by SMRT sequencing of S. aureus genomic DNA, a = target obtained by ATPase assay with a purified enzyme. Full details are given in the Supplementary Data.
Further TRD pairs found in sequenced strains of Staphylococcus aureus
| TRD pair | Example strain | Clonal complex or sequence type of example strain | REBASE name |
|---|---|---|---|
| AD | FDAARGOS_159 | ST5 | S.Sau159ORF12345P |
| AL | K12S0375 | ST692 | S.Sau375ORFDP |
| AU |
| S.SauFSA084ORF355P | |
| AW | FDA209P | ST464 | S.Sau209ORF1697P |
| BG | MRSN8611 | ST8 | S.Sau8611ORF11430P |
| BH | PLAC6019 | ST5 | S.Sau6019ORF851P |
| BU | SA-083 | ST101 | S.Sau083ORF9680P |
| BY |
| S.SarM260ORF2316P | |
| Bf* | SA-083 | ST101 | S.Sau083ORF1720P |
| JE | Tager 104 | ST49 | S.Sau104ORF1102P |
| JL | W56227 | ST45 | S.Sau56227ORF970P |
| JW | CIG290 | ST45 | S.SauCIG290ORF2408P |
| JW | APS211 | ST45 | S.SauAPS211ORF9230P |
| MW | FSA037 | ST1872 | S.SauFSA037ORF2487P |
| NQ | KPL1845 | ST96 | S.Sau1845ORF2596P |
| Of* | USA300-TCH959 | ST1159 | S.SauTCH959ORF2844P |
| Rf* | Tager 104 | ST49 | S.Sau104ORF2433P |
| TY | M21126 | ST2250 | S.Sau21126ORF1065P |
| XF | 21334 | ST109, CC9 | S.Sau21334ORF1353P |
| XF | RKI4 | ST27 | S.SauRKI4ORF1905P |
| XW | 103564 | ST80-PVL carrier | S.Sau103564ORF678P |
| ZY | D139 | ST145 | S.SauD139ORF2470P |
| b*W | ST20130941 | CC15 | S.Sau941ORF4310P |
| e*f* | SA-120 | ST425 | S.Sau120ORF4875P |
Every pair of TRD1 with TRD2 in Table 1 was used in a BLASTP sequence search to identify HsdS subunit sequences in publicly accessible databases. Examples of strains containing these TRD pairs are shown. ST and CC are from the PATRIC database (69) or derived using (73). Some TRD pairs are present in many strains while others are rare.
New TRD pairs associated with pairs shown in Tables 2, 3 and 4
| Subspecies 21343 Bioproject accession: PRJNA53699 |
| > S.Sau21343ORF2597P TRD NOVEL 1 + TRD K |
|
|
| Species KPL1845. Bioproject accession: PRJNA169473 |
| > S.Sau1845ORF1619P TRD NOVEL 2 + NOVEL 3 |
|
|
| > S.Sau1845ORF2199P TRD NOVEL 4 + TRD f* |
|
|
The new TRDs of unknown specificity are termed NOVEL 1, NOVEL 2, NOVEL 3 and NOVEL 4. TRD NOVEL 3 is a second TRD while the others are first TRDs in the HsdS amino acid sequence. Subspecies 21343 and species KPL1845 also contain S.SauNQ (S.Sau21343ORF1169P and S.Sau1845ORF2596P respectively).
Figure 3.General protection from endonuclease activity using M.EcoGII MTase to methylate all adenines. Plasmid without M.EcoGII treatment is digested (− lanes) but plasmid with M.EcoGII treatment is protected from digestion (+ lanes). Panel (A) uses Sau347I (CC45-1, TRDs C and L) restriction enzyme against plasmids E2, E5 and E10 described in (31). Panel (B) uses SauNY (TRDs N and Y) against plasmids E10, E11 and E12 described in (31). Panel (C) uses three different enzymes, SauN315I (CC5-1, TRDs B and D), SauN315II (CC5-2, TRDs A and H) and SauMRSII (CC30-1, TRDs C and D), against plasmid E10. In each panel EcoRI restriction enzyme was used as a control and markers (M) are in kb.