| Literature DB >> 25860355 |
Cary Pirone-Davies1, Maria Hoffmann1, Richard J Roberts2, Tim Muruvanda1, Ruth E Timme1, Errol Strain3, Yan Luo3, Justin Payne1, Khai Luong4, Yi Song4, Yu-Chih Tsai4, Matthew Boitano4, Tyson A Clark4, Jonas Korlach4, Peter S Evans1, Marc W Allard1.
Abstract
The methylation of DNA bases plays an important role in numerous biological processes including development, gene expression, and DNA replication. Salmonella is an important foodborne pathogen, and methylation in Salmonella is implicated in virulence. Using single molecule real-time (SMRT) DNA-sequencing, we sequenced and assembled the complete genomes of eleven Salmonella enterica isolates from nine different serovars, and analysed the whole-genome methylation patterns of each genome. We describe 16 distinct N6-methyladenine (m6A) methylated motifs, one N4-methylcytosine (m4C) motif, and one combined m6A-m4C motif. Eight of these motifs are novel, i.e., they have not been previously described. We also identified the methyltransferases (MTases) associated with 13 of the motifs. Some motifs are conserved across all Salmonella serovars tested, while others were found only in a subset of serovars. Eight of the nine serovars contained a unique methylated motif that was not found in any other serovar (most of these motifs were part of Type I restriction modification systems), indicating the high diversity of methylation patterns present in Salmonella.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25860355 PMCID: PMC4393132 DOI: 10.1371/journal.pone.0123639
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of Salmonella genomes sequenced in this study.
| Serovar | Chromosome size (bp) | Plasmid size (bp) | GenBank Accession (chromosome) | GenBank Accession (plasmid) | Phage | MTase on phage (specificity, if known) | MTase on plasmid (specificity, if known) |
|---|---|---|---|---|---|---|---|
|
| 4730612 | 78193 | CP006053.1 | CP006054.1 | Salmon_Fels_1_NC_010391 Gifsy_1_NC_010392 | _ | M.SbaUORF280P |
|
| 4737447 | NA | CP007534.1 | _ | Entero_ST64T_NC_004348 Gifsy_2_NC_010393 | _ | _ |
|
| 4706101 | NA | CP007531.1 | _ | Salmon_Fels_1_NC_010391 Gifsy_1_NC_010392 | M.SenAnaORF14155P | _ |
|
| 4977480 | 166,668 122,863 | CP006055.1 | CP006056.1 CP006057.1 | Gifsy_1_NC_010392 Salmon_vB_SemP_Emek_NC_018275 | _ |
|
|
| 4783943 | 110,363 37,679 | CP005390.2 | CP005389.2 CP005391.2 | Entero_P22_NC_002371 Gifsy_2_NC_010393 |
| M.Sen2069ORF23325P |
|
| 4783867 | 37692 | CP005995.1 | CP005994.1 | Entero_P22_NC_002371 Gifsy_2_NC_010393 |
| _ |
|
| 4634161 | 24,012 17,094 | CP004027.1 | CP004026.1 CP004028.1 | Gifsy_2_NC_010393 Salmon_RE_2010_NC_019488 Entero_PsP3_NC_005340 | M.SenJORF19790P (GATC) | _ |
|
| 4694375 | NA | CP007530.1 | _ | Salmon_vB_SosS_Oslo_NC_018279 Entero_Fels_2_NC_010463 |
| _ |
|
| 4679662 | 59369 | CP007528.1 | CP007529.1 | Salmon_RE_2010_NC_019488 Gifsy_2_NC_010393 |
| _ |
| S. Enteritidis CFSAN000111 | 4679081 | 39599 | CP007598.1 | CP007599.1 | Gifsy_2_NC_010393 Salmon_RE_2010_NC_019488 | M.Sen1427ORF7910P (GATC) | _ |
| S. Typhimurium CFSAN001921 | 4859931 | 3,609 4,675 221,009 | CP006048.1 | CP006052.1 CP006051.1 CP006050.1 | Salmon_ST64B_NC_004313 Gifsy_1_NC_010392 Gifsy_2_NC_010393 Entero_ST104_NC_005841 | _ | M.SenTFORF23885P (CAGCTG) M.SenTFORF24805P (CCNGG) |
Fig 1The methylomes of eleven Salmonella serovars.
Violet = MTase identified, light violet = MTase unknown, red = novel motif, orange = MTase present in genome, motif not observed. Light violet/red stripe = novel motif, MTase unknown. Roman numerals indicate MTase type (I – III). Note the majority of novel MTases are Type I systems.
Summary of motifs and methyltransferases found in each Salmonella genome.
| Serovar | Enzyme Assignment | Gene Locus_Tag (GenBank) | Type | Sub- Type | Motif Observed | Motif Uni-que | % Methyl-ated 5'-3'/3'-5' | Number Methyl- ated Motifs (5'-3' strand/ 3'-5' strand) | Number Motifs in Genome (5'-3' strand/ 3'-5' strand) |
|---|---|---|---|---|---|---|---|---|---|
|
| M.SbaUI | SEEB0189_17520 | III | beta | CAGm6AG | no | 97.7 | 5652 | 5787 |
| M.SbaUII | SEEB0189_19945 | II | beta | Cm6AGC | no | 88.2 | 1466 | 1662 | |
| M.SbaUIII | SEEB0189_19740 | I | gamma | CCGm6ANNNNNG | yes | 98.6/ 98.6 | 482/ 482 | 489/ 489 | |
| M.SbaUIV | SEEB0189_02925 | II | beta | A | no | 78.5 | 1093 | 1392 | |
| M.SbaUDam | SEEB0189_02450 | Orphan | _ | G6mA | no | 98.6/ 98.6 | 37148 | 37688 | |
|
| M.SenAboI | SEEA0014_11325 | III | beta | CAGm6AG | no | 96.8 | 5391 | 5569 |
| M.SenAboII | SEEA0014_03225 | II | beta | A | no | 38 | 283 | 744 | |
| M.SenAboIV | SEEA0014_08865 | I | gamma | GAm6ACNNNNNNN | yes | 94.9/ 93.5 | 410/ 404 | 432/ 432 | |
| M.SenAboDam | SEEA0014_03700 | Orphan | alpha | G6mA | no | 95.1/ 95.1 | 35607 | 37436 | |
| M1.SenAboIII | SEEA0014_08700 | I | gamma | G6mATGNNNNNG4mGC/ G4mCCNNNNNCATC | yes | 96.1/ 31.0 | 1260/ 406 | 1311/ 1311 | |
| M2.SenAboIII | SEEA0014_08705 | I | gamma | G6mATGNNNNNG4mGC/ G4mCCNNNNNCATC | yes | 96.1/ 31.0 | 1260/ 406 | 1311/ 1311 | |
|
| M.SenAnaI | SEEA1592_11695 | I | gamma | CCm6ANNNNNNNN | yes | 99.7/ 99.4 | 354/ 353 | 355/ 355 |
| M.SenAnaII | SEEA1592_09525 | III | beta | CAGm6AG | no | 100.0 | 5509 | 5511 | |
| M.SenAnaIII | SEEA1592_17520 | II | beta | A | no | 66.7/ 66.7 | 674 | 1010 | |
| M.SenAnaIV | SEEA1592_11855 | II | beta |
4mCCWWG | no | 83.3/ 83.3 | 1423 | 1708 | |
| M.SenAnaDam | SEEA1592_01330 | Orphan | _ | G6mA | no | 99.8/ 99.8 | 37140 | 37224 | |
|
| M.Sen2050I | CFSAN002050_08375 | III | beta | CAGm6AG | no | 95.1 | 6235 | 6558 |
| M.Sen2050II | CFSAN002050_23900 | II | beta | A | no | 45.1/ 45.1 | 510 | 1131 | |
| _ | _ | I | _ | GGm6ANNNNNNA | yes | 92.7/ 92.3 | 459/ 457 | 495/ 495 | |
| _ | _ | I | _ | TCm6ANNNNNGT | yes | 95.5/ 92.3 | 1248/ 1338 | 1352/ 1352 | |
|
| M.Sen2064I | CFSAN002064_15765 | I | gamma | Gm6AGNNNNNNR | no | 97.9/ 97.5 | 231/ 230 | 236/ 236 |
| M.Sen2064II | CFSAN002064_18310 | III | beta | CAGm6AG | no | 98.2 | 5587 | 5691 | |
| M.Sen2064III | CFSAN002064_10125 | II | beta | A | no | 42.4 | 319 | 752 | |
| _ | _ | II | _ | ACCm6ANCC | no | 99.4 | 2703 | 2719 | |
|
| M.Sen2069I | CFSAN002069_07060 | III | beta | CAGm6AG | no | 97.9 | 5816 | 5939 |
| M.Sen2069II | CFSAN002069_09575 | I | gamma | Gm6AGNNNNNNR | no | 97.5/ 97.9 | 238/ 239 | 244/ 244 | |
| M.Sen2069III | CFSAN002069_15235 | II | beta | A | no | 42.2/ 42.2 | 217 | 514 | |
| _ | _ | ACCm6ANCC | 99 | 2747 | 2774 | ||||
|
| M.SenJI | CFSAN001992_09405 | III | beta | CAGm6AG | no | 97.8 | 5410 | 5523 |
| M.SenJII | CFSAN001992_11490 | I | gamma | CCm6AYNNNNNR | yes | 98.1/ 97.7 | 474/ 472 | 483/ 483 | |
| M.SenJIII | CFSAN001992_16620 | II | beta | A | no | 58.8/ 58.8 | 803 | 1364 | |
| _ | _ | G6mA | no | 98.9/ 98.9 | 36330 | 36738 | |||
|
| M.Sen255I | Y007_00590 | III | beta | CAGm6AG | no | 99.4 | 5504 | 5535 |
| M.Sen255II | Y007_12075 | II | beta | A | no | 50.0/ 50.0 | 387 | 774 | |
| _ | _ | I | _ | CG6mAYNNNNNNNR | yes | 99.1/ 98.9 | 439/ 438 | 443/ 443 | |
| _ | _ | II | alpha | G6mA | no | 99.1/ 99.1 | 36866 | 37204 | |
| _ | _ | I | _ | GCm6ANNNNNNC | no | 98.6/ 99.5 | 554/ 559 | 562/ 562 | |
|
| M.Sen1427II | SEEE1427_7355 | I | gamma | CGm6ANNNNNN | no | 98.4/ 97.9 | 1721/ 1712 | 1749/ 1749 |
| _ | _ | II | G6mA | no | 98.8/ 98.8 | 36824 | 37256 | ||
| M.Sen1427I | SEEE1427_9465 | CAGm6AG | no | 99.2 | 5505 | 5549 | |||
| M.Sen1427III | A | no | 43 | 315 | 732 | ||||
| S. Enteriditis CFSAN000158 | M.Sen158I | SEEE0968_18850 | III | beta | CAGm6AG | no | 98.1 | 5490 | 5599 |
| M.Sen158II | SEEE0968_20955 | I | gamma | CGm6ANNNNNN | no | 99.0/ 98.0 | 1739/ 1722 | 1757/ 1757 | |
| M.Sen158III | SEEE0968_03950 | II | beta | A | no | 41.3/ 41.3 | 302 | 732 | |
| S. Typhimurium CFSAN001921 | M.SenTFI | CFSAN001921_15255 | III | beta | CAGm6AG | no | 89.3 | 5635 | 6308 |
| M.SenTFII | CFSAN001921_17800 | I | gamma | CRTm6AYNNNNNNCTC | no | 90.7/ 89.1 | 233/ 229 | 257/ 257 | |
| M.SenTFIII | CFSAN001921_00055 | II | beta | A | no | 60.9 | 630 | 1035 | |
| SenTFIV | CFSAN001921_17955 | II | alpha | GATC6mAG | no | 94.3 | 2841 | 3011 |
*A unique motif refers to one that has not been previously observed in any bacterial species.
Methyltransferases identified in the Salmonella serovars, but not assigned to a motif.
| Serovar | Enzyme Assignment | Type | SubType | Motif (if known) |
|---|---|---|---|---|
|
| M.SbaUORF19730P | I | gamma | - |
| M.SbaUORF280P | II | beta | - | |
|
| M.SenAboORF8720P | I | gamma | - |
|
| M.SenAnaDamP | Orphan | alpha | G6mA |
| M.SenAnaORF14155P | II | alpha | G6mA | |
|
| M.Sen2050DamP | Orphan | alpha | G6mA |
| M.Sen2050ORF235P | II | alpha | G6mA | |
| M.Sen2050ORF245P | II | gamma | - | |
| M.Sen2050ORF400P | II | gamma | - | |
| M.Sen2050ORF480P | II | beta | - | |
| M.Sen2050ORF4940P | I | gamma | - | |
| M.Sen2050ORF5885P | I | gamma | - | |
|
| M.Sen2064DamP | Orphan | alpha | G6mA |
| M.Sen2064ORF21380P | II | alpha | G6mA | |
| Sen2064ORF15615P | II | G,S | GATC6mAG | |
|
| M.Sen2069DamP | Orphan | alpha | G6mA |
| M.Sen2069ORF4005P | II | alpha | G6mA | |
| M.Sen2069ORF23325P | II | beta | - | |
| Sen2069ORF9735P | II | G,S | GATC6mAG | |
|
| M.SenJORF11520P | I | gamma | - |
| M.SenJDamP | orphan | alpha | G6mA | |
| M.SenJORF19790P | II | alpha | G6mA | |
| M.SenJORF20475P | II | alpha | G6mA | |
| M.SenJORF6415P | II | G6mA | ||
|
| M.Sen255DamP | Orphan | alpha | G6mA |
| M.Sen255ORF17075P | II | alpha | G6mA | |
| M.Sen255ORF20925P | I | gamma | - | |
| M.Sen255ORF5995P | I | gamma | - | |
|
| M.Sen1427DamP | Orphan | alpha | G6mA |
| M.Sen1427ORF7380P | I | gamma | - | |
| M.Sen1427ORF7910P | II | alpha | G6mA | |
| S. Enteritidis CFSAN000158 | M.Sen158DamP | Orphan | alpha | G6mA |
| M.Sen158ORFDP | II | alpha | G6mA | |
| M.Sen158ORF20930P | I | gamma | - | |
| S. Typhimurium CFSAN001921 | M.SenTFDamP | Orphan | alpha | G6mA |
| M.SenTFORF6885P | II | G6mA | ||
| M.SenTFORF23885P | II | beta | Cm6AGC | |
| M.SenTFORF24320P | II | - | ||
| M.SenTFORF3520P | III | beta | - |
am5C MTases not included.
Fig 2Diagram of the interpulse duration (IPD) ratio of three novel motifs identified in three Salmonella serovars.
Vertical axis = IPD ratio, horizontal axis = genome position. IPD ratio listed next to bar. A. Motif CCm6AN8 TGAG in S. Anatum. B. Motif RAm6ACN5 TGA in S. Cubana. C. Motif CGm6AYN6RTRTC in S. Montevideo.