| Literature DB >> 29263245 |
Mehul Jani1, Soham Sengupta1, Kelsey Hu2, Rajeev K Azad3,4.
Abstract
Staphylococcus aureus is a versatile pathogen that is capable of causing infections in both humans and animals. It can cause furuncles, septicaemia, pneumonia and endocarditis. Adaptation of S. aureus to the modern hospital environment has been facilitated, in part, by the horizontal acquisition of drug resistance genes, such as mecA gene that imparts resistance to methicillin. Horizontal acquisitions of islands of genes harbouring virulence and antibiotic resistance genes have made S. aureus resistant to commonly used antibiotics. To decipher genomic islands (GIs) in 22 hospital- and 9 community-associated methicillin-resistant S. aureus strains and classify a subset of GIs carrying virulence and resistance genes as pathogenicity and resistance islands respectively, we applied a host of methods for localizing genomic islands in prokaryotic genomes. Surprisingly, none of the frequently used GI prediction methods could perform well in delineating the resistance islands in the S. aureus genomes. Rather, a gene clustering procedure exploiting biases in codon usage for identifying horizontally transferred genes outperformed the current methods for GI detection, in particular in identifying the known islands in S. aureus including the SCCmec island that harbours the mecA resistance gene. The gene clustering approach also identified novel, as yet unreported islands, with many of these found to harbour virulence and/or resistance genes. These as yet unexplored islands may provide valuable information on the evolution of drug resistance in S. aureus.Entities:
Keywords: MRSA; Staphylococcus aureus; antibiotic resistance; genomic islands; pathogenicity
Mesh:
Substances:
Year: 2017 PMID: 29263245 PMCID: PMC5746543 DOI: 10.1098/rsob.170094
Source DB: PubMed Journal: Open Biol ISSN: 2046-2441 Impact factor: 6.411
Coordinates of known GIs in the genomes of the five MRSA strains and the corresponding JS-CB predicted GI coordinates. Coordinates of known GIs in MRSA strains [5,13,14] and coordinates of the corresponding JS-CB predicted GIs in the MRSA genomes are indicated in base pairs (bp). Δ, remnant GI [5].
| USA300 | MW2 | N315 | Mu50 | COL | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| island | known [ | predicted | known [ | predicted | known [ | predicted | known [ | predicted | known [ | predicted |
| SCC | 34 173–57 914 | 34 513–113 460 | 34 150–58 278 | 34 490–87 067 | 34 158–87 119 | 33 692–111 612 | 34 158–87 085 | 34 455–107 910 | 34 173–68 085 | 34 375–51 600 |
| ACME | 57 915–88 900 | 34 513–113 460 | — | — | — | — | — | — | — | — |
| νSaα | 441 501–473 470 | 441 501–479 066 | 416 307–452 099 | 416 307–440 048 | 436 162–466 813 | 436 159–467 952 | 461 919–491 326 | 477 065–492 473 | 465 424–489 723 | 451 796–476 971 |
| SaPI-5 | 881 835–895 807 | 843 347–898 530 | — | — | — | — | — | — | — | — |
| φSa2 | 1 545 912–1 592 050 | 1 557 369–1 606 675 | 1 529 123–1 575 042 | 1 540 580–1 589 663 | — | — | — | — | — | — |
| νSaβ | 1 924 777–1 959 376 | 1 930 682–1 936 843 | 1 890 800–1 922 552 | 1 895 586–1 901 747 | 1 854 608–1 881 615 | — | 1 932 523–1 961 464 | — | 1 902 466–1 938 731 | 1 909 272–1 914 665 |
| φSa3 | 2 084 658–2 127 720 | 2 096 778–2 138 923 | 2 046 205–2 088 820 | 2 058 798–2 100 838 | 2 005 321–2 049 591 | 2 018 761–2 072 272 | 2 083 238–2 126 304 | 2 095 697–2 148 702 | — | — |
| νSa4 | Δ2136710–2 139 851 | 2 096 778–2 138 923 | Δ2097809–2 100 950 | 2 058 798–2 100 838 | 2 056 679–2 072 358 | 2 018 761–2 072 272 | 2 133 235–2 148 912 | 2 095 697–2 148 702 | Δ2072899–2 076 041 | 2 063 728–2 075 113 |
| νSaγ | 1 149 668–1 169 542 | — | 1 133 469–1 153 549 | — | 1 132 235–1 153 775 | — | 1 208 629–1 230 219 | — | 1 173 206–1 193 358 | — |
| φSa1 | — | — | — | — | — | — | 917 453–962 005 | 916 042–981 803 | — | — |
| νSa3 | — | — | 839 358–853 808 | 827 901–856 739 | — | — | 868 373–882 872 | 868 462–886 899 | — | — |
| φCOL | — | — | — | — | — | — | — | — | 354 674–398 267 | 315 294–377 918 |
| νSa1 | — | — | — | — | — | — | — | — | 903 332–919 283 | 873 009–922 004 |
Assessment of GI prediction tools: Recall and precision in identifying known GIs in five MRSA strains are shown for GI prediction methods. Highest values of recall and precision among all methods are shown in red; recall N%: (number of known GIs with at least N% of nucleotides classified correctly)/(number of known GIs); precision N%: (number of known GIs with at least N% of nucleotides classified correctly)/(number of GIs predicted). JS-CB, Jensen–Shannon Codon-Bias; MSGIP, Mean Shift Genomic Island Predictor.
Figure 1.GIs predicted in five Staphylococcus aureus MRSA strains by different methods. From the outside inward: first seven circles show the GIs predicted by MSGIP (blank as no island identified), PredictBias, Alien_Hunter, GIHunter, Zisland Explorer, IslandViewer, SeqWord and JS-CB; known islands are shown in the eighth circle with arrows pointing to their names. Mosaic GIs predicted by JS-CB have their distinct segments shown in different shades of blue colour. Ninth circle represents GC skew in green and violet colour. The reference S. aureus strains used were (a) MW2, (b) USA300_FPR3757, (c) COL, (d) Mu50 and (e) N315. The figure was generated using CGviewer 3.0 [27].
Performance metrics of GI prediction tools: recall, precision, mean accuracy, F-measure, and performance coefficient of GI prediction methods averaged over the five MRSA strains are shown for different overlap cut-offs, with highest values among all methods shown in red; recall N%: (number of known GIs with at least N% of nucleotides classified correctly)/(number of known GIs); precision N%: (number of known GIs with at least N% of nucleotides classified correctly)/(number of GIs predicted). JS-CB, Jensen–Shannon Codon-Bias.
Comparison of JS-CB and GEMINI: Performance of JS-CB and GEMINI in identifying four verified islands in P. aeruginosa LESB58 strain (a) and SCCmec island in five MRSA strains (b) respectively was assessed at the default parameter setting and at two other parameter settings (see text). The known and predicted GI coordinates in the genomes are shown in base pairs (bp).
| ( | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| verified islands in | VI-1 | VI-2 | VI-3 | VI-4 | ||||||
| start | end | start | end | start | end | start | end | |||
| verified islands co-ordinates | 2 504 700 | 2 551 100 | 2 690 450 | 2 740 350 | 2 751 800 | 2 783 500 | 2 796 836 | 2 907 406 | ||
| JS-CB co-ordinates at default parameters (native cluster size 64.5%) | 2 506 736 | 2 560 126 | 2 690 501 | 2 786 349 | 2 690 501 | 2 786 349 | 2 798 946 | 2 929 279 | ||
| JS-CB co-ordinates at altered parameter settings (native cluster size 87%) | 2 525 630 | 2 560 126 | 2 708 796 | 2 737 803 | 2 756 461 | 2 786 349 | 2 858 302 | 2 906 455 | ||
Figure 2.Gene maps of: (a) N315_GI1: transposase gene imparting mobility to the island has been highlighted in red; (b) N315_GI7: the presence of a transposase gene in the GI is indicated; (c) Mu50_GI1: the presence of a gene encoding transposase of insertion element is indicated; and (d) Mu50_GI6: the presence of a transposase gene and genes involved in metabolism in the GI is indicated.
Figure 3.Gene map of: (a) N315_GI3: this GI possesses multiple copies of transposase genes (shown in red) and genes required for adhesion (i.e. virulence genes), namely fibrinogen-binding protein A and coagulase encoding genes (shown in blue); (b) N315_GI10: the presence of intercellular adhesin genes in the GI with a role in virulence is indicated; and (c) USA300_GI7: virulence genes (shown in blue) and a gene required for GI integration (shown in red) are indicated in this GI, suggesting a potential role in pathogenicity.