| Literature DB >> 23281931 |
Michal Wozniak1, Jerzy Tiuryn, Limsoon Wong.
Abstract
BACKGROUND: Drug resistance in bacterial pathogens is an increasing problem, which stimulates research. However, our understanding of drug resistance mechanisms remains incomplete. Fortunately, the fast-growing number of fully sequenced bacterial strains now enables us to develop new methods to identify mutations associated with drug resistance.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23281931 PMCID: PMC3521396 DOI: 10.1186/1471-2164-13-S7-S23
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Support and weighted support. A schematic example of classification of genetic variation profiles and computation of their supports. Point mutations 1 and 4 are essential, mutation 2 is conflict and mutation 3 is neutral. Light blue circles mark nodes which appear in the definition of weighted support. These are nodes the highest parental nodes (for the leaf nodes corresponding to drug-resistant strains), that their subtrees do not contain any drug-susceptible strains in leaves. The scores (a) support and (b) weighted support are assigned to these mutations. For this drug-resistance profile, the ratio αequals .
Rankings of known drug resistance determining genes
| Rankings before prediction | Rankings after prediction | ||||||
|---|---|---|---|---|---|---|---|
| gene id. | drug name | S | WS | OR | S | WS | OR |
| tet | Tetracycline | 54.5 | 2.5 | 43.7 | 1.5 | 1.5 | 1.5 |
| tetM | Tetracycline | 14.5 | 11.5 | 7.5 | 4 | 4 | 4 |
| mecA | Methicillin | 1 | 1 | 1 | 1 | 1 | 1 |
| mecA | Oxacillin | 3 | 4 | 2 | 1 | 2 | 1 |
| ermA1 | Clindamycin | 5.5 | 5.5 | 5.5 | 1 | 1 | 1 |
| ermC | Clindamycin | 907 | 471 | 907 | 414.5 | 11 | 191.5 |
| ermA1 | Erythromycin | 3 | 3 | 4 | 1 | 1 | 1 |
| ermC | Erythromycin | 1527 | 3994.5 | 1006.5 | 413.5 | 28 | 214.5 |
| aacA-aphD | Gentamicin | 72 | 34 | 34 | 1 | 1 | 1 |
| blaZ | Penicillin | 163 | 66 | 223 | 1.5 | 1 | 2.5 |
| mecA | Penicillin | 163 | 8 | 223 | 11 | 5 | 52 |
| Average ranking (excluding ermC): | 53.27 | 60.411 | 2.55 | 7.22 | |||
Rankings of the known drug resistance determinants obtained by employing three different methods to score gene gain/loss profiles: support (S), weighted support (WS) and odds ratio (OR). Since some of the gene gain/loss profiles are assigned with the same score, we calculate their rankings as the arithmetic mean of positions of the profiles with the same score on the list sorted according to the scores; thus some of the rankings are not round numbers. The rankings were computed before and after prediction of drug resistance, which is based on the presence of the drug resistance determinants. We excluded the gene ermC from the calculations of the rankings since none of the methods were able to pull it out into the top 100 before prediction.
Figure 2The collected dataset of phenotypes with predictions. The collected dataset of phenotypes put together with results of our drug resistance predictions based on the presence of known drug resistance determinants. Due to the high number of strains the table is split into two panels. Columns represent drugs, rows represent S. aureus strains included in the study in the order corresponding to the reconstructed phylogenetic tree of strains. Green, yellow and red cell colors represent susceptible, intermediate resistant and resistant phenotypes, respectively. Analogously, light green and light red cell colors represent predicted susceptible and resistant phenotypes, respectively. White cell color represents unknown (not determined by experiments or prediction) drug resistance phenotypes.
The top scored gene gain/loss profiles
| Gene identifier | NS | NWS | OR | p-value | Gene functional annotation |
|---|---|---|---|---|---|
| SAR1831(blaZ) | 0.84 | 0.81 | 37.15 | 1.15e-06 | beta-lactamase |
| SAR1829(blaI) | 0.84 | 0.74 | 37.15 | 5.24e-06 | transcriptional repressor |
| SAR1830(blaR1) | 0.82 | 0.73 | 31.27 | 7.09e-06 | beta-lactamase regulatory protein blar1 |
| SAR0056 | 0.63 | 0.71 | 12.13 | 1.03e-05 | conserved hypothetical protein |
| SAR0039(mecA) | 0.61 | 0.70 | 10.94 | 1.28e-05 | penicillin-binding protein pbp2a, methicillin resistance determinant mecA, transpeptidase |
| SAR0060(ccrA) | 0.61 | 0.63 | 10.94 | 4.40e-05 | resolvase, n-terminal domain protein |
| SAR0061(yycG) | 0.61 | 0.63 | 10.94 | 4.40e-05 | putative membrane protein |
| NWMN 0025 | 0.57 | 0.63 | 9.40 | 4.41e-05 | conserved domain protein |
| SAR0037(ugpQ) | 0.60 | 0.63 | 10.39 | 5.08e-05 | glycerophosphoryldiester phosphodiesterase |
| SAR0038(maoC) | 0.60 | 0.63 | 10.39 | 5.08e-05 | dehydratase |
| SAR0057 | 0.57 | 0.59 | 9.40 | 9.78e-05 | conserved hypothetical protein |
| Methicillin (NWS-threshold: 0.68) | |||||
| SAR0039(mecA) | 1.00 | 1.00 | 950.00 | 4.48e-20 | penicillin-binding protein pbp2a, methicillin resistance determinant mecA, transpeptidase |
| SAR0037(ugpQ) | 0.98 | 0.94 | 931.00 | 6.77e-15 | glycerophosphoryldiester phosphodiesterase |
| SAR0038(maoC) | 0.98 | 0.94 | 931.00 | 6.77e-15 | dehydratase |
| SAR0056 | 0.95 | 0.85 | 900.00 | 7.55e-12 | conserved hypothetical protein |
| SAR0036 | 0.64 | 0.80 | 33.78 | 5.77e-11 | putative membrane protein |
| SAR0057 | 0.85 | 0.75 | 162.00 | 6.47e-10 | conserved hypothetical protein |
| SAR0060(ccrA) | 0.91 | 0.73 | 432.00 | 1.40e-09 | resolvase, n-terminal domain protein |
| SAR0061(yycG) | 0.91 | 0.73 | 432.00 | 1.40e-09 | putative membrane protein |
| MW0028(ebpS) | 0.54 | 0.71 | 22.30 | 2.76e-09 | hmg-coa synthase |
| Tetracycline (NWS-threshold: 0.32) | |||||
| SAAV_b3(repC) | 0.54 | 0.64 | 27.69 | 5.70e-08 | plasmid replication protein |
| SATW20_00660(tet) | 0.54 | 0.64 | 27.69 | 5.70e-08 | tetracycline resistance protein |
| SATW20_00670(pre) | 0.50 | 0.50 | 24.00 | 3.51e-06 | plasmid recombination enzyme type 3 |
| SATW20_04620(tetM) | 0.46 | 0.37 | 20.80 | 7.54e-05 | tetracycline resistance protein tetM |
| SATW20_08990(virE) | 0.42 | 0.37 | 19.93 | 7.67e-05 | pathogenicity island protein |
| SATW20_09000 | 0.42 | 0.37 | 19.93 | 7.67e-05 | pathogenicity island protein |
| SATW20_09010(lipA) | 0.42 | 0.37 | 19.93 | 7.67e-05 | putative protein in superantigen-encoding pathogenicity islands |
| SATW20_04610(thiI) | 0.43 | 0.35 | 18.00 | 1.32e-04 | putative transcriptional regulator |
| MW0745(int) | 0.25 | 0.32 | 8.00 | 2.28e-04 | site-specific recombinase, phage integrase family |
| MW0747 | 0.25 | 0.32 | 8.00 | 2.28e-04 | DNA-binding helix-turn-helix protein |
| Erythromycin (NWS-threshold: 0.27) | |||||
| SAR0050(ermA1) | 0.80 | 0.58 | 76.00 | 1.36e-06 | rRNA adenine n-6-methyltransferase |
| CGSSa03 12660 | 0.47 | 0.44 | 17.19 | 2.98e-05 | conserved hypothetical protein |
| SAR0054(tnpA1) | 0.75 | 0.39 | 72.00 | 8.12e-05 | transposase for transposon |
| SAR1734 | 0.75 | 0.39 | 72.00 | 8.12e-05 | methylase |
| SAR1736(spc2) | 0.75 | 0.39 | 72.00 | 8.12e-05 | spectinomycin 9-o-adenylyltransferase |
| SaurJH9_1711(radC) | 0.72 | 0.38 | 62.00 | 8.83e-05 | predicted protein |
| SAUSA300_pUSA030006 | 0.20 | 0.35 | 4.75 | 1.65e-04 | replication and maintenance protein |
| SAR1737(tnpC2) | 0.72 | 0.34 | 62.00 | 1.89e-04 | Unknown |
| SAR1529 | 0.33 | 0.33 | 9.15 | 2.43e-04 | conserved hypothetical protein |
| SATW20_04860(recF_1) | 0.23 | 0.30 | 5.52 | 3.67e-04 | recombinational DNA repair ATPase |
| SAR1738(tnpB2) | 0.70 | 0.29 | 54.00 | 4.39e-04 | transposase B from transposon Tn554 |
| SauraJ_010100009720 | 0.23 | 0.27 | 5.52 | 6.60e-04 | conserved domain protein |
| Gentamicin (NWS-threshold: 0.83) | |||||
| SaurJH1_2806(aacA-aphD) | 0.83 | 0.90 | 150.00 | 9.38e-11 | bifunctional acetyltransferase/phosphotransferase GNAT family acetyltransferase |
| Ciprooxacin (NWS-threshold: 0.4) | |||||
| SATW20_04610(thiI) | 0.35 | 0.45 | 36.00 | 1.33e-07 | putative transcriptional regulator |
| SATW20_04650(cap8J) | 0.32 | 0.40 | 31.57 | 8.25e-07 | lipoprotein |
| SATW20_04670(capL) | 0.32 | 0.40 | 31.57 | 8.25e-07 | putative ATP/GTP-binding protein |
| SATW20_04780 | 0.32 | 0.40 | 31.57 | 8.25e-07 | conjugation related protein |
| SATW20_04800 | 0.32 | 0.40 | 31.57 | 8.25e-07 | replication initiation factor |
| SATW20_04810 | 0.32 | 0.40 | 31.57 | 8.25e-07 | DNA segregation ATPase FtsK/SpoIIIE |
| SATW20_04830 | 0.32 | 0.40 | 31.57 | 8.25e-07 | conjugative transposon protein |
Summarizing information for the top scored gene gain/loss profiles. The consequent columns refer to: gene identifier of the corresponding gene family; normalized support (NS); normalized weighted support (NWS); odds ratio (OR); p-value and the gene functional annotation. Thresholds for weighted support are provided in brackets for each drug. Colored gene gain/loss profiles are provided in the supplementary the (additional file 3). Complete results for all the drugs are provided in the supplementary Excel table (additional files 2).
The top scored point mutation profiles, only for essential mutations
| Gene identifier | desc. | NS | NWS | OR | p-value | Gene functional annotation |
|---|---|---|---|---|---|---|
| SAR0023(sasH) | G723D | 0.55 | 0.63 | 8.51 | 1.87e-05 | virulence-associated cell-wall-anchored protein; 5'-nucleotidase |
| SAR0023(sasH) | T725A | 0.54 | 0.62 | 8.11 | 2.23e-05 | virulence-associated cell-wall-anchored protein; 5'-nucleotidase |
| SAR0304 | V295I | 0.39 | 0.49 | 4.48 | 3.25e-04 | acid phosphatase |
| SAR2791 | V182M | 0.46 | 0.46 | 6.05 | 5.41e-04 | transcriptional regulator, Xre family |
| SAR2700 | N493KD | 0.52 | 0.45 | 7.72 | 6.16e-04 | ABC transporter permease protein |
| SAR0233(hmp) | Q333K | 0.44 | 0.44 | 5.48 | 7.21e-04 | avohemoprotein (nitric oxide dioxygenase) |
| SAR0318(sbnA) | N25HK | 0.44 | 0.43 | 5.48 | 8.36e-04 | alpha/beta family hydrolase |
| SAR2664 | V282AT | 0.44 | 0.43 | 5.48 | 8.36e-04 | probable monooxygenase |
| SAR2779 | S48G | 0.44 | 0.43 | 5.48 | 8.36e-04 | n-hydroxyarylamine o-acetyltransferase |
| SAR0318(sbnA) | T138IM | 0.43 | 0.43 | 5.21 | 8.36e-04 | alpha/beta family hydrolase |
| SAR0318(sbnA) | T139AQ | 0.43 | 0.43 | 5.21 | 8.36e-04 | alpha/beta family hydrolase |
| SAR0023(sasH) | A749TG | 0.41 | 0.43 | 4.96 | 8.44e-04 | virulence-associated cell-wall-anchored protein; 5'-nucleotidase |
| SAR0318(sbnA) | R130CG | 0.41 | 0.43 | 4.96 | 8.72e-04 | alpha/beta family hydrolase |
| SAR0322(folC) | H201YQE | 0.41 | 0.43 | 4.96 | 8.72e-04 | macro domain, possibly adp-ribose binding module |
| SAR0233(hmp) | K323ET | 0.40 | 0.42 | 4.71 | 9.08e-04 | avohemoprotein (hemoglobin-like protein) |
| SAR2750(icaC) | I21V | 0.40 | 0.42 | 4.71 | 9.46e-04 | polysaccharide intercellular adhesin (PIA) biosynthesis protein |
| SAR0233(hmp) | S309RN | 0.39 | 0.42 | 4.48 | 9.46e-04 | avohemoprotein (hemoglobin-like protein) |
| Methicillin (NWS-threshold: 0.25) | ||||||
| SAR0198(oppF) | T287IK | 0.10 | 0.29 | 2.11 | 1.41e-04 | putative glutathione transporter, ATP-binding component |
| SAR0420 | I72F | 0.10 | 0.29 | 2.11 | 1.41e-04 | membrane protein |
| SAR2508(sbi) | S219AT | 0.10 | 0.29 | 2.11 | 1.41e-04 | IgG-binding protein Sbi |
| SAR2508(sbi) | N222QK | 0.10 | 0.29 | 2.11 | 1.41e-04 | IgG-binding protein Sbi |
| SAR2508(sbi) | K224SDN | 0.10 | 0.29 | 2.11 | 1.41e-04 | IgG-binding protein Sbi |
| Tetracycline (NWS-threshold: 0.2) | ||||||
| SAR1840 | D291YS | 0.18 | 0.23 | 5.22 | 7.09e-04 | NAD(FAD)-utilizing dehydrogenases |
| SAR2336(rpsJ) | K57M | 0.29 | 0.23 | 9.60 | 7.32e-04 | SSU ribosomal protein S10P (S20E) |
| SAR0550(rpsL) | K113R | 0.36 | 0.20 | 13.33 | 1.14e-03 | SSU ribosomal protein S12P (S23E) |
| Erythromycin (NWS-threshold: 0.2) | ||||||
| SAR0576 | A68EV | 0.07 | 0.21 | 1.54 | 8.89e-04 | phosphoglycolate phosphatase |
| Gentamicin (NWS-threshold: 0.21) | ||||||
| SAR1840 | L289IW | 0.33 | 0.29 | 15.00 | 1.43e-03 | NAD(FAD)-utilizing dehydrogenases |
| SAR1840 | D291YS | 0.33 | 0.29 | 15.00 | 1.43e-03 | NAD(FAD)-utilizing dehydrogenases |
| SAR1840 | H327RF | 0.33 | 0.29 | 15.00 | 1.43e-03 | NAD(FAD)-utilizing dehydrogenases |
| SAR1167(ylmH) | K215N | 0.25 | 0.29 | 10.00 | 1.43e-03 | RNA-binding S4 domain-containing protein |
| SAR1167(ylmH) | R216V | 0.25 | 0.29 | 10.00 | 1.43e-03 | RNA-binding S4 domain-containing protein |
| SAR1167(ylmH) | V217L | 0.25 | 0.29 | 10.00 | 1.43e-03 | RNA-binding S4 domain-containing protein |
| SAR0547(rpoB) | D471YG | 0.17 | 0.21 | 6.00 | 4.61e-03 | DNA-directed RNA polymerase beta subunit |
| SAR1833(trmB) | T54IK | 0.17 | 0.21 | 6.00 | 4.61e-03 | tRNA (guanine46-n7-)-methyltransferase |
| Ciprooxacin (NWS-threshold: 0.12) | ||||||
| SAR1367(grlA) | S80YF | 1.00 | 1.00 | 2244.00 | 6.03e-30 | topoisomerase IV subunit a |
| SAR0006(gyrA) | S90AL | 0.94 | 0.88 | 1056.00 | 1.92e-18 | DNA gyrase subunit a |
| SAR2449(lytT) | V45I | 0.21 | 0.20 | 17.11 | 2.06e-04 | transcriptional regulator |
| SAR1840 | L289IW | 0.12 | 0.20 | 8.80 | 4.56e-04 | NAD(FAD)-utilizing dehydrogenases |
| SAR1793(thiI) | A92ET | 0.09 | 0.20 | 6.39 | 2.06e-04 | thiamine biosynthesis protein thiI |
| SAR2212(murA2) | A102T | 0.06 | 0.20 | 4.12 | 2.06e-04 | UDP-n-acetylglucosamine 1-carboxyvinyltransferase |
| SAR1367(grlA) | E84KG | 0.26 | 0.15 | 23.76 | 9.40e-04 | topoisomerase IV subunit a |
| SAR0235(pstG 1) | F401LV | 0.09 | 0.13 | 6.39 | 2.21e-03 | PTS system, maltose and glucose-specific IIC component |
| SAR0400(nfrA) | R194H | 0.09 | 0.13 | 6.39 | 2.21e-03 | nitroreductase family protein |
Summarizing information for the top scored point mutation profiles, only for essential mutations. The conflict mutatations were removed from the table for: tetracycline, erythromycin and gentamicin (for the rest of drugs there were no conflict mutations above the set thresholds). The consequent columns refer to: gene identifier of the corresponding gene family; corresponding position in the multiple alignment and changed amino acids; normalized support (NS); normalized weighted support (NWS); odds ratio (OR); p-value and the gene functional annotation. Thresholds for weighted support are provided in brackets for each drug. Colored gene gain/loss profiles are provided in the supplementary the (additional file 4). Complete results for all the drugs are provided in the supplementary Excel table (additional files 5).
Figure 3Genes related to methicillin resistance. Presence and relative genome coordinates of genes related to methicillin resistance (mecA, mecR1, mecI, ccrA, ccrB, ccrC), put together with the identified genes: ugpQ and maoC. The gene presence profiles are clustered with respect to the genes order. In this figure we include only these methicillin-resistant strains for which all the genes where located on the main genome and within the same sequence conting (in order to determine the relative positions).