| Literature DB >> 28197063 |
Jessica Marklevitz1, Laura K Harris1.
Abstract
Antibiotic resistance Staphylococcus aureus strains cause several life threatening infections. New drug treatment options are needed, but are slow to develop because 50% of the S. aureus genome is hypothetical. The goal of this is to aid in the annotation of the S. aureus NCTC 8325 genome by identifying hypothetical proteins related to the Major Facilitator Superfamily (MFS). The MFS is a broad protein group with members involved in drug efflux mechanisms causing resistance. To do this, sequences for three MFS proteins with x-ray crystal structures in E. coli were PSI-BLASTed against the S. aureus NCTC 8325 genome to identify homologs. Eleven identified hypothetical protein homologs underwent BLASTP against the non-redundant NCBI database to fit homologs specific to each hypothetical protein. ExPASy characterized the physiochemical features, CDD-BLAST and Pfam identified domains, and the SOSUI server defined transmembrane helices of each hypothetical protein. Based on size (300 - 700 amino acids), number of transmembrane helices (>7), CD06174 and MFS domains in CDD-BLAST and Pfam, respectively, and close relation to well-defined homologs, SAOUHSC_00058, SAOUHSC_00078, SAOUHSC_00952, SAOUHSC_02435, SAOUHSC_02752, and ABD31642.1 are members of the MFS. Further multiple-alignment and phylogeny analyses show SAOUHSC_00058 to be a quinolone resistance protein (NorB), SAOUHSC_00058 a siderophore biosynthesis protein (SbnD), SAOUHSC_00952 a glycolipid permease (LtaA), SAOUHSC_02435 a macrolide MFS transporter, SAOUHSC_02752 a chloramphenicol resistance (DHA1), and ABD31642.1 is a Bcr/CflA family drug resistance efflux transporter. These findings provide better annotation for the existing genome, and identify proteins related to antibiotic resistance in S. aureus NCTC 8325.Entities:
Keywords: Functional annotation; Hypothetical proteins; S.aureus; Superfamily
Year: 2016 PMID: 28197063 PMCID: PMC5290667 DOI: 10.6026/97320630012254
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Experimental Overview. Protein sequences for three E. coli MFS proteins underwent PSI-BLAST. Hypothetical protein homologs meeting five parameters are likely functional MFS and then compared to homologs of predicted function.
Homologs in S. aureus NCTC 8325 to GlpT from E. coli
| Protein | Accession | QC % | %ID |
| glycerol-3-phosphate transporter | WP_001010111.1 | 97 | 57 |
| antiporter | WP_001008722.1 | 96 | 32 |
| glucarate transporter | WP_000660709.1 | 90 | 22 |
| quinolone resistance protein NorB | WP_000381270.1 | 21 | 28 |
| multidrug efflux MFS transporter NorA | WP_001041272.1 | 15 | 27 |
| MFS transporter | WP_001154220.1 | 27 | 20 |
| probable glycolipid permease LtaA | Q2FZP8.2 | 27 | 20 |
| conserved hypothetical protein | ABD31816.1 | 7 | 27 |
| hypothetical protein SAOUHSC_00058 | YP_498663.1 | 56 | 23 |
| hypothetical protein SAOUHSC_00078 | YP_498678.1 | 50 | 22 |
| hypothetical protein SAOUHSC_00952 | YP_499505.1 | 41 | 20 |
| hypothetical protein SAOUHSC_02620 | WP_000436818.1 | 4 | 45 |
| QC, Query Cover; %ID, percentage of identity; MFS, major facilitator superfamily | |||
Homologs in S. aureus NCTC 8325 to LacY from E. coli
| Protein | Accession | QC % | %ID |
| multidrug efflux MFS transporter NorA | WP_001041272.1 | 45 | 28 |
| hydroxyethylthiazole kinase | WP_000610051.1 | 17 | 25 |
| siderophore biosynthesis protein SbnD | WP_000610051.1 | 17 | 25 |
| MFS transporter | WP_000610884.1 | 35 | 27 |
| glucarate transporter | WP_000660709.1 | 46 | 29 |
| proline/betaine transporter | WP_000347061.1 | 14 | 27 |
| nitrate transporter NarT | WP_000278558.1 | 17 | 32 |
| nickel ABC transporter permease | WP_000584765.1 | 19 | 34 |
| antibiotic MFS transporter | WP_000675401.1 | 11 | 29 |
| hypothetical protein SAOUHSC_02866 | YP_501322.1 | 12 | 31 |
| hypothetical protein SAOUHSC_02307 | YP_500786.1 | 23 | 23 |
| hypothetical protein SAOUHSC_02309 | WP_001287088.1 | 11 | 31 |
| QC, Query Cover; %ID, percentage of identity; MFS, major facilitator super-family | |||
Homologs in S. aureus NCTC 8325 to EmrD from E. coli
| Protein | Accession | QC % | %ID |
| Bcr/CflA family drug resistance efflux transporter | WP_000999131.1 | 62 | 25 |
| drug:proton antiporter | WP_000038961.1 | 42 | 27 |
| multidrug MFS transporter | WP_001820335.1 | 40 | 26 |
| nitrate transporter NarT | WP_000278558.1 | 52 | 24 |
| multidrug efflux MFS transporter NorA | WP_001041272.1 | 84 | 20 |
| chloramphenicol resistance protein DHA1 | WP_000026194.1 | 41 | 20 |
| quinolone resistance protein NorB | WP_001066546.1 | 34 | 23 |
| conserved hypothetical protein | ABD31642.1 | 62 | 25 |
| hypothetical protein SAOUHSC_02435 | YP_500904.1 | 34 | 22 |
| hypothetical protein SAOUHSC_02752 | YP_501211.1 | 51 | 20 |
| QC, Query Cover; %ID, percentage of identity; MFS, major facilitator super-family | |||
Homologs to hypothetical proteins
| Hypothetical Protein | Top Match |
| SAOUHSC_00058 | quinolone resistance protein NorB |
| SAOUHSC_00078 | siderophore biosynthesis protein SbnD |
| SAOUHSC_00952 | glycolipid permease LtaA |
| SAOUHSC_02307 | putative membrane spanning protein |
| SAOUHSC_02309 | membrane protein |
| SAOUHSC_02435 | macrolide MFS transporter |
| SAOUHSC_02620 | membrane protein |
| SAOUHSC_02752 | chloramphenicol resistance protein DHA1 |
| SAOUHSC_02866 | membrane protein |
| ABD31816.1 | membrane protein |
| ABD31642.1 | Bcr/CflA family drug resistance efflux transporter |
| MFS, major facilitator super-family | |
Physiochemical properties
| Protein | # AA | MW | pI | # neg | # pos | EC | II | AI | GRAVY |
| GlpT | 451 | 50204.2 | 8.69 | 28 | 33 | 115660 | 36.12 | 99.73 | 0.505 |
| LacY | 417 | 46457.1 | 9.02 | 17 | 24 | 54240 | 29.54 | 109.93 | 0.906 |
| EmrD | 375 | 39995.1 | 9.1 | 12 | 18 | 61800 | 37.9 | 121.23 | 0.942 |
| SAOUHSC_00058 | 462 | 48999.6 | 9.49 | 17 | 29 | 59610 | 22.37 | 128.74 | 0.948 |
| SAOUHSC_00078 | 418 | 44835.3 | 9.54 | 13 | 27 | 53570 | 46.13 | 120.65 | 0.786 |
| SAOUHSC_00952 | 402 | 45457.6 | 9.54 | 16 | 26 | 63830 | 36.9 | 128.28 | 0.936 |
| SAOUHSC_02307 | 163 | 19404.9 | 9.62 | 16 | 22 | 32430 | 48.51 | 123.8 | 0.134 |
| SAOUHSC_02309 | 159 | 18972.3 | 9.18 | 13 | 18 | 38850 | 36.78 | 108.43 | 0.14 |
| SAOUHSC_02435 | 397 | 44337.3 | 9.44 | 16 | 31 | 47830 | 24.38 | 127.68 | 0.868 |
| SAOUHSC_02620 | 215 | 24956.7 | 6.1 | 16 | 15 | 34630 | 56.72 | 111.91 | 0.573 |
| SAOUHSC_02752 | 375 | 40245.1 | 9.68 | 12 | 20 | 43430 | 29.41 | 132.51 | 0.918 |
| SAOUHSC_02866 | 822 | 90409.3 | 6.46 | 90 | 86 | 58790 | 31.15 | 107.93 | 0.159 |
| ABD31816.1 | 416 | 47645 | 9.19 | 31 | 43 | 39810 | 26.46 | 107.62 | 0.282 |
| ABD31642.1 | 312 | 34467.5 | 10.05 | 11 | 20 | 34950 | 28.19 | 125.61 | 0.897 |
Transmembrane Regions Identified by SOSUI
| Locus Tag | Number | Base-Pair Position |
| GlpT | 10 | 26 – 43, 102 – 124, 160 – 181, 187 – 208, 251 – 273, 290 – 312, 320 – 342, 350 – 372, 384 – 406, 414 – 435 |
| LacY | 12 | 11 – 33, 44 – 66, 75 – 97, 106 – 128, 144 – 166, 174 – 196, 215 – 237, 260 – 282, 288 – 310, 313 – 335, 346 – 368, 379 – 401 |
| EmrD | 10 | 4 – 26, 32 – 54, 69 – 91, 127 – 149, 153 – 175, 205 – 227, 235 – 256, 263 – 285, 291 – 313, 346 – 368 |
| SAOUHSC_00058 | 13 | 12 – 34, 45 – 67, 88 – 110, 134 – 156, 163 – 184, 198 – 220, 226 – 247, 262 – 284, 301 – 323, 330 – 349, 355 – 373, 403 – 425, 431 – 453 |
| SAOUHSC_00078 | 12 | 11 – 33, 45 – 67, 80 – 102, 104 – 126, 145 – 167, 171 – 192, 224 – 246, 256 – 277, 291 – 313, 317 – 339, 351 – 371, 376 – 398 |
| SAOUHSC_ 00952 | 11 | 15 – 37, 42 – 64, 85 – 107, 113 – 134, 144 – 166, 173 – 195, 218 – 240, 251 – 273, 280 – 302, 307 – 329, 374 – 396 |
| SAOUHSC_02307 | 2 | 17 – 39, 50 – 72 |
| SAOUHSC_02309 | 2 | 20 – 42, 48 – 70 |
| SAOUHSC_02435 | 11 | 5 – 27, 36 – 58, 66 – 88, 92 – 114, 133 – 155, 162 – 184, 212 – 234, 248 – 270, 286 – 308, 333 – 355, 361 – 382 |
| SAOUHSC_02620 | 4 | 38 – 60, 73 – 95, 108 – 130, 143 – 165 |
| SAOUHSC_02752 | 8 | 12 – 34, 53 – 75, 80 – 101, 110 – 132, 139 – 161, 193 – 215, 224 – 246, 260 – 282 |
| SAOUHSC_02866 | 12 | 10 – 32, 170 – 192, 198 – 220, 225 – 246, 267 – 289, 303 – 324, 348 – 370, 510 – 532, 538 – 560, 579 – 601, 624 – 646, 655 – 677 |
| ABD31816.1 | 6 | 75 – 97, 119 – 141, 143 – 165, 169 – 191, 195 – 216, 389 – 411 |
| ABD31642.1 | 9 | 13 – 35, 53 – 74, 80 – 102, 107 – 129, 138 – 160, 167 – 189, 221 – 243, 254 – 276, 285 – 307 |
CDD-BLAST domain data
| Hypothetical | Protein Domains |
| Hypothetical | Protein Domains |
| GlpT | cd06174 |
| PRK11273 | |
| LacY | cd06174 |
| pfam01306 | |
| EmrD | cd06174 |
| PRK11652 | |
| SAOUHSC_00058 | cd06174 |
| pfam07690 | |
| SAOUHSC_00078 | cd06174 |
| PRK09874 | |
| SAOUHSC_00952 | cd06174 |
| pfam07690 | |
| SAOUHSC_02307 | COG3402 |
| SAOUHSC_02309 | cl01348 |
| SAOUHSC_02435 | cd06174 |
| TIGR00900 | |
| SAOUHSC_02620 | COG3152 |
| SAOUHSC_02752 | cd06174 |
| COG2814 | |
| SAOUHSC_02866 | pfam03176 |
| cl21543 | |
| COG2409 | |
| ABD31816.1 | COG1289 |
| ABD31642.1 | cd06174 |
| cd06174 and pfam07690, Major Facilitator Superfamily; PRK11273, glycerol- 3-phosphate transporter; pfam01306, LacY proton/sugar symporter; PRK11652, multidrug resistance protein D; PRK09874, drug efflux system protein MdtG; COG3402, YdbS; cl01348, bPH_2 super family; TIGR00900, H+ Antiporter protein; COG3152, yhaH; COG2814, arabinose efflux permease; pfam03176 and cl21543, MMPL super family; COG2409, YdfJ; COG1289, YccC. | |
Pfam domain data
| Hypothetical | Protein Domains |
| GlpT | MFS_1 |
| LacY | LacY_symp |
| EmrD | MFS_1 |
| SAOUHSC_00058 | MFS_1 |
| SAOUHSC_00078 | MFS_1 |
| sugar_tr | |
| SAOUHSC_00952 | MFS_1 |
| SAOUHSC_02307 | bPH_2 |
| SAOUHSC_02309 | No Pfam-A matches |
| SAOUHSC_02435 | MFS_3 |
| SAOUHSC_02620 | DUF805 |
| SAOUHSC_02752 | MFS_1 |
| SAOUHSC_02866 | MMPL family |
| ABD31816.1 | No Pfam-A matches |
| ABD31642.1 | MFS_1 |
| MFS_1, Major Facilitator Superfamily; LacY_symp, LacY proton/sugar symporter; sugar_tr, sugar and other transporter; bPH_2, bacterial Pleckstrin homology domain; MFS_3, Transmembrane secretion effector; DUF805, Protein of unknown function; MMPL family, putative integral membrane proteins. | |
Figure 2Alignment of GlpT homologs aligned by PROMALS3D. Magenta names are representative sequences colored red to identify predicted alpha-helix secondary structures. The black names belonging to the same alignment group as the magenta name above it, indicating a strong relationship between the two. Consensus_aa, consensus amino acid sequence; Consensus_ss, consensus predicted secondary structures; h, consensus predicted secondary structure alpha-helix.
Figure 3Alignment of EmrD homologs aligned by PROMALS3D. Magenta names are representative sequences colored red to identify predicted alpha-helix secondary structures. The black names belonging to the same alignment group as the magenta name above it, indicating a strong relationship between the two. Consensus_aa, consensus amino acid sequence; Consensus_ss, consensus predicted secondary structures; h, consensus predicted secondary structure alpha-helix.
Figure 3Representative phylogenetic tree of proteins produced via PHYLIP package programs showing six hypothetical proteins belong evolutionarily to the major facilitator superfamily (MFS). SAOUHSC_00058, SAOUHSC_02435, SAOUHSC_02752 and ABD31642.1 are related to drug efflux proteins. SAOUHSC_00078 is closely related to a siderophore biosynthesis protein as SAOUHSC_00952 is confirmed to be a glycolipid permease.