| Literature DB >> 27314000 |
Joanna Lee1, Zara A Sands2, Philip C Biggin1.
Abstract
The Major Facilitator Superfamily (MFS) is one of the largest classes of secondary active transporters and is widely expressed in many domains of life. It is characterized by a common 12-transmembrane helix motif that allows the selective transport of a vast range of diverse substrates across the membrane. MFS transporters play a central role in many physiological processes and are increasingly recognized as potential drug targets. Despite intensive efforts, there are still only a handful of crystal structures and therefore homology modeling is likely to be a necessary process for providing models to interpret experiments for many years to come. However, the diversity of sequences and the multiple conformational states these proteins can exist in makes the process significantly more complicated, especially for sequences for which there is very little sequence identity to known templates. Inspired by the approach adopted many years ago for GPCRs, we have analyzed the large number of MFS sequences now available alongside the current structural information to propose a series of conserved contact points that can provide additional guidance for the homology modeling process. To enable cross-comparison across MFS models we also present a numbering scheme that can be used to provide a point of reference within each of the 12 transmembrane regions.Entities:
Keywords: LacY; alternating access; homology modeling; transmembrane; transport
Year: 2016 PMID: 27314000 PMCID: PMC4889909 DOI: 10.3389/fmolb.2016.00021
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
Figure 1Secondary and tertiary structure of MFS proteins. (A) The topology of the 12 TM helices. (B) The MFS fold as viewed from the side of the membrane, and (C) rotated through 90 as exemplified by FucP. The conserved 12 TM α-helix fold is arranged into two 6 TMH bundles (domains 1 and 2) and a cavity exists (gray surface) at the interface between them that is accessible to either the cytoplasm or extracellular region depending on the conformational state.
The sequence identity (%) between 10 MFS proteins for which there is an X-ray crystal structure.
| LacY (1PV6) | ||||||||||
| FucP (3O7P) | 19 | |||||||||
| XylE (4GBY) | 22 | 22 | ||||||||
| GlpT (1PW4) | 21 | 19 | 23 | |||||||
| PepT (2XUT) | 23 | 24 | 19 | 24 | ||||||
| EmrD (3GFP) | 21 | 20 | 21 | |||||||
| YajR (3WDO) | 22 | 21 | 25 | 20 | 23 | 22 | ||||
| PiPT (4J05) | 22 | 21 | 24 | 19 | 22 | 23 | 22 | |||
| NarK (4JR9) | 21 | 22 | 22 | 23 | 23 | 24 | 23 | 21 | ||
| GLUT1 (4PYP) | 21 | 23 | 29 | 21 | 20 | 22 | 20 | 22 | 24 |
The percentage identity was computed from CLUSTAL Omega pairwise alignments and they range from 18 to 25.
The MFS proteins used for determining key residues that are conserved across the superfamily in this work.
| FucP | MFS_1 | Fucose/H+ symport | Into cell | ||
| GlpT | MFS_1 | Glycerol-3P/Pi antiport | Into cell | ||
| EmrD | DHA12 | Multidrug/H+ antiport | Out of cell | ||
| YajR | DHA12 | Multidrug/H+ antiport | Out of cell | ||
| LacY | lacy_symport | Galactose/H+ symport | Into cell | ||
| PepT | POT | Peptide/H+ symport | Into cell | ||
| YgbH | POT | Peptide/H+ symport | Into cell | ||
| GlcPse | Glucose/H+ symport | Into cell | |||
| gkPOT | POT | Peptide/H+ symport | Into cell | ||
| GLUT1 | SP | Glucose uniport | Into cell | ||
| PiPT | SP | Phosphate/H+ symport | Into cell | ||
| NRT1.1 | NRT1/PTR | Nitrite/H+ symport | Into cell | ||
| NarU | NNP | Nitrate/nitrite symport | Into cell | ||
| XylE | SP | Xylose/H+ symport | Into cell | ||
| MelB | Glycoside-pentoside-hexuronide: cation symporter | Melibose/Na+ symport | Into cell |
The cluster containing each protein sequence in UniRef50 was downloaded and aligned using MUSCLE (Edgar, 2004b).
Figure 2The aromatic side chain on TMH 7. (A) PepT (occluded), (B) GLUT1 (outward), and (C) LacY (inward; PDB IDs 2XUT, 2PYP, and 1PV6, respectively). In both proteins, the most conserved aromatic residue points into the central TM cavity between the two domains, but the absolute position with respect to the membrane is variable. For clarity, helices 8 and 10 are omitted from (A) and helices 2 and 11 are omitted from (B).
Figure 3Classification of contact types. Contacts can be classified as static (A) where there is apparent different between conformational states, as pivot points (B) where the contact remains static but the angle of the helices making the interaction changes between conformational state or mobile (C) where the position of the most conserved contact between helices appears to depend on conformational state.
Analysis of the position of the most conserved contacts between helices (summarized by a series of heat maps in .
| STNQ | 1–2 | Keeps periplasmic end of helices in contact regardless of state |
| STNQ | 2–11 | Keeps periplasmic end of helices in contact regardless of state |
| STNQ | 5–8 | In the center of the TM region, and state dependent |
| STNQ | 7–11 | In the center of the TM region, and state dependent |
| Small hydrophobic | 1–5 | Contacts are at a pivot point in the helices, with each conformation having a different tilt to bilayer normal |
| Small hydrophobic | 1–6 | The contact is in the middle of the helix and TMH 1 has different tilts for each state |
| Small hydrophobic | 2–4 | The contact is in the middle of the helices and small changes in tilt |
| Small hydrophobic | 2–11 | The contact is along length of helices in occluded and inward structure but only at cytoplasmic end in outward structure |
| Small hydrophobic | 3–4 | Mediates packing |
| Small hydrophobic | 3–6 | Mediates packing |
| Small hydrophobic | 5–8 | The contacts are in the center of TM region |
| Small hydrophobic | 7–11 | In cytoplasmic half of the helices, outward state TMH 7 bends away from TMH 11 at periplasmic end to occluded and inward |
| Small hydrophobic | 8–10 | The contacts are along the helices—mediates packing |
| Small hydrophobic | 9–10 | The contacts are along the helices—mediates packing |
| Small hydrophobic | 9–12 | The contacts are along the helices—mediates packing, all three states have straight helices |
| Large hydrophobic | 2–4 | The position of the contact is in the center of the TM region, and the helices rock around that central point |
| Large hydrophobic | 3–6 | At the periplasmic end of the TM region. Inward structure has a further contact at cytoplasmic end of the TM region |
| Large hydrophobic | 8–10 | The contacts are in the center of the TM region, with a small difference in TMH 10 tilt between the conformations |
| Glycine | 1–5 | The contact is in the center of the helices, and is a pivot point for the helices which have different tilts in each state |
| Glycine | 2–4 | The contact is in the center of the helices, not much difference in tilt implying a packing motif |
| Glycine | 9–10 | The contacts are along the length of the helices, indicating packing mediators |
The analysis depicts either static contact points (green) that do not change between different conformational states; contacts that move according to conformational state (blue), and contacts that constitute distinct pivot points (purple) around which the rest of the protein moves.
Figure 4Pivot point contact and variable contacts. An example of an equivalent pivot point contact between TMH 3 and TMH 6 for FucP (A) between L98 (helix 3) and V219 (helix 6) and LacY (B) between L84 (helix 3) and A177 (helix 6). An example mobile contact between TMHs 7 and 11 is similarly depicted for FucP between Q267 (helix 7) and T390 (helix 11) (C) and for LacY between Q241 (helix 7) and S366 (helix 11) (D). Helix 2 has been removed for clarity in (D).
Figure 5Contact map for polar residues (S, T, N, Q). The black boxes show the most conserved contacts, whilst the white boxes indicate the position of contacts that describe the degree of closure of the cavity at both the cytoplasmic (TMH 4–10) and extracellular (TMH 1–7) ends. Heat maps analysis for other residue groupings can be found in the Figures S2–S4.
Summary of proposed numbering scheme.
| 1 | Contact with TMH 5 (either polar, glycine or small hydrophobic) |
| 2 | Contact with TMH 4 (most conserved glycine) |
| 3 | Contact with TMH 6 (most conserved small hydrophobic) |
| 4 | Contact with TMH 2 (most conserved glycine) |
| 5 | Contact with TMH 1 (either polar, glycine or small hydrophobic) |
| 6 | Contact with TMH 3 (most conserved small hydrophobic) |
| 7 | Contact with TMH 11 (most conserved polar) |
| 8 | Contact with TMH 5 (most conserved polar or small hydrophobic) |
| 9 | Contact with TMH 10 (most conserved glycine or small hydrophobic if glycine not present in sequence) |
| 10 | Contact with TMH 9 (most conserved glycine) |
| 11 | Contact with TMH 7 (most conserved polar residue) |
| 12 | Contact with TMH 9 (most conserved small hydrophobic) |
In TMHs 4, 5, and 10 these correspond with the conserved glycine motifs present on those helices. In other TMHs, the residues are mostly in the same spatial location on the helix, but sometimes the position changes and thus they are easier to track via their conserved contacts rather than conservation alone.
The numbering in MFS proteins.
| XylE (4JA4) | 17–33 | 57–74 | 87–104 | 126–145 | 166–183 | 201–218 | 281–301 | 316–333 | 343–361 | 371–390 | 410–427 | 443–463 |
| G25 | G71 | A92 | G141 | G174 | A210 | S285 | T329 | G348 | G388 | Q415 | A456 | |
| PepT (2XUT) | 18–39 | 53–75 | 86–105 | 110–129 | 153–172 | 178–195 | 304–324 | 335–352 | 376–393 | 404–424 | 444–464 | 483–500 |
| A22 | G66 | L93 | G124 | G160 | F189 | T309 | A342 | G379 | G418 | S449 | A489 | |
| LacY (1PV6) | 10–35 | 41–64 | 75–96 | 103–128 | 143–163 | 167–187 | 222–248 | 257–278 | 289–309 | 312–332 | 352–373 | 380–399 |
| G13 | F55 | L84 | G111 | G147 | A177 | Q248 | T266 | G296 | G332 | S366 | L390 | |
| FucP (3O7Q) | 28–47 | 64–86 | 90–108 | 119–140 | 154–174 | 210–229 | 261–282 | 301–319 | 326–345 | 348–372 | 383–403 | 412–430 |
| N43 | G73 | L98 | G132 | G165 | V219 | Q267 | V306 | L328 | G372 | T390 | A419 | |
| GLUT1 (4PYP) | 14–36 | 64–91 | 94–112 | 119–140 | 157–176 | 187–206 | 275–295 | 306–326 | 335–354 | 366–387 | 402–426 | 431–449 |
| G27 | G79 | A103 | G134 | G167 | A197 | Q283 | I315 | G340 | G382 | N415 | L441 | |
| EmrD (2GFP) | 11–31 | 43–64 | 73–92 | 97–116 | 134–155 | 157–175 | 208–229 | 237–261 | 267–283 | 289–306 | 326–345 | 357–378 |
| Q21 | T55 | L77 | G109 | G140 | L169 | N210 | S246 | L279 | G295 | T334 | L374 | |
| GlcP (4LDS) | 7–31 | 41–64 | 74–92 | 95–115 | 134–154 | 159–178 | 244–266 | 275–295 | 305–325 | 335–355 | 376–395 | 400–418 |
| G20 | G54 | I83 | G106 | G143 | V168 | N256 | N287 | G310 | G349 | S388 | A408 | |
| GlpT (1PW4) | 32–51 | 65–84 | 93–110 | 121–139 | 159–178 | 190–208 | 255–277 | 292–311 | 322–340 | 349–369 | 386–405 | 416–435 |
| N47 | G77 | L100 | G131 | G168 | A197 | N262 | T306 | G325 | G363 | T388 | L431 | |
| MelB (4M64) | 11–29 | 46–63 | 79–98 | 107–129 | 147–169 | 178–196 | 233–252 | 272–291 | 295–312 | 332–347 | 368–385 | 415–432 |
| G23 | W54 | L91 | G117 | G156 | L186 | N244 | N279 | G301 | G337 | T373 | L420 | |
| NarU (4IU8) | 37–56 | 73–91 | 101–119 | 130–147 | 167–187 | 211–229 | 254–277 | 290–307 | 316–334 | 347–367 | 405–423 | 431–454 |
| L44 | G83 | L110 | G139 | G172 | V218 | S258 | S304 | N322 | G362 | S408 | V447 | |
| NRT1.1 (5A2N) | 38–57 | 70–91 | 100–119 | 147–166 | 193–213 | 218–237 | 342–364 | 381–399 | 421–439 | 462–481 | 500–520 | 542–561 |
| G50 | G88 | A110 | G161 | G200 | L232 | Q358 | S383 | G426 | G475 | S514 | L554 | |
| PiPT (4J05) | 39–56 | 74–92 | 106–123 | 133–151 | 173–195 | 209–227 | 313–333 | 358–377 | 385–404 | 415–436 | 452–469 | 481–499 |
| A46 | G84 | I113 | G143 | G180 | A222 | N333 | N361 | G392 | G429 | N470 | L482 | |
| YajR (3WDO) | 14–33 | 51–69 | 79–98 | 102–119 | 136–155 | 165–183 | 215–234 | 250–268 | 279–298 | 303–325 | 340–361 | 368–386 |
| L26 | G58 | L85 | G112 | G141 | I179 | N218 | V264 | G286 | G308 | S346 | A378 | |
| YgbH (4Q65) | 13–33 | 49–67 | 78–95 | 101–120 | 142–161 | 169–187 | 267–289 | 311–331 | 342–362 | 379–400 | 414–433 | 461–480 |
| G27 | G65 | A83 | G115 | G150 | L182 | Q285 | S315 | G351 | G388 | N427 | V470 | |
TM helices were taken as those predicted by the OPM database (Lomize et al., 2006). The x.0 residue of each helix is summarized in Table 4. The helix is numbered x.0, where x is the helix number and it is prefixed by the conserved residue, for example in TM helix 5, the most conserved residue is a glycine, therefore it is G5.0. Residues from this residue toward the cytoplasm are labeled as negative numbers and residues toward the periplasm are given positive numbers (for example TM helix 5 in XylE is FNQFAIIFQLLVYCVNY giving: …, I5.-2, F5.-1, G5.0, Q5.1, L5.2, …). The proposed numbering start points are indicated in bold for each helix from each protein.
Figure 6The numbering scheme using TMH 5 of XylE as an example. The most conserved site is labeled G5.0 and then increasing negative values are given toward the cytoplasm and increasing positive values toward the extracellular cavity.
Figure 7The position of the numbered residue on each helix. Each line of the heat maps corresponds to the helix in each crystal structure. The helix lengths are defined by those within the region of the bilayer given in the OPM database and helices are different lengths depending on the helix tilt and whether there are kinks. The helices are aligned according to the rotation of the helix such that numbered sites in the same structural position are aligned.