| Literature DB >> 24900999 |
Maria N Simakova1, Nikolai N Simakov2.
Abstract
Protein functions are specified by its three-dimensional structure, which is usually obtained by X-ray crystallography. Due to difficulty of handling membrane proteins experimentally to date the structure has only been determined for a very limited part of membrane proteins (<4%). Nevertheless, investigation of structure and functions of membrane proteins is important for medicine and pharmacology and, therefore, is of significant interest. Methods of computer modeling based on the data on the primary protein structure or the symbolic amino acid sequence have become an actual alternative to the experimental method of X-ray crystallography for investigating the structure of membrane proteins. Here we presented the results of the study of 35 transmembrane proteins, mainly GPCRs, using the novel method of cascade averaging of hydrophobicity function within the limits of a sliding window. The proposed method allowed revealing 139 transmembrane domains out of 140 (or 99.3%) identified by other methods. Also 236 transmembrane domain boundary positions out of 280 (or 84%) were predicted correctly by the proposed method with deviation from the predictions made by other methods that does not exceed the detection error of this method.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24900999 PMCID: PMC4034515 DOI: 10.1155/2014/921218
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Hydrophobicity scales H (i).
|
| Code | Abbreviation | Name |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | A | Ala | Alanine | 1.8 | 0 | 0 | 0.74 | 0.62 | 1.60 | −0.17 |
| 2 | C | Cys | Cysteine | 2.5 | 1 | 1 | 0.91 | 0.29 | 2.00 | 0.24 |
| 3 | D | Asp | Aspartic acid | −3.5 | 0 | −1 | 0.62 | −0.90 | −9.20 | −1.23 |
| 4 | E | Glu | Glutamic acid | −3.5 | 0 | −1 | 0.62 | −0.74 | −8.20 | −2.02 |
| 5 | F | Phe | Phenylalanine | 2.8 | 1 | 1 | 0.88 | 1.19 | 3.70 | 1.13 |
| 6 | G | Gly | Glycine | −0.4 | 0 | −1 | 0.72 | 0.48 | 1.00 | −0.01 |
| 7 | H | His | Histidine | −3.2 | 0 | 0 | 0.78 | −0.40 | −3.00 | −0.96 |
| 8 | I | Ile | Isoleucine | 4.5 | 1 | 1 | 0.88 | 1.38 | 3.10 | 0.31 |
| 9 | K | Lys | Lysine | −3.9 | 0 | −1 | 0.52 | −1.50 | −8.80 | −0.99 |
| 10 | L | Leu | Leucine | 3.8 | 1 | 1 | 0.85 | 1.06 | 2.80 | 0.56 |
| 11 | M | Met | Methionine | 1.9 | 1 | 1 | 0.85 | 0.64 | 3.40 | 0.23 |
| 12 | N | Asp | Asparagine | −3.5 | 0 | −1 | 0.63 | −0.78 | −4.80 | −1.23 |
| 13 | P | Pro | Proline | −1.6 | 0 | −1 | 0.64 | 0.12 | −0.20 | −0.45 |
| 14 | Q | Gln | Glutamine | −3.5 | 0 | −1 | 0.62 | −0.85 | −4.10 | −0.58 |
| 15 | R | Arg | Arginine | −4.5 | 0 | −1 | 0.64 | −2.53 | −12.3 | −0.81 |
| 16 | S | Ser | Serine | −0.8 | 0 | −1 | 0.66 | −0.18 | 0.60 | −0.13 |
| 17 | T | Thr | Threonine | −0.7 | 0 | −1 | 0.70 | −0.05 | 1.20 | −0.14 |
| 18 | V | Val | Valine | 4.2 | 1 | 1 | 0.86 | 1.08 | 2.60 | −0.07 |
| 19 | W | Trp | Tryptophan | −0.9 | 1 | 1 | 0.85 | 0.81 | 1.90 | 1.85 |
| 20 | Y | Tyr | Tyrosine | −1.3 | 0 | 0 | 0.76 | 0.26 | −0.70 | 0.94 |
Comparison of TMD boundaries calculated upon processing of hydrophobicity functions f (k) at n = 3, 4, 5 on H (i) (N = 3 and 5) scales for GPCRs with known data from [21].
| Protein name, code, length | Data source | Number and boundaries of transmembrane domains | ||||||
|---|---|---|---|---|---|---|---|---|
| Scale level | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| GLR_ | [ | 137–161 | 174–198 | 226–249 | 264–285 | 304–326 | 351–369 | 382–402 |
|
| 143–166 | 180–192 | 218–257 | 261–288 | 303–327 | 353–368 | 384–401 | |
|
| ||||||||
| CRFR1_ | [ | 112–142 | 179–203 | 219–247 | 255–282 | 299–324 | 336–360 | 368–397 |
|
| 116–146 | 178–204 | 217–247 | 255–280 | 302–325 | 344–362 | 370–397 | |
|
| ||||||||
| ADRB1_ | [ | 39–67 | 77–103 | 116–137 | 156–179 | 206–231 | 286–315 | 321–343 |
|
| 44–64 | 81–99 | 108–138 | 160–181 | 214–229 | 293–314 | 320–331 | |
|
| ||||||||
|
5HT1B_ | [ | 50–75 | 85–110 | 124–145 | 166–187 | 206–228 | 316–336 | 350–371 |
|
| 46–72 | 86–109 | 119–145 | 168–185 | 205–230 | 316–340 | 343–369 | |
|
| 45–73 | 85–110 | 118–145 | 168–185 | 205–229 | 316–340 | 344–370 | |
|
| ||||||||
| 5HT2B_ | [ | 57–79 | 91–113 | 130–151 | 172–192 | 217–239 | 325–345 | 361–382 |
|
| 54–81 | 89– | −151 | 173–194 | 215–243 | 325–352 | 356–381 | |
Figure 1Hydrophobicity functions f (k) for the protein P47871 in Table 2 after averaging at n = 2 and n = 4 on the scale H 5(i) in Table 1; dotted line shows the level u = const = 0.266.
Figure 2Hydrophobicity functions f (k) for the protein P34998 in Table 2 after averaging at n = 2 and n = 5 on the scale H 3(i) in Table 1; dotted line shows the level u = const = 〈f(k)〉 = −0.052.
Comparison of TMD boundaries calculated upon processing of hydrophobicity functions f (k) at n = 4, 5 on H (i) (N = 3, 5, 6) scales for GPCRs with known data from [21].
| Protein name, code, length | Data source | Number and boundaries of transmembrane domains | ||||||
|---|---|---|---|---|---|---|---|---|
| Scale level | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| S1PR1_ | [ | 47–71 | 79–107 | 122–140 | 160–185 | 202–222 | 256–277 | 294–314 |
|
| 48–69 | 83–107 | 122–142 | 160–195 | 199–223 | 255–281 | 293–310 | |
|
| ||||||||
| ACM2_ | [ | 23–45 | 60–80 | 98–119 | 140–162 | 185–207 | 389–409 | 424–443 |
|
| 21–48 | 60–85 | 90–122 | 142–167 | 192–208 | 389–415 | 422–429 | |
|
| ||||||||
| ACM3_ | [ | 67–90 | 104–124 | 142–163 | 184–206 | 229–251 | 492–512 | 527–546 |
|
| 62–92 | 105–128 | 137–161 | 187–208 | 221–249 | 492–515 | 526–541 | |
|
| ||||||||
| CXCR1_ | [ | 40–66 | 76–96 | 112–133 | 155–174 | 200–220 | 243–264 | 286–308 |
|
| 39–67 | 76–96 | 102–141 | 152–175 | 199–230 | 241–267 | 291–308 | |
|
| ||||||||
| CCR5_ | [ | 31–58 | 69–89 | 103–124 | 142–166 | 199–218 | 236–260 | 278–301 |
|
| 33–56 | 68–93 | 100–136 | 141–164 | 196–218 | 238–264 | 288–299 | |
|
| ||||||||
| HRH1_ | [ | 30–49 | 64–83 | 102–123 | 146–165 | 190–210 | 419–438 | 451–470 |
|
| 25–50 | 63–93 | 96–122 | 147–167 | 188–212 | 418–442 | 449–469 | |
|
| ||||||||
| OPRK_ | [ | 59–85 | 96–117 | 133–154 | 174–196 | 223–247 | 276–299 | 312–333 |
|
| 56–83 | 99–122 | 143–151 | 180–195 | 227–248 | 277–300 | 302–320 | |
|
| ||||||||
| OPRM_ | [ | 65–94 | 104–121 | 144–163 | 194–209 | 235–257 | 281–303 | 312–328 |
|
| 68–95 | 105–114 | 136–162 | 187–205 | 229–262 | 280–306 | 317–325 | |
|
| ||||||||
| OPRD_ | [ | 46–75 | 85–102 | 125–144 | 175–190 | 216–238 | 262–284 | 294–310 |
|
| 44–74 | 85–102 | 112–142 | 167–187 | 211–236 | 263–286 | 296–319 | |
|
| ||||||||
| OPRX_ | [ | 51–77 | 88–109 | 125–146 | 166–188 | 212–236 | 265–288 | 301–322 |
|
| 42–79 | 90–107 | 112–130 | 172–186 | 212–241 | 263–284 | 301–335 | |
|
| ||||||||
| NTR1_ | [ | 65–87 | 97–121 | 144–165 | 189–210 | 236–260 | 309–330 | 349–372 |
|
| 63–86 | 103–139 | 154–172 | 191–208 | 220–268 | 306–324 | 338–374 | |
|
| ||||||||
| PAR1_ | [ | 103–128 | 138–157 | 177–198 | 219–239 | 269–288 | 312–334 | 351–374 |
|
| 101–133 | 136–158 | 175–208 | 221–238 | 270–296 | 313–338 | 350–371 | |
|
| ||||||||
| O51E1_ | [ | 28–48 | 57–77 | 102–122 | 142–162 | 199–219 | 239–259 | 275–295 |
|
| 12–49 | 60–77 | 80–120 | 146–166 | 198–227 | 243–260 | 276–292 | |
|
| ||||||||
| SMO_ | [ | 234–254 | 263–283 | 315–335 | 359–379 | 403–423 | 452–472 | 525–545 |
|
| 236–251 | 264–283 | 313–340 | 362–380 | 403–425 | 451–473 | 519–545 | |
|
| ||||||||
| GP160_ | [ | 24–44 | 59–79 | 94–114 | 137–157 | 178–198 | 245–265 | 269–289 |
|
| 26–40 | 59–81 | 97–118 | 139–157 | 182–202 | 244–271 | 274–292 | |
|
| ||||||||
| HRH3_ | [ | 40–60 | 71–91 | 109–129 | 157–177 | 197–217 | 360–380 | 396–416 |
|
| 33–61 | 72–95 | 105–132 | 155–173 | 191–222 | 360–388 | 395–416 | |
|
| ||||||||
| HRH4_ | [ | 20–40 | 53–73 | 88–108 | 132–152 | 173–193 | 305–325 | 342–362 |
|
| 16–41 | 55–79 | 83–107 | 130–153 | 169–198 | 305–331 | 341–357 | |
|
| ||||||||
| RAI3_ | [ | 34–54 | 69–89 | 98–118 | 130–150 | 177–197 | 213–233 | 248–268 |
|
| 26–53 | 68–92 | 96–118 | 130–155 | 178–202 | 213–233 | 246–265 | |
|
| ||||||||
| VN1R1_ | [ | 57–77 | 85–105 | 133–153 | 170–190 | 227–247 | 275–295 | 304–324 |
|
| 53–77 | 90–103 | 122–145 | 165–188 | 222–245 | 274–301 | 306–338 | |
|
| ||||||||
| APJ_ | [ | 27–51 | 67–91 | 101–125 | 145–166 | 201–221 | 245–271 | 285–308 |
|
| 30–52 | 67–85 | 98–135 | 147–167 | 208–228 | 246– | −312 | |
Prediction of TMD boundaries calculated upon processing of hydrophobicity functions f (k) at n = 4, 5 on H (i) (N = 3, 4, 5) scales for GPCRs.
| Protein name, code, length | Scale level | Number and boundaries of hydrophobic regions, including TMDs | ||||||
|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | ||
| A4D1U0_ |
| 7–28 | 45–70 | 82–102 | 127–147 | 173–194 | 222–240 | 253–274 |
|
| 7–29 | 46–71 | 77–103 | 124–144 | 179–194 | 222–237 | 258–275 | |
|
| ||||||||
| A5Z1T7_ |
| 7–27 | 43–57 | 75–100 | 121–146 | 185–210 | 225–240 | 263–274 |
|
| 7–26 | 41–64 | 75–97 | 123–144 | 185–209 | 225–238 | 264–274 | |
|
| ||||||||
| B5B0C2_ |
| 14–40 | 49–72 | 85–122 | 132–155 | 189–201 | 227–255 | 275–293 |
|
| 14–39 | 51–71 | 89– | −154 | 193–205 | 226–256 | 277–292 | |
|
| ||||||||
| M9TID6_ |
| 43–57 | 69–88 | 97–123 | 149–161 | 188–219 | 232–262 | 265–288 |
|
| 33–56 | 66–87 | 100–123 | 148–164 | 186–217 | 233– | −295 | |
|
| ||||||||
| Q76L88_ |
| 11–40 | 54–78 | 93–116 | 156–178 | 196–223 | 251–270 | |
|
| 13–37 | 55–78 | 90–117 | 153–174 | 198–225 | 248–282 | ||
Prediction of hydrophobic regions and TMDs calculated upon processing of hydrophobicity functions f (k) at n = 4, 5 on H (i) (N = 3, 4, 5) scales for α-helical membrane proteins.
| Protein name, code, length | Data source | Number and boundaries of hydrophobic regions, including TMDs | ||||||
|---|---|---|---|---|---|---|---|---|
| Scale level | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
|
SP2Q_ | [ | 22–42 | ||||||
|
| 20–47 | 70–94 | 107–124 | 130–175 | 207–229 | |||
|
| 16–48 | 70–94 | 109–121 | 132–174 | 197–225 | |||
|
| ||||||||
|
SP3AH_ | [ | 7–26 | ||||||
|
| 3–31 | 92–106 | 146–179 | 193–211 | ||||
|
| 3–30 | 95–113 | 146–179 | 193–211 | ||||
|
| ||||||||
| Q8TMG0_METAC |
| 7–20 | 49–67 | 76–93 | 130–162 | |||
|
| 0–22 | 45–62 | 77–91 | 127–163 | ||||
|
| ||||||||
|
HLYE_ | [ | 183–203 | ||||||
|
| 0–17 | 24–38 | 82–103 | 114–123 | 180–209 | 242–247 | 264–280 | |
|
| 5–26 | 32–40 | 81–102 | 115–123 | 179–208 | 242–253 | 267–275 | |
Prediction of TMDs calculated upon processing of hydrophobicity functions f (k) at n = 5 on the scale H 5(i) for the long α-helical membrane protein.
| Protein name, code, length | Data source | Number and boundaries of transmembrane domains | |||||
|---|---|---|---|---|---|---|---|
| Scale | 1 | 2 | 3 | 4 | 5 | 6 | |
| CAC1A_ | [ | 99–117 | 136–155 | 168–185 | 191–209 |
| 336–360 |
|
| 101–116 | 141–158 | 172–185 |
| 302–317 | 336–358 | |
| Scale | 7 | 8 | 9 | 10 | 11 | 12 | |
| [ | 488–506 | 522–541 | 550–568 | 579–597 |
| 690–714 | |
|
| 491–507 | 518–537 | 554–577 |
| 654–665 | 685–714 | |
| Scale | 13 | 14 | 15 | 16 | 17 | 18 | |
| [ | 1254–1272 | 1289–1308 | 1321–1339 | 1351–1369 |
| 1496–1520 | |
|
| 1255–1270 | 1293–1312 | 1323–1339 |
| 1456–1467 | 1497–1522 | |
| Scale | 19 | 20 | 21 | 22 | 23 | 24 | |
| [ | 1576–1604 | 1610–1629 | 1638–1656 | 1666–1684 |
| 1796–1820 | |
|
| 1575–1599 | 1607–1633 | 1641–1660 |
| — | 1794–1820 | |