| Literature DB >> 24112406 |
Jhih-Siang Lai1, Cheng-Wei Cheng, Allan Lo, Ting-Yi Sung, Wen-Lian Hsu.
Abstract
BACKGROUND: Since membrane protein structures are challenging to crystallize, computational approaches are essential for elucidating the sequence-to-structure relationships. Structural modeling of membrane proteins requires a multidimensional approach, and one critical geometric parameter is the rotational angle of transmembrane helices. Rotational angles of transmembrane helices are characterized by their folded structures and could be inferred by the hydrophobic moment; however, the folding mechanism of membrane proteins is not yet fully understood. The rotational angle of a transmembrane helix is related to the exposed surface of a transmembrane helix, since lipid exposure gives the degree of accessibility of each residue in lipid environment. To the best of our knowledge, there have been few advances in investigating whether an environment descriptor of lipid exposure could infer a geometric parameter of rotational angle.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24112406 PMCID: PMC3854514 DOI: 10.1186/1471-2105-14-304
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The MAAE of rotational angles determined by various approaches
| NACCESS (observed rASA) | ||
| ES | 72.85° | 68.35° |
| kPROT | 74.26° | 68.84° |
| ProperTM | 76.59° | 70.69° |
| TMLIP1H | 81.02° | 76.35° |
| TMLIP2H | 81.33° | 76.90° |
| TMLIP1C | 73.24° | 67.51° |
| TMLIP2C | 74.23° | 68.27° |
| MO | 71.47° | 69.43° |
| TMexpo (predicted rASA) |
Comparison of mean absolute angular errors (MAAE) of rotational angles obtained by different methods on the development set and the independent test.
Comparison of different methods for classifying exposed/buried residues on the independent test set without interface TM residues
| MCC | 0.35 | 0.44 | 0.32 | 0.29 | |
| Sensitivity | 64.24% | 75.60% | 73.53% | 67.25% | |
| Specificity | 69.15% | 67.99% | 58.03% | 61.80% | |
| Precision | 68.48% | 68.69% | 65.02% | 65.17% | |
| Accuracy | 67.27% | 71.66% | 66.01% | 64.61% |
The boldface indicates the best performance among various methods.
Comparison of different methods for classifying exposed/buried residues on 392 interface TM residues of the independent test set by rASA derived from both subunit structure and complete structure (in parentheses)
| MCC | 0.14 (0.22) | 0.12 (0.25) | 0.13 (0.17) | 0.08 (−0.06) | 0.19 (0.28) |
| Sensitivity | 73.11% (81.25%) | 56.29% (67.23%) | 66.17% (72.78%) | 67.72% (63.22%) | 66.10% (77.97%) |
| Specificity | 48.57% (38.50%) | 64.71% (57.59%) | 55.88% (43.98%) | 45.00% (30.77%) | 64.29% (49.32%) |
| Precision | 93.55% (55.91%) | 94.00% (59.50%) | 93.70% (55.04%) | 90.68% (46.61%) | 93.98% (55.42%) |
| Accuracy | 70.92% (59.44%) | 57.07% (62.23%) | 65.23% (57.95%) | 65.17% (46.63%) | 65.91% (62.12%) |
Comparison of different methods for classifying exposed/buried residues on 3,553 entire TM residues of the independent test set by rASA derived from both subunit structure and complete structure (in parentheses)
| MCC | 0.49 (0.48) | 0.33 (0.34) | 0.41 (0.41) | 0.30 (0.27) | 0.29 (0.29) |
| Sensitivity | 79.79% (81.29%) | 62.85% (64.55%) | 73.70% (73.70%) | 72.43% (72.36%) | 67.07% (68.18%) |
| Specificity | 68.70% (65.69%) | 70.19% (68.93%) | 67.71% (67.71%) | 57.64% (54.64%) | 61.86% (60.42%) |
| Precision | 75.56% (70.45%) | 71.53% (67.40%) | 72.18% (72.18%) | 68.43% (62.57%) | 68.45% (64.06%) |
| Accuracy | 74.78% (73.51%) | 66.20% (66.74%) | 70.90% (70.90%) | 65.91% (63.71%) | 64.74% (64.37%) |
Rotational angle prediction on the protein chains from Harrington and Ben-Tal’s work
| HWRAAGAATVLLVIVLLAGSYLAVLA | 199.55° | 203.38° | 1.46 | VSALLLGLLPAAYLYPG | 229.60° | 234.10° | 1.06 | ||||
| (30.86) | WGRCVAVVVMVAGITSFGLVTAALAT | 240.88° | 298.78° | 57.90° | 1.87 | (27.22) | AVDYSLAAALTLHGHWGL | 8.24° | 354.67° | 13.57° | 0.95 |
| IWLALGTALMGLGTLYFLVKGMG | 318.98° | 355.18° | 36.19° | 3.38 | | GLYVLSAITFTGLCYFNYYDV | 334.89° | 271.30° | 63.59° | 1.49 | |
| (42.28) | KFYAITTLVPAIAFTMYLSMLL | 262.69° | 313.32° | 50.63° | 2.74 | VAVVIGTAFTALVTKFTDSIITPLINRIG | 319.36° | 203.39° | 115.96° | 0.37 | |
| | WARYADWLFTTPLLLLDLALL | 118.84° | 44.89° | 73.95° | 1.60 | (113.71) | TIDLNVLLSAAINFFLIAFAVYFL | 105.75° | 354.30° | 111.46° | 1.06 |
| | GTILALVGADGIMIGTGLVGAL | 357.99° | 332.94° | 25.05° | 2.80 | VWALCFMGSLALLALVCTNRIQ | 285.04° | 280.66° | 2.32 | ||
| | RFVWWAISTAAMLYILYVLFFGF | 154.03° | 169.74° | 15.71° | 2.25 | (11.07) | AGLLGDIGGQMGLFIGASILTVL | 41.15° | 23.39° | 17.76° | 2.22 |
| | FKVLRNVTVVLWSAYPVVWLIG | 133.56° | 164.23° | 30.67° | 2.70 | WVVGMGIVMSLIVLAIVFGNVLVITAIA | 253.60° | 259.99° | 1.71 | ||
| | ETLLFMVLDVSAKVGFGLILLRS | 214.98° | 278.73° | 63.75° | 1.93 | (35.73) | YFITSLACADLVMGLAVVPFGAAHIL | 293.99° | 341.90° | 47.91° | 1.06 |
| LSFLKDFLAGGVAAAISKTAVAPIER | 24.46° | 46.33° | 21.87° | 1.59 | | WCEFWTSIDVLCVTASIETLCVIAV | 279.34° | 285.10° | 0.68 | ||
| (65.78) | NLANVIRYFPTQALNFAFKDKYKQIFL | 207.77° | 114.64° | 93.14° | 1.75 | | RVIILMVWIVSGLTSFLPIQMHWYR | 88.76° | 102.09° | 13.33° | 1.93 |
| | WRYFAGNLASGGAAGATSLCFVYPLDFART | 112.40° | 70.30° | 42.09° | 1.32 | | FTNQAYAIASSIVSFYVPLVIMVFVYS | 158.19° | 84.71° | 73.47° | 1.88 |
| | YQGFNVSVQGIIIYRAAYFGVYDTAKGMLP | 179.94° | 59.44° | 120.50° | 1.40 | | LGIIMGTFTLCWLPFFIVNIVHVIQ | 106.50° | 159.31° | 52.82° | 1.40 |
| | HIIVSWMIAQTVTAVAGLVSYPFDTVRR | 264.34° | 315.20° | 50.85° | 0.98 | | IRKEVYILLNWIGYVNSGFNPLIYC | 271.42° | 321.81° | 50.39° | 1.50 |
| | AWSNVLRGMGGAFVLVLYDEI | 172.16° | 105.90° | 66.25° | 1.87 | AAVTLLGVLLQAYF | 60.17° | 77.84° | 17.67° | 1.38 | |
| VELGVSYAALLSVIVVVVEYTMQL | 268.82° | 191.51° | 77.31° | 0.70 | (10.47) | SEYFPLFLATLWVAG | 96.55° | 80.38° | 16.17° | 0.77 | |
| (62.09) | LVRLYLVDLILVIILWADYAY | 167.76° | 133.29° | 34.46° | 1.71 | | AALCGLVYLFARLR | 191.79° | 197.96° | 2.39 | |
| | KKTLYEIPALVPAGLLALIE | 27.58° | 321.07° | 66.51° | 0.72 | | LYASARALWLLVALAAL | 116.59° | 118.45° | 2.57 | |
| | LVRLLRFLRILLIISRGSKFLSAIA | 233.80° | 303.87° | 70.07° | 0.62 | SLGIFIGICGIIGCGGNGIVIY | 276.78° | 317.73° | 40.94° | 3.01 | |
| RAIFAEFFATLFYVFFGLGAS | 308.42° | 335.32° | 26.90° | 1.44 | (35.30) | FIINLAFSDFTFSLVNGFPLMTI | 206.68° | 252.82° | 46.14° | 1.12 | |
| (33.01) | LQVALAFGLALATLVQAVGHIS | 59.48° | 65.56° | 1.17 | | VYGFIGGIFGFMSIMTMAMISI | 303.09° | 339.23° | 36.14° | 0.80 | |
| | LRAICYVVAQLLGAVAGAAVLYSV | 354.70° | 13.71° | 19.01° | 2.35 | | FIMIIFVWLWSVLWAIGPIF | 99.36° | 72.24° | 27.11° | 2.67 |
| | GQATIVEIFLTLQFVLCIFATY | 61.06° | 71.11° | 10.05° | 1.58 | | NILCMFILGFFGPILIIFFCYF | 270.54° | 293.89° | 23.35° | 2.49 |
| | GSVALAVGFSLTLGHLFGM | 342.15° | 109.29° | 127.14° | 1.21 | | SIVIVSQFLLSWSPYAVVAL | 127.57° | 154.75° | 27.18° | 2.12 |
| | WVYWVGPVIGAGLGSLLYDFLL | 49.77° | 58.65° | 1.58 | | QLPVMFAKASAIHNPMIYSV | 60.63° | 106.85° | 46.22° | 2.01 | |
| VLAMATATIFSGIGSAKGVG | 45.15° | 105.31° | 60.16° | 0.99 | YSINILAMLLVGFGFLMV | 232.00° | 229.10° | 0.63 | |||
| (41.83) | LPGTQGLYGFVIAFLIFI | 285.84° | 259.80° | 26.04° | 1.70 | (33.25) | ATTGTYLVVATGLPLYILL | 193.50° | 227.23° | 33.73° | 0.87 |
| | LGASLPIAFTGLFSGIAQ | 82.57° | 87.25° | 1.39 | | IYAEFAVATGLIAMGAVL | 221.13° | 199.82° | 21.31° | 0.05 | |
| | MVETYAILGFVISFLLVL | 7.10° | 290.64° | 76.46° | 1.13 | | FQYALLALFIVPVYLLNE | 11.39° | 35.81° | 24.42° | 1.15 |
| WQSATGLFLGLFMIGHMFFVST | 285.17° | 308.25° | 23.08° | 1.79 | | GSIAIHAFGAYFGLGVSIA | 208.92° | 309.21° | 100.29° | 0.73 | |
| (51.33) | IVVSFLAAFVFAVFIAHAFLAMR | 55.33° | 17.57° | 37.76° | 2.89 | | FSMLGSMVLWLFWPSFA | 284.14° | 287.41° | 1.16 | |
| | LWWIQAMTGFAMFFLGSVHLYIMMTQP | 188.56° | 222.27° | 33.71° | 1.48 | | VNTLLALCGATLATYFLSAL | 36.54° | 3.47° | 33.07° | 1.57 |
| | WMWPLYLVLLFAVELHGSVGLYRLAV | 192.12° | 322.45° | 130.33° | 1.38 | | VDMANAALAGGVAIGSVC | 138.00° | 55.95° | 82.05° | 0.24 |
| | RANLKKLKTLMSAFLIVLGLLTFGAYV | 185.58° | 153.82° | 31.76° | 3.30 | | VGAFVIGLLGGAISVVGF | 11.05° | 21.65° | 10.60° | 1.90 |
| HRGTGVALSLGVSLFSLAALLLP | 123.28° | 203.88° | 80.60° | 1.69 | | TCGVHNLHGLPGLLGGFSAIL | 112.57° | 156.18° | 43.61° | 0.92 | |
| (45.70) | LIYSAKFALVFPLSYHTWNGIR | 307.08° | 253.07° | 54.01° | 0.39 | | LTGIGITLALALIGGVIAGALIKLT | 103.20° | 92.65° | 10.55° | 2.58 |
| VVVLILTLLSSAAIASE | 74.04° | 71.55° | 2.05 |
The columns are protein chain, sequences of transmembrane helices, observed rotational angles from structures, predicted rotational angles by TMexpo, angular errors, and predicted rASA moment lengths ().
Figure 1Top view of the structure 3L1L shown by helical wheels, interhelical interactions and observed rASA of the TM residues. Each TM residue is shown by a color-coded gradient scale of its observed rASA, with dark red and dark blue representing rASA value 1.0 for extremely buried and 0 for extremely exposed, respectively. The red arrow inside each helical wheel indicates the lipid-facing direction, and the green clock-wise arrow starts at the first amino acid.
Figure 2Observed and predicted rASA and lipid-facing directions of the 12 TMHs in 3L1L represented by helical wheels. The angular error of 12 TMHs ranges from 0.49˚ to 153.48˚; the MAAE of 12 TMHs is 36.72˚.
Comparing Pearson correlation coefficients between contact-enriched set and reference set defined by different thresholds ( )
| 0.50 (4,447) | 0.59 (4,078) | |
| 0.33 (1,981) | 0.64 (6,544) | |
| 0.25 (671) | 0.65 (7,854) |
The first column stands for different thresholds, and the last two columns stand for Pearson correlation coefficients of transmembrane residues from the two different sets.
The list of all protein chains (PDB:Chain) included in the development set and the independent test set
| 1E7P:C | 1LNQ:A | 1YCE:A | 2 F93:A | 2PNO:A | 2WIT:A | 3B8C:A | 3EH4:A | 2XQ2:A | 3MP7:A |
| 1EYS:L | 1ORQ:C | 1YEW:C | 2GFP:A | 2Q67:A | 2WSW:A | 3CHX:A | 3EHB:B | 2XUT:A | 3NYM:A |
| 1EYS:M | 1P7B:A | 1YQ3:C | 2GIF:A | 2Q7R:A | 2YVX:A | 3CHX:B | 3G5U:A | 3KBC:A | 3O0R:B |
| 1FFT:A | 1PV6:A | 1YQ3:D | 2IUB:A | 2QJY:A | 2Z73:A | 3CIR:C | 3GIA:A | 3KCU:A | 3O7P:A |
| 1FFT:B | 1PW4:A | 1ZCD:A | 2JLO:A | 2R6G:F | 2ZD9:A | 3CIR:D | 3H90:A | 3KG2:A | 3OE6:A |
| 1FFT:C | 1QLE:C | 2A65:A | 2NMR:A | 2R6G:G | 2ZJS:Y | 3CN5:A | 3H9V:A | 3KJ6:A | 3ORG:A |
| 1FX8:A | 1S5L:B | 2AXT:A | 2NQ2:A | 2R9R:B | 2ZW3:A | 3D31:C | 3HD6:A | 3KP9:A | 3P4W:A |
| 1GZM:A | 1S5L:C | 2AXT:D | 2NR9:A | 2VL0:A | 2ZXE:A | 3DDL:A | 3IJ4:A | 3L1L:A | 3P5N:A |
| 1JB0:K | 1S5L:Z | 2BL2:A | 2OAR:A | 2VPZ:C | 3A7K:A | 3DHW:A | 3JYC:A | 3 M71:A | 3PJZ:A |
| 1JV6:A | 1XIO:A | 2C3E:A | 2OAU:A | 2WDV:C | 3B4R:A | 3E9J:C | 3 K07:A | 3MK7:A | |
| 1KPK:A | 1Y4Z:C | 2E75:B | 2ONJ:A | 2WDV:D | 3B5D:A | 3EFF:K | 3K3F:A | 3MK7:C | |
| 1KQG:C | 3MKT:A | ||||||||
Figure 3The workflow of TMexpo for predicting rotational angles from sequences.