| Literature DB >> 30443749 |
Elrashdy M Redwan1, Abdullah A AlJaddawi2, Vladimir N Uversky3,4,5.
Abstract
Infection by the Alkhurma virus (ALKV) leading to the Alkhurma hemorrhagic fever is a common thread in Saudi Arabia, with no efficient treatment or prevention available as of yet. Although the rational drug design traditionally uses information on known 3D structures of viral proteins, intrinsically disordered proteins (i.e., functional proteins that do not possess unique 3D structures), with their multitude of disorder-dependent functions, are crucial for the biology of viruses. Here, viruses utilize disordered regions in their invasion of the host organisms and in hijacking and repurposing of different host systems. Furthermore, the ability of viruses to efficiently adjust and accommodate to their hostile habitats is also intrinsic disorder-dependent. However, little is currently known on the level of penetrance and functional utilization of intrinsic disorder in the ALKV proteome. To fill this gap, we used here multiple computational tools to evaluate the abundance of intrinsic disorder in the ALKV genome polyprotein. We also analyzed the peculiarities of intrinsic disorder predisposition of the individual viral proteins, as well as human proteins known to be engaged in interaction with the ALKV proteins. Special attention was paid to finding a correlation between protein functionality and structural disorder. To the best of our knowledge, this work represents the first systematic study of the intrinsic disorder status of ALKV proteome and interactome.Entities:
Keywords: Alkhurma virus; Interactome; Intrinsically disordered protein; Partially folded conformation; Protein folding; Protein function; Protein structure; Protein–protein interactions; Proteome
Mesh:
Substances:
Year: 2018 PMID: 30443749 PMCID: PMC7079808 DOI: 10.1007/s00018-018-2968-8
Source DB: PubMed Journal: Cell Mol Life Sci ISSN: 1420-682X Impact factor: 9.261
Fig. 1Compositional profiling of the ALKV polyprotein and mature viral proteins. a Results for the order-promoting residues, and b represents data for the disorder-promoting residues. This analysis was conducted by the web-based computation tool, fractional compositional difference was computed using Composition Profiler (http://profiler.cs.ucr.edu) [51]. Since this tool generates fractional compositional difference calculated as (C − Corder)/Corder, where C is the content of a given amino acid in a query protein (set), and Corder is the corresponding value for the set of ordered proteins from PDB Select 25 (which is a subset of all proteins in PDB that share below 25% sequence identity) [84], the magnitudes of the bars indicate the fractional difference between the amino acid compositions, where positive (negative) value indicates enrichment (depletion) of the give amino acid type. The error bars indicate the standard deviations of the fractional differences based on the bootstrapping performed by the composition profiler. The proteins are color coded, and the corresponding keys are given in figure legends
Fig. 2Characterizing the intrinsic disorder predisposition of the ALKV genomic polyprotein (UniProt ID: Q91B85). a Disorder profile generated by a set of commonly used disorder predictors, such as PONDR® VLXT (black curve), PONDR® VL3 (red curve), PONDR® VSL2 (green curve), PONDR® FIT (pink curve), IUPred_short (yellow curve), and IUPred_long (blue curve). Light pink shadow around the PONDR® FIT curve represents error distribution for this predictor. b Mean disorder profile calculated by averaging of all predictor-specific per-residue disorder profiles. Light blue shadow around the disorder curve show distribution of standard deviations. Positions of individual viral proteins within the ALKV polyprotein are indicated by differently colored bars, and the corresponding keys are given in figure legend. In both plots, light gray bars show locations of cleavage sites. In these analyses, the predicted intrinsic disorder scores above 0.5 are considered to correspond to the disordered residues/regions, whereas residues and regions with the disorder scores between 0.2 and 0.5 are considered as flexible
Evaluating intrinsic disorder predisposition of proteins from the ALK proteome and interactome
| Protein | Function | Length | Mean disorder score | Mean PPID (%) | MoRFs | Number of ELMs | Number of ELM instances | NAPID |
|---|---|---|---|---|---|---|---|---|
| Proteome of the Alkhurma virus (ALKV) | ||||||||
| Q91B85, genome polyprotein | Genome polyprotein is a product of the ALKV genome translation that includes all ALKV proteins and is cleaved by viral and host peptidases to generate mature ALKV proteins | 3416 | 0.243 ± 0.153 | 5.59 | 2–27, 606–609, 637–640, 1669–1674, 1685–1693, 1699–1702 | 121 | 1286 | 37 |
| Capsid protein C | One of structural proteins that complexes with viral mRNA to form a nucleocapsid has multiple other functions | 117 | 0.472 ± 0.154 | 39.83 | 2–27 | 28 | 63 | |
| Protein prM | Pre-membrane protein (prM) is an intracellular precursor of membrane protein M that is a component of the mature virion. PrM plays a role in immune evasion | 164 | 0.301 ± 0.137 | 19.51 | Not detected | 39 | 81 | |
| Envelope protein E | Major envelope glycoprotein is responsible for virus entry and serves as a major target of neutralizing antibodies | 496 | 0.235 ± 0.104 | 1.81 | 325–328, 356–359, 369–371 | 51 | 169 | |
| Non-structural protein 1 (NS1) | NS1 plays a role in immune evasion, pathogenesis, and viral replication, being important for the formation of the replication complex | 353 | 0.287 ± 0.132 | 8.78 | Not detected | 58 | 142 | |
| Non-structural protein 2A (NS2A) | Component of the viral RNA replication complex | 230 | 0.149 ± 0.158 | 6.96 | Not detected | 43 | 96 | |
| Serine protease subunit NS2B | NS2B is a crucial cofactor for the serine protease function of NS3 | 131 | 0.222 ± 0.145 | 4.58 | Not detected | 25 | 40 | |
| Serine protease NS3 | NS3 is a mutidomain protein with three enzymatic activities, serine protease (which is required for the autocleavage of the ALKV polyprotein at dibasic sites: C-prM, NS2A-NS2B, NS2B-NS3, NS3-NS4A, NS4A-2K and NS4B-NS5), NTPase (which is required for the 5′-RNA cap formation) and RNA helicase (which binds and unwinds dsRNA) | 621 | 0.333 ± 0.161 | 16.59 | 130–132, 178–183, 194–202, 208–211, 319–321, 474–477 | 71 | 207 | |
| Non-structural protein 4A (NS4A) | NS4A serves as regulator of the ATPase activity of the NS3 helicase | 126 | 0.187 ± 0.161 | 4.76 | Not detected | 26 | 43 | |
| Non-structural protein 4B (NS4B) with peptide 2k | NS4B triggers the formation of ER-derived membrane vesicles where the viral replication takes place | 275 | 0.189 ± 0.117 | 2.18 | Not detected | 59 | 118 | |
| RNA-directed RNA polymerase NS5 | NS5 is a RNA-directed RNA polymerase that replicates the viral RNA genome, and performs the capping of viral genomes | 903 | 0.236 ± 0.122 | 2.66 | Not detected | 72 | 314 | |
| Human proteins interacting with proteins of the Alkhurma virus (ALKV) | ||||||||
| Q92804, TATA-binding protein-associated factor 2N (TAF15) | TAF15 is an RNA and ssDNA-binding protein involved in transcription initiation at specific promoters | 592 | 0.714 ± 0.154 | 89.19 | 1–22, 32–86, 96–222, 233–243, 248–259, 295–303, 313–328, 343–354, 390–401, 585–592 | 22 | 105 | 95 |
| Q9BV73, Centrosome-associated protein CEP250 (CEP250) | CEP250 is needed for the centriole–centriole cohesion during interphase of the cell cycle | 2442 | 0.637 ± 0.134 | 84.64 | 1–13, 317–332, 527–532, 656–683, 705–711, 829–834, 848–852, 873–893, 913–920, 983–1008, 1201–1218, 1309–1330, 1344–1350, 1608–1623, 171–1729, 1839–1864, 1944–1948, 1958–1990, 1996–2008, 2119–2144, 2220–2270, 2294–2298, 2310–2330, 2383–2442 | 56 | 230 | 244 |
| Q5QJE6, Deoxynucleotidyl transferase terminal-interacting protein 2 (DNTTIP2) | DNTTIP2 acts as a chromatin remodeling protein and regulates transcriptional activities of DNA nucleotidylexotransferase (DNTT) and ethylene-responsive transcription factor (ESR1). | 756 | 0.647 ± 0.172 | 78.04 | 1–43, 49–67, 71–76, 92–121, 138–159, 170–234, 243–286, 301–308, 318–361, 371–373, 403–414, 419–427, 438–450, 461–468, 486–492, 498–525, 529–548, 620–630, 644–646, 661–664, 747–756 | 68 | 379 | 34 |
| Q8IXK0, Polyhomeotic-like protein 2 (PHC2) | PCH2 serves as an important component of a Polycomb group (PcG) multiprotein PRC1-like complex that maintains the transcriptionally repressive state of many genes, acting via chromatin remodeling and histone modifications | 858 | 0.677 ± 0.221 | 77.51 | 1–12, 42–50, 58–63, 203–352, 362–553, 562–565, 589–590, 691–735, 739–758, 766–776, 780–785 | 62 | 313 | 107 |
| Q02833, Ras association domain-containing protein 7 (RASSF7) | RASSF7 is a promoter of the MAP2K7 phosphorylation that negatively regulates stress-induced JNK activation and apoptosis | 373 | 0.631 ± 0.229 | 77.48 | 131–157, 162–163, 171–179, 192–248, 259–262, 266–267, 271–284, 291–372 | 31 | 61 | 37 |
| Q9BV36, Protein melanophilin (MLPH) | MLPH links melanosome-bound RAB27A and the motor protein MYO5A, and is involved in melanosome transport | 600 | 0.626 ± 0.247 | 76.00 | 149–154, 167–186, 190–296, 304–312, 318–329, 366–525, 535–544, 551–600 | 63 | 222 | 13 |
| Q9NWQ4, G patch domain-containing protein 2-like (GPATCH2L or C14orf118) | No functional information is currently available for GPATCH2L or C14orf118 | 482 | 0.586 ± 0.158 | 70.75 | 1–13, 22–30, 87–112, 134–135, 247–249, 304–306, 354–356, 415–441, 445–459, 464–475 | 51 | 249 | 42 |
| Q8IVT2, Mitotic interactor and substrate of PLK1 (MISP) | MISP is involved in mitotic spindle orientation and mitotic progression | 679 | 0.611 ± 0.170 | 70.69 | 18–23, 40–112, 121–259, 265–288, 299–328, 333–339, 347–357, 367–480, 490–493, 541–543, 567–579, 591–599, 618–626, 635–641, 649–655, 667–679 | 58 | 298 | 60 |
| Q5VU43, Myomegalin (PDE4DIP) | Myomegalin is a centrosome and cis-Golgi localized protein that is required for the microtubule (MT) growth from the centrosome and Golgi apparatus (GA) | 2346 | 0.567 ± 0.151 | 70.38 | 122–130, 199–223, 235–246, 299–326, 437–442, 663–669, 679–795, 888–892, 936–1018, 1064–1074, 1153–1213, 1299–1303, 1308–1311, 1399–1417, 1560–1568, 1584–1614, 1624–1718, 1742–1769, 1779–1800, 1825–1847, 1886–1890, 1917–1921, 1932–1935, 2083–2170 | 79 | 567 | 171 |
| Q8N5G2, Macoilin (MACO1) | Macoilin has a multitude of neural functions and is related to various processes, such as locomotion and chemotaxis | 664 | 0.466 ± 0.260 | 63.86 | 217–220, 241–253, 266–310, 323–337, 363–399, 414–416, 432–447, 555–561, 644–646 | 57 | 224 | 27 |
| Q99750, MyoD family inhibitor (MDFI) | MDFI serves as an inhibitor of the transactivation activity of the Myod family of myogenic factors and represses myogenesis | 246 | 0.635 ± 0.251 | 62.20 | 1–85, 96–134 | 38 | 86 | 269 |
| Q96EA4, Protein Spindly (SPDL1 or Coiled-coil domain-containing protein 99 (CCDC99)) | Spindly targets dynein/dynactin to kinetochores in mitosis and can activate its motility | 605 | 0.544 ± 0.111 | 61.98 | 166–169, 478–479, 518–523, 543–547, 555–564, 590–595 | 46 | 118 | 37 |
| Q9H814, Phosphorylated adapter RNA export protein (PHAX or RNA U small nuclear RNA export adapter protein (RNUXA)) | PHAX is involved in the XPO1-mediated U snRNA export from the nucleus | 394 | 0.572 ± 0.188 | 60.91 | 1–40, 95–106, 110–119, 131–141, 154–160, 170–185, 197–210, 214–218, 341–343, 354–383 | 45 | 137 | 53 |
| Q96CN5, Leucine-rich repeat-containing protein 45 (LRRC45) | LRRC45 is involved in the formation of the fiber-like linker between two centrioles, required for centrosome cohesion | 670 | 0.502 ± 0.210 | 60.45 | 243–248, 262–264, 277–280, 357–360, 396–411, 433–450, 611–615, 668–670 | 26 | 50 | 12 |
| Q9H7J1, Protein phosphatase 1 regulatory subunit 3E (PPP1R3E) | PPP1R3E is a glycogen-targeting subunit for protein phosphatase 1 (PP1) | 279 | 0.569 ± 0.219 | 58.42 | 1–16, 22–23, 28–35, 43–89, 112–117 | 43 | 98 | 2 |
| Q68CP9, AT-rich interactive domain-containing protein 2 (ARID2) | ARID2 is involved in transcriptional activation and repression of select genes by chromatin remodeling | 1835 | 0.518 ± 0.265 | 57.44 | 1–8, 13–15, 623–670, 673–690, 729–734, 756–781, 807–809, 891–899, 940–951, 954–1024, 1045–1096, 1109–1111, 1132–1134, 1196–1201, 1210–1219, 1239–1245, 1250–1269, 1285–1331, 1341–1475, 1478–1505, 1520–1542, 1550–1604, 1614–1628, 1695–1704, 1727–1733 | 78 | 585 | 40 |
| P35579, Myosin-9 (MYH9) | Myosin-9 is a non-muscle cellular myosin that play a role in cytokinesis, cell shape, and specialized functions such as secretion and capping | 1960 | 0.487 ± 0.251 | 55.71 | 1020–1033, 1059–1064, 1089–1093, 1101–1104, 1116–1120, 1140–1159, 1172–1199, 1233–1237, 1302–1305, 1354–1378, 1477–1523, 1536–1549, 1555–1573, 1603–1629, 1637–1691, 1705–1718, 1746–1751, 1763–1772, 1847–1851, 1864–1875, 1895–1960 | 69 | 472 | 387 |
| Q7Z5R6, Amyloid beta A4 precursor protein-binding family B member 1-interacting protein (APBB1IP) | APBB1IP is required for the signal transduction from Ras activation to actin cytoskeletal remodeling | 666 | 0.553 ± 0.311 | 55.56 | 1–11, 22–47, 55–72, 92–98, 120–158, 445–451, 466–601, 608–666 | 52 | 198 | 22 |
| P08670, Vimentin (VIM) | Vimentin is a class-III intermediate filament found in different non-epithelial cells, e.g. mesenchymal cells. It is attached to the nucleus, ER, and mitochondria | 466 | 0.525 ± 0.162 | 54.29 | 2–19, 40–43, 344–347, 460–466 | 37 | 97 | 333 |
| Q14160, Protein scribble homolog (SCRIB) | SCRIB is an important scaffold protein regulating epithelial and neuronal morphogenesis via involvement in different aspects of the differentiation of polarized cells | 1630 | 0.558 ± 0.265 | 51.53 | 397–402, 413–440, 444–608, 610–711, 715–731, 754–763, 798–800, 822–832, 840–863, 929–932, 946–981, 1131–1138, 1225–1265, 1272–1389, 1397–1630 | 72 | 406 | 133 |
| O00327, Aryl hydrocarbon receptor nuclear translocator-like protein 1 (ARNTL) | ARNTL is a transcriptional activator that forms a core component of the circadian clock | 626 | 0.480 ± 0.282 | 50.63 | 1–34, 45–84, 453–606, 615–625 | 47 | 146 | 70 |
| O15066, Kinesin-like protein KIF3B (KIF3B) | KIF3B is involved in tethering the chromosomes to the spindle pole and in chromosome movement and serves as a microtubule-based anterograde translocator for membranous organelles | 747 | 0.516 ± 0.215 | 50.47 | 334–335, 351–354, 356–358, 425–442, 460–470, 474–481, 631–633, 641–650, 668–669, 673–709, 729–747 | 37 | 96 | 37 |
| P31942, Heterogeneous nuclear ribonucleoprotein H3 (HNRNPH3) | HNRNPH3 is an RNA-binding protein forming complex with heterogeneous nuclear RNA (hnRNA), plays a role in the splicing process, and is involved in early heat shock-induced splicing arrest | 346 | 0.489 ± 0.195 | 50.00 | 100–103, 107–113, 262–266 | 42 | 79 | 128 |
| O94983, Calmodulin-binding transcription activator 2 (CAMTA2) | CAMTA2 is a transcription activator that may act as tumor suppressor | 1202 | 0.503 ± 0.279 | 48.59 | 253–495, 512–521, 660–675, 818–935, 957–966. 999–1000, 1138–1164, 1186–1189, 1197–1200 | 73 | 427 | 12 |
| Q9Y6X2, E3 SUMO-protein ligase PIAS3 (PIAS3) | PIAS3 is an E3-type SUMO (small ubiquitin-like modifier) ligase | 628 | 0.435 ± 0.224 | 40.13 | 76–83, 88–95, 99–101, 107–124, 429–434, 444–464, 481–497, 591–616 | 66 | 232 | 82 |
| O14530, Thioredoxin domain-containing protein 9 (TXNDC9) | TXNDC9 negatively affects protein folding, including folding of actin or tubulin via decreasing the ATPase activity of chaperonin TCP1 complex | 226 | 0.391 ± 0.233 | 34.51 | Not detected | 19 | 26 | 79 |
| Q9NS15, Latent-transforming growth factor beta-binding protein 3 (LTBP3) | LTBP3 is related to the assembly, secretion, and targeting of the transforming growth factor beta-1 (TGFB1) to sites of its storage and/or activation | 1303 | 0.412 ± 0.225 | 30.24 | 158–269, 474–571, 1197–1209 | 56 | 217 | 15 |
| Q13105, Zinc finger and BTB domain-containing protein 17 (ZBTB17) | ZBTB17 is a transcription factor that, depending on its binding partners, may act as an activator or repressor and plays an important role at early stages of lymphocyte development | 803 | 0.433 ± 0.282 | 29.51 | 122–131, 139–297, 322–326, 774–803 | 39 | 122 | 50 |
| P52742, Zinc finger protein 135 (ZNF135) | ZNF135 may regulate transcription and control cell morphology and cytoskeletal organization | 658 | 0.426 ± 0.165 | 29.03 | 1–3, 169–178, 230–233, 258–261, 286–288, 398–400, 426–429, 482–485, 510–512, 538–541, 566–569, 622–628 | 37 | 72 | 7 |
| P55268, Laminin subunit beta-2 (LAMB2) | LAMB2 can bind to cells via a high affinity receptor and interact with other extracellular matrix components, thereby mediating the attachment, migration and organization of cells into tissues during embryonic development | 1798 | 0.351 ± 0.177 | 21.25 | 1–3, 1337–1397, 1452–1466, 1492–1498, 1527–1530, 1640–1642, 1651–1655 | 51 | 187 | 39 |
| P55287, Cadherin-11 (CDH11) | Cadherin-11 is a calcium-dependent cell adhesion protein that contributes to the sorting of heterogeneous cell types | 796 | 0.349 ± 0.143 | 16.58 | 124–150, 157–167, 256–261, 269–279, 284–286, 508–511, 672–679, 702–714, 719–743 | 36 | 78 | 8 |
| P19474, E3 ubiquitin-protein ligase TRIM21 (TRIM21) | TRIM21 is an E3 ubiquitin-protein ligase | 475 | 0.317 ± 0.173 | 17.47 | Not detected | 37 | 71 | 105 |
| P78504, Protein jagged-1 (JAG1) | Jagged-1 is involved in the mediation of Notch signaling, serving as a ligand for multiple Notch receptors | 1218 | 0.273 ± 0.175 | 11.08 | 1120–1184, 1194–1218 | 53 | 170 | 21 |
| P28799, Granulins (GRN) | Granulins have cytokine-like activity and play a role in inflammation, tissue remodeling, and wound repair | 593 | 0.286 ± 0.137 | 10.12 | 346–353 | 31 | 48 | 121 |
| Q9H270, Vacuolar protein sorting-associated protein 11 homolog (VPS11) | VPS11 is related to the vesicle-mediated protein trafficking to lysosomal compartments including the endocytic membrane transport and autophagic pathways | 941 | 0.241 ± 0.150 | 8.11 | 608–611 | 67 | 306 | 59 |
| Q9BPW8, Protein NipSnap homolog 1 (NIPSNAP1) | NIPSNAP1 serves as a regulator of the transient receptor potential vanilloid channel 6 (TRPV6), which is an epithelial Ca2+ channel that mediates Ca2+ uptake in various tissues | 284 | 0.246 ± 0.140 | 7.04 | Not detected | 39 | 78 | 79 |
| P06733, α-Enolase (ENO1) | α-Enolase is a multifunctional enzyme involved in glycolysis and various biological processes, such as growth control, hypoxia tolerance, and allergic responses | 434 | 0.243 ± 0.119 | 3.46 | Not detected | 32 | 66 | 213 |
| Q14192, Four and a half LIM domains protein 2 (FHL2) | FHL2 serves as a molecular transmitter that links various signaling pathways to transcriptional regulation | 279 | 0.162 ± 0.095 | 1.43 | Not detected | 34 | 63 | 194 |
Fig. 3Disorder propensities of the ALKV polyprotein regions containing cleavage sites between capsid protein C and ER anchor (a), ER anchor and protein prM (b), peptide pr and protein M (c), protein M and envelope protein E (d), proteins E and NS1 (e), NS1 and NS2A (f), NS2A and NS2B (g), NS2A and NS3 (h), NS3 and NS4A (i), protein NS4A and peptide 2k (j), peptide 2k and NS4B (k), and proteins NS4A and NS5 (l). Disorder propensities were evaluated by PONDR® VLXT (black curves), PONDR® VL3 (red curves), PONDR® VSL2 (green curves), PONDR® FIT (pink curves), IUPred_short (yellow curves), and IUPred_long (blue curves). Mean disorder predisposition was calculated by averaging of all predictor-specific per-residue disorder profiles (bold, dashed, dark cyan curves). Light pink shadow around the PONDR® FIT curve represents error distribution for this predictor, whereas light blue shadow around the mean disorder curve show distribution of standard deviations. Cleavage sites are shown as gray bars
Fig. 4Effect of amino acid polymorphism on the intrinsic disorder predisposition of the different variants of the ALKV polyprotein evaluated by PONDR® VSL2. Different plots show consecutive 500 residue-long fragments of the ALKV polyprotein to better illustrate local changes in the disorder predisposition in regions surrounding polymorphic sites. The proteins are color coded, and the corresponding keys are given in figure legends
Fig. 5Analysis of the evolutionary conservation of the intrinsic disorder predisposition in some flaviviral polyproteins, such as ALKV (UniProt ID: Q91B85, black curves), KFDV (UniProt ID: D7RF80, dashed red curves), ZIKV (strain Mr 766, UniProt ID: Q32ZE1, dashed green curves), and DENV (type 1, strain Singapore/S275/1990, UniProt ID: P33478, dashed yellow curves). Different plots show consecutive 870 residue-long fragments of the aligned polyproteins to better represent some peculiarities in the local distribution of their intrinsic disorder predispositions. Sequences of these polyproteins were aligned using Clustal Omega algorithm [52]. The gaps in sequence alignments are shown as gaps in the corresponding disorder profiles. The original results of multiple sequence alignment including aligned sequences, conservation matrix, and evolutionary tree are shown in Supplementary Materials. Localization of cleavage sites is indicated by gray bars
Fig. 6Intrinsic disorder predisposition of mature ALKV proteins: a Capsid protein C; b Protein prM; c Envelope protein E; d non-structural protein 1, NS1; e non-structural protein 2A, NS2A; f serine protease subunit NS2B; g serine protease NS3; h non-structural protein 4A, NS4A; i non-structural protein 4B, NS4B; j RNA-directed RNA polymerase NS5. Disorder propensities were evaluated by PONDR® VLXT (black curves), PONDR® VL3 (red curves), PONDR® VSL2 (green curves), PONDR® FIT (pink curves), IUPred_short (yellow curves), and IUPred_long (blue curves). Mean disorder predisposition was calculated by averaging of all predictor-specific per-residue disorder profiles (bold, dashed, dark cyan curves). Light pink shadow around the PONDR® FIT curve represents error distribution for this predictor, whereas light blue shadow around the mean disorder curve shows distribution of standard deviations
Fig. 7Intrinsic disorder predisposition of the members of ALKV interactome: A Q92804, TATA-binding protein-associated factor 2N (TAF15); B Q9BV73, Centrosome-associated protein CEP250 (CEP250); C Q5QJE6, Deoxynucleotidyl transferase terminal-interacting protein 2 (DNTTIP2); D Q8IXK0, Polyhomeotic-like protein 2 (PHC2); E Q02833, Ras association domain-containing protein 7 (RASSF7); F Q9BV36, Protein melanophilin (MLPH); G Q9NWQ4, G patch domain-containing protein 2-like (GPATCH2L or C14orf118); H Q8IVT2, Mitotic interactor and substrate of PLK1 (MISP); I Q5VU43, Myomegalin (PDE4DIP); J Q8N5G2, Macoilin (MACO1); K Q99750, MyoD family inhibitor (MDFI); L Q96EA4, Protein Spindly (SPDL1 or Coiled-coil domain-containing protein 99 (CCDC99)); M Q9H814, Phosphorylated adapter RNA export protein [PHAX or RNA U small nuclear RNA export adapter protein (RNUXA)]; N Q96CN5, Leucine-rich repeat-containing protein 45 (LRRC45); O Q9H7J1, Protein phosphatase 1 regulatory subunit 3E (PPP1R3E); P Q68CP9, AT-rich interactive domain-containing protein 2 (ARID2); Q P35579, Myosin-9 (MYH9); R Q7Z5R6, Amyloid beta A4 precursor protein-binding family B member 1-interacting protein (APBB1IP); S P08670, Vimentin (VIM); T Q14160, Protein scribble homolog (SCRIB); U O00327, Aryl hydrocarbon receptor nuclear translocator-like protein 1 (ARNTL); V O15066, Kinesin-like protein KIF3B (KIF3B); W P31942, Heterogeneous nuclear ribonucleoprotein H3 (HNRNPH3); X O94983, Calmodulin-binding transcription activator 2 (CAMTA2); Y Q9Y6X2, E3 SUMO-protein ligase PIAS3 (PIAS3); Z O14530, Thioredoxin domain-containing protein 9 (TXNDC9); a Q9NS15, Latent-transforming growth factor beta-binding protein 3 (LTBP3); b Q13105, Zinc finger and BTB domain-containing protein 17 (ZBTB17); c P52742, Zinc finger protein 135 (ZNF135); d P55268, Laminin subunit beta-2 (LAMB2); e P55287, Cadherin-11 (CDH11); f P19474, E3 ubiquitin-protein ligase TRIM21 (TRIM21); g P78504, Protein jagged-1 (JAG1); h P28799, Granulins (GRN); i Q9H270, Vacuolar protein sorting-associated protein 11 homolog (VPS11); j Q9BPW8, Protein NipSnap homolog 1 (NIPSNAP1); k P06733, α-Enolase (ENO1); l Q14192, Four and a half LIM domains protein 2 (FHL2). Each plot contains an area graph representing the distribution of mean disorder predisposition calculated for a given protein by averaging of the per-residue disorder profiles generated by PONDR® VLXT, PONDR® VL3, PONDR® VSL2, PONDR® FIT, IUPred_short, and IUPred_long. The corresponding disorder profiles generated for query proteins by these six per-residue disorder predictors are assembled in Supplementary Materials
Fig. 8Global disorder propensity of the members of ALKV proteome (a) and interactome (b) presented as dependence of the percentages of residues in these proteins predicted to be disordered by averaging the outputs of PONDR® VLXT, PONDR® VL3, PONDR® VSL2, PONDR® FIT, IUPred_short, and IUPred_long on their mean disorder scores
Fig. 9Evaluation of the interset interactivity of human proteins involved in interaction with the ALKV proteins using the APID web server (http://apid.dep.usal.es). This tool builds a PPI network between proteins included in a query list. Here, each node corresponds to a protein, and where PPIs are shown as corresponding edges, thickness of which reflects the reliability of a given interaction
Fig. 10Evaluating functional intrinsic disorder in six most disordered members of the ALKV interactome, a Q92804, TATA-binding protein-associated factor 2N (TAF15); b Q9BV73, Centrosome-associated protein CEP250 (CEP250); c Q5QJE6, Deoxynucleotidyl transferase terminal-interacting protein 2 (DNTTIP2); d Q8IXK0, Polyhomeotic-like protein 2 (PHC2); e Q02833, Ras association domain-containing protein 7 (RASSF7); f Q9BV36, Protein melanophilin (MLPH), using the D2P2 database (http://d2p2.pro/) [147]. Here, the green-and-white bar in the middle of each plot shows the predicted disorder agreement between nine predictors, with green parts corresponding to disordered regions by consensus. Yellow bar shows the location of the predicted disorder-based binding sites (molecular recognition features, MoRFs), whereas colored circles at the bottom of the plot show location of various PTMs: phosphorylation (red), acetylation (yellow), glycosylation (orange), ubiquitination (violet), nitrosylation (light blue), methylation (dark blue), and SUMOylation (green)
Fig. 11Comparison of structural coverage and intrinsic disorder propensity of human proteins involved in interaction with ALKV proteins. The structural coverage of a protein is the percent of its residues with known structure, whereas disorder propensity is the mean content of disordered residues evaluated for each of these proteins by averaging the outputs of the per-residue disorder predictors, such as PONDR® VLXT, PONDR® VL3, PONDR® VSL2, PONDR® FIT, IUPred_short, and IUPred_long. Dashed line represents the linear fit of data for proteins with known structure