Literature DB >> 30352491

PreSMo Target-Binding Signatures in Intrinsically Disordered Proteins.

Do-Hyoung Kim1, Kyou-Hoon Han1.   

Abstract

Intrinsically disordered proteins (IDPs) are highly unorthodox proteins that do not form three-dimensional structures under physiological conditions. The discovery of IDPs has destroyed the classical structure-function paradigm in protein science, 3-D structure = function, because IDPs even without well-folded 3-D structures are still capable of performing important biological functions and furthermore are associated with fatal diseases such as cancers, neurodegenerative diseases and viral pandemics. Pre-structured motifs (PreSMos) refer to transient local secondary structural elements present in the target-unbound state of IDPs. During the last two decades PreSMos have been steadily acknowledged as the critical determinants for target binding in dozens of IDPs. To date, the PreSMo concept provides the most convincing structural rationale explaining the IDP-target binding behavior at an atomic resolution. Here we present a brief developmental history of PreSMos and describe their common characteristics. We also provide a list of newly discovered PreSMos along with their functional relevance.

Entities:  

Keywords:  IDPs; IDR (Intrinsically Disordered Region); IUPs (Intrinsically Unfolded Proteins); NMR; PreSMos (Pre-Structured Motifs)

Mesh:

Substances:

Year:  2018        PMID: 30352491      PMCID: PMC6199570          DOI: 10.14348/molcells.2018.0192

Source DB:  PubMed          Journal:  Mol Cells        ISSN: 1016-8478            Impact factor:   5.034


INTRODUCTION

Intrinsically Disordered Proteins

The central dogma in protein science, established over the last half-century, states that “a well-folded 3-D structure is a prerequisite for protein function”. The 3-D structure in this statement refers to the one that is observed under near-physiological conditions, (i.e., ~ pH 7, ambient temperature, and aqueous buffer, etc.). Intrinsically unstructured/unfolded proteins (IUPs), now more commonly known as intrinsically disordered proteins (IDPs) (Dunker et al., 2013), are very peculiar proteins that do not form well-folded 3-D structures even under non-denaturing conditions. Naturally, IDPs are of great importance from a protein folding perspective. More intriguing are the observations that IDPs are functional or active without 3-D structures, for example, being involved in transcription (Lee et al., 2000; Sherr, 2004; Kim et al., 2017a; 2017b), translation (Fletcher and Wagner, 1998; Kim et al., 2015), cell cycle regulation (Pavletich, 1999), chaperoning (Hong et al., 2005), and membrane-binding (Atwal et al., 2007; Eliezer et al., 2001). The discovery of many, as much as half of the entire human proteome (Dunker et al., 2000), such highly unorthodox proteins has strongly suggested that the classical structure-function relationship of proteins needs to be reexamined. Cleary, the golden paradigm in structural biology, 3-D structure = protein function, is no longer valid. Several reviews dealing with general aspects of IDPs are available for further reading (Chavali et al., 2017; Dunker et al., 2013; Lee et al., 2012; Uversky and Dunker, 2010; Uversky, 2015). Not only because of a basic scientific point of view are our interests in IDPs keen but also because of the fact that these proteins are involved in many fatal diseases. For example, ~80% of human cancers are associated with IDPs (Galea et al., 2008) such as eIF4E-binding proteins (4EBPs) (Fletcher and Wagner, 1998; Kim et al., 2015), Bcl-XL (Xu et al., 2009], human glucocorticoid receptors (Kim et al., 2017b), E7 (Lee et al., 2016), hypoxia inducible factors (Semenza, 2003; Kim et al., 2009a) and p53 all of which are so-called “hybrid-type” IDPs where intrinsically disordered regions (IDRs) coexist with globular domains (Lee et al., 2000; Wells et al., 2008). The causative agents of mad cow disease or Creutzfeldt-Jakob disease (CJD) in humans are prions that are also IDPs where a C-terminal globular domain coexists with a long intrinsically disordered region (IDR) at the N-terminus encompassing ~120 amino acid residues (James et al., 1997; Liu et al., 1999). Alpha-synuclein (Eliezer et al., 2001) and tau (Bibow et al., 2011; Künze et al., 2012), implicated in PD (Parkinson’s diseases) and AD (Alzheimer’s disease) respectively, are also IDPs. Furthermore, several viral strains including the well-known AIDS-causing HIV-1 produce IDPs (Chi et al., 2007; Feuerstein et al., 2012; Kim et al., 2009b; Lee et al., 2016; Liang et al., 2007; Reingewertz et al., 2009; To et al., 2016). Clearly, there is an immediate and strong need to acquire very thorough knowledge not only on the normal functionality of IDPs but also on their pathologic connection to above diseases since it has become apparent that the classical globular protein based approach is unlikely to provide us with sufficient information that can be used for developing effective weaponry against IDP-associated diseases.

PreSMos: Pre-Structured Motifs, a Historical Perspective

The most obvious characteristic of IDPs is that they do not possess spatially-disposed active pockets, a fact that brings us to a simple but profound question of how then these long malleable stretches of amino acids (sometimes hundreds of amino acids) can bind their targets. Targets of IDPs are not just proteins, but can be nucleic acids (Thapar et al., 2004; To et al., 2016; Wells et al., 2008), lipids, metals, and small molecules (Follis et al., 2008; Metallo, 2010). Efforts were made recently to classify IDPs into several subfamilies (van der Lee et al., 2014). While intuitive, such a classification fails to provide detailed insights into how all these different subfamilies bind their targets. The well-cited expression “coupled folding and binding” (Dyson and Wright, 2002) is a useful term, but only as far as one tries to depict the rather easily-predictable topological change that IDPs need experience upon binding to their partners. This generic description therefore fails to provide any atomistic details associated with IDP-target binding that, if available, would be highly valuable for IDP-based drug design. As the axiom “the devil is in the details” dictates, the question one must answer is rather specific. It has been amply demonstrated that only certain segments or residues of IDPs/IDRs are involved in direct physical contact with target. Do we then have a clear answer on what specific features in these segments or residues make target binding possible? Why does mutating just a few (often just one) sparsely-disposed hydrophobic residues in acidic transactivation domains (TADs) drastically affect the transcriptional activity whereas mutating several of the abundant acidic residues has only a marginal effect on the activity? (Chang et al., 1995; Drysdale et al., 1995) An early investigation attempted to address this question by employing wild type GAL4 and its scrambled mutant with no transcriptional activity (Giniger and Ptashne, 1987) and concluded that the mutant was inactive because its helix-forming propensity was compromised. This study triggered a huge controversy over whether target-free acidic TADs should form an amphipathic helix as the specificity determinant for activity (Van Hoy et al., 1993). Direct and quantitative evidence that some sort of a secondary structural element, e.g., helix, is needed for transcriptional activity came from an NMR study on p53 TAD (Lee et al., 2000). The 73-residue long p53 TAD in its unbound form was found “unstructured” in a tertiary sense, yet contained a transient (~25% populated only) amphipathic helix whose residues formed a stable amphipathic helix when complexed with the N-terminal p53-binding domain (residues 3–109) of mdm2 (Kussie et al., 1996). This pioneering NMR study heralded the birth of the PreSMo concept. Subsequent NMR reports confirmed that pre-existing, pre-formed, or pre-ordered residual secondary structures, no matter what they may be called, do exist in unbound IDPs and are important for target binding (Lee et al., 2012). In the early days of IDP research, another line of thought prevailed advocating a notion of induced fit (IF), arguing that no pre-existing secondary structures were needed for target binding based upon the conclusion that IDPs are fully unstructured. A well-known example is the 4EBP1, a 118-residue translational inhibitor, which was reported to have “no regions of local order in the absence of eIF4E” (Fletcher et al., 1998). For the last two decades, this IDP has been known as the symbol of the completely unstructured (CU) nature of IDPs; however, a recent NMR study revealed that this IDP also contains a pre-structured helix which mediates its binding to eIF4E (Kim et al., 2015). Another well-known IDP is the kinase-inducible domain (KID) of CREB the NMR results on which supported the concept that IDPs must be in the CU state so that they must undergo “a coil -> helix folding transition” via IF (Radhakrishnan et al., 1997). It is unclear how the authors of this particular report reached the conclusion that “the population of helix in free pKID is extremely small.” when their NMR data indicated presence of two transient helices (one ~60% and the other ~10% populated). Another group which worked on the same system concluded that two helix PreSMos were present (Table 1; Hua et al., 1998; Lee et al., 2012).
Table 1

A list of MU-type IDPs/IDRs containing PreSMos

NameNumber of residuesP/RbLocation of PreSMo residuescPopulationd (%)Role/BindingReferences
FlgM97P60–7350±10σ28Daughdrill et al., 1997
83–9050±10
42–5020
KID60R119–129>50KIXRadhakrishnan et al., 1998
134–143~10Hua et al., 1998
GBD/CRIB in WASP W768R252–264~14Cdc42/RacRudolph et al., 1998
(201–268)
HIV-1 Nef56R14–22 : helix I18Geyer et al., 1999
(2–57)35–41 : helix II (Hα only)
Synaptobrevin-296R78–9145core complex formingHazzard et al., 1999
APPC47R20–2330X11Ramelot et al., 2000
(649–695)27–3520
37–45 (Hα only)30
p53 TAD73R18–26 : helix20Mdm2Lee et al., 2000
40–44 : turn I5RPA, TFEII
48–53 : turn II15
RPS4200P12–158rRNA, ribosomal proteinsSayers et al., 2000
30–33: β?23
α-Synuclein140P18–31~10amyloid-formingEliezer et al., 2001
Murrali et al., 2018
N-term. Tmod 192R24–35NAtropomyosinGreenfield et al., 2005
VP16 TAD79443–44725hTAFII31 PC4Jonker et al., 2005
(412–490)R469–48315
VP16 TAD79R424–433/442–446, 465–467/472–479 (Hα only)60/40hTAFII31 PC4Kim et al., 2009
(412–490)10/20
Dynein interm. chain40R223–228NAlight chainsBenison et al., 2006
(198–237)Benison et al., 2007
γ-Synuclein127P49–99~15Marsh et al., 2006
HMGA1107P3–9820 different proteinsBuchko et al., 2007
64–67
CFTR185Rα-helixinteraction between R region and NT-binding domain 1Baker et al., 2007
(654–838)654–668, 759–764, 766–776, 801–817>5
>5
β-strand
744–753>5
NS5A-D2 (HCV)93RL48-V5720-Liang et al., 2007
(250–342)L86-E96 (Hα only)25
preS1 of HBV119R32–36, 41–45~10hepatocyte receptor-bindingChi et al., 2007
11–18, 22–25, 37–40, 46–50. (Hα only)~10
~10
β-synuclein134PNA~20-Sung et al., 2007
Securin202P150–159 : helix45-Csizmok et al., 2008
113–127 (β)15
174–17820
C-XPCe126R818–843: helix~30Centri2Miron et al., 2008
(815–940)847–860: helix~30TFIIH
891–901: helixNA
908–915: helixNA
923–930: helixNA
MSP2237P14–2135-Zhang et al., 2008
140–15035
197–21120
DARPP-32118R22–2950PP1Dancheck et al., 2008
103–11425
I-2156R36–4230PP1Dancheck et al., 2008
(9–164)96–10648 (70)
127–15467 (90)
132–138>98
ENSA121P32–3640-Boettcher et al., 2008
48–5010Boettcher et al., 2007
65–7030
ODD/HIF-1α74R438–440~10-Kim et al., 2009
(404–477)467–477
Sml1104P4–14: helix~20RNR bindingZhao et al., 2000
(1–104)61–80: helix~70Dimer forming
Myb2525R295–309 : helix25~30KIXZor et al., 2002
(291–315)
N tail125R488–499 : helixNAphosphoprotein PBourhis et al., 2004
Measles virus nucleoprotein(401–525)
dSLBP92R28–45 : helixNAmRNAThapar et al., 2004
(17–108)50–57 : helixstem-loop
66–75 : helix
91–96 : helix
Tβ-443P5–16 : helixNACa ATPDomanski et al., 2004
(1–43)G-actin
N tail82R479–48436phosphoprotein PJensen et al., 2008
Sendai Virus nucleoprotein(443–524)476–48838
478–49211
Sic190R20–3020Cdc4Mittag et al., 2008
(1–90)63–68
c-Myc88R26–34 : helix40Bin-SH3 domainAndresen et al., 2012
(1–88)47–52 : helix2524–31(TRRAP binding)
20–23 : β-turn
ExsE88P42–51: helixNAExsCZheng et al., 2012
(1–88)61–65: helix
NS5A415R401–412 : helixNABin1-SH3Braeuning, 2013
HCV(33–447)427–445 : helix
NS5A179R205–221 : helix I38Bin1-SH3Feuerstein et al., 2012
HCV(191–369)251–266 : helix II38Solyom et al., 2015
292–306 : helix III51
4EBP2120P1–515~37eIF4ELukhele et al., 2013
(1–120)33–37
50–64
86–89
96–105
E740R8–13 : helixNAE2Noval et al., 2013
HPV(1–40)17–29 : helix
33–38 : PPII
4EBP170R56–63 : helix20eIF4EKim et al., 2015
(49–118)
Myb3232R290–310 : helix~70KIXArai et al., 2015
(284–315)
E746R7–14 : helix10E2Lee et al., 2016
HPV(1–46)20–26 : helix20
CBP-ID4207R1852–1875: helix~60-Piai et al., 2016
(1851–2057)1951–1978: helix
HIV-1 Tat121P27–32: helix~20Fab’To et al., 2016
(1–121)a41–59: helix~30P-TEFb
70–81: β sheet~25TAR-cyclin T1
93–99: β sheet~25
105–112: β sheet~10
SUSP4100R263–291 : helix~30mdm2Kim et al., 2017
(201–300)265–270 : helix~10
281–291 : helix
hGRtau1c64R185–202: helix20~30TAZ2Kim et al., 2017
(181–244)206–225: helix
232–244: helix
Huntingtin Httex1 25Q95P18–42: helixNACytotoxicNewcombe et al., 2018
(1–95)Membrane binding
Aggregation

The numbering includes a 20-residue N-terminal tag.

An IDP (P) versus an IDR (R).

Residue numbers are taken from the original report.

Population of PreSMos are read from the mid-point of the SSP scores that are calculated from chemical shifts in BMRB or literature. Shown in bold are the populations described in the original report. When the populations described in the original report without SSP scores differed significantly from the calculated SSP scores, the SSP scores are provided in parenthesis.

NA = not available.

Determined by SAXS.

While the conceptual development on PreSMos has been somewhat delayed due to previous misconceptions that IDPs were completely unstructured, the presence of local residual secondary structures in isolated IDPs has been increasingly detected by many NMR investigations including a few critical NMR reports published at the turn of the century. The first key report found that p53 TAD has local structural elements (a helix and two turns) in the unbound state, as described above (Lee et al., 2000). The second report made by Ramelot et al. demonstrated that the cytoplasmic tail of the amyloid precursor protein forms a transient structure and such a pre-ordered structure is important for its binding to cytosolic factors (Ramelot et al., 2000). Sayers et al. also reported that structural preordering important for target binding was detected in the N-terminal region of ribosomal protein S4 (Sayers et al., 2000). Zhao et al. reported local structural elements in the overall loosely folded Sml1 (Zhao et al., 2000). Zitzewitz et al. published an article in 2000 with a title of “Preformed secondary structure drives the association reaction of GCN4-p1, a model coiled-coil system” (Zitzewitz et al., 2000). Another report by Bienkiewicz et al. described the functional consequences of pre-organized helical structure in the intrinsically disordered cell-cycle inhibitor p27 (Kip1) (Bienkiewicz et al., 2002). All these early NMR studies contributed to the foundation of the PreSMo concept, the idea that IDPs are not completely unstructured, but mostly unstructured (MU), and contain PreSMos. Following these NMR reports, bioinformatics studies proposed similar concepts such as PSE (Pre-formed Structural Element) (Fuxreiter et al., 2004), MoRF (Molecular Recognition Element) (Mohan et al., 2006; Oldfield et al., 2005), or primary contact sites a few years later. All these results, NMR experimental or predicted, point in unison to the idea that IDPs possess local secondary structural elements that are “hot spots” for target-binding. In 2012 we published the first comprehensive review on PreSMos (Lee et al., 2012) because no explicit articles on the subject were available, despite the fact that PreSMos (whatever they may be called) have been recognized for more than a decade as very important (perhaps the most significant) features explaining IDP-target binding on a per-residue basis. Several additional pieces of evidence have recently been published, demonstrating the functional significance of PreSMos (Kim et al., 2017b; Iešmantavičius et al., 2014; Mohan et al., 2014; Salamanova et al., 2018). In the first review, we presented 27 IDPs/IDRs containing PreSMos which constitute ~56% of all IDPs characterized by then. Most critically, we introduced the term pre-structured motifs (PreSMos) in order to unambiguously point out the importance of the pre-structured nature of target-binding segments in free IDPs and to provide a convenient term that can replace various names “transient, nascent, residual, minimally-structured, non-negligible, pre-existing, pre-formed, or pre-ordered secondary structures”. These terms were used mainly by NMR structural biologists who did not hasten to generalize the concept with a particular name realizing that PreSMos had only been observed in a handful of IDPs until 2005. This review is a follow-up to our 2012 review. Because we have found 20 more PreSMos since our first review here we provide an updated list of PreSMos and a brief description on their functional significance; however, we acknowledge that the list may still be incomplete. In addition, we describe differences between the PreSMos that are detected experimentally and the terms derived from bioinformatics predictions. With this review we now have 47 IDPs/IDRs containing PreSMos, strongly suggesting that PreSMos are general signatures in most IDPs.

DISCUSSION

Definition of a PreSMo

The definition of a PreSMo was given in our 2012 review (Lee et al., 2012); PreSMos are NMR-detected transient secondary structural elements within long (minimally 40 residues) and functionally-active IDRs of IDPs. We underline the fact that PreSMos are the experimentally observable entitites in NMR analyses or other atomic-resolution experiments no matter how minimally it might be pre-populated; it is a measured quantity, not predicted notions. This contrasts with MoRF (Mohan et al., 2006), which is a theoretical concept derived from the target-bound conformations of short segments (peptides) of IDRs (Fig. 1). IDPs exist as an ensemble of many different conformers separated by small energy differences. A conformer with a PreSMo would be one in the ensemble that is populated to an NMR-detectable degree. The lowest population of a PreSMo-containing conformer observed to date is ~10% (Lee et al., 2012).
Fig. 1

PreSMo vs. MoRF. A schematic diagram of the main differences between a PreSMo and a MoRF

A PreSMo is observed mostly by NMR experiments in the target-free state of IDPs. Since free-state IDPs exist as an ensemble of many conformers separated by small energy differences, structural superposition among different conformers along the backbone atoms is not possible. Nevertheless, a structural zsuperposition along a PreSMo is possible as shown in the left panel for the helix PreSMo of 4EBP1 (Kim et al., 2015). A PreSMo may become a MoRF upon target binding as illustrated for this helix PreSMo in 4EBP1 which becomes an α-MoRF when bound to eIF4E.

Table 1 is an updated list of PreSMos found in 47 IDPs/IDRs. The total number of IDPs studied in detail by NMR (with an exception of C-XPC studied by SAXS) is 70 even though the number of reports are more than 70 reports because some IDPs were investigated more than once. Notably, several IDPs (4EBP1, HIV-1 Tat, VP16 TAD, securin, and p21Waf1/Cipl/Sdil) that were originally reported as CU types with no PreSMo turned out to be MU types in later studies. For convenience, we added the 20 newly-identified PreSMos (starting from Myb25) at the end of Table 1, including a few PreSMos that were actually reported before 2012, but were not included in our 2102 review. Although the number of investigated IDPs is small compared to the possible number of IDPs/IDRs predicted by bioinformatics (thousands or more) it is sufficient to provide an overview on PreSMos. In 2012, the number of IDPs/IDRs with PreSMos was 27 (out of 48 studied) it is now 48 out of 70; the proportion of MU type IDPs/IDRs increased from 56% to 69%. The proportion is likely to increase if more IDPs/IDRs are characterized. One immediate feature noted in Table 1 is that in most cases we essentially study IDRs rather than IDPs (only 15 are IDPs), although we speak of IDPs. Note that all IDPs/IDRs in Table 1 are composed of more than 40 residues except for Myb25/Myb32. IDPs by definition consist of a minimal 40 residues and are distinct from the short flexible linkers and loops typically composed of fewer than 20 residues. The other feature shown in Table 1 is that most PreSMos are helices even though some are turns, β-strands and poly-proline type II helices. A high percentage of helices is also noted in MoRFs where α-MoRFs are the majority (Mohan et al., 2006; Oldfield et al., 2005). NMR is the main tool that enables quantitative definition of a PreSMo (Chi et al., 2007; Eliezer et al., 2001; Kim et al., 2009a; 2009b; 2015; 2017b; Lee et al., 2000; 2012; 2016; Liu et al., 1999; Xu et al., 2009). The beauty of NMR technique is that the presence of a PreSMo is reflected in several independent NMR parameters. In the early days, one needed to provide all of these NMR parameters (chemical shifts, inter-proton NOEs, J-couplings, T1 and T2 relaxation times, heteronuclear NOEs, temperature coefficients of backbone amide protons, etc.) to prove the existence of a PreSMo (Lee et al., 2000), whereas it usually is sufficient in recent years to just provide SSP (secondary structure propensity) scores (Marsh et al., 2006) as the concept of PreSMos has become more and more widely accepted. The SSP scores derived from CSIs (chemical shift indices) reveal an actual percentile value of a PreSMo population whereas CSIs can only indicate whether or not a PreSMo is present. A very important feature of a PreSMo is that it is never 100% populated. On the average, they are ~30% pre-populated, i.e., transient (Lee et al., 2012). This transient nature of PreSMos probably is the main cause that made several NMR investigators fail to detect them in the early days (Fletcher and Wagner, 1998; O’Hare and Williams, 1992; Radhakrishnan et al., 1997).

PreSMo vs. MoRF

The most common bioinformatics term used interchangeably with PreSMos is MoRFs (Mohan et al., 2006). For example, the mdm2-binding helix PreSMo detected by NMR in free p53 TAD is reported as an α-MoRF, a MoRF seen as an alpha helix in the target-bound state (Oldfield et al., 2005). Although there are a few more (out of more than a hundred) MoRFs that overlap with PreSMos fundamental differences exist between MoRFs and PreSMos. By definition MoRFs were identified in the x-ray structures of complexes between target proteins and short fragments of IDPs/IDRs that were predicted to be disordered by bioinformatics disorder prediction algorithms. The concept of the MoRF implicitly acknowledges the idea that the structured, bound-conformation is induced only upon target binding which is based on the early-day idea that IDPs have no pre-structured secondary structures. On the other hand, the definition of a PreSMo is not associated with the target-bound structure at all. In this regard, stating that a MoRF is found by NMR experiments is inaccurate (Bourhis et al., 2004) since one cannot tell if a MoRF would exist within an isolated IDP. One has to obtain a complex structure between a target and a PreSMo/MoRF in order to conclude that the putative MoRF (which is actually a PreSMo) is indeed a MoRF. Thus, a helix PreSMo may become an α-MoRF, but the opposite may not necessarily be true. With PreSMos we get the realistic percentage of the pre-structuredness whereas MoRFs do not provide such information. The term PreSMo was introduced as late as in 2012, but we underline that the PreSMos mentioned here refer to all the pre-existing or pre-formed residual secondary structures detected by NMR years before the term MoRF was introduced. It will be interesting to see how many of MoRFs may indeed coincide with PreSMos. One has to use a MoRF fragment, or preferably a longer IDR that encompasses such a MoRF fragment, to answer this question. An active pocket is a property of a globular protein that exists before binding to its target. In this regard, PreSMos qualify as the “active sites”, albeit not pockets, of IDPs since they are present before target binding. The same cannot be said for MoRFs. In Fig. 1, we show a conceptual scheme depicting what we have just described.

Characteristics of PreSMos

PreSMos are the “active sites” of IDPs

As is evident from Table 1 the PreSMos are the target-binding hot spots already present in free IDPs/IDRs; PreSMos are primed in a conformation similar to the target-bound conformation. Such pre-structuring is certainly advantageous for avoiding an entropic penalty that has to be paid when malleable IDPs/IDRs bind globular targets. Recent mutation studies demonstrated that the degree of pre-population of PreSMos is subtly controlled for efficient target binding (Borcherds et al., 2014; Iešmantavičius et al., 2014; Kim et al., 2017b; Salamanova et al., 2018). In many globular proteins a single mutation in the active site completely nullifies protein function by disabling the binding of ligands. PreSMos are often found in tandem within sufficiently long transcription factor IDPs/IDRs separated by ~30 residues (Chi et al., 2005). One PreSMo may be a high-affinity binding site to a target whereas the other is a low-affinity site to the same target. A synergistic effect of multiple PreSMos for efficient target binding has been discussed previously (Lee et al., 2000).

Shape complementarity in IDPs

Since it was believed that any secondary structure in IDPs should be induced only upon target binding many implicitly concluded that IDPs would totally lie outside of the classical structure-function paradigm, not obeying the rules established by structural biology such as shape complementarity. However, PreSMos reveal to us that IDPs abide by the shape complementarity extremely well via binding to targets (see Fig. 3 in Lee et al., 2012). In other words, when the secondary structural aspects for IDP-target binding are considered IDPs are not unorthodox at all. The genuine novelty of IDPs is the absence of 3-D structures only, not the absence of secondary structures. Structure (or PreSMos) does dictate function in the case of IDPs.

Practical tips for NMR detection of PreSMos

The NMR spectral quality of hybrid-type IDPs is often not good enough for a full resonance assignment since a globular domain and an IDR will tumble around in different time scales. Consequently, a reductionist approach of using an IDR instead of a whole IDP is often necessary. One precaution when using such an approach is that one should use a sufficiently long region, not a short fragment since PreSMos may exist in the outside of the region covered by a short peptide (Botuyan et al., 1997; Uesugi et al., 1997). A longer IDR often contains a more populated PreSMo due to a tertiary effect that stabilizes the transient secondary structures, as was demonstrated in the case of p53 TAD and its short helical peptide (Botuyan et al., 1997; Lee et al., 2000). Another case demonstrating the significance of using a fragment of appropriate length is Myb 25/Myb32 (Table 1; Arai et al., 2015). The populations of a helix PreSMo in Myb25 and in Myb32 are ~30% and ~70%, respectively, demonstrating that having just 7 more residues in Myb32 drastically increases the PreSMo population by ~40%. Using bioinformatics disorder prediction programs may keep one from choosing an inappropriate IDR for NMR experiments. The inappropriate choice of an IDR for NMR investigation might be another reason why some NMR studies failed to detect PreSMos.

CONCLUSION & PERSPECTIVE

Because IDPs are relatively a new field several new (sometimes rather vague) terms and expressions were introduced in order to describe novel concepts or phenomena associated with IDPs (van der Lee et al., 2014). Aside from bioinformatics terms (PSEs, MoRFs) other numerous expressions basically with the same meaning as PreSMos were proposed such as “only partly structured” (Zor et al., 2002), “small islands of secondary structures” (Laptenko and Prives, 2006), “weakly structured” (Chumakov, 2007), “limited structure” (Lavery and McEwan, 2008), “minimal ordering of short linear motifs” (Mittag et al., 2008), “residual secondary structural elements” (Kim et al., 2009b), “transient order” (Feuerstein et al., 2012), “transiently ordered regions”, “localized structurally ordered regions” (Zheng et al., 2012), and dynamic local structure (Lum et al., 2012) just to name a few. Being flooded with so many terms that are intended to denote PreSMos is not unique for PreSMos. For example, it took more than a decade for the IDP research community to come up with a more or less consensus term for IDPs in 2013 (Dunker et al., 2013). Yet overly creative names not precisely in line with the classical concepts and terms in structural biology or protein science created a certain degree of confusion that led to a situation where the importance of IDPs was not duly appreciated for some time (Uversky and Dunker, 2010). Here, we present again an easy-to-use term of PreSMos to designate what has been described by several generic names realizing that the existence and functional significance of PreSMos will be appreciated more and more (now in ~70% of IDPs). Most importantly, the statement that IDPs would adopt structure only upon target binding is misleading because it implies that IDPs are structureless down to the level of secondary structures. On the contrary, target binding only tightens (some structural induction) a PreSMo into a more stable conformation, but does not let a random-coil turn into a structure. In hindsight, the presence of PreSMos is in excellent agreement with the observations that a protein cannot exist in a fully random-coil state; denatured globular proteins are not random coils (Baldwin and Zimm, 2000; Bernadó et al., 2005; Neri et al., 1992). Approximately 20 years have passed since IDPs emerged in protein science and structural biology communities. With more than ~5,000 papers on the subject no one would deny that IDPs have brought a critical paradigm shift to protein research, undoubtedly requiring that biochemistry textbooks be revised to include IDPs. There has been a tendency to put excessive emphasis on the disordered nature per se of IDPs with subsequent attempts trying to relate it to function due to an early-day misconception. For example, some reports on PreSMos were interpreted simply as evidence for disorder itself rather than as evidence for the existence of PreSMos (Cheng et al., 2006; Midic et al., 2009; Radivojac et al., 2007). It is important for the protein science community to learn a non-traditional view on proteins and their structures in two aspects. First, it is now well-known fact that long regions (40 residues and up) of proteins can be intrinsically disordered beyond the level of short disordered loops (Dunker et al., 2000). Proteins exist as dynamic conformational ensembles, not as snap-short entities that the PDB structures (both x-ray and NMR) have depicted for a long time. Second, in the absence of a well-defined 3D structure, the minimal residual secondary structures embedded into the flexible long IDR play key roles in target binding and govern the function of IDPs. Even in globular proteins, an important role of tertiary structure is to place the interacting (or active) secondary structures in a proper orientation relative to target proteins. A discussion of PreSMos naturally brings us to the question of whether the mechanism of IDP-target binding follows IF (induced fit) or CS (conformational selection). In the case of KID-KIX binding IF (Sugase et al., 2007) was shown to be dominant whereas in the N-tail of viral nucleoproteins CS appeared prevalent (Jensen et al., 2008). In recent years, it is believed that these two mechanisms would work in concert; CS at the start of binding and IF at the final stage of binding (tightening). The existence of PreSMos itself is not an evidence for CS and one need to use a kinetics approach in order to determine if faster binding (kon increased) can be achieved with more pre-structuring of the PreSMo segments. Future works employing PreSMo mutants should provide a more concreate answer on this aspect. No matter whether PreSMos are pre-structured or not, i.e., even if a PreSMo may become unstructured and re-structured for binding as one may envision in the IF model (To et al., 2016) it still does not change the fact that the fragment forming a PreSMo per se is important for target binding. It is possible that PreSMos are also important for aggregation via oligomerization (Atwal et al., 2007; Eliezer et al., 2001). Both oligomerization and IDP-target binding are protein-protein interactions; the former is homogenous IDP-IDP self-binding while the latter is heterogeneous binding. Even though the PreSMo concept is broadly (~70%) applicable we do not expect that it should be applicable to all IDPs since there are IDPs/IDRs that are composed of simple dipeptide repeats (Lee et al., 2016). The PreSMo concept is also unlikely to be applicable to highly charged polyvalent IDPs which maintain unfolded topology even after target binding (Borgia et al., 2018). Due to strong attractive electrostatic interactions these IDPs have a very high affinity (pM) towards each other, unlike MU-type IDPs that bind their targets via PreSMos typically with μM affinities. However, it is noteworthy that even polyglutamine and polyproline were shown to form α-helical and PPII helix type secondary structures, respectively (Mukrasch et al., 2009; Newcombe et al., 2018). Recent reports showed that IDP studies may lead to the development of new pharmaceuticals. For example, some PreSMo-antagonists against target proteins could serve as anti-cancer compounds (Kim et al., 2017a) and certain small molecule inhibitors can directly inhibit IDPs themselves (Follis et al., 2008; Metallo, 2010).
  107 in total

1.  Transient structure of the amyloid precursor protein cytoplasmic tail indicates preordering of structure for binding to cytosolic factors.

Authors:  T A Ramelot; L N Gentile; L K Nicholson
Journal:  Biochemistry       Date:  2000-03-14       Impact factor: 3.162

2.  Preformed structural elements feature in partner recognition by intrinsically unstructured proteins.

Authors:  Monika Fuxreiter; István Simon; Peter Friedrich; Peter Tompa
Journal:  J Mol Biol       Date:  2004-05-14       Impact factor: 5.469

Review 3.  Transcriptional regulation by p53: one protein, many possibilities.

Authors:  O Laptenko; C Prives
Journal:  Cell Death Differ       Date:  2006-06       Impact factor: 15.828

4.  Solution structure of the KIX domain of CBP bound to the transactivation domain of CREB: a model for activator:coactivator interactions.

Authors:  I Radhakrishnan; G C Pérez-Alvarado; D Parker; H J Dyson; M R Montminy; P E Wright
Journal:  Cell       Date:  1997-12-12       Impact factor: 41.582

5.  The Mechanism of p53 Rescue by SUSP4.

Authors:  Do-Hyoung Kim; Chewook Lee; Si-Hyung Lee; Kyung-Tae Kim; Joan J Han; Eun-Ji Cha; Ji-Eun Lim; Ye-Jin Cho; Seung-Hee Hong; Kyou-Hoon Han
Journal:  Angew Chem Int Ed Engl       Date:  2016-12-21       Impact factor: 15.336

6.  The acidic activation domains of the GCN4 and GAL4 proteins are not alpha helical but form beta sheets.

Authors:  M Van Hoy; K K Leuther; T Kodadek; S A Johnston
Journal:  Cell       Date:  1993-02-26       Impact factor: 41.582

7.  The Dynamic Landscape of the Full-Length HIV-1 Transactivator of Transcription.

Authors:  Vu To; Edis Dzananovic; Sean A McKenna; Joe O'Neil
Journal:  Biochemistry       Date:  2016-02-25       Impact factor: 3.162

8.  Binding of the three-repeat domain of tau to phospholipid membranes induces an aggregated-like state of the protein.

Authors:  Georg Künze; Patrick Barré; Holger A Scheidt; Lars Thomas; David Eliezer; Daniel Huster
Journal:  Biochim Biophys Acta       Date:  2012-04-06

9.  Structural characterization of the native NH2-terminal transactivation domain of the human androgen receptor: a collapsed disordered conformation underlies structural plasticity and protein-induced folding.

Authors:  Derek N Lavery; Iain J McEwan
Journal:  Biochemistry       Date:  2008-02-20       Impact factor: 3.162

10.  The carboxy-terminal domain of xeroderma pigmentosum complementation group C protein, involved in TFIIH and centrin binding, is highly disordered.

Authors:  Simona Miron; Patricia Duchambon; Yves Blouquit; Dominique Durand; Constantin T Craescu
Journal:  Biochemistry       Date:  2008-01-05       Impact factor: 3.162

View more
  7 in total

Review 1.  Intrinsically disordered proteins and proteins with intrinsically disordered regions in neurodegenerative diseases.

Authors:  Orkid Coskuner-Weber; Ozan Mirzanli; Vladimir N Uversky
Journal:  Biophys Rev       Date:  2022-06-08

Review 2.  Targeting Intrinsically Disordered Proteins through Dynamic Interactions.

Authors:  Jianlin Chen; Xiaorong Liu; Jianhan Chen
Journal:  Biomolecules       Date:  2020-05-11

Review 3.  Transient Secondary Structures as General Target-Binding Motifs in Intrinsically Disordered Proteins.

Authors:  Do-Hyoung Kim; Kyou-Hoon Han
Journal:  Int J Mol Sci       Date:  2018-11-15       Impact factor: 5.923

Review 4.  Salient Features of Monomeric Alpha-Synuclein Revealed by NMR Spectroscopy.

Authors:  Do-Hyoung Kim; Jongchan Lee; K H Mok; Jung Ho Lee; Kyou-Hoon Han
Journal:  Biomolecules       Date:  2020-03-10

Review 5.  Liquid-Liquid Phase Separation by Intrinsically Disordered Protein Regions of Viruses: Roles in Viral Life Cycle and Control of Virus-Host Interactions.

Authors:  Stefania Brocca; Rita Grandori; Sonia Longhi; Vladimir Uversky
Journal:  Int J Mol Sci       Date:  2020-11-28       Impact factor: 5.923

6.  Analysis of Protein Disorder Predictions in the Light of a Protein Structural Alphabet.

Authors:  Alexandre G de Brevern
Journal:  Biomolecules       Date:  2020-07-20

7.  Interplay of Structural Disorder and Short Binding Elements in the Cellular Chaperone Function of Plant Dehydrin ERD14.

Authors:  Nikoletta Murvai; Lajos Kalmar; Bianka Szalaine Agoston; Beata Szabo; Agnes Tantos; Gyorgy Csikos; András Micsonai; József Kardos; Didier Vertommen; Phuong N Nguyen; Nevena Hristozova; Andras Lang; Denes Kovacs; Laszlo Buday; Kyou-Hoon Han; Andras Perczel; Peter Tompa
Journal:  Cells       Date:  2020-08-07       Impact factor: 6.600

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.