| Literature DB >> 32831102 |
Pernille Seiffert1,2, Katrine Bugge1,2, Mads Nygaard1,2, Gitte W Haxholm1,2, Jacob H Martinsen1,2, Martin N Pedersen3, Lise Arleth3, Wouter Boomsma4, Birthe B Kragelund5,6.
Abstract
BACKGROUND: Class 1 cytokine receptors (C1CRs) are single-pass transmembrane proteins responsible for transmitting signals between the outside and the inside of cells. Remarkably, they orchestrate key biological processes such as proliferation, differentiation, immunity and growth through long disordered intracellular domains (ICDs), but without having intrinsic kinase activity. Despite these key roles, their characteristics remain rudimentarily understood.Entities:
Keywords: CIDER; Cytokine receptors; Disorder; IDDomainSpotter; IDPs; IDRs; NMR; SAXS; SLiM; Signaling; Structural biology; Transmembrane receptors
Year: 2020 PMID: 32831102 PMCID: PMC7444064 DOI: 10.1186/s12964-020-00626-6
Source DB: PubMed Journal: Cell Commun Signal ISSN: 1478-811X Impact factor: 5.712
Fig. 1The C1CR family - structures and ICD isoforms. a Structural model of the full-length human PRLR [35] in the membrane, with the ECD in light grey, the TMD in pink and a total of six ICD chains shown in blue shades to represent its disordered conformational ensemble. b Left: A representative sketch of a prototypical C1CR (blue) in a membrane (green) having a long ICD with its length given in %. Box1 and Box2 are highlighted in orange. The red shades highlight the approximate extent and position of small ICD fragments of different C1CR-ICDs having their structure solved in complex with signaling proteins. Right: Three-dimensional structures of signaling proteins (grey surfaces) in complex with C1CR-ICD fragments (red cartoon and sticks), being the FERM-SH2 domain of JAK2 in complex with EPOR-ICD279–334 (pdb code 6E2Q) (top), SOCS2 in complex with GHR-ICD591–603 (pdb code 6I5J, 6I5N) (middle top), the PTB domain of IRS-1 in complex with IL-4Rα490–500 (pdb code 1IRS) (middle bottom) and the PDZ domain from syntenin in complex with IL-5Rα417–420 (pdb code 1OBZ) (bottom). The grey arrows point to the approximate positions of the ICD fragments (red shades) on the representative ICD sketch. c The 29 receptors of the C1CR family with an ICD and their isoforms. The length of the ICDs are to scale and red indicates the length of unique sequences differing from the long-form. The receptors are divided into 5 groups according to [33].
Fig. 2Intrinsic disorder and compositional bias of C1CR-ICDs. a Disorder prediction for C1CR common receptors and group 1. The disorder propensity, ranging from 0 to 1 was predicted using IUPred2A (blue), ANCHOR (black) and Pondr-fit VSL2 (red) and is plotted as a function of residue number. The boundaries between the ECD (white background), TMD (yellow background) and ICD (orange background) were predicted using TMHMM v. 2.0. The sequence numbering includes the signal peptide. b Fractional differences in composition between the C1CR-ICDs (dark grey) or a set of IDPs (light grey), and a set of folded proteins calculated for each amino acid type (see methods for details). Negative values denote that the amino acid is less frequent than in folded proteins, and positive values denote that the amino acid is more frequent than in folded proteins. The arrows indicate the directions of “more” abundant than in folded proteins, and “less” abundant than in folded proteins. The error bars indicate the 90% confidence interval of the estimated frequencies, calculated using a per-protein bootstrapping procedure with 1000 iterations [105]. c IDDomainSpotter profiles of group 1 C1CR-ICDs. Profiles display scores for +Arg,Lys-Asp,Glu (green), +Cys (brown), +Pro (purple), + Ile,Leu,Val (red) and + Phe,Tyr,Gly (blue) calculated over 15 residue windows for each of the ICDs
Overview of isoforms of the cytokine class 1 receptor family differing in the ICD
| Class 1 Cytokine Receptors with Intracellular Domains | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Name# | Iso | Alternative | Uniprot | ICD sequence | Residuesa | Unique | Unique | ICD length | ICD start | |
| GROUP 1 - HOMODIMERIC RECEPTORS | 1,2b | PRLRfl1 ΔS12 | 6221 5212 | 364 | 259 | |||||
| 4 | SF1a | 376 | 338 | 376 | 118 | 259 | ||||
| 5 | I | 349 | 337 | 349 | 91 | 259 | ||||
| 6,8c | SF1b6 Δ4-SF1b8 | 2886 2178 | 2866 2158 | 2886 2178 | 30 | 2596 1888 | ||||
| 9 | SF1c | 309 | 286 | 309 | 51 | 259 | ||||
| 1,4d | GHRfl1 GHRd34 | 6381 6164 | 351 | 2881 2664 | ||||||
| 2 | GHRtr GHR1-279 | 297 | 292 | 297 | 10 | 288 | ||||
| 3 | GHR1-277 | 294 | 292 | 294 | 7 | 288 | ||||
| F | FL | 508 | 236 | 273 | ||||||
| T | Truncated | 328 | 306 | 328 | 56 | 273 | ||||
| 1 | C-mpl-P | 635 | 121 | 515 | ||||||
| 2 | C-mpl-K | 579 | 523 | 579 | 66 | 514 | ||||
| GROUP 2 - gp130 FAMILY | 1 | YRKREWIKETFYPDIPNPENCKALQFQKSVCEGSSALKTLEMNPCTPNNVEVLETRSAFPKIEDTEIISPVAERPEDRSDAEPENHVVVSYCPPIIEEEIPNPAADEAGGTAQVIYIDVQSMYQPQAKPEEEQENDPVGGAGYKPQMHLPINSTVEDIAAEEDLDKTAGYRPQANVNTWNLVSPDSPRSIDSNSEIVSFGSPCSINSRQFLIPPKDEDSPKSNGGGWSFTNFFQNKPND | 1097 | 239 | 859 | |||||
(IL-31Rβ) | 1 | OSMRfl IL-31Rβ | KSQWIKETCYPDIPDPYKSSILSLIKFKENPHLIIMNVSDCIPDAIEVVSKPEGTKIQFLGTRKSLTETELTKPNYLYLLPTEKNHSGPGPCICFENLTYNQAASDSGSCGHVPVSPKAPSMLGLMTSPENVLKALEKNYMNSLGEIPAGETSLNYVSQLASPMFGDKDSLPTNPVEAPHCSEYKMQMAVSLRLALPPPTENSSLSSITLLDPGEHYC | 979 | 218 | 762 | ||||
| 1 | G-CSF-R1 | 836 | 188 | 649 | ||||||
| 3 | G-CSF-R3 | 863 | 680 | 707 | 215 | 649 | ||||
| 4 | GCSF-R4 | 783 | 750 | 783 | 134 | 650 | ||||
(GLMR) | 1, 2, 6,12 | V32 GPL74512 | Q8NI17-11 Q8NI17-22 Q8NI17-66 8NI17-1212 | 7321 7642 6226 74512 | 194 | 5391 5712 4296 55212 | ||||
| 3,5 | v4 v1 | Q8NI17-33 Q8NI17-55 | 6623 6815 | 6393 6395 | 6493 6495 | 111 | 5523 5715 | |||
| 9 | GPL560, short | Q8NI17-99 | 560 | 9 | 552 | |||||
| 10 | GPL610 | Q8NI17-1010 | 611 | 593 | 598 | 60 | 552 | |||
| 11 | GLP626 | Q8NI17-1111 | 626 | 593 | 613 | 75 | 552 | |||
(gp130) | 1,3 | P40189-11 P40189-33 | NKRDLIKKHIWPNVPDPSKSHIAQWSPHTPPRHNFNSKDQMYSDGNFTDVSVVEIEANDKKPFPEDLKSLDLFKKEKINTEGHSSGIGGSSCMSSSRPSISSSDENESSQNTSSTVQYSTVVHSGYRHQVPSVQVFSRSESTQPLLDSEERPEDLQLVDHVDGGDGILPRQQYFKQNCSQHESSPDISHFERSKQVSSVNEEDFVRLKQQISDHISQSCGSGQMKMFQEVSAADAFGPGTEGQVERFETVGMEAATDEGMPKSYLPQTVRQGGYMPQ | 9181 8573 | 277 | 6411 5803 | ||||
| 1,3 | Q99665-11 Q99665-33 | 8621 7763 | 221 | 6421 5563 | ||||||
| 2 | Q99665-2 | 659 | 650 | 659 | 18 | 642 | ||||
| 1 | IL23R1 | NRSFRTGIKRRILLLIPKWLYEDIPNMKNSNVVKMLQENSELMNNNSSEQVLYVDPMITEIKEIFIPEHKPTDYKKENTGPLETRDYPQNSLFDNTTVVYIPDLNTGYKPQISNFLPEGSHLSNNNEITSLTLKPPVDSLDSGNNPRLQKHPNFAFSVSSVNSLSNTIFLGELSLILNQGECSSPDIQNSVEEETTMLLENDSPSETIPEQTLLPDEFVSCLGIV | 629 | 253 | 377 | |||||
| 2, 5, 6,7 | IL23R2F22 IL23R2F35 IL23R6 6 IL23R57 | Q5VWK5-22 Q5VWK5-55 Q5VWK5-66 Q5VWK5-77 | 3752 2645 3746 2277 | 28 | 3482 2375 3476 2007 | |||||
| 1 | SGRCYHLRHKVLPRWVWEKVPDPANSSSGQPHMEQVPEAQPLGDLPILEVEEMEPPPVMESSQPAQATAPLDSGYEKHFLPTPEELGLLGPPRPQVLA | 636 | 98 | 539 | ||||||
| 1 | Long | 662 | 94 | 569 | ||||||
| 2 | Short | 660 | 659 | 660 | 92 | 569 | ||||
(OBR) | B | 13.2, OBRb | 1165 | 303 | 863 | |||||
| A | 6.4, HuB219.3 | P48357-2 | 896 | 892 | 896 | 34 | 863 | |||
| C | 12.1 OBRa | P48357-3 | 958 | 892 | 958 | 96 | 863 | |||
| D | Hub219.2 | P48357-4 | 906 | 892 | 906 | 44 | 863 | |||
| GROUP 3 – SOLUBLE ALFA CHAINS | 1 | Long | RFKKTWKLRALKEGKTSMHPPYSLGQLVPERPRPTPVLVPLISPPVSPSSLGSDNTSSHNRPDARDPRSPYDISNTDYFFPR | 468 | 82 | 387 | ||||
| HCR1 | Membrane Form | 422 | 33 | 390 | ||||||
| HCR2 | Soluble Form, sIL11RA | Q14626-2 | 390 | 1 | 390 | |||||
| GROUP 4 – SLONG CHAIN RECEPTOR CHAINS | 1 | H20 | 459 | 196 | 264 | |||||
| 3 | H1 | P16871-2 | 298 | 293 | 298 | 35 | 264 | |||
| 1,2 | Q01113-11 Q01113-22 | 5211 5002 | 230 | 2921 2712 | ||||||
| 3 | Q01113-3 | 342 | 332 | 342 | 16 | 327 | ||||
(IL-15Rβ) | 1 | NCRNTGPWLKKVLKCNTPDPSKFFSQLSSEHGGDVQKWLSSPFPSSSFSPGGLAPEISPLEVLERDKVTQLLLQQDKVPEPASLSSNHSLTSCFTNQGYFFFHLPDALEIEACQVYFTYDPYSEEDPDEGVAGAPTGSSPQPLQPLSGEDDAYCTFPSRDDLLLFSPSLLGGPSPPSTAPGGSGAGEERMPPSLQERVPRDWDPQPLGPPTPGVPDLVDFQPPPELVLREAGEEVPDAGPREGVSFPWSRPPGQGEFRALNARLPLNTDAYLSLQELQGQDPTHLV | 551 | 286 | 266 | |||||
| 1 | KTHPLWRLWKKIWAVPSPERFFMPLYKGCSGDFKKWVGAPFTGSSLELGPWSPEVPSTLEVYSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTAQNSGGSAYSEERDRPYGLVSIDTVTVLDAEGPCTWPCSCEDDGYPALDLDAGLEPSPGLEDPLLDAGTTVLSCGCVSAGSPGLGGPLGSLLDRLKPPLADGEDWAGGLPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECDFTSPGDEGPPRSYLRQWVVIPPPLSSPGPQAS | 538 | 283 | 256 | ||||||
| 1,3 | P24394-11 P24394-33 | CYVSITKIKKEWWDQIPNPARSRLVAIIIQDAQGSQWEKRSRGQEPAKCPHWKNCLTKLLPCFLEHNMKRDEDPHKAAKEMPFQGSGKSAWCPVEISKTVLWPESISVVRCVELFEAPVECEEEEEVEEEKGSFCASPESSRDDFQEGREGIVARLTESLFLDLLGEENGGFCQQDMGESCLLPPSGSTSAHMPWDEFPSAGPKEAPPWGKEQPLHLEPSPPASPTQSPDNLTCTETPLVIAGNPAYRSFSNSLSQSPCPRELGPDPLLARHLEEVEPEMPCVPQLSEPTTVPQPEPETWEQILRRNVLQHGAAAAPVSAPTSGYQEFVHAVEQGGTQASAVVGLGPPGEAGYKAFSSLLASSAVSPEKCGFGASSGEEGYKPFQDLIPGCPGDPAPVPVPLFTFGLDREPPRSPQSSHLPSSSPEHLGLEPGEKVEDMPKPPLPQEQATDPLVDSLGSGIVYSALTCHLCGHLKQCHGQEDGGQTPVMASPCCGCCCGDRSSPPTTPLRAPDPSPGGVPLEASLCPASLAPSGISEKSKSSSSFHPAPGNAQSSSQTPKIVNFVSVGPTYMRVS | 8251 8103 | 575 | 2511 2363 | |||||
| 1,2 | IL-3Rb | P32927-11 P32927-22 | RFCGIYGYRLRRKWEEKIPNPSKSHLFQNGSAELWPPGSMSAFTSGSPPHQGPWGSRFPELEGVFPVGFGDSEVSPLTIEDPKHVCDPPSGPDTTPAASDLPTEQPPSPQPGPPAASHTPEKQASSFDFNGPYLGPPHSRSLPDILGQPEPPQEGGSQKSPPPGSLEYLCLPAGGQVQLVPLAQAMGPGQAVEVERRPSQGAAGSPSLESGGGPAPPALGPRVGGQDQKDSPVAIPMSSGDTEDPGVASGYVSSADLVFTPNSGASSVSLVPSLGLPSDQTPSLCPGLASGPPGAPGPVKSGFEGYVELPPIEGRSPRSPRNNPVPPEAKSPVLNPGERPADVSPTSPQPEGLLVLQQVGDYCFLPGLGPGPLSLRSKPSSPGPGPEIKNLDQAFQVKKPPGQAVPQVPVIQLFKALKQQDYLSLPPWEVNKPGEVC | 8971 9032 | 437 | 4611 4672 | ||||
| GROUP 5 – SHORT TAIL RECEPTOR CHAINS | 1,3 | TLSP-R IL-XR | Q9HC73-11 Q9HC73-33 | KLWRVKKFLIPSVPDPKSIFPGLFEIHQGNFQEWITDTQNVAHLHKMAGAEQESGPEEPLVVQLAKTEAESPRMLDPQTEEKEASGGSLQLPHQPLQGGDVVTIGGFTFVMNDRSYVAL | 3711 2593 | 119 | 2531 1413 | |||
| 1,2 | IL-2Rg | P31785-11 P31785-22 | ERTMPRIPTLKNLEDLVTEYHGNFSAWSGVSKGLAESLQPDYSERLCLVSEIPPKGGALGEGPGASPCNQHSPYWAPPCYTLKPET | 3691 1792 | 86 | 2841 942 | ||||
| 1 | IL-13RA1 | P78552-1 | KRLKIIIFPPIPDPGKIFKEMFGDQNDDTLHWKKYDIYEKQTKEETDSVVLIENLKKASQ | 427 | 60 | 368 | ||||
| 1 | IL-13RA2 | Q14627-1 | RKPNTYPKMIPEFFCDT | 380 | 17 | 364 | ||||
| 1,5 | Q01344-11 Q01344-55 | 4201 2115 | 59 | 3621 1535 | ||||||
| 4 | Q01344-4 | 378 | 265 | 378 | 17 | 362 | ||||
| 1, 7, 8 | GMRα | P15509-11 P15509-77 P15509-88 | 4001 4347 2678 | 54 | 3471 3817 2148 | |||||
| 2 | P15509-2 | 410 | 376 | 410 | 64 | 347 | ||||
| 1 + 2 | SP11 SP22 | P26951-11 P26951-22 | RRYLVMQRLFPRIPHMKDPIGDSFQNDKLVVWEAGKAGLEECLVTEVQVVQKT | 3781 3002 | 53 | 3261 2482 | ||||
#Full names of receptors are: PRLR prolactin receptor; GHR growth hormone receptor; EPOR erythropoietin receptor; TPOR thrombopoietin receptor; (alternative name: myeloproliferative leukemia protein, MPL); LIFR leukemia inhibitory factor receptor; OSMR oncostatin-M-specific receptor subunit beta (alternative name: IL-31 receptor subunit beta); G-CSFR granulocyte colony-stimulating factor receptor; IL-31Rα, interleukin-31 receptor subunit alpha (alternative name: GLMR, Gp130-like monocyte receptor); IL-6Rβ interleukin 6 receptor beta (alternative name: gp130, glycoprotein 130); IL-12Rβ2 interleukin 12 receptor beta 2; IL-23R interleukin 23 receptor; IL-27R interleukin 27 receptor; IL-12Rβ1 interleukin 12 receptor beta 1; LEP-R leptin receptor (alternative name OB-R, obesity receptor); IL-6Rα interleukin 6 receptor alpha; IL-11Rα interleukin 11 receptor alpha; IL-7Rα interleukin 7 receptor alpha; IL-9Rα interleukin 9 receptor alpha; IL-2Rβ interleukin 2 receptor subunit beta (alternative name; IL-15Rβ, interleukin-15 receptor subunit beta); IL-21R interleukin 21 receptor; IL-4Rα interleukin 4 receptor alpha; βc cytokine receptor common subunit beta; CRLF2 cytokine receptor-like factor 2 (alternative name TSLPR, thymic stromal lymphopoietin protein receptor); γc cytokine receptor common subunit gamma (alternative name IL-2Rγ interleukin 2 receptor subunit gamma); IL-13Rα1 interleukin 13 receptor alpha 1; IL-13Rα2 interleukin 13 receptor alpha 2; IL-5Rα interleukin 5 receptor alpha; GM-CSFRα granulocyte-macrophage colony-stimulating factor receptor subunit alpha; IL-3Rα interleukin 3 receptor subunit alpha
aResidue numbering is with the signal peptide included
bIsoform 2: missing residues 24–124, but ICD is identical
cSF1b, Isoform 8 missing 1–71 ICD identical to Isoform 6
dIsoform 4: A24D & 25–46 missing, but ICD is identical
Fig. 3Short linear motifs form the basis for signaling choreography. a Sequence logo of the amino acid distribution in C1CR group 1 Box1 and Box2 motifs. Letter-height indicates the relative abundance of the given amino acid at the specific position. Red indicates an acidic residue, blue - a basic residue, green – a polar residue, black – a hydrophobic residue and pink – a neutral residue [102]. b SLiM and phosphorylation site mapping in C1CR group 1 ICDs. Regions with SLiMs are highlighted and colored according to the amount of overlap. Black indicates regions with no overlap. Yellow indicates regions with two overlapping SLiMs, whereas red indicates regions with more than two overlapping SLiMs. Predicted phosphorylation sites are indicated by grey diamonds [107] and confirmed and well-characterized sites by black diamonds. c Zoom on the membrane proximal 67-residue region of GHR-LF-ICD containing overlapping binding sites. Box1 and Box2 are highlighted in blue. Proteins were illustrated in IBS 1.0.3 [114]
Properties of disorder in C1CRs
| IDP characteristic | Key descriptors for C1CR ICDs |
|---|---|
| Specific amino acid composition | The C1CR-ICDs have a unique amino acid composition that distinguishes them from both folded proteins and IDPs. |
| Disordered | All the ICDs of the C1CRs are predicted disordered throughout their sequences, but has been shown experimentally only for the GHR and PRLR |
| Rich in SLiMs | #Several SLiMs are common to groups of the C1CRs, in particular • BOX1 motifs (JAK/SH2) ΦΦP.ΦP.P (JAK2) ΦP. P (JAK1) ΦΦP.ΦP.[P/Φ].[P/Φ](JAK3/TYK2) • 14–3-3 R[^DE](0,2)[^DEPG][ST][^PRIKGN]-P R[^DE](0,2)[^DEPG][ST][^P]* • SOCS2/3 pY [AFILVWY].[AFILVWY] (loose SH2-motif) • PDZ ..[ST].[ACVILF]* (Class 1) ..[VLIFY].[ACVILF]* (Class 2) ..[DE].[ACVILF]* (Class 3) • TRAF2/6 [PSAT].[QE] E (TRAF2) [P].[Q]..D (TRAF2) [P].[Q]..[FYWHDE] (TRAF6) • STAT [Y]..[P] (STAT1) [Y]..[Q] (STAT3) [Y][VLTFIC].. (STAT5) (promiscuous) [Y]..[F] (STAT6) • Phospho-degrons D [S]G.(2,3[ST] [LIVMP].(0,2)(T)P..[ST] • Dileucine motifs [D/E]...[L/I][L/I] [D/E]..LL • Tyrosine-based internalization motifs Y..[LMVIF] Only very few SLiMs have been addressed experimentally and only 7 three-dimensional structures exist of complexes. |
| Rich in PTMs | All C1CR-ICDs have numerous predicted phosphorylation sites distributed along the chain, but only few have been confirmed by MS or by mutational studies. Some SLiMs are regulated by phosphorylation |
| Alternative splice variants | Out of a total of 29 C1CRs, 16 have at least two isoforms differing in their ICDs, but up to five ICD isoforms are seen for some receptors (PRLR, IL-31R). Isoforms allow for network rewiring by insertion and deletion of specific SLiMs |
| Dynamic conformational ensemble | The CIDER analysis and measured |
| Multispecificity | Overlapping SLiMs dominates C1CR-ICDs and allow competition as a regulatory mechanism. This is made possible as the disordered chain can adapts to several different binding partner. |
#:^means that it cannot be a residue of this kind; * indicates the negatively charged C-terminal; For the JAK binding motifs, the similar PXP motif is shaded in grey. Φ illustrate ant hydrophobic residue
Fig. 4Network rewiring by disorder. Top; Comparison of a long and short isoform of PRLR. a Structural propensities of the PRLR-LF-ICD compared to PRLR-SF1b-ICD illustrated by differences in NMR secondary chemical shifts (SCSs) of Cα. Red coloring indicates lower Cα SCSs in PRLR-SF1b-ICD compared to PRLR-LF-ICD, and hence less helicity. b NMR chemical shift perturbations upon addition of POPC/POPS SUVs to PRLR-SF1b-ICD and c intensity ratios of PRLR-SF1b-ICD in the absence and the presence of POPC/POPS SUVs. Red circles highlight prolines and/or unassigned residues and shaded areas loss of NMR signal upon addition. Bottom; Predicted and selected binding SLiMs in the isoforms of d PRLR, e IL-31Rα (GLMR), and f LEPR (OBR). Introduction of potential new binding SLiMs in the unique sequences are indicated by the sequence of the SLiM, and the presence of SLiMs in the LF of the isoforms is indicated with a “+”. When a SLiM is not present, this is indicated by a “-”. The length of the ICD is indicated by the scale bars on top, and red lines illustrate the length of the unique sequence. “*” indicates a different SLiM compared to the binding site identified by mutations in [148]. Alternative names for the isoforms are given in the second row. For other C1CR isoforms, see Table S2
Fig. 5Conformational properties of C1CR-ICDs. a Plot of conformational parameters provided by CIDER analyses of the longest isoforms of the C1CR-ICDs. For GM-SCFRα, both isoform 1 and 2 were analyzed as they differed > 50%. Each receptor group is highlighted with different colors; 1 – blue, 2 – green, 3 -yellow, 4 – red, and 5 - purple. When sequences have a Pro warning, i.e. a Pro fraction > 0.15, the ICD is highlighted with a P on top. The numbers at the top are the compositional region assigned by CIDER. # residues are the numbers of residues in the ICD, FCR is the fraction of charged residues (FCR = f+ + f−), NCPR is the net charge per residue (NCPR = f+ − f−), κ is a measure of the charge mixing across the sequence (normalized between 0 and 1) and Ω additionally takes Pro mixing into account (normalized between 0 and 1). The dotted lines at 0.4, 0.3, and 0.1 in the FCR plot represent the borders for very highly charged (above), highly charged (above) and highly depleted (lower) in charges, respectively [160]. The area between the dotted lines at 0.1 and − 0.1 in the NCPR plot represents the range occupied by 70% of human IDPs [160]. b Experimental SAXS data (spheres) and Gaussian random coil model fit (lines) for PRLR-LF-ICD at different concentrations of NaCl (see color key), with the residuals from the fit shown below in units of standard deviations. c Pair distance distribution functions for the PRLR-LF-ICD SAXS data. The color coding is the same as in b)