| Literature DB >> 31626641 |
Ruchi Lohia1, Reza Salari1, Grace Brannigan1,2.
Abstract
The role of electrostatic interactions and mutations that change charge states in intrinsically disordered proteins (IDPs) is well-estEntities:
Year: 2019 PMID: 31626641 PMCID: PMC6821141 DOI: 10.1371/journal.pcbi.1007390
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Sequence-based decomposition of the BDNF prodomain.
a) The two functional domains of precursor BDNF: the disordered prodomain considered in this manuscript and the structured mature domain BDNF (mBDNF). b) The mean hydrophobicity (〈H〉) per residue (top), given by the Kyte-Dolittle [65] score averaged over a three residue window, and scaled to fit between 0 and 1 was digitized (bottom) according to a cutoff at 〈H〉 > 0.37. Four or more contiguous residues above the cutoff were identified as forming a hydrophobic blob. Eight hydrophobic “h” blobs (darkgrey) are identified along with 3 “p” blobs of low hydrophobicity (light grey). c) The diagram of IDP states proposed by Das and Pappu [21], based on fraction of positive (f+) and negative (f−) charged residues, and annotated by the location of the simulated BDNF prodomain and each blob identified in panel b. d) Location of simulated BDNF prodomain and each blob on an Uversky diagram [66] of IDPs and globular proteins, as a function of absolute net charge per residue (|NCPR|) and 〈H〉, with the boundary line between folded and disordered proteins given by the equation in the legend. e) Blobs identified in panel b, colored according to (i) the region of the Das and Pappu [21] diagram in panel c or (ii) sign of net charge, where red is negatively-charged, blue is positively-charged, and white is neutral. The blob h2b contains the Val66Met SNP and is marked with star. Additional properties of the blob sequences can be found in Table 1.
Sequence based properties of hydrophobic (h) and linker (p) blobs identified in the BDNF prodomain, as shown in Fig 1.
| Region | Group | Blob | N | NCPR | 〈 | FCR | Sequence | R | P | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| I | p1 | p1 | 8 | 0.00 | 0.37 | 0.25 | 0.13 | 0.13 | 0.8 | EANIRGQG | 2 | 0.00 |
| h1 | h1a | 8 | 0.13 | 0.52 | 0.13 | 0.00 | 0.13 | 1.0 | GLAYPGVR | 1 | 0.13 | |
| h1b | 6 | -0.17 | 0.49 | 0.17 | 0.17 | 0.00 | 0.1 | TLESVN | 1 | 0.00 | ||
| p2 | p2 | 7 | 0.29 | 0.34 | 0.29 | 0.00 | 0.29 | 0.4 | GPKAGSR | 2 | 0.14 | |
| h2 | h2a | 9 | -0.11 | 0.58 | 0.11 | 0.11 | 0.00 | 0.7 | GLTSLADTF | 1 | 0.00 | |
| h2b(V66) | 8 | -0.38 | 0.54 | 0.38 | 0.38 | 0.00 | 0.3 | HVIEELLD | 4 | 0.00 | ||
| h2b(M66) | 8 | -0.38 | 0.50 | 0.38 | 0.38 | 0.00 | 0.3 | HMIEELLD | 4 | 0.00 | ||
| II | p3 | p3 | 15 | -0.13 | 0.21 | 0.53 | 0.33 | 0.20 | 0.1 | EDQKVRPNEENNKDA | 3 | 0.06 |
| III | h3 | h3a | 4 | -0.25 | 0.45 | 0.25 | 0.25 | 0.00 | N/A | DLYT | 2 | 0.00 |
| h3b | 5 | 0.20 | 0.60 | 0.20 | 0.00 | 0.20 | N/A | RVMLS | 1 | 0.00 | ||
| h3c | 5 | -0.20 | 0.49 | 0.20 | 0.20 | 0.00 | N/A | QVPLE | 1 | 0.20 | ||
| h3d | 7 | -0.14 | 0.70 | 0.14 | 0.14 | 0.00 | 1.0 | PLLFLLE | 1 | 0.14 | ||
| V66 Seq | 91 | -0.09 | 0.44 | 0.26 | 0.18 | 0.09 | 0.2 | 2 | 0.07 | |||
| M66 Seq | 91 | -0.09 | 0.44 | 0.26 | 0.18 | 0.09 | 0.2 | 2 | 0.07 |
Number of residues in the blob
Net charge per residue
Mean hydrophobicity, average of Kyte-Dolittle [65] scores for each residue in the blob scaled to fit between 0 and 1
Fraction of charged residues
Fraction of negatively charged residues
Fraction of positively charged residues
Charge distribution parameter κ as defined by Das and Pappu [21], calculated using CIDER [67]
Region in phase diagram proposed by Das and Pappu [21] (Fig 1c)
Fraction of Proline residues
Fig 2Comparison of MD and NMR observables.
a) ΔδC (top), ΔδC (middle), ΔδC-ΔδC (bottom) values from NMR at 280K (black lines) [63] and MD at 300K for the V66 (a) and M66 (b) sequences. The gray region represents a discrepancy of more than 1 ppm from NMR secondary chemical shifts. Root-mean-squared deviation (RMSD) represents the deviation between the NMR and MD values. Error at each residue is calculated as the standard error in the mean, where n = 1088 is the product of the total number of replicas simulated and the average number of roundtrips per replica. Panels are annotated by a blob representation of the prodomain, as in Fig 1e(i); vertical grey lines in each panel represent the blob boundaries.
Fig 3Effects of Val66Met on secondary structure.
a) ΔδC-ΔδC values for the V66 and M66 sequences from NMR [63]. Values on top are equivalent to the two NMR curves shown in Fig 2 (bottom panel), while the difference between the two curves is shown at the bottom. b) Helix (top) or β (bottom) propensity for each simulated residue of the 300K replica, defined as the probability of a given residue being part of a sequence of four or more consecutive residues whose dihedral angles place them in the helical (left) region or β (right) region of the Ramachandran map (further described in Methods). Errors represent standard error of a Bernoulli trial with n samples, where n = 1088 is the product of the total number replicas and the average number of roundtrips per replica. c) Difference (M66-V66) between probabilities of secondary structure formation of a given length, for helix (top) and β (bottom). Panels are annotated by a blob representation of the prodomain, as in Fig 1e(i); vertical grey lines in each panel represent the blob boundaries. d) Contact probability for each residue pair within the h2 group for V66 (top) and M66 (bottom) sequences. Each residue in group h2 is annotated with a circle representation and contacts found in at least 50% of the frames are represented with an edge. e) Difference (M66-V66) between the contact probabilities shown in panel d. Contacts with a population difference of at least 15% between the V66 and M66 sequences are represented by an edge.
Fig 4Detection of tertiary enrichment.
To decouple short-range and long-range structural correlations, this work grouped segments of the protein into blobs using sequence, and then compared contacts between the blobs to those expected for an analogous self-avoiding heteropolymer (SAHP). The SAHP was parameterized by extracting local properties (size and shape) of blobs from the real protein (RP) trajectory.
Fig 5Effect of Val66Met on contacts between blobs.
a) Blob-blob contact probability for the V66 self-avoiding heteropolymer (SAHP) from Monte Carlo simulations (further described in Methods). The black boxes mark the regions identified. b) Blob-blob contact probability shown in panel a for the V66 (left) and M66 (right) sequences of the real protein (RP). The x and y axes are annotated with cartoon representation of the prodomain; circles are drawn to the scale of each blob’s size. c) Population of contacts in SAHP, RP (top) and enrichment in RP contacts with respect to SAHP contacts (bottom) for each region pair. The errors represent standard errors (n = 1088 as described in Methods). d) Difference (M66-V66) between the contact probabilities shown in panel b. e) Differences shown in panel d with respect to SAHP; interactions more frequently found in M66 or V66 sequence are in blue and orange respectively.
Fig 6Secondary structure coupling between blobs on either side of the p3 linker.
β propensities at each residue in the V66 sequence (top) and the M66 sequence (bottom) for four clusters. Frames were first clustered by whether the h3b-h2b (a) or h3b-h1a (b) contact was formed (purple) or broken (green), and then by whether β structure was present (solid) or absent (dashed) in h2b (panel a) or h1a (panel b). The dark-gray window indicates the contacting blob that is constrained to have high or vanishing values by construction of the cluster, while the white window indicates the contacting blob without constrained secondary structure. If the contact is coupled to simultaneous β-strand formation, the peak within the white window for the solid purple curve should be significantly higher than other curves. Errors represent standard error of a Bernoulli trial with n number of samples, where n is the product of the total number of unique replicas in a given cluster and the average number of roundtrips per replica. Panels are annotated by a blob representation of the prodomain, as in Fig 1e(i).
Fig 7Effect of secondary structure in group h2 on which residues form the cross-boundary h2-h3 contact.
a) Contact probability at each residue in h2b with each residue in h3 for V66 (left) and M66 (right) sequences. b) Difference (M66-V66) between the contact probabilities shown in panel a. c) Representative conformations of V66 sequence (top) and M66 sequence (bottom) showing preferred residue-level contacts in VDW representations. Residues are colored by residue type: blue:basic, red:acidic, cyan:polar, grey:hydrophobic except methionine, Met: yellow. The chain is colored according to the Das and Pappu diagram in Fig 1c. Tubes represent hydrophobic “h” blobs whereas lines represent non-hydrophobic linker “p” blobs. d) Contact probability between residues 63-69 and each residue in h3ab, when respective secondary structure is formed at each residue, for both the V66 (left) and M66 (right) sequences. Residue labels are colored according to residue type: blue:basic, red:acidic, grey:hydrophobic/polar and Met: yellow.
Fig 8Simulation convergence.
a) Trajectory (left) and distribution (right) of R and R for the 300K replicas. The shaded region represents the equilibration period discarded from the distribution and from all analysis presented in Results and Discussion. Experimental values of R from NMR diffusion [63] are 2.24±0.1 nm for the V66 sequence and 2.20±0.1 nm for the M66 sequence, and are indicated by dashed lines. b) Trajectory of helix (top) and β (bottom) propensity at residue 66 for the 300K replicas of both sequences. c) Trajectory of enrichment of total Region I-Region III contacts relative to SAHP in the 300K replica. The trajectory shows averages over a 100 ns moving window in panel a, b and c. d) Number of replicas forming the V66-V94 contact (top) and the M66-M95 contact (bottom) vs the simulation time. e) The number of round trips completed by each replica over the 1.2 μs production period.
Fig 9Parameterization of self-avoiding heteropolymer.
a) 〈R〉 vs 〈R〉 for each blob of V66 sequence. Blobs are colored according to the Das and Pappu diagram in Fig 1c. Statistical error was smaller than the circles used for the representation of each blob. b) The distribution of normalized excess distances across all blob-pairs in the V66 RP, where |i − j| > 1. c) Relationship between the radius of gyration R, end to end distance R, and excess distance d, calculated for each blob or blob pair using a RP trajectory. d) The SAHP is a chain with each monomer representing a blob of the RP and modeled as a hard sphere. Each monomer i has radius aR and is separated from monomer i + 1 by bond length (R + R)/2. Bond lengths are constrained and bond angles can rotate freely.