| Literature DB >> 31797598 |
Ramiz Somjee1,2, Diana M Mitrea, Richard W Kriwacki.
Abstract
Biomolecular condensates form through a process termed phase separation and play diverse roles throughout the cell. Proteins that undergo phase separation often have disordered regions that can engage in weak, multivalent interactions; however, our understanding of the sequence grammar that defines which proteins phase separate is far from complete. Here, we show that proteins that display a high density of charged tracts within intrinsically disordered regions are likely to be constituents of electrostatically organized biomolecular condensates. We scored the human proteome using an algorithm termed ABTdensity that quantifies the density of charged tracts and observed that proteins with more charged tracts are enriched in particular Gene Ontology annotations and, based upon analysis of interaction networks, cluster into distinct biomolecular condensates. These results suggest that electrostatically-driven, multivalent interactions involving charged tracts within disordered regions serve to organize certain biomolecular condensates through phase separation.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31797598 PMCID: PMC6939312
Source DB: PubMed Journal: Pac Symp Biocomput ISSN: 2335-6928
Figure 1.Net charge per residue plots of NPM1 (a) and another nuclear protein, SURF6 (b). Regions of predicted disorder highlighted in purple. Highlighted charged tracts have area greater than 1 Process diagram for the calculation of ABTdensity (c).
Table showing the ABTscore and ABTdensity value ranges for different percent ranges of proteins. The designated groups and number of proteins (N) are based on ABTdensity.
| Protein Percent Range | ABTscore Value Range | ABTdensity Value Range | Mean ABTdensity Value | N |
|---|---|---|---|---|
| Top 5% (Group 1) | 64–650 | 0.21–0.94 | 0.29 | 537 |
| ≤5% to <15% (Group 2) | 36–54 | 0.12–0.21 | 0.21 | 1094 |
| ≤15% to < 30% (Group 3) | 22–36 | 0.10–0.14 | 0.14 | 1642 |
| Remaining (Group 4) | 0–22 | 0–0.10 | 0.06 | 7673 |
Figure 2.Scatter plot showing correlation between ABTscore values (blue data points) and ABTdensity values (orange data points), and the number of disordered residues in each protein with one or more disordered regions (a). Enhanced box plots showing ABTscore (b) and ABTdensity (c) distributions for the whole proteome, nucleolar proteome, stress granule (SG) proteome, and NUP98 interactors. P values are reported when a protein set’s mean is different from the whole proteome’s at a p<0.05.
Figure 3.Heatmap of the Gene Ontology enrichment analysis for processes annotations of proteins in Groups 1 through 4. Asterisks indicate significance at p≤0.05.
Figure 4.Interaction network for Group 1. Nodes represent proteins and edges represent physical or genetic interactions. Orphan proteins are not shown. Each color represents clusters described in Table 3. Yellow, nucleolar proteins; green, proteins associated with nucleosomes and heterochromatin; cyan, proteins associated with transcription bodies; and blue, proteins associated with protein degradation.
Table showing the results of the interaction enrichment analysis for each Group 1–3. Enrichment P-value
| Group | Observed Interactions | Expected Interactions | Interaction Fold | Enrichment |
|---|---|---|---|---|
| 1 (n=513) | 2624 | 1289 | 2.04 | <10−16 |
| 2 (n=1045) | 7147 | 4700 | 1.52 | <10−16 |
| 3 (n=1579) | 10382 | 7021 | 1.22 | <10−16 |
Table showing Gene Ontology (GO) terms associated with proteins within each cluster from the interaction network of Group 1 (see Figure 4) and identification of the potential condensate each cluster represents. N is the number of proteins in each cluster.
| Cluster | N | GO Process | GO Function | GO Component | Condensate |
|---|---|---|---|---|---|
| Yellow | 60 | rRNA processing, ribosome biogenesis | snoRNA binding, translation initiation factor activity, RNA helicase activity | Nucleolus, Cajal body | Nucleoli |
| Green | 36 | Histone, and chromatin binding, dimerization activity | Histone, and chromatin binding, dimerization activity | Nucleosome, heterochromatin, PML body, | Nucleosome and heterochromatin |
| Cyan | 34 | RNA polymerase activity, transcription initiation activity | RNA polymerase activity, transcription initiation activity | RNA polymerase complex | Transcription bodies |
| Blue | 18 | Ubiquitin, ubiquitin-like, protein transferase activity | Ubiquitin, ubiquitin-like, protein transferase activity | Ubiquitin ligase complex, transferase complex | Protein Degradation |