| Literature DB >> 24972093 |
Djamel Harbi1, Paul M Harrison1.
Abstract
Prions are transmissible, propagating alternative states of proteins. Prions in budding yeast propagate heritable phenotypes and can function in large-scale gene regulation, or in some cases occur as diseases of yeast. Other 'prionogenic' proteins are likely prions that have been determined experimentally to form amyloid in vivo, and to have prion-like domains that are able to propagate heritable states. Furthermore, there are over 300 additional 'prion-like' yeast proteins that have similar amino-acid composition to prions (primarily a bias for asparagines and glutamines). Here, we examine the protein functional and interaction networks that involve prion, prionogenic and prion-like proteins. Set against a marked overall preference for N/Q-rich prion-like proteins not to interact with each other, we observe a significant tendency of prion/prionogenic proteins to interact with other, N/Q-rich prion-like proteins. This tendency is mostly due to a small number of networks involving the proteins NUP100p, LSM4p and PUB1p. In general, different data analyses of functional and interaction networks converge to indicate a strong linkage of prionogenic and prion-like proteins, to stress-granule assembly and related biological processes. These results further elucidate how prions may impact gene regulation, and reveal a broader horizon for the functional relevance of N/Q-rich prion-like domains.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24972093 PMCID: PMC4074094 DOI: 10.1371/journal.pone.0100615
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic representation of the relationship between the four data sets KPs, EPDs, EPNs and NQPs.
Figure 2Disorder content of known prions (KPs), experimental prionogenic domains (EPDs) and prion negatives (EPNs).
Monte Carlo samples (n = 10000) of the same total protein length as each of the three data sets were made and the total amount of disorder (from the DISOPRED2 program [25]) was annotated. A fractional piece of one protein was used to make up the exact residue count for the sample size. The plot shows the distribution of disorder content for these samples for the KP set. The actual observed value is indicated by an arrow. The % of samples that have greater disorder than the observed value for each data set is indicated in the table below the histogram.
Figure 3The protein interaction network for the EPD data set.
We drew this original picture using the publicly distributed Cytoscape tool that can be used for depicting networks [49], with the data sets of protein interactions that we derived (as described in ) as input. We have coloured the nodes as follows: --- known prions (KP) = BLACK; --- other proteins in the EPD data set = GREY; --- EPN data set = YELLOW; --- NQPs that are also prion predictions made using the PrionScan algorithm = CYAN; --- any other NQP = DARK BLUE; --- other interactors = BROWN. The non-amyloid prion accessory protein STD1 that underlies the [GAR+] prion [50] (Q02794) is part of the NQP data set that we derived, since it has an N/Q-rich domain. We have coloured its node at the lower right of the network, ORANGE. The prion/prionogenic proteins are labelled with their UniProt accessions and standard gene names. The three EPD hubs are pointed out with red arrows. A red box surrounds common interactors between the LSM4 and PUB1 proteins.
Enrichments for different sets of sequences in the interactor lists for prion, prionogenic and prion-like proteins in budding yeast*.
|
| KP (10/5796 proteins) | EPD (27/5796 proteins) | NQP (354/5796 proteins) | EPN (18/5796 proteins) | Disordered proteins (>0.5 disordered) not in NQP data set (251/5796 proteins) |
|
| 1/152 (NS) | 5/152 (0.010) | 20/152 (NS) | 1/152 (NS) | 6/152 (NS) |
|
| 5/314 (0.010) | 7/314 (0.019) |
| 7/314 (0.0074) | 9/314 ( |
|
| 1/259 (NS) | 7/259 (0.0074) | 46/259 (0.0050) | 3/259 (NS) | 7/259 (0.0013, depletion) |
|
| 20/4405 (NS) |
| 251/4405 | 46/4405 (0.0047) |
|
*The interactor lists are in the rows of the table, and the sets that are tested as enriched/depleted or not, are in the columns. These sets are explained in the main text. At the head of each column is given the total number of proteins of each set type, and the total number of interactions involving them. In each cell, is given the number of interactors that are members of the sets tested as enriching/depleted, expressed as a fraction of the total number of interactors. In brackets is given the hypergeometric probability for this enrichment/depletion, with NS for non-significant (P-value threshold = 0.05). Values that are significant enrichments after Holm-Bonferroni correction are in bold, significant depletions in italics.
These P-values become non-significant after a Holm-Bonferroni correction over all tests performed (totalling 72).
Enrichments of proteins with and without protein-binding domains (PBDs), in the interactor lists of the EPDs*.
|
| NQPs | Proteins containing PBDs but not NQP domains | Proteins containing NQP domains but not PBDs | Proteins containing NQP domains and PBDs | Disordered proteins (>0.5 disordered) not in NQP data set |
|
|
| 89/314 |
| 15/314 (NS) |
|
|
|
| 8/78 |
| 9/78 (0.006) | 0/78 (0.005, depletion) |
|
| 27/236 (NS) | 81/236 | 21/236 (NS) | 6/236 (0.048) | 9/236 (0.022, depletion) |
*As for Table 1.
**Proteins containing PBDs are those containing predicted coiled-coil regions or protein-binding domains defined specifically as such, in InterPro (see for details). The enrichment of proteins containing PBDs in the list of their own interactors is very highly significant (P = 4e−52).
As for Table 1.