| Literature DB >> 26054620 |
Charles E Chapple1, Benoit Robisson1, Lionel Spinelli2, Céline Guien1, Emmanuelle Becker1, Christine Brun3.
Abstract
Moonlighting proteins are a subclass of multifunctional proteins whose functions are unrelated. Although they may play important roles in cells, there has been no large-scale method to identify them, nor any effort to characterize them as a group. Here, we propose the first method for the identification of 'extreme multifunctional' proteins from an interactome as a first step to characterize moonlighting proteins. By combining network topological information with protein annotations, we identify 430 extreme multifunctional proteins (3% of the human interactome). We show that the candidates form a distinct sub-group of proteins, characterized by specific features, which form a signature of extreme multifunctionality. Overall, extreme multifunctional proteins are enriched in linear motifs and less intrinsically disordered than network hubs. We also provide MoonDB, a database containing information on all the candidates identified in the analysis and a set of manually curated human moonlighting proteins.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26054620 PMCID: PMC4468855 DOI: 10.1038/ncomms8412
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1MoonGO: the EMF identification pipeline.
Overlapping clusters are extracted from a PPI network using OCG. Clusters are annotated according to the GO annotations of their constituent proteins. Potential EMFs are then identified at the intersection of clusters involved in unrelated biological processes according to PrOnto GO term association probabilities.
Dissimilar biological processes.
| 53.5 | Nitrogen or nucleic acid metabolism | Signalling |
| 7.3 | Nucleic acid metabolism | Localization |
| 6.8 | Macromolecular metabolic process | Transport |
| 5.9 | RNA metabolic process | Signalling |
| 4.7 | Nucleic acid metabolism | Transport |
| 4.5 | Gene expression | Transport |
| 3.6 | Macromolecular metabolic process | Localization |
| 2.9 | Nucleic acid metabolism | Physiological processes |
| 2.7 | Nucleic acid metabolism | Phosphorus metabolism |
| 1.2 | Nucleic acid metabolism | Development |
Top 10 dissimilar function pairs by the percentage of candidates identified.
Candidate features.
Figure 2Protein features.
(a) Protein degree. (b) Protein isoforms. (c) The number of Pfam domains (including PfamB) predicted on each protein. (d) Protein disorder as calculated by disopred. The numbers shown are the percentage of a protein's residues that are disordered. Outliers are not shown. Red dots indicate mean values and the camel dots between Candidates and Hubs are the values of the known moonlighting proteins.
Figure 3ELMs per residue.
The plots show the number of ELMs divided by the length of each protein.
Figure 4Radar plots.
Radar plots showing the characteristics that were significantly different in candidates with respect to Multi-NCs (a) and hubs (b). Mean values are plotted for all features except association with cancer, where fold overrepresentation is shown. For Shortest Paths, the outermost data point is the most connected, that is, has the smallest Shortest Path value. For all others, the outermost data point is the one with the greatest value. These plots are purely descriptive and were built once the candidates were found. The features they describe were not used to identify the candidates.