| Literature DB >> 30626316 |
Morteza H Chalabi1, Vasileios Tsiamis1, Lukas Käll2, Fabio Vandin3, Veit Schwämmle4.
Abstract
BACKGROUND: Translational and post-translational control mechanisms in the cell result in widely observable differences between measured gene transcription and protein abundances. Herein, protein complexes are among the most tightly controlled entities by selective degradation of their individual proteins. They furthermore act as control hubs that regulate highly important processes in the cell and exhibit a high functional diversity due to their ability to change their composition and their structure. Better understanding and prediction of these functional states demands methods for the characterization of complex composition, behavior, and abundance across multiple cell states. Mass spectrometry provides an unbiased approach to directly determine protein abundances across different cell populations and thus to profile a comprehensive abundance map of proteins.Entities:
Keywords: Co-regulation; Protein complex; Statistics
Mesh:
Substances:
Year: 2019 PMID: 30626316 PMCID: PMC6327379 DOI: 10.1186/s12859-018-2573-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Schema of entire workflow to investigate protein complex behavior. Models and randomization methods that were not used in the final assessment of CORUM complexes are shown in grey. For a more detailed description of the workflow, see Methods
Summary of scoring models and randomization methods
| Model | Abbrv. | Output |
| Mean correlation | MCOM | Similarity to averaged abundance profile |
| Pairwise correlation | PCOM | Sum of pairwise similarities |
| Factor analysis | FAM | Weights for protein contribution to full set |
| Randomization | Abbrv. | Basis |
| Independent sampling | IS | Mix all values |
| Protein-centered sampling | PCS | Keep protein profiles |
| Protein- and cell type- centered sampling | PTCS | Keep protein and cell type profiles |
Fig. 2Comparison of models for significant co-regulations. Number of complexes (a) and proteins (b) with significant abundance profiles according to the different scoring models and randomizations calculated for different thresholds for their false discovery rate (FDR)
Fig. 3Performance of scoring models measured by robustness to 50%, 75% and 100% artificially added random proteins. Proteins were categorized into complex subunits and random proteins. True positive and false positives rates (TPR and FPR) were given by the fraction of true positives and false positives at a given FDR threshold. MCOM and FAM models lead to better performance. Only slight difference between PCS and PTCS randomizations can be observed
Fig. 4Examples for complexes with highly co-regulated proteins. Upper panels: hierarchical clustering of pairwise correlations between protein abundance profiles. The sidebars show the significance of MCOM and FAM models (PCS randomization). Middle panels: Network visualization of profile similarities. Edge widths correspond to pairwise correlations. Grey tones of the proteins depict FDR significance calculated by PCOM (PCS randomization). Lower panels: STRINGdb (version 10) networks of proteins. Edge width is given by interaction confidence