| Literature DB >> 32345598 |
Abstract
Protein subcellular localization is an essential and highly regulated determinant of protein function. Major advances in mass spectrometry and imaging have allowed the development of powerful spatial proteomics approaches for determining protein localization at the whole cell scale. Here, a brief overview of current methods is presented, followed by a detailed discussion of organellar mapping through proteomic profiling. This relatively simple yet flexible approach is rapidly gaining popularity, because of its ability to capture the localizations of thousands of proteins in a single experiment. It can be used to generate high-resolution cell maps, and as a tool for monitoring protein localization dynamics. This review highlights the strengths and limitations of the approach and provides guidance to designing and interpreting profiling experiments.Entities:
Keywords: Omics; cell biology; cell fractionation; cellular organelles; mass spectrometry; organellar proteomics; proteomics; spatial proteomics; systems biology
Mesh:
Substances:
Year: 2020 PMID: 32345598 PMCID: PMC7338086 DOI: 10.1074/mcp.R120.001971
Source DB: PubMed Journal: Mol Cell Proteomics ISSN: 1535-9476 Impact factor: 5.911
Fig. 1.Generic workflow for generating organellar maps through proteomic profiling. Cells are lysed, and released organelles are partially separated by differential centrifugation (top) or density/velocity gradient centrifugation (bottom). Differential pellets or gradient fractions are analyzed by quantitative mass spectrometry. For each protein, an abundance distribution profile across the fractions is obtained. Organelles have overlapping but distinct profiles, and proteins predominantly associated with the same organelle have similar profiles. Dimensionality reduction tools (such as PCA) reveal groups of proteins with similar subcellular localization. By overlaying organellar marker proteins (color coded), the identity of clusters is revealed. Machine learning can be used to assign proteins to the nearest cluster. (Organellar map reproduced from (12)).
Global organelle profiling studies with large subcellular localization datasets (since 2016)
| Profiling method/Lab | Separation technique | Quantification strategy | Clustering method | Visualization | Cell type/tissue | Refs | Localization data available from |
|---|---|---|---|---|---|---|---|
| Protein Correlation Profiling (PCP) (Mann Lab) | Velocity gradient centrifugation | Label-free | HC, PCP, SVM | HC | Mouse liver | ( | |
| HyperLOPIT (Lilley Lab) | Density gradient centrifugation | TMT | SVM | PCA | Mouse ES cells | ( | |
| SVM | PCA | U2OS (human) | ( | Supplemental Data of Ref. | |||
| SVM | PCA, t-SNE | Arabdidopsis thaliana callus | ( | Supplemental Data of Ref. | |||
| SVM | PCA | Saccharomyces cerevisiae | ( | ||||
| SVM | PCA | Cyanobacterium synechocystis | ( | ||||
| LOPIT variant (Cristea Lab) | Density gradient centrifugation | TMT | RF, NN, SVM | t-SNE | Human fibroblasts | ( | Supplemental Data of Ref. |
| Dynamic Organellar Maps (Borner Lab) | Differential centrifugation | SILAC | SVM | PCA | HeLa (human) | ( | |
| Label-free; SILAC; TMT | SVM | PCA | Mouse primary neurons | ( | Supplemental Data of Ref. | ||
| SILAC | SVM | PCA | MutuDC (mouse) | ( | |||
| LOPIT-DC (Lilley Lab) | Differential centrifugation | TMT | SVM | PCA | U2OS (human) | ( | |
| Prolocate (Lobel Lab) | Differential and density gradient centrifugation | iTRAQ, TMT | PCP | PCA | Rat liver | ( | |
| SubCellBarCode (Lehtiö Lab) | Differential centrifugation and detergent extraction | TMT | SVM | t-SNE | A431, U251, MCF7, NCIH322, HCC827 (human) | ( |
Other main resources for spatial proteomes are the imaging-based Human Cell Atlas (https://www.proteinatlas.org/humanproteome/cell), the first proximity labelling based organellar map (https://cell-map.org/), the UniProt database (https://www.uniprot.org), and the Compartments database (https://compartments.jensenlab.org/Search).
Abbreviations: HC, Hierarchical Clustering; NN, Neural Networks; PCA, Principal Component Analysis; PCP, Protein Correlation Profiling; RF, Random Forest; SVM, Support Vector Machine; t-SNE, t-distributed Stochastic Neighbor Embedding; UMAP, Uniform Manifold Approximation and Projection for Dimension Reduction.
Choosing the right spatial proteomics approach
| Research Question | Method | Strength | |||
|---|---|---|---|---|---|
| Single protein | Static | Where is protein X? | Localization database ( | Fast, multiple sources for cross-referencing | |
| Static/Dynamic | Where is protein X? | [Microscopy] | Multi-compartment localizations and transient interactions captured | ||
| Is protein X associated with compartment Y? | Proximity labelling (APEX, BioID with protein X as bait) | ||||
| Single subcellular compartment/location | Static/Dynamic | What is the composition of compartment Y? | Proximity labelling (APEX, BioID using organelle-specific markers as baits) | Very sensitive | |
| Single organelle profiling | No constructs/cell lines | ||||
| Global–all compartments and locations, the complete spatial proteome | Static | What is the composition of all organelles in a given cell type? | Multi organelle profiling (gradient centrifugation; long gradients for high resolution; differential centrifugation for higher throughput) | No labelling reagents, no tagging/cell line generation; relatively rapid; a single experiment covers thousands of proteins; peptide level data. | |
| Proximity labelling (multiple baits for every compartment) | Very sensitive, multi-compartment localizations | ||||
| Imaging (one cell line or antibody per protein) | Direct visualization, also in relation to other structures/proteins; multi-compartment localizations | ||||
| Dynamic | Which proteins change subcellular localization upon a specific perturbation, drug treatment, genetic alteration etc? Which organelles change composition upon perturbation? | Membrane-nucleus-cytosol split | Simple, robust, deep coverage from one experiment, little MS measurement time | ||
| Multi organelle profiling (most robust by differential centrifugation) | Sensitive, deep coverage from one experiment | ||||
See also (5), including the supplemental data, for a detailed discussion.
Compartments resolved by different implementations of organellar profiling
| Refs | Profiling method | Cell type | Organelles | Other compartments |
|---|---|---|---|---|
| ( | HyperLOPIT | Mouse ES cells | Endo; ER/Golgi; Lys; Mito; Nuc; Pex; PM | Cyt; ribosome; proteasome; actin cytoskeleton; extracellular matrix |
| ( | Dynamic Organellar Maps | HeLa (human) | Endo; ER; ERGIC; ER_HC; Golgi; Lys; Mito; Nuc; Pex; PM | Cyt; large protein complexes; actin binding proteins |
| ( | LOPIT variant | Fibroblasts (human) | ER; Golgi; Lys; Mito; Pex; PM | Cyt |
| ( | Prolocate | Rat liver cells | ER; Golgi; Lys; Mito; Nuc; Pex; PM | Cyt |
| ( | PCP | Mouse liver cells | Endo; ER; Golgi; Lys; Mito; Nuc; Pex; PM | Cyt; lipid droplets |
| ( | LOPIT-DC | U2OS (human) | ER; Golgi; Lys; Mito; Nuc; Pex; PM | Cyt; ribosome; proteasome |
| ( | SubCellBarCode | A431, U251, MCF7, NCIH322, HCC827 (human) | Secretory 1 (Golgi, Endo/Lys); Secretory 2 (ER, Pex); Secretory 3 (ER, Mito); Secretory 4 (PM); Nuc; Mito | Cyt/cytoskeleton |
Only one recent representative study is shown per laboratory and method.
Abbreviations: Endo, endosome; ER, endoplasmic reticulum; ERGIC, ER-Golgi intermediate compartment; ER_HC, ER High curvature; Golgi, Golgi apparatus; Lys, lysosome; Mito, Mitochondria; Nuc, Nucleus; Pex, peroxisome; PM, plasma membrane.
*(21) predominantly used mixed compartment classifiers.
Applications of comparative global organellar profiling
| Method | Research Question/Application | Reference |
|---|---|---|
| Dynamic Organellar Maps | EGF signaling | ( |
| Disease mechanism of AP-4 deficiency syndrome | ( | |
| AP-5 mediated protein transport | ( | |
| Characterization of drug action to enhance cross presentation in dendritic cells | ( | |
| LOPIT variant | HCMV infection | ( |
| LOPIT-DC | Tethering complexes of the Golgi | ( |
| PCP | Non-alcoholic fatty liver disease in mice | ( |
| SubCellBarCode | EGF signaling | ( |
Fig. 2.Organellar profiling of the core proteasome performed with different MS quantification strategies. HeLa cell lysates were fractionated into five pellets by differential centrifugation (Dynamic Organellar Maps approach (17)). The relative abundances of the 14 proteasomal core subunits (PSMA1 to PSMB7) were quantified across the fractions (normalized to sum 1). Because the core proteasome is a stable complex, the profiles are expected to be identical, and deviations largely reflect quantification error. Quantification was achieved by four different approaches (all based on data-dependent acquisition): label-free quantification using the MaxLFQ (43) algorithm; quantification against an invariant SILAC (44) heavy reference; labeling with TMT10-plex and MS3 quantification (SPS method; (47)); and labeling with EASI-tag 6-plex (46) and MS2 quantification. A, Relative abundance profiles; subunits in light gray, means in dashed black. SILAC quantification produces a tight profile cluster with finely nuanced resolution of small differences in the low abundance fractions (1–3). The LFQ profiles show substantially more scatter. TMT profiles are tighter than the LFQ profiles, but have a flatter shape in the first three fractions. The EASI-tag profile has the largest dynamic range. B, Profile scatter, i.e. distribution of (Manhattan) distances of the 14 profiles in A) to the average profile. Boxes show mean (line) and 1st to 3rd quartile, whiskers 5th-95th percentile; data points outside this range are not shown. SILAC quantification has the lowest scatter (smallest mean and tightest distribution), whereas LFQ has the highest scatter. TMT10 and EASI-tag 6 show similar intermediate levels. C, PCA plot of the 14 core proteasome subunit profiles shown in A). PCA was jointly applied to all 4 × 14 = 56 profiles, but each plot only shows the profiles obtained with the indicated quantification method; all plots have the same scale, center, and PC loadings. PSMA and PSMB subunits are color coded. SILAC quantification shows the tightest cluster, and largely resolves the A/B subcomplexes. TMT and EASI-tag show partial resolution and intermediate cluster tightness. SILAC, LFQ and TMT data were published previously (17); EASI-tag profiles were also generated in house (our unpublished data). All raw files were processed with MaxQuant (49). Importantly, the same sample set was used for LFQ, SILAC and EASI-tag quantification; for technical reasons, a very similar replicate of this set was used for TMT quantification. The LFQ profiles were obtained by reprocessing the SILAC raw files with detection of light peptides only (heavy reference channel ignored). The total MS analysis time was similar for all samples (32–40 h per map), as was the quality of instrumentation (LFQ/SILAC: Orbitrap HF; EASI-tag: Orbitrap HFX; TMT: Orbitrap Lumos).