| Literature DB >> 34046695 |
Saiful Effendi Syafruddin1, M Aiman Mohtar1, Teck Yew Low2, Adaikkalam Vellaichamy3, Nisa Syakila A Rahman1, Yuh-Fen Pung4, Chris Soon Heng Tan5.
Abstract
Protein-protein interactions are fundamental to various aspects of cell biology with many protein complexes participating in numerous fundamental biological processes such as transcription, translation and cell cycle. MS-based proteomics techniques are routinely applied for characterising the interactome, such as affinity purification coupled to mass spectrometry that has been used to selectively enrich and identify interacting partners of a bait protein. In recent years, many orthogonal MS-based techniques and approaches have surfaced including proximity-dependent labelling of neighbouring proteins, chemical cross-linking of two interacting proteins, as well as inferring PPIs from the co-behaviour of proteins such as the co-fractionating profiles and the thermal solubility profiles of proteins. This review discusses the underlying principles, advantages, limitations and experimental considerations of these emerging techniques. In addition, a brief account on how MS-based techniques are used to investigate the structural and functional properties of protein complexes, including their topology, stoichiometry, copy number and dynamics, are discussed.Entities:
Keywords: Affinity purification coupled to mass spectrometry (AP-MS); Co-fractionation mass spectrometry (coFrac-MS); Cross-linking mass spectrometry (XL-MS); Proximity-dependent biotinylation coupled to MS (PDB-MS); Thermal proximity coaggregation (TPCA)
Mesh:
Substances:
Year: 2021 PMID: 34046695 PMCID: PMC8159249 DOI: 10.1007/s00018-021-03856-0
Source DB: PubMed Journal: Cell Mol Life Sci ISSN: 1420-682X Impact factor: 9.207
A table listing the pros and cons of MS-based methodologies for identifying and quantifying PPIs
| Methods | Pros | Cons |
|---|---|---|
| AP-MS | Co-immunoprecipitation can be performed without tagged baits expressed at physiological levels to identify endogenous PPIs | Co-IP with untagged baits is limited by the availability of antibodies, and the low expression levels of baits |
| Epitope tagging provides an alternative for purifying proteins lacking suitable antibodies | Epitope tags may interfere with the functions and solubility of the baits | |
| Transient transfection of tagged baits enhance their expression, thus improving the efficiency and throughput of the pulldowns | Ectopic expression of tagged baits may promote misfolding and mislocalization of the baits, promoting background contamination and spurious interactions | |
| PDB-MS | Allows detection of PPIs among both soluble and membrane proteins, as well as enriching for PPIs that are transient, weak, low abundance or have high turnover [ | May react with biotin-phenol and H2O2 to produce reactive radicals resulting in cellular toxicity (APEX) [ |
| Avoids post-lysis artefacts [ | The accessibility and labelling efficiency of the biotinylating enzyme are locality-dependent, as its orientation and topology within the protein complex may impede its performance | |
| The affinity of biotin to streptavidin is robust yet reversible. Hence, highly stringent conditions for sample denaturing, solubilization, capture, wash and extraction of biotinylated proteins can be employed to maximize the recovery of hydrophobic proteins while minimizing nonspecific background contaminants | The high affinity of the streptavidin–biotin interaction may hinder the recovery of highly biotinylated proteins. PDB-MS suffers from false positives in the forms of high-abundance background proteins or artefacts from endogenous biotinylation | |
| The labelling time for different enzyme varies from 1 min to 24 h [ | ||
| XL-MS | Crosslinking reagents can covalently connect two or more non-covalently interacting proteins, regardless of the duration and strength of the interaction. As such, even transient and weak PPIs can be preserved [ | The low efficiency (~ 1–5%) of crosslinking reagents, which often results in marginal crosslinks, where only the top 20–30% of proteins are detected |
| When used in combination with X-ray crystallography, CryoEM, NMR and native MS, the spatial constraint data from XL-MS can guide molecular modelling, construct a connectivity map for determining subunit topology, and map the dynamic behaviour of the protein complex [ | The crosslinking reaction time may be relatively long (~ 30 min). Excessively long reaction time may result in large, crosslinked protein aggregates | |
| To expand the number and coverage of crosslinks, alternative modes of crosslinking can be employed, such as carboxyl-targeting reagents [ | A crosslinker covalently links two linear peptides, giving rise to a hybrid dipeptide that can dramatically expand the search space during spectra matching, giving rise to the 'n-square problem' [ | |
| Co-Frac-MS | CoFrac-MS has high throughput, and it provides global identification and quantification of native protein complexes in one setting | False positives constitute a significant problem in the form of chance co-elution |
| It can be operated without genetic manipulation and overexpression, thereby inferring endogenous, physiologically relevant interactome [ | ||
| CoFrac-MS combined with quantitative proteomics can delineate the relative distribution of a protein in multiple co-elution features. Thus, the stoichiometries and dynamics of a target protein within different co-isolated complexes can be simultaneously elucidated [ | ||
| TPCA | TPCA permits system-wide profiling of protein complex dynamics, and it requires neither antibodies nor epitope tagging [ | The current version of TPCA is limited to studying the dynamics of known or predicted protein complexes across cellular state and physiological conditions. Need to incorporate existing interaction data with graph/network clustering algorithms to identify novel protein complexes [ |
| Little preparation time is required. It allows most of the study of protein complexes in situ and in vivo | ||
| TPCA profiling can be rapidly deployed to unravel the assembly state of protein complexes across cellular state, cell type, tissue and physiological conditions to provide insight into their functions in normal and diseased cells |
Fig. 1The AP-MS workflow. A A specific antibody can be used to selectively capture an untagged protein of interest (POI) that is expressed at physiological levels from the protein lysate. This untagged POI binds to other protein interactors directly or indirectly. Subsequently, beads conjugated with protein A/G are added to the protein mixture to capture the antibodies together with the protein assemblies. This is then followed by the washing and elution step to release the POI and its interactors for LC–MS/MS analysis. B For bait proteins lacking suitable antibodies, the POI can be genetically fused with an epitope tag, such as FLAG-tag or HA-tag. This bait-tag fusion construct can then be transfected transiently or stably into selected cell lines. Subsequently, resins conjugated to anti-epitope tag antibodies are added so that the POI and its interactors can be selectively enriched
Fig. 2The PDB-MS workflow. In PDB, a biotin ligase (BioID), a horseradish peroxidase (HRP) or a peroxidase (APEX) is genetically fused to a selected bait protein and expressed in a chosen cell line. In vivo labelling is achieved by adding biotins (BioID) or biotin phenols (APEX) to the cells, whereby these molecules are converted to reactive biotin intermediates. These reactive intermediates then diffuse away from the enzyme in a distance-dependent manner to covalently modify lysine (BioID) or tyrosine (APEX) residues located in close proximity. After performing cell lysis in harsh, denaturing conditions, biotinylated proteins are enriched using resin conjugated with streptavidin or neutravidin for subsequent quantitative proteomics analysis
Fig. 3The XL-MS workflow. Chemical crosslinking can be performed in vitro using extensively purified protein assemblies or in vivo using intact cells. The first step of chemical crosslinking involves adding a selected crosslinker to the protein mixture or cells. After chemical crosslinking, crosslinked proteins are digested to yield peptides. Typically, three types of cross-linked peptides are produced, i.e., the mono-linked peptides, the loop-linked peptides and the cross-linked peptides, among the many unlabelled peptides and unreacted crosslinkers. Due to the heterogeneity, the total pool of proteolyzed peptides is subjected to fractionation to enrich cross-linked peptides, subsequently mass-analysed by LC–MS/MS
Fig. 4The coFrac-MS workflow. Samples are lysed in mild conditions to preserve the integrity of protein complexes, separated under native or near-native conditions using column chromatography or native gel electrophoresis into fractions. Each fraction is then individually subjected to quantitative, bottom-up LC–MS/MS analysis. With the assistance of dedicated computational algorithms, the abundance of each protein is then plotted as co-migration profiles across fractions to construct an interactome network
Fig. 5The TPCA workflow. TPCA can be performed on intact cells or cell lysate. Lysed samples are first divided into an equal amount of aliquots and subjected to heat treatment with an increasing temperature gradient. Heat treatment induces denaturation and coaggregation of interacting proteins, which then co-precipitate. Upon centrifugation, the supernatant consisting of soluble proteins from different temperature treatment is retrieved for isobaric TMT-labelling and quantitative LC–MS/MS analysis. The abundance of each soluble proteins identified and quantified is then plotted against the temperatures to generate the “protein melting curve”