Analysis of intact proteins by native mass spectrometry has emerged as a powerful tool for obtaining insight into subunit diversity, post-translational modifications, stoichiometry, structural arrangement, stability, and overall architecture. Typically, such an analysis is performed following protein purification procedures, which are time consuming, costly, and labor intensive. As this technology continues to move forward, advances in sample handling and instrumentation have enabled the investigation of intact proteins in situ and in crude samples, offering rapid analysis and improved conservation of the biological context. This emerging field, which involves various ion source platforms such as matrix-assisted laser desorption ionization (MALDI) and electrospray ionization (ESI) for both spatial imaging and solution-based analysis, is expected to impact many scientific fields, including biotechnology, pharmaceuticals, and clinical sciences. In this Perspective, we discuss the information that can be retrieved by such experiments as well as the current advantages and technical challenges associated with the different sampling strategies. Furthermore, we present future directions of these MS-based methods, including current limitations and efforts that should be made to make these approaches more accessible. Considering the vast progress we have witnessed in recent years, we anticipate that the advent of further innovations enabling minimal handling of MS samples will make this field more robust, user friendly, and widespread.
Analysis of intact proteins by native mass spectrometry has emerged as a powerful tool for obtaining insight into subunit diversity, post-translational modifications, stoichiometry, structural arrangement, stability, and overall architecture. Typically, such an analysis is performed following protein purification procedures, which are time consuming, costly, and labor intensive. As this technology continues to move forward, advances in sample handling and instrumentation have enabled the investigation of intact proteins in situ and in crude samples, offering rapid analysis and improved conservation of the biological context. This emerging field, which involves various ion source platforms such as matrix-assisted laser desorption ionization (MALDI) and electrospray ionization (ESI) for both spatial imaging and solution-based analysis, is expected to impact many scientific fields, including biotechnology, pharmaceuticals, and clinical sciences. In this Perspective, we discuss the information that can be retrieved by such experiments as well as the current advantages and technical challenges associated with the different sampling strategies. Furthermore, we present future directions of these MS-based methods, including current limitations and efforts that should be made to make these approaches more accessible. Considering the vast progress we have witnessed in recent years, we anticipate that the advent of further innovations enabling minimal handling of MS samples will make this field more robust, user friendly, and widespread.
Mass spectrometry (MS) has become
an indispensable tool for protein analysis.[5,6] This
is exemplified by the large-scale quantitative analysis of complex
cellular systems, including the investigation of clinical samples
by proteomics.[3,7−9] In this approach,
proteins are enzymatically digested prior to MS analysis, and it is
the peptide fragments that are identified and quantified. An alternative
strategy, which is the focus of this Perspective, is MS-based analysis
of intact proteins, in which the full-length polypeptide is examined.[10−13] When native conditions are used for such an analysis, not only the
intact protein is detected but also the three-dimensional structure
of the protein and its noncovalent interactions with protein partners
and/or biomolecules are preserved.[10−12] This mode of analysis
has been mainly the regime of native MS electrospray ionization (ESI)
experiments, which has yielded unprecedented insights into the array
of coexisting protein isoforms, the crosstalk between post-translational
modifications (PTMs) and the structure and function of protein complexes
ranging from ribosomes through viral capsids to membrane protein complexes.Top-down analysis of intact proteins allows the identification
and PTM mapping of individual protein components in the protein complex,
while native MS studies the upper levels of the protein complex organization.[2,14,15] Both approaches, however, usually
call for substantial sample preparation. Current top-down protocols,
for example, often include cleanup strategies like protein precipitation,
molecular weight cutoff ultrafiltration, liquid chromatography (LC)
separation approaches (such as reversed phase, size exclusion, ion
exchange and chromatofocusing), and capillary electrophoresis techniques
(such as capillary zone electrophoresis and capillary isoelectric
focusing).[14] Conversely, native MS studies
are usually assayed with purified proteins,[16] but their purification procedures require pronounced time and labor
investments and frequently involve prior molecular tagging strategies
to enable affinity purification techniques and multiple biochemical
isolation steps. Moreover, as most of the commonly used solutions
for purification or storage of protein complexes contain buffers,
salts, and solubilizing agents that are largely incompatible with
MS, a buffer exchange step into a volatile buffer is necessary.[16]Therefore, methods for the rapid analysis
of proteins that do not
require multiple preparation steps, i.e., can be analyzed in their
crude state, offer great advantages. Obviating the multistep sample
preparation procedures and/or the need for protein purification will
overcome this labor-intensive and costly process. There is also the
advantage of better preserving the native state of the protein, thus
maintaining PTMs and noncovalent associations of the protein with
other biomolecules, such as ligands and cofactors, which are likely
to be lost during lengthy procedures. Shortening the time lag between
sample production and characterization will also be highly beneficial
for screening programs of engineered constructs by enabling the rapid
selection and ranking of lead protein candidates. Moreover, the ability
to rapidly analyze proteins may also assist clinical diagnostics initiatives
and pharmaceutical applications.Traditionally, ESI-based methods
have been used for the investigation
of intact proteins. Indeed, there are emerging examples of harnessing
the native MS method for the characterization of proteins from crude
samples.[17−21] Recent studies have also put forward the power of in situ ambient surface sampling and matrix-assisted laser desorption ionization
(MALDI) imaging MS methods for the analysis of proteins from surfaces,
cells, and tissues with minimal sample handling (see reviews in refs (2 and 22−27)). Although these in situ methods have been mainly
harnessed for the investigation of metabolites and peptides, they
are now being expanded toward protein analysis. In this Perspective,
we highlight recent MS-based studies that take advantage of different
sampling platforms to rapidly analyze intact proteins (Figure ). We review the most relevant
technical challenges associated with these approaches and present
ideas about how to improve their performance and promote utilization
in the future. We anticipate that the development of rapid protein
analysis approaches will progress in the next decade, with increasing
applications and usage.
Figure 1
Scheme summarizing the various mass spectrometry
ionization techniques
for intact protein analysis from crude samples. In situ analysis of intact proteins is enabled by MS-based imaging approaches
such as MALDI, DESI, LESA, and nano-DESI. These methods have successfully
been used to probe intact proteins from tissue sections, dried blood
spots, and microbial colonies. In contrast, the direct MS approach
is based on an ESI-MS analysis of crude solution samples. It enables
the assessment of cellular lysates of intracellular overexpressed
proteins in bacterial systems or the growth medium of eukaryotic secretion
systems. For further reading, several comprehensive reviews on the
different ionization methods have been published.[1−4]
Scheme summarizing the various mass spectrometry
ionization techniques
for intact protein analysis from crude samples. In situ analysis of intact proteins is enabled by MS-based imaging approaches
such as MALDI, DESI, LESA, and nano-DESI. These methods have successfully
been used to probe intact proteins from tissue sections, dried blood
spots, and microbial colonies. In contrast, the direct MS approach
is based on an ESI-MS analysis of crude solution samples. It enables
the assessment of cellular lysates of intracellular overexpressed
proteins in bacterial systems or the growth medium of eukaryotic secretion
systems. For further reading, several comprehensive reviews on the
different ionization methods have been published.[1−4]
MALDI
Imaging MS
MALDI is a “soft” technique that
can be used for
the ionization of large biomolecules.[28,29] The method
uses UV laser irradiation to photovolatilize samples that are cocrystallized
with an energy-absorbent organic matrix (Figure ). The activation of the sample–matrix
mixture gives rise to matrix evaporation, which carries the sample
with it into the mass spectrometer.[30] While
the exact mechanism of ion generation is not fully understood, MALDI
typically generates singly charged ions with small populations of
doubly and triply charged ions.[31] As a
result of its ability to analyze samples directly from surfaces, MALDI-MS
can be combined with histology in a process known as imaging mass
spectrometry to obtain the spatial distribution of proteins directly
from tissue sections.[32]MALDI imaging
of intact proteins directly from mammalian tissues
started in the early 2000s. In one of the first examples, the spatial
distribution of intact thymosin β.4 (∼5 kDa) and S100
calcium binding protein A4 (S100A4) (10 kDa) was mapped.[33] More recent examples involve determining the
expression, localization, and PTM profile of intact lens crystalline
proteins (α, β, and γ).[34,35] The method has also been extensively used for the study of intact
histones in MCF7breast cancer cells, which have indicated that histone
H3 (15.2 kDa) and H4 (11.3 kDa) are highly abundant in tumor regions.[36] MALDI examination of mouse brains has revealed
the existence of multiple histone H1 variants and their PTMs (20–25
kDa mass range).[37] This study also enabled
mapping of the distribution of H1 variants in the brain, providing
information not accessible by other methods, given the limited availability
of suitable antibodies.Identifying tissue- or tumor-specific
proteoforms is challenging,
especially if the mass difference between variants is small, as in
the case of different point mutations, oxidation (16 Da), disulfide
bond formation (2 Da), or amide formation (1 Da).[38] The resolving power of time-of-flight (TOF) mass analyzers
may not be sufficient in such cases.[39] However,
coupling a MALDI source with high-resolution mass analyzers such as
Fourier transform ion cyclotron resonance (FTICR)[40] and Orbitrap instruments[41] that
have high resolving powers and mass accuracy, can overcome this limitation
and resolve highly similar proteoforms, including proteins with isobaric
isotopic distributions. A recent MALDI-FTICR high-resolution imaging
analysis of glioblastomamouse brain tissue has identified a large
number of intact proteins in the range of 4–15 kDa, including
heavily modified histones. The imaging resolved many of the PTMs of
histones H2, H3, and H4, whose acetylation and methylation status
was dependent on whether tumor or healthy regions were analyzed.[40] Similarly, in another high-resolution MALDI-FTICR
imaging study, sections of mousekidneys infected with Staphylococcus aureus were analyzed.[42] Twelve modified variants were identified for S100A8 (10.1
kDa), a protein subunit of the heterodimer calprotectin, with one
form of the oxidized protein found to localize in the center of infectious
foci, where staphylococcal microcolonies reside.[42] The high mass and spatial resolution of MALDI-FTICR imaging
was used recently to determine the localization of the humaninsulin
protein (INS) and to distinguish between the two mouseinsulin proteins
INS1 and INS2 within the pancreatic islets of Langerhans (Figure ), demonstrating
this method’s ability to resolve highly similar proteins.[43]
Figure 2
MALDI-FTICR MS imaging of intact insulin from pancreatic
tissues.
Spectra and images of a mouse (A) and human (B) pancreas indicating
the localization of the two mouse insulin isoforms INS1 and INS2 and
the human protein INS, respectively, within the islets of Langerhans.
The theoretical mass distribution is highlighted in blue (for mouse)
and green (for human). Adapted with permission from ref (43). Copyright (2019) Elsevier.
MALDI-FTICR MS imaging of intact insulin from pancreatic
tissues.
Spectra and images of a mouse (A) and human (B) pancreas indicating
the localization of the two mouseinsulin isoforms INS1 and INS2 and
the human protein INS, respectively, within the islets of Langerhans.
The theoretical mass distribution is highlighted in blue (for mouse)
and green (for human). Adapted with permission from ref (43). Copyright (2019) Elsevier.The benefit of MALDI imaging is that intact proteins
are analyzed
directly from tissues, without the isolation and purification process.
Another significant advantage is the spatial molecular landscape it
provides. In recent years, this aspect has been rapidly progressing
with the development of instruments equipped with low-diameter laser
beams, suitable for high spatial resolution imaging.[44,45] There is no doubt that the ability to resolve the spatial localization
of multiple proteins within a single section of pathological tissue
can enable the detection of disease candidates and improve our understanding
of pathophysiology. Nevertheless, the method still requires extensive
tissue preparation techniques.[46] Tissues
must be pretreated with a matrix, and although multiple matrix options
and coating methods are available, the commonly used matrices are
acidic and contain organic solvents, which tend to denature the proteins
of interest.[47,48] Thus, while intact proteins can
be ionized in the presence of detergents and highsalt concentrations,[48] the preservation of protein–protein or
protein–ligand interactions is challenging.[25,49]Top-down strategies provide an efficient approach for the
protein
identification of MALDI imaging data, wherein intact protein measurements
are followed by on-tissue MS/MS analysis. However, the typical low-charge
ions generated by MALDI not only yield high m/z values, which may limit the detection of the intact protein,
but also restrict the fragmentation efficiency.[50,51] Current attempts to overcome this hurdle focus on supercharging
matrices, which are expected to increase the charge state of ions
generated from tissue.[52,53] Another inherent limitation of
MALDI imaging is that it is mainly suitable for small proteins, up
to 25 kDa.[24] However, given the recent
developments in the field, it is expected that in the future the method
will expand toward the detection and imaging of higher-mass proteins.
Progress in this direction is already being made by the use of the
matrix ferulic acid, which remarkably increases signal acquisition
in the mass range from 20 to 150 kDa.[54] An additional promising direction involves a new detector that enables
proteins up to 110 kDa to be detected directly from the tissue and
proteins of up to 70 kDa to be spatially resolved.[55]
Ambient Surface MS
Ambient surface sampling techniques
are performed with minimal
sample handling at atmospheric pressure and make use of ESI.[2] Specifically, the analysis of intact protein
involves various liquid extraction techniques, in which target molecules
are removed from the sample surface and extracted into a solvent before
the ionization process.[56] These liquid
extraction approaches can be categorized into three main classes (Figure ): (1) Spray-based
techniques, such as desorption electrospray ionization mass spectrometry
(DESI–MS), in which charged droplets are sprayed directly onto
a sample of interest, while extracting the analytes from the surface
and producing gaseous ions that are directed into the mass spectrometer.[57] (2) Direct liquid extraction techniques, such
as liquid extraction surface analysis (LESA), in which the sample
is extracted from the surface by dispensing a solvent on its surface
from a pipet tip. This creates a liquid junction between the tip and
the sample surface, allowing analytes to dissolve and reaspire with
a conductive pipet tip prior to ESI-MS.[2] (3) Flow-based techniques, such as nanospray desorption electrospray
ionization (nano-DESI), in which a solvent bridge is created between
two capillaries and the surface of interest. The main difference between
nano-DESI and LESA is the configuration of the capillaries. In LESA,
only one capillary is responsible for sample extraction and ionization,
while in nano-DESI two capillaries are used, a primary capillary which
directs the solvent to the surface, where the sample material is dissolved,
and a second capillary, which collects the sample and directs it into
the nano-ESI mass spectrometer.[58] The liquid
junction created in nano-DESI is typically smaller than that of LESA,
due to the low flow rate of the liquid onto the surface[59]Studies using liquid extraction surface
sampling techniques for in situ analysis of intact
proteins are starting to emerge.
Hemoglobin, due to its high abundance in blood, represents one of
the main examined systems. Using a DESI FTICR platform, measurements
from hemolysate cells have allowed the identification of the intact
hemoglobin α and β chains (∼16 kDa) as well as
the released heme group (616 Da).[60] Another
ambient surface MS study, this time involving a LESA-based analysis
of intact proteins from dried blood spots, demonstrated this method’s
ability to identify multiple hemoglobin variants that lead to sickle
cell disease and thalassemia in a rapid (<10 min) and unambiguous
manner.[61,62] Nano-DESI, in combination with light microscopy,
has too been employed to investigate hemoglobin, specifically, the
variation in its expression during mice development from fetal to
adult stages (Figure ).[63]
Figure 3
Mouse hemoglobin expression during development
is characterized
by coupling nano-DESI and microscopy. The data show the variety of
hemoglobin subtypes and isoforms over time, preformed directly on
spinal cords tissue sections at different stages of mouse development.
High expression levels of fetal isoforms are detected during early
embryonic stages (E12.5–E15.5). During the developmental transition
from fetal to adult, the hemoglobin forms gradually change: the fetal
isoforms decline in number over time and disappear altogether at the
postnatal stage, at which time the adult variant becomes the dominant
form. Adapted with permission from ref (63). Copyright (2013) National Academy of Sciences.
Mouse hemoglobin expression during development
is characterized
by coupling nano-DESI and microscopy. The data show the variety of
hemoglobin subtypes and isoforms over time, preformed directly on
spinal cords tissue sections at different stages of mouse development.
High expression levels of fetal isoforms are detected during early
embryonic stages (E12.5–E15.5). During the developmental transition
from fetal to adult, the hemoglobin forms gradually change: the fetal
isoforms decline in number over time and disappear altogether at the
postnatal stage, at which time the adult variant becomes the dominant
form. Adapted with permission from ref (63). Copyright (2013) National Academy of Sciences.While the above examples were all conducted under
denaturing conditions
using high concentrations of organic solvents, more recent native
MS studies have opted to employ ammonium acetate-based solvents.[22,64] In these studies, LESA-MS detected the intact tetrameric hemoglobin
complex (∼64 kDa) bound to its heme group in dried blood spots[64] and in thin tissue sections of mouse liver.[22]High-resolution instruments enable resolving
isotopic patterns
of proteins from tissue sections, which are then combined with spatial
mapping. This has been exemplified by nano-DESI imaging of lymphoma
and healthy thymus sections, in which truncated forms of proteins,
such as thymosin β-4 and ubiquitin, were largely found in the
tumor, particularly in the thymus region infiltrated by cancerous
cells.[65] Notably, such truncations are
highly likely to escape detection by the immuno-based techniques commonly
used in biochemical studies, highlighting the benefits of this MS-based
method. Similarly, a top-down LESA-MS analysis of healthy and diseased
liver tissues was able to unambiguously distinguish between two different
variants of a protein biomarker, the liver fatty acid binding protein
(FABP1). The two highly similar forms of FABP1 differ only by one
amino acid substitution, Thr → Ala, with the Ala variant displaying
increased risk of developing nonalcoholic fatty liver disease.[66] Another benefit of this approach, in comparison
to proteomic analysis or immunohistochemistry, derives from the ability
to define the assembly state of the analyzed protein complex. This
has been shown in a LESA-based analysis of kidney tissues, which identified
the presence of a 42 kDa homotrimer,[23] whose
stoichiometry was confirmed by MS/MS.The ambient surface techniques
have also been expanded to the extraction
of periplasmic and cytosolic proteins directly from living bacterial
colonies, offering the means to analyze bacterial growth, communication,
and response to external factors.[67] In
one study using the E. coli K-12 laboratory
model strain, a LESA extraction pipet tip was maneuvered into contact
with the colony. Protein identification using top-down MS revealed
the identity of six different proteins, among them DNA-binding and
stress-resistant proteins, ranging in mass between 6.5 and 15.5 kDa.[67]Unlike bacterial colonies, the analysis
of yeast colonies is not
possible using contact LESA because of their thick cell walls. Therefore,
a combination of LESA-MS and electroporation was explored to lyse
yeast colonies directly from growth agar media and release intact
proteins for LESA-MS analysis.[68] Electroporation
of different yeast species, such as Saccharomyces cerevisiae, Candida glabrata, and Cryptococcus neoformans, enabled the identification
of different intact proteins, ranging in mass from 5 to 15 kDa. Two
of the S. cerevisiae proteins and 20
of the C. glabrata and C. neoformans proteins were detected, for the first
time, in their intact form.To reduce the overlap of charge
states and increase the protein
identification capacity, in situ ambient surface
approaches were recently integrated with ion mobility separation.
Significant improvement in the signal-to-noise ratio was achieved
by the separation of highly complex overlapping signals of liver tissues
through DESI-MS imaging via coupling the measurement with traveling
wave ion mobility mass spectrometry (TWIMS). In this research, the
intact mass and spatial distribution of the hemoglobin α and
β chains (∼16 kDa), a fatty acid binding protein (14.3
kDa), and a 10 kDa heat shock protein (CH10) could be detected.[69] In a similar effort to increase sensitivity
so as to detect proteins that would have otherwise remained buried
in the noise, DESI-MS was coupled to a high field asymmetric waveform
ion mobility spectrometer (FAIMS), which allowed the detection of
up to 16 proteoforms in mouse kidney, mouse brain, and human ovarian
samples. The different expression levels of members of the S100 protein
family were imaged directly in healthy and cancer breast tissue samples
(Figure A).[70] Moreover, when such experiments are performed
using native-like conditions, the collision cross sections of folded
proteins can be profiled directly from tissues, as demonstrated by
LESA-TWIMS for β-thymosin (4.9 kDa), ubiquitin (8.5 kDa), and
tetrameric hemoglobin (64 kDa) (Figure B).[71] This type of analysis
opens the way to studying protein–protein interactions or protein
misfolding in disease tissues.
Figure 4
Intact-protein analysis by ambient surface
MS. (A) DESI-MS images
of normal and cancerous breast tissue samples reveal the different
expression levels of intact S100 proteins and hemoglobin. The detected
upregulation of the S100 proteins in breast cancer tissues is in accordance
with previous records. Adapted with permission from ref (70) Copyright (2018) American
Chemical Society. (B) LESA-MS analysis under native conditions of
a mouse kidney section. (Left) Microscope image of a mouse kidney
section overlaid with a pixel grid. (Right) Mass spectrum of an intact
hemoglobin heterotetramer (+16) and heterodimer (+11) sampled directly
from the tissue section labeled by a red square in the left panel,
which marks the position of the renal pelvis, which is rich in blood
vessels. Adapted with permission from ref (71). Copyright (2020) American Chemical Society.
Intact-protein analysis by ambient surface
MS. (A) DESI-MS images
of normal and cancerous breast tissue samples reveal the different
expression levels of intact S100 proteins and hemoglobin. The detected
upregulation of the S100 proteins in breast cancer tissues is in accordance
with previous records. Adapted with permission from ref (70) Copyright (2018) American
Chemical Society. (B) LESA-MS analysis under native conditions of
a mouse kidney section. (Left) Microscope image of a mouse kidney
section overlaid with a pixel grid. (Right) Mass spectrum of an intact
hemoglobin heterotetramer (+16) and heterodimer (+11) sampled directly
from the tissue section labeled by a red square in the left panel,
which marks the position of the renal pelvis, which is rich in blood
vessels. Adapted with permission from ref (71). Copyright (2020) American Chemical Society.Taken together, ambient surface analysis techniques
benefit from
the fact that sample preparation or disruption procedures are not
required, enabling preserving a more biologically relevant environment
compared to MALDI imaging methods.[2] Moreover,
the multiple charge states generated by electrospray ionization enables
top-down analysis by a range of techniques, such as collision-induced
dissociation (CID), electron capture dissociation (ECD), or electron
transfer dissociation (ETD).[2] Another major
advantage is the integration of ion mobility separation with the imaging
workflows to analyze folded proteins and protein complexes in a spatially
defined manner. Nevertheless, the method is still limited by the ionization
efficiency of protein assemblies, restricting the analysis of low-abundance
and large species.Incomplete desolvation and the formation
of protein–protein
and protein–contaminant clusters have been shown to be responsible
for the mass-dependent loss of sensitivity in large protein analyses.[72] Recent progress in mitigating these limitations
has, however, been achieved by using solvent additives[73] and improving platform setups.[74,75] These technical advances suggest that it may become possible to
analyze the spatial distribution of less-abundant intact protein complexes
in tissues, aiding disease understanding and therapeutic avenues.
Direct
MS
Direct MS is a new approach for the analysis of recombinant
proteins
under native conditions that does not require prior protein purification
while allowing the rapid analysis of intact, overproduced proteins
from crude samples, with minimal sample handling.[76] In this method, overexpressing bacterial cells are directly
lysed in a volatile native MS compatible solution, and only the clearance
of cell debris and insoluble materials is required before measurement.[18] In a similar manner, secreted proteins from
eukaryotic hosts can be directly sprayed into the mass spectrometer
from a precleared culture medium, following buffer exchange into a
native MS-compatible solution.[17] Thus,
analysis is based on the ESI-MS measurement of samples in solution,
unlike the surface-based methods described above (Figure ). The method can be carried
out on multiple mass spectrometers, such as Orbitrap and quadrupole
time-of-flight (QTOF)-based mass spectroscopy platforms, that enable
intact protein detection.[17−19]The key requirement for
direct MS analysis is the overproduction
of the target protein, such that it will outperform the cellular levels
of the endogenous proteins.[17−19] In its basis, the method relies
on a drawback of MS, its limited dynamic range, wherein low-abundance
proteins are masked by higher-abundance ones.[77] This inherent property is turned into an advantage that enables
biased detection of the highly produced target protein, disregarding
the lower-abundant endogenous proteins. Thus, signal suppression enables
overcoming the need for prior protein purification.Recombinant
proteins produced in E. coli, the most
commonly used bacterial expression system,[78,79] can accumulate several milligrams of protein per milliliter of lysate,
making them an excellent system for direct MS analysis.[18] Production quantities are usually lower in eukaryotic
hosts, challenging the ability to detect proteins that are overexpressed
intracellularly. Secretion systems, however, overcome this bottleneck
by transferring the recombinant protein from the cell to the growth
medium, making it the dominant protein in the culture despite the
presence of the background endogenous proteins in the medium.[80] Therefore, for direct MS, the use of protein-free
growth media is recommended in order to eliminate ion suppression
and superposition of signals due to a high background of medium proteins,
particularly when expression levels are low.[17]The direct MS method has been applied to a broad array of
proteins
and protein complexes, ranging from 20 kDa monomers to 185 kDa protein
complexes.[17,18] The approach is particularly
useful for characterizing engineered proteins, as it provides rapid
assessment of their quality along with structural input for iterative
redesign and optimization.[17,19] This aspect has been
shown in the analysis of designed antibodies, demonstrating not only
the proper folding, assembly state, and glycan modification of the
generated antibodies but also their improved expression levels and
stability compared to the wild type forms.[20] In another example, the method was utilized for a “quick
and dirty” determination of proteins’ interactions and
function.[19] RAB1A, a Ras-related protein,
was suspected to bind the 20S proteasome and inhibit its proteolytic
activity. MS/MS experiments using a cell lysate expressing RAB1A in
the presence of the 20S proteasome confirmed the binding of the two,
while time course analysis of the levels of a 20S proteasome substrate
in this lysate confirmed that RAB1A coordinates the proteasome activity.[19]Recently, direct MS was adjusted so as
to measure pairwise interaction
strengths from crude lysates (Figure ).[21] The approach is based
on the double mutant-cycle method, wherein the two target residues
are mutated both separately and in combination, usually to alanine,
and the energetic effects of the mutations are determined.[81] An earlier study has shown that pairwise interaction
energies can be determined from a single mass spectrum.[82] On the basis of this understanding, four different
proteins, consisting of the two wild type (wt) and two mutated (mut)
variants, were coexpressed in the same bacterial cells.[21] The spectrum measured from the crude lysate
showed the four coexisting complexes (wt/wt, wt/mut, mut/wt, and mut/mut),
from which the coupling energy was calculated, obviating the need
to purify each of the four proteins and determine individual binding
constants for each of the generated dimers.
Figure 5
Determining the strengths
of intermolecular pairwise interactions
between residues at the interfaces of interacting proteins by direct
MS. A wild type colicin endonuclease (E) and a wild type immunity
(Im) protein and two respective mutants (mut), in which the two interface
residues (N83 and N31) were mutated to alanine, were overexpressed
in the same bacterial cells. The crude lysate contained the four different
complexes constituting the WT and mutated proteins. The pairwise interaction
energy was calculated from a single high-resolution native mass spectrum
directly from the crude lysate by measuring the intensities of the
complexes formed by the two WT proteins (red peaks), the complex of
each WT protein with a mutant protein (blue and orange peaks), and
the complex of the two mutant proteins (green peaks). Yellow dots
in the dimer cartoons represent the mutated interface residues. Inset
shows the structures of the E-Im dimer. The mutated residues are labeled.
Adapted with permission from refs (3 and 21). Copyright
(2020) Springer Nature and Copyright (2018) American Chemical Society,
respectively.
Determining the strengths
of intermolecular pairwise interactions
between residues at the interfaces of interacting proteins by direct
MS. A wild type colicin endonuclease (E) and a wild type immunity
(Im) protein and two respective mutants (mut), in which the two interface
residues (N83 and N31) were mutated to alanine, were overexpressed
in the same bacterial cells. The crude lysate contained the four different
complexes constituting the WT and mutated proteins. The pairwise interaction
energy was calculated from a single high-resolution native mass spectrum
directly from the crude lysate by measuring the intensities of the
complexes formed by the two WT proteins (red peaks), the complex of
each WT protein with a mutant protein (blue and orange peaks), and
the complex of the two mutant proteins (green peaks). Yellow dots
in the dimer cartoons represent the mutated interface residues. Inset
shows the structures of the E-Im dimer. The mutated residues are labeled.
Adapted with permission from refs (3 and 21). Copyright
(2020) Springer Nature and Copyright (2018) American Chemical Society,
respectively.Unlike the in situ surface techniques described
above, the direct MS method, although less restricted by the protein
or protein complex size, is constrained to the analysis of overproduced
recombinant targets rather than endogenous proteins. Consequently,
while it provides in-depth analysis of PTMs and profiles of coexisting
variants,[17,19] it should be noted that the relative abundance
of the different modified forms of the target protein may not reflect
the true biological distribution. Moreover, the method is limited
to the use of expression systems that enable the target proteins to
become the dominant component in the sample. Therefore, current applications
using eukaryotic expression systems are limited to the analysis of
recombinant secreted proteins rather than those produced intracellularly.
This limitation may be overcome by the development of improved expression
plasmids that can scale up the yield. Another way to overcome the
relative abundance limitation is to simplify existing fractionation
methods employed in the native MS analysis of complex protein lysates.[83−85] Currently, however, these strategies require a significant amount
of sample handling, which is not only labor intensive but may also
result in the loss of important noncovalent associations. The recent
development of charge detection MS,[86,87] which allows
detailed analyses of high mass and heterogeneous samples, is also
expected to advance direct characterization of endogenous proteins
from crude lysates. Alternatively, data processing techniques may
be adapted from the in-cell NMR method, in which background signals
are significantly reduced by subtracting from the original spectrum
a spectrum acquired on a control sample of cells transfected with
an empty vector, where protein expression did not occur.[88]We expect that further extension of this
method will involve its
progression to high-throughput analysis. Recent progress toward this
direction has been made by the development of an online buffer exchange
method for clarifying cell lystaes.[89] Similarly,
improvements in cell lysis techniques, like microwell arrays,[90] high-throughput electromechanical lysis,[91] or pressure-based methods for tissue lysis,[92] may facilitate throughput analysis. Another
expected future direction involves coupling to microfluidic systems,
which will enable the continued analysis of cultures grown in a bioreactor
and the capture in real time of the state of protein production.
Concluding
Remarks
Herein, we propose that the MS analysis of intact
proteins from
crude samples, with minimal perturbations to their structure, modifications,
and/or assembly, opens up multiple opportunities for rapid characterization
and high-throughput screening. Specifically, the approaches described
above, although fundamentally differing from each other in terms of
the characteristics of the sample (surface analysis vs solution),
all benefit from the relative simplicity and feasibility across different
MS platforms. They also benefit from the minimal amount of sample
required, the absence of labeling, the reduced time gap between sample
production and characterization, and the in-depth information afforded
by MS.We anticipate, for example, that the direct MS method
that involves
solution analysis will facilitate protein engineering and successful
mass production of recombinant proteins for both industrial and pharmaceutical
applications, achieving batch-to-batch consistency and maximum productivity
at a reasonable cost. The screening of surfaces, as exemplified by
the ambient ionization methods, is foreseen to be translated into
benchtop analytical devices for on-site tests, such as intraoperative
tumor margin assessment and environmental, forensics, and defense
applications.[27] Moreover, given the promising
results of MALDI and ambient MS methods, these methods are expected
to be highly beneficial for the rapid identification of microorganisms,
analysis of biofluids, and diagnostic studies.[26] Thus, a critical aspect in the coming years will be to
keep evolving these approaches into a robust and user-friendly set
of technologies.Despite the diverse potential applications
of the MS techniques
for protein analyses with minimal handling, they are mostly used in
certain specialized laboratories, and their translation into everyday
life practice is hindered due to multiple challenges. For instance,
all the described methods, regardless of the applied ionization method,
will benefit from the development of robust, easy-to-use, cost-effective
devices that can be employed routinely by nonexperts. Automation of
platforms and their integration with robotic laboratories are also
expected to expand the outreach of the techniques. There is also a
need for improving the ionization efficiency, overcoming signal suppression
issues, and developing high-throughput capabilities. Considering that,
irrespective of the method by which the protein sample is introduced
into the mass spectrometer, fragmentation of the intact protein must
be used for identification (as CID, ECD, and ETD), improvements in
these capabilities along with automated assignment tools will advance de novo sequence analysis and unambiguous interpretation.
Moreover, as the advanced methods outlined in this Perspective are
often adjusted to one analytical aspect, combining different types
of techniques may bring new advances. Finally, as the field grows
to maturity, standardization of the methodologies for clinical, industrial,
and pharmaceutical applications also will be required.
Authors: Jeremy L Norris; Dale S Cornett; James A Mobley; Malin Andersson; Erin H Seeley; Pierre Chaurand; Richard M Caprioli Journal: Int J Mass Spectrom Date: 2007-02-01 Impact factor: 1.986
Authors: Joscelyn Sarsby; Nicholas J Martin; Patricia F Lalor; Josephine Bunch; Helen J Cooper Journal: J Am Soc Mass Spectrom Date: 2014-09-03 Impact factor: 3.109
Authors: Oliver J Hale; Emma K Sisley; Rian L Griffiths; Iain B Styles; Helen J Cooper Journal: J Am Soc Mass Spectrom Date: 2020-03-11 Impact factor: 3.109
Authors: Oleg Klykov; Mykhailo Kopylov; Bridget Carragher; Albert J R Heck; Alex J Noble; Richard A Scheltema Journal: Mol Cell Date: 2022-01-20 Impact factor: 17.970
Authors: Sohail Jahid; Jose A Ortega; Linh M Vuong; Isabella Maria Acquistapace; Stephanie J Hachey; Jessica L Flesher; Maria Antonietta La Serra; Nicoletta Brindani; Giuseppina La Sala; Jacopo Manigrasso; Jose M Arencibia; Sine Mandrup Bertozzi; Maria Summa; Rosalia Bertorelli; Andrea Armirotti; Rongsheng Jin; Zheng Liu; Chi-Fen Chen; Robert Edwards; Christopher C W Hughes; Marco De Vivo; Anand K Ganesan Journal: Cell Rep Date: 2022-04-05 Impact factor: 9.995