Clement M Potel1,2, Simone Lemeer1,2, Albert J R Heck1,2. 1. Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences , Utrecht University , Padualaan 8 , 3584 CH Utrecht , The Netherlands. 2. Netherlands Proteomics Centre , Padualaan 8 , 3584 CH Utrecht , The Netherlands.
Pioneering
work by, among others,
the groups of Hunt and Mann at the start of this century opened up
the era of mass spectrometry (MS)-based phosphoproteomics.[1−3] Through advances in sample preparation, phosphopeptide enrichment,
mass spectrometric detection, peptide sequencing, and dedicated database
search algorithms, this field has matured and it is nowadays possible
to monitor thousands of protein phosphorylation events qualitatively
and quantitatively.[4−8] Such phosphoproteomics studies are important, as reversible protein
phosphorylation is regulating nearly all biological processes and
the deregulation of phosphorylation has been associated with the onset
of various pathologies.[9−15] Notwithstanding this maturation and broad acceptance of mass spectrometry
based phosphoproteomics by the research community, the mass spectrometric
analysis of phosphorylation remains demanding, mainly due to the often
severe substoichiometric levels of protein phosphorylation and the
intrinsic lability of the phosphate group, hampering both enrichment
and unambiguous sequencing analysis of phosphopeptides. Here we review
the increasing knowledge gathered about the mechanisms behind the
fragmentation of phosphopeptide ions and describe additionally recent
advances made to improve and facilitate the identification and site
localization of phosphorylated peptides. Since our earlier related
review in 2009,[16] several fragmentation
methods have been introduced and successfully applied to phosphopeptides
sequencing, such as electron capture (ECD), electron transfer induced
dissociation (ETD), and hybrid fragmentation techniques such as EThcD
(a combination of ETD and higher energy collisional dissociation (HCD))
and AI-ETD (a combination of ETD with infrared photoactivation). A
plethora of data has been gathered on endogenous Ser, Thr, Tyr phosphorylation
but also on large synthetic phosphopeptide libraries, providing new
insight into the mechanisms behind phosphopeptide fragmentation. Moreover,
data on phosphopeptides harboring phosphorylated histidine, arginine,
or lysine have become available, showing some distinct fragmentation
behaviors. Through all this acquired knowledge on phosphopeptide fragmentation
and site localization, MS-based phosphoproteomics has developed into
a respectable and valuable tool for the broader life science research
community.
Sequencing Phosphopeptides Using Collision Induced Dissociation
Basic
Principles of Collision Induced Dissociation Applied to
Peptide Ions
During collision induced dissociation (CID),
gas-phase peptide ions are subjected to collisions with inert gas
atoms or molecules, leading to the incremental increase of the peptide
ions internal energies and ultimately to their unimolecular dissociation
into fragment neutrals and ions, whereby the ions are used to elucidate
the peptide sequence or to pinpoint the modification sites following
data analysis. CID methods are the most prominent ion activation methods
in the field of bottom-up proteomics, due to their high efficiency
and ease of implementation. However, this ease of implementation cannot
conceal the complexity of energy transfer and dissociation mechanisms
involved, discussed in detail in various recent reviews.[17−19] When considering applications of CID to (phospho)proteomics, one
refers almost exclusively to the so-called low-energy CID, during
which collision energies in the range of 1–200 eV are used.
Nonelastic collisions with the inert gas molecules lead to the transfer
of translational into internal energy, mainly in the form of vibrational
energy. Low-energy CID is a stepwise process and multiple collisions
(tens to hundreds) are necessary to accumulate sufficient internal
energy to reach a threshold energy necessary to overcome for the gas-phase
unimolecular dissociation reaction to happen.[20,21] For this reason, CID presents relatively long activation times in
comparison with alternative dissociation techniques discussed later.
During this process, the internal energy is quickly (<10–12 s) equilibrated through the vibrational degrees of freedom of the
peptide ions. This statistical redistribution of internal energy implies
that following CID, low-energy dissociation pathways are preferably
accessed.The main dissociation pathways occurring during CID
are charge-directed. In positive ion mode, the mobile proton model,
introduced by Gaskell, Wysocki, and co-workers,[22−27] provides a generally accepted model to explain CID fragmentation
behavior of protonated peptides. Peptide gas-phase ions formed by
soft ionization techniques (i.e., electrospray ionization) are characterized
by their relatively low level of internal energy: ionizing protons
are thus localized at the energetically favored positions, mainly
at basic residues side chains (histidine, lysine, and arginine) and/or
at the N-terminal amine. Increase of the internal energy of the peptide
upon CID facilitates the mobilization of the protons to alternative,
less favored sites along the backbone. Protonation of the nitrogen
atom of the peptide backbone amide bond results in the weakening of
the amide bond, and the subsequent oxazolone pathway results preferentially
in the formation of the well-known complementary b and y fragment
ions[17] (Figure ).
Figure 1
Peptide ion fragmentation ladder with the complementary
a/x, b/y,
and c/z ion series displayed. Upon CID, the oxazolone pathway, resulting
in the preferential formation of the b/y ion series is energetically
favored after protonation of the nitrogen atom of the amide bond.
Peptide ion fragmentation ladder with the complementary
a/x, b/y,
and c/z ion series displayed. Upon CID, the oxazolone pathway, resulting
in the preferential formation of the b/y ion series is energetically
favored after protonation of the nitrogen atom of the amide bond.
Neutral Losses of Serine
and Threonine Phosphorylated Peptides
Particular for phosphorylated
peptides, low-energy relaxation pathways
can additionally lead to the loss of the rather labile phosphate group,
either as neutral phosphoric (H3PO4) or meta-phosphoric
(HPO3) acids, competing with the above-described backbone
fragmentation.[6,16] We will first focus on the mechanisms
of neutral losses, while the impacts of such neutral losses on phosphopeptides
identification and phosphosites localization are discussed later.
As CID dissociation pathways are mainly charge-directed, the mobility
of the ionizing protons bears a crucial importance in the type of
gas-phase reactions involved. Three major classes of proton mobility
conditions have been defined,[28] depending
on the number of ionizing protons (i.e., the peptide’s charge
state) and the number of basic residues present in the peptide sequence.
If the number of charges is higher to the number of basic residues,
the conditions are considered as mobile. Limited mobility corresponds
to cases, wherein the number of basic residues is higher than or equal
to the number of protons. Because of the very high gas-phase basicity
of arginine residues, large amounts of energy are necessary to mobilize
a proton initially sequestered at such an arginine side-chain. Conditions
are thus considered as nonmobile if the number of arginine residues
is higher than or equal to the number of charges carried by the peptide.It is well established that upon CID fragmentation, phosphoserine
and phosphothreonine (pST) peptides mainly fragment through elimination
of H3PO4.[6,16] Thisphosphoric acid
loss can originate from a direct loss of H3PO4 or combined losses of HPO3 and a water molecule from
a residue’s side chain containing a hydroxyl group. It was
also observed that the propensity of pST phosphopeptides to undergo
neutral loss is inversely correlated with the proton mobility (i.e.,
neutral loss is more likely to occur under limited proton mobility
conditions).[29] Over the years, several
alternative mechanisms have been proposed for the observed phosphate
neutral losses, including β-elimination, E2 eliminations, and
nucleophilic substitutions involving diverse nucleophiles displacing
the phosphate group. Gronert et al. originally obtained similar activation
barriers for the different dissociation mechanisms, implying that
neutral loss could occur via multiple pathways.[30] Based on computational and experimental evidence, it is
now accepted that under mobile proton conditions, H3PO4 neutral loss via a SN2 mechanism is favored, initiated
by the mobilization of a proton to one of the oxygen atoms of the
phospho-group.[29,31,32] This protonation makes the β-carbon linked to the charged
phospho-group electrophilic and thus suitable for a subsequent nucleophilic
attack from a neighboring nucleophile group. Neutral loss then preferably
occurs via formation of a five-membered oxazoline ion after nucleophilic
attack from the N-terminal carbonyl oxygen, expelling the phospho-group
(Figure ).
Figure 2
Neutral H3PO4 elimination from phosphopeptides
under high (A) and low (B) proton mobility conditions. Note that alternative
SN2 mechanisms have been proposed, depending on the nature
of the nucleophilic attacking group. Here, the energetically favored
formation of the five-membered oxazoline ion is displayed. The energetically
favored HPO3 neutral loss mechanism is displayed in part
C. Reproduced from Modeling of the gas-phase phosphate group loss
and rearrangement in phosphorylated peptides, Rožman, M., J. Mass Spectrom.2011, Vol. 46, pp. 949–955 (ref (31)). Copyright 2011 Wiley.
Neutral H3PO4 elimination from phosphopeptides
under high (A) and low (B) proton mobility conditions. Note that alternative
SN2 mechanisms have been proposed, depending on the nature
of the nucleophilic attacking group. Here, the energetically favored
formation of the five-membered oxazoline ion is displayed. The energetically
favored HPO3 neutral loss mechanism is displayed in part
C. Reproduced from Modeling of the gas-phase phosphate group loss
and rearrangement in phosphorylated peptides, Rožman, M., J. Mass Spectrom.2011, Vol. 46, pp. 949–955 (ref (31)). Copyright 2011 Wiley.It was initially hypothesized that neutral losses via charge-remote
mechanisms were favored in case of nonmobile or limited mobility conditions,
with a β-elimination reaction leading to the formation of dehydroalanine
or dehydrobutyrine in the case of phosphoserine or phosphothreoninepeptides, respectively.[33] However, Palumbo
et al. showed that charge-directed mechanisms were favored even under
nonmobile conditions.[29] This observation
was rationalized by the formation of a hydrogen bond between the arginineguanidinium group and the phosphate group. By withdrawing electrons,
this interaction creates an electrophilic group suitable for the SN reaction, can assist the phosphate leaving group,
and the basic group can act as a charge neutralizer by transferring
its proton to the phosphate group to form H3PO4. Rožman confirmed by a combination of quantum mechanics and
RRKM modeling that neutral loss via charge-directed mechanisms are
indeed also energetically favored in limited proton mobility conditions,
through interaction of positively charged guanidinium with the phospho-group
(Figure ).[31]This indicates that the proton mobility
conditions do not significantly
influence the neutral loss mechanism but rather affect the competition
between the neutral loss and backbone fragmentation channels. Under
high mobile proton conditions, energetic and kinetic properties of
the neutral loss pathway match those of the classical oxazolone pathway.
Backbone fragmentation mechanisms, which are primarily charge-directed,
necessitate a larger amount of activation energy when the proton mobility
is limited, while Rožman calculated that activation barriers
for neutral loss mechanisms under both high and low mobile proton
conditions (displayed in Figure A,B) were somewhat similar.[31] Hence, under limited proton mobility conditions, the neutral loss
reaction can become energetically and kinetically favored in comparison
with peptide backbone fragmentation. To illustrate this point, Lanucara
et al. were able to reduce the extent of neutral loss and thus increase
backbone fragmentation by increasing the proton mobility of phosphopeptides
via enzymatic removal of the basic C-term residue.[34]Laskin et al. recently showed that the phosphoric
acid neutral
loss is likely a two-step process under limited proton mobility conditions,
whereby the phosphate abstraction is followed by the dissociation
of the ion–molecule complex, the phosphopeptide sequence influencing
the overall observed reaction kinetics.[35] They concluded that phosphate neutral loss under mobile conditions
exhibited a relatively high dissociation barrier with a loose transition
state, while neutral loss under nonmobile conditions is characterized
by a lower dissociation barrier with a tight transition state.A bias toward higher neutral loss abundance during CID of serine
phosphorylated peptides in comparison with threonine phosphorylated
peptides has been documented, especially under high mobile proton
conditions, and can be explained by a difference in fragmentation
dynamics.[29,31] Meta-phosphoric (HPO3) neutral
loss can also occur (Figure C), albeit much less frequently than the phosphoric acid neutral
loss, potentially leading to the gas-phase rearrangement of the phosphate
group. This meta-phosphate loss can be accompanied by a water loss
from residues containing a hydroxyl moiety (in particular, nonmodified
serine and threonine residues) or carboxyl groups, cumulatively resulting
in an apparent phosphoric acid loss.[36,37] This implies
that the dehydration site is not indicative of the position of the
phosphorylation site.Cui et al. showed that the proton mobility
conditions also influence
the competition between the direct loss (i.e., SN2 mechanism)
and the combined loss, the probabilities of the latter increasing
under limited proton mobility and nonmobile proton conditions.[36] This is in accordance with the calculations
made by Rožman, revealing that HPO3 loss can compete
with H3PO4 loss under limited proton mobility
conditions.[31] In an effort to limit the
neutral loss extent, a chemical derivatization strategy mitigating
charge-directed mechanisms leading to phosphate neutral loss was developed,[38] while addition of a dinuclear gallium complex
improved the stability of the phospho-ester bond during CID.[39]In an extensive study, factors influencing
the occurrence and abundance
of neutral losses under CID conditions were analyzed by Brown et al.[40] They verified that the extent of neutral loss
is highly sequence-dependent, whereby the proton mobility alone cannot
explain the differences observed from peptide to peptide sequence.
For this study, they compiled more than 30 000 ion trap based
CID fragmentation spectra originating from tryptic phosphopeptides.
They confirmed the correlation between neutral losses and proton mobility
(Figure A) and showed
that neutral loss was on average 3-fold higher for b-ions in comparison
with y-ions, indicating that proper phosphosite localization should
rely more on y-ions series.
Figure 3
Fraction of signal attributable to neutral loss
observed in spectra
derived from 5749 monophosphorylated pS and pT tryptic peptides using
ion trap CID fragmentation (A). The extent of neutral loss correlates
with proton mobility but does not fully explain the observed behavior.
The observed fraction of signal intensity attributable to H3PO4 neutral loss, as a function of the proximity of the
phosphorylation site to both peptide termini, is displayed in part
B. Reprinted by permission from Springer. J. Am. Soc. Mass
Spectrom., Large-Scale Examination of Factors Influencing
Phosphopeptide Neutral Loss during Collision Induced Dissociation.
Brown, R., Stuart, S. S., Houel, S., Ahn, N. G., Old, W. M. Vol. 26, 2015, pp. 1128–1142 (ref (40)). Copyright 2015.
Fraction of signal attributable to neutral loss
observed in spectra
derived from 5749 monophosphorylated pS and pT tryptic peptides using
ion trap CID fragmentation (A). The extent of neutral loss correlates
with proton mobility but does not fully explain the observed behavior.
The observed fraction of signal intensity attributable to H3PO4 neutral loss, as a function of the proximity of the
phosphorylation site to both peptide termini, is displayed in part
B. Reprinted by permission from Springer. J. Am. Soc. Mass
Spectrom., Large-Scale Examination of Factors Influencing
PhosphopeptideNeutral Loss during Collision Induced Dissociation.
Brown, R., Stuart, S. S., Houel, S., Ahn, N. G., Old, W. M. Vol. 26, 2015, pp. 1128–1142 (ref (40)). Copyright 2015.The study of Brown et al. also
elucidated that the phosphosite
proximity to the predominantly protonated N-terminus enhances the
magnitude of the neutral loss (Figure B) in mobile conditions. To explain this behavior,
the authors proposed a mechanism in which the formation of a hydrogen
bond between the N-terminus, which is the least basic group protonated
with high occupancy, and the phospho-group is favoring the SN2 reaction leading to neutral loss. Such effect is not observed in
limited conditions as the N-terminus is less likely to be protonated.Surprisingly, basic residues, which are supplying the protons under
mobile conditions and promoting neutral loss reaction in limited and
nonmobile proton conditions, suppress neutral loss when directly adjacent
to the phosphosite, which the authors explained by steric hindrance
due to the formation of a strong hydrogen bond preventing the formation
of the oxazoline ring. More generally, under mobile conditions, strong
hydrogen bonds between the phospho-group and proximal basic residues
impede the phospho-group protonation or interaction with the less
stable charge donor protonated N-terminus, which, in fine, favors
backbone fragmentation following the mobile proton model. Additionally,
Brown et al. determined that neutral water and ammonia loss can compete
with phosphate neutral loss and that the presence of aspartic and
glutamic acids promote neutral loss, while proline, by reducing the
backbone flexibility, can hamper neutral loss by limiting distal interaction
between the phospho-group and basic residues.
Neutral Losses of Tyrosine
Phosphorylated Peptides
Tyrosine phosphorylated peptides
exhibit some specific fragmentation
characteristics. Upon CID, tyrosine phosphorylated (pY) peptide ions
can occasionally exhibit a meta-phosphoric (HPO3) neutral
loss but to a markedly lower extent when compared to neutral losses
observed during CID of serine and threonine phosphorylated peptides.[6,16] The SN2 reaction resulting in the H3PO4 elimination is hampered due to steric hindrance from the
aromatic group, which also stabilizes the C–O bond cleaved
during H3PO4 loss via resonance. HPO3 loss occurs via a charge-remote mechanism, while only limited H3PO4 loss is observed, which can only result from
a concomitant loss of HPO3 and water. In accordance with
an early report that neutral loss of the phosphate moiety from pY
peptides is charge dependent,[41] Everley
et al. recently reported that phosphate neutral loss from tyrosine
phosphorylated peptides drastically increases after isobaric labeling
(Figure ), during
which a primary amine is replaced by a tertiary amine of high gas-phase
basicity, thus substantially affecting the proton mobility.[42] In fact, 97% of peptides for which the neutral
loss was observed were of low proton mobility, highlighting the fact
that phosphotyrosine neutral loss should be seriously considered during
database searches of iTRAQ- and TMT-labeled phosphotyrosine peptides.
While intense neutral losses after CID of labeled pST peptides have
also been reported,[43,44] as far as we know the impact
of isobaric labeling on the extent of neutral loss during CID fragmentation
of pST peptides has yet to be quantified, but a similar trend can
be expected.
Figure 4
Isobaric labeling enhances neutral loss in phosphotyrosine
peptides.
Tyrosine phosphorylated peptides were reported as exhibiting neutral
loss when the neutral loss intensity was >33% of the base peak.
A
substantial increase in neutral loss is observed upon isobaric labeling,
which is caused by the lowering of the proton mobility. Note that
under ETD conditions (purple) no neutral loss was observed. Reproduced
from Everley, R. A.; Huttlin, E. L.; Erickson, A. R.; Beausoleil,
S. A.; Gygi, S. P. J. Proteome Res.2017, 16, 1069–1076 (ref (42)). Copyright 2017 American
Chemical Society.
Isobaric labeling enhances neutral loss in phosphotyrosinepeptides.
Tyrosine phosphorylated peptides were reported as exhibiting neutral
loss when the neutral loss intensity was >33% of the base peak.
A
substantial increase in neutral loss is observed upon isobaric labeling,
which is caused by the lowering of the proton mobility. Note that
under ETD conditions (purple) no neutral loss was observed. Reproduced
from Everley, R. A.; Huttlin, E. L.; Erickson, A. R.; Beausoleil,
S. A.; Gygi, S. P. J. Proteome Res.2017, 16, 1069–1076 (ref (42)). Copyright 2017 American
Chemical Society.
Resonance Excitation versus
Beam-Type CID
Low-energy
CID is achieved through either resonance excitation or beam-type collisional
dissociation.[20] While basically the same
fragmentation rules apply and both techniques are well suited for
the analysis of nonmodified peptides, several factors make beam-type
CID superior to resonance excitation in the analysis of phosphopeptides.[45] Ion trap CID (IT-CID) corresponds to the resonance
excitation of trapped peptide ions by applying a supplemental voltage
matching the precursor secular frequency. The depth of the trapping
potential well is such that, to prevent the ejection of ions, the
precursor can only be excited to a few electron volts of kinetic energy.
Hence, hundreds of low-energy collisions, over long activation times
(>10 ms), are necessary to build up enough internal energy to enable
fragmentation. Because of these long activation times, IT-CID is considered
as a slow-heating technique, during which extensive gas-phase rearrangements
can occur prior to dissociation. In the case of the fragmentation
of phosphorylated peptides, evidence for gas-phase rearrangement of
a phosphate moiety to another nonmodified serine or threonine residue
has been reported,[46] albeit with limited
impact on the phosphosite localization.[47,48] Activation
times in beam-type CID (including higher energy collisional dissociation,
HCD[49]) are shorter, as precursor ions are
not excited by resonance but instead are accelerated into the neutral
gas bath of a collision cell. The high focusing power of radiofrequency
(rf) multipole collision cells enables higher energy collisions with
the neutral gas without significant ion loss, decreasing the activation
time to approximately 0.1 ms. This reduces potential gas-phase rearrangement
of the phosphate moiety,[50] for which the
fastest reaction occurs on a millisecond time-scale.[31] More importantly, if neutral losses are common upon beam-type
CID, HCD has been shown to generate less phosphate neutral loss in
comparison with IT-CID and is thus better suited to pinpoint more
accurately phosphorylation sites.[36] In
addition, Diedrich et al. demonstrated that the use of a stepped HCD
collision energy (fragmentation at multiple collision energies) can
result in an increase of identified phosphopeptides and phosphosites
localization confidence.[51]During
IT-CID, only precursor ions are excited by resonance while fragments
fall off-resonance. For phosphopeptides this can result in the generation
of high abundant noninformative ions corresponding to the phosphate
neutral loss(es) from the precursor ion without further backbone fragmentation.
To address this issue, multistage activation (MSA) has been applied
to increase phosphopeptide identification through subsequent activation
of the neutral loss product ions.[52] In
contrast, during beam-type CID, all ions are activated and fragments
can undergo multiple sequential fragmentation events, resulting in
the generation of richer fragmentation spectra.[53] Fragments resulting from neutral loss can thus be subsequently
dissociated into sequence-informative b and y-ions, facilitating phosphopeptide
identification. As such, HCD fragmentation of phosphopeptides has
been reported to yield in general better identification scores than
IT-CID based fragmentation.[45,54,55] In some cases, the generation of an x-ion resulting from the fragmentation
of an ion that underwent phosphoric neutral loss (Figure ) can aid the phosphosite localization.[50] Similarly, even if neutral loss via elimination
mechanisms is a minor dissociation channel, Pilo et al. recently showed
that CID fragmentation of a dehydroalanine-containing product yields
atypical c/z ions after cleavage of the N–Cα bond of
the dehydroalanine residue. These low abundance c/z ions were used
to pinpoint the phosphosite within a serine phosphorylated peptide[56] but do not constitute absolute diagnostic ions
as dehydroalanine can also originate from serinedehydration.
Figure 5
Formation of
the atypical 4,5-dihydrooxazolyl x ion, resulting
from the fragmentation of an ion that underwent a phosphoric acid
loss via the SN2 mechanism but cannot fragment following
the classic oxazolone pathway. Note that the formation of an x-ion
of the same mass is possible (albeit infrequent) during beam type
fragmentation of nonphosphorylated peptides, but a quality filtering
based on the higher intensity of the x-ions over y-ions enables one
to filter out nonphosphorylated peptides. According to two studies,
the x-ion was present in 24 and 36% of spectra deriving from HCD fragmentation
of pST peptides.[50,53] Reproduced from Kelstrup, C.
D.; Hekmat, O.; Francavilla, C.; Olsen, J. V. J. Proteome
Res.2011, 10, 2937–2948
(ref (50)). Copyright
2011 American Chemical Society.
Formation of
the atypical 4,5-dihydrooxazolyl x ion, resulting
from the fragmentation of an ion that underwent a phosphoric acid
loss via the SN2 mechanism but cannot fragment following
the classic oxazolone pathway. Note that the formation of an x-ion
of the same mass is possible (albeit infrequent) during beam type
fragmentation of nonphosphorylated peptides, but a quality filtering
based on the higher intensity of the x-ions over y-ions enables one
to filter out nonphosphorylated peptides. According to two studies,
the x-ion was present in 24 and 36% of spectra deriving from HCD fragmentation
of pST peptides.[50,53] Reproduced from Kelstrup, C.
D.; Hekmat, O.; Francavilla, C.; Olsen, J. V. J. Proteome
Res.2011, 10, 2937–2948
(ref (50)). Copyright
2011 American Chemical Society.Beam-type CID (e.g., HCD) spectra are also characterized
by the
presence of abundant informative immonium ions in the low-mass region.[53,57,58] Immonium ions result from two
sequential cleavages and are particularly abundant in the case of
residues presenting both a heteroatom and an aromatic ring that can
provide charge stabilization. While commonly observed after beam-type
CID, such ions are usually difficult to detect after IT-CID due to
the lack of sequential fragmentation and low-mass cutoff applied,
which is inherent to the use of the rf trapping field. Immonium ions
bear a particular importance in phosphoproteomics, as the phosphotyrosineimmonium ion possesses a unique m/z value and can thus serve as a diagnostic tool. Diedrich et al. showed
that the pY immonium ion abundances scale up with the HCD normalized
energy,[51] and in addition to serve diagnostic
purposes, such ions have been used in analytical strategies to scan
for phosphotyrosine precursors.[59] The chance
to observe the diagnostic pY immonium ion is also directly correlated
with the abundance of the tyrosine phosphorylated peptides. In our
laboratory, at normal HCD collision energy, the immonium ion is observed
for ∼90% of the pY phosphopeptides after pY immunoprecipitation,
while only for ∼20% of the low abundant pY phosphopeptides
present in samples in which pSTY are coenriched. Everley et al. also
recently showed that the low proton mobility resulting of isobaric
labeling alters the probability of observing the pY diagnostic immonium
ion,[42] which is in line with the fact that
the immonium ion is only observed at high collision energy in the
case of iTRAQ labeled pY peptides.[43]Another significant difference between the two techniques lies
in the fact that beam-type dissociation processes are usually coupled
with high-resolution detection of fragments by time-of-flight (TOF)
or Fourier transform (FT) Orbitrap mass analyzers, where IT-CID is
often coupled with low-resolution ion trap detection. While ion traps
exhibit higher sensitivity and often faster duty cycles, measuring
the fragment ions with high mass accuracy and high signal-to-noise
ratio is essential to achieve confident phosphosite localization.
In the early days of HCD, CID-IT was reported to outperform beam-type
CID in term of identification numbers (but not in terms of scoring),
which was imputed to the slower duty cycle time of HCD.[54] Today, thanks to instrumental progress over
the past decade, beam-type CID (and more particularly HCD) outperforms
CID-IT for the identification of phosphopeptides and subsequent phosphosite
localization.[45]
Electron Transfer/Capture
Dissociation of Phosphopeptides
Principles of Electron Transfer/Capture Dissociation
In MS-based proteomics, electron capture/transfer induced dissociation
(ExD) techniques have become viable alternatives to the collisional
activation techniques discussed above.[60] Introduced in 1998, electron capture dissociation (ECD) for peptide
and protein sequencing relies on the capture of a low-energy electron
(generally below 1 eV) by a multicharged gas-phase cation.[61] Electron capture forms an electronically excited
charge-reduced radical cation, initiating radical-ion dissociation
reactions. Quadrupole ion traps cannot efficiently trap electrons
due to the low mass cutoff of the rf field, and for this reason, ECD
was initially performed in FT ion cyclotron resonance (FTICR) instruments,
as the strong magnetic fields applied enable simultaneous trapping
of analytes and electrons. Introduced in 2004, electron transfer dissociation
(ETD) is an alike technique involving an ion/ion reaction during which
an anion reagent with low electron affinity (usually fluoranthene)
transfer an electron to a polycharged peptide cation.[62] By applying unbalanced rf in an ion trap, it is possible
to obtain charge-sign independent trapping, i.e., it is possible to
efficiently trap both anionic reagent and cationic peptides. The widespread
availability, low cost, and robustness of ion trap instruments as
well as its compatibility with chromatographic time scales largely
helped to popularize ETD, making it nowadays the preferred ExD technique
in the field of bottom-up proteomics. Notably, a compact ECD cell
that can be implemented on Orbitrap and Q-TOF platforms may rejuvenate
ECD in the near future.[63,64]Besides the differences
in electron capture/transfer, ECD and ETD hold much in common from
a mechanistic point of view. Different dissociation mechanisms have
been proposed and are discussed in several recent reviews.[65−67] Very briefly, the two well-accepted mechanisms (named Cornell[61] and Utah–Washington[68] mechanisms) result in the formation of an unstable aminoketyl
radical, leading ultimately to the cleavage of the peptide backbone
bond between the amidenitrogen atom and the Cα atom of the
neighboring residue (N–Cα bond). This preferred fragmentation
pathway involves random cleavage along the peptide backbone and produces
complementary even-electron c-type and odd-electron z•-type ions series. When considering its application to phosphoproteomics,
ExD-based fragmentations possess several advantageous features. First,
the phosphate group does not possess bound anionic states and exhibits
negative electron and hydrogen affinity;[69] hence, phosphorylation of side chain residues are virtually nonreactive
upon ExD. ExD also enables, in theory, to achieve better sequence
coverage through the generation of more complete ion series, which
is an important factor for confident phosphorylation site localization,
as discussed later. Finally, after electron capture/transfer, backbone
cleavages occur faster than the rate of internal vibrational energy
redistribution, diminishing the undesired neutral losses of the phosphate
moieties. This ability of cleaving strong backbone bonds while preserving
weak phosphoester bonds, which are labile upon vibrational activation,
was reported early on and still constitutes one of the major advantages
of ExD dissociation over CID-based methods, especially for phosphopeptides
fragmentation.[70−73]However, ExD techniques are hampered by two major bottlenecks.
The first one is the strong dependency of the ExD efficiency on the
precursor charge density, as inter alia electron
capture cross sections and ion/ion reaction rates are highly dependent
on the peptide charge state.[74,75] As a result, ExD of
doubly protonated peptides generally suffers from lower fragmentation
efficiency when compared to CID/HCD.[76] During
ETD, the competition between undesirable proton transfer from the
peptide to the anion reagent and electron transfer is also influenced
by the peptide charge state, the detrimental proton transfer being
more prominent in the case of doubly charged peptides.[77] The second major bottleneck is that ExD necessitates
longer reaction times in comparison with CID activation times, especially
when considering beam-type CID. Indeed, while ExD are fast processes,
they suffer from low fragmentation frequency and require long reaction
times to achieve extensive precursor ion dissociation. For example,
ETD reaction times to reach optimal fragmentation are in the order
of 100 ms for doubly charged precursor and less than 50 ms for triply
charged peptides.[78] These longer cycle
times (compared to CID) have an important impact on the number of
identifiable (phospho)peptides, as discussed later, and as such limits
the dynamic range and depth of ETD experiments.The capture
or transfer of low-energy electrons does not always
disrupt noncovalent interactions, which can result in the product
ions staying together, termed nondissociative ExD. The prevalence
of these phenomena, termed sometimes ECnoD/ETnoD, during which such
complexes appear as nondissociated charge reduced precursor ions,
is also directly related to the precursor charge density.[79] Indeed, Coulombic repulsion between positively
charged side chains favors dissociation, and for the same charge state,
a higher m/z increases the chance
of noncovalent interactions. Remarkably, detected tryptic phosphopeptides
have been reported to present a higher average charge state when compared
to nonmodified tryptic peptides,[80,81] which would
favor dissociation of ExD products. However, this increase of average
charge state is caused by an increase of trypsin miscleavages due
to strong in-solution interactions between arginine/lysine side chains
and phosphate groups,[82,83] and in reality phosphopeptides
exhibit on average lower charge density than nonmodified tryptic peptides.[80] Because the ETD reaction is performed in a linear
ion trap at relatively high pressure (∼10–3 Torr) in comparison with ECD (∼10–10 Torr),
internal energies of the ETD fragment ions are reduced through collisional
cooling, which further hampers their subsequent dissociation. Phosphopeptide
chemical derivatization, or the addition of a dinuclear zinc complex,
which selectively binds to the phosphate group thus increasing the
phosphopeptide charge state, notably led to improved ExD.[84−86]Moreover, ECnoD/ETnoD is notably problematic for phosphorylated
peptides because of the strong intramolecular noncovalent interactions
in the gas phase between the phospho-group and the side chains of
basic amino acid residues, which have been reported to be able to
survive ExD[87] and even low-energy CID.[88] Cooper and co-workers showed that such gas-phase
interactions, identified as being salt bridges and ionic hydrogen
bonds, influence the observed fragmentation patterns as well as fragment
abundances and overall have deleterious effects on ECD, which can
be mitigated by performing ECD on precursors of higher charge state.[89,90] Using synthetic model phosphopeptides, they later demonstrated that
the structures of phosphopeptides, and not only their sequences influence
ECD fragmentation behaviors.[91] Moss et
al. reported an unusual ECD behavior resulting in the preferred neutral
loss of phosphoric acid after ECD (but not ETD) of a phosphorylated
pentapeptide, which the authors explained by a dipole-guided electron
capture at the arginine side chain.[69] While
the presence of a phosphate group can influence ExD fragmentation
of phosphopeptides, Chen et al. demonstrated that it has little effect
on the ECD behavior of phosphoproteins, as fragmentation patterns
of phosphorylated and nonphosphorylated proteins were highly similar.[92]
ExD with Supplemental Activation to Extend
Sequence Coverage
In ExD, supplemental activation enables
to circumvent some of the
above-described issuesthrough the introduction of additional energy
to the precursor ions, increasing the efficiency of ExD-based techniques,
especially for the fragmentation of lower charge peptides. This extra
energy is mainly supplied as vibrational energy via collisional activation
or infrared photoexcitation. In contrast with ECD, the collisional
cooling occurring during the ETD reaction mitigates the effects of
preactivation. CID activation during ECD is nearly impossible due
to the high vacuum restrictions of FTICR instruments, while resonance
excitation during the ETD reaction has undesirable effects, as it
would increase the precursor velocity and impair good spatial overlap
between the two ion clouds, thus negatively impacting the ion/ion
reaction rate. For this reason, CID activation can either occur before
or after ECD or after ETD. ETcaD, during which the charge-reduced
precursor is activated by resonance to disrupt noncovalent interactions
was introduced in 2007 and greatly alleviated ETnoD issues.[93]In 2012, the Heck lab introduced EThcD
fragmentation, which corresponds to an all-ions dual fragmentation,
i.e., both unreacted species and product ions resulting from ETD are
subjected to subsequent HCD fragmentation.[94] This resulted in extensive backbone fragmentation via the concomitant
generation of both c/z and b/y ion series. EThcD was later applied
to phosphoproteomics (Figure ), leading to significant increases in sequence coverage when
compared to HCD or ETD alone, along with an increase in confidence
for site localization.[95] In addition, EThcD
proved to be less time-consuming than ETcaD, due to the shorter activation
times of beam-type CID.
Figure 6
EThcD fragmentation spectrum of a doubly phosphorylated
peptide.
Complete sequence coverage is achieved while neutral losses are minimal,
enabling unambiguous localization of both phosphosites. Reproduced
from Frese, C. K.; Zhou, H.; Taus, T.; Altelaar, A. F. M.; Mechtler,
K.; Heck, A. J. R.; Mohammed, S. J. Proteome Res.2013, 12, 1520–1525 (ref (95)). Copyright 2013 American
Chemical Society.
EThcD fragmentation spectrum of a doubly phosphorylated
peptide.
Complete sequence coverage is achieved while neutral losses are minimal,
enabling unambiguous localization of both phosphosites. Reproduced
from Frese, C. K.; Zhou, H.; Taus, T.; Altelaar, A. F. M.; Mechtler,
K.; Heck, A. J. R.; Mohammed, S. J. Proteome Res.2013, 12, 1520–1525 (ref (95)). Copyright 2013 American
Chemical Society.The Coon lab recently
introduced activated ion ETD (AI-ETD) for
phosphoproteomics (Figure A).[80] During AI-ETD, peptides are
activated during the ETD reaction by infrared photoactivation, as
photon irradiation does not impede the ETD process. Therefore, in
contrast with ETcaD and EThcD, the application of supplemental energy
does not lead to an increase in duty cycle time. During infrared multiphoton
dissociation (IRMPD), analytes are irradiated by low energy photons
(λ = 10.6 μm; corresponding to an energy of ∼0.1
eV per photon). Absorption of hundreds of photons over long activation
times is necessary to trigger dissociation, thus enabling intramolecular
vibrational energy redistribution. For that reason, the stepwise activation
process can be compared to the one of slow-heating CID and preferably
induces the formation of b/y ions series. Notably, the P–O
stretch presents a remarkably high infrared absorption cross-section,
making IRMPD of phosphopeptides a particularly efficient process.[96,97] As a consequence, AI-ETD of phosphopeptides resulted in the generation
of more b/y ions when compared with the dissociation of nonmodified
peptides.[98]
Figure 7
Comparison of ETD and
AI-ETD of a singly phosphorylated, doubly
protonated peptide (A). AI-ETD enables complete sequence coverage
and confident phosphosite localization. In contrast, during ETD, the
ETnoD process is preventing the achievement of full sequence coverage
and site localization. The extent of neutral phosphate moiety loss
observed upon ETD, ETcaD, EThcD, AI-ETD (at 12 W and 15 W laser power),
AI-ETD+ (AI-ETD at 15 W followed by additional IRMPD) and HCD is displayed
in parts B and C. Reproduced from Riley, N. M.; Hebert, A. S.; Dürnberger,
G.; Stanek, F.; Mechtler, K.; Westphall, M. S.; Coon, J. J. Anal. Chem.2017, 89, 6367–6376
(ref (80)). Copyright
2017 American Chemical Society.
Comparison of ETD and
AI-ETD of a singly phosphorylated, doubly
protonated peptide (A). AI-ETD enables complete sequence coverage
and confident phosphosite localization. In contrast, during ETD, the
ETnoD process is preventing the achievement of full sequence coverage
and site localization. The extent of neutral phosphate moiety loss
observed upon ETD, ETcaD, EThcD, AI-ETD (at 12 W and 15 W laser power),
AI-ETD+ (AI-ETD at 15 W followed by additional IRMPD) and HCD is displayed
in parts B and C. Reproduced from Riley, N. M.; Hebert, A. S.; Dürnberger,
G.; Stanek, F.; Mechtler, K.; Westphall, M. S.; Coon, J. J. Anal. Chem.2017, 89, 6367–6376
(ref (80)). Copyright
2017 American Chemical Society.
Extent of Neutral Losses Following ETD-Based Activation
Naturally, supplying additional activation energy deposited as vibrational
energy can lead to the neutral losses of the phosphate moiety, which
could mitigate the benefits of ETD fragmentation. Riley et al. recently
investigated neutral loss extent upon fragmentation of endogenous
phosphopeptides using different ETD-based techniques.[80] As expected, ETD fragmentation did not induce significant
neutral losses (Figure B,C). ETcaD and EThcD exhibited low phosphate neutral losses, to
a similar extent. Observed neutral losses mainly resulted from phosphate
loss of b-ions (which is consistent with previous report that b-ions
are more prone to neutral losses[40]). AI-ETD
exhibited the highest magnitude of neutral losses among all ETD-based
techniques, which can be explained by the slow-heating of all ions
and high efficiency of infrared photons absorption by phosphopeptides.
These neutral losses however did not impede phosphopeptide identification
and localization, as AI-ETD led to the identification of the highest
number of localized phosphopeptides.
Noncanonical Phosphorylations
Besides the well-studied
pSTY phosphorylations, it is established that phosphorylation can
also occur on six other residues (i.e., His, Arg, Lys, Cys, Asp, and
Glu).[99] The amount of information on the
gas-phase fragmentation behavior of these less common phosphorylation
events is however still limited in comparison with the STY phosphorylation.
Because histidine phosphorylation occurs on the aromatic imidazole
side-chain, direct H3PO4 neutral loss via SN2 or elimination mechanisms is impossible and can thus only
occur via combined losses of HPO3 and a water molecule.
Still, in contrast with CID of tyrosine phosphorylated peptides, a
prominent H3PO4 loss is observed upon IT-CID,[100,101] alongside less abundant HPO3 and H5PO5 ancillary neutral losses (Figure A).[102] The high
magnitude of neutral losses can be attributed to the lower energy
required to break the N–P bond in comparison with the activation
threshold of the O–P bond.[103] Oslund
et al. investigated the source of the water molecules in the combined
neutral loss pathway and deduced that the water loss mainly originated
from the carboxylic moieties at the C-terminal residue and from aspartate/glutamate
side chains.[102] When comparing the fragmentation
behavior of pHispeptides with those of pSTY peptides, they concluded
that the neutral loss triplet was preferentially observed during fragmentation
of pHispeptides (∼40% of pHispeptides against less than 5%
for pSTY peptides) and used the presence of this specific neutral
loss to trigger additional subsequent fragmentation.
Figure 8
(A) IT-CID fragmentation
spectra of phosphopeptides of sequence
TSHYSIMAR, phosphorylated at the histidine (top) or adjacent tyrosine
residue (bottom). Reproduced from Oslund, R. C.; Kee, J.-M.; Couvillon,
A. D.; Bhatia, V. N.; Perlman, D. H.; Muir, T. W. J. Am. Chem.
Soc.2014, 136, 12899–12911
(ref (102)). Copyright
2014 American Chemical Society. (B) HCD spectra of a pHis peptide.
The phosphosite is unambiguously localized and the neutral loss triplet
as well as the low mass diagnostic phosphohistidine immonium ion are
observed. Reprinted by permission from Macmillan Publishers Ltd.:
NATURE. Widespread bacterial protein histidine phosphorylation revealed
by mass spectrometry-based proteomics. Potel, C. M., Lin, M.-H., Heck,
A. J. R., Lemeer, S. Nat. Methods2018, Vol. 15, pp. 187–190 (ref (107)). Copyright 2018.
(A) IT-CID fragmentation
spectra of phosphopeptides of sequence
TSHYSIMAR, phosphorylated at the histidine (top) or adjacent tyrosine
residue (bottom). Reproduced from Oslund, R. C.; Kee, J.-M.; Couvillon,
A. D.; Bhatia, V. N.; Perlman, D. H.; Muir, T. W. J. Am. Chem.
Soc.2014, 136, 12899–12911
(ref (102)). Copyright
2014 American Chemical Society. (B) HCD spectra of a pHispeptide.
The phosphosite is unambiguously localized and the neutral loss triplet
as well as the low mass diagnostic phosphohistidineimmonium ion are
observed. Reprinted by permission from Macmillan Publishers Ltd.:
NATURE. Widespread bacterial protein histidine phosphorylation revealed
by mass spectrometry-based proteomics. Potel, C. M., Lin, M.-H., Heck,
A. J. R., Lemeer, S. Nat. Methods2018, Vol. 15, pp. 187–190 (ref (107)). Copyright 2018.Arginine phosphorylated peptides
also exhibited intense H3PO4 neutral loss from
the precursor ions upon CID,[104,105] sometimes accompanied
by H5PO5 loss but not
HPO3. Notably, a pArgpeptide exhibited the same neutral
loss pattern as a pHispeptide.[104] As for
pHispeptides, the additional water loss originates from the C-terminus
or residue side chains containing carboxyl and hydroxyl functions,
the extent of observed neutral loss(es) is dependent on the proton
mobility and size of the pArgpeptide.[104]The extent of the phosphate group neutral loss, combined with
the
fact that it involves losses from at least two distinct functional
groups within the peptide complicates accurate localization of N-phosphosites.
Moreover, gas-phase rearrangements of N-phosphorylation to the C-terminus
or other acceptors upon CID fragmentation have been reported.[100,104,106] Low resolution IT-CID and HCD
fragmentation of synthetic pArgpeptides led to a false localization
rate of 10–20%.[104] It however has
to be noted that such false localization constituted false negative
identification of N-phosphorylation (i.e., mislocalization of the
N-phosphosite at another residue). Several groups have compared the
fragmentation patterns of synthetic pHispeptides with those of identified
endogenous pHispeptides to achieve confident identification/localization
of pHispeptides,[101,102,107] while a pArg spectral library was built to study arginine phosphorylation
in a bacterium.[108] ETD was also reported
to improve confident phosphosite localization of both pArg[104,105] and pHis,[100] and it was demonstrated
that the pHisimmonium ion of specific m/z 190.0367 can be used as a diagnostic ion[107] (Figure B).Lysine phosphorylation is the least studied of the three
possible
N-phosphorylations, and its analysis by mass spectrometry is somehow
more delicate. While CID was successfully used for the study of pHis/pArg,
CID of pLyspeptides resulted in the complete loss of the phosphate
group, mainly as phosphoric acid.[109,110] ECD and ETD
enabled the identification of lysine phosphorylated peptides,[109,110] but in contrast with pArg, gas-phase scrambling of the phosphate
group was observed even upon ETD.[111] Concerning
cysteine phosphorylation, intense H3PO4 and
HPO3 neutral losses were reported upon vibrational activation.[112,113] Bertran-Vicente et al. reported that while HCD resulted in complete
loss of the phosphate group for some pCyspeptides, EThcD permitted
the unambiguous localization of cysteinephosphosites.[113]We could not find any information concerning
the fragmentation
behavior of pAsp and pGlupeptides, which can be explained by the
fact that no chemical approach to synthesize pAsp and pGlupeptides
exists as well as no tailored enrichment method.[114] Finally, pyrophosphorylations of Ser and Thr were confidently
identified and localized by EthcD fragmentation, following a triggering
approach when a diagnostic neutral loss doublet of 98 and 178 Da was
observed upon CID fragmentation.[115]
Identification
of Phosphopeptides
Different peptide sequence matching search
engines and algorithms
coexist, such as Mascot,[116] Sequest,[117] Andromeda,[118] MSAmanda,[119] and MS-GF+,[120] that
all can be used to match experimental tandem mass spectra to peptide
sequences contained in a database. In addition to computing a score
reflecting the reliability of peptide identification, all these search
engines utilize a target-decoy strategy to estimate the false discovery
rate (FDR).[121] During the decoy search,
MS2 spectra are searched against decoy sequences derived
from the randomization or reversal of the original target database’s
sequences, enabling an estimation of the number of random matches
(false positives) in the target search and to control the number of
false discoveries, which constitutes a second reliability measure
for peptide identification. It could be logical to think that unambiguous
identification of phosphopeptides by CID is more difficult than identification
of their nonphosphorylated counterparts. Indeed, the fact that neutral
losses compete with backbone fragmentation (i.e., there are less b
and y ions susceptible to match theoretical fragment masses) combined
with the increase of the number of fragments obtainable due to the
increase of accessible dissociation channels could hinder the identification
of phosphopeptides by standard search engines and give rise to more
spurious matches. This would be particularly true in the case of low-resolution
ion trap CID, during which substantial nonsequence informative neutral
loss from the precursor ion can occur. However, by producing richer
spectra due to the possibility of sequential fragmentation events
at higher energies, combined mostly with high-resolution determination
of fragment masses, beam-type CID/HCD largely circumvents these issues.
This was clearly illustrated in the analysis of two large libraries
of synthetic phosphopeptides and their nonphosphorylated counterparts
(>100 000 peptides each) by HCD fragmentation, which surprisingly
revealed that phosphopeptides were in fact easier to identify[122] (i.e., the FDR values were significantly lower
in comparison with their nonphosphorylated peptides and this at any
score value, Figure ).
Figure 9
Identification of phosphopeptides using HCD fragmentation seems
to be simpler than the identification of their nonphosphorylated counterparts.
Here, the false discovery rate (FDR) is plotted as a function of the
Andromeda score. Similar trends were observed when the Mascot search
engine was used. Reprinted by permission from A large synthetic peptide
and phosphopeptide reference library for mass spectrometry-based proteomics.
Marx, H., Lemeer, S., Schliep, J. E., Matheron, L., Mohammed, S.,
Cox, J., Mann, M., Heck, A. J. R., Kuster, B. Nat. Biotechnol.2013, Vol. 31, pp. 557–564
(ref (122)). Copyright
Springer Nature, Nature Biotechnology 2013.
Identification of phosphopeptides using HCD fragmentation seems
to be simpler than the identification of their nonphosphorylated counterparts.
Here, the false discovery rate (FDR) is plotted as a function of the
Andromeda score. Similar trends were observed when the Mascot search
engine was used. Reprinted by permission from A large synthetic peptide
and phosphopeptide reference library for mass spectrometry-based proteomics.
Marx, H., Lemeer, S., Schliep, J. E., Matheron, L., Mohammed, S.,
Cox, J., Mann, M., Heck, A. J. R., Kuster, B. Nat. Biotechnol.2013, Vol. 31, pp. 557–564
(ref (122)). Copyright
Springer Nature, Nature Biotechnology 2013.Moreover, no significant biases in favor of identification
of pS,
pT, or pY peptides as well as any biases toward amino acids sequence
motifs were observed. The latter has some important implications,
as it signifies that observed statistically enriched phosphorylation
motifs derive from specific kinase activities and not from enhanced
CID fragmentation due to the presence of proline and aspartic acid
residues.[17] This study, also revealed that
in terms of number of identified phosphopeptides, HCD outperformed
ETD fragmentation, once again underpinning that neutral losses observed
during HCD fragmentation do not significantly impair the identification
of phosphopeptides. Surprisingly, in this study, ETD did not surpass
HCD fragmentation in identifying highly charged precursors. If the
overlap between phosphopeptides identified by both techniques is relatively
high (∼70%), it however appears that HCD and ETD remain complementary
dissociation methods. It should be noted that the synthetic library
was made by sequential permutation of amino acids around well-known
phospho-motifs, generating many in sequence (and chemical nature)
alike monophosphorylated peptides, which may not fully represent the
authentic phosphoproteome.ETD accompanied by supplemental activation
usually gives rise to
a higher confidence in phosphopeptide identification, notably through
the generation of dual ion series, resulting in higher scores. Riley
et al. reported that EThcD and AI-ETD both resulted in an ∼15%
increase of the median identification score in comparison with HCD,
while ETcaD yields significantly lower scores (∼17% decrease
of the median score when compared to HCD).[80] In the original EThcD paper, Frese et al. reported an ∼28%
increase of the average score using EThcD fragmentation in comparison
with HCD.[95] Still, as extracted from different
available studies comparing different fragmentation techniques, it
appears that as of today, HCD remains the gold standard dissociation
technique in phosphoproteomics, due to its higher speed and efficiency.[80,95,122,123] Alternative dissociation techniques can yield more confident identification
scores but come often at the expense of the total number of identified
phosphopeptides (Figure ). Possibly in the future, ExD methods could compete with
HCD, if the activation times could be diminished, giving similar duty
cycles.
Figure 10
Comparison and overlap of the number of phosphopeptides identified
by using different fragmentation methods. Due to its shorter cycle
time, HCD outperforms all ETD-based techniques. Between the ETD-based
techniques, AI-ETD yields the highest number of phosphopeptide identifications,
followed by EThcD, ETcaD, and finally ETD. Reproduced from Riley,
N. M.; Hebert, A. S.; Dürnberger, G.; Stanek, F.; Mechtler,
K.; Westphall, M. S.; Coon, J. J. Anal. Chem.2017, 89, pp 6367–6376 (ref (80)). Copyright 2017 American
Chemical Society.
Comparison and overlap of the number of phosphopeptides identified
by using different fragmentation methods. Due to its shorter cycle
time, HCD outperforms all ETD-based techniques. Between the ETD-based
techniques, AI-ETD yields the highest number of phosphopeptide identifications,
followed by EThcD, ETcaD, and finally ETD. Reproduced from Riley,
N. M.; Hebert, A. S.; Dürnberger, G.; Stanek, F.; Mechtler,
K.; Westphall, M. S.; Coon, J. J. Anal. Chem.2017, 89, pp 6367–6376 (ref (80)). Copyright 2017 American
Chemical Society.
Phosphorylation Sites Localization
Although the identification
of phosphopeptides by MS-based proteomics has become more facile,
a serious challenge in today’s phosphoproteomics remains the
unambiguous localization of the phosphosites(s) within the identified
phosphopeptides, which is essential to understand the roles of phosphorylation
events. While as described above, neutral losses do not impair phosphopeptide
identification, it can seriously hamper the correct identification
of the localization of the phosphosite, as fragments resulting from
a neutral loss and nonmodified fragments (or nonmodified fragments
associated with commonly observed water loss in the case of phosphoric
acid loss) have the same mass and are thus undistinguishable. Moreover,
confident phosphosite localization often requires achieving complete
peptide sequence coverage, as several candidate phosphorylation sites
(i.e., Ser, Thr, Tyr, His, etc.) are often present within the same
peptide. The assessment of phosphosite localization confidence usually
requires dedicated additional computational approaches, as phosphosite
localizations reported by commonly used search engines can be quite
unreliable. To do so, two main paradigms exist: (i) computing the
probability of an incorrect match for each phosphopeptide isoforms
or (ii) using the difference between scores of the different phosphopeptide
isoforms. The former category regroups algorithms such as A-score,[124] PhosphoRS (or PtmRS),[125] the PTM-score of Andromeda,[118] Slomo,[126] while the latter corresponds to the Mascot
delta score,[127] the SLIP score in Protein
Prospector,[128] and Luciphor.[129] Most of these approaches are relatively similar
in term of basic principles (the score calculation of most search
engines being also probability based), but several factors could explain
the reported discrepancies between localization strategies. For example,
(i) the type of fragments considered, (ii) the peak depth, (iii) whether
the phosphate group neutral losses are used (iv) whether the algorithm
was designed for low or high mass resolution and accuracy measurements,
(v) how the localization probability is derived from the calculated
probabilities of a random match for each candidate.The pioneering
A-score method[124] will, for example, only
consider site-determining fragments (i.e., only ions enabling to discriminate
two candidates), while other algorithms consider all fragments in
the localization probability calculation. For the PTM-score, neutral
loss fragments will be automatically considered if they result in
an increase of the score while in phosphoRS, the user can choose to
use such fragments. In the case of fragmentation techniques inducing
substantial neutral losses, considering such peaks is resulting in
a significant increase of the number of localized phosphosites.[80] Nevertheless, one has to keep in mind that such
fragments possess the same mass than nonmodified fragments that suffered
water loss, which is commonly observed upon collisional activation
and as such it has been demonstrated that considering phosphate neutral
losses can lead to an increase of the false localization rate.[130]In the following, we will focus on two
of the most commonly used
algorithms, phosphoRS and the PTM-score of Andromeda to demonstrate
how localization probabilities are calculated and to illustrate potential
differences between localization algorithms. The peak depth, i.e.,
how many of the most intense peaks per m/z windows are considered for score calculation influences
the probability calculation, as the more peaks are considered, the
more the probability for a match to be random match increases. Peak
picking enables to eliminate low intensity peaks that can originate
from electric or chemical noise or coisolation of other precursor
ions, both of which can interfere with correct localization. The way
of determining the peak depth for each spectrum is different between
the two algorithms: it will be determined as the peak depth value
yielding the higher score for the PTM-score or as the higher score
difference between the different phosphopeptide isoforms if site-determining
fragments are present in phosphoRS. For the calculation of the PTM-score,
the peak depth is uniform across all m/z windows, while it is individually determined for each m/z window in phosphoRS. After peak picking, a value
of n matching theoretical fragments out of the N total number of peaks contained in the spectra is obtained.
The probability of each phosphopeptide isoform match to be random
is then determined by calculation of the probability of n matched peaks out of a total of N peaks to be random
(in other words, the smaller the probability, the more confident the
match), by applying a cumulative binomial distribution:The calculation of the probability p of a random peak to be matched is also different between
the two algorithms. For the PTM-score calculation, p is equal to the peak depth divided by the m/z window (100 m/z). This
derives from the fact that the PTM-score was first introduced for
ion trap detection, for which a mass tolerance of 0.5 Da is common
(i.e., there is 4% of chance of a peak to be a random match if 4 peaks
are picked in a 100 m/z window).
The PTM-score is thus using a conservative way of calculating p and significantly overestimates its value in the case
of high-resolution data. In phosphoRS, p is defined
asN corresponds to
the total
number of picked peaks, d to the fragment mass tolerance,
and w to the full mass range of the spectrum. Hence,
the value of p is adapted to the mass accuracy of
the mass analyzer. Finally, in both cases localization probabilities
are inferred from the differences of the values of P calculated for the different isoforms (Figure ).
Figure 11
Workflows for the calculation of phosphosite
localization probabilities
by phosphoRS (left), and MaxQuant/Andromeda (PTM-score, right). Reproduced
from Taus, T.; Köcher, T.; Pichler, P.; Paschke, C.; Schmidt,
A.; Henrich, C.; Mechtler, K.J. Proteome Res.2011, 10, 5354–5362 (ref (125)). Copyright 2011 American
Chemical Society. Reproduced from Cox, J.; Neuhauser, N.; Michalski,
A.; Scheltema, R. A.; Olsen, J. V.; Mann, M. J. Proteome Res.2011, 10, 1794–1805 (ref (118)). Copyright 2011 American
Chemical Society. Reprinted from Cell Rep., Vol. 8, Sharma, K., D’Souza, R. C. J., Tyanova, S., Schaab,
C., Wiśniewski, J. R., Cox, J., Mann, M. Ultradeep Human Phosphoproteome
Reveals a Distinct Regulatory Nature of Tyr and Ser/Thr-Based Signaling.
pp. 1583–1594. (ref (131)). Copyright 2014, with permission from Elsevier.
Workflows for the calculation of phosphosite
localization probabilities
by phosphoRS (left), and MaxQuant/Andromeda (PTM-score, right). Reproduced
from Taus, T.; Köcher, T.; Pichler, P.; Paschke, C.; Schmidt,
A.; Henrich, C.; Mechtler, K.J. Proteome Res.2011, 10, 5354–5362 (ref (125)). Copyright 2011 American
Chemical Society. Reproduced from Cox, J.; Neuhauser, N.; Michalski,
A.; Scheltema, R. A.; Olsen, J. V.; Mann, M. J. Proteome Res.2011, 10, 1794–1805 (ref (118)). Copyright 2011 American
Chemical Society. Reprinted from Cell Rep., Vol. 8, Sharma, K., D’Souza, R. C. J., Tyanova, S., Schaab,
C., Wiśniewski, J. R., Cox, J., Mann, M. Ultradeep Human Phosphoproteome
Reveals a Distinct Regulatory Nature of Tyr and Ser/Thr-Based Signaling.
pp. 1583–1594. (ref (131)). Copyright 2014, with permission from Elsevier.
False Localization Rate
Estimation
Most of the localization
algorithms score the confidence of the localization but do not estimate
the false localization rate (FLR), and instead arbitrary score cut-offs
are used, such as the 0.75 cutoff recommended for the Andromeda-PTMscore,
corresponding to so-called class I phosphosites.[1] Indeed, the estimation of the FLR is less straightforward
than the FDR estimation as a similar target-decoy approach is no longer
valid, as for an identified phosphopeptide with a given sequence an
incorrect localization of a phosphosite does not correspond to a random
match as many fragments will match both correct and incorrect localizations.[132] Therefore, reversal/randomization of sequences
contained in the database will not provide an accurate estimation
of false localizations. However, the addition of noncanonical phosphorylations
into the equation, combined with the continuous increase in the number
of MS2 spectra generated per experiment obviously increases
the risk of false localization, making FLR control more and more essential,
as it enables the user to make an informed decision about localization
score cutoff. One elegant strategy, used in the SLIP approach,[128] is to perform searches during which decoy phosphorylations
on glutamic acid (E) and proline (P) residues are allowed. There are
two rationales behind the choice of these two amino acids: the combined
occurrence of E and P matches the one of S and T (∼16%), and
E and P residues are present in some of the most dominant phosphorylation
motifs, ensuring the presence of decoy sites in the vicinity of the
real phosphosites. Luciphor also uses a target decoy approach to estimate
the FLR, in which decoy phosphopeptides are generated by placing the
phosphorylation on each residue present in the sequence.[129] As the number of decoy sites is usually higher
than the number of target sites, this constitutes a more conservative
approach.
Performance Comparison between Different Phosphosite Localization
Tools
While all localization tools perform adequately (i.e.,
present a low FLR), comparing their performance can be complicated
by the fact that most localization tools are tied to specific search
engines. This is for example the case for two of the most popular
localization tools, ptmRS and the PTM-score.[133] It is thus difficult to determine if observed discrepancies in confidently
localized phosphosites originate from the difference between search
engines or from the localization tools. Such differences between the
different search engines will not be discussed here. Analyzing synthetic
phosphopeptide libraries, for which the correct phosphorylation sites
are known is the only situation in which the true FLR can be determined.
However, one has to keep in mind that analyzing synthetic peptides
often does not reflect the complexity of real biological samples,
which present higher dynamic range and/or coisolation of peptides,
both resulting in the potential generation of lower quality spectra.Marx et al. compared the PTM-score, phosphoRS, and the Mascot delta
score performances on correctly localizing phosphosites from a large
phosphopeptides library.[122] Keeping both
the FDR and FLR at 1%, the Andromeda-PTM-score and Mascot-phosphoRS
score proved to be significantly more sensitive than the Mascot delta
score and exhibited similar numbers of correctly identified and localized
phosphopeptides. However, phosphosites localized by the two search
engine/localization tools presented a low overlap of around 50% after
analysis of both the synthetic phosphopeptides libraries and a biological
sample. Similarly, phosphoRS, A-score, and MD-score only presented
an overlap of 50% when considering unique unambiguously localized
phosphosites from a biological sample after database search with the
same search algorithm,[125] indicating that
there is still much room for improvement in computational strategies
for confident phosphosite localization. Notably, all localization
tools slightly underestimated the false localization rate at high
localization probability scores. In contrast, at a 1% FDR at the phosphopeptides
level, false discovery rates were significantly overestimated at lower
localization probability,[125,134] illustrating that
this discussion is still not closed. Finally, to our knowledge, the
robustness of these search algorithms has not been tested against
large libraries of multiple phosphorylated peptides.Besides
peak-probability based approaches, simulating HCD spectra
of phosphopeptides of enzymatically dephosphorylated peptides, while
laborious, enabled accurate phosphosite localization and presented
a lower FLR than A-score and ptmRS.[130] Recently,
Yang et al. developed a semi supervised learning algorithm to assess
the confidence of each amino acids obtained by de novo sequencing.[135] Interestingly, this algorithm
was used to calculate localization probabilities of phosphorylation
sites within phosphopeptides identified by either de novo sequencing or database search and significantly outperformed the
peak-probability based algorithms A-score and ptmRS both in terms
of number of identifications and localization accuracy after a database
search of synthetic phosphopeptides libraries.
Comparison of Different
Fragmentation Techniques
While
we concluded that HCD performs very well in term of identification
of phosphopeptides, it has been demonstrated that confident localization
of a phosphosite becomes increasingly difficult when the number of
putative phosphosites increase, and the closer another phosphorylation
site candidate is from the actual phosphosite.[122,134] Interestingly, this issue is much less observed with ETD fragmentation,
underlining that both the lack of neutral loss during ETD and the
more comprehensive coverage achieved via ETD random cleavages are
beneficial for correct phosphosite localization.[122] More generally, localization probability cutoff to reach
a 1% FLR is significantly higher in the case of HCD when compared
to ETD-based fragmentation methods.[122,125] By using
the Marx et al. data set, Wiese et al. demonstrated that in the case
of ETD fragmentation, a localization probability cutoff of 0.55 (lowest
reported localization probability threshold) was sufficient to fall
below 1% FLR after analyzing the data with the combination the Andromeda-PTM-score
at a 1% FDR.[134] In contrast, the authors
reported that a cutoff of 1 (maximum localization probability) was
necessary to reach a FLR below 1% in the case of HCD fragmentation.
Ferries et al. recently reported similar results with a different,
albeit significantly smaller synthetic phosphopeptide library.[123] They showed that with EThcD fragmentation,
the lowest localization probabilities calculated correspond to a FLR
below 1% (Figure ). This is in agreement with the initial report on EThcD of Frese
et al., who reported that EThcD fragmentation led to a significant
increase of the proportion of correct phosphosite localizations when
compared to HCD.[95]
Figure 12
False localization rate
determination in four different analytical
peptide fragmentation strategies: HCD and EThcD fragmentation with
high-resolution (Orbitrap, OT) or low-resolution (Ion trap, IT) fragment
ion detection. Reproduced from Ferries, S.; Perkins, S.; Brownridge,
P. J.; Campbell, A.; Eyers, P. A.; Jones, A. R.; Eyers, C. E. J. Proteome Res.2017, Vol. 16, pp 3448–3459 (ref (123)). Copyright 2017 American Chemical Society.
False localization rate
determination in four different analytical
peptide fragmentation strategies: HCD and EThcD fragmentation with
high-resolution (Orbitrap, OT) or low-resolution (Ion trap, IT) fragment
ion detection. Reproduced from Ferries, S.; Perkins, S.; Brownridge,
P. J.; Campbell, A.; Eyers, P. A.; Jones, A. R.; Eyers, C. E. J. Proteome Res.2017, Vol. 16, pp 3448–3459 (ref (123)). Copyright 2017 American Chemical Society.In addition, Ferries et al. also reported that
high-resolution
measurements of fragments (i.e., within the Orbitrap) strongly aids
to reduce false localizations. The Olsen group showed that increasing
Orbitrap resolution to 60k after HCD fragmentation as well as the
maximum injection time resulted in an increase in localization probabilitiesthrough the increase of spectra quality (at the cost of longer cycle
times).[136] Both Marx et al. and Wiese et
al. concluded that tyrosine phosphorylation is easier to localize
by HCD than serine and threonine phosphorylation.[122,134] This can be explained by the low tendency of pY peptides to undergo
neutral loss. Wiese et al. reported that a PTM-score localization
probability cutoff of 0.86 was sufficient to reach a FLR value below
1%, indicating that in the case of pY peptides, HCD is the preferred
fragmentation technique. Finally, Ferries et al. showed that localization
confidence logically decreases with the number of phosphorylation
present on a peptide, ETD-based techniques being more suited for the
accurate localization of multiple phosphorylation.[123] This is of importance as multiphosphorylated peptides represented
between 20 and 40% of identified phosphopeptides in recent phosphoproteomics
studies on mammalian cells.[81,137]
Concluding
Remarks
The field of MS-based phosphoproteomics
has seriously matured over the past decade and has been adopted by
dozens of research groups worldwide. Notwithstanding the achieved
successes and wide adaptation by the community, one should not forget
key remaining challenges, which can only be tackled by introducing
new technologies. To make such further advances in MS-based phosphoproteomics,
a deep understanding of peptide fragmentation mechanisms is essential,
whereby we hope that this review provides the interested researcher
a good starting point by summarizing the current knowledge. With new
peptide activation methods in mass spectrometry also come new search
algorithms and more advanced algorithms to determine with higher accuracy
the exact amino-acid being modified by the phosphorylation. Such information
is critical for the understanding of the role of that particular event.
Our review of the literature clearly illustrates that especially confident
site localization is a remaining challenge, whereby current approaches
still disagree with each other too much.Several new technologies
are on the horizon for MS-based phosphoproteomics that we intently,
and due to space limitations, did not discuss here. Alternative radical-induced
dissociation techniques presenting different fragmentation patterns
when compared to classic ExD have recently shown interesting preliminary
results when applied to the fragmentation of phosphopeptides,[138−140] but it remains unsure if they will be compatible with high-throughput
phosphoproteomics. Among the different photoactivation-based dissociation
methods applied to phosphopeptides sequencing,[141−144] UV-photodissociation (UV-PD) has emerged as a credible alternative
to CID/ExD. UV-PD has been exploited to phosphopeptides to identify
and site-localize the phosphate moiety. Early results indicate that
UV-PD is compatible with the high-throughput identification of phosphopeptides
while the propensity of the phosphate group to remain bound to the
peptide during fragmentation increases when compared to HCD.[145−147] Top-down and middle-down phosphoproteomics approaches, which have
greatly benefited from recent technological advances in alternative
fragmentation techniques may improve the identification and site localization
of protein phosphorylations and be ideally suited to not only map
multiple modifications on a peptide or protein at once but also determine
phospho-isoform abundance, reaction kinetics and to study functional
cross-talk between post-translational modifications.[80,148−155]While ionization in negative ion mode still suffers from a
lack
of efficiency when compared with the positive ion mode, negative ETD,
UVPD, and AI-ETD showed some promising results and could help to study
acid-labile phosphorylation events.[146,156−158] Ion mobility separation coupled to MS can provide and additional
layer of separation, allowing potentially the separation of phosphopeptide
isomers.[159−161] The availability of such synthetic phosphopeptide
isomers can also be used in targeted MS approaches, whereby the retention
time of the peptide and the comparison with the isotopically labeled
standard aid the confidence in identification, site localization,
and quantification.[162,163] In short, a lot of effort is
made by the proteomics research community to tackle important persisting
issues in MS based phosphoproteomics. It will be interesting to see
in a decade from now what the field will have further achieved. For
sure a lot of novel biological new insights, hopefully based on well-reproducible
and confident phosphoproteomics data.
Authors: Amanda M Palumbo; Scott A Smith; Christine L Kalcic; Marcos Dantus; Paul M Stemmer; Gavin E Reid Journal: Mass Spectrom Rev Date: 2011-02-03 Impact factor: 10.946
Authors: Samantha Ferries; Simon Perkins; Philip J Brownridge; Amy Campbell; Patrick A Eyers; Andrew R Jones; Claire E Eyers Journal: J Proteome Res Date: 2017-08-11 Impact factor: 4.466
Authors: Christian K Frese; A F Maarten Altelaar; Henk van den Toorn; Dirk Nolting; Jens Griep-Raming; Albert J R Heck; Shabaz Mohammed Journal: Anal Chem Date: 2012-10-31 Impact factor: 6.986
Authors: Joshua E Mayfield; Michelle R Robinson; Victoria C Cotham; Seema Irani; Wendy L Matthews; Anjana Ram; David S Gilmour; Joe R Cannon; Yan Jessie Zhang; Jennifer S Brodbelt Journal: ACS Chem Biol Date: 2016-12-01 Impact factor: 5.100
Authors: Nicholas M Riley; Alexander S Hebert; Gerhard Dürnberger; Florian Stanek; Karl Mechtler; Michael S Westphall; Joshua J Coon Journal: Anal Chem Date: 2017-04-17 Impact factor: 6.986
Authors: Agathe Marcelot; Ambre Petitalot; Virginie Ropars; Marie-Hélène Le Du; Camille Samson; Stevens Dubois; Guillaume Hoffmann; Simona Miron; Philippe Cuniasse; Jose Antonio Marquez; Robert Thai; François-Xavier Theillet; Sophie Zinn-Justin Journal: Nucleic Acids Res Date: 2021-04-19 Impact factor: 16.971
Authors: Laura K Muehlbauer; Alexander S Hebert; Michael S Westphall; Evgenia Shishkova; Joshua J Coon Journal: Anal Chem Date: 2020-12-03 Impact factor: 6.986
Authors: Hussain Dahodwala; Prashant Kaushik; Vijay Tejwani; Chih-Chung Kuo; Patrice Menard; Michael Henry; Bjorn G Voldborg; Nathan E Lewis; Paula Meleady; Susan T Sharfstein Journal: Curr Res Biotechnol Date: 2019-10-05