The gut-derived incretin hormone, glucagon-like peptide-1 (GLP1), plays an important physiological role in attenuating post-prandial blood glucose excursions in part by amplifying pancreatic insulin secretion. Native GLP1 is rapidly degraded by the serine protease, dipeptidyl peptidase-4 (DPP4); however, enzyme-resistant analogues of this 30-amino-acid peptide provide an effective therapy for type 2 diabetes (T2D) and can curb obesity via complementary functions in the brain. In addition to its medical relevance, the incretin system provides a fertile arena for exploring how to better separate agonist function at cognate receptors versus susceptibility of peptides to DPP4-induced degradation. We have discovered that novel chemical decorations can make GLP1 and its analogues completely DPP4 resistant while fully preserving GLP1 receptor activity. This strategy is also applicable to other therapeutic ligands, namely, glucose-dependent insulinotropic polypeptide (GIP), glucagon, and glucagon-like peptide-2 (GLP2), targeting the secretin family of receptors. The versatility of the approach offers hundreds of active compounds based on any template that target these receptors. These observations should allow for rapid optimization of pharmacological properties and because the appendages are in a position crucial to receptor stimulation, they proffer the possibility of conferring "biased" signaling and in turn minimizing side effects.
The gut-derived n class="Gene">incretin hormone, n class="Gene">glucagon-like peptide-1 (GLP1), plays an important physiological role in attenuating post-prandial blood glucose excursions in part by amplifying pancreatic insulin secretion. Native GLP1 is rapidly degraded by the serine protease, dipeptidyl peptidase-4 (DPP4); however, enzyme-resistant analogues of this 30-amino-acid peptide provide an effective therapy for type 2 diabetes (T2D) and can curb obesity via complementary functions in the brain. In addition to its medical relevance, the incretin system provides a fertile arena for exploring how to better separate agonist function at cognate receptors versus susceptibility of peptides to DPP4-induced degradation. We have discovered that novel chemical decorations can make GLP1 and its analogues completely DPP4 resistant while fully preserving GLP1 receptor activity. This strategy is also applicable to other therapeutic ligands, namely, glucose-dependent insulinotropic polypeptide (GIP), glucagon, and glucagon-like peptide-2 (GLP2), targeting the secretin family of receptors. The versatility of the approach offers hundreds of active compounds based on any template that target these receptors. These observations should allow for rapid optimization of pharmacological properties and because the appendages are in a position crucial to receptor stimulation, they proffer the possibility of conferring "biased" signaling and in turn minimizing side effects.
n class="Chemical">Peptides
derived from the processing of the polyn class="Chemical">peptide preproglucagon
define the axis of a synchronized, choreographed system of maintaining
glucose homeostasis and regulating other aspects of physiological
function.[1] In particular, three distinct
peptides, glucagon-like peptide-1 (GLP-1; referred to as GLP1 in this
paper), glucagon-like peptide-2 (GLP2), and glucagon are processed
from the precursor preproglucagon into mature functioning entities
and play distinct roles in glucose metabolism.[2] An additional peptide, produced by K-cells in the GI tract, glucose-dependent
insulinotropic polypeptide (GIP), complements this axis. GLP1 is produced
in enteroendocrine cells of the gut, and processed to the active form,
GLP1(7–36)NH2. Upon food ingestion, active GLP1
is secreted into the bloodstream and initiates multifaceted actions
via activation of its cognate GLP1 receptor (GLP1R) expressed in several
target tissues that synergistically work to restore normoglycemia.[3] GLP1R stimulation triggers cAMP production and
amplification of glucose-induced insulin secretion from pancreatic
β-cells resulting in lowered blood glucose levels. This action
also results in diminution of glucagon secretion and β-cell
proliferation. In parallel, thispeptide hormone delays gastric emptying
through vagal afferents and suppresses appetite via neurons in the
paraventricular nucleus of the hypothalamus.[4] Based in large part on the function of endogenous GLP1, physiologic
glycemia is markedly lower after an oral glucose load compared to
an equivalent parenteral load—this phenomenological observation
has been termed the “incretin” effect.[5] Because insulin secretion is glucose-dependent, this process
is self-regulated, and the risk of hypoglycemia remains low.[1]
The efficacy of n class="Gene">GLP1 as a treatment for
n class="Disease">type 2 diabetes (T2D) has
long been validated in human subjects, as infusion of exogenous GLP1
to diabeticpatients results in near-normalization of blood sugar
levels.[6] However, native GLP1 cannot be
used as a therapeutic, as it is rapidly inactivated by the ubiquitous
serine proteasedipeptidyl peptidase-4 (DPP4), which cleaves the N-terminal residues His7-Ala8 from the intact peptide (t1/2 ∼ 90 s) resulting in a shorter inactive
fragment, GLP1(9–36)NH2.[7] This observation has motivated development of GLP1 based T2D therapeutics
with enhanced protease resistance and delayed renal clearance. The
first GLP1 mimetic clinically approved was exenatide, an equipotent
reptilian homologue that shows 50% amino acid similarity with humanGLP1.[8] Success of glycemic control by exenatide
inspired the development of liraglutide, a lipidated form of GLP1.[9] Liraglutide forms a heptamer in solution[12] and also binds the abundant plasma protein albumin
resulting in delayed renal clearance.[13] These compounds require twice- and once-daily injections, respectively.
Further development has led to a derivative that is attached to the
Fc fragment of IgG4 (dulaglutide)[14] and
another that uses addition of a lipid to enhance albumin binding
along with substitution of aminoisobutyric acid (Aib) at position
two (semaglutide)[9,15] rendering these two compounds
functional in vivo a week after injection. The future
of this class of compounds has been further cemented by new guidance
from the American Diabetes Association (ADA) and the European Association
for the Study of Diabetes (EASD) which recommends GLP1R agonists as
the first injectable before insulin for T2D.[16] Small molecule inhibitors of DPP4 have also entered clinical use,[17] although these compounds do not result in weight
loss, and the long-term safety profile is still uncertain as there
are more than 30 known endogenous substrates of DPP4.[18] Recent efforts to overcome the liability of enzyme catalyzed
hydrolysis have seen widespread use of Aib in position 8, but a previous
compound (taspoglutide)[19] containing this
noncanonical amino acid was withdrawn from Phase III clinical trials
due to allergic reactions at the injection site.[20] Taspoglutide contains two Aib residues, at positions 8
and 35, to provide protection from proteases. Given that compounds
in the clinic (e.g., semaglutide) contain Aib at position 8, and have
good safety profiles, it is unclear whether the drug substance itself
or the pharmaceutical formulation is responsible for the adverse reaction
in the case of taspoglutide.[21]
Prior
attempts at making n class="Gene">GLP1 and related n class="Chemical">peptides more stable
to proteolysis have relied on a number of strategies. The use of an
α/β peptide scaffold that changes the backbone of the
construct,[22] incorporation of fluorinated
amino acids at strategic positions,[23] modification
of side chains with saccharides,[24] and
the use of aminoisobutyric acid (Aib) in position 8 (second from the N-terminus)[25] have been the most
fruitful. While these strategies provide proteolytic stability to
various degrees, previous attempts at modifying the sacrosanct N-terminal histidine residue, crucial to receptor activation,
have led to greatly compromised GLP1 function.[26]
We describe in tn class="Chemical">his report that n class="Chemical">N-alkyl modifications
of several peptide ligands (GLP1, GIP, glucagon, GLP2, and a “triagonist”
peptide) of the secretin family of receptors are refractory to DPP4
proteolysis while simultaneously maintaining full activity at respective
cognate receptors. Because the strategy is applicable to all developed
peptide ligand templates, modulation of activity and retaining stability
is possible with N-terminal alkylation. Given the
large number of low-molecular-weight aldehydes (estd ∼25,000)
commercially available and the ease of installation, the number of
derivatives for each peptide template to be useful clinically can
easily number in the hundreds providing a blueprint for pharmacokinetic
and pharmacodynamic optimization. In addition, because the N-terminus of the peptide ligand interacts deep in the membrane
embedded region of the receptor, so as to say in the “belly
of the beast”, it provides an avenue to select constructs that
may trigger “biased” signaling resulting in fewer side
effects.[11,27] We note that Aib residues commonly employed
to protect from DPP4 cleavage are used internally within the sequence,
and do not confer ability to interact with the hitherto undescribed
ligand binding pocket.
Results and Discussion
n class="Chemical">Peptides
are inherently susceptible to proteolytic cleavage, and
tn class="Chemical">his deficit has usually been addressed by introduction of unnatural
or noncanonical amino acids, by backbone or side chain modification,
or by addition of exogenous groups that provide serum stability by
binding to plasma components. In the case of GLP1, it has been challenging
to decouple enzyme mediated hydrolysis from receptor activation. The
ubiquitous serine proteaseDPP4 removes the N-terminal
dipeptideHis7Ala8 of GLP1 with a short half-life (t1/2 < 2 min), resulting in a truncated peptide, GLP1(9–37)NH2, with greatly diminished efficacy and potency. The active
site of DPP4 has an arrangement of functionality that positions two
glutamates for substrate recognition (Glu205 and Glu206), and mutating
either or both glutamate(s) to alanine(s) renders the enzyme incompetent
at catalysis (Figure d). The carboxylate groups from Glu205/Glu206 form electrostatic
interactions between the positively charged amino terminus of the
substrate and position the labile amide proximal to the catalytic
hydroxyl of Ser630 that is primed for nucleophilic attack. Previous
studies have focused on amino acid alterations of GLP1 to increase
stability toward DPP4, but this strategy has usually come at the expense
of GLP1R potency and/or efficacy. The loss in receptor activity underscores
the importance of His7 in GLP1. Therefore, instead of altering His7,
we first focused our attention on abolishing the N-terminal charge in GLP1 to prevent enzyme recognition.
Figure 1
(a) Cryo-EM
structure of the GLP1:GLP1R complex (PDB: 5VAI).[10] GLP1 (ribbon,
gold) bound to cognate G protein-coupled
receptor, GLP1R (ribbon and surface, gray), with solid gray lines
representing approximate locations of the cellular membrane that separate
the extracellular domain (ECD) from the transmembrane domain (TMD)
of GLP1R. (b) Illustration of receptor amino acids within 4 Å
of the N-terminal histidine of GLP1. Residue numbers
denote the Wootten nomenclature[11] for class
B GPCRs. To note, human GLP1R contains R3105.40, but this
PDB structure contains an alanine mutation at this position. The dashed
pink circle indicates approximate space where N-terminal
decorations may reside. (c) Flattened 2D rendering of the interactions
of GLP1R with His7 of GLP1. R299ECL2 forms two hydrogen
bonds with the N-terminal histidine (dashed line,
gray) and a putative cation−π interaction of the guanidine
group of R299ECL2 with the imidazole of His7 of GLP1 (dotted
line, maroon). Select neighboring side chains of the receptor are
shown at approximate positions relative to GLP1. (d) 2D depiction
of DPP4 active site, with a known inhibitor bound (valine–pyrrolidide,
blue, PDB: 1N1M). Important electrostatic interactions (dashed lines, gray) occur
between the primary amine of the substrate and Glu205 and Glu206 of
DPP4 (highlighted yellow). The carbonyl of the first amide bond is
anchored by a hydrogen bond to Asn710 (dashed line, gray). The catalytic
triad (Ser630, His710, and Asp708) forms a hydrogen bonding network
(dashed lines, gray) and is positioned proximal to the pyrrole ring.
If the structure bound were a peptide substrate, the labile amide
bond would be located close to the canonical nucleophile Ser630.
(a) Cryo-EM
structure of the n class="Gene">GLP1:n class="Gene">GLP1R complex (PDB: 5VAI).[10] GLP1 (ribbon,
gold) bound to cognate G protein-coupled
receptor, GLP1R (ribbon and surface, gray), with solid gray lines
representing approximate locations of the cellular membrane that separate
the extracellular domain (ECD) from the transmembrane domain (TMD)
of GLP1R. (b) Illustration of receptor amino acids within 4 Å
of the N-terminal histidine of GLP1. Residue numbers
denote the Wootten nomenclature[11] for class
B GPCRs. To note, humanGLP1R contains R3105.40, but this
PDB structure contains an alanine mutation at this position. The dashed
pink circle indicates approximate space where N-terminal
decorations may reside. (c) Flattened 2D rendering of the interactions
of GLP1R with His7 of GLP1. R299ECL2 forms two hydrogen
bonds with the N-terminal histidine (dashed line,
gray) and a putative cation−π interaction of the guanidine
group of R299ECL2 with the imidazole of His7 of GLP1 (dotted
line, maroon). Select neighboring side chains of the receptor are
shown at approximate positions relative to GLP1. (d) 2D depiction
of DPP4 active site, with a known inhibitor bound (valine–pyrrolidide,
blue, PDB: 1N1M). Important electrostatic interactions (dashed lines, gray) occur
between the primary amine of the substrate and Glu205 and Glu206 of
DPP4 (highlighted yellow). The carbonyl of the first amide bond is
anchored by a hydrogen bond to Asn710 (dashed line, gray). The catalytic
triad (Ser630, His710, and Asp708) forms a hydrogen bonding network
(dashed lines, gray) and is positioned proximal to the pyrrole ring.
If the structure bound were a peptide substrate, the labile amide
bond would be located close to the canonical nucleophile Ser630.
Acylation of the N-Terminus of GLP1 Provides
DPP4 Protection but Diminishes Receptor Activity
An obvious
stn class="Species">rategy to overcome n class="Gene">GLP1 degradation by DPP4 is removal of the N-terminal charge, crucial to substrate recognition, via
acylation, as in 1-GLP1 (Figure ). Prior reports have established that 1-GLP1 is resistant to DPP4 proteolysis; however, it results
in a 60- to 97-fold decrease[28,29] in potency at the receptor.
In our hands, 1-GLP1 stimulated GLP1R with an EC50 of 116 pM, a 58-fold loss in potency compared to native
GLP1 (EC50 = 2.5 pM; Table , Figure S2). This result
is consistent with other examples of N-acylation
of GLP1, for example, with pyroglutamyl[26] or hexanoyl[28] groups, that show vastly
reduced affinity and activation of GLP1R. All acyl groups suffer from
this fate, as the receptor has evolved to recognize substrates that
are processed via proteolytic cleavage of the precursor to reveal
the canonical peptide of correct length, which thus lacks an acyl
(amide) bond at the N-terminus. Processing in intestinal
L-cells of a larger polypeptide, preproglucagon, first results in
production of GLP1(1–37) which is further cleaved to active
fragments GLP1(7–37) or GLP1(7–36)NH2. Improperly
processed fragments such as GLP1(1–37), GLP1(8–37),
and GLP1(6–37) are unable to induce insulin secretion at normal
levels.[29] It is only once GLP1 is proteolyzed
to display the free N-terminal histidine in GLP1(7–37)
or GLP1(7–36)NH2 that native insulin release action
is observed.[30]
Table 1
Receptor
Activation of N-Modified Peptidesa
Peptideb
pEC50 ± SEMc
EC50 (pM)c
nd
GLP1R
GLP1
11.6 ± 0.1
2.5
3
1-GLP1e
10.0 ± 0.2
115.9
3
2-GLP1
11.5 ± 0.1
2.8
3
3-GLP1
11.2 ± 0.1
6.7
3
4-GLP1
11.4 ± 0.3
6.9
3
5-GLP1
11.4 ± 0.07
3.8
3
6-GLP1
12.0 ± 0.2
1.1
2
7-GLP1
11.3 ± 0.2
6.5
3
8-GLP1
10.8 ± 0.2
17.6
2
9-GLP1
11.7 ± 0.4
2.8
3
10-GLP1
11.4 ± 0.02
4.4
3
11-GLP1
11.6 ± 0.7
4.7
3
12-GLP1
11.6 ± 0.1
2.7
2
13-GLP1
11.3 ± 0.2
5.3
2
14-GLP1
10.6 ± 0.1
25.3
2
15-GLP1
11.4 ± 0.1
3.8
2
16-GLP1
11.0 ± 0.03
9.6
2
17-GLP1
10.4 ± 0.05
41.6
3
18-GLP1
11.1 ± 0.06
8.4
4
19-GLP1
7.5 ± 0.1
3.3 × 104
2
Liraglutide
11.6 ± 0.2
3.2
3
2-Liraglutide
11.4 ± 0.03
4.0
3
Exenatide
11.6 ± 0.02
2.6
3
2-Exenatide
11.6 ± 0.06
2.9
3
2-Triagonist(Ala2)
11.2
6.9
1
GIP
6.92
1.2 × 105
1
Glucagon
9.31
490
1
GIPR
GIP
12.2 ± 0.04
0.6
3
1-GIP
10.3 ± 0.2
54.6
2
2-GIP
12.6 ± 0.02
0.3
3
Glucagon
8.07
8.6 × 103
1
GLP1
8.30
5.0 × 103
1
2-Triagonist(Ala2)
11.5
3.3
1
GCGR
Glucagon
11.5 ± 0.04
3.4
5
2-Glucagon
11.6 ± 0.1
2.3
5
7-Glucagon
9.49 ± 0.1
325
2
GLP1
6.18
6.6 × 105
1
GIP
6.30
5.1 × 105
1
2-Triagonist(Ala2)
11.4
4.5
1
GLP2R
GLP2
10.7 ± 0.1
21
3
2-GLP2
11.2 ± 0.1
7
3
Potency of synthesized
peptides
using HEK293 cells expressing GLP1R, GIPR, GCGR, or GLP2R and luciferase
reporter system. Results are separated by target receptors.
See Figure for peptide structures.
EC50 is the concentration
of peptide required for half-maximal activity of the targeted receptor.
pEC50 = −log(EC50) ± standard error
of the mean (SEM) of independent experiments where applicable.
Number of independent experiments
that were run in triplicate or quadruplicate.
Peptides were incubated at 37 °C
overnight before stimulating transfected cells.
Potency of synthesized
n class="Chemical">peptides
using HEK293 cells expressing GLP1R, GIPR, GCGR, or GLP2R and luciferase
reporter system. Results are separated by target receptors.
See Figure for n class="Chemical">peptide structures.
Figure 2
Library of N-terminally modified peptides. (a)
Alignment of GLP1 and related peptides with positions and numbering
above each residue (gray). GLP1 starts with amino acid 7 based on
established convention.[3] Blue residues
are homologous to GLP1, and residues highlighted yellow are conserved
between all peptides. Liraglutide and triagonist contain a lysine
(, maroon) modified with a γ-glutamic
acid spacer and by palmitoylation (right). Semaglutide contains a
modified lysine (, orange) with
two oliogoethylene glycol (OEG) spacers, γ-glutamic acid, and
octadecanedioic acid (right). “X” denotes the noncanonical
amino acid, aminoisobutyric acid (Aib, bottom right). (b) Native amino
acid sequences are modified with N-terminal chemical
modifications 1–19 resulting in a
library of peptides, nominally “R-Peptide” where “R”
is the number referencing the N-terminus modification
and “Peptide” indicates the template sequence as in
(a). Semaglutide and triagonist peptides were also assembled with
Aib2Ala mutation denoted as R-semaglutide(Ala2) and R-triagonist(Ala2),
respectively.
EC50 is the concentn class="Species">ration
of peptide required for half-maximal activity of the targeted receptor.
pEC50 = −log(EC50) ± standard error
of the mean (SEM) of independent experiments where applicable.
Number of independent experiments
that were run in triplicate or quadruplicate.n class="Chemical">Peptides were incubated at 37 °C
overnight before stimulating transfected cells.
We show here that removal of charge
by alkylation (for example,
with a trifluoroethyl group) is a viable stn class="Species">rategy. Since tn class="Chemical">his modification
does not introduce an acyl (amide) bond at the N-terminus,
it is tolerated by the receptor. A second approach is to increase
steric bulk at the N-terminus so that the binding
of the free amine in the enzyme active site is compromised. Surprisingly,
the receptor is again accommodating of a large variety of such bulky
groups introduced as N-alkyl tethers.
We then
investigated alternative stn class="Species">rategies that are capable of
altering the pKa of the n class="Chemical">N-terminal amine. The cryo-EM structure of a GLP1-GLP1R complex (PDB: 5VAI)[10] provides a basis to inform which His7 modifications on
the ligand could be accommodated within the membrane embedded region
of the receptor (Figure a). Within the binding pocket of GLP1R, there are receptor side chains
that make key noncovalent interactions with His7 of GLP1 (Figures b, S1A). Notably, R299ECL2 is involved in a π–cation
interaction with the aromatic imidazole ring of His7, as well as W3065.36 and I3095.39 that form hydrogen bonds, but
the primary N-terminal amine of GLP1 in this structure
is devoid of contacts with the receptor (Figure c). The only residue within 4 Å of the
free amine is V2373.40, and this hydrophobic residue is
not proximal enough to GLP1 to contribute toward intermolecular interactions
between receptor and ligand. Although thisamine is not perceived
to be interacting with GLP1R, removal of the primary amine results
in a 15-fold loss in potency illustrating the importance of the amino
terminus.[31] Not only has the free positive
amino terminus been determined to be crucial, but the methyl imidazole
side chain of His7 has also been recalcitrant to modification efforts.
For example, substitution of His7 with Ala, Arg, Lys, or Tyr results
in losses in both potency (174- to 1280-fold) and efficacy (12–66%
of native peptide).[32] Further interrogation
with six analogues that differ in the number and position of nitrogens in the methyl imidazole side chain
revealed potency losses from 2.1-fold (H2.2-GLP1(8–36), Figure S1B) to 475-fold (H3.1-GLP1(8–36), Figure S1B) without a significant improvement
in DPP4 resistance. Further, a His7Tyr modification leads both to
decrease in receptor affinity (10-fold) and EC50 values
of stimulatory activity (13-fold).[28] These
observations serve as a foundation to make chemical modifications
at this location to prevent DPP4 proteolysis while simultaneously
retaining receptor efficacy and potency.
We hypothesized that
modification of the amino terminus via alkylation
through the agency of a trifluoroethyl group would afford an equipotent
and equiefficacious n class="Gene">GLP1 analogue that is impervious to n class="Gene">DPP4 catalyzed
inactivation. We envisioned that proteolytic stability would result
from the electron withdrawing character and steric bulk of the 2,2,2-trifluoroethyl
group. Alkylation of the α-amine of glycine with the trifluoroethyl
moiety reduces the pKa of the ammonium
species[33] from 9.6 to 5.3, and we envisioned
that such a modification would result in removal of charge and simultaneously
eliminate the acyl functionality at the N-terminus
that GLP1R recognizes as an improperly processed ligand.
N-Trifluoroethyl Modification of GLP1 and of
Related Receptor Agonists Result in Maintenance of Receptor Activity
We assessed the potency and efficacy of 2-n class="Gene">GLP1 (Figure ) at n class="Gene">GLP1R by transfecting HEK293 cells with three cDNA constructs
that encode for the target G protein-coupled receptor, GLP1R, CRE6-luciferase reporter, and β-galactosidase
to determine transfection efficiency. After 24 h, 2-GLP1 was titrated and the amount of luciferase produced after 4–6
h was used to calculate EC50 values, that were found to
be comparable to native GLP1 (Table ). This was a surprising and remarkable discovery given
that the free amino terminus of GLP1 has long been considered crucial
for receptor binding and activation.[28] Additionally, 2-GLP1 was entirely refractory to DPP4 catalyzed hydrolysis
(vide infra). To the best of our knowledge, this
is the first instance that an N-terminal modification
of GLP1 has resulted in an equipotent and equally efficacious GLP1R
agonist.
Library of N-terminally modified n class="Chemical">peptides. (a)
Alignment of n class="Gene">GLP1 and related peptides with positions and numbering
above each residue (gray). GLP1 starts with amino acid 7 based on
established convention.[3] Blue residues
are homologous to GLP1, and residues highlighted yellow are conserved
between all peptides. Liraglutide and triagonist contain a lysine
(, maroon) modified with a γ-glutamic
acid spacer and by palmitoylation (right). Semaglutide contains a
modified lysine (, orange) with
two oliogoethylene glycol (OEG) spacers, γ-glutamic acid, and
octadecanedioic acid (right). “X” denotes the noncanonical
amino acid, aminoisobutyric acid (Aib, bottom right). (b) Native amino
acid sequences are modified with N-terminal chemical
modifications 1–19 resulting in a
library of peptides, nominally “R-Peptide” where “R”
is the number referencing the N-terminus modification
and “Peptide” indicates the template sequence as in
(a). Semaglutide and triagonist peptides were also assembled with
Aib2Ala mutation denoted as R-semaglutide(Ala2) and R-triagonist(Ala2),
respectively.
n class="Gene">GLP1R agonists currently used
clinically (liraglutide, n class="Chemical">exenatide,
and semaglutide) operate by similar modes of binding and activation;
we therefore evaluated the effect of N-terminal alkylation
on these peptide ligands. For these GLP1R agonists, the potency and
efficacy toward the receptor were not altered. The potency of 2-exenatide (2.9 pM) was essentially unchanged from that of
exenatide (Figure , Table ). A similar
trend was seen for 2-liraglutide and 2-semaglutide(Ala2) (Figure , Table , Figure S3). The fact that this same N-terminal
modification is useful in multiple GLP1R agonists confirms the robustness
of this chemical grafting method and fueled our investigation to probe
for stability against DPP4 action.
LC-MS Analysis of DPP4
Stability for GLP1R Agonists
After having determined that n class="Chemical">N-trifluoroethyl histidine
modified n class="Gene">GLP1 is accommodated by GLP1R, we investigated if the change
in the N-terminal pKa influenced DPP4 proteolysis.[33] Native
GLP1 or 2-GLP1 was incubated with and without DPP4 at
37 °C and the reaction mixture analyzed by ESI LC-MS after 16
h. In the presence of DPP4, native GLP1 was cleaved resulting in a
distinct shift in retention time, from 22.2 to 23.3 min, with an accompanying
reduction in mass by 208.1 Da indicating removal of the His7Ala8dipeptide
from the N-terminus. The product of GLP1 incubated
with DPP4 eluted at the same time as the control GLP1(9–37)NH2, further confirming that the product resulted from dipeptide
cleavage (Figure a).
When 2-GLP1 was incubated with DPP4 overnight, no change
in retention time or mass was observed, illustrating that the N-trifluoroethyl alkylation of GLP1 prevented DPP4 hydrolysis
likely due to both decrease in pKa and
addition of steric bulk on the N-terminal amine.
This modification abolishes the interaction between the amino terminus
of 2-GLP1 and Glu205 and Glu206 in the DPP4 active site,
rendering it refractory to enzyme recognition and hydrolysis. We also
explored whether the modified peptides are inhibitors of the enzyme
or are just plainly not recognized as substrates. Steady-state kinetics
with a known inhibitor (linagliptin) and substrate (GlyPro-pNA, Figure S4) reveal that these compounds are neither
inhibitors (no reduction in Km) nor bind
to the enzyme (no change in Vmax). These
data suggest that either removal of charge or addition of bulk at
the N-terminus of GLP1 abrogates recognition by DPP4,
the frontline protease.
Figure 3
(a) LC-MS/MS total ion chromatogram depicting
the stability of GLP1 (maroon) and 2-GLP1 (pink) with (shaded)
and without (nonshaded) DPP4. GLP1 incubated with DPP4 shows the same
retention time as control GLP1(9–36) (gray) indicating excision
of dipeptide His7Ala8 to give cleaved, c, peptide. 2-GLP1 incubated with DPP4 exhibits no change in retention time or mass.
(b) LC-MS/MS total ion chromatogram depicting the stability of exenatide (purple) and 2-exenatide (light purple)
with (shaded) and without DPP4 (nonshaded). Exenatide incubated with
DPP4 results in a mixture of cleaved, c, and native (unreacted), n,
exenatide. The retention time of cleaved exenatide, c, is the same
as control exenatide(3–39) (gray). 2-Exenatide incubated with DPP4 is unreactive with no change in retention time
or mass. (c) LC-MS/MS total ion chromatogram depicting the stability
of liraglutide (navy) and 2-liraglutide (light
blue) with (shaded) and without DPP4 (nonshaded). Liraglutide incubated
with DPP4 results in a mixture of cleaved, c, and native (unreacted),
n, liraglutide. 2-Liraglutide incubated with DPP4 does
not undergo reaction with unchanged retention time and mass.
(a) LC-MS/MS total ion chromatogram depicting
the stability of n class="Gene">GLP1 (maroon) and 2-n class="Gene">GLP1 (pink) with (shaded)
and without (nonshaded) DPP4. GLP1 incubated with DPP4 shows the same
retention time as control GLP1(9–36) (gray) indicating excision
of dipeptideHis7Ala8 to give cleaved, c, peptide. 2-GLP1 incubated with DPP4 exhibits no change in retention time or mass.
(b) LC-MS/MS total ion chromatogram depicting the stability of exenatide (purple) and 2-exenatide (light purple)
with (shaded) and without DPP4 (nonshaded). Exenatide incubated with
DPP4 results in a mixture of cleaved, c, and native (unreacted), n,
exenatide. The retention time of cleaved exenatide, c, is the same
as control exenatide(3–39) (gray). 2-Exenatide incubated with DPP4 is unreactive with no change in retention time
or mass. (c) LC-MS/MS total ion chromatogram depicting the stability
of liraglutide (navy) and 2-liraglutide (light
blue) with (shaded) and without DPP4 (nonshaded). Liraglutide incubated
with DPP4 results in a mixture of cleaved, c, and native (unreacted),
n, liraglutide. 2-Liraglutide incubated with DPP4 does
not undergo reaction with unchanged retention time and mass.
n class="Chemical">Exenatide is more resistant to n class="Gene">DPP4 than GLP1;
however, it is still
susceptible to enzymatic degradation as judged by LC-MS analysis.
After overnight incubation with DPP4, exenatide was partially cleaved
resulting in a mixture containing an additional product (tR = 6.1 min). The elution profile and identified mass
(−194.1 g/mol) corresponded to the peptide fragment exenatide(3–39)
(Figure b). When 2-exenatide was subjected to DPP4 incubation, no cleavage
products were observed. Our results can be explained with the same
rationale as 2-GLP1, where recognition of the substrate
and cleavage are prevented by removal of charge and addition of steric
bulk at the N-terminus.
Liraglutide contains
the same N-terminal motif
as n class="Gene">GLP1, and in the absence of n class="Gene">serum albumin, it is susceptible to
proteolysis. When we incubated liraglutide with DPP4 overnight, we
saw MS signatures of two unique compounds by LC-MS (Figure c). The first compound eluted
at the same time (tR = 9.3 min) and had
the same MW as liraglutide. The second compound eluted later and had
a mass that was 208.1 Da lower than liraglutide, suggestive of DPP4
removal of the HisAla dipeptide. When the N-terminally
modified peptide2-liraglutide was incubated with DPP4
under similar conditions, no change in mass or retention time was
observed indicating the failure of DPP4 to process and inactivate
substrate.
N-Trifluoroethyl Modification
Is Accommodated
by Most Secretin Family Receptors
Three n class="Chemical">peptide ligands—n class="Gene">glucagon,
GIP, and GLP2—fall within the same secretin family as GLP1
and contain five conserved amino acids: Gly4, Thr6, Phe22, Trp31,
and Leu32 (Figure ). Each peptide elicits various physiological responses from increasing
blood glucose to stimulating intestinal growth. Although the first
three N-terminal amino acids are dissimilar, all
peptides are cleaved by DPP4. We sought to ascertain if fluoroalkylation
would be compatible with the cognate receptors for glucagon, GIP,
and GLP2secretinpeptides.
n class="Chemical">Glucose-dependent n class="Gene">insulinotrophic
polypeptide (GIP), like GLP1, is a gut hormone, and functionally complements
the latter in accounting for the incretin effect after food ingestion.[34] GIP works through a different class B receptor
(GIPR)[35,36] and has overlapping (amplification of insulin
release from pancreatic β-cells) as well as distinct (modulation
of fat cell metabolism) functions compared to GLP1.[7,5] In
the diabetic state, the function of GIP is greatly diminished,[37] possibly as a result of GIPR down-regulation.[38,39] However, there is now increasing evidence that this defect is reversible
once chronic hyperglycemia is alleviated,[40] e.g., after treatment with GLP1 analogues. There are some sequence
similarities between GLP1 and GIP, but the N-terminus
is occupied by a Tyr instead of His and this change provides specificity
at GIPR. We synthesized an α-amino-modified tyrosine to incorporate
into the full-length peptides yielding 2-GIP. Again,
utilizing a luciferase-based assay, we transfected HEK293 cells with
cDNAs encoding for the receptor of interest, GIPR, CRE6-luciferase reporter, and a β-galactosidase
control.[23] The day following transfection,
cells were agonized with native GIP or 2-GIP and production
of luciferase was quantified. 2-GIP showed similar potency
as native GIP at the cognate receptor, GIPR (EC50 = 0.6
pM and 0.3 pM, respectively, Table ). Similar to GLP1, this small molecular modification
resulted in a GIP analogue with complete proteolytic stability without
compromising receptor stimulatory activity (vide infra). The minimal change in potency and efficacy of 2-GIP cements the view that this molecular grafting method by alkylation
is suitable for use at multiple receptors of the secretin family.
n class="Gene">Glucagon originates from pancreatic islets and, in contrast to
the incretins, elevates glycemia by inducing n class="Disease">hepatic gluconeogenesis
and mobilization of glucose into blood. However, during balanced pharmacological
co-stimulation of GLP1R, GIPR, and glucagon receptors with low concentrations
of corresponding agonists, glucagon further amplifies the anorectic
effect of the incretins by enhancing energy expenditure via receptors
on adipocytes and in the brain, thereby further promoting a loss in
body weight.[1,41] The glycemic liability of glucagon
disappears in combination drug regimens with GLP1 and GIP mimetics
where the latter are dominant in reducing blood glucose. A recently
engineered GLP1/GIP/glucagon triple agonist was highly effective in
treating diabetes and obesity in mice.[42] Akin to GIP and GLP1, glucagon is also a substrate for DPP4 albeit
suffers hydrolysis at a slower rate (t1/2 = 5–6 min).[43,44] Glucagon possesses the same N-terminal HisXaa motif as GLP1 and the N-terminal histidine was previously found to be essential for receptor
activation.[45] We synthesized glucagon with
the N-trifluoroethyl modification, 2-glucagon, and tested its ability to bind and activate its cognate glucagon
receptor (GCGR). Consistent with our findings above, the addition
of the N-terminal modification did not influence
potency—2-glucagon was slightly more potent than
native glucagon, with an EC50 of 2.3 pM compared to 3.4
pM of native glucagon in cellular assays (Table ).
n class="Gene">Glucagon-like peptide-2 (n class="Gene">GLP2) is
an agonist of another secretin
family receptor, GLP2R. The 33-residue peptide is cosecreted with
GLP1 and is a mediator of mucosal proliferation and enhances the activity
of several absorptive enzymes resulting in optimal intestinal uptake
of nutrients.[46,47] We synthesized an N-terminally modified GLP2 analogue to determine if this trifluoroethyl
decoration would also be useful at its cognate receptor GLP2R. After
synthesizing 2-GLP2, cells overexpressing GLP2R were
treated with 2-GLP2 or native GLP2. We discovered that
the fluorinated derivative 2-GLP2 was more potent than
native GLP2 with EC50 of 7 pM compared to 21 pM of native
GLP2 and was completely refractory to DPP4 catalyzed proteolysis.
This was the most impressive improvement in potency of a secretinpeptide that we observed. Taken together, we have shown that multiple
peptides within the secretin family tolerate or are actually functionally
enhanced by the small N-trifluoroethyl modification.
Because variations in potency was observed for alkylated n class="Gene">secretin
n class="Chemical">peptides at their cognate receptors, we then modified a peptide known
to activate three of the secretin receptors (GLP1R, GIPR, and GCGR).
Previous work by Finan et al. noticed a similarity
in amino acid sequences between GLP1, GIP, and glucagon and generated
a single peptide constructed with a synergistic effect on body weight
reduction.[42] Thispeptide, named “triagonist”,
contains Aib at position two to prevent DPP4 cleavage. Our goal was
to eliminate Aib and utilize our novel N-terminal
modification to retain receptor stimulatory activity while simultaneously
preserving DPP4 resistance. We synthesized a N-trifluoroethyl
triagonist with alanine in position 2, 2-triagonist(Ala2). Fluoroalkylation of the triagonist resulted in potencies in the
single picomolar range, remarkably close to the respective native
peptide ligands at their cognate receptors (Table , Figure S5).
In order to exn class="Chemical">amine the scope and limitation of receptors that
are able to recognize ligands modified through tn class="Chemical">his strategy, we focused
on the δ-opioid receptor and its native ligand, MetEnk, which
can also be inactivated by DPP4.[48,49] In order to
test if the trifluoroethyl modification protects the native peptide
from hydrolysis and retain activity at the receptor, the activity
of 2-MetEnk was tested at the native δ-receptor.
This small modification abolished activity of the native peptide completely,
most likely due to the importance of the primary amine for receptor
activation (Figure S6) and thus circumscribing
the scope of such alterations to the secretin family of receptors
and their ligands.
N-Trifluoroethyl Alkylation
Confers Superior
DPP4 Stability
We have established that altering the pKa of the n class="Chemical">N-terminal amine of
n class="Gene">GLP1 contributes to increased DPP4 stability, as judged by LC-MS.
We then investigated how EC50 is modulated upon incubation
with DPP4, which provides a more sensitive analysis of the remaining
full-length peptide. We incubated 2-GLP1 and GLP1 with
DPP4 at 37 °C overnight. The peptides were then added directly
to HEK293 cells overexpressing GLP1R. The receptor was stimulated
for 4–6 h and production of luciferase was quantified and concentration
response curves with and without DPP4 incubation were plotted. When
GLP1 was incubated with DPP4 overnight, there was a shift in potency
(EC50) from 3 pM to 2.8 nM corresponding to a massive 800-fold
shift due to proteolysis (Figure a and Table ). A significant difference in potency was also observed after
incubation of liraglutide with DPP4 shifting an average of 33-fold
from an EC50 of 8.6 pM to 283 pM (Figure b, Table ). Only a minimal loss (∼1.7-fold) in potency
of exenatide was observed after DPP4 incubation from 3.3 to 5.6 pM
(Table ).
Figure 4
Representative
concentration–response curves of unmodified
peptides (GLP1, liraglutide, GIP, and glucagon; a–d) or N-trifluoroethyl analogues (2-GLP1, 2-liraglutide,
2-GIP, and 2-glucagon; e–h) incubated overnight
with DPP4 or vehicle prior to diluting into microtiter plates containing
HEK293 cells overly expressing receptors (GLP1R, GIPR, or GCGR) and
reporter CRE6-luciferase. Luciferase
production corresponds directly to activation of cognate GPCR via
a cAMP dependent pathway, normalized to 100% maximal activity, and
resultant fold-loss in EC50 upon DPP4 incubation is listed
when applicable. Error bars represent SEM for three independent experiments
(n = 3).
Table 2
Stability towards DPP4 Proteolysis
of N-Modified
Peptidesa
–DPP4
+DPP4
Peptideb
pEC50 ± SEMc
EC50 (pM)c
nd
pEC50 ±
SEMc
EC50 (pM)c
nd
Fold-shift (↓)e
GLP1R
GLP1
11.6 ± 0.2
3.5
5
8.83 ± 0.3
2.8 × 103
5
800.0
1-GLP1
9.98 ± 0.2
99.1
3
9.93 ± 0.2
106.4
3
1.1
2-GLP1
11.5 ± 0.1
3.5
3
11.3 ± 0.1
5.4
3
1.5
3-GLP1
11.4 ± 0.2
5.5
3
11.2 ± 0.1
6.4
3
1.2
4-GLP1
11.4 ± 0.5
7.4
2
11.0 ± 0.4
17.1
2
2.3
5-GLP1
11.3 ± 0.1
4.8
3
11.3 ± 0.1
5.0
3
1.0
6-GLP1
11.8
1.55
1
11.5
3.23
1
2.1
7-GLP1
11.5 ± 0.3
3.6
2
11.5 ± 0.3
3.1
2
0.9
8-GLP1
11.1
7.2
1
10.9
12.4
1
1.7
10-GLP1
11.4 ± 0.07
4.0
3
11.4 ± 0.08
4.6
3
1.2
12-GLP1
11.2 ± 0.5
10.8
2
10.9 ± 0.3
14.0
2
1.3
19-GLP1
7.53 ± 0.09
3.0 × 104
2
6.91 ± 0.1
1.3 × 105
2
4.3
Liraglutide
11.2 ± 0.2
8.6
4
9.59 ± 0.1
283
4
32.9
2-Liraglutide
11.0 ± 0.3
12.8
3
11.1 ± 0.05
8.7
3
0.7
Exenatide
11.5 ± 0.1
3.3
3
11.3 ± 0.05
5.6
3
1.7
2-Exenatide
11.4 ± 0.07
4.3
3
11.4 ± 0.02
4.4
3
1.0
GIPR
GIP
11.9 ± 0.08
1.4
3
9.77 ± 0.3
226.5
3
161.8
2-GIP
12.2 ± 0.04
0.6
3
12.1 ± 0.03
0.90
3
1.5
GCGR
Glucagon
11.3 ± 0.1
5.9
4
9.42 ± 0.4
870
4
147.5
2-Glucagon
11.4 ± 0.2
4.9
4
11.5 ± 0.2
3.7
4
0.8
GLP2R
GLP2
10.4 ± 0.06
39.3
4
8.40 ± 0.02
4.0 × 103
3
101.8
2-GLP2
11.1 ± 0.2
9.6
2
11.1 ± 0.05
9.0
2
0.9
Potency of synthesized peptides
using HEK293 cells expressing GLP1R, GIPR, GCGR, or GLP2R and luciferase
reporter system. Results are separated by target receptors. Peptides
were incubated at 37 °C overnight with and without DPP4 before
incubation of transfected cells.
See Figure for
structures of synthesized peptides.
EC50 is the concentration
of peptide required for half maximal activity of the targeted receptor.
pEC50 = −log(EC50) ± standard error
of the mean (SEM) of independent experiments where applicable.
Number of independent experiments
conducted in triplicate or quadruplicate.
Calculated by the ratio (EC50 with DPP4)/(EC50 without DPP4)
Representative
concentn class="Species">ration–response curves of unmodified
n class="Chemical">peptides (GLP1, liraglutide, GIP, and glucagon; a–d) or N-trifluoroethyl analogues (2-GLP1, 2-liraglutide,
2-GIP, and 2-glucagon; e–h) incubated overnight
with DPP4 or vehicle prior to diluting into microtiter plates containing
HEK293 cells overly expressing receptors (GLP1R, GIPR, or GCGR) and
reporter CRE6-luciferase. Luciferase
production corresponds directly to activation of cognate GPCR via
a cAMP dependent pathway, normalized to 100% maximal activity, and
resultant fold-loss in EC50 upon DPP4 incubation is listed
when applicable. Error bars represent SEM for three independent experiments
(n = 3).
Potency of synthesized n class="Chemical">peptides
using n class="CellLine">HEK293 cells expressing GLP1R, GIPR, GCGR, or GLP2R and luciferase
reporter system. Results are separated by target receptors. Peptides
were incubated at 37 °C overnight with and without DPP4 before
incubation of transfected cells.
See Figure for
structures of synthesized n class="Chemical">peptides.
EC50 is the concentn class="Species">ration
of peptide required for half maximal activity of the targeted receptor.
pEC50 = −log(EC50) ± standard error
of the mean (SEM) of independent experiments where applicable.
Number of independent experiments
conducted in triplicate or quadruplicate.Calculated by the n class="Species">ratio (EC50 with DPP4)/(EC50 without DPP4)
Because n class="Gene">DPP4 is known to cleave multiple substn class="Species">rates, we interrogated
the protease stability of secretinpeptides: GIP, glucagon, and GLP2.
Peptides were preincubated with DPP4, then titrated into a microtiter
plate containing HEK293 cells overexpressing the receptor of interest
(GIPR, GCGR, or GLP2R). Each peptide showed diminution in receptor
stimulatory activity after DPP4 incubation. The tabulated loss for
GIP was 162-fold, for glucagon 147-fold, and for GLP2 102-fold (Table , Figure c,d).
After determining
the loss in potency upon n class="Gene">DPP4 incubation, we
assessed if tn class="Chemical">his loss could be prevented by N-trifluoroethyl
alkylation. All fluorinated peptides 2-GLP1, 2-liraglutide, 2-exenatide, 2-GIP, 2-glucagon, and 2-GLP2 showed retention of potency and efficacy
upon preincubation with DPP4. An alternative to Aib protease protection
in semaglutide is made available by these results, by retaining alanine
at position 2 and alkylation of the N-terminus to
give 2-semaglutide(Ala2) (Figure S3b,c). This pointed to a new, vigorous approach to prevent
DPP4 proteolysis among various peptides of the secretin family without
compromising receptor stimulatory function.
N-Terminal
Modifications on GLP1 Extend beyond
−CH2CF3
In order to establish
the molecular parameters that best complement the receptor binding
pocket in the membrane embedded region that is assumed to be elastic, we varied the alkyl tethers for various physicochemical
attributes such as size (7-n class="Gene">GLP1, 8-n class="Gene">GLP1, 10-GLP1, and 19-GLP1), hydrophobicity (7-GLP1, 8-GLP1, and 10-GLP1), geometry
(7-GLP1 and 8-GLP1), charge (12-GLP1, 13-GLP1, 14-GLP1, 15-GLP1, and 16-GLP1), polarizability (2-GLP1, 6-GLP1, 8-GLP1, 9-GLP1, 10-GLP1, and 11-GLP1), stereochemistry (12-GLP1, 13-GLP1, 14-GLP1, 15-GLP1, and 16-GLP1), and electronegativity (2-GLP1, 3-GLP1, and 6-GLP1). A polar appendage
with multiple hydroxyl groups (mannitol, 19-GLP1) was
also tested.
A structure activity relationship (n class="Disease">SAR) approach
was employed to explore an assortment of N-terminal
decon class="Species">rations. The majority of N-alkyl groups were
installed by reductive amination directly on resin by reaction of
the α-amino group and the corresponding aldehyde.[50] After incubation of the peptide and aldehyde,
the resulting imine was reduced with NaBH4 to afford the
alkylated product creating a structurally diverse library of amine
decorations (Figure ).
The ethyl, propyl, and isobutyl modifications of n class="Gene">GLP1 introduce
small aliphatic groups on the N-terminus to afford 3-n class="Gene">GLP1, 4-GLP1, and 5-GLP1. These
modifications resulted in constructs that are close in potencies to
native GLP1 and were able to stimulate the receptor with full efficacy.
In addition, the largest group in 5-GLP1 resulted in
the most potent compound of the group suggesting that larger nonpolar
groups are able to penetrate the binding pocket and activate the receptor.
After observing the number of small aliphatic groups that n class="Gene">GLP1R
is able to accommodate, we investigated the size limit of these modifications.
Introduction of a larger perfluoroalkyl group as in 6-n class="Gene">GLP1 resulted in equipotent GLP1R activity (EC50 = 1.1 pM).
We placed a large hydrophobic methyl adamantyl group on the N-terminus of GLP1 (7-GLP1), and surprisingly,
this did not significantly diminish activation of GLP1R (∼3-fold
loss in potency). Extension with a flat biphenyl aromatic ring system 8-GLP1 resulted in a 9-fold change in EC50 to 17.6
pM. Based on these observations, one can conclude that large nonpolar
groups are compatible with GLP1R, but geometric attributes do play
a role in receptor activation. We estimate that the receptor is able
to accommodate aliphatic groups up to a volume of 130 Å3.
The observation that a greater reduction in potency with 8-n class="Gene">GLP1 containing a biphenyl group led us to remove the second
geometrically flat aromatic ring to yield a benzyl functionality (10-n class="Gene">GLP1). The potency recovered from a 9-fold loss to only
a 2-fold loss in potency compared to GLP1. The methyl imidazole containing 11-GLP1 was investigated to determine its effect and influence
on receptor binding and activation. A priori, the
expectation was an increase in potency because of the crucial role
the methyl imidazole side chain of His7 had for receptor binding and
activation; however, no increased potency was observed. The EC50 remained essentially unchanged compared to native GLP1 at
3 pM.
As the positively charged amino terminus of n class="Gene">GLP1 was previously
shown to be important for receptor binding and activation, we used
n class="Chemical">alanine aldehyde to yield construct 12-GLP1. This resulted
in essentially no change in potency at GLP1R (EC50 = 2.7
pM). More intriguingly, stereochemical differences play an important
role in receptor activation. We used reductive amination with the R or S stereoisomers of Garner’s
aldehyde to retain stereochemistry at the α-carbon and produce
stereoisomeric constructs equivalent to those that would be obtained
from the d-serine (13-GLP1) and l-serine
(14-GLP1) aldehydes after cleavage from peptide resin.[51] Remarkably, there was a 6.3-fold difference
in potencies, 4.0 pM and 25 pM, in favor of the S isomer. We also conducted a similar comparison with the aldehyde
of l-phenylalanine (15-GLP1) and d-phenylalanine
(16-GLP1). A difference of 3-fold was observed between
the two stereoisomers, with the S chirality favored
once again. This highlights the fact that stereochemical differences
in the ligand can be exploited to fine-tune receptor activation and
that the binding pocket has chirality that is responsive to changes
in the three-dimensional disposition of functionality.
In order
to gaina full understanding of the dynamics of tn class="Chemical">his pocket,
we synthesized two n class="Chemical">diazirine moieties to generate a probe capable
of cross-linking with the receptor. We synthesized the commonly used
trifluoromethylphenyl diazirine 17-GLP1 and assessed
the potency at the receptor. We observed a modestly diminished potency
(EC50 = 42 pM) which fell in line with the biphenyl compound 8-GLP1, suggesting that the large geometrically flat phenyl
ring cannot bear too much steric bulk and still fit within the receptor
pocket. This prompted the synthesis of a smaller alkyl diazirine, 18-GLP1, resulting in a construct with significantly improved
potency of 8.4 pM.
Finally, we interrogated whether the receptor
is tolerant of polar
uncharged functionality. We tested the n class="Chemical">mannitol derivative, 19-n class="Gene">GLP1, for potency and efficacy. With five hydroxyl groups
positioned in the pocket, the potency of this construct was massively
diminished with an EC50 of 33 nM (13,200-fold loss) while
efficacy was fully retained. This suggests that 19-GLP1 is able to bind the receptor, but the appendage is unable to penetrate
the receptor binding pocket in any meaningful way.
Overall,
these n class="Chemical">peptide constructs provide a framework for understanding
the types of modification that are topologically compatible with the
n class="Gene">GLP1R binding pocket, the volume that is available, the functional
groups that are complementary, and the three-dimensional chiral space
that the binding region presents to ligands. These observations should
help guide design of a diverse library of high potency, full efficacy
peptides that are also resistant to DPP4 catalyzed hydrolysis (Table ). It is likely that
subtle changes in the interaction of these ligands with the GLP1R
binding pocket will result in peptides that trigger biased signaling.
Stability Conferred by Fluoroalkylation Extends to Protease
Family Related to DPP4
n class="Gene">DPP4 belongs to a family of homologous
n class="Gene">serine proteases that are active against the same N-terminal motif and cleave GLP1 resulting in the inactive fragment
GLP1(9–37)NH2.[52] We interrogated
if trifluoroethylation of GLP1 increases stability toward enzymes
related to DPP4 (members of the S9B prolyl oligopeptidase subfamily
that have similar arrangement of active sites and recognition of substrates
by two glutamate residues), namely, dipeptidyl peptidase-9 (DPP9)
and fibroblast activation protein α (FAP). Peptidases DPP4,
DPP9, and FAP all cleave similar substrates: GLP1, GLP2, neuropeptide
Y (NPY), and peptide YY (PYY).[53] Prior
reports have documented the ability of DPP9 in cleaving GLP1 efficiently
with a half-life of 6 min[54]in
vitro as compared FAP, which gives GLP1 a t1/2 of 22 h.[53] After overnight
incubation of GLP1 or 2-GLP1peptides with vehicle, FAP,
or DPP9, samples were titrated into a microtiter plate containing
the luciferase reporter HEK293 cells overexpressing GLP1R. After the
receptors were stimulated for 4–6 h, the production of luciferase
was quantified as an index of GLP1R activation. GLP1 was cleaved by
FAP and DPP9 with DPP9 cleaving more peptide due to the shorter half-life
followed by FAP (Figure S7). This was not
the case for 2-GLP1, where we did not observe any change
in potency. This result indicates that 2-GLP1 is refractory
to DPP9 and FAP action and expands the protective umbrella through
this modification to other enzymes within the protease family.
In Vivo Glucose Tolerance Test
Blood
n class="Chemical">glucose levels are regulated by the ability of n class="Gene">GLP1 to promote insulin
secretion, and we explored if our compounds achieve the same. We compared
the ability of select peptides (2-GLP1, 7-GLP1, liraglutide, and 2-liraglutide) and GLP1
to stimulate insulin secretion and normalize glucose levels by performing in vivo glucose tolerance tests (Figure ). Before peptide administration, the blood
glucose of C57BL/6J mice was measured establishing an average fasting
glucose level of 71 mg/dL. Mice were injected peritoneally with vehicle, GLP1, 2-GLP1, 7-GLP1, liraglutide, or 2-liraglutide at doses of 1 or 0.1 mg/kg. After
1 h, mice were administered an oral glucose bolus and mice administered
vehicle showed a dramatic spike in blood glucose levels after 30 min
(Figure a). All mice
administered GLP1R agonists showed a less pronounced increase in blood
glucose, which returned to basal glucose levels at about 120 min post
glucose bolus. The area under the curve (AUC) between 0 and 120 min
was significantly different for all GLP1R agonists compared to the
vehicle (Figure b).
To determine circulation longevity of each peptide, we administered
a second bolus 5 h after the first glucose challenge. Lipidated analogues,
liraglutide and 2-liraglutide, initially dosed at 1 mg/kg
remained in circulation and prevented blood glucose level excursion
post 30 min after the second glucose challenge (Figure c). No significant difference between GLP1, 2-GLP1, 7-GLP1, and vehicle
was observed and 2-liraglutide performed just as well
as liraglutide at both doses. This demonstrates the utility
of N-alkylated peptides that are also lipidated to
avoid renal clearance, in maintaining blood glucose levels to the
same extent as compounds currently in clinical use. These findings
also underscore the initial expectation that these minimal modifications
do not affect in vivo biological function.
Figure 5
N-Trifluoroethyl alkylation and lipidation of 2-liraglutide performs as well as liraglutide at lowering
blood sugar levels in vivo. (a) Measured glucose
levels by tail vein prick for an oral glucose tolerance test (OGTT)
of fasted mice treated intraperitoneally (i.p., dotted line) with
vehicle, GLP1, 2-GLP1, 7-GLP1, liraglutide,
or 2-liraglutide at (1 mg/kg or 0.1 mg/kg as noted).
Glucose bolus was administered orally at time 0 and 240 min (gray,
upward arrow). (b) Average area under the curve (AUC) calculated from
0 to 120 min in part a. (c) Glucose levels 30 min past second glucose
challenge that occurred 5 h after first OGTT. Error represents the
average ± SEM (n = 5). P-values
compared to vehicle: **P < 0.01; ***P < 0.001; ****P < 0.0001.
n class="Chemical">N-Trifluoroethyl alkylation and n class="Chemical">lipidation of 2-liraglutide performs as well as liraglutide at lowering
blood sugar levels in vivo. (a) Measured glucose
levels by tail vein prick for an oral glucose tolerance test (OGTT)
of fasted mice treated intraperitoneally (i.p., dotted line) with
vehicle, GLP1, 2-GLP1, 7-GLP1, liraglutide,
or 2-liraglutide at (1 mg/kg or 0.1 mg/kg as noted).
Glucose bolus was administered orally at time 0 and 240 min (gray,
upward arrow). (b) Average area under the curve (AUC) calculated from
0 to 120 min in part a. (c) Glucose levels 30 min past second glucose
challenge that occurred 5 h after first OGTT. Error represents the
average ± SEM (n = 5). P-values
compared to vehicle: **P < 0.01; ***P < 0.001; ****P < 0.0001.
Serum Stability
Although Liraglutide has been shown
to have increased serum stability, it is still metabolically processed
by n class="Gene">DPP4 (Figures c
and 4b). Compound n class="Chemical">2-liraglutide, which is only minimally modified from liraglutide with the sole
addition of the N-terminal alkyl tether, but otherwise
has the same sequence, linker, and lipid side chain, has a longer
systemic half-life that is similar to semaglutide, but without the
use of Aib at position 2.[25] Compounds were
administered to rats via oral gavage (5 nmol/kg), and blood was collected
sublingually to quantify the amount of peptide that remained in circulation
via bioassay. The observed half-lives of liraglutide and 2-liraglutide were 3 and 5.5 h (Figure S8). These results
indicate that the designed compounds survive in the bloodstream intact
without damage by other metabolic processes and are active at the
cognate receptor. The protracted lifetime is similar to semaglutide,
the leading peptide-based compound in clinical use.[25]
Conclusion
Decades of research on
the designand modification of incretins
has yielded functional templates that have resulted in several therapeutics
for n class="Disease">type 2 diabetes and related comorbidities.[21] The N-terminus of n class="Gene">GLP1 has previously
been assumed to be intolerant of modifications as it resides in a
peptide domain crucial for intermolecular interactions with the receptor.
This is mainly because the range of ligand modifications hitherto
have relied primarily on acyl functionalities that the receptor has
evolved to recognize as improperly processed fragments. We demonstrate
in this study that not only is the N-terminal amenable
to chemical modification but it is also tolerant to a wide range of
functionality if the group is attached by alkylation, whereas acylation
is deleterious to activity. The compounds described in this study
are as efficacious and long-lived as the leading compounds in the
clinic today. In addition, the strategy is generalizable to all templates
for a range of receptors. Because the number of active compounds that
can be generated for each template is in the hundreds, optimization
of pharmacological properties (e.g., ligand bias and oral bioavailability)
should be possible.
The use of native n class="Gene">GLP1 as a therapeutic
is hindered by its extreme
lability, in that it is very short-lived with a half-life in vivo of less than 2 min, as it is degraded by the ubiquitous
n class="Gene">serine protease DPP4. We demonstrate in this study that the simultaneous
goal of rendering substrates refractory to enzyme catalyzed hydrolysis
and maintaining activity at the receptor is possible. We conducted
activity assays (cAMP production) as well as LC ESI-MS to establish
the resistance of the peptide constructs to DPP4. Through post-incubation
assays, we show that the designed peptides have protracted lifetimes in vivo and are also able to restore normoglycemia over
a prolonged period of time. We further show that the strategy is not
only applicable to GLP1 but also to leading compounds currently in
clinical use including liraglutide, exenatide, and semaglutide.
We extended the scope of such N-terminal modifications
to other receptors within the class B family, namely, n class="Gene">GCGR, n class="Gene">GIPR,
and GLP2R. These related receptors are also therapeutically relevant
and provide avenues for design and development to a range of clinically
useful peptide constructs. Collectively, this strategy is broadly
applicable for modification of ligands which activate receptors that
define the axis of glucose metabolism.
The results presented
here open up an array of opportunities that
have traditionally not been accessible. Class B G protein-coupled
receptors (GPCRs) which encompass n class="Gene">GLP1R, n class="Gene">GIPR, GCGR, and GLP2R all
interact with intracellular partner proteins upon activation by ligand.
Several G proteins such as Gαs, Gαi, Gαq, Gαo, and arrestins (β-arrestin-1,
β-arrestin-2) can serve in the effector role.[11,55,56] These partner/effector G proteins activate
one or more signaling pathways such as mobilization of intracellular
calcium and ERK1/2 phosphorylation (Gαs, Gαq, and Gαi/o or production of cAMP and insulin
secretion (Gαs).[57,58] β-Arrestins
control receptor internalization and/or mediate G protein independent
signaling that lead to cell proliferation or apoptosis in part through
activation of MAP kinases such as ERK1/2.[59] It is possible that binding of certain ligands to the receptor can
selectively engage particular effectors over others resulting in “biased”
signaling favoring one channel of activity over another. This has
led to the idea that selective modulation of certain channels may
result in ligands (therapeutics) that have fewer side effects sometimes
accompanying receptor stimulation.[55] The
molecular grafting method described in this study modifies the ligands
at the crucial N-terminal residues and introduces
new moieties in the receptor pocket interacting with a region of the
receptor that is buried deep in the membrane. We believe this provides
an avenue to achieve such signaling bias and studies along these lines
are ongoing in our laboratories.
Authors: Marlies V Hager; Lisa M Johnson; Denise Wootten; Patrick M Sexton; Samuel H Gellman Journal: J Am Chem Soc Date: 2016-11-04 Impact factor: 15.419
Authors: L B Knudsen; P F Nielsen; P O Huusfeldt; N L Johansen; K Madsen; F Z Pedersen; H Thøgersen; M Wilken; H Agersø Journal: J Med Chem Date: 2000-05-04 Impact factor: 7.446
Authors: Christoph Parthier; Martin Kleinschmidt; Piotr Neumann; Rainer Rudolph; Susanne Manhart; Dagmar Schlenzig; Jörg Fanghänel; Jens-Ulrich Rahfeld; Hans-Ulrich Demuth; Milton T Stubbs Journal: Proc Natl Acad Sci U S A Date: 2007-08-21 Impact factor: 11.205
Authors: Brian Finan; Bin Yang; Nickki Ottaway; David L Smiley; Tao Ma; Christoffer Clemmensen; Joe Chabenne; Lianshan Zhang; Kirk M Habegger; Katrin Fischer; Jonathan E Campbell; Darleen Sandoval; Randy J Seeley; Konrad Bleicher; Sabine Uhles; William Riboulet; Jürgen Funk; Cornelia Hertel; Sara Belli; Elena Sebokova; Karin Conde-Knape; Anish Konkar; Daniel J Drucker; Vasily Gelfanov; Paul T Pfluger; Timo D Müller; Diego Perez-Tilve; Richard D DiMarchi; Matthias H Tschöp Journal: Nat Med Date: 2014-12-08 Impact factor: 53.440
Authors: Jais R Bjelke; Jesper Christensen; Per F Nielsen; Sven Branner; Anders B Kanstrup; Nicolai Wagtmann; Hanne B Rasmussen Journal: Biochem J Date: 2006-06-01 Impact factor: 3.857
Authors: Jakob Leppkes; Nicole Dimos; Bernhard Loll; Thomas Hohmann; Michael Dyrks; Ariane Wieseke; Bettina G Keller; Beate Koksch Journal: RSC Chem Biol Date: 2022-05-19