Literature DB >> 33732907

Structural properties and peptide ligand binding of the capsid homology domains of human Arc.

Erik I Hallin1, Clive R Bramham1, Petri Kursula1,2,3.   

Abstract

The activity-regulated cytoskeleton-associated protein (Arc) is important for synaptic plasticity and the normal function of the brain. Arc interacts with neuronal postsynaptic proteins, but the mechanistic details of its function have not been fully established. The C-terminal domain of Arc consists of tandem domains, termed the N- and C-lobe. The N-lobe harbours a peptide binding site, able to bind multiple targets. By measuring the affinity of human Arc towards various peptides from stargazin and guanylate kinase-associated protein (GKAP), we have refined its specificity determinants. We found two sites in the GKAP repeat region that bind to Arc and confirmed these interactions by X-ray crystallography. Phosphorylation of the stargazin peptide did not affect binding affinity but caused changes in thermodynamic parameters. Comparison of the crystal structures of three high-resolution human Arc-peptide complexes identifies three conserved C-H…π interactions at the binding cavity, explaining the sequence specificity of short linear motif binding by Arc. We further characterise central residues of the Arc lobe fold, show the effects of peptide binding on protein dynamics, and identify acyl carrier proteins as structures similar to the Arc lobes. We hypothesise that Arc may affect protein-protein interactions and phase separation at the postsynaptic density, affecting protein turnover and re-modelling of the synapse. The present data on Arc structure and ligand binding will help in further deciphering these processes.
© 2021 The Authors.

Entities:  

Keywords:  Capsid homology; Crystal structure; Peptide binding; Postsynaptic density; Protein interaction; Specificity

Year:  2021        PMID: 33732907      PMCID: PMC7941041          DOI: 10.1016/j.bbrep.2021.100975

Source DB:  PubMed          Journal:  Biochem Biophys Rep        ISSN: 2405-5808


Introduction

Normal brain function depends on signal transduction across synapses, and the ability to modify the strength of synaptic transmission is key to learning and memory. Synaptic transmission and plasticity involve many proteins located on both the pre- and postsynaptic side of the synapse. A central protein in long-term synaptic plasticity is the activity-regulated cytoskeleton-associated protein (Arc) [1,2]. Arc is an immediate early gene expressed in excitatory, glutamatergic neurons following synaptic activation. Arc mRNA transports into dendrites, where it undergoes local translation near synapses. In neuronal activity-dependent synaptic plasticity, such as long-term potentiation (LTP) and long-term depression (LTD), Arc protein is rapidly expressed and degraded, indicating a transient and dynamic mode of action [1,[3], [4], [5], [6]]. Transient expression of Arc is also required for memory consolidation and postnatal developmental plasticity of the visual cortex [[7], [8], [9], [10]]. Arc is a flexible hub, interacting with many proteins located in the postsynaptic density (PSD) and the nucleus [11]. The abundance of AMPA-type glutamate receptors (AMPARs) at the synapse is a major determinant of synaptic transmission efficiency. Expression of Arc promotes endocytosis of AMPARs, by recruitment of the machinery for clathrin-mediated endocytosis, resulting in synaptic depression [[12], [13], [14]]. Arc was found to bind stargazin (Stg), an auxiliary subunit of AMPAR important for receptor trafficking [[15], [16], [17]]. Intriguingly, Arc is able to form capsid-like structures and transfer nucleic acids between neurons [[18], [19], [20], [21]]. The C-terminal domain of Arc (Arc-CT) consists of two structural repeats called the N-lobe (Arc-NL) and the C-lobe (Arc-CL). These domains are structural homologs to retroviral capsid proteins, but the Arc-NL domain differs by having a peptide binding site, binding to stargazin and other peptides [16,22]. The peptide binding site of the Arc-NL allows an extended peptide to bind across a hydrophobic pocket. The N-terminal tail of Arc-NL is located on top of the peptide in a parallel manner, and together with the peptide binding site forms a small β sheet with the ligand peptide in the middle. Accordingly, Arc-NL mostly interacts with the bound peptide backbone [23]. Based on alignments of Arc ligand peptides, the PxF/Y/H binding motif was proposed [16]. The histidine of the motif became questionable, when we showed that the GluN2A peptide, having PxH and suggested to bind to the Arc peptide binding pocket [16], failed to bind to hArc-NL [23]. The links between Arc ligand peptide sequence and its backbone conformation have not been studied. Further characterisation of the specificity determinants of the Arc-NL peptide binding site would be of high value for finding novel Arc binding partners. The PSD is a protein-rich assembly below the postsynaptic membrane, formed of large scaffolding proteins. These proteins carry a combination of protein interaction domains, which may interact with several alternative partners; the structure of the protein assembly can be regulated in an activity-dependent manner. The main PSD scaffolds include membrane-associated guanylate kinases (such as PSD95), Shanks, calmodulin-dependent protein kinase II, Homer, synaptic Ras GTPase-activating protein 1, and guanylate kinase-associated protein (GKAP). Scaffolding proteins of the PSD are targets for mutations in neurological disorders [24]. GKAP is a structural protein of the PSD, linking PSD95 to Shank, which accumulates during synaptic inactivity [25]. The binding site of GKAP for Shank is located at the C terminus of GKAP, interacting with the Shank PDZ domain [[26], [27], [28]]. GKAP has five sequence repeats located in its middle region that interacts with PSD95 [29,30], and one repeat overlaps with the previously suggested Arc binding site [16,23]. PSD95 is required for targeting of Arc to the PSD, where it is found in multiple PSD95 interaction complexes [31]. It is, therefore, possible that Arc expression contributes to activity-dependent remodelling of the PSD. To gain insight to human Arc signalling, we sought to characterise the structural determinants of ligand binding to the hArc-NL. By measuring the affinity between hArc-NL and various peptides, we show that the peptide binding site is sequence-specific. Mutation of the proline or the tyrosine of the PxF/Y motif resulted in loss of binding. We found two Arc interaction sites on GKAP, suggesting that Arc may bind to the repeat region of GKAP. By X-ray crystallography, we confirm the peptide binding by hArc-NL and provide the structure of hArc-CL, which does not have the peptide binding ability found in hArc-NL. Structure-based analyses give further insights into the structure and conserved features of hArc.

Material and methods

Recombinant protein production

The expression and purification of hArc NL and CL were performed as described [23]. In short, hArc-NL (residues 207–277), hArc-CL (residues 278–370), and hArc-CT (residues 206–396, containing both lobes) were expressed with a cleavable His-MBP tag in E. coli BL21, with isopropyl β-d-1-thiogalactopyranoside induction at +30 °C. The cells were lysed by sonication and the proteins purified by Ni-NTA affinity chromatography and size-exclusion chromatography. The affinity tag was removed with recombinant TEV protease during purification. After cleavage, the N terminus contained the 4-residue sequence GAMG prior to the Arc sequence. This sequence was predicted to not affect peptide binding to hArc-NL based on earlier structural data [16]; this was confirmed by crystal structures presented in this paper, as the visible part of hArc-NL, interacting with the peptides, starts at Thr211.

Peptide synthesis

Peptides were synthesised by GenScript (Piscataway NJ, USA), carrying N-terminal acetylation and C-terminal amidation. The sequences and nomenclature of the synthetic peptides are given in Table 1.
Table 1

Synthetic peptides used in the experiments.

PeptideSequence
pStg(wt)RIPSYRYR
pStg(P3A)RIASYRYR
pStg(Y5L)RIPSLRYR
pStg(rev)RYRYSPIR
pStg(phos)RIP-pSer-YRYR
pStg(long)RIPSYRYRYQRRSRSSSRSTE
Poly-ProPPPPPPPP
GKAP-R1EIPCRRMR
GKAP-R2PSPKVAAR
GKAP-R3HSPKLQIR
GKAP-R4TSPKFRSR
GKAP-R5MPGCFRMR
GKAP-R4(long)TSPKFRSRNESYMRAMST
Synthetic peptides used in the experiments.

Isothermal titration calorimetry

The affinity of peptides towards hArc lobe domains was measured using a MicroCal iTC200 (Malvern, UK) instrument. The peptide was injected into the cell containing the purified protein; the peptides were 2.5–5.0 mM and the protein 0.25–0.5 mM. The buffer used was 20 mM Tris-HCl, pH 7.4, with 150 mM NaCl, and the measurements were performed at +25 °C. The data were analysed using MicroCal Origin 7, using a one-site binding model.

Protein crystallisation

Crystals of hArc lobes were obtained by vapour diffusion. The proteins and peptides were in a buffer consisting of 20 mM of Tris-HCl and 150 mM of NaCl. The crystal of hArc-NL with pStg(wt) was made by mixing hArc-NL (20 mg/ml) with a 3-fold molar excess of pStg(wt). Hanging drops were set up by mixing 1 μl of the protein/peptide solution with 1 μl of the reservoir, consisting of 1.4 M trisodium citrate, and incubated at +4 °C. The crystals were soaked in well solution containing 20% glycerol as cryo-protectant for a few seconds before flash-cooling in liquid nitrogen. The crystal of hArc-NL with GKAP-R4 was made by mixing hArc-NL (20 mg/ml) with a 3-fold molar excess of the peptide. Hanging drops were made by mixing 2 μl of the protein/peptide mixture with 2 μl of the reservoir, consisting of 1.2 M of trisodium citrate, and incubated at +20 °C. A solution of 1.2 M trisodium citrate with 20% glycerol was used as cryo-protectant for these crystals. The crystal of hArc-NL with GKAP-R5 was made by mixing hArc-NL (20 mg/ml) with a 2-fold molar excess of peptide. Hanging drops were made by mixing 2 μl of the protein/peptide mixture with the reservoir, consisting of 1.2 M of trisodium citrate, and incubated at +4 °C. Crystals were picked and flash-frozen in liquid nitrogen. Crystals of hArc-CL were obtained by sitting-drop vapour diffusion. The drops were made by mixing 200 nl of the protein (15 mg/ml) with 100 nl of a reservoir, consisting of 16% PEG 8000, 40 mM monobasic potassium phosphate, and 20% glycerol. The drops were incubated at +20 °C, and crystals were picked and flash-frozen in liquid nitrogen.

X-ray diffraction data collection and structure refinement

Initial diffraction data for hArc-NL with pStg(wt) were collected on the I03 beamline at Diamond Light Source (Oxfordshire, UK). Data for hArc-NL with pStg(wt), GKAP-R4, and GKAP-R5 were collected on the P13 beamline [32] at EMBL/DESY (Hamburg, Germany). Data for hArc-CL were collected on the P14 beamline at EMBL/DESY (Hamburg, Germany). All data were collected at 100 K and processed using XDS [33]. Phasing was done by molecular replacement, using PDB entry 4X3H [16] for hArc-NL and PDB entry 4X3X [16] for hArc-CL as search models in Phaser [34]. The structures were refined using phenix.refine [35] and model building was done in Coot [36]. Structure validation was done using MolProbity [37].

Structure analyses

Network analyses on the protein structures were carried out on the NAPS server [38,39]. Structure similarity searches were done using Salami [40]. The pStg(phos) peptide was modelled into hArc-NL manually in Coot, based on the crystal structure with bound pStg(wt), and energy minimisation was carried out in YASARA [41]. Molecular dynamics (MD) simulations for hArc-NL without peptide, and with either pStg(wt) or pStg(phos) bound, were run with YASARA, essentially as described [42]. The AMBER14 force field [43] with the TIP3P model for explicit solvent was employed, while pressure and temperature were controlled with the YASARA densostat [41]. To avoid the problems known for the classic Berendsen thermostat and barostat, and to make sure that a proper ensemble is created also for small systems, YASARA uses weak coupling to the time average temperature and density (‘densostat’) instead of the instantaneous values. The systems were built in a dodecahedral cell, with 0.15 M NaCl providing physiological ionic strength. After energy minimisation, MD simulations were run for at least 280 ns at +25 °C, and trajectory analysis was carried out in YASARA. The first 50 ns of each simulation were taken as the equilibration period, based on lack of major fluctuations in Rg or RMSD values before that point.

Results and discussion

We set out to study the structure of the two lobes of human Arc-CT, and to better understand the binding of short linear peptide motifs to hARc-NL. We further wanted to observe possible peptide interactions of hArc-CL and the effects of Stg phosphorylation on the interaction with hArc-NL. Using a combination of ITC and X-ray crystallography, we provide an improved understanding of hArc structure and refine the linear peptide motif that interacts directly with the binding site on hArc-NL.

Peptide ligand binding by the human Arc lobe domains

To study the specificity of peptide binding by hArc-NL, we measured the affinity of a set of mutated versions of a known peptide ligand from Stg. The Stg peptide has shown the highest affinity to hArc-NL in our earlier studies [23] and has the suggested PxF/Y motif [16]. For clarity, we number the binding motif such that the conserved aromatic residue is P(0). By mutating the consensus sequence, affinity to hArc-NL is lost (Fig. 1A, Table 2). As controls, hArc-NL did not bind to a poly-proline peptide nor to the reverse sequence of Stg. The modifications P(-2)A and Y(0)L also resulted in loss of binding, showing that the binding site is specific and that the proline and tyrosine are important for binding to hArc-NL. The binding of pStg(wt) to hArc-NL was confirmed by X-ray crystallography of the complex (see below), and the binding mode is similar to the earlier lower-resolution structure of the same complex of rat Arc-NL [16]. Crystallisation screening of hArc-NL with the mutated peptides was unsuccessful, which is an additional indication that hArc-NL does not bind these peptides, since Arc-NL fails to crystallise without a bound peptide.
Fig. 1

Calorimetric analysis of hArc ligand peptide binding.

A. Titration of hArc-NL with pStg variants.

B. Titration of hArc-NL with GKAP repeat peptides.

C. Electron density for structure 4x3i [16], which clearly has electron density for Pro instead of Arg in the ligand peptide.

D. Comparison of hArc-NL and hArc-CT binding to Stg.

E. Comparison of hArc-NL and hArc-CL.

F. Comparison of short and long GKAP-R4 peptide binding to hArc-NL and hArc-CT.

Table 2

ITC analysis of Arc-peptide interactions. Only combinations of Arc domain and peptide with detectable binding in ITC are shown (see Fig. 1 for all binding curves). The hArc-CL construct did not bind to any peptide, and repeats 1–3 of GKAP did not show detectable binding to any construct in the experiments.

ProteinPeptideKd (μM)ΔH (kcal/mol)-TΔS (kcal/mol/°)
hArc-NLpStg(wt)33.6 ± 1.1−13.1 ± 0.1+7.0
hArc-NLpStg(phos)23.8 ± 1.4−66.5 ± 0.6+60.2
hArc-NLGKAP-R4714 ± 19−10.9 ± 0.4+6.6
hArc-NLGKAP-R52390 ± 314−8.2 ± 0.3+4.7
hArc-NLGKAP-R4(long)426 ± 12−7.2 ± 0.2+3.1
hArc-CTGKAP-R4763 ± 10−6.8 ± 0.1+2.5
hArc-CTGKAP-R4(long)391 ± 18−6.8 ± 0.2+2.2
hArc-CTpStg80 ± 4.0−12.0 ± 0.2+6.4
hArc-CTpStg(long)26.4 ± 1.2−20.2 ± 0.2+14.0
Calorimetric analysis of hArc ligand peptide binding. A. Titration of hArc-NL with pStg variants. B. Titration of hArc-NL with GKAP repeat peptides. C. Electron density for structure 4x3i [16], which clearly has electron density for Pro instead of Arg in the ligand peptide. D. Comparison of hArc-NL and hArc-CT binding to Stg. E. Comparison of hArc-NL and hArc-CL. F. Comparison of short and long GKAP-R4 peptide binding to hArc-NL and hArc-CT. ITC analysis of Arc-peptide interactions. Only combinations of Arc domain and peptide with detectable binding in ITC are shown (see Fig. 1 for all binding curves). The hArc-CL construct did not bind to any peptide, and repeats 1–3 of GKAP did not show detectable binding to any construct in the experiments. hArc-NL binding to GKAP repeat peptides was tested next, as one of these repeats was previously shown to be a ligand [16,23]. The affinity of hArc-NL towards the five repeat sequences of GKAP showed binding in two cases (Fig. 1B, Table 2). GKAP-R4, which was known to bind [16,23], showed higher affinity. GKAP-R5 also bound to hArc-NL, while the other repeats did not show measurable affinity using ITC (Fig. 1B). The binding of GKAP-R4 and GKAP-R5 was confirmed with X-ray crystallography, and the peptides were bound in the same peptide binding site of hArc-NL (see below). The selective binding of these two repeats reveals new information on hArc binding specificity. Both GKAP-R4 and GKAP-R5 have the phenylalanine of the PxF/Y motif, while the other repeats have Arg, Val, or Leu at this position. All repeats except for GKAP-R5 have the proline of the PxF/Y motif. Still, GKAP-R5 binds to hArc-NL. It has a neighbouring Pro residue one position before the canonical Pro, which in GKAP-R5 is replaced by a Gly. It is likely that the conserved Pro residue is important for both keeping an extended conformation for the binding peptide and in making specific interactions, such as C–H…π bonds towards Arc-NL. The extra Gly residue in GKAP-R5 provides flexibility needed to accommodate this altered motif into the binding site. Overall, the results show that both the Pro and Phe/Tyr residue of the motif are required for short peptide ligand binding to hArc-NL, but the position is not absolutely fixed. Using fluorescence polarisation spectroscopy, two other peptides were reported to bind to Arc-NL, which both have His instead of the Phe/Tyr residue of the PxF/Y motif [16]. Using ITC in our earlier work [23], we could not see any indication of Arc-NL binding to the GluN2A peptide. Indeed, Arc binds to another segment of GluN2A having a PxF/Y motif [22]. These observations indicate that the Phe/Tyr residue is important for binding. An unconventional CaMKII peptide was reported to bind to Arc-NL, in which the canonical Pro is replaced by Arg [16]. However, observation of the electron density of the corresponding crystal structure reveals that the peptide sequence in the structure is erroneous, as the density indeed shows a peptide with a Pro at the canonical position instead of Arg (Fig. 1C). This is a further indication of the sequence specificity of the binding pocket in Arc-NL. Arc-CL is a structural homologue of Arc-NL, and it could in theory have a similar ligand-binding site. It might be possible that a long peptide that binds to Arc-NL would continue into the corresponding region of Arc-CL, forming an extended peptide binding surface. However, we could not detect binding of hArc-CL to any of the tested peptides (Fig. 1D). Therefore, we tested longer versions of Stg and GKAP peptides that bind hArc-NL to see if Arc-CT, with both lobes present, has altered affinity towards them. However, the affinity towards the short and long peptides was comparable (Fig. 1E–F, Table 2), suggesting that hArc-CL did not interact with the C-terminal region of the longer peptide. A similar small increase in affinity could be seen towards the longer peptides for both hArc-NL alone and hArc-CT (Table 2), showing that hArc-CL is not involved. For the longer Stg peptide, hArc-CT bound with a higher favourable enthalpy than hArc-NL; this could be an indication of additional interactions between the peptide and hArc-CL. However, the change in affinity was small, and the binding entropy was more unfavourable, compensating for the enthalpy contribution. The NMR structure of the rat Arc-CT [22], consisting of both lobes, showed that the relative orientation of the two lobes could allow a peptide binding to the Arc-NL to continue into the Arc-CL if extended. However, it appears that peptide binding is unique to the mammalian Arc-NL. The structure of Drosophila Arc1 (dArc1) without bound peptides showed that the sequence upstream of the NL could fold into the peptide binding pocket [44]. This folding back would block peptide binding, and dArc may be unable to bind to peptide ligands like mammalian Arc-NL. The structures of dArc capsids [45] suggest the same. The Stg peptide that binds to human and rat Arc-NL does not bind to the two isoforms of dArc [46]. All peptides showed similar enthalpy-entropy compensation, when binding thermodynamics were assessed using ITC (Table 2). The observations are typical for a protein-ligand interaction in the micromolar affinity range [47,48]. Favourable enthalpy is brought about by hydrogen bonds as well as weaker interactions including C–H…π bonds and van der Waals interactions, while unfavourable entropy can be explained by overall loss of flexibility in the complex compared to molecules free in solution.

Structures of hArc-NL bound to ligand peptides

We aimed to obtain more detailed information on Arc-peptide interactions using X-ray crystallography. A comparison of the crystal structures of the different Arc-peptide complexes (Table 3) further highlights important details of peptide binding (Fig. 2). The structures of rat Arc-NL and Arc-CL were previously reported [16]. We report the hArc-NL structure bound to the pStg(wt) peptide at higher resolution (1.90 Å) (Fig. 2A). The structure of hArc-NL bound to GKAP-R4 shows that the peptide backbone and the PxF motif align well with pStg(wt) (Fig. 2B). hArc-NL complexed with GKAP-R5 is different, showing that the peptide backbone is shifted at the position of the canonical Pro at P(-2), so that the neighbouring Pro at P(-3) present in GKAP-R5 can get closer to the binding pocket (Fig. 2C). Based on these findings, the linear motif for Arc-NL binding can be expanded from PxF/Y to P(G)xF/Y.
Table 3

Crystallographic data processing and refinement.

StructurehArc-NL + pStg(wt)hArc-NL + GKAP-R4hArc-NL + GKAP-R5hArc-CL
PDB entry6TNO6TNQ6TQ06TN7
BeamlineP13P13P13P14
Wavelength0.9760.9760.9760.976
Space groupP21P21P21P22121
Unit cell parameters51.39 29.77 71.79 90 92.5 9050.80 29.37 71.36 90 93.4 9039.31 192.30 39.56 90 94.8 9037.21 38.29 61.01 90 90 90
Resolution range (Å)50–1.90 (1.95–1.90)50–1.30 (1.34–1.30)50–1.95 (2.00–1.95)50–1.67 (1.71–1.67)
Completeness (%)97.8 (97.9)89.5 (40.4)98.2 (97.3)92.2 (32.5)
Rsym (%)4.1 (38.6)2.7 (13.8)4.5 (178.6)5.1 (62.6)
Rmeas (%)4.8 (45.1)3.2 (17.9)5.3 (207.3)5.5 (69.9)
CC1/2 (%)99.9 (92.1)99.9 (96.3)99.9 (40.0)99.9 (85.7)
<I/σ(I)>18.6 (2.9)24.2 (4.6)13.8 (0.9)18.3 (2.6)
Redundancy3.6 (3.7)3.4 (2.0)3.7 (3.9)7.1 (4.5)
Wilson B2)36.322.953.835.2
Rwork/Rfree (%)18.2/21.613.2/17.022.1/26.324.2/26.8
Rmsd bond length (Å)0.0050.0160.0120.004
Rmsd bond angle (°)0.91.40.80.8
Ramachandran favoured/outliers (%)100/099.5/099.5/0.298.8/0
MolProbity score (percentile)1.74 (86th)1.68 (61st)1.62 (92nd)1.72 (77th)
Fig. 2

Structures of hArc-NL complexes with peptide ligands.

A. Stereo view of the hArc-NL complexed with pStg(wt).

B. Stereo view of the hArc-NL complexed with GKAP-R4.

C. Stereo view of the hArc-NL complexed with GKAP-R5.

D. Conserved C–H…π interactions of the Pro residue in the ligand peptide with Tyr227.

E. Two conserved C–H…π interactions of the peptide aromatic residue.

Crystallographic data processing and refinement. Structures of hArc-NL complexes with peptide ligands. A. Stereo view of the hArc-NL complexed with pStg(wt). B. Stereo view of the hArc-NL complexed with GKAP-R4. C. Stereo view of the hArc-NL complexed with GKAP-R5. D. Conserved C–H…π interactions of the Pro residue in the ligand peptide with Tyr227. E. Two conserved C–H…π interactions of the peptide aromatic residue. The key interactions in all complexes include three C–H…π hydrogen bonds that directly involve the P(G)xF/Y motif. Pro at P(-2/-3) forms a C–H…π interaction with the aromatic ring of Tyr227 in Arc (Fig. 2D). In addition, Tyr/Phe(0) is involved in two C–H…π interactions (Fig. 2E). The Cβ atom interacts with the side chain of Phe220, and the aromatic ring accepts a C–H…π bond from Phe271 of Arc. Other residues in the peptides also make conserved interactions with hArc-NL (Fig. 2). The visually strongest of these are main-chain β-sheet interactions of the bound peptide on both sides of the chain. Additionally, the conserved Arg at P(+1) and/or P(+3) form salt bridges to Glu215 of hArc-NL. Asn247 interacts with the side chain of the variable residue in the P(-1) position in all the three complexes, which may be one affinity determinant between different ligand motifs. The crystal structure of hArc-NL was subjected to MD simulations, with and without the pStg(wt) ligand. This was done in order to follow the dynamics of the Arc-NL N-terminal region that in peptide complexes, folds as a β strand on top of the peptide. The results indicate flexibility of the N terminus of hArc-NL in the absence of a peptide ligand (Fig. 3A). Furthermore, the radius of gyration during simulation is smaller for the peptide complex than the free protein (Fig. 3B–C). Hence, conformational changes in Arc may be linked to peptide ligand binding, as the long flexible linker between the N-terminal and C-terminal domains of Arc lies N-terminal to hArc-NL.
Fig. 3

Simulations and Stg phosphorylation.

A. Root mean-square fluctuations (RMSF) of hArc-NL C⍺ atoms without the peptide (black), with pStg(wt) (red), and with pStg(phos) (blue). Note the flexible N terminus in the absence of ligand peptides and the increased mobility of helix 2 in the presence of pStg(phos).

B. Radius of gyration during the simulations; colouring as in (A).

C. Structural snapshots from each simulation at a time point of 200 ns; colouring as in (A). Note the more open conformation of the N terminus in the absence of bound peptide.

D. ITC analysis of hArc-NL binding to pStg(phos).

E. Polar interactions of phospho-Ser228 200 ns into the simulation. Hydrogen bonds are shown as dashed lines, and residues making frequent contact with the phospho group are indicated. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)

Simulations and Stg phosphorylation. A. Root mean-square fluctuations (RMSF) of hArc-NL C⍺ atoms without the peptide (black), with pStg(wt) (red), and with pStg(phos) (blue). Note the flexible N terminus in the absence of ligand peptides and the increased mobility of helix 2 in the presence of pStg(phos). B. Radius of gyration during the simulations; colouring as in (A). C. Structural snapshots from each simulation at a time point of 200 ns; colouring as in (A). Note the more open conformation of the N terminus in the absence of bound peptide. D. ITC analysis of hArc-NL binding to pStg(phos). E. Polar interactions of phospho-Ser228 200 ns into the simulation. Hydrogen bonds are shown as dashed lines, and residues making frequent contact with the phospho group are indicated. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)

Phosphorylation of stargazin

Arc interacts with Stg through the binding site on Arc-NL, and Stg phosphorylation state is regulated during synaptic plasticity [49]. One phosphorylation site is located in the middle of the Stg peptide, at the P(-1) position of the binding motif. We tested if phosphorylation of the Stg peptide at Ser228 affected the binding affinity to hArc-NL. The binding entropy and enthalpy both changed, but binding affinity remained the same (Fig. 3D, Table 2), indicating that the Stg motif binds to hArc-NL regardless of the phosphorylation state at Ser228. Our result is in conflict with an earlier experiment, where a fluorescence polarisation competition binding assay showed that phosphorylation of a Stg peptide at Ser228 reduced the affinity to Arc-NL 20-fold [16]. The difference may be related to the fact that the peptide we used is longer. Like for the other studied peptides, a strong enthalpy-entropy compensation is present (Table 2), and the observed changes are in line with those expected, when comparing peptides with and without phosphorylation [50]. The high favourable enthalpy is a sign of electrostatic interactions between the phosphate group and the protein, while the large unfavourable entropy component is likely related to the more flexible nature of the phosphopeptide when free in solution, as well as reduced flexibility of Arc in the pStg(phos) complex. The structure of hArc-NL bound to pStg(wt) shows that Ser228 points away from the hydrophobic pocket, and phosphorylation should not generate major clashes. A model of hArc-NL bound to pStg(phos) was made based on the crystal structure to get an insight into additional interactions. The phospho-Ser228 side chain is surrounded by potentially interacting residues from all sides (Fig. 3E). The residues most likely making contact with the phosphate group include Asn247, His245, and Thr211. MD simulations further indicate that, in addition to Lys250, the Arg side chain two residues downstream on the peptide can make a salt bridge contact with the phospho-Ser228; this explains some of the favourable enthalpy and unfavourable entropy observed in ITC. At a coarser level, it can be seen that the phosphopeptide binding is accompanied by an increased flexibility of the first half of helix ⍺2 of hArc-NL (Fig. 3A).

Structure of hArc-CL and folding properties of the lobes

We crystallised hArc-CL, which has the same overall fold as hArc-NL (Fig. 4A). As expected, hArc-CL is similar to rat Arc-CL, but gives a higher resolution and shows the structure of the missing loop after the first helix (Fig. 4B). The largest variation between the rat and human Arc-CL is the tilt of the first helix (here termed ⍺0), which is a sign of flexibility of the structure, and could reflect varying conformations of the Arc-CT in the context of full-length Arc. The sequences of rat and human Arc-CL only differ by 3 residues, and the structures were expected to be similar. One of these residue variations is Phe333 in rat Arc, which is Leu for hArc. The smaller residue in hArc allows the second helix to come closer to the protein core. However, whether this is of functional relevance, remains to be studied.
Fig. 4

Structure of hArc-CL and structure analyses.

A. Superposition of hArc-NL (grey) and hArc-CL (rainbow colours).

B. Superposition of human (coloured) and rat (grey) Arc-CL.

C. DynaMine analysis of human full-length Arc.

D. Betweenness analysis of hArc-NL (left) and hArc-CL (right).

E. 3D view of the most central residues, based on the betweenness scores. Sidechains of residues scoring >0.5 are shown, and the top 3 central residues in each domain are coloured green. Left: hArc-NL, right: hArc-CL.

F. Structure-based sequence alignment of hArc-NL and hArc-CL.

G. Superposition of hArc-NL (grey) and hArc-CL (rainbow), with identified conserved sites highlighted. The central FL motif (blue) is conserved in the fold family [16], and may play a dual role in folding and ligand binding. At both ends of the peptide binding site of hArc-NL, both hArc-NL and hArc-CL have hydrophobic/aromatic clusters (green/pink). In addition, conserved charged residues (grey) may have functional relevance in the protein family. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)

Structure of hArc-CL and structure analyses. A. Superposition of hArc-NL (grey) and hArc-CL (rainbow colours). B. Superposition of human (coloured) and rat (grey) Arc-CL. C. DynaMine analysis of human full-length Arc. D. Betweenness analysis of hArc-NL (left) and hArc-CL (right). E. 3D view of the most central residues, based on the betweenness scores. Sidechains of residues scoring >0.5 are shown, and the top 3 central residues in each domain are coloured green. Left: hArc-NL, right: hArc-CL. F. Structure-based sequence alignment of hArc-NL and hArc-CL. G. Superposition of hArc-NL (grey) and hArc-CL (rainbow), with identified conserved sites highlighted. The central FL motif (blue) is conserved in the fold family [16], and may play a dual role in folding and ligand binding. At both ends of the peptide binding site of hArc-NL, both hArc-NL and hArc-CL have hydrophobic/aromatic clusters (green/pink). In addition, conserved charged residues (grey) may have functional relevance in the protein family. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.) Recent structural data on dArc have indicated its lobe domains have strong oligomerisation propensity, and the dArc-CL forms dimers, both in the crystal state and in capsids [[44], [45], [46]], similar to those formed by viral capsid proteins. In line with earlier data [23,46], the crystal forms of hArc-NL and hArc-CL here provide no evidence for such high-affinity symmetric oligomerisation of the hArc lobe domains, which may represent an important difference between mammalian Arc and dArc, related to the lack of a large N-terminal domain in dArc. A sequence-based prediction of chain dynamics was carried out for full-length Arc (Fig. 4C). The most rigid predicted region is the oligomerisation region [19] in the helical coil of Arc-NT. The Arc-CT helices are also predicted as rigid, while the termini and the central linker are flexible. All these aspects comply well with current experimental data. Residue interaction networks of both hArc-NL and hArc-CL were analysed (Fig. 4D) based on the crystal structures, in order to identify residues central to the lobe domain fold. The central residues correspond mainly to buried large hydrophobic residues, which are conserved in the protein family. A Phe residue at the same position of both hArc lobes has high betweenness in both structures (Fig. 4E). A structure-based sequence alignment of hArc-NL and hArc-CL (Fig. 4F) shows low sequence homology, with few conserved residues. In a larger sequence alignment [16], most of these residues were conserved across species, including the FL motif and the double Glu motif. An inspection of the vicinity of the hArc-NL peptide binding site in superposed hArc lobes (Fig. 4G) indicates that the charged cluster is away from the binding site, which is centered at the FL motif and flanked by hydrophobic and aromatic surfaces. The central FL motif could, hence, function in both folding of the lobes and ligand interactions. While we have no evidence for peptide ligand binding by the hArc-CL, a putative, currently unknown factor removing the bound ⍺0 helix from this site could induce binding of external ligands in a manner similar to the hArc-NL. The charged cluster is involved in a network of salt bridges in the dArc capsid [45], whereby the NL forms penta- and hexamers on the capsid surface. Whether this is a conserved functional feature on hArc, awaits the structure of the mammalian Arc capsid. A structural homology search with hArc-NL and hArc-CL returned, as expected, various capsid proteins from retroviruses, but also other similarly folded helical bundles. The most notable group concerns acyl carrier proteins (ACPs), which have a similar 4-helix bundle as the Arc NL and CL. The function of ACPs as carriers of acyl intermediates in fatty acid metabolism is not related to the function of Arc on the basis of current knowledge. For both viral capsid proteins and ACPs, it is noteworthy that these similarly folded proteins often have a sequence identity of less than 5% towards hArc. A selected set of hits is presented in Table 4 and Fig. 5. While it is possible that finding ACPs in the similarity search simply reflects the usefulness of 4-helix bundles for biomolecular interaction modules, to our knowledge, further implications of this structural similarity have not been studied.
Table 4

Structural homologs of hArc lobe domains. Shown are selected hits with high similarity from a Salami [40] search with the hArc-NL and hArc-CL structures. Both searches produce a long list of similar structures, with often low sequence identity. Examples of superposed structures are given in Fig. 5.

Search modelHomologuePDB entryC⍺ RMSD (Å)Sequence identity (%)reference
hArc-NL + pStg(wt)rat Arc NL + Stg4x3h0.57100[16]
dArc1 capsid6tas2.8824[45]
Toxoplasma mitochondrial ACP3ce73.443
Ty3 capsid6r232.845[51]
P. luminescens ACP6sm43.282[52]
E. faecalis ACP2n503.783[53]
hArc-CLrat Arc CL4x3x1.5696[16]
dArc1 CL6sid2.5222[46]
Rous sarcoma virus capsid protein1eoq2.9018[54]
rat Arc NL4x3h2.8710[16]
T. thermophilus ACP1x3o4.285
Ty3 capsid protein6r232.9712[51]
HML2 C-terminal dimer domain6sai2.9814[55]
Fig. 5

Selected structures with similarity to hArc-NL and hArc-CL.

Top row: selected structures similar to hArc-NL. The 3 last ones are different ACPs. Bottom row: selected structures similar to hArc-CL. The last structure is an ACP. See Table 4 for additional details.

Structural homologs of hArc lobe domains. Shown are selected hits with high similarity from a Salami [40] search with the hArc-NL and hArc-CL structures. Both searches produce a long list of similar structures, with often low sequence identity. Examples of superposed structures are given in Fig. 5. Selected structures with similarity to hArc-NL and hArc-CL. Top row: selected structures similar to hArc-NL. The 3 last ones are different ACPs. Bottom row: selected structures similar to hArc-CL. The last structure is an ACP. See Table 4 for additional details.

Arc and the PSD protein network

PSD scaffold proteins belong to several protein families, and they each carry distinct sets of protein interaction domains, which can be used to form multivalent contacts in the tight protein network of the PSD, but also for regulating the PSD molecular assembly in an activity-dependent manner. The latter is important for the participation of the PSD in LTP and LTD at excitatory glutamatergic synapses. The structure of the PSD varies between excitatory and inhibitory synapses [56]. Various PSD scaffold molecules are concentrated at different depths from the postsynaptic membrane [[57], [58], [59]]. The distribution of individual proteins can be regulated by synaptic activity [30,60], which in turn may result in changes in post-translational modifications as well as interaction partners. Some of the PSD proteins, such as the Shanks, have large molecular dimensions and are flexible, possibly enabling them to interact with other proteins at different depths of the PSD assembly [61]. We showed that hArc-NL can bind to multiple sites on GKAP. The function of this interaction remains unknown, but one possible scenario may be that Arc assists in the disassembly/reassembly process of PSD proteins. Large protein complexes in the PSD are formed through a number of relatively weak interactions between multidomain proteins, such as GKAP, PSD95, Shank, and Homer. Could Arc play a role as a rapid regulator and re-organiser of such interactive networks? Putative Arc-binding motifs are present in several components of the PSD assembly [31], but they have not been further characterised. Phase separation in the PSD is an emerging mechanism of formation of the PSD molecular network [62,63]. The formation of so-called membraneless organelles is central in many biological processes, and phase separation can be induced by properties of individual macromolecules and/or complexes under specific conditions. The structural details of forming a separated phase centered on PSD scaffold proteins, such as GKAP, remain to be elucidated. Recent work using truncated PSD proteins has shed light on protein domains and interactions required for PSD phase separation [64]. GKAP accumulates at the PSD during synapse inactivity and is removed by synaptic activation due to proteasome-coupled degradation [25]. Following synapse activation, Arc rapidly accumulates at the postsynapse, being thereafter degraded [1]. The ability of Arc to bind to several proteins in the PSD [31] could disrupt structural interactions, resulting in dissolution or loosening of the PSD scaffold, which would promote remodelling of the synapse. Disturbing the interaction between Stg and PSD95 increases AMPAR surface diffusion, preventing AMPAR accumulation at postsynaptic sites [65]. AMPAR removal from postsynaptic sites is an effect of Arc expression [13], suggesting that Arc expression and disturbing Stg-PSD95 interactions could result in the same outcome [6]. The C terminus of Stg interacts with the first two PDZ domains of PSD95 [66], while the entire Stg cytoplasmic domain was shown to be involved in the interaction [63]. The Stg C-terminal tail interacts with Arc-NL, and Arc could disturb the interaction between Stargazin and PSD95. The PSD95 GK domain interacts with the repeat regions of GKAP [29], and phosphorylation of the GKAP repeats is important for this interaction [67]. The repeats of GKAP contain both a putative Arc-binding site and a conserved binding site for PSD95 (Fig. 6A). We have shown that two of these repeats bind to Arc, suggesting that Arc could directly affect the interaction between PSD95 and GKAP. Structural data at the moment do not necessarily rule out simultaneous binding (Fig. 6B), and Arc oligomerisation could be an additional factor in regulating the interaction. A GKAP mutant incapable of binding PSD95 induced Shank aggregation and degradation in neurons, and Shank and GKAP form aggresomes that are degraded by proteasomes in the absence of PSD95 [68]. Due to the multiple possible ligands of Arc, as well as its oligomerisation properties [19,20], it is possible that a high local concentration of Arc disrupts both the Stg-PSD95 as well as the PSD95-GKAP interaction, allowing remodelling of the synapse. Both LTP and LTD require structural remodelling of the PSD. A potential role for Arc in regulating PSD phase separation (decondensation) is attractive, as it might explain why enhanced Arc synthesis is needed for consolidation of both LTP and LTD. Rapid Arc expression following LTP and LTD induction would interrupt interactions with PSD95 and promote remodelling of the PSD, while Arc degradation would be required to establish a new stable state. Clearly, further experimental studies into the GKAP system, focusing on Arc, are warranted.
Fig. 6

Arc and GKAP.

A. Alignment of the five GKAP repeats, including the neighbouring segment that binds to PSD95. Conserved residues in the PSD95-binding site are in bold, and the peptides used in this study are shaded grey.

B. 3D model of a ternary complex of Arc, GKAP, and the PSD95 GK domain. The GKAP repeat could sterically bind to both Arc-NL and PSD95 at the same time. Shown are the PSD95-GKAP complex [67], hArc-NL bound to GKAP-R4, and the NMR structure of rat Arc-CT [22].

Arc and GKAP. A. Alignment of the five GKAP repeats, including the neighbouring segment that binds to PSD95. Conserved residues in the PSD95-binding site are in bold, and the peptides used in this study are shaded grey. B. 3D model of a ternary complex of Arc, GKAP, and the PSD95 GK domain. The GKAP repeat could sterically bind to both Arc-NL and PSD95 at the same time. Shown are the PSD95-GKAP complex [67], hArc-NL bound to GKAP-R4, and the NMR structure of rat Arc-CT [22]. A further expansion of such a suggested Arc function could include regulation of the actin cytoskeleton. Arc-NL also binds to a peptide from WAVE1 or Wasf1 [16,23]. WAVE1 is an activator of the Arp2/3 complex that is involved in branching of actin filaments in dendritic spines [11]. The PSD is connected to actin through cortactin, which binds to Shank via its proline-rich domain [69,70]. The interaction of Arc with WAVE1 might be an additional way of affecting the actin cytoskeleton and aiding synaptic remodelling.

Conclusions

We have shown that hArc-NL binds to specific peptides, with a PxF/Y or P(G)xF/Y motif. Knowing these requirements of peptide binding will be helpful when trying to locate Arc binding sites in other proteins and for identifying new Arc binding partners. The peptide binding site of Arc is specific and able to bind to two positions of the GKAP protein, as well as to the phosphorylated and unphosphorylated Stg peptide. For any of the discussed Arc-NL ligands with short linear binding motifs analysed here and in other studies, we currently do not know if additional binding surfaces exist in the corresponding proteins. Such a setting would increase the affinity of the observed interactions, together with possible avidity effects brought about by Arc oligomerisation. The network of interactions between Arc and other PSD proteins is likely to affect the structural arrangement of the postsynaptic density.

Author contributions

EIH: Conceptualization; Data curation; Formal analysis; Investigation; Methodology; Validation; Visualization; Writing - original draft. CRB: Conceptualization; Funding acquisition; Project administration; Resources; Supervision; Writing - review & editing. PK: Conceptualization; Data curation; Formal analysis; Funding acquisition; Investigation; Project administration; Resources; Supervision; Validation; Visualization; Writing - original draft; Writing - review & editing.

Funding

This work was funded through a TOPPFORSK grant (249951) from the (to CRB) and an INFRASTRUCTURE grant (245828) from the .

Declaration of competing interest

The authors declare no conflicts of interest.
  69 in total

1.  NAPS update: network analysis of molecular dynamics data and protein-nucleic acid complexes.

Authors:  Broto Chakrabarty; Varun Naganathan; Kanak Garg; Yash Agarwal; Nita Parekh
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

2.  The Capsid Domain of Arc Changes Its Oligomerization Propensity through Direct Interaction with the NMDA Receptor.

Authors:  Lau Dalby Nielsen; Christian Parsbæk Pedersen; Simon Erlendsson; Kaare Teilum
Journal:  Structure       Date:  2019-05-09       Impact factor: 5.006

3.  Structure and self-association of the Rous sarcoma virus capsid protein.

Authors:  R L Kingston; T Fitzon-Ostendorp; E Z Eisenmesser; G W Schatz; V M Vogt; C B Post; M G Rossmann
Journal:  Structure       Date:  2000-06-15       Impact factor: 5.006

4.  MolProbity: More and better reference data for improved all-atom structure validation.

Authors:  Christopher J Williams; Jeffrey J Headd; Nigel W Moriarty; Michael G Prisant; Lizbeth L Videau; Lindsay N Deis; Vishal Verma; Daniel A Keedy; Bradley J Hintze; Vincent B Chen; Swati Jain; Steven M Lewis; W Bryan Arendall; Jack Snoeyink; Paul D Adams; Simon C Lovell; Jane S Richardson; David C Richardson
Journal:  Protein Sci       Date:  2017-11-27       Impact factor: 6.725

5.  NAPS: Network Analysis of Protein Structures.

Authors:  Broto Chakrabarty; Nita Parekh
Journal:  Nucleic Acids Res       Date:  2016-05-05       Impact factor: 16.971

6.  The SALAMI protein structure search server.

Authors:  Thomas Margraf; Gundolf Schenk; Andrew E Torda
Journal:  Nucleic Acids Res       Date:  2009-05-22       Impact factor: 16.971

7.  Computational analysis of phosphopeptide binding to the polo-box domain of the mitotic kinase PLK1 using molecular dynamics simulation.

Authors:  David J Huggins; Grahame J McKenzie; Daniel D Robinson; Ana J Narváez; Bryn Hardwick; Meredith Roberts-Thomson; Ashok R Venkitaraman; Guy H Grant; Mike C Payne
Journal:  PLoS Comput Biol       Date:  2010-08-12       Impact factor: 4.475

8.  Arc is a flexible modular protein capable of reversible self-oligomerization.

Authors:  Craig Myrum; Anne Baumann; Helene J Bustad; Marte Innselset Flydal; Vincent Mariaule; Sara Alvira; Jorge Cuéllar; Jan Haavik; Jonathan Soulé; José Maria Valpuesta; José Antonio Márquez; Aurora Martinez; Clive R Bramham
Journal:  Biochem J       Date:  2015-05-15       Impact factor: 3.857

9.  Differentiation and Characterization of Excitatory and Inhibitory Synapses by Cryo-electron Tomography and Correlative Microscopy.

Authors:  Chang-Lu Tao; Yun-Tao Liu; Rong Sun; Bin Zhang; Lei Qi; Sakar Shivakoti; Chong-Li Tian; Peijun Zhang; Pak-Ming Lau; Z Hong Zhou; Guo-Qiang Bi
Journal:  J Neurosci       Date:  2018-01-08       Impact factor: 6.167

10.  A functional role of postsynaptic density-95-guanylate kinase-associated protein complex in regulating Shank assembly and stability to synapses.

Authors:  Stefano Romorini; Giovanni Piccoli; Ming Jiang; Pasquale Grossano; Noemi Tonna; Maria Passafaro; Mingjie Zhang; Carlo Sala
Journal:  J Neurosci       Date:  2004-10-20       Impact factor: 6.167

View more
  3 in total

1.  High-affinity anti-Arc nanobodies provide tools for structural and functional studies.

Authors:  Sigurbjörn Markússon; Erik I Hallin; Helene J Bustad; Arne Raasakka; Ju Xu; Gopinath Muruganandam; Remy Loris; Aurora Martinez; Clive R Bramham; Petri Kursula
Journal:  PLoS One       Date:  2022-06-07       Impact factor: 3.752

2.  Development and Validation of Arc Nanobodies: New Tools for Probing Arc Dynamics and Function.

Authors:  Yuta Ishizuka; Tadiwos F Mergiya; Rodolfo Baldinotti; Ju Xu; Erik I Hallin; Sigurbjörn Markússon; Petri Kursula; Clive R Bramham
Journal:  Neurochem Res       Date:  2022-03-20       Impact factor: 4.414

3.  Phosphorylation-dependent control of Activity-regulated cytoskeleton-associated protein (Arc) protein by TNIK.

Authors:  Alicyia Walczyk-Mooradally; Jennifer Holborn; Karamjeet Singh; Marshall Tyler; Debasis Patnaik; Hendrik Wesseling; Nicholas J Brandon; Judith Steen; Steffen P Graether; Stephen J Haggarty; Jasmin Lalonde
Journal:  J Neurochem       Date:  2021-06-22       Impact factor: 5.546

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.