Literature DB >> 26517891

The centriolar protein CPAP G-box: an amyloid fibril in a single domain.

Erin E Cutts1, Alison Inglis1, Phillip J Stansfeld1, Ioannis Vakonakis1, Georgios N Hatzopoulos.   

Abstract

Centrioles are evolutionarily conserved cylindrical cell organelles with characteristic radial symmetry. Despite their considerable size (400 nm × 200 nm, in humans), genetic studies suggest that relatively few protein components are involved in their assembly. We recently characterized the molecular architecture of the centrosomal P4.1-associated protein (CPAP), which is crucial for controlling the centriolar cylinder length. Here, we review the remarkable architecture of the C-terminal domain of CPAP, termed the G-box, which comprises a single, entirely solvent exposed, antiparallel β-sheet. Molecular dynamics simulations support the stability of the G-box domain even in the face of truncations or amino acid substitutions. The similarity of the G-box domain to amyloids (or amyloid precursors) is strengthened by its oligomeric arrangement to form continuous fibrils. G-box fibrils were observed in crystals as well as in solution and are also supported by simulations. We conclude that the G-box domain may well represent the best analogue currently available for studies of exposed β-sheets, unencumbered by additional structural elements or severe aggregations problems.
© 2015 Authors; published by Portland Press Limited.

Entities:  

Keywords:  CPAP; b-sheet; centriole; fibril; structure

Mesh:

Substances:

Year:  2015        PMID: 26517891      PMCID: PMC4613516          DOI: 10.1042/BST20150082

Source DB:  PubMed          Journal:  Biochem Soc Trans        ISSN: 0300-5127            Impact factor:   5.407


Introduction

Centrioles are large cylindrical organelles, ubiquitous in eukaryotic cells, with a microtubule-based outer wall and characteristic 9-fold radial symmetry. We have limited structural information on most proteins found in centrioles but, so far, the few components resolved yielded exciting structures with the propensity to form large assemblies through oligomerization [1-4]. Recently, we and others, determined the structure of the C-terminal domain of the centriolar protein CPAP, termed the G-box [5-7]. The G-box domain adopts a unique conformation comprising a single continuous antiparallel β-sheet of approximately 7.5 nm length. Further, this domain oligomerizes in vitro in a concentration-depended manner and, in crystals, it self-associates through a head-to-tail interaction to form continuous β-sheets reminiscent of amyloid fibrils [5-7]. Functionally, CPAP is one of the five proteins, excluding microtubules, necessary for centriole formation. CPAP is a positive regulator of centriole length and, together with the centrosomal protein 120kDa (Cep120), Cep135 and the spindle and centriole associated protein 1 (SPICE1), it controls the centriole elongation process [8-11]. In centrioles, CPAP localizes close to the outer microtubule wall [12], where the α/β tubulin units impose a spatial periodicity of ∼8 nm [13]. Inspired by the similar length of the G-box, we proposed a model [5] in which G-box fibrils act as a molecular ruler aligning the centriolar core structure [14,15] with outer microtubules. Our model provides a mechanistic explanation for the abnormally long centrioles observed upon CPAP overexpression [11,16,17] and it crucially relies on the G-box domain to provide a platform for protein interactions important for centriole assembly. The importance of this domain to centriole assembly and consequently to human health, is demonstrated by a mis-sense mutation therein, E1235V that negatively affects centriole formation [6] and leads to primary microcephaly [18]. The G-box structure is the sole example from a unique protein fold. Although all-β proteins are common, they typically have a hydrophobic core formed by interactions between β-sheets as seen in the β-sandwich, β-helix or β-barrel folds. In these folds, open (non-hydrogen bonded) edges of β-strands are protected against interacting with other open β-strand edges, for example through steric hindrance by nearby loops or helices [19]. Single β-sheet proteins, in contrast, tend to form large aggregates through β-strand edge-to-edge interactions as well as exposed hydrophobic patches; such associations are well documented in amyloid fibrils where long parallel sheets, formed by β-strand interactions, sandwich hydrophobic surfaces [20-22]. As a result, biophysical studies of single β-sheets in isolation have primarily relied on protein analogues [23] and thus far only one short three-stranded β-sheet has been engineered [24]. To our knowledge, naturally occurring single-layered β-sheet proteins are restricted to the Borrelia burgdorferi outer surface protein A (OspA) [25] and the surface protein G (SasG) identified in Staphylococcus aureus [26]. However, OspA features globular domains capping its exposed single-layered central β-sheet, whereas the SasG topology does not define a continuous β-sheet. Hence, the G-box domain is, to our knowledge, the only available example of a repetitive class III single-antiparallel β-structure protein [27] and it can thus provide valuable information about the stability, flexibility and properties of such assemblies.

Materials and Methods

Details of the molecular dynamics (MD) simulations and small angle X-ray scattering (SAXS) methods are available as supporting information.

The β-sheet fold of the G-box domain

Our structure of the G-box domain from Danio rerio CPAP (4LD1, 4LD3 and 4LZF) [5] consists of 17 antiparallel β-strands that form a 75 Å (1 Å=0.1 nm) long and 23 Å wide β-sheet, with 168° of twist across its length (Figure 1A). Remarkably, all residues of this structure are solvent exposed, thus the G-box domain lacks a well-defined hydrophobic core. However, despite adopting a unique protein fold, all metrics of β-sheet geometry in the G-box domain match those observed in other systems. The mean twist per strand is 18°, which is comparable with that of globular proteins (30±30°) [28,29] and the overall β-sheet twist is left handed, consistent with reports that this is the most energetically favourable case [30]. Further, studies on the dependence of protein stability to β-strand lengths, using synthetic peptides forming antiparallel β-hairpins, found that seven residues per strand were optimal [31], this is also the length of most β-strands in the G-box domain.
Figure 1

Structural analysis of the CPAP G-box domain

(A) Schematic representation of G-box in two perpendicular views. The secondary structure elements, protein dimensions and the β-sheet twist are shown. (B) Superposition of the first five β-strands of all available G-box structures. 4LD1 in slate, 4LZ in red, 4LD3 in orange, 4MPZ in yellow, 4BY2_A in light green, 4BY2_B n dark green, 4BY2_C in lime green, 4BXR_A in marine, 4BXR_B in magenta and 4BXP in blue. (C) Representation of electrostatic and hydrophobic interactions on the surface of the G-box β-sheet. Hydrophobic residues are shown in grey in space-filling representation, whereas charged residues are in blue (for positive charge) and red (for negative charge).

Structural analysis of the CPAP G-box domain

(A) Schematic representation of G-box in two perpendicular views. The secondary structure elements, protein dimensions and the β-sheet twist are shown. (B) Superposition of the first five β-strands of all available G-box structures. 4LD1 in slate, 4LZ in red, 4LD3 in orange, 4MPZ in yellow, 4BY2_A in light green, 4BY2_B n dark green, 4BY2_C in lime green, 4BXR_A in marine, 4BXR_B in magenta and 4BXP in blue. (C) Representation of electrostatic and hydrophobic interactions on the surface of the G-box β-sheet. Hydrophobic residues are shown in grey in space-filling representation, whereas charged residues are in blue (for positive charge) and red (for negative charge). In addition to our model, further structures of the D. rerio CPAP G-box are now resolved (4BXP and 4BXR [6]), as well as structures from the Drosophila melanogaster CPAP G-box homologue (4BY2 and 4MPZ [6,7]). Although all structures show the same basic fold they differ in the number of β-strands formed, with the D. rerio G-box comprising 17–20 strands depending on the crystallographic model and the D. melanogaster domain having 14–15 strands. Superposition of the first five β-strands of these models shows that they differ in overall β-sheet geometry (Figure 1B), however no single point of large deviation could be identified. Rather, minor differences in inter-strand angles, amplified across the 14–20-strand long sheet, result in different β-sheet orientations. Further, β-sheet geometry is likely constrained and influenced by crystal packing forces, which naturally vary in these models due to different crystal contacts. The G-box domain lacks a canonical, well-defined hydrophobic core as all its amino acid side chains are solvent exposed, yet it is stable in solution as shown by thermal denaturation experiments [5]. The structure is stabilized through a network of hydrophobic contacts and salt bridges that form between adjacent β-strands (Figure 1C). On one face of the G-box β-sheet strings of hydrophobic residues, such as Val1038, Val1050 and Ile1056, mediate contacts between adjacent strands. Any gaps in the continuity of the hydrophobic network are complemented with salt bridges, such as between Lys1022 and Glu1030. The second face of the G-box β-sheet has similar motifs of stabilization and, in addition, contains multiple aromatic and arginine residues involved in π-stacking interactions.

The G-box domain has a stable but dynamic structure

In order to further investigate the unique nature of this domain, we used atomistic MD simulations to gain insight in the stability and flexibility of the structure. Starting from the crystallographic model 4LD1 we performed 100 ns-long simulations [5], throughout which the G-box structure remains stable as it retains the main chain hydrogen bonding network and secondary structure elements (Figure 2A; Supplementary Table S1; Supplementary Movie S1). Minimal distortions are visible in the terminal β-strands where the protein N- and C-termini tend to fold back and interact with the β-sheet; these distortions are transient and manifest as slightly higher than average inter-strand twist angles at the terminal β-strands (Figure 2B). The overall G-box β-sheet twist remains left-handed throughout the MD simulations although it flattens somewhat to 130±30° from the 168° starting point. In addition, an average length of seven amino acids is maintained for the majority of β-strands throughout the simulations (Figure 2A). Hence, the G-box structure retains an energetically favourable conformation with respect to its twist [30] and typical strand length [31]. Further domain motions in the MD simulations result in slight β-sheet bending, such that the average end-to-end distance is 72 Å compared with 75 Å in the starting model and transient fanning of β-strand termini as single hydrogen bonds break and reform.
Figure 2

Analysis of the G-box structure throughout 100 ns of atomistic MD simulations

(A) Secondary structure composition as function of amino acid residue and simulation time and (B) inter-strand twist angle as function of simulation time. The specific strand pairs are indicated. (C) One MD simulation of an I1067G substitution on β-strand 13 yielded large fluctuations in twist and eventually in breakage of the β-sheet. Shown here are (top) a snapshot of the G-box model at the end of this MD simulation and (bottom) inter-strand twist angles for the region surrounding the I1067G substitution.

Analysis of the G-box structure throughout 100 ns of atomistic MD simulations

(A) Secondary structure composition as function of amino acid residue and simulation time and (B) inter-strand twist angle as function of simulation time. The specific strand pairs are indicated. (C) One MD simulation of an I1067G substitution on β-strand 13 yielded large fluctuations in twist and eventually in breakage of the β-sheet. Shown here are (top) a snapshot of the G-box model at the end of this MD simulation and (bottom) inter-strand twist angles for the region surrounding the I1067G substitution. The variable number of β-strands resolved in the different crystallographic models of the G-box domain, as well as biophysical data showing that G-box variants harbouring single amino acid substitutions remain folded [5-7], led us to study the effects of truncations or substitutions on the G-box structure in MD simulations. We constructed in silico C-terminally or N-terminally truncated models of the G-box domain comprising just eight out of 17 strands; however, 100 ns-long MD simulations of these models did not show significant structural perturbations (Supplementary Tables S2 and S3 respectively). G-box variants harbouring amino acid substitutions V1050A, V1050S, Y995A, F1024A, F1069A or V1050A/I1056A also did not yield significant or reproducible differences in simulations compared with the wild-type domain (result not shown). These results suggest that the G-box structure is resistant to truncations or most substitutions. However, we were able to perturb the G-box structure using a single I1067G substitution, located on β-strand 13. Glycine residues are known to destabilize β-sheets [32] and in one out of three MD simulations of the I1067G G-box variant we observed large fluctuations in the inter-strand twist angle (Figure 2C; Supplementary Table S4) and increased loss of hydrogen bonds near the substitution, which resulted in the complete breakage of the β-sheet (Figure 2C).

The G-box domain forms fibril

In several crystallographic structures (4LD1, 4LD3, 4LZF, 4BY2 and 4MPZ), G-box domains associate through a head-to-tail interaction along the open ends of the first and last β-strands [5-7]. This interaction is stabilized by the formation of five hydrogen bonds and the burial of ∼420 Å2 of accessible surface area. The resulting assembly is a continuous β-sheet with ∼8 nm periodicity, reminiscent of amyloid fibrils (Figure 3A). MD simulations of this interface show that it persists with no significant perturbations (Supplementary Movie S2) and that it has a lower twist angle as measured between strand 17 of one monomer and strand 1 of the next, compared with the average inter-strand twist of the G-box domain (∼8° compared with ∼18°, Figure 3B; Supplementary Table S5). Further, whereas MD simulations of the isolated G-box domain showed some fluctuations at the terminal strands, these were substantially reduced in the oligomer suggesting that oligomerization stabilizes the G-box conformation (Figure 3B).
Figure 3

The CPAP G-box domain forms elongated fibrils

(A) Schematic representation of the G-box fibril in comparison with the α/β tubulin heterodimer. Three successive G-box domains are coloured green, magenta and green, with the β1 and β17 strands of each monomer and the fibril periodicity indicated. (B) Mean Rh and Rg of G-box oligomers in solution as a function of protein concentration, derived by DLS and SAXS respectively. (C) Analysis of the end-to-end interaction between G-box domains by MD simulations. The inter-strand twist angle is shown for the last G-box β-hairpin of the domain when simulated alone (in red) or as part of a fibril (in blue) and for the interface β-strands in black. (D) Ab-initio SAXS reconstruction of G-box oligomers at increasing protein concentration, with G-box monomeric and fibrillar structures fitted within. (E) Overlay of G-box models extracted from the MD simulations and used to build fibrillar oligomers (monomer to dimer to trimer). The calculated Rg for each oligomeric species is shown.

The CPAP G-box domain forms elongated fibrils

(A) Schematic representation of the G-box fibril in comparison with the α/β tubulin heterodimer. Three successive G-box domains are coloured green, magenta and green, with the β1 and β17 strands of each monomer and the fibril periodicity indicated. (B) Mean Rh and Rg of G-box oligomers in solution as a function of protein concentration, derived by DLS and SAXS respectively. (C) Analysis of the end-to-end interaction between G-box domains by MD simulations. The inter-strand twist angle is shown for the last G-box β-hairpin of the domain when simulated alone (in red) or as part of a fibril (in blue) and for the interface β-strands in black. (D) Ab-initio SAXS reconstruction of G-box oligomers at increasing protein concentration, with G-box monomeric and fibrillar structures fitted within. (E) Overlay of G-box models extracted from the MD simulations and used to build fibrillar oligomers (monomer to dimer to trimer). The calculated Rg for each oligomeric species is shown. To establish whether the G-box domain forms similar fibrils in solution we performed dynamic light scattering (DLS) and SAXS. As shown in Figure 3(C), the DLS and SAXS measurements show that G-box forms oligomers of increasing hydrodynamic radius (Rh) and radius of gyration (Rg) respectively, as a function of protein concentration. The ab initio reconstructions derived from the SAXS data are in good agreement with the monomeric G-box domain for the lowest sample concentration (Figure 3D). As the protein concentration increases, the ab initio models derived from SAXS data have increasingly elongated shape and match the dimensions of G-box oligomers, composed of 2 or 3 domains arranged in elongated fibrils as observed in the crystals (Figure 3D). Based on these data, we conclude that the G-box domain is inherently able to form oligomers in solution, in a manner probably similar to the fibrils observed in the crystal. However, it should be noted that the fit of the crystallographic fibrils to the elongated SAXS reconstructions is not ideal, probably due to sample polydispersity. We anticipate that the G-box domain in solution will form a mixture of oligomers with different lengths and, in addition that the domain flexibility observed in MD simulations (Supplementary Movie S1) will result in larger G-box oligomers having variable geometries. To better visualize such variable geometries we constructed in silico fibrils using 15 structurally diverse G-box models extracted from the MD simulations. As seen in Figure 3(E), fibrils of even just three G-box domains show a vast number of possible conformations.

Conclusion

The G-box structure has shown that proteins comprising isolated, single β-sheets are possible. Although the structure is unique, it obeys the same basic principles of favourable β-sheet formation, such as strand twist and length, as those noted for globular proteins. Conversely, lessons learned from studying the G-box domain may be more generally applicable. Thus, we believe that this domain constitutes an ideal system to study the structural, dynamic and thermodynamic properties of β-sheets, without contributions from additional structural elements. Furthermore, the G-box domain forms end-to-end fibrils that may be useful in studies of amyloidogenesis, yet be easier to characterise than true amyloids as G-box oligomers are soluble and reversible. More broadly, the CPAP G-box domain is just one example of structural ‘solutions’ that have evolved to underpin formation of centrioles, a remarkable protein assembly in eukaryotes. As we delve further into this system, it is intriguing to speculate what other structural ‘surprises’ we may uncover.
  32 in total

1.  Atomic structures of peptide self-assembly mimics.

Authors:  Koki Makabe; Dan McElheny; Valentia Tereshko; Aaron Hilyard; Grzegorz Gawlak; Shude Yan; Akiko Koide; Shohei Koide
Journal:  Proc Natl Acad Sci U S A       Date:  2006-11-08       Impact factor: 11.205

2.  Native architecture of the centriole proximal region reveals features underlying its 9-fold radial symmetry.

Authors:  Paul Guichard; Virginie Hachet; Norbert Majubu; Aitana Neves; Davide Demurtas; Natacha Olieric; Isabelle Fluckiger; Akinori Yamada; Kumiko Kihara; Yuichiro Nishida; Shigeharu Moriya; Michel O Steinmetz; Yuichi Hongoh; Pierre Gönczy
Journal:  Curr Biol       Date:  2013-08-08       Impact factor: 10.834

3.  Design of a 20-amino acid, three-stranded beta-sheet protein.

Authors:  T Kortemme; M Ramírez-Alvarado; L Serrano
Journal:  Science       Date:  1998-07-10       Impact factor: 47.728

4.  Tandem repeats in proteins: from sequence to structure.

Authors:  Andrey V Kajava
Journal:  J Struct Biol       Date:  2011-08-24       Impact factor: 2.867

5.  Conformation of twisted beta-pleated sheets in proteins.

Authors:  C Chothia
Journal:  J Mol Biol       Date:  1973-04-05       Impact factor: 5.469

6.  A thermodynamic scale for the beta-sheet forming tendencies of the amino acids.

Authors:  C K Smith; J M Withka; L Regan
Journal:  Biochemistry       Date:  1994-05-10       Impact factor: 3.162

7.  Caenorhabditis elegans centriolar protein SAS-6 forms a spiral that is consistent with imparting a ninefold symmetry.

Authors:  Manuel Hilbert; Michèle C Erat; Virginie Hachet; Paul Guichard; Iris D Blank; Isabelle Flückiger; Leanne Slater; Edward D Lowe; Georgios N Hatzopoulos; Michel O Steinmetz; Pierre Gönczy; Ioannis Vakonakis
Journal:  Proc Natl Acad Sci U S A       Date:  2013-06-24       Impact factor: 11.205

8.  Human microcephaly protein CEP135 binds to hSAS-6 and CPAP, and is required for centriole assembly.

Authors:  Yu-Chih Lin; Ching-Wen Chang; Wen-Bin Hsu; Chieh-Ju C Tang; Yi-Nan Lin; En-Ju Chou; Chien-Ting Wu; Tang K Tang
Journal:  EMBO J       Date:  2013-03-19       Impact factor: 11.598

9.  Structural analysis of the G-box domain of the microcephaly protein CPAP suggests a role in centriole architecture.

Authors:  Georgios N Hatzopoulos; Michèle C Erat; Erin Cutts; Kacper B Rogala; Leanne M Slater; Philip J Stansfeld; Ioannis Vakonakis
Journal:  Structure       Date:  2013-09-26       Impact factor: 5.006

Review 10.  Primary microcephaly: do all roads lead to Rome?

Authors:  Gemma K Thornton; C Geoffrey Woods
Journal:  Trends Genet       Date:  2009-10-21       Impact factor: 11.639

View more
  4 in total

1.  Structures of single-layer β-sheet proteins evolved from β-hairpin repeats.

Authors:  Qingping Xu; Matthew Biancalana; Joanna C Grant; Hsiu-Ju Chiu; Lukasz Jaroszewski; Mark W Knuth; Scott A Lesley; Adam Godzik; Marc-André Elsliger; Ashley M Deacon; Ian A Wilson
Journal:  Protein Sci       Date:  2019-08-02       Impact factor: 6.725

2.  Electron Microscopy Structural Insights into CPAP Oligomeric Behavior: A Plausible Assembly Process of a Supramolecular Scaffold of the Centrosome.

Authors:  Ana L Alvarez-Cabrera; Sandra Delgado; David Gil-Carton; Gulnahar B Mortuza; Guillermo Montoya; Carlos O S Sorzano; Tang K Tang; Jose M Carazo
Journal:  Front Mol Biosci       Date:  2017-03-27

3.  Interaction between the Caenorhabditis elegans centriolar protein SAS-5 and microtubules facilitates organelle assembly.

Authors:  Sarah Bianchi; Kacper B Rogala; Nicola J Dynes; Manuel Hilbert; Sebastian A Leidel; Michel O Steinmetz; Pierre Gönczy; Ioannis Vakonakis
Journal:  Mol Biol Cell       Date:  2018-01-24       Impact factor: 4.138

4.  A molecular mechanism for the procentriole recruitment of Ana2.

Authors:  Nasser M Rusan; Gregory C Rogers; Tiffany A McLamarrah; Sarah K Speed; John M Ryniawec; Daniel W Buster; Carey J Fagerstrom; Brian J Galletta
Journal:  J Cell Biol       Date:  2020-02-03       Impact factor: 10.539

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.