John T Melchior1, Ryan G Walker2, Allison L Cooke1, Jamie Morris1, Mark Castleberry1, Thomas B Thompson2, Martin K Jones3, Hyun D Song3, Kerry-Anne Rye4, Michael N Oda5, Mary G Sorci-Thomas6, Michael J Thomas7, Jay W Heinecke8, Xiaohu Mei9, David Atkinson9, Jere P Segrest3, Sissel Lund-Katz10, Michael C Phillips10, W Sean Davidson1. 1. Department of Pathology and Laboratory Medicine, University of Cincinnati, Cincinnati, Ohio, USA. 2. Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati, Cincinnati, Ohio, USA. 3. Department of Medicine, Division of Cardiovascular Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA. 4. School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, New South Wales, Australia. 5. Children's Hospital Oakland Research Institute, Oakland, California, USA. 6. Department of Medicine, Section on Endocrinology, Medical College of Wisconsin, Milwaukee, Wisconsin, USA. 7. Department of Pharmacology and Toxicology, Medical College of Wisconsin, Milwaukee, Wisconsin, USA. 8. Department of Medicine, University of Washington, Seattle, Washington, USA. 9. Department of Physiology and Biophysics, Boston University School of Medicine, Boston, Massachusetts, USA. 10. Division of Translational Medicine and Human Genetics, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.
Abstract
Apolipoprotein (apo)A-I is an organizing scaffold protein that is critical to high-density lipoprotein (HDL) structure and metabolism, probably mediating many of its cardioprotective properties. However, HDL biogenesis is poorly understood, as lipid-free apoA-I has been notoriously resistant to high-resolution structural study. Published models from low-resolution techniques share certain features but vary considerably in shape and secondary structure. To tackle this central issue in lipoprotein biology, we assembled a team of structural biologists specializing in apolipoproteins and set out to build a consensus model of monomeric lipid-free human apoA-I. Combining novel and published cross-link constraints, small-angle X-ray scattering (SAXS), hydrogen-deuterium exchange (HDX) and crystallography data, we propose a time-averaged model consistent with much of the experimental data published over the last 40 years. The model provides a long-sought platform for understanding and testing details of HDL biogenesis, structure and function.
Apolipoprotein (apo)A-I is an organizing scaffold protein that is critical to high-density lipoprotein (HDL) structure and metabolism, probably mediating many of its cardioprotective properties. However, HDL biogenesis is poorly understood, as lipid-free apoA-I has been notoriously resistant to high-resolution structural study. Published models from low-resolution techniques share certain features but vary considerably in shape and secondary structure. To tackle this central issue in lipoprotein biology, we assembled a team of structural biologists specializing in apolipoproteins and set out to build a consensus model of monomeric lipid-free humanapoA-I. Combining novel and published cross-link constraints, small-angle X-ray scattering (SAXS), hydrogen-deuterium exchange (HDX) and crystallography data, we propose a time-averaged model consistent with much of the experimental data published over the last 40 years. The model provides a long-sought platform for understanding and testing details of HDL biogenesis, structure and function.
Apolipoprotein (apo)A-I is the most abundant protein in high density lipoprotein (HDL). Although HDL contains up to 95 ‘accessory’ proteins (http://homepages.uc.edu/∼davidswm/HDLproteome.html) that modulate particle function, apoA-I is central to mediating HDL's structure, interactions and function. Indeed, it is a cofactor for lecithin:cholesterol acyl transferase, mediating HDL maturation via the esterification of free cholesterol in plasma[1]. Lipid-free (poor) apoA-I is also critical for HDL biogenesis via the cell surface transporter ATP-binding cassette A1 (ABCA1)[2]. Thus, there has been a sustained effort to understand apoA-I structure in both its lipid-free and -bound states. Lipid-free monomeric apoA-I represents the protein in its most nascent form. As such, it is a logical starting point for the investigation into the structural steps of HDL biogenesis.Unfortunately, apoA-I is difficult to study by traditional protein structural analyses like X-ray crystallography or nuclear magnetic resonance (NMR). It has a tendency to self-associate, bind hydrophobic substances, and exceptional structural flexibility. Indeed, multiple laboratories have been trying to crystalize the full-length protein for decades without success. However, two groups published crystal structures of tetrameric[3] and dimeric[4] truncation mutants of lipid-free apoA-I. The former, lacking the N-terminal 44 amino acids, was widely thought to mimic apoA-I configuration on a nascent, discoidal HDL particle. The latter structure lacked the C-terminal 58 amino acids and may be more reflective of lipid-free apoA-I. Independent studies of the same truncation mutant showed that most of the structural features of the crystal structure were borne out in solution[5]. These reports have provided critical insights into structural details of lipid-free apoA-I. However, they must still be extrapolated to the in vivo condition where i) the N- and C-termini are both present and ii) the protein is in aqueous solution.The difficulty of crystallizing full-length apoA-I has led to hundreds of studies using clever applications of lower resolution techniques such as circular dichroism (CD)[6], analytical ultracentrifugation[7], NMR[8], fluorescence spectroscopy[9], molecular dynamics[10], chemical cross-linking[11], hydrogen-deuterium exchange[12], electron paramagnetic resonance spectroscopy (EPR)[13,14], and others[15]. Cross-linking has been particularly useful for 3-D models because it provides distance constraint data under physiological conditions in solution. To date, there have been eight models proposed for full-length, monomeric apoA-I from four laboratories[10,11,16,17]. Examples are shown in Fig. 1. They all convey the generally held view that lipid-free apoA-I is a four-helix bundle in a pseudo-stable state poised to interact with lipid. However, looking more deeply, the models differ widely in shape, secondary structure placement, and the location of the N- and C-termini. Thus, the field still lacks the structural detail to understand how apoA-I binds lipids, interacts with ABCA1 to form mature HDL, exchanges between HDL, and mediates particle interactions with plasma-borne remodeling factors.
Figure 1
Published models of full-length lipid-free human apoA-I
There are 8 published models of lipid-free monomeric apoA-I that have coordinates available. Silva et al.[11] proposed a model based on chemical cross-linking and homology sequence threading techniques. Pollard et al.[16] generated their model using chemical cross-linking and cysteine point mutations. Segrest et al.[10] used chemical cross-linking and molecular dynamics simulations using information from the crystal structures of apoA-I truncation mutants as templates. Note: this paper also proposed 3 other models that differed in terms of length of simulation (one of which is shown here, 15 ns). Zhang et al.[17] produced molecular simulations using information from the crystal structures as initial templates. The models are shown in cartoon form with cylinders representing helical domains in panel (a). The N-terminal amino acids are shown space-filled in red while the C-terminal amino acids are in yellow. The alpha helical content of each model (as assessed by Pymol) is shown on the linear diagram for each model in panel (b) with each colored block representing helical domains and black lines representing random coil or loop regions.
In the absence of a high-resolution structure of lipid-free apoA-I but with the benefit of nearly 5 decades of spectroscopic and cross-linking data, we assembled a group of leading apoA-I structural investigators, the ApoA-I Working Group, to derive a consensus model that consolidates the apoA-I structural literature. In addition, we report new cross-linking data to round out a universal list of 95 distance constraints from 3 laboratories, utilizing multiple cross-linking reagents. We also included new molecular shape data from small-angle x-ray scattering (SAXS) and incorporated secondary structural information from hydrogen-deuterium exchange (H-DX)[12]. Using the crystal structure of dimeric apoA-I[4] as a template, we applied computer modeling and simulation techniques to derive the new model.
Results
Cross-linking
We used the homo-bifunctional cross-linkers CBDPS (Cyanur-biotin-dimercapto-propionyl-sulfo-succinimide) and BS3 (bis-sulfosuccinimidyl suberate), both NHS esters which cross-link lysine and, to a lesser extent, serine residues[18,19]. Wild-type (14NN) and isotopically labeled (15N) apoA-I were mixed 1:1 under denaturing conditions. This allowed i) unambiguous peptide identification and ii) confirmation that cross-links were not due to oligomerization[5]. After cross-linking, monomeric apoA-I was isolated by size exclusion chromatography (SEC). We routinely obtained >95% purity of cross-linked monomeric apoA-I as shown by SDS-PAGE (Supplementary Fig. 1). Supplementary Tables 1 and 2 list the 65 cross-linked peptide pairs identified (see Methods for criteria). All cross-links exhibited a dual peak pattern indicating intramolecular span[5,20]. 45 were identified in both BS3 and CBDPS treated samples with 15 unique to BS3 and 5 unique to CBDPS.
Small-Angle X-ray Scattering (SAXS)
We used SAXS to assess the shape of monomeric apoA-I in solution. ApoA-I's propensity for concentration dependent self-association posed a challenge since reliable SAXS data collection requires concentrations up to 4 mg/ml. We circumvented this by performing SAXS on apoA-I that had been locked into the monomeric state by cross-linking with BS3 or CBDPS (see Methods). Careful evaluation by SEC showed that this prevented further oligomerization. The SAXS parameters are shown in Supplementary Table 3. For both cross-linking reagents, scattering intensity increased proportionally with sample concentration and the Guinier range was linear at low scattering angles (not shown), indicating no concentration dependent effects and good data quality. No significant differences were observed in Rg (Guinier and real space), Dmax, and comparable molecular volumes were found across all concentrations. Additionally, scattering profiles and pairwise distribution plots for both samples were highly related (Fig. 2a and Fig. 2b). There may be small differences in the flexibility of differentially cross-linked apoA-I species. ApoA-I cross-linked with CBDPS shows a dip in q plot (Fig. 2c) and a plateau in the q plot (Fig. 2d) indicative of a folded, rigid structure. The BS3 treated sample exhibited a plateau in the q plot and lack of plateau in the q plot indicating a possible flexible domain. The deviation in dynamics is likely due to differences in cross-linking efficiency as much less CBDPS is required for linking oligomers vs. BS3. The Rg of both samples were similar (25.35 ± 0.15 Å and 25.34 ± 0.17 Å for CBDPS and BS3, respectively) and, taken with the pairwise distribution plots, suggested apoA-I had characteristics of a globular protein. Twenty-three independent envelope reconstructions were performed and averaged using DAMMIF[21] to generate a composite. Both DAMMIF reconstructions had normalized spatial discrepancies (NSD) between 0.5 and 0.7 (Supplementary Table 3) indicating good convergence of the independent reconstructions. The envelopes (Fig. 2e and Fig. 2f) serve as a low-resolution representation of the overall shape of the molecule which can be used as a rough tool for visualizing the fit of a model. ApoA-I cross-linked with CBDPS had a calculated volume of ∼69,400 Å3 compared to ∼79,500 Å3 for that cross-linked with BS3, consistent with the more flexible nature of the BS3 sample. Taken together, the data suggest minimal structural differences between the samples.
Figure 2
Molecular envelopes of monomeric apoA-I using SAXS
Plasma and recombinant apoA-I were cross-linked with CBDPS and BS3 and monomeric species were isolated by gel filtration chromatography and subjected to SAXS. Panel (a): Scattering profiles of monomeric apoA-I cross-linked with CBDPS and BS3. Panel (b): P(r) curves of monomeric apoA-I cross-linked with CBDPS and BS3. Panels (c and d): Flexibility plots of monomeric apoA-I cross-linked with CBDPS and BS3. Panels (e and f): DAMMIF ab initio molecular envelopes of recombinant apoA-I cross-linked with CBDPS and BS3, respectively.
Generation and Evaluation of the Model
The starting construct of the model was derived from the crystal structure of apoA-I1-184 as shown in Supplementary Figure 2. As Mei and Atkinson[4] proposed, an inflection point was first placed in H5, near the center of the long helix shared by two protomers (A and B) in the dimer. However, guided by cross-linking constraints and SAXS, the inflection point was shifted more C-terminal to residue 139 in helix 5. The C-terminal portion of monomer A was then folded back resulting in the juxtaposition of H6(A) with H5(A) (Supplementary Figure 2b). Using Pymol and Modeller v9.14, the missing residues 185-243 were threaded in and positioned guided by the relevant cross-linking constraints[10,11,16] in Supplementary Table 4. This construct was used as a base model for further refinement. Secondary structural data derived exclusively from H-DX[12] was next implemented onto the base model. This required conversion of helical segments observed in the crystal structure to random coil (and vice versa in some cases). Next, the entire model was simultaneously refined for fit to the SAXS scattering curves and the cross-linking data, while holding the secondary structure assignments as constant as possible. Lastly, the model was subjected to energy minimization using YASARA[22] and side chain rotamers were manually refined in COOT[23]. Please see Methods for more detail on each of these steps.The finalized model (Fig. 3) depicts monomeric lipid-free apoA-I as a helical bundle composed of three main helices: H1 (residues 8-35), H5 (81-115), and H6 (148-179). Three shorter helical regions are interspersed between these: H2 (37-45), H3 (54-64), and H4 (68-78). The remainder is random coil with two main stretches spanning residues 116-147 and the entire C-terminus 180-243. The C-terminus forms a globular domain that sits on the bundle at the N-terminal end of H5 and the C-terminal end of H6. The N- and C-termini are located near each other at the base of the bundle and opposite the bulk of the C-terminal globular domain. Proline residues at positions 7 and 66 reside in regions of transition between helix and random coil with residue 66 clearly mediating a turn structure. While most of the other prolines are located in random coil regions, prolines 99 and 165 occur in the middle of helix 5 and 6, respectively, without apparent disruption.
Figure 3
All atom model of full-length, lipid-free monomeric apoA-I
The final time-averaged model is shown where dark blue represents N-terminal helical α-helical domains H1 and H2, light blue represents α-helical domains H3, H4, H5 and H6, pink represents random coil, red represents the C-terminal 59 residues absent in the crystal structure and yellow shows the position of prolines on each model.
Compatibility of the model with experimental data old and new
A contact plot of the final structure is shown in Fig. 4a with experimental cross-links superimposed in blue and violations in red. Overall, 89 of 95 cross-links (both previously published and newly reported here) were consistent with the model. It is worth noting that this is 3-5x the number of constraints used in previous modeling attempts, significantly reducing the conformational possibilities that can fit the data. The six violations were, on average, within ∼2 Å of the allowable span. Fig. 4b shows the model agreement with the H-DX data of Chetty et al[12]. The new model has 235 of 243 residues (97%) assigned in keeping with the H-DX data. The small differences resulted from tradeoffs made for optimal cross-link and SAXS data fitting. These occurred near residue 36 which is predicted to form the minor H2 helix (also observed in the crystal structure[4]). The remaining differences involved 1-2 amino acids bookending α-helical regions. Fig. 4c shows the model superimposed onto both DAMMIF ab initio molecular envelopes generated from recombinant apoA-I cross-linked with CBDPS and BS3. The primary measure of agreement to SAXS was the model fit to the scattering curves. Fig. 4d and 4e shows that the model was in good agreement with both SAXS scattering curves (χ=1.13 and χ=0.81 for CBDPS and BS3, respectively). As a point of reference, our previous work compared a crystal structure of a truncation mutant of apoA-IV with similarly obtained SAXS data with a χ=0.97.
Figure 4
Evaluation of experimental cross-links, secondary structure, and SAXS data with respect to the model
Panel (a): Cross-linking data Supplementary Table 4 containing all published cross-links reported by Melchior, Silva[11], Pollard[16], and Segrest[10]. The X and Y axis of the plots indicate the residue number of apoA-I from 1-243. The diagonal line bisects the figure and the bottom/right shows intramolecular interactions. Dark green represents residues within 28 Å (upper limit for alpha-carbons cross-linked with CBDPS or BS3) and light green represents residues within 23 Å (upper limit for alpha-carbons cross-linked with MDA). Blue dots represent experimental cross-links that fall within the upper-limit distance constraint and red dots represent show those that exceed the allowable distance on the time-averaged model. Panel (b): Comparison of the secondary structure reported in the H-DX study by Chetty et. al.[12] and the time-averaged model. Rectangles represent α-helical segments and the lines represent random coil. Panel (c): Superposition of the model on molecular envelopes derived from monomeric apoA-I cross-linked with CBDPS and BS3. Panel (d): The theoretical scattering curve of the proposed model (black line) superimposed on the experimental scattering profile of apoA-I cross-linked with CBDPS and BS3. The fit to the respective scattering profiles are shown in parenthesis. Panel (e): The residual plot of the model (black) to experimental scattering data from apoA-I cross-linked with CBDPS and BS3.
We also reviewed the literature for additional structural data for lipid-free apoA-I collected under conditions where it was monomeric. Supplementary Table 5 summarizes numerous studies that used far UV circular dichroism (CD) to estimate α-helix and random coil contents. α-helical content ranged from 40% to 68% across 27 studies for an average of 52 ± 6 %, or 126 ± 14 residues. The new model, driven primarily by the H-DX restraints, matched exactly with 126 α-helical residues. Another technique for identifying areas of low structural organization is limited proteolysis. Supplementary Table 6 summarizes two studies that applied this technique to primarily monomeric apoA-I. Proteolytically susceptible sites (highlighted on the new model in Figure 5a) tended to occur in random coil areas near the molecule surface. The solvent accessibility of these sites in the new model, as assessed by VADAR[24], was highly consistent with the proteolysis data with only two of 13 reported cleavage sites (Tyr115, Phe57) showing solvent accessible areas of less than 20 Å2.
Figure 5
Predictions of the new model with respect to proteolytic susceptibility and stability
Panel (a) shows the proteolytic cleavage sites from Supplementary Table 6. Panel (b) shows a space-filled rendering of the model by residue hydrophobicity with hydrophilic residues in blue and hydrophobic residues in red. Panel (c) highlights aromatic clusters (orange) that may participate in molecular stabilization and the proximity of the N- and C-termini in the time-averaged model. Panel (d) shows the position of natural mutations in apoA-IIOWA (Gly 26) and apoA-IMilano (Arg 173) within one aromatic cluster. Panel (e) shows critical stabilizing residues mapped by Gorshkova et. al. for lipid-free apoA-I[33,42].
Supplementary Figure 3 summarizes how the new model compares to the 8 published models of monomeric apoA-I with respect to the experimental data outlined above. In most cases, the new model showed considerably improved compatibility with the data.
Discussion
Lipid-free apoA-I is of direct physiological relevance because it and its related forms (i.e. preβ1-HDL) are essential to de novo HDL biogenesis via ABCA1-mediated efflux of phospholipid and cholesterol, a critical anti-atherogenic pathway in human plasma[25]. This involves direct interaction of apoA-I with both ABCA1 and specialized domains in the plasma membrane [26,27]. The concentration of total apoA-I in human plasma is about 130 mg/dL (∼50 μM). At 5-10% of total apoA-I[28], lipid-free apoA-I in plasma is around 2 μM and should be mostly monomeric. In peripheral lymph (interstitial fluid) it is about 10-fold less concentrated (∼0.2 μM or about 5 μg/ml) corresponding to the Km for lipid efflux via ABCA1[29]. Thus, a detailed understanding of apoA-I in its most “nascent” biophysical state (lipid-free and monomeric) will aid in our understanding of HDL biogenesis and its eventual interaction with critical HDL mediators such as plasma lecithin:cholesterol acyl transferase. This problem has been pursued intensively for decades but a high resolution structural model of apoA-I's native form has remained elusive. The model described here represents a comprehensive multi-technique attempt to address this issue given apoA-I's recalcitrance to traditional high resolution structural techniques. We first describe the features of the model, touch on its implications for understanding HDL biology and then discuss its limitations.Figures 3 and 5 show that the model is consistent with previous proposals of a two-domain structure with a relatively organized N-terminal helical bundle and a mostly random coil C-terminal domain from residues 180-243. With respect to the C-terminal domain, we point out that our methodology does not fully preclude the existence of some helical structure, particularly highly unstable helices that cannot be not detected at the timescale of the H-DX experiments. Indeed, truncation of the C-terminus from residue 243 to residues 231 and 221 reduced CD-detectable helix content by 7 and 14 amino acid residues, respectively[30,31]. On the basis of such observations, Mei and Atkinson[31] suggested that the segment spanning residues 231-241 contains α-helical structure. Such a C-terminal helix is further supported by preliminary molecular dynamics simulations of the new model (Segrest et al, unpublished observation). Overall, the C-terminal domain sits on N-terminal α-helical domains H1 and H2 (dark blue) (Fig. 5) with the termini residing close to each other. This theme was also observed in the solution structure for apoA-IV[20] and may explain structural and energetic studies indicating that the N- and C-terminal domains of apoA-I act cooperatively to modulate its stability[32] and lipidation kinetics[5,33-36]. The space-filling illustration in Fig. 5b shows that the model is well packed, particularly in the N-terminal domain, with hydrophobic faces of the helices oriented toward the bundle core. However, there are clear instances of solvent exposed hydrophobic residues (red), possibly contributing to the protein's low thermodynamic stability[37]. Another intriguing feature is two potential networks of aromatic residues (orange) (Fig 5c). Aromatic residues play important roles in protein stability, protein-protein interactions, and tertiary folding[38]. Indeed, natural mutations such as G26R in apoA-IIOWA[39] and the R173C mutation in apoA-IMILANO[40] both result in substantial decreases in free energy of denaturation of lipid-free apoA-I[41] and both are central to an aromatic cluster that potentially stabilizes helices 1, 5, and 6 (Fig. 5d). Lastly, residues reported by Gorshova and colleagues[33,42] to be critical for apoA-I stability (Fig 5e, space-filled in red) exhibit remarkable co-localization with the aromatic clusters highlighted in Fig. 5c. The model predicts that additional mutations in these areas should affect stability and likely the propensity to oligomerize and interact with lipid.We considered the new model in light of apoA-I's propensity to self-associate. Fig. 6 compares the consensus model to the crystal structure of dimeric apoA-I1-184 and its monomer as postulated by Mei et al [4]. Comparing Fig. 6b and 6c, the full-length model shares numerous features with the postulated monomer. Both models show an N-terminal helix between residues 7 and 36 (blue) situated alongside another helix composed of residues 80-116 (green/yellow). Though these helices are bent more acutely in the full-length model, the inter-helical interactions are similar. Also, the N-termini of both models are in close proximity to a short helix of residues ∼66-76. The full-length model shows a large random coil region between residues 116-147 that acts as a turn allowing H6 (148-178, red) to double back and participate in the helical bundle. Although more ordered in the crystal model, a similarly centered turn (near residue 129) generates an intramolecular helical bundle. The main difference is that, in the full-length model, the position of H6 is shifted relative to the bundle, perhaps by the presence residues 185-243 that sit against the bundle (note how the red helix in Fig. 6c is shifted to the right vs 6b). Given the relative agreement with the monomeric model derived from the crystal structure, it follows that the full length protein may dimerize similarly as the crystal structure dimer. Fig. 6d and 6e show how this might occur. By placing a hinge point near residue 129, the C-terminal portion of full-length apoA-I can be extended to interact with a similarly extended apoA-I partner. Aside from the increased random coil structure in the consensus model dimer, many of the intermolecular alignments seen in the apoA-I1-184 dimer are highly plausible. Thus, we propose that the dimer/monomer conversion scheme proposed by Mei et al. for apoA-I1-184 is compatible with the new consensus model and may describe how the full-length protein dimerizes. Further experimental work will be required to test this hypothesis.
Figure 6
Comparison of the consensus apoA-I model to the apoA-I1-184 crystal structure
Panel (a): The dimeric apoA-I1-184 crystal structure is shown in cartoon rendering. For each protomer, the N-terminus is blue and gradates through cyan, green, yellow, and orange to the C-terminus (residue 184 in this case) in red. Panel (b) A postulated monomeric form of apoA-I1-184 as proposed by Mei et al.[4]. An inflection point was placed near the center of the long helix shared by two protomers (A and B) at about residue 126. Monomer A is rendered with the same color gradient as panel (a) until residue 126, at which point all remaining residues are colored gray. With its N-terminal portion colored gray, the C-terminal portion of monomer B was colored in a continuation of the gradient as though it was a single molecule with a turn introduced as indicated by the arrow. Panel (c): The consensus model of full-length apoA-I colored the same way as panel (a) with residue 182 being red. The residues that are missing from the crystal structure are colored in magenta. Panel (d): The apoA-I1-184 crystal structure rendered with all of molecule B in gray. Panel (e): Postulated model of dimeric full-length apoA-I. Molecule A is colored as for the monomer in panel (c) with residues 183-243 colored magenta. Molecule B is entirely gray.
We also considered the implications of the new model for apoA-Ilipid-binding and HDL biogenesis. There is a general consensus that lipid-binding is initiated when the relatively unstable C-terminal domain comes in contact with a lipid surface[31,43-45], either via a packing defect[46] or an ABCA1 mediated mechanism[26]. It is easy to visualize the periodic movement of the C-terminus away from the helical bundle in the consensus model. If it encounters lipid, the formation and/or stabilization of C-terminal helices provides a favorable ΔG[30] that drives further lipid association. There is also evidence that the N-terminus may play a role in initial lipid contact[35,47]. Freed of the C-terminal domain, the bundle helices can then unpack, reorienting the hydrophobic faces of the central part of apoA-I to penetrate the lipid surface[35]. One of the main benefits of having the consensus structure is that it gives a ‘road map’ to the initial position of the helices and offers a structural basis for how they may unpack during this transition. For example, Saito et al. speculated that H1 and H2, based on their similar low stabilities, may be in close opposition and unfold as a unit[30] early on in the lipid binding process. Our model shows that H2 (in our case, residues 50-80) forms a bent helix that actually wraps around H1, supporting this idea.There is considerable debate as to the exact subsequent steps involved in the creation of a nascent HDL particle. Key questions remain as to the role and timing of apoA-I dimerization (i.e. does it happen prior to, during or after lipidation?) and the exact mechanism and role of ABCA1 (does it play a role in apoA-I dimer formation and how does it physically load lipid onto apoA-I?). A recent cryo-EM structure of ABCA1 offers interesting possibilities[48], but a consensus on how this works has yet to materialize. In Figure 7, we show a simplified scheme consistent with that proposed by Mei et. al.[31] of how the consensus model could be unpacked to generate a ring-like conformation as proposed for the double belt structure of apoA-I in discoidal HDL particles[49]. We caution that this is only one of many possibilities and the role of apoA-I dimerization and ABCA1 was intentionally not addressed. Now that we have good ideas of the structures of both apoA-I and ABCA1, we look forward to exciting new work on this critical mechanism.
Figure 7
Hypothesis on lipid-binding and relationship to apoA-I arrangement on discoidal HDL
Panel (a): The consensus model of full-length apoA-I. Panel (b): Proposed arrangement of two molecules of apoA-I on discoidal HDL; the “double-belt” model. Panel (c): Molecule B from the double-belt model rotated 90° (top view). Panel (d): Cartoon representation of the time-averaged model in its folded state. Panel (e): Dissociation of helix 6 and C-terminus from the lipid-free model and formation of a helical region upon interaction of the CT with lipid. Panel (f): formation of additional alpha-helical segments upon lipid filling and unfurling of the N-terminal bundle. Panel (g): Endpoint as a belt on an HDL particle. The model and cartoon is colored based on alpha-helical repeats identified and reported by Segrest et. al. [49]. Rectangles in panels (d-g) represent alpha-helical segments.
Limitations of the model
Despite its consistency with much of the known structural data, we note that the model is still limited in resolution compared to NMR or X-ray crystallography. The general backbone configuration is likely correct, but more refined molecular interactions such as salt-bridging and hydrogen bonding remain unclear. Additionally, one might question whether it is possible to represent a highly dynamic protein with a single, time-averaged model. ApoA-I has ΔG of helix stabilization ranging from ∼3-5 kcal/mol[12]. At neutral pH and room temperature, H-DX studies have demonstrated complete hydrogen exchange into apoA-I α-helical segments occurring in ∼10 minutes (Supplementary Figure 4); i.e. all apoA-I helical segments have unfolded and reformed at least once in this timeframe. To put this in context, a stable globular protein like cytochrome C has a ΔG of helix stabilization of 10 kcal/mol with complete hydrogen exchange requiring ∼10 weeks. Given this degree of dynamics, it seems likely that, at physiological temperatures, apoA-I probably exists in an ensemble of related structures at any point in time. Indeed, the fact that some cross-links failed to fit the model (6 were close, but not strictly allowed) directly suggests differential conformations. Our model is a time-averaged structure derived from experimental data obtained on a longer time scale than typical secondary structure oscillations. For this reason, we think of it as a base model upon which hypothesized dynamics and conformational alterations can be further modeled and tested. However, because the majority of cross-links did fit the model and the SAXS patterns showed consistent molecular shapes, we argue that i) the consensus model likely reflects a majority of the molecules existing at a given time or ii) any conformers exhibit related shapes and structures.Another issue relates to the notion of solvent accessibility of the cross-linking reagents. While most cross-links fit the model in terms of Euclidian distance (‘as the crow flies’), many are impeded by some obstruction like a side chain from a non-participating residue or an adjacent helical domain. Given that the model represents a time-average of an ensemble of related apoA-I structures, cross-links that appear sterically hindered or solvent inaccessible could occur on an alternate conformation within the boundaries of the experimental system. Additional studies are needed to better define these boundaries and the extent of rearrangement apoA-I can achieve in vivo and in vitro. Also, the constant unfolding of helical domains over the course of the experiment could allow cross-links that might not be expected in a static structure. Nevertheless, previous reports have shown excellent consistency between observed cross-links in solution with crystal structures of apoA-I1-184 (5) and apoA-IV[20] and non-apolipoproteins[50-53] validating the approach. Importantly, instances of steric cross-linker hindrance were also routinely observed in these studies. For a more detailed discussion of potential limitations of the model, see the online supplement.In summary, we propose a new time-averaged model of the structure of apoA-I in solution that is, by and large, consistent with decades worth of experimental data. While still lacking amino acid level resolution, it provides a much-needed starting point upon which to test structural hypotheses with regard to loss-of-function mutations, interaction with cell-surface proteins, and HDL formation.
Online Methods
Protein expression and purification
Recombinant apoA-I was expressed and purified as previously described[54] with minor modifications. Briefly, a pET30 vector (Novagen) containing mature apoA-I (sequence confirmed by the Cincinnati Children's Hospital Sequencing Core on an Applied Biosystems 3730 DNA Analyzer) was transformed into BL-21 E. coli cells. Cells were grown at 37°C in Luria-Bertani (LB) culture media containing kanamycin to select transformants. Protein expression was induced by addition of isopropyl β-d-thiogalactopyranoside (IPTG) at 0.1 mM followed by shaking at 225 RPM for 2 h at 37°C. Cells were pelleted by centrifugation and stored at -20°C. Cells were thawed on ice, resuspended in binding buffer (5 mM imidazole, 500 mM NaCl, 20 mM Tris-HCl, pH 7.9) and lysed at 4°C by probe sonication. The insoluble cell components were pelleted by centrifugation and supernatant was applied to His-bind columns (Novagen), which were extensively washed and eluted with elution buffer (1 M imidazole, 0.5 M NaCl, 20 mM Tris pH 7.9). The His-tag was cleaved by tobacco etch virus (TEV) protease at a ratio of 20:1 for 2 h at room temperature. The sample was reapplied to the His-binding resin to remove uncut protein. Fractions with purified protein were pooled, dialyzed into 10 mM ammonium bicarbonate pH = 8.1 and lyophilized for storage at -80°C until ready for use. Lyophilized protein was solubilized in a Tris buffer containing 3M guanidine HCl at concentrations below 0.1 mg/ml for 2 h at 4°C and then exhaustively dialyzed into appropriate buffer for experimentation. Our recombinant protein contains an N-terminal glycine which does not affect any protein structural or functional index that we have examined[54]. The 15N-labeled version of apoA-I was expressed and purified with minor modifications to the system as previously described[20]. The experimental molecular weight of 15NapoA-I was 28,483.7 Da determined by direct injection mass spectrometry. This translated to 342/348 successfully labeled nitrogen atoms which was >98% labeling efficiency.
Mixing of wild-type and isotopically labeled protein
Recombinant 14N and 15NapoA-I were solubilized separately in 3M guanidine in Tris-HCl (1.5 M NaCl, 100 mM Tris-HCl, 10 mM EDTA, 0.2% Azide), pH = 8.0 at a concentration below 0.1 mg/ml to ensure complete unfolding and dissociation of higher order oligomers. The species were then mixed at a 1:1 molar ratio and dialyzed against PBS (140 mM NaCl, 2.7 mM KCl, 10.1 mM Na2HPO4, 1.8 mM KH2PO4, pH=7.4) to refold the protein. The 14N/15N recombinant apoA-I was further purified using size exclusion chromatography using a Superdex 200 gel filtration column (10/300 GL; GE Healthcare) on a ӒKTA FPLC system (GE Healthcare) in PBS. Fractions containing pure apoA-I were pooled and concentrated to between 1-2 mg/ml for cross-linking. Protein concentration was determined by the Markwell modified Lowry assay[55]. Purity was routinely >99% as determined by SDS-PAGE and mass spectrometry.
Cross-linking and Isolation of Monomeric ApoA-I
ApoA-I was cross-linked with bis-(sulfosuccinimidyl) suberate (BS3) (Thermo Scientific) or Cyanur-biotin-dimercapto-propionyl-sulfo-succinimide (CBDPS), a homo-bifunctional, biotin-tagged cross-linking agent (Creative Molecules Inc.)[56]. Samples were cross-linked at molar ratios of 50:1 (BS3) or 20:1 (CBDPS) which were determined based on optimal formation of oligomeric species. The reactions proceeded for 12 h at 4°C and were quenched by excess Tris-HCl. Monomeric species of apoA-I were isolated by size exclusion chromatography on three Superdex 200 gel filtration columns (10/300 GL; GE Healthcare) in series on a ӒKTA FPLC system (GE Healthcare) in PBS at a flowrate of 0.3 ml/min. 0.25 ml fractions were collected and analyzed by SDS PAGE. Fractions containing purified monomeric apoA-I were pooled, concentrated and quantified. Monomers undergoing MS analysis were dialyzed into 10 mM ammonium bicarbonate pH 8.1. 50 μg aliquots were digested with sequencing grade trypsin (Promega) at 1:20 mass ratio of trypsin to protein for 16 h at 37°C. Peptides were lyophilized to dryness and stored at -20°C until analysis by MS.
Peptide Purification with CBDPS
We utilized the biotin tag on CBDPS to isolate peptides that had been modified by the cross-linker[56]. ApoA-I was cross-linked with CBDPS-H8/D8 at a molar ratio of 20:1 cross-linker to protein. Spin cups were loaded with 150 μg of Ultralink streptavidin resin (Pierce, 53117) and washed 5× with 300 μl PBS by centrifuging at 4000 × g for 1 min at RT. 100 μg of digested peptides per 150 μl of slurry were incubated together on an orbital shaker for 1 h at RT. Unlabeled peptides were removed by centrifugation at 4000 × g for 1 min at RT. The resin was washed once with PBS and then eluted with two aliquots of 300 μl of 50% acetonitrile, 0.4% trifluoroacetic acid. The peptides were then lyophilized to dryness for MS analysis.
Mass Spectrometry and Identification of Cross-linked Peptides
Mass spectrometry analyses were performed as previously described[20]. Briefly, Nano-LC-MS/MS analyses were performed on a TripleTOF® 5600+ (AB Sciex, Toronto, Canada) coupled to an Eksigent (Dublin, CA) NanoLC-Ultra® nanoflow system. Dried samples were reconstituted in formic acid/H2O 0.1/99.9 (v/v), and 5 μl (∼1-2 μg of digest) was loaded onto C18 IntegraFrit™ trap column (New Objective, Inc.) at 2 μl/min in FA/H2O 0.1/99.9 (v/v) for 15 min to desalt and concentrate the samples. For the chromatographic separation, the trap-column was switched to align with the analytical column, Acclaim® PepMap100 (Dionex-Thermo Fisher Scientific). Peptides were eluted at 300 nL/min using a mobile phase gradient from 95% phase A (FA/H2O 0.1/99.9, v/v) to 40% phase B (FA/ACN 0.1/99.9 v/v) for 35 min (1% per min), then from 40% B to 85% B in 5 min with re-equilibration. Effluent was introduced to the mass spectrometer using a NANOSpray® III Source (AB Sciex, Toronto, Canada). The instrument was operated in positive ion mode for 65 min, where each cycle consisted of one TOF-MS scan (0.25 s accumulation time, in a 350 to 1500 m/z window) followed by 30 information dependent acquisition mode MS/MS-scans on the most intense candidate ions selected from initially performed TOF-MS scan during each cycle. Each product ion scan was operated under vendor-specified high-sensitivity mode with an accumulation time of 0.075 s and CE of 43 with an 8 unit scan range. The .wiff files were converted to Mascot generic files using PeakView® v1.2.0.3 software (AB Sciex). Cross-links were identified using SimXL[57] as previously described[5]. A user-defined error of 20 ppm was used for mass detection in all experiments. Two independent experiments were performed for apoA-I cross-linked with CBDPS and three independent experiments were performed for apoA-I cross-linked with BS3. Positive identification of cross-linked peptides required confirmation from the MS/MS fragmentation pattern and visual identification of mass peaks independently by two experienced analysts. Additionally, a unique cross-link had to appear in a minimum of two experiments for inclusion.
Small Angle X-ray Scattering
SAXS data were collected using the SIBYLS beamline (Berkeley, CA)[58]. Cross-linked monomeric apoA-I samples were shipped overnight at 4°C within 48 h of isolation to minimize aggregation artifacts. The cross-linked molecules were sampled at 4 mg/ml, 2 mg/ml, and 1 mg/ml at 10°C with four exposure times; 0.5, 1.0, 2.0, and 5.0 s. Scattering profiles showing evidence of radiation damage or particle repulsion were discarded. ScÅtter (SIBYLS) and ATSAS program suite (EMBL) were used for data analysis. 23 independent ab initio molecular envelopes were generated, averaged, and superimposed using DAMMIF (ATSAS, EMBL-Hamburg)[21]. The averaged molecular envelope graphics were rendered using UCSF Chimera.
Iterative Model Generation and Evaluation
A 3-D composite homology model of monomeric mutant apoA-I1-184 was generated from the dimeric crystal structure[4,59] using Modeller v9.14[60] as previously described[5]. The C-terminal 59 residues were modeled in incremental segments guided by experimental cross-linking data for respective regions using Modeller v9.14. Models were constrained with previously published cross-links[10,11,16] and newly identified cross-links using an upper bound of 23.0 ± 0.001 Å or 28.0 ± 0.001 Å from C-α to C-α for cross-links identified with MDA and BS3, respectively. The models primary and secondary structure were adapted and refined iteratively guided by hydrogen-deuterium exchange (H-DX) data published by Chetty and colleagues[12]. Models were evaluated against experimental data obtained from SAXS using the online server FoxS[61,62]. The final model was subjected to energy minimization using the YASARA server[22] and rotamers were manually refined using COOT[23]. Given the lack of a density map, rotamer position was determined using the most probable position without resulting in clashes. The reliability of the model was determined using MolProbity[63] and solvent accessibility to individual amino acids was determined using VADAR[24].
Data Availability
The .pdb file for the model is available for download from the online supplement or from the Davidson Lab homepage: (http://homepages.uc.edu/∼davidswm/structures.html). All other source data supporting the findings in this study are available upon request. A Life Sciences Reporting Summary for this article is available.
Authors: Stacey E Panagotopulos; Scott R Witting; Erica M Horace; David Y Hui; J Nicholas Maiorano; W Sean Davidson Journal: J Biol Chem Date: 2002-08-13 Impact factor: 5.157
Authors: Ricquita D Pollard; Brian Fulp; Michael P Samuel; Mary G Sorci-Thomas; Michael J Thomas Journal: Biochemistry Date: 2013-12-17 Impact factor: 3.162
Authors: Andrzej Witkowski; Gary K L Chan; Jennifer C Boatz; Nancy J Li; Ayuka P Inoue; Jaclyn C Wong; Patrick C A van der Wel; Giorgio Cavigiolio Journal: FASEB J Date: 2018-01-17 Impact factor: 5.191
Authors: Allison L Cooke; Jamie Morris; John T Melchior; Scott E Street; W Gray Jerome; Rong Huang; Andrew B Herr; Loren E Smith; Jere P Segrest; Alan T Remaley; Amy S Shah; Thomas B Thompson; W Sean Davidson Journal: J Lipid Res Date: 2018-05-17 Impact factor: 5.922
Authors: Maryam Zamanian-Daryoush; Valentin Gogonea; Anthony J DiDonato; Jennifer A Buffa; Ibrahim Choucair; Bruce S Levison; Randall A Hughes; Andrew D Ellington; Ying Huang; Xinmin S Li; Joseph A DiDonato; Stanley L Hazen Journal: J Biol Chem Date: 2020-02-25 Impact factor: 5.157