Literature DB >> 25423138

An experimentally robust model of monomeric apolipoprotein A-I created from a chimera of two X-ray structures and molecular dynamics simulations.

Jere P Segrest¹, Martin K Jones, Baohai Shao, Jay W Heinecke.

Abstract

High-density lipoprotein (HDL) retards atherosclerosis by accepting cholesterol from the artery wall. However, the structure of the proposed acceptor, monomeric apolipoprotein A-I (apoA-I), the major protein of HDL, is poorly understood. Two published models for monomeric apoA-I used cross-linking distance constraints to derive best fit conformations. This approach has limitations. (i) Cross-linked peptides provide no information about secondary structure. (ii) A protein chain can be folded in multiple ways to create a best fit. (iii) Ad hoc folding of a secondary structure is unlikely to produce a stable orientation of hydrophobic and hydrophilic residues. To address these limitations, we used a different approach. We first noted that the dimeric apoA-I crystal structure, (Δ185-243)apoA-I, is topologically identical to a monomer in which helix 5 forms a helical hairpin, a monomer with a hydrophobic cleft running the length of the molecule. We then realized that a second crystal structure, (Δ1-43)apoA-I, contains a C-terminal structure that fits snuggly via aromatic and hydrophobic interactions into the hydrophobic cleft. Consequently, we combined these crystal structures into an initial model that was subjected to molecular dynamics simulations. We tested the initial and simulated models and the two previously published models in three ways: against two published data sets (domains predicted to be helical by H/D exchange and six spin-coupled residues) and against our own experimentally determined cross-linking distance constraints. We note that the best fit simulation model, superior by all tests to previously published models, has dynamic features of a molten globule with interesting implications for the functions of apoA-I.

Entities: CellLine Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2014 PMID： 25423138 PMCID： PMC4263436 DOI： 10.1021/bi501111j

Source DB: PubMed Journal: Biochemistry ISSN： 0006-2960 Impact factor: 3.162

Clinical and epidemiological studies show a robust, inverse association of high-density lipoprotein cholesterol (HDL-C) levels and the risk for coronary vascular disease (CVD).[1,2] Moreover, hypercholesterolemic mice with genetically engineered deficiencies in proteins implicated in HDL metabolism have strongly atherosclerotic phenotypes,[3,4] providing compelling evidence that HDL is a key modulator of atherosclerosis in animal models. These observations have triggered intense interest in targeting HDL for therapeutic intervention. The relationship of HDL-C to CVD is complex, however, because lower HDL-C levels are not uniformly associated with an increased cardiovascular risk.[5−7] Moreover, several recent clinical studies have failed to show any benefit from the pharmacological elevation of HDL-C levels in statin-treated humans with established CVD.[8,9] These observations suggest that HDL-C levels provide limited information about HDL’s proposed cardioprotective effects. Therefore, the rational design of therapies and assessments of their benefits demand detailed knowledge of the assembly and molecular structure of the cardioprotective form of HDL. The anti-atherogenic nature of HDL has been most firmly related to reverse cholesterol transport, in which apoA-I, HDL’s major protein, accepts cholesterol from artery wall cells, counteracting lipid loading.[8] Thus, the level of atherosclerosis is markedly increased in hypercholesterolemic mice that lack apoA-I,[11,12] while transgenic expression of apoA-I inhibits atherosclerosis in a wide variety of hypercholesterolemic animal models.[13−16] ApoA-I is secreted into the bloodstream as a lipid-free or lipid-poor monomer (preβ HDL), which is widely believed to be the most common acceptor of sterol transported out of cells.[17] The circulating form of mature human apoA-I contains 243 amino acids encoded by exon 3 (residues 1–43) and exon 4 (residues 44–243). The region encoded by exon 4 contains 10 tandem 11/22 residue repeats, termed helical repeats 1–10 (H1–H10, respectively), which form a series of lipid-binding class A amphipathic helices. ApoA-I accepts sterol from ABCA1, a transmembrane ATPase, to become discoidal HDL. In this configuration, two antiparallel molecules of apoA-I form a belt around the edge of a disk of phospholipid and cholesterol.[18−23] This dimerization of apoA-I is driven in part by the formation of salt bridges among the 10 helical repeats.[20] The flexible nature of apoA-I plays a key role in the lipidation and assembly of HDL by ABCA1,[24] but this property hampers high-resolution structural studies of the full-length protein under physiologically relevant conditions. Fortunately, molecular modeling provides an alternative approach to understanding HDL’s structure. Two all-atom models for lipid-free full-length human apoA-I have been proposed by Silva et al.[25] and Pollard et al.[26] Both models are helix bundle structures constructed as best fit explanations of the cross-linked peptides that are identified when apoA-I is exposed to chemical cross-linkers and then analyzed by high-precision mass spectrometry. In both models, H5, the central helical repeat of apoA-I, forms a helical hairpin. The Silva model is essentially an antiparallel six-helix bundle structure, while the Pollard model is a more loosely packed helix bundle structure containing multiple short helical segments. Two crystal structures have also been determined for lipid-free human apoA-I: the N-terminally truncated structure (Δ1–43) of Borhani et al.[18] and the high-resolution C-terminally truncated (Δ185–243) crystal structure of Mei and Atkinson.[27] The Borhani structure is 93% helical and contains an antiparallel four-helix bundle with an elliptical ring shape, a model more consistent with the structure of apoA-I bound to discoidal HDL particles.[18] The structure of Mei and Atkinson is less helical and forms a more compact helix bundle, which is more consistent with the biophysical and biochemical properties of lipid-free apoA-I.[27] There are, however, substantial similarities between the two structures, e.g., the pattern of salt bridges between helical repeats 4 and 6 and pairwise helical repeats 5.[28] In their two models for monomeric apoA-I, Silva et al.[25] and Pollard et al.[26] determined distance constraints by measuring maximal potential distances between the α-carbons of cross-linked amino acids in two different peptides. While driven by data, this approach has several limitations. (i) Cross-linked peptides provide no information about secondary structure. (ii) A secondary structure can be folded in many ways to create a best fit. (iii) Ad hoc folding of a secondary structure may not produce a stable conformation in which the hydrophobic and hydrophilic residues are oriented correctly. In the studies presented here, we used a different approach. Starting from first principles based on known protein structures, we generated an initial model, which we then subjected to molecular dynamics (MD) simulations. We ranked the resulting models by how well they conformed to two published data sets: domains predicted to be helical by H/D exchange and six spin-coupled residues. We tested the best fit model by chemically cross-linking peptides. The proposed structure, a molten globule, has interesting implications for the function of apoA-I.

Materials and Methods

Initial Models

We created an initial crystal model by taking parts of the recent crystal structure for dimeric C-terminally truncated apoA-I by Mei and Atkinson[27] [Protein Data Bank (PDB) entry 3R2P] and parts of the crystal structure for the tetrameric N-terminally truncated apoA-I of Borhani et al.[18] (PDB entry 1AV1). We began by building a monomer that was as complete as possible from each crystal structure by selecting residues from different chains so that there would be a continuous backbone connection and the monomer was as compact as possible. The parts from dimeric C-terminally truncated apoA-I were residues 3–130 of chain A and residues 131–182 of chain B. This formed a C-terminally truncated monomer of residues 3–182. The parts from tetrameric N-terminally truncated apoA-I were residues 43–67 of chain A, residues 68–130 of chain D, residues 131–192 of chain C, and residues 193–243 of chain B. This formed an N-terminally truncated monomer of residues 43–243. The two terminally truncated monomers are shown in Figure S1 of the Supporting Information. An immediate observation is that the structures are quite similar, especially with respect to the main double helix consisting of residues 68–182. To build the initial crystal model, we formed a complete monomer by joining residues 3–182 of our C-terminally truncated monomer to residues 183–243 from our N-terminally truncated monomer. To insert residues 193–243 into the hydrophobic cleft seen in residues 3–182 (Figure 1d), we repositioned residues 193–243 by bending them slightly and stretching them at several points along the backbone. This aligned them with the aromatic residues of the cleft, as explained in more detail in Results. The hairpin between residues 192 and 193 in the N-terminally truncated monomer was instrumental in joining the long helix of residues 131–182 of the C-terminally truncated monomer around the bottom of the hybrid crystal model to residues 193–243 of the N-terminally truncated monomer via the short helix of residues 183–192. Additionally, residues 1 and 2 were taken from a previous simulation of ours of monomeric apoA-I at 500 K. The two residues were in a random coil structure and were joined to residue 3.

Figure 1

Creation of a lipid-free apoA-I monomer model from a combination of the C-terminally truncated (Δ185–243)apoA-I[27] and N-terminally truncated (Δ1–43)apoA-I[18] crystal structures. (a and b) Depth-cued ribbon images showing the creation of a monomer from the crystal structure of the (Δ185–243)apoA-I dimer of Mei and Atkinson:[27] (a) crystal dimer[27] and (b) monomeric (Δ185–243)apoA-I created by forming a hairpin from H5. Because residues 1 and 2 were missing from the crystal structure, they were added from a previous MD simulation. (c–i) Development of the initial crystal model. Color codes for ribbon models: H5, green; other helical domains, red; nonhelical domains, green; prolines in panels a–c, e–g, and i, space-filling yellow. Color codes for space-filling models: aromatic residues, magenta; nonaromatic hydrophobic residues, gold; prolines, yellow. (c) Reorientation of the ribbon model of the (Δ185–243)apoA-I monomer in panel b. (d) Space-filling model of the structure in panel c showing its deep hydrophobic cleft. (e) Ribbon model of residues 183–243 from the N-terminally truncated crystal structure.[18] (f) Space-filling model of the structure in panel e. (g) Alignment of aromatic and hydrophobic residues between the (Δ185–243)apoA-I monomer model (1–182) and the N-terminally truncated fragment. (h) Initial crystal model (space-filling) for the full-length monomer. (i) Ribbon model of the structure in panel h. Residues 1–182 are colored cyan and residues 183–243 red.

Molecular Dynamics Simulations

All-atom MD was performed using NAMD[29] after solvating and ionizing the initial structure as described previously.[30,31] After minimization and heating to 500 K for 50 ps, the MD simulation consisted of a 30 ns equilibration at 500 K. Postprocessing of trajectories was analyzed by writing VMD-Tcl scripts.

ApoA-I Model Figures

All molecular model figures were created with the PyMOL Molecular Graphics System (version 1.3, Schrödinger).

Root-Mean-Square Deviation and Root-Mean-Square Fluctuation

The root-mean-square deviation (rmsd) of protein α-carbon atoms was calculated for the apoA-I monomer over the entire 30 ns duration of the simulation by using the RMSD Trajectory Tool extension in VMD. The root-mean-square fluctuation (rmsf) of protein α-carbon atoms was calculated for the apoA-I monomer over the entire 30 ns duration of the simulation by using the “measure rmsf” command in VMD.

Measurement of α Helicity

The fraction of all protein residues that were α-helical was measured every time increment over the course of the 30 ns trajectory, using VMD’s implementation of the protein secondary structure determination program, STRIDE.[32]

Average 10–20 and 15 ns Models

This average structure was created by averaging the protein atom coordinates over all frames of the simulation from 10 to 20 ns by using the “measure avpos” command in VMD after aligning over all atoms with respect to the 10 ns frame. Because averaging creates possible close contacts between atoms and deforms bond lengths, torsions, and dihedrals, the structure was then subjected to 10000 steps of steepest-descent energy minimization. The result is called the 10–20 ns model. Because the protein structure at 15 ns is kinetically expanded by the 500 K temperature of the simulation, we performed a 0.1 ns linear MD simulation cooling of the 15 ns structure to 310 K. The result is called the 15 ns model.

Sum of Spin-Coupled Residue Distances of >20 Å and Number of Spin-Coupled Distances of >20 Å

To determine the sum of the spin-coupled residue distances of >20 Å and the number of spin-coupled distances of >20 Å, we measured distances between the protein α-carbons for the spin-coupled residues by writing VMD-Tcl scripts for the individual models as well as over the 30 ns trajectory. See Results for more details.

Relative Residue to Residue Proximities Shown as Protein Contact Maps

To obtain protein contact maps for the 15 ns model and over the 30 ns trajectory, we counted the number of times that α-carbon atoms were within 23 Å of each other for every time increment of that simulation. The results were then combined and are shown as a contour plot of one color (for the one frame) or four colors (for the trajectory) indicating the quartile of counts (except for a count of 0, which is white) relative to the maximal number over all pairs of residues, respectively, using the graphics software GNUPLOT (www.gnuplot.info).

Sum of the Violation Distance Algorithm

The violation distance algorithm attempts to measure, given a set of distance constraints such as experimental cross-links, the amount that a given structure violates those constraints. For each experimentally determined cross-link, x,y, where x and y are the residue numbers of the lysine pairs (and the N-terminus) of the ith cross-link, a violation will occur if the minimal distance between the Cα atoms of x and y, d(x,y), is greater than the 23 Å length of the cross-linking reagent MDA[33] (a distance of <28 Å for the reagent used by Silva et al.[25] and Pollard et al.[26]). The sum of violations, ∑[d(x,y) – r], where r is the length of the cross-linker, plotted against the number of individual violations, is the extent to which any given model is consistent with the cross-linking data. The best model will be closest to the x,y origin. Because we can also examine MD trajectories instead of given structures, any single Cα–Cα distance ≤ r for a given x,y in any MD trajectory frame indicates no violation distance. Finally, for each cross-link x,y pair, we count only the minimal violation distance to calculate the sum of violation distances.

Isolating HDL and ApoA-I

Plasma was prepared from EDTA-anticoagulated blood collected from healthy adult subjects who had fasted overnight. HDL (density of 1.125–1.210 g/mL) was isolated from plasma by sequential ultracentrifugation and depleted of apolipoproteins E and B100 by heparin-agarose chromatography.[34] ApoA-I was purified from HDL by ion exchange chromatography.[34] The protein concentration was determined with the Lowry assay (Bio-Rad, Hercules, CA), with albumin as the standard. The Human Studies Committees at the University of Washington School of Medicine approved all protocols involving human material.

Cross-Linking Reactions and Liquid Chromatography–Electrospray Ionization Mass Spectrometry (LC–ESI-MS)

Previous studies demonstrate that malondialdehyde (MDA), a major product of lipid peroxidation that is also a reactive carbonyl peptide cross-linker, markedly impairs the ability of lipid-free apoA-I to promote the efflux of cholesterol from cells by the ABCA1 pathway.[35] To further test the fit of the models to experimental data, we used MS to detect apoA-I peptides that had been cross-linked with MDA, a major product of lipid peroxidation.[35,2] Reactions of freshly prepared MDA with lipid-free apoA-I (5 μM, 0.14 mg/mL) were conducted at 37 °C for 24 h in 50 mM sodium phosphate buffer (pH 7.4) containing 100 μM diethyleneaminepentaacetic acid to chelate redox-active metal ions.[36] Reactions were initiated by adding MDA and terminated by adding a 20-fold molar excess (relative to MDA) of aminoguanidine.[35,37] The apoA-I monomer, dimer, and multimer were separated by high-resolution size exclusion chromatography using fast protein liquid chromatography (two Superdex 200 10/300 GL columns, GE Healthcare). The monomer fractions were collected, combined, and concentrated using Amicon Ultra-4 filters (Millipore) before being digested. Concentrated fractions were digested with sequencing grade modified trypsin (Promega) or endoproteinase Glu-C (from Staphylococcus aureus V8, Roche Applied Science) in 50 mM NH4HCO3 (pH 7.8).[35,38] Proteolytic digests were desalted by solid phase extraction (OASIS HLB cartridge, Waters Corp., Milford, MA) and then analyzed by LC–ESI-MS/MS in positive ion mode on a hybrid linear ion trap-Orbitrap instrument (Thermo Fisher, San Jose, CA) equipped with a nanoflow UPLC system (NanoAcquity, Waters Corp.). MS survey scans were performed from m/z 200 to 2000 at a resolution of 60000 and ion populations of 5 × 105. Data-dependent analyses were performed using MS survey scans followed by data-dependent selection of the five most abundant precursor ions for MS/MS. Every sample was analyzed twice. Singly charged precursors were rejected from the data-dependent scan in the first run, and singly, doubly, and triply charged precursors were rejected from data-dependent selection in the second run to obtain optimal MS/MS scans for cross-linked peptides that usually present as ≥4+ ions. Potentially cross-linked peptides were identified using a spreadsheet containing the masses of each possible peptide pair (or a single peptide for intrapeptide cross-links) plus the mass of the MDA cross-linker (K+36+K).[35,2] All peptide identifications were confirmed by inspection of the MS/MS spectra, using the following criteria: (i) mass being within 10 ppm of the calculated mass, (ii) corresponding MS/MS spectrum exhibiting at least 50% of the b and y series of theoretical fragment ions expected, and (iii) all major fragmentation ions being consistent with the peptide identification.

Results

The Crystal Structures of C-Terminally Truncated ApoA-I and N-Terminally Truncated ApoA-I Potentially Interact via a Hydrophobic Cleft

The similarity of the two models created by the selection of individual fragments of different chains from the structures of Borhani et al.[18] and Mei and Atkinson[27] shown in Figure S1a,b of the Supporting Information led us to consider the possibility that a combination of these two general conformations might provide a model for the structure of lipid-free apoA-I in solution. Figure 1a shows a ribbon model of the (Δ185–243)apoA-I crystal structure determined by Mei and Atkinson.[27] The H5 region of the structure has been proposed to function as a hinge that allows the extended form of apoA-I to fold back on itself to interconvert the protein between a monomer and dimer in solution.[27] This structure is topologically identical to a monomer in which helix 5 forms a helical hairpin, converting the intermolecular H4/H6 pairings of the dimer into monomeric intramolecular H4/H6 pairings (Figure 1b). We observed that the space-filling model of the monomer created this way has a remarkable hydrophobic cleft running the length of the molecule. The cleft contains only aliphatic (gold) and aromatic (magenta) residues (Figure 1c,d). Inspection of the C-terminal portion of the (Δ1–43)apoA-I crystal structure determined by Borhani et al.[18] suggested that residues 183–243 of the C-terminus (Figure 1e,f) [residues 183 and 184 are not resolved in the (Δ185–243)apoA-I crystal structure] would fit snuggly into the hydrophobic cleft (Figure 1d). Only small bending and stretching of residues 183–243 aligned four of the cleft’s five aromatic residues (Figure 1d) with the four aromatic residues in the fragment of Borhani et al. (Figure 1g). We therefore docked this peptide (Figure 1e,f) with the monomeric (Δ185–243)apoA-I structure in Figure 1b to create an all-atom model for full-length monomeric lipid-free apoA-I (Figure 1h,i) that we term the initial crystal model. Importantly, we generated this model of full-length apoA-I by combining the two crystal structures of opposite ends of the protein into a hybrid structure.

MD Simulations with a Temperature Jump Decrease the Helicity of the Initial Crystal Model to Physiologically Plausible Levels

The experimentally determined helicity for monomeric lipid-free human apoA-I is 55 ± 5%.[25,39,40] Our hybrid structure had significantly more helicity (74%). We therefore used MD simulations of the initial crystal model to explore additional conformational space. As MD simulations at 310 K produced only minor structural changes, we subjected the initial crystal model to MD simulations at 500 K for 30 ns. The rmsd of this simulation is shown in Figure S2 of the Supporting Information; the rmsd of the full model flattens out between 15 and 28 ns but increases between 28 and 30 ns (Figure S2a of the Supporting Information), suggesting at least a portion of the model is not equilibrating. Restriction of the rmsd to the model’s core helical domains 1, 4 and 5 (see Figure 6a for the definition) shows that the core is equilibrated from 15 ns to the 30 ns end (Figure S2b of the Supporting Information). The rmsd of the noncore domains, however, increased between 28 and 30 ns (Figure S2c of the Supporting Information), and thus, these domains account for the corresponding increase in the rmsd of the full model.

Figure 6

Ribbon comparison of the four alternatively open and closed structures: 15 ns (Closed1) → 20 ns (Open1) → 25 ns (Closed2) → 28 ns (Open2). Color code: H10, red; H5, green; H4, cyan; H6, magenta; remainder of structure, gray; N-termini, yellow arrowheads; C-termini, red arrowheads. In both transitions from open to closed structures, H5 and H10 move away from each other and from the main body of the protein exposing the H4–H5–H6 hairpin so that a second apoA-I rotated along its long axis by 180° can dimerize with the first through intermolecular interactions of the H4–H5–H6 region.

Changes in the helicity of the initial crystal model with simulation time decreased for the first 10 ns and then reached a relatively steady state between 10 and 20 ns (white dotted line in Figure 2a), indicating that the ensemble of structures reached some sort of conformational stability. The simulation between 10 and 20 ns exhibited ∼58% helicity, in excellent agreement with the value of 55 ± 5% reported in the literature for lipid-free monomeric apoA-I.[25,39,40]

Figure 2

Plots of changes in properties of the initial crystal model during a 30 ns MD simulation at 500 K. The double-headed arrows mark the 10–20 ns intervals. The filled arrowheads mark the simulations at 15 ns. (a) Change in fraction helicity with time. The white dotted line is a linear trendline for changes in helicity between 10 and 20 ns. (b) Sum of the distances between Cα atoms of the 15-residue pair combinations from an average center for the six spin-coupled residues predicted by Lagerstedt et al.[50] to define a central plane. (c). Total number of distances of >20 Å from the 15 combinations of distances between the six spin-coupled residues. In all panels, the empty diamonds represent the initial crystal model.

Testing Candidate MD Simulation Models against Published Experimental Data

Using site-directed spin-label electron paramagnetic resonance spectroscopy (EPR), Lagerstedt et al.[41] identified six spin-coupled residues (G26, L44, L64, S167, G217, and K226) in lipid-free human apoA-I. In the initial crystal model, all but G26 are exposed to solvent and four (L44, L64, G217, and K226) are in loops on the surface of our 15 ns model (Figure 3c); the two that are not, G26 and S167, are small and in physical contact with each other.

Figure 3

Goodness of fit of models to the helical regions predicted by Chetty et al.[40] and the spin-coupled residues predicted by Lagerstedt et al.[50] proposed to define a central plane. (a) Bar graph showing the goodness of fit of helicity of five models (initial crystal, 10–20 ns, 15 ns, Silva, and Pollard) to the position of the five helical regions (designated 1–5) predicted by Chetty et al.[40] using H/D exchange. (b and c) Relaxed-eyed stereo ribbon images illustrating distances and positions of the six spin-coupled residues (Cα, magenta spheres) predicted by Lagerstedt et al.[50] to define a central plane: (b) initial crystal model and (c) 15 ns model. Distances between residues are denoted by dotted lines. Color code: H5 (residues 121–143), green; prolines, yellow spheres; H10 (residues 221–243), red; remainder of the helical residues, cyan; remainder of random coil residues, orange. To be spin-coupled, residue pairs must be within ∼20 Å of each other. The fact that six residues distributed along the full length of the apoA-I sequence, mostly on the surface, are spin-coupled suggests that the six form a spatially compact cluster in the solution structure of monomeric apoA-I.[41] We therefore analyzed distances between the six spin-coupled residues over the 30 ns of MD simulation. As seen in Figure 2b, the sum of distances for the 15 combinations among the six residues (a measure of their spatial compactness) reaches its lowest value in the region of seeming helical equilibration (10–20 ns, double-headed arrow). In Figure 2c, the number of distances of ≥20 Å reaches a minimum of 2–3 near the 15 ns structure in the middle of the region of helical equilibration defined in Figure 2a. On the basis of the best fit of our MD simulation of the initial crystal model with the spin-labeled data, we selected three models for additional analysis: (i) the initial crystal model, (ii) the average atom coordinates of the MD structural ensemble between 10 and 20 ns, termed the 10–20 ns model, and (iii) the 15 ns structure in the middle of the 10–20 ns region, termed the 15 ns model, based on the 15 ns frame of the full 30 ns ensemble. We selected the latter model because it was in the middle of both the zone of helix stability and the zone of spin-coupled spatial compactness. The rmsd between the 15 ns and 10–20 ns models is 5.4 Å, suggesting that the structures in the 10–20 ns range are not greatly different. In the 15 ns model, cooled from 500 to 310 K, the radius of gyration of the apoA-I monomer was smaller than in the original structure, decreasing from 20.7 to 20.2 Å. This change reduced the monomer’s diameter of gyration by 1.08 Å. We term the 310 K cooled-down structure the 15 ns model. The average 10–20 ns structure, because there is no solvent present, could not be subjected to a similar cool-down procedure. Chetty et al.[40] used hydrogen/deuterium (H/D) exchange coupled with proteolysis and mass spectrometry to define the positions of helical domains in human full-length monomeric lipid-free apoA-I. The method quantifies the H/D exchange rate of measurable amide hydrogens at known positions throughout the native structure with the exchange rate expected for an unstructured chain. The authors reported three extended and two short α-helices (residues 7–44, 54–65, 70–78, 81–115, and 147–178, designated 1–5, respectively, in Figure 3a). The overall helical content of 52% predicted by the H/D exchange study agrees well with the values reported in the literature.[25,39,40] We graphically compared the helical regions from the H/D exchange results of Chetty et al.[40] with those predicted by our models (the initial crystal model, the 10–20 ns model, and the 15 ns model). We also compared the two all-atom models for human monomeric apoA-I reported by Silva et al.[25] and Pollard et al.[26] Inspection of Figure 3a suggests that our three models fit the five domains predicted to be helical by H/D exchange significantly better than the Silva or Pollard model. Of our models, the 15 ns model appeared to provide the best overall fit. We used the secondary structure determination program STRIDE to quantify the goodness of fit between the experimental H/D exchange data[40] and the different models. We determined the secondary structure “match” for each residue: only residues in the model that match residues in the H/D data (i.e., helix to helix or nonhelix to nonhelix) are counted. The results are reported as (i) the total number of matching residues and (ii) the fraction of matched residues. The 15 ns model matched the H/D data better than the initial crystal model or the 10–20 ns model (Table 1). Further, each of our models matched the H/D exchange data significantly better than either the Silva model or the Pollard model (Table 1). The total helicities of the 10–20 ns model and the 15 ns model were 56 and 59%, respectively, in good agreement with the value of 55 ± 5% reported in the literature.[25,39,40]

Table 1

Comparison of Helical Properties of Models to H/D Data

model	helical residues	fraction helicitya	H/D match	fraction match
H/D (Chetty et al.[40])	126	0.52	243	1.00
initial crystal model	180	0.74	171	0.70
10–20 ns model	137	0.56	168	0.69
15 ns model	144	0.59	175	0.72
Silva model[25]	127	0.52	146	0.60
Pollard model[26]	131	0.54	136	0.56

Monomeric apoA-I is 55 ± 5% helical.

Monomeric apoA-I is 55 ± 5% helical. We next compared the fit of our models and the Silva[25] and Pollard[26] models with the EPR spin-coupled data,[41] using the following metrics: (i) the sum of distances (the total pairwise distances among Cα atoms of all six residues) and (ii) the number of distances between residues of >20 Å (the distance monitored by spin coupling). Each of our models fits the spin-coupled data better than either the Silva model or the Pollard model (Table 2); again, the 15 ns model provides the best fit. Panels b and c of Figure 3 compare the locations of the six spin-coupled residues in stereo images of two of our models, the initial crystal model and the 15 ns model. Each pair distance is denoted by a dotted line and distance in angstroms. In the initial crystal model, the six residues are scattered over the length of the monomer (Figure 3b). In the 15 ns model, the six residues form a spatially compact cluster. It is noteworthy that five of the six residues form a plane that is approximately equidistant from the long ends of the model (Figure 3c).[41]

Table 2

Fits of Models to Spin-Coupled Data

model	sum of distances (Å)	no. of distances of >20 Å
initial crystal model	88.2	8
10–20 ns model	70.2	4
15 ns model	67.9	3
Silva[25] model	129.4	12
Pollard[26] model	104.8	10

To further test the fit of the models to experimental data, we used mass spectrometry to detect apoA-I peptides that we had cross-linked with MDA. We exposed full-length lipid-free human apoA-I (0.14 mg/mL, predominantly monomeric) to MDA (100 μM) in 50 mM sodium phosphate buffer (pH 7.4) containing 100 μM DTPA. Reactions were initiated by adding MDA and terminated by adding a 20-fold molar excess (relative to MDA) of aminoguanidine. Following high-resolution size exclusion chromatography, monomeric apoA-I was digested with trypsin. Then the peptide digest was analyzed by MS/MS on a high-mass resolution Orbitrap instrument. This approach identified 41 cross-linked pairs of peptides with a high degree of confidence (based on the mass accuracy of the peptide–peptide adduct and the tandem mass spectrum of the peptide). The cross-linked peptides we detected and their tandem mass spectra are summarized in Tables 3 and 4 and Figure S3 of the Supporting Information.

Table 3

Intrapeptide (adjacent) Cross-Links (K+36+K) Identified in the FPLC Monomer Fraction of ApoA-I Exposed to MDAa

X-linked Lys residues	peptide involved	theoretical mass (Da)	experimental mass (Da)	error (ppm)	protease	product yield (%)	relative intensity
N_Tb–K12	1–13	1604.73	1604.74	4.67	Glu-C	49.68	196.78
K12–K23	11–27	1914.03	1914.04	4.51	trypsin	49.92	197.74
K40–K45c	28–59	3626.80	3626.82	4.76	trypsin	27.52	108.99
K94–K96c	93–111	2415.23	2415.24	4.26	Glu-C	7.42	29.39
K106–K107c^,d	97–116	2681.27	2681.28	4.31	trypsin	44.42	175.93
K133–K140d	132–149	2201.14	2201.15	4.71	trypsin	47.53	188.23
K195–K206d	189–215	3057.60	3057.62	5.30	trypsin	25.10	99.43
K206–K208c^,e	206–212	792.46	792.46	0.14	Glu-C	56.77	224.86
K238–K239c^,e	236–243	1030.55	1031.55	0.10	Glu-C	25.93	102.68

N-Terminal amino group.

Cross-linked peptides can be considered short-range with respect to the protein sequences.

Also identified by Glu-C digestion.

Also identified by trypsin digestion.

Table 4

Interpeptide (distant) Cross-Links (K+36+K) Identified in the FPLC Monomer Fraction of ApoA-I Exposed to MDAa

X-linked Lys residues	peptide involved	theoretical mass (Da)	experimental mass (Da)	error (ppm)	protease	product yield (%)	relative intensity
N_t×K195	1–10	3289.58	3289.59	3.71	trypsin	3.85	15.25
	189–206
K12–K226	1–13	2960.44	2960.45	3.86	Glu-C	1.29	5.12
	224–235
K23–K59	13–27	3567.83	3567.85	5.13	trypsin	26.07	103.25
	46–61
K23×K239	13–27	2289.22	2289.22	0.60	trypsin	2.54	10.07
	239–243
K40–K94	28–45	2963.54	2963.55	3.86	trypsin	1.51	5.98
	89–96
K40×K133	28–45	3184.67	3184.68	4.90	trypsin	4.37	17.29
	132–140
K40×K239	28–45	2635.38	2635.38	0.85	trypsin	5.54	21.94
	239–243
K59–K195	46–61	3945.01	3945.02	3.43	trypsin	1.99	7.88
	192–198
K59–K239	46–61	2520.32	2520.32	2.26	trypsin	3.19	12.63
	239–243
K77–K195c	71–78	1934.90	1934.90	1.12	Glu-C	3.98	15.76
	192–198
K77×K208	71–78	3665.85	3665.87	3.36	trypsin	0.69	2.71
	207–215
K88×K195	84–94	3398.69	3398.70	4.94	trypsin	0.72	2.85
	192–198
K94×K226	89–96	3564.92	3564.93	4.69	trypsin	1.27	5.03
	216–238
K94×K239	89–96	1569.86	1569.86	3.50	trypsin	5.87	23.25
	239–243
K96–K195	95–106	3514.79	3514.80	4.38	trypsin	5.88	23.29
	189–206
K96×K208	95–106	2499.33	2499.34	3.65	trypsin	3.52	13.96
	207–215
K96×K239	95–106	2090.10	2090.10	1.01	trypsin	5.64	22.33
	239–243
K107×K239	107–116	2050.01	2050.02	0.91	trypsin	1.88	7.45
	239–243
K118–K133c	114–125	2475.36	2475.36	1.80	Glu-C	12.63	50.01
	129–136
K118–K140b	117–123	2813.50	2813.50	0.60	trypsin	44.59	176.59
	134–149
K118×K195c	114–125	2356.25	2356.25	2.37	Glu-C	2.03	8.03
	192–198
K118×K208	117–123	1917.10	1917.10	1.65	trypsin	0.92	3.64
	207–215
K118–K226	114–125	2893.55	2893.56	1.06	Glu-C	8.94	35.42
	224–235
K118×K239	114–125	1507.87	1507.87	0.05	trypsin	4.56	18.06
	239–243
K133×K239	132–140	1790.98	1790.98	0.50	trypsin	2.60	10.31
	239–243
K140–K239	134–149	2547.33	2547.34	1.68	trypsin	3.51	13.92
	239–243
K182–K195	180–183	1314.68	1314.68	0.15	Glu-C	9.48	37.55
	192–198
K182×K226	180–183	1851.97	1851.98	1.69	Glu-C	13.61	53.91
	224–235
K195–K226	192–198	2211.10	2211.11	1.36	Glu-C	6.03	23.89
	224–235
K195–K239b	189–206	2666.39	2666.39	1.99	trypsin	4.45	17.63
	239–243
K206–K226	206–212	2148.17	2148.17	2.40	Glu-C	5.97	23.64
	224–235
K226–K239	224–235	2387.26	2387.26	1.76	Glu-C	3.31	13.11
	236–243

ApoA-I (5 μM) was incubated with 100 μM MDA, and the cross-linked peptides were identified as described in footnote a of Table 3.

Also identified by Glu-C digestion.

Also identified by trypsin digestion.

ApoA-I (5 μM) was incubated with 100 μM MDA at 37 °C in phosphate buffer (pH 7.4) for 24 h. Reactions were initiated by adding MDA and terminated by adding a 20-fold molar excess (relative to MDA) of aminoguanidine. The reaction mixture was then concentrated, and the monomer was isolated by FPLC. The combined monomer fractions were further concentrated and digested with trypsin or Glu-C. Proteolytic digests were desalted by solid phase extraction prior to MS analysis and analyzed on a hybrid linear ion trap Orbitrap instrument. N-Terminal amino group. Cross-linked peptides can be considered short-range with respect to the protein sequences. Also identified by Glu-C digestion. Also identified by trypsin digestion. ApoA-I (5 μM) was incubated with 100 μM MDA, and the cross-linked peptides were identified as described in footnote a of Table 3. Also identified by Glu-C digestion. Also identified by trypsin digestion. We applied two methods to establish the goodness of fit of the models with the cross-linking data. First, we used the sum of violation distances algorithm recently described by Kalisman et al.[42] to analyze the goodness of fit of different models to cross-link violation distances. We assumed a maximal Cα distance of linkable lysine residues of 23 Å, defined as 20 Å + 3.0 Å, the estimated coordinate error for two mobile surface residues.[33] For each of the six models, the number of violations and the sum of minimal violations were plotted on the x- and y-axes, respectively (Figure 4a). In this plot, the initial crystal model is farthest from the origin, while the Pollard, Silva, and 10–20 ns models, in rank order, fall progressively closer to the origin. The best fit for any model (i.e., the one falling closest to the origin) was that of the 15 ns model. For comparison, a full trajectory analysis of the 0–30 ns MD simulation at 500 K, termed the full trajectory model, is also shown in Figure 4a. It should be noted that, in the latter analysis, every structure in the simulation was considered and the minimum of each individual violation distance was found among possibly different structures.

Figure 4

Graphic analyses of chemical cross-link data for different models. (a) Graphic analysis of the goodness of fit of the five models plus the entire 30 ns trajectory to cross-link violation distances, number vs sum, assuming a cross-linking distance cutoff of 23 Å. (b) Overlay of 41 chemical cross-linking data points onto a static contact map plot of Cα distances of ≤23 Å for the 15 ns model. Small purple circles represent the positions of all potential cross-links, red circles the most probable one-third of cross-links, yellow circles the median probable one-third of cross-links, and blue circles the least probable one-third of cross-links. Red arrowheads denote four loops and turns numbered from the N-terminus and blue arrowheads the N- and C-termini. Two cross-links (182–195 and 195–206) at turn/loop 4 are denoted with red bull’s-eyes and four cross-links of H5 to H10 (118–226, 118–239, 133–239, and 140–239) with magenta bull’s-eyes, and the cross-link supporting an H5 hairpin (118–140) is denoted with a blue bull’s-eye. For the second method for analyzing the goodness of fit, we plotted each chemical cross-link as a colored dot onto the background of a two-dimensional contact map of all 23 Å residue-to-residue contacts, focusing on the static 15 ns model (Figure 4b). The apparent abundance of each cross-linked peptide (monitored as ion current) is indicated by the color of the corresponding dot. For comparison, we also plotted the cross-linking data on the contact map calculated over the full trajectory (Figure S4 of the Supporting Information). Overall, the abundant peptides exhibited an excellent fit to the 15 ns model (red dots in Figure 4b). The majority of those cross-linked peptides were in the most mobile regions of the models: at the N- and C-termini (blue arrowheads) and near hairpins and loops represented by the junction of contact zones with (and at right angles to) the diagonal line of equality (red arrowheads). Figure 5a is a stereo ribbon image of the 15 ns model. The five domains predicted to be helical by H/D exchange and the four domains that are helical in the 15 ns model but not helical as predicted by H/D exchange[40] are colored red and green, respectively. Figure 5b illustrates the general conformation of the 15 ns model in the form of a stereo cartoon in which the 10 helical segments are shown as cylinders. The following are key features of the model. (i) The C-terminal H10 (red) is nestled under the folded H5 helical hairpin (green) covering the antiparallel H4–H6 array (residues 100–120 and 144–164). (ii) The C-terminal and N-terminal domains, residues 226–239 (red) and 7–31 (yellow), respectively, are in contact, forming an antiparallel helical array. (iii) H5 and the N- and C-terminal segments are in the proximity of each other on the side of the model facing the viewer, a feature also found in the Silva[25] model.

Figure 5

Relaxed-eyed stereo views of the 15 ns model. (a) Mapping of the location of the five helical domains predicted by Chetty et al. using H/D exchange[40] onto a relaxed-eyed stereo ribbon representation of the 15 ns model. Helical domains [marked as 1–5 (see Figure 3a)] of the 15 ns model predicted to be helical are colored red and helical domains of the 15 ns model predicted not to be helical are green. The remainder of the nonhelical structure is colored gray. Prolines are shown as yellow spheres. The N-terminus is marked with a yellow N and the C-terminus with a red C. (b) Cartoon representation of the 15 ns model in same orientation as panel a. Helical domains are represented as cylinders and nonhelical domains as random coils. Color code: H5, green; H10, red; N-terminal helical domain, yellow; other helical domains, magenta; K40 and Y192, cyan and magenta spheres, respectively. The N- and C-termini are indicated as in panel a.

Discussion

By combining published crystal structures of the C-terminal and N-terminal portions of apoA-I, we derived an initial all-atom crystal model for the full-length monomeric protein. Using MD simulations, we modified that initial crystal model into a 10–20 ns model and a 15 ns model. Ribbon comparison of the four alternatively open and closed structures: 15 ns (Closed1) → 20 ns (Open1) → 25 ns (Closed2) → 28 ns (Open2). Color code: H10, red; H5, green; H4, cyan; H6, magenta; remainder of structure, gray; N-termini, yellow arrowheads; C-termini, red arrowheads. In both transitions from open to closed structures, H5 and H10 move away from each other and from the main body of the protein exposing the H4–H5–H6 hairpin so that a second apoA-I rotated along its long axis by 180° can dimerize with the first through intermolecular interactions of the H4–H5–H6 region. The 10–20 ns region was used to construct the 10–20 ns model because it represented a zone of helix stability and spin-coupled spatial compactness. The 15 ns frame was used to create the 15 ns model because it represented the middle of the zone of helix stability and maximal spin-coupled spatial compactness. This choice is supported by the cross-linking data (Figure 4a) and the observation that the core helical domains of the model, CH1, -4, and -5 [except for the spikes associated with the closed–open transitions (Figure 6)], are equilibrated at 15 ns (red circle in Figure S2b of the Supporting Information). The nonhelical core of the simulated structure reaches initial equilibration at 10 ns (cyan line in Figure S2c of the Supporting Information) but begins a steep increase in rmsd after 27 ns, in part because of significant changes in the position of H10 in the model (see below). Examination of the simulation frames between 27 and 30 ns shows that (i) H/D helical domain 2 has become nonhelical (a particular problem because this domain was shown by Chetty et al.[40] to be helical); (ii) H10 is trapped between CH1 and a portion of H6 and is beginning to separate from its previous interaction with H4 and H5 (Figure S5a of the Supporting Information); and (iii) for the first time, a gap has appeared between the H4 and H6 domains (Figure S5b of the Supporting Information). The gap and the loss of H/D helical domain 2 would suggest that the simulation is becoming unstable, although it is possible that the 30 ns structure represents a new transitional conformation for monomeric apoA-I. We will pursue the latter possibility at a later date. We believe that the 15 ns model (Figure 5) represents the best available static structure for human lipid-free monomeric apoA-I; it fits all three experimentally determined parameters (helical domains, spin-coupled residues, and chemical cross-linking results) better than either of the two previously published all-atom models.[25,26] Because the helical structure of monomeric human apoA-I is known to be very flexible and to unfold and refold in a dynamic manner,[40] the 15 ns model, although static, illustrates in a clear fashion the most important conformational features of the ensemble of structures in thermal equilibrium (Figures 4a and 5). The Chetty[40] H/D exchange data suggest that residues 121–143 (H5) and 180–243 (H8–H10) are not helical. However, EPR studies led Oda et al.[43] to conclude that H10 in monomeric apoA-I is at least partially helical. We propose that H5 and H8–H10 (green helical domains in Figure 5a) are in motion, forming and breaking helices sufficiently rapidly that H/D exchange cannot detect them reliably [perhaps similar to what is happening at 30 ns (Figure S5 of the Supporting Information)]. In support of this proposal, a comparison of the 15 ns model with the 20 ns structure (Figure 6) demonstrates that H5 and H10 have moved away from each other and from the main body of the protein, exposing tandem helical domains H4 and H6. This conformational transition happens twice during the 30 ns MD simulation. The two open structures can be recognized as two upward blips in the rmsd plot (empty arrowheads in Figure S2a of the Supporting Information). The closed structures appear at 15 and 23 ns and the open structures at 20 and 28 ns (Figure 6). Although we performed the simulation at 500 K, we found little change in overall helicity between 10 and 30 ns (58 to 55%). We surmised from the H/D data[40] that the structure’s core conformation remains relatively intact [red helical domains 1 (residues 7–31), 4 (residues 84–120), and 5 (residues 146–174) in Figure 5a]. Evidence of core stability is provided by the rmsd data (Figure S2 of the Supporting Information) and by the more extensive stability analyses shown in Figure 7. An rmsf plot of the fluctuation of each residue during the full simulation (Figure 7a) shows that core helix (CH) domains 1, 4, and 5 (along with smaller H/D helical domains 2 and 3) are more stable than most of the remainder of the model.

Figure 7

Stability analysis of the apoA-I monomer core CH1, CH4, and CH5 domains. (a) Root-mean-square fluctuation (rmsf) of all residues of the initial crystal model of apoA-I over the entire course of the 30 ns MD simulation at 500 K. Red boxes show H/D helical regions 1, 4, and 5 (residues 7–44, 81–115, and 147–178, respectively) that represent the core helical domains of the monomer model. Colored circles (L22 and L163, yellow; G26 and S167, gray; E92, red; K96, blue; Y100, Y104, Y108, and Y166, magenta; L159, green) denote key residues in panel b that stabilize the crossover contacts among the three core helical domains. (b) Relaxed-eye stereo view of the CH1, CH4, and CH5 domains (transparent red ribbons identified by pink numerals) of the 15 ns model. Hydrophobic residues of the core are shown as green sticks. G26 and S167 (gray space-filling model) form a hydrogen-bonded pair at the crossover between CH1 and CH5. E92 (red space-filling model), K96 (blue space-filling model), and Y166 (magenta space-filling model) form a cation−π complex between CH4 and CH5. Y100, F104, and W108 (magenta, dotted space-filling model–stick combination) form an aromatic cup around L159 (green sticks) between CH4 and CH5. L22 on CH1 (yellow space-filling model) packs tightly against the G26–S167 hydrogen bond. L163 on CH5 (yellow space-filling model) is tightly packed into the angle between S167 and Y166. Y115 on CH4 associates with three surrounding hydrophobic residues to help hold the upper ends of CH4 and CH5 together. Similar hydrophobic interactions occur between CH4 and CH5 at the bottom. We examined the molecular packing of the three core helical domains to identify stabilizing structural elements. The three helical domains are in contact at crossover zones of CH1 with CH5 and CH5 with CH4; CH1 makes no contact with CH4, leaving a cleft for interaction with H10. Hydrophobic residue contact is pronounced in the crossover zones of CH1 with CH5 and CH5 with CH4 (Figure 7b). Importantly, detailed examination of these two crossover zones shows several specific forms of molecular interactions (Figure 7b). (i) S167 in H5 forms a hydrogen bond with the backbone carbonyl of G26 in CH1, a common motif in helix–helix interaction in nonpolar environments.[44] This hydrogen bond is not present in the initial crystal model. (ii) The formation of a salt bridge from E92 to K96 on CH4 creates a cation−π bond between K96 and Y166 of CH5.[44] (iii) Three aromatic residues on CH4 (Y100, F104, and W108) form what we call a π cup around L159 on CH5. (iv) L22 on CH1 packs tightly against the G26–S167 hydrogen bond. (v) L163 is tightly packed into the angle between S167 and Y166 on CH5. These 11 residues involved in molecular interactions at the crossover zones of CH1 with CH5 and CH5 with CH4 when plotted onto the rmsf plot of Figure 7a define the least mobile regions of CH1, CH4, and CH5. Other interactions, e.g., between Y115 and three surrounding hydrophobic residues and several just below E92 (Figure 7b), hold the ends of CH4 and CH5 together. Thus, in the MD simulation, the core remains relatively stable during three transitions involving H5 and H10: closed → open → closed → open (Figure 6). These observations are consistent with a wide range of biophysical studies[24] demonstrating that lipid-free apoA-I exhibits a two-domain structure in solution: a more rigid N-terminal domain (residues 1–189, red helical domains in Figure 5a) and a less organized flexible C-terminal domain (residues 190–243, green helical domains in Figure 5a). The stability of the initial crystal model’s secondary structure to MD simulations at 500 K is unusual because our 500 K MD simulations of the models of Silva[25] and Pollard[26] resulted in much greater changes in total helicity (Figure S6 of the Supporting Information): the Silva model went from 52% helicity at 0 ns to 28% at 30 ns (Figure S6a of the Supporting Information), while the Pollard model went from 54% helicity at 0 ns to 28% at 10 ns (Figure S6b of the Supporting Information). Our model for monomeric apoA-I is stable even when compared with one of the apolipophorin III proteins found in many insects.[45] An MD simulation of the crystal structure of that four-helix bundle apolipoprotein showed no plateau in helix stability, which decreased linearly from 80% at 0 ns to 50% at 20 ns (Figure S6c of the Supporting Information). A protein that conserves a nativelike secondary structure but has no tightly packed interior is defined as a molten globule, which supposedly is an intermediate thermodynamic state that clearly differs from either a tightly folded, highly stable structure or the denatured state.[46] Our MD simulations, which suggest that lipid-free apoA-I maintains a core complex of intact folded helices with dynamic closing and opening of H5 and H10, are consistent with the definition of a molten globule. We suggest that apoA-I is highly dynamic, with a nativelike secondary structure that binds its targets tightly and specifically in the absence of side chain close packing.[46] Consistent with our findings, biophysical studies suggest that apoA-I is a molten globule,[39] a state suggested to mediate the incorporation of lipid into plasma.[39] To sample the lipid affinity of our lipid-free apoA-I monomer model (Figure 5), we coarse grained the 15 ns model and performed a 20 μs coarse grained MD simulation of the model in a periodic solvent box containing 10 randomly distributed monomeric POPC; the results are shown in Figure S7 of the Supporting Information. After simulation, seven POPC bind, forming a small monolayer in the monomeric apoA-I “lipid cup”. This model shares similarities with a model for the preβ-apoA-I monomer recently proposed by Jayaraman et al.[47] that is based upon lipid-poor monomeric apoA-I released by heating HDL. (i) Their particle contains 8 or 9 PCs and 1 CE. (ii) In their model, the C-terminal H10 folds back against a H4–H5–H6 “bucket” (see the cup formed by H4–H5–H6 and H10 in Figure S7d of the Supporting Information) derived from the crystal structure of (Δ185–243)apoA-I.[27] The open conformation of apoA-I at 20 and 28 ns suggests a mechanism for dimerization and accumulation of lipid into the discoidal HDL form. The 20 and 28 ns structures expose the H4–H6 hairpin domains (Figure 6), so that two similarly structured apoA-I monomers rotated relative to one another along their long axes by 180° are able to dimerize. This association would be driven by the formation of intermolecular salt bridges among the H4, H5, and H6 domains of the two monomers and result in close planar proximity of the six spin-coupled residues in the dimer, as proposed by Lagerstedt et al.[41] The addition of lipid (Figure S7 of the Supporting Information) and the unfolding of the helical remainder of the two apoA-I would provide the correct topology and lower the activation energy for dimerization. The dimerized lipid-poor apoA-I would then be poised to grow into a fully lipidated LL5/5 double-belt discoidal particle.[20] As noted by Mei and Atkinson,[27] the paired antiparallel H5 repeats between opposing chains of the apoA-I dimer form leucine zipper motifs. It seems likely that these are the principal motifs for aligning the pairwise antiparallel H5 repeats of the apoA-I dimer in the lipid-free state. For example, in the open 20 ns state (Figure 6), the leucine zippers might “zip-lock” the double belt into the H4–H6 antiparallel pair and ultimately the full LL5/5[20] rotamer registration. As noted elsewhere,[28] however, this leucine zipper motif may not be important for maintaining the apoA-I dimer after apoA-I has acquired lipid, reflecting the marked increase in helicity and the formation of salt bridges between the proteins.[20] A key element of our model is the loop/turn in the vicinity of Y192 that allows H10 to contact H5, a conformational motif whose formation and breaking we propose modulates protein dimerization and lipid binding. Our cross-linking data support all elements of this model. The existence of the loop/turn [loop 4 (Figure 4b)] is supported by cross-links 182–195 and 195–206. Contact of the H5 hairpin with H10 is supported by cross-links 118–226, 118–239, 133–239, and 140–239 (Figure 4b). The H5 hairpin itself is supported by cross-link 118–140 (Figure 4b). Table S1 of the Supporting Information compares cross-linked peptides identified in this study with those identified by Silva et al.[25] and Pollard et al.[26] Eight cross-links are common to all three studies (Figure S8a of the Supporting Information). Of the lysine cross-links identified in our study that support the H10–H5 contact model (Figure S8b of the Supporting Information), only one, cross-link 118–140 indicating a H5 hairpin, is common to all three studies. None of the other six lysine cross-links found in our study that support the H10–H5 contact model were reported by Silva et al.[25] or Pollard et al.[26] Although it does not directly support the Y192 loop, the ability of Pollard et al.[26] to cross-link L200C and L233C mutations supports our contention that helix 10 is extremely mobile. We tested the models experimentally by reacting monomeric apoA-I with MDA, which cross-links proteins and also impairs the ABCA1 activity of lipid-free apoA-I.[35] Because MDA creates cross-links in apoA-I between H10 (C-terminus) and H5,[35] cross-linking should hinder apoA-I’s ability to form discoidal HDL, a key step in reverse cholesterol transport, by locking H10 to H5, preventing the transition from the closed to open form (Figure 6). It is significant that HDL isolated from human atherosclerotic lesions is modified by MDA and that elevated levels of MDA-modified HDL have been detected in blood of subjects with established coronary artery disease. Thus, activation of the MDA pathway might impair sterol efflux from artery wall cells in vivo by locking apoA-I into an unfavorable conformation.[35] Other studies suggest that Tyr192 of lipid-free apoA-I is also a key regulator of sterol efflux by the ABCA1 pathway.[48] In our models, Y192 is the first aromatic residue in residues 183–243 of the C-terminal fragment of the Borhani crystal structure that aligns with the aromatic residues of the hydrophobic cleft of the crystal structure of Mei and Atkinson (Figure 1g). Thus, the π–π interactions between Y192 and W72 (Figure 1g) may be important in wrapping the C-terminal fragment around one end of the helix bundle structure (Figure 7b) to create the H10–H5 juxtaposition that we propose regulates interactions of apoA-I with lipids. Consistent with this possibility, we have shown that Y192 and the adjacent E191 are the only completely conserved adjacent pair of residues in apoA-I of the eight completely conserved residues found in 31 orthologs of apoA-I from mammals to fish.[49] Finally, Pollard et al.[26] have emphasized the degree of cross-linking connectivity of K40 to six other residues, K59, K118, K133, K140, K182, and K239. As shown in Figure 5b, K40 is within a long loop that extends from residue 32 to 55 and is expected to be highly mobile. In fact, three of the six connected residues, including K239 at the opposite end of the primary sequence from K40, are <23 Å from K40 in our 15 ns model, and all six are <17 Å from K40 at some point in the full trajectory model (Table 5).

Table 5

Minimal Distances (angstroms) from K40

residue	15 ns model	full trajectory model
K59	22.3	11.6
K118	32.4	16.1
K133	18.4	16.0
K140	28.4	13.9
K182	24.7	15.2
K239	17.4	6.7

In conclusion, we combined portions of the crystal structures of human apoA-I of Borhani et al.[18] and Mei and Atkinson[27] and used MD simulations to develop a refined, dynamic structure of lipid-free monomeric human apoA-I that is consistent with the molten globule state. This model robustly fits three experimentally determined parameters: (i) domains predicted to be helical by H/D exchange,[40] (ii) spin-coupled residues defining a spatially compact cluster in the solution structure of monomeric apoA-I,[50] and (iii) high-precision mass spectrometry/peptide analysis with cross-linking chemistry. Because our model, schematically illustrated in Figure S9 of the Supporting Information, cycles between closed and open states, it suggests a mechanism whereby monomeric apoA-I might dimerize and interact with lipid during the process of reverse cholesterol transport. Although our proposed model is consistent with the existing low-resolution data, it awaits ultimate experimental verification by higher-resolution methods.

47 in total

1. The structure of human lipoprotein A-I. Evidence for the "belt" model.

Authors: V Koppaka; L Silvestro; J A Engler; C G Brouillette; P H Axelsen
Journal: J Biol Chem Date: 1999-05-21 Impact factor: 5.157

2. A three-dimensional molecular model of lipid-free apolipoprotein A-I determined by cross-linking/mass spectrometry and sequence threading.

Authors: R A Gangani D Silva; George M Hilliard; Jianwen Fang; Stephen Macha; W Sean Davidson
Journal: Biochemistry Date: 2005-03-01 Impact factor: 3.162

Review 3. Pathways for oxidation of high-density lipoprotein in human cardiovascular disease.

Authors: Baohai Shao; Michael N Oda; Tomas Vaisar; John F Oram; Jay W Heinecke
Journal: Curr Opin Mol Ther Date: 2006-06

Review 4. New insights into the regulation of HDL metabolism and reverse cholesterol transport.

Authors: Gary F Lewis; Daniel J Rader
Journal: Circ Res Date: 2005-06-24 Impact factor: 17.367

5. Crystal structure of truncated human apolipoprotein A-I suggests a lipid-bound conformation.

Authors: D W Borhani; D P Rogers; J A Engler; C G Brouillette
Journal: Proc Natl Acad Sci U S A Date: 1997-11-11 Impact factor: 11.205

6. ApoA-I deficiency causes both hypertriglyceridemia and increased atherosclerosis in human apoB transgenic mice.

Authors: E Voyiaziakis; I J Goldberg; A S Plump; E M Rubin; J L Breslow; L S Huang
Journal: J Lipid Res Date: 1998-02 Impact factor: 5.922

7. Quantification of malondialdehyde and 4-hydroxynonenal adducts to lysine residues in native and oxidized human low-density lipoprotein.

Authors: J R Requena; M X Fu; M U Ahmed; A J Jenkins; T J Lyons; J W Baynes; S R Thorpe
Journal: Biochem J Date: 1997-02-15 Impact factor: 3.857

8. Novel changes in discoidal high density lipoprotein morphology: a molecular dynamics study.

Authors: Andrea Catte; James C Patterson; Martin K Jones; W Gray Jerome; Denys Bashtovyy; Zhengchang Su; Feifei Gu; Jianguo Chen; Marcela P Aliste; Stephen C Harvey; Ling Li; Gilbert Weinstein; Jere P Segrest
Journal: Biophys J Date: 2006-03-31 Impact factor: 4.033

9. Somatic gene transfer of human ApoA-I inhibits atherosclerosis progression in mouse models.

Authors: P Benoit; F Emmanuel; J M Caillaud; L Bassinet; G Castro; P Gallix; J C Fruchart; D Branellec; P Denèfle; N Duverger
Journal: Circulation Date: 1999 Jan 5-12 Impact factor: 29.690

Review 10. Using tandem mass spectrometry to quantify site-specific chlorination and nitration of proteins: model system studies with high-density lipoprotein oxidized by myeloperoxidase.

Authors: Baohai Shao; Jay W Heinecke
Journal: Methods Enzymol Date: 2008 Impact factor: 1.600

12 in total

Review 1. Lipid-free Apolipoprotein A-I Structure: Insights into HDL Formation and Atherosclerosis Development.

Authors: Xiaohu Mei; David Atkinson
Journal: Arch Med Res Date: 2015-06-03 Impact factor: 2.235

2. Multiscale Simulations of Biological Membranes: The Challenge To Understand Biological Phenomena in a Living Substance.

Authors: Giray Enkavi; Matti Javanainen; Waldemar Kulig; Tomasz Róg; Ilpo Vattulainen
Journal: Chem Rev Date: 2019-03-12 Impact factor: 60.622

3. An Evaluation of the Crystal Structure of C-terminal Truncated Apolipoprotein A-I in Solution Reveals Structural Dynamics Related to Lipid Binding.

Authors: John T Melchior; Ryan G Walker; Jamie Morris; Martin K Jones; Jere P Segrest; Diogo B Lima; Paulo C Carvalho; Fábio C Gozzo; Mark Castleberry; Thomas B Thompson; W Sean Davidson
Journal: J Biol Chem Date: 2016-01-11 Impact factor: 5.157

Review 4. Structural stability and functional remodeling of high-density lipoproteins.

Authors: Olga Gursky
Journal: FEBS Lett Date: 2015-03-05 Impact factor: 4.124

5. Apolipoprotein A1 Forms 5/5 and 5/4 Antiparallel Dimers in Human High-density Lipoprotein.

Authors: Yi He; Hyun D Song; G M Anantharamaiah; M N Palgunachari; Karin E Bornfeldt; Jere P Segrest; Jay W Heinecke
Journal: Mol Cell Proteomics Date: 2019-01-18 Impact factor: 5.911

6. High-Density Lipoprotein Biogenesis: Defining the Domains Involved in Human Apolipoprotein A-I Lipidation.

Authors: Ricquita D Pollard; Brian Fulp; Mary G Sorci-Thomas; Michael J Thomas
Journal: Biochemistry Date: 2016-08-23 Impact factor: 3.162

7. Structural Stability and Local Dynamics in Disease-Causing Mutants of Human Apolipoprotein A-I: What Makes the Protein Amyloidogenic?

Authors: Madhurima Das; Christopher J Wilson; Xiaohu Mei; Thomas E Wales; John R Engen; Olga Gursky
Journal: J Mol Biol Date: 2015-11-10 Impact factor: 5.469

8. Probing the C-terminal domain of lipid-free apoA-I demonstrates the vital role of the H10B sequence repeat in HDL formation.

Authors: Xiaohu Mei; Minjing Liu; Haya Herscovitz; David Atkinson
Journal: J Lipid Res Date: 2016-06-17 Impact factor: 5.922

Review 9. Structural Insights into High Density Lipoprotein: Old Models and New Facts.

Authors: Valentin Gogonea
Journal: Front Pharmacol Date: 2016-01-12 Impact factor: 5.810

10. Structural and Functional Analysis of the ApolipoproteinA-I A164S Variant.

Authors: Jonathan Dalla-Riva; Jens O Lagerstedt; Jitka Petrlova
Journal: PLoS One Date: 2015-11-25 Impact factor: 3.240