| Literature DB >> 29395654 |
Michael Schneider1, Adam Belsom2, Juri Rappsilber3.
Abstract
Observing the structures of proteins within the cell and tracking structural changes under different cellular conditions are the ultimate challenges for structural biology. This, however, requires an experimental technique that can generate sufficient data for structure determination and is applicable in the native environment of proteins. Crosslinking/mass spectrometry (CLMS) and protein structure determination have recently advanced to meet these requirements and crosslinking-driven de novo structure determination in native environments is now possible. In this opinion article, we highlight recent successes in the field of CLMS with protein structure modeling and challenges it still holds.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29395654 PMCID: PMC5854373 DOI: 10.1016/j.tibs.2017.12.006
Source DB: PubMed Journal: Trends Biochem Sci ISSN: 0968-0004 Impact factor: 13.807
Studies and Modeling Resources for Crosslink-Driven Protein Tertiary Structure Modeling
| Study | Crosslinking/mass spectrometry | Data analysis | Protein structure modeling |
|---|---|---|---|
| Young | BS3 crosslinker with MALDI-postsource decay mass spectrometry | Automated Spectrum Assignment Program (ASAP) | Scoring of threaded models with crosslink constraints |
| Kahraman | Simulated crosslinks | Simulated crosslinks | XWalk algorithm for computing sSASD for crosslinks. Can be used for validation and visualization |
| Kahraman | Crosslink data from the literature | Crosslink data from the literature | Comparative modeling and |
| Hofmann | Simulated crosslinks | Simulated crosslinks | |
| Matthew Allen Bullock | Crosslink data from the literature | Crosslink data from the literature | JWalk algorithm for crosslink modeling using sSASD. Development of scoring metric that accounts for nonaccessible residues |
| Belsom | Sulfosuccinimidyl 4,4′-azipentanoate (sulfo-SDA) crosslinker with liquid chromatography–MS | Xi | Guided model-based search |
| Degiacomi | Crosslink data from the literature | Crosslink data from the literature | Crosslink modeling using shortest solvent-accessible distance and explicit modeling of protein flexibility (DynamXL) |
| Brodie | Several zero-length and short-range crosslinkers. Liquid chromatography–MS | Isotopically Coded Cleavable Cross-Linking Analysis Software Suite and Kojak | Replica exchange discrete molecular dynamics |
Figure 1Overview of a Crosslinking Experiment for Protein Structure Determination. (A) As the first step in standard (homobifunctional) crosslinking, the crosslinker reacts with a specific reactive residue and then a second one to form a crosslink. Photo-crosslinking with photoactivatable reagents follows the same workflow. However, in the nucleophilic reaction step, only one side of the crosslinker reacts with the protein. The other side is activated by UV light and then reacts with the protein to form the crosslink. (B) The experimenter digests the protein using proteases (usually trypsin). The resulting peptides are then subjected to mass spectrometry. Specialized database search software reads out the crosslinks from the mass spectrometry data. The crosslinks then form the input to data-driven protein structure modeling. (C) Photo-crosslinkers such as sulfosuccinimidyl 4,4′-azipentanoate react on one side with lysine (and S/T/Y) and can react with any amino acid on the other side. This leads to a high crosslink density (the sequence of the protein is depicted by the circle; the crosslinks are shown as lines). These crosslinks can be leveraged for structural modeling. The reaction specificity of standard homobifunctional crosslinkers targets lysines (and S/T/Y residues to a lesser extent). This limits the density of the resulting crosslink network.
Figure 2Effect of Crosslinking/Mass Spectrometry (CLMS) Data in Conformational Space Search. De novo protein structure modeling searches the conformational space of the protein for the lowest energy conformation, which usually coincides with the native structure. However, the energy landscape is rugged, and the energy of the native state might be close to the energy of other local minima. This makes search difficult because there might be no clear gradient toward the native structure. Using CLMS data as residue–residue constraints transforms the energy landscape by deepening the energy well of the native structure. This also makes the energy landscape less rugged and provides a gradient toward the native state. This makes it easier to search for the native conformation and therefore leads to more frequent sampling of nativelike structures in de novo structure modeling calculations.
Figure 3Using Photo-Crosslinking/Mass Spectrometry (CLMS) Crosslinkers for Structure Modeling. (A) Photo-crosslinking of human serum albumin (HSA) with sulfosuccinimidyl 4,4′-azipentanoate leads to 1495 links at 20% false-discovery rate. (B) The distance distribution of crosslinked residues follows a log-normal distribution. Most crosslinks are between residues with Cα distances below 20 Å. (C) The combination of high-density CLMS data with computational protein modeling is able to recapitulate the HSA domain structures. Here, we show the results for domain C of HSA. Models are shown in color, while the native structure is shown in gray. Using high density-CLMS (HD-CLMS) data from purified HSA samples leads to modeled structures with a root mean square deviation (RMSD) of 2.9 Å. (D) Using HD-CLMS data from HSA samples in blood serum leads to models with an RMSD of 3.8 Å to the native structure. (E) RMSD distribution of low-energy computed models using CLMS data from purified HSA (red), from HSA in blood serum (orange), and without CLMS data (blue). Using CLMS data shifts the RMSD distribution toward lower RMSD values. Thus, the CLMS effectively guides conformational space search and allows to sample nativelike, low-RMSD structures more frequently. Adapted from [18].
Figure 4Challenges in Crosslinking/Mass Spectrometry (CLMS)-Driven Structure Determination. (A) One of the current challenges in crosslinking for structure determination is the uneven distribution of digestion sites in the protein sequence. Long-sequence stretches without trypsin digestion sites generate large peptides that are unsuitable for MS analysis. Consequently, no links can be detected in these regions. (B) Using alternative proteases or multiple enzymes for digestion could alleviate this issue by cutting these regions into smaller peptides, which can be detected in the MS. (C) Another current challenge for CLMS structure analysis is β-sheets. β-Sheets form compact structure arrangements and the distance between two β-strands is ∼5 Å. Current crosslinkers generate distance constraints of 20–35 Å [20–25 Å for sulfosuccinimidyl 4,4′-azipentanoate (sulfo-SDA)]. This is not sufficient to resolve β-sheet arrangements. (D) We speculate that using photo-amino acids could alleviate the issue, where the crosslinker formed by the side chain should lead to tighter distances constraints in the 10 Å range. Adapted from [53].