Literature DB >> 29741581

Jwalk and MNXL web server: model validation using restraints from crosslinking mass spectrometry.

Joshua M A Bullock¹, Konstantinos Thalassinos^1,2, Maya Topf¹.

Abstract

Motivation: Crosslinking Mass Spectrometry generates restraints that can be used to model proteins and protein complexes. Previously, we have developed two methods, to help users achieve better modelling performance from their crosslinking restraints: Jwalk, to estimate solvent accessible distances between crosslinked residues and MNXL, to assess the quality of the models based on these distances.
Results: Here, we present the Jwalk and MNXL webservers, which streamline the process of validating monomeric protein models using restraints from crosslinks. We demonstrate this by using the MNXL server to filter models made of varying quality, selecting the most native-like. Availability and implementation: The webserver and source code are freely available from jwalk.ismb.lon.ac.uk and mnxl.ismb.lon.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities: Chemical Gene Species

Mesh：

Substances：

Year: 2018 PMID： 29741581 PMCID： PMC6184817 DOI： 10.1093/bioinformatics/bty366

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

1 Introduction

Crosslinking Mass Spectrometry (XL-MS) is an experimental method that can generate sparse structural information on proteins, complementary to traditional structural techniques. Briefly, an XL-MS experiment consists of (a) crosslinking of your target protein (or protein complex); (b) digesting the crosslinked protein and (c) identifying via MS which residues are crosslinked. This results in restraining information that can then be used to model the protein of interest. Jwalk (Bullock ) is a program that calculates the Solvent Accessible Surface Distance (SASD), which is defined as the shortest distance between two residues across the surface of the protein (Kahraman ). The SASD is a theoretically more correct approximation of the distance between crosslinked residues than the commonly used Euclidean distance, because the Euclidean distance permits travelling through the protein mass whereas a crosslinker cannot travel through protein mass. Other methods exist that calculate the SASD, while also considering side-chain flexibility (Degiacomi ). Crosslinks can also act as a proxy for solvent accessibility, as residues must be solvent exposed if they are to be crosslinked. Utilizing this extra information, we created the crosslink scoring function called Matched and Non-accessible Crosslink Score (MNXL), which we found to outperform more conventional methods when scoring protein monomers (Bullock ). In order to streamline the XL-MS modelling procedure, we have incorporated these two developments (MNXL and Jwalk) into two webservers. Both programs are also standalone and freely available to download.

2 Implementation

Jwalk and MNXL are written in Python 2.7. For a full description of Jwalk see (Bullock ). Jwalk outputs a list of all the SASDs and Euclidean distances between target residues in a .txt file along with a .pdb file that contains all the SASD paths modelled using glycine pseudo atoms. On the Jwalk webserver (Fig. 1A), SASD paths are visualized with JSmol (Hanson ). MNXL takes as input a list of experimental crosslinks and Jwalk .txt output files (which can also be provided by the user independently). The SASDs of the experimental crosslinks are then scored using the MNXL scoring function (Bullock ). Using the webserver, users can also go directly from .pdb file to MNXL score (instead of running Jwalk separately).

Fig. 1.

(A) Partial screenshot of the Jwalk result page with PDB id. 1HRC shown. (B) The models of the test-case superposed in grey, with the best scoring model based on PDB id. 1QL3 (blue) and the native 1HRC (green). (C) Results table showing MNXL is able to select the lowest Cα-RMSD model MNXL outputs the scores for each model in a .txt file. Higher scores indicate better models. Additionally, MNXL outputs the number of crosslinks that are matched, violating and non-accessible to aid model assessment. The source code for both Jwalk and MNXL is available under a Creative Commons license at http://topf-group.ismb.lon.ac.uk/Software.html.

3 Results

To demonstrate the utility of the Jwalk/MNXL web server, we show the ability for the combination of Jwalk and MNXL to filter comparative models made with different templates. Five different models of the horse heart cytochrome C crystal structure (PDB id: 1HRC) were made with MODELLER (Eswar ) using templates of various quality, taken from the HHPred server (Alva ) with probability score > 96% in all cases (Fig. 1B). The five template PDB ids (and associated sequence identity) are: 1QL3 (42%), 5LO9 (20%), 1H32 (17%), 2MTA (17%) and 2C1D (19%), respectively. These comparative models were then uploaded into the MNXL webserver [along with the experimentally observed crosslinks taken from XLdb (Kahraman )]. The MNXL score was able to successfully select the model made using template 1QL3, which is the model with the lowest Cα-RMSD to the native structure (Fig. 1C)––for further discussion of the results see Supplementary Material.

4 Conclusion

We have created webservers for MNXL and Jwalk, two methods that can be used to validate models using restraints from XL-MS and demonstrated how it can be useful in filtering comparative models built from different templates. These webservers are designed to be user-friendly, in order to make it easier for the novice user to make better use of their crosslinking data. We hope to expand these platforms to incorporate the modelling of protein complexes in the near future. Click here for additional data file.

6 in total

1. Comparative protein structure modeling using MODELLER.

Authors: Narayanan Eswar; Ben Webb; Marc A Marti-Renom; M S Madhusudhan; David Eramian; Min-Yi Shen; Ursula Pieper; Andrej Sali
Journal: Curr Protoc Protein Sci Date: 2007-11

2. Accommodating Protein Dynamics in the Modeling of Chemical Crosslinks.

Authors: Matteo T Degiacomi; Carla Schmidt; Andrew J Baldwin; Justin L P Benesch
Journal: Structure Date: 2017-09-28 Impact factor: 5.006

3. Xwalk: computing and visualizing distances in cross-linking experiments.

Authors: Abdullah Kahraman; Lars Malmström; Ruedi Aebersold
Journal: Bioinformatics Date: 2011-06-11 Impact factor: 6.937

4. The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry.

Authors: Joshua Matthew Allen Bullock; Jannik Schwab; Konstantinos Thalassinos; Maya Topf
Journal: Mol Cell Proteomics Date: 2016-05-05 Impact factor: 5.911

5. The MPI bioinformatics Toolkit as an integrative platform for advanced protein sequence and structure analysis.

Authors: Vikram Alva; Seung-Zin Nam; Johannes Söding; Andrei N Lupas
Journal: Nucleic Acids Res Date: 2016-04-29 Impact factor: 16.971

6. Cross-link guided molecular modeling with ROSETTA.

Authors: Abdullah Kahraman; Franz Herzog; Alexander Leitner; George Rosenberger; Ruedi Aebersold; Lars Malmström
Journal: PLoS One Date: 2013-09-17 Impact factor: 3.240

6 in total

11 in total

Review 1. Automated Modeling and Validation of Protein Complexes in Cryo-EM Maps.

Authors: Tristan Cragnolini; Aaron Sweeney; Maya Topf
Journal: Methods Mol Biol Date: 2021

2. Integrative Mass Spectrometry-Based Approaches for Modeling Macromolecular Assemblies.

Authors: Andy M Lau; Argyris Politis
Journal: Methods Mol Biol Date: 2021

Review 3. Principles for Integrative Structural Biology Studies.

Authors: Michael P Rout; Andrej Sali
Journal: Cell Date: 2019-05-30 Impact factor: 41.582

4. SurA is a cryptically grooved chaperone that expands unfolded outer membrane proteins.

Authors: Dagan C Marx; Ashlee M Plummer; Anneliese M Faustino; Taylor Devlin; Michaela A Roskopf; Mathis J Leblanc; Henry J Lessen; Barbara T Amann; Patrick J Fleming; Susan Krueger; Stephen D Fried; Karen G Fleming
Journal: Proc Natl Acad Sci U S A Date: 2020-10-22 Impact factor: 11.205

5. Structural mass spectrometry decodes domain interaction and dynamics of the full-length Human Histone Deacetylase 2.

Authors: Zoja Soloviev; Joshua M A Bullock; Juliette M B James; Andrea C Sauerwein; Joanne E Nettleship; Raymond J Owens; D Flemming Hansen; Maya Topf; Konstantinos Thalassinos
Journal: Biochim Biophys Acta Proteins Proteom Date: 2022-01-18 Impact factor: 3.036

6. Cross-linking mass spectrometry reveals the structural topology of peripheral NuRD subunits relative to the core complex.

Authors: Cornelia G Spruijt; Cathrin Gräwe; Simone C Kleinendorst; Marijke P A Baltissen; Michiel Vermeulen
Journal: FEBS J Date: 2020-12-14 Impact factor: 5.542

7. PyXlinkViewer: A flexible tool for visualization of protein chemical crosslinking data within the PyMOL molecular graphics system.

Authors: Bob Schiffrin; Sheena E Radford; David J Brockwell; Antonio N Calabrese
Journal: Protein Sci Date: 2020-07-03 Impact factor: 6.725

Review 8. Computational modeling of RNA 3D structure based on experimental data.

Authors: Almudena Ponce-Salvatierra; Katarzyna Merdas; Chandran Nithin; Pritha Ghosh; Sunandan Mukherjee; Janusz M Bujnicki
Journal: Biosci Rep Date: 2019-02-08 Impact factor: 3.840

9. A general chemical crosslinking strategy for structural analyses of weakly interacting proteins applied to preTCR-pMHC complexes.

Authors: Réka Mizsei; Xiaolong Li; Wan-Na Chen; Monika Szabo; Jia-Huai Wang; Gerhard Wagner; Ellis L Reinherz; Robert J Mallis
Journal: J Biol Chem Date: 2021-01-08 Impact factor: 5.486

Review 10. Structural Proteomics Methods to Interrogate the Conformations and Dynamics of Intrinsically Disordered Proteins.

Authors: Rebecca Beveridge; Antonio N Calabrese
Journal: Front Chem Date: 2021-03-11 Impact factor: 5.221