| Literature DB >> 29736599 |
Abstract
Ion mobility mass spectrometry (IM/MS) can provide structural information on intact protein complexes. Such data, including connectivity and collision cross sections (CCS) of assemblies' subunits, can in turn be used as a guide to produce representative super coarse-grained models. These models are constituted by ensembles of overlapping spheres, each representing a protein subunit. A model is considered plausible if the CCS and sphere-overlap levels of its subunits fall within predetermined confidence intervals. While the first is determined by experimental error, the latter is based on a statistical analysis on a range of protein dimers. Here, we first propose a new expression to describe the overlap between two spheres. Then we analyze the effect of specific overlap cutoff choices on the precision and accuracy of super coarse-grained models. Finally, we propose a method to determine overlap cutoff levels on a per-case scenario, based on collected CCS data, and show that it can be applied to the characterization of the assembly topology of symmetrical homo-multimers. Graphical Abstract.Entities:
Keywords: Ion mobility, super coarse-grain; Molecular modeling; Native mass spectrometry; Protein assembly
Mesh:
Substances:
Year: 2018 PMID: 29736599 PMCID: PMC6318233 DOI: 10.1007/s13361-018-1974-2
Source DB: PubMed Journal: J Am Soc Mass Spectrom ISSN: 1044-0305 Impact factor: 3.109
Figure 1Relationship between sphere-overlap and their CCS (a). The overlap between two spheres ranges from 0 (spheres touching or not in contact) to 100% (the smaller sphere is completely embedded in the larger one). (b) The histogram shows the distribution of optimal overlap Obest in our benchmark set of 1988 protein pairs. Data can be fitted with a Gaussian curve (solid line) centered at an overlap level |O| equal to 22.6%. As comparison, the Gaussian fitting the distribution of Ostruct is also shown (dotted line). Three proteins, featuring three different overlap levels are shown: lactamase (PDB: 1M6K, Obest = 2%), BanLec (PDB: 5EXG, Obest = 29%), and the peroxidase HORF6 (PDB: 1PRX, Obest = 50%). (c) The gray area shows the error in CCS value connected to the choice of an overlap interval of a specific size (e.g., |O| ± 15 indicates an overlap interval from 7.6 to 37.6%). Each interval choice is connected to a certain likelihood of including the specific Obest value for the complex under study, shown with a palatinate colored line. (d) Relationship between ratio of CCS of individual components and that of complex, against spheres overlap. Given measured CCS of individual components and complex, the ideal overlap between spheres representing protein subunits can be predicted (solid line). The relationship holds independently from the relative radius of the spheres representing the binding partners: both homo- and hetero-multimers are equally distributed along the same trend
Figure 2Testing the predictive power of overlap confidence intervals. For three different homo-multimers (from left to right: PDBs 4I88, 1D2N, and 1G41), we produced a range of super coarse-grained models according to different candidate topologies. We then assessed whether the correct topology could be identified (indicated with a tick mark in each case), by filtering the models according to both their CCS matching with the known value (3% error, gray region), and the amount of overlap between their subunits. Red vertical bands indicate overlap confidence intervals defined by the constant cutoff method, blue bands by our adaptive cutoff one, and purple bands regions where both methods agree. To be considered acceptable, a topology must have its trend line within the region at the interception between the gray and red (or blue) areas. The constant cutoff method produced both false positives and false negatives, whereas our adaptive cutoff method always identified the correct topology