| Literature DB >> 35619705 |
Dario F Marzella1, Farzaneh M Parizi1, Derek van Tilborg1,2, Nicolas Renaud3, Daan Sybrandi4, Rafaella Buzatu1, Daniel T Rademaker1, Peter A C 't Hoen1, Li C Xue1.
Abstract
Deeper understanding of T-cell-mediated adaptive immune responses is important for the design of cancer immunotherapies and antiviral vaccines against pandemic outbreaks. T-cells are activated when they recognize foreign peptides that are presented on the cell surface by Major Histocompatibility Complexes (MHC), forming peptide:MHC (pMHC) complexes. 3D structures of pMHC complexes provide fundamental insight into T-cell recognition mechanism and aids immunotherapy design. High MHC and peptide diversities necessitate efficient computational modelling to enable whole proteome structural analysis. We developed PANDORA, a generic modelling pipeline for pMHC class I and II (pMHC-I and pMHC-II), and present its performance on pMHC-I here. Given a query, PANDORA searches for structural templates in its extensive database and then applies anchor restraints to the modelling process. This restrained energy minimization ensures one of the fastest pMHC modelling pipelines so far. On a set of 835 pMHC-I complexes over 78 MHC types, PANDORA generated models with a median RMSD of 0.70 Å and achieved a 93% success rate in top 10 models. PANDORA performs competitively with three pMHC-I modelling state-of-the-art approaches and outperforms AlphaFold2 in terms of accuracy while being superior to it in speed. PANDORA is a modularized and user-configurable python package with easy installation. We envision PANDORA to fuel deep learning algorithms with large-scale high-quality 3D models to tackle long-standing immunology challenges.Entities:
Keywords: computational immunology; computational structural biology; integrative modelling; large-scale 3D-modelling; peptide:MHC
Mesh:
Substances:
Year: 2022 PMID: 35619705 PMCID: PMC9127323 DOI: 10.3389/fimmu.2022.878762
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 8.786
Figure 1Overview of the MHC molecules. (A) 3D structure of a pMHC-I complex (PDB ID: 1DUZ). The α chain is divided in IMGT defined domains by shades of light blue. The β-2 Microglobulin chain is shown in light orange. The peptide is shown in red. (B) 3D structure of a pMHC-II complex (PDB ID: 1AQD). The alpha chain is divided in IMGT defined domains by shades of light blue. The β chain is divided in IMGT defined domains by shades of orange. The peptide is shown in red.
Figure 2Overview of PANDORA pMHC-I protocol and its performance on 835 pMHC-I complexes with X-ray structures. (A) PANDORA schematic flowchart. An allele type and peptide sequence of a target pMHC-I case are given as input. This information is used to identify the best matching template structure from a local database of pMHC-I structures. The target MHC is then modelled on top of the template and its peptide (red) is superposed on the template peptide. The anchor positions (specified by the user or by other tools, see section 2.4) are then specified as fixed. MODELLER then generates 20 loop models maintaining the anchor restrained. Finally, all models are scored with MODELLER internal molpdf scoring function. (B) Sampling performance of PANDORA in our cross-docking benchmark experiment. Histogram of the lowest backbone L-RMSD models is shown. (C) Success rate of Backbone L-RMSD at different thresholds according to CAPRI criteria: High-quality (L-RMSD <1 Å), Medium, (<2 Å), Acceptable (<5 Å), and Incorrect (<10 Å) (Lensink et al., 2020). (D) Complete performance of PANDORA (modelling + scoring). Histogram of the backbone L-RMSD of the best molpdf models is shown. (E) Example of an average-quality 3D model generated with PANDORA. The target peptide (PDB ID: 3I6L) is marked in red; the template structure (PDB ID: 3WL9) is marked in blue; the model structure is marked in orange.
Figure 3PANDORA comparison with the state-of-the-art methods. Y axes represent PANDORA L-RMSD per case while X axes represent the L-RMSD of the same case modelled with the reported method. The dotted line indicates the average L-RMSD for each method. (A) Difference between PANDORA best molpdf model Cα L-RMSD and DockTope reported Cα L-RMSD on 133 cases (PANDORA cross-docking against DockTope cross-docking); (B) Difference between PANDORA best molpdf model backbone + Cβ L-RMSD and GradDock reported backbone + Cβ L-RMSD on 65 cases (PANDORA cross-docking against GradDock cross-docking); (C) Difference between PANDORA best L-RMSD model Cα L-RMSD and APE-Gen reported Cα L-RMSD on 509 cases (PANDORA cross-docking against APE-Gen self-docking).
Figure 4PANDORA’s case studies on with non-canonical cases. The images are oriented to present the most representative view of the difference between models and target. (A) PANDORA produced better models than using canonical anchor positions in terms of backbone L-RMSD of cases. (B) A typical case (target PDB ID: 1DUY, template PDB ID: 1AO7 in Red, peptide=LFGYPVYV) with non-canonical anchor positions Blue with (actual anchors: P1 and P9(Ω). (C) Case study on the 10-mer from PDB structure 3BEW (Red). L-RMSD with default settings: 2.02 Å (Yellow); L-RMSD using secondary structure restraints: 0.80 Å (Blue). (D) Case study on the 15-mer from PDB structure 4U6Y (Red). L-RMSD with default settings: 3.32 Å (Yellow); L-RMSD using secondary structure restraints: 1.50 Å (Blue).