Literature DB >> 33103906

Deciphering the Structural Enigma of HLA Class-II Binding Peptides for Enhanced Immunoinformatics-based Prediction of Vaccine Epitopes.

Deepyan Chatterjee1,2, Pragya Priyadarshini2, Deepjyoti K Das1, Khurram Mushtaq1, Balvinder Singh2, Javed N Agrewala1,3.   

Abstract

Vaccines remain the most efficacious means to avoid and eliminate morbid diseases associated with high morbidity and mortality. Clinical trials indicate the gaining impetus of peptide vaccines against diseases for which an effective treatment still remains obscure. CD4 T-cell-based peptide vaccines involve immunization with antigenic determinants from pathogens or neoplastic cells that possess the ability to elicit a robust T helper cell response, which subsequently activates other arms of the immune system. The available in silico predictors of human leukocyte antigen II (HLA-II) binding peptides are sequence-based techniques, which ostensibly have balanced sensitivity and specificity. Structural analysis and understanding of the cognate peptide and HLA-II interactions are essential to empirically derive a successful peptide vaccine. However, the availability of structure-based epitope prediction algorithms is inadequate compared with sequence-based prediction methods. The present study is an attempt to understand the structural aspects of HLA-II binders by analyzing the Protein Data Bank (PDB) complexes of pHLA-II. Furthermore, we mimic the peptide exchange mechanism and demonstrate the structural implication of an acidic environment on HLA-II binders. Finally, we discuss a structure-guided approach to decipher potential HLA-II binders within an antigenic protein. This strategy may accurately predict the peptide epitopes and thus aid in designing successful peptide vaccines.

Entities:  

Keywords:  HLA-II binders; immunoinformatics; molecular docking; molecular dynamics simulation; peptide HLA-II interaction; peptide vaccines; structural immunology; structure guided vaccine designing; torsion angles

Mesh:

Substances:

Year:  2020        PMID: 33103906      PMCID: PMC7640962          DOI: 10.1021/acs.jproteome.0c00405

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


Introduction

The CD4 T-cell response is initiated by the presentation of peptides in context with human leukocyte antigen II (HLA-II), expressed on the surface of professional antigen presenting cells (APCs). During the “cut first, bind later” model,[1] HLA-II-mediated antigenic processing and presentation begins by internalization of deleterious proteins, transformed cellular contents, or pathogens by phagocytosis into a cellular organelle, the endosome. The endosome then enters a vesicular pathway to form a lyso-endosomal complex, within which the internalized components are degraded via the acidic and proteolytic activity of lysosomes, which typically have a pH of 4.5 to 5.[2,3] The final stage in the functional maturation of HLA-II begins inside the endosome, which now has multiple antigenic peptides competing to present on the polymorphic antigen binding cleft of HLA-II allomorphs. Through a process of peptide exchange, the epitope exhibiting efficacy is able to displace the endogenous class-II-associated invariant chain peptide (CLIP) to form an HLA-II and peptide complex (pHLA-II) dictated by the hydrogen bonds and noncovalent interactions between them.[4] This complex migrates toward the extracellular membrane to further present and interact with the T-cell receptor (TCR) displayed on the CD4 T cells.[5] Furthermore, the open-ended antigen binding clefts of the heterodimeric HLA-II molecule enable peptides with size of 10- to 25-mer to form the pHLA-II.[6] However, anchoring residues within the binding pocket of the antigen binding cleft typically interact with the 9-mer region of the peptide. Furthermore, they also dictate the binding preference of the TCR-pHLA-II ternary complex.[7,8] Consequently, this region on the peptide is termed as the binding core. Complementing this pathway is the “bind first, cut later” model, wherein epitopes are first scanned within the entire length of the antigenic protein by HLA-II allomorphs. Subsequently, cathepsin cleaves them outside the HLA-II binding groove to generate the free peptides that now bind to the HLA-II allotype before forming the TCR-pHLA-II complex.[1] Vaccines have yielded unparalleled success in eliciting robust acquired immunity against infectious and pathological conditions, thus substantially reducing the morbidity and mortality associated with the disease. Classical vaccines constitute the attenuated form of live microbes or complex virulent components.[9,10] Although such formulations are durable and induce long-lasting immunological memory,[11] certain studies indicate probable drawbacks that may be attributed to the superfluous contents of the vaccines, which may instigate reactogenicity such as anaphylaxis, allergic shock, or even autoimmunity.[12] Moreover, producing a stable attenuated strain of a pathogenic organism with an assurance of a lack of reversion to virulence while retaining immunogenic potency is a challenging task.[13] To overcome such potential problems, peptide vaccines are gaining considerable impetus, as evident by a search of the clinical studies repertoire (https://clinicaltrials.gov/). A total of 182 in phase-I, 115 in phase-II and 7 in phase-III trials are presently underway for peptide vaccine candidates worldwide. Such vaccines aim at inducing long-term prophylactic or therapeutic protection in addition to overcoming the challenges of classical vaccine formulation. These vaccine formulations consist of only defined immunogenic epitopes from a desired antigen that preferentially elicits a protective immune response, and they are devoid of the uncharacterized components of a whole-cell-based vaccine. In addition, virulent reversion or inactivation associated with the classical whole-cell-based vaccines is improbable with synthetic peptide vaccines.[14,15] A potent cell-mediated immune response can be instigated by peptide vaccines by the HLA-II-mediated processing and presentation of the peptides by the APCs to a TCR expressed on the surface of CD4 T cells. Therefore, an important prerequisite of this approach is the identification of peptide epitopes that show permissive binding, which means, the ability to bind an array of HLA-II allomorphs, and exhibit high immunogenicity. Additionally, such epitopes must not share sequence or structural homology with the host proteins, thus avoiding the chance of generating autoreactivity.[14] Advancement in the field of immunoinformatics has transformed various aspects of vaccinomics through methods like promiscuous epitope prediction and the in silico design of vaccine candidates.[16] Major servers that enable the prediction of promiscuous peptide candidates for HLA-II are based on the linear sequence of the peptide. Although these algorithms are easy to use and fast, some of the key challenges render inconsistencies in their predicted results. These include a distinct possibility of false-positives and false-negatives during prediction and the inability to effectively distinguish self-antigens from non-self-antigens. Furthermore, various prediction servers may also present different results when compared among themselves for the same data set, thus adding to the uncertainty. Thus the algorithms used in designing sequence-based servers may create more confusion than conclusions.[17] Structure-based approaches have been proposed as a potential way to overcome some of the previously described challenges in the field of designing peptide vaccines.[18] The present study is divided into four parts and is an attempt to understand the structural attributes among the available crystal structures of the pHLA-II complex. The study dialectically presents an enumeration for predicting peptide vaccine candidates that elicit a CD4 T-cell response. We start by statistically evaluating the dihedral angles of peptides within the crystal structure of the pHLA-II complex, analyzing their 9-mer peptide binding core, and assessing the structural contribution of the amino acid physiochemical properties on its secondary structure. Second, we present a quantitative scale for predicting the efficacy of peptides for HLA-II allomorphs by in silico mimicking the peptide exchanging mechanism.[4] Furthermore, we demonstrate the seminal contribution of the acidic microenvironment of the lyso-endosome on the structural conformation adopted by HLA-II binding peptides. Finally, we apply the insights gained in evaluating the crystal structures for their ability to distinguish HLA-II binders from nonbinders. This approach aims at addressing the challenges faced in the prediction of CD4 T-cell epitopes by sequence-based in silico tools. Subsequently, we discuss an empirically derived workflow that will enable the prediction of accurate epitopes that will significantly enhance the success of peptide-based vaccines.

Materials and Methods

Ramachandran Plots and Interpolation Model of HLA-II Epitopes

The dihedral angles, phi (φ) and psi (ψ), of the crystallized HLA-II binding peptides were measured using the Ramachandran window plot of the molecular visualization tool DeepView (Swiss-PDB viewer).[19] Next, using GraphPad Prism version 6.01, an interpolation curve was plotted through the observed values of dihedral angles among amino acid residues in the crystal structure of these HLA-II binding peptides after grouping these peptides according to their size (Table S1). The equation used to interpolate the data points was a second-order polynomial defined by y = b0 + b1*x + b2*x2. The statistical significance of the generated interpolation curve was tested by means of the observed p value, where p values ≤0.05 indicated significant deviation of the data set from the model.

B-Factor Analysis of HLA-II Epitopes

B-factor values of the studied peptides were obtained from the Protein Data Bank (PDB) coordinate files, and visual depictions were plotted using the B-factor coloring option available in PyMOL (https://pymol.org/pymol.html). Next, we grouped the studied peptides according to their size. Subsequently, for each amino acid in a peptide of a particular length, we calculated the normalized B-factor (B) value according to the following equationFinally, we computed the mean and standard deviation (SD) of the obtained normalized B-factor values to evaluate them across the different position for all of the peptides.

Molecular Docking to Assess the Efficacy of HLA-II Epitopes

Peptide exchange involved during the processing and presentation of HLA-II epitopes[4,6] was mimicked by means of molecular docking and the subsequent refinement of the protein–protein docking solution using the online servers PatchDock[20] and FireDock,[21] respectively. First, we identified the amino acid residues residing within the 4 Å region of the peptide binding site on the HLA-II allomorphs using PyMOL. This region was defined as the receptor binding sites of the studied HLA-II allomorphs. Next, the crystal structure of the CLIP peptide, obtained from PDB ID 3QXA, was used as a ligand to perform molecular docking on the defined peptide binding site of the HLA-II molecules. We also performed redocking of the same antigenic cocrystallized peptide on the HLA-II peptide binding site. Both of these docking steps were subsequently followed by molecular refinement to obtain the free-energy change (ΔG) value. Finally, the binding affinity of the best ranked docked complex was compared with that of the docked CLIP, revealed by the negative ΔG for the docked CLIP and cocrystallized antigenic peptides, to interpret their efficacy.

Two Sample Logo Assessment of the Prevalence of Amino Acids in HLA-II Binders and Nonbinders

MHCBN version 4.0[22] was used to attain an experimentally tested list of 1327 HLA-II binders and 2150 HLA-II nonbinders, respectively. Subsequently, peptides were grouped according to their size, and Two Sample Logo (http://www.twosamplelogo.org/) was used to determine and visualize the propensity of amino acids at different positions on both of the lists.

Ab initio Structure Prediction of Peptides at pH 7 Using PEP-FOLD

The online server PEP-FOLD[23] (http://bioserv.rpbs.univ-paris-diderot.fr/services/PEP-FOLD3/) was used to predict the 3D structure of the peptides. FASTA sequences of the studied peptides were used as an input, and no reference models were provided to PEP-FOLD, and thus the 3D ab initio structures of peptides were generated at a neutral pH with the default settings. Cluster representatives from only the top five clusters were selected for the subsequent structural analysis.

Ab initio Prediction of Peptide Structures at pH 4.5 by MD Simulations and Clustering

A linear structure of the studied peptides, whereby the dihedral angles φ and ψ were kept at 180° and 180°, was generated using the AgrusLab molecular modeling tool (http://www.arguslab.com/arguslab.com/ArgusLab.html). Subsequently, these were subjected to a minimization of 10 000 steps, which included the first 5000 steps of steepest descent and the next 5000 steps of conjugate gradient with a restraint weight of 10 kcal/(mol·Å2) on the backbone atoms of the peptides. Next, the system was gradually heated for 400 ps at a constant volume to 300 K. The equilibration step was performed for 4 ns in the NPT ensemble. Finally, a production run was performed for 100 ns at pH 4.5 with a salt concentration of 0.1 M. The protonation states of residues were maintained according to the pH of the system. All molecular dynamics (MD) simulations were done in an explicit environment using Amber16, and the trajectories were visualized and analyzed using Visual Molecular Dynamics (VMD)[24] and the cpptraj module.[25] Next, using the cpptraj module of AmberTool 16, we extracted every fifth frame from the 50 000 frames generated during the MD simulation and appended these snapshots in a separate PDB file format. These extracted 10 000 structures were subjected to clustering analysis using the MaxCluster software suite (http://www.sbg.bio.ic.ac.uk/maxcluster/). The Algorithm used for clustering was the nearest-neighbor method,[26] with initial clustering size kept at default, that is, three. The obtained clusters, their size and spread, along with the cluster representative, that is, the centroid structure, are listed in the Table S2. Only the cluster representatives of the top five bins or fewer were used for further structural analysis.

Comparing the Root-Mean-Square Deviation and Radius of Gyration of the Ab Initio Predicted Peptides with Their Crystal Counterparts

Superposition server SuperPose (http://wishart.biology.ualberta.ca/SuperPose/)[27] was used to compute the global backbone root-mean-square deviation (RMSD) between the cluster representatives of the ab initio predicted structures and their crystal structure counterparts. Similarly, the radius of gyration (RG) for all of the respective structures was computed using the online version of CRYSOL (https://www.embl-hamburg.de/biosaxs/atsas-online/crysol.php) with default settings.

Characterization of the Helical Content and Charge on the Peptides at Different pH Values

The freely assessable servers Agadir (http://agadir.crg.es/) and PROTEIN CALCULATOR version 3.4 (http://protcalc.sourceforge.net/) were used to compute the helical content and the net charge on the studied peptides in different pH ranges. Amino acid sequences of the studied peptides were used as the input for both the servers.

Evaluating the Structure and Efficacy of Experimentally Tested HLA-II Binders and Nonbinders

The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/)[28] was used to identify peptides that are experimentally tested for their ability to bind HLA-II allomorphs. The peptides with lengths of 13-, 14-, and 15-mer in size were chosen as the test peptides to evaluate the present study. Structures of all of the peptides were first modeled by subjecting to MD simulations at pH 4.5 following the previously given protocol. This was followed by clustering to obtain the top five representative structures using MaxCluster and computing the dihedral angles using DeepView (Swiss-PDB viewer).[19] For the structural analysis, the interpolation curve was plotted through the observed values of the dihedral angles at each position across the length of the test peptides. This was then compared with the values of cocrystallized peptides of 13-, 14-, and 15-mer used in the study. To assess the efficacy of the test peptides, docking and refinement were performed using PatchDock[20] and FireDock[21] on the cluster representatives of all of the test peptides onto HLA-II allomorphs (DRB1*0101; PDB ID 1SJE, DPA1*0103 DPB1*0201; PDB ID 4P57, DQB1*0603; PDB ID 1UVQ). No outliers were excluded from the study.

Statistical Analysis

All statistical analyses was done using GraphPad Prism version 6.01. Graphs were plotted using GraphPad Prism version 6.01 or MS Excel. The statistical significance was computed using the paired parametric t test or the unpaired t test, and the level of significance was determined using p values, where p > 0.05 was nonsignificant (NS) and P ≤ 0.05 was considered significant (p ≤ 0.05: *; p ≤ 0.01: **; p ≤ 0.001: ***).

Results

Interpreting the Structure Relatedness among HLA-II-Binding Peptides

Screening of the cocrystal structures of HLA-II binders from PDB, which had a size of not less than 10-mer, revealed a total of 30 peptides. Of the obtained peptides, 20 were presented on HLA-II type DR, 4 on HLA-II type DP, and 6 on HLA-II type DQ. Along with the HLA-II type, these were also shortlisted according to the size of the binding peptide; subsequently, dihedral angles of the peptide were computed at each position and were listed (Table S1, sheets 1–3). For the peptides grouped by size, an interpolation curve of a second-order polynomial (quadratic) function was plotted along the dihedral values, whereby the mean and the SD are depicted as red lines and dotted blue lines in Figure A. Instances of extreme values of dihedral angles are isolated and are distinctively outside the interpolation model, indicating the structural semblance among HLA-II binding peptides. This can be statistically verified by the nonsignificant P values of deviation, with the only exception being the φ angles of peptides of 11-mer size. Next, to illustrate the static occupancy and dynamic mobility among amino acids of the HLA-II binding peptide, we visualized their B-factor values by coloring them according to low and high B-factor segments, depicted in blue and red, respectively, in Figure B. Furthermore, we normalized the B-factor values of all amino acid residues in the peptides and then plotted the mean and SD at each position along the peptide, grouped according to size in Figure C. The color spectrum and heterogeneity among B-factor values in the flanking region, as opposed to the central residues, depict the prevalence of proximal structural conservation among amino acids that may interact with the HLA-II peptide binding core. This observation is in accordance with previous reports.[8] Interestingly, each peptide in the study had a distinct set of amino acids; however, the interpolation curve across the dihedral angles also indicated less SD in the peptide central region, as opposed to the flanking sections (Figure A).
Figure 1

HLA-II bound peptides exhibited structural relatedness across binders of various lengths. The lists of cocrystallized structures of peptides presented on HLA-II were segregated according to their size. (A) The nonsignificant deviation in “p values” for the interpolation curve of the second-order polynomial function plotted across dihedral angles indicates structural relatedness among HLA-II epitopes, with the exception of phi (φ) angles in 11-mer peptides. (B) The color spectrum obtained during B-factor analysis indicates mobility among amino acids in the flanking region, as opposed to core residues. (C) The plotted “mean ± SD” of the normalized B-factor values in amino acids of peptides grouped according to their size corroborates structural conservation among residues in the proximal region. The number of peptides belonging to specified sizes are indicated for panels A and C. In panel A, p > 0.05 was considered as nonsignificant.

HLA-II bound peptides exhibited structural relatedness across binders of various lengths. The lists of cocrystallized structures of peptides presented on HLA-II were segregated according to their size. (A) The nonsignificant deviation in “p values” for the interpolation curve of the second-order polynomial function plotted across dihedral angles indicates structural relatedness among HLA-II epitopes, with the exception of phi (φ) angles in 11-mer peptides. (B) The color spectrum obtained during B-factor analysis indicates mobility among amino acids in the flanking region, as opposed to core residues. (C) The plotted “mean ± SD” of the normalized B-factor values in amino acids of peptides grouped according to their size corroborates structural conservation among residues in the proximal region. The number of peptides belonging to specified sizes are indicated for panels A and C. In panel A, p > 0.05 was considered as nonsignificant.

Evaluating the Structural Semblance within the Peptide Core That Interacts with the Anchor Positions on the HLA-II Bed

To further corroborate the existence of structural semblance across the peptide binding core that interacts with the HLA-II bed, we arranged the peptides according to the HLA-II allomorphs, viz. DR, DP, and DQ. The 9-mer core binding segments of the bonded peptides were manually identified by their interaction corresponding to the anchor positions of the HLA-II bed. Subsequently, the obtained segments were also cross-verified in the literature[29] and are highlighted in yellow in Figure A. Next, the computed mean and SD of the dihedral angles within the binding core are represented according to all of the HLA allomorphs in Figure B and also by the subtypes of HLA-DR, viz. DR B1, DR B5, and DR B3, respectively, in Figure C. The highly flexible flanking regions of the peptides are attributed to the open-ended cleft of HLA-II,[8] and this can be observed in peptides represented by pep8, pep12, pep13, pep1, pep5, and pep29, as a few examples (Figure A). Furthermore, comparing the 9-mer peptide binding core across all of the studied HLAs enabled us to structurally evaluate them irrespective of their sizes and also nullify the flexible segments within the flanking region of the HLA-II binder. Interestingly, dihedral angle ψ of the core binding regions across all 30 HLA bonded peptides appeared to be more structurally conserved across the entire 9-mer length, except for position 3. However, apart from positions 2, 7, 8, and 9 of the core regions, dihedral angle φ of the HLA-II binders does not appear to share such similarity across the entire 9-mer length (Figure B). Furthermore, the high SD of ψ angles at position 3 can be attributed to the peptides presented on the HLA-II allomorphs, DP and DQ, as peptides presented across the three different HLA-II subtypes of DR exhibit a low SD to the computed mean of ψ angles at position 3 (Figure C).
Figure 2

Identifying the 9-mer binding core of peptides presented in association with HLA-II enabled their evaluation, irrespective of their sizes. Anchored residues on the open-ended HLA-II cleft interacted with the 9-mer regions of peptides. (A) 9-mer regions of the studied peptides are highlighted in yellow, and visual analysis reaffirmed the existence of flexible flanking residues. (B) The mean value of the psi (ψ) angles of the studied peptides indicates that HLA-II subtypes exhibited similarity across the entire length, baring position 3, whereas the phi (φ) angles share similar values at positions 2, 7, 8, and 9 of the peptide binding cores. (C) Assertion of the structural similarity in dihedral angles across HLA-II subtypes: DR (B1, B3, B5), DP, and DQ. For panels B and C, structural similarity is established by comparing the mean and standard deviation (SD) of the dihedral angles, φ and ψ, at each position, and the number of peptides used for the computation is also indicated.

Identifying the 9-mer binding core of peptides presented in association with HLA-II enabled their evaluation, irrespective of their sizes. Anchored residues on the open-ended HLA-II cleft interacted with the 9-mer regions of peptides. (A) 9-mer regions of the studied peptides are highlighted in yellow, and visual analysis reaffirmed the existence of flexible flanking residues. (B) The mean value of the psi (ψ) angles of the studied peptides indicates that HLA-II subtypes exhibited similarity across the entire length, baring position 3, whereas the phi (φ) angles share similar values at positions 2, 7, 8, and 9 of the peptide binding cores. (C) Assertion of the structural similarity in dihedral angles across HLA-II subtypes: DR (B1, B3, B5), DP, and DQ. For panels B and C, structural similarity is established by comparing the mean and standard deviation (SD) of the dihedral angles, φ and ψ, at each position, and the number of peptides used for the computation is also indicated.

Interpreting the Prevalence of Amino Acid Type and the Structural Effect of Its Physiochemical Properties for the Residing Residues on the 9-mer HLA-II Binding Core

Twenty proteinogenic α-amino acids have structurally distinct side-chain (R group) moieties and thus can be divided according to their physicochemical properties.[30] We evaluated the frequency of different amino acids within the studied 9-mer peptide binding core by plotting a frequency chart of individual amino acids present in it, which are depicted by a sequence logo map according to their position in Figure A. Any significant prevalence of a particular type of amino acid at a particular position could not be attributed within the entire region of the core peptide. This may be due to the low data set of only 30 available crystal structures of human HLA-II cocrystallized with peptides that are available in PDB and are used in the present study. Furthermore, to validate the stated observation, we shortlisted an HLA-II peptide data set of known (experimentally tested) binders and nonbinders according to their size from a previous study.[22] Subsequently we plotted the frequency of amino acids at each position, which is depicted in two sample logos in Figure S1. In accordance with the fundamental working principal of sequence-based HLA-II prediction algorithms, we observed the differential preference of specific amino acid types at particular positions among the peptide data sets of known HLA-II binders and nonbinders.
Figure 3

Elucidating the effect on dihedral angles due to the physiochemical properties of amino acid residues within the 9-mer HLA-II binding core. (A) The sequence logo plot depicts the frequency of 20 amino acid types at each position of the 9-mer HLA-II binding core. (B) Computed “mean ± SD” of 20 amino acids grouped by physiochemical properties of the amino acid residues residing in the 9-mer peptide binding core, indicating the contribution of the R group to the behavior of dihedral angles.

Elucidating the effect on dihedral angles due to the physiochemical properties of amino acid residues within the 9-mer HLA-II binding core. (A) The sequence logo plot depicts the frequency of 20 amino acid types at each position of the 9-mer HLA-II binding core. (B) Computed “mean ± SD” of 20 amino acids grouped by physiochemical properties of the amino acid residues residing in the 9-mer peptide binding core, indicating the contribution of the R group to the behavior of dihedral angles. Additionally, in Figure B, we evaluated the mean and SD of dihedral angles φ and ψ among the 20 amino acid types, grouped according to five different physiochemical properties ((i) Nonpolar, aliphatic R groups, (ii) aromatic R groups, (iii) positively charged R groups, (iv) negatively charged R groups, and (v) polar, uncharged R Groups), and their position on the HLA-II binding core. Looking back to the high SD of ψ angles at the third position of the core peptide (Figure B), we observed that the amino acids glycine, phenylalanine, and proline may occupy positions on the sterically disallowed segments of the Ramachandran plot at this position (Figure B). This may be well accredited to fact that glycine has no side chain, so it can occupy dihedral angles on all four quadrants of Ramachandran plot. Similarly, the pyrrolidine ring of proline conformationally restricts it from adopting a helix breaker structure.[31] In essence, comparing the dihedral angles among the amino acids grouped by physiochemical properties enables us to imply the observation that conformationally, ψ angles tend to form a linear shape and are clustered together. However, opposed to ψ angles, the multitude of φ angles in amino acids of the core peptide occupied positions around a much larger region, thus their scattering on the upper left quadrant of the Ramachandran plot (Table S1, sheet 3). Therefore, HLA-II binding peptides are predominantly linear and are devoid of any particular secondary structural component, that is, α helix or β sheet.

Peptide Exchange between CLIP and Antigenic Epitopes May Be Mimicked via Molecular Docking, Thus Providing a Quantitative Scale to Predict HLA-II Binders

The antigenic presentation pathway across HLA-II molecules involves a peptide exchange mechanism within the compartment of the lyso-endosome, and during this process, only a high-affinity antigenic peptide replaces the HLA-II-associated invariant chain peptide, CLIP.[4] We have successfully mimicked these events using an in silico molecular docking approach. In Figure , we have compared the binding efficacies of the crystal structure of the CLIP peptide (obtained from PDB ID 3QXA) and the antigenic peptides that we presently study against the appropriate HLA-II receptors. As revealed by the negative ΔG for CLIP and the cocrystallized antigenic peptide, the antigenic peptide has a much stronger affinity as compared with the CLIP peptide in 26 of the 30 available structures of HLA-II subtypes (Figure A–C). The HLA-II bonded peptide represented by PDB ID 1KLU was among the four instances where the antigenic peptide exhibits a stronger affinity than the CLIP peptide (Figure A). Interestingly both the 1KLU bonded peptide and another peptide represented by PDB ID 1KLG were part of a study by Sundberg et al., which demonstrated the altered potential of wild-type and mutant peptides (represented by PDB IDs 1KLU and 1KLG, respectively) to induce the CD4 T-cell response in the context of a melanoma tumor.[32] Remarkably, in accordance with tolerance mechanisms,[33] the present in silico docking simulation was able to account for the inconsequential difference, at the level of sequence and structure (Table S1),[32] between the wild-type and mutant peptides, which can be observed in the altered binding efficacy (ΔG values) for its HLA-II subtype, DRB1*0101 (Figure A).
Figure 4

Efficacy of potential HLA-II binders can be evaluated by comparing the docking score with the CLIP peptide. (A–C) 26 times out of 30, antigenic epitopes exhibited stronger efficacy than the CLIP peptide, as observed by the negative free-energy change (ΔG), after docking them to the binding cavity of HLA-II subtypes. (A) Among the exceptions, the peptide represented by PDB ID IKLU is a wild type from the triose-phosphate isomerase (TPI) enzyme and exhibits less efficacy in comparison with CLIP for its HLA-II receptor against its mutant counterpart peptide, represented by PDB ID 1KLG. This is in accordance with the results observed by Sundberg et al. and the tolerance mechanism. RMSDs of redocked peptides are shown in comparison with (D) their crystal counterparts and (E) the CLIP structure obtained from PDB ID 3QXA. Redocked peptides and crystal counterparts are shown in green and blue, respectively. (F) Overall, the low range of RMSD values established the validity of the molecular docking.

Efficacy of potential HLA-II binders can be evaluated by comparing the docking score with the CLIP peptide. (A–C) 26 times out of 30, antigenic epitopes exhibited stronger efficacy than the CLIP peptide, as observed by the negative free-energy change (ΔG), after docking them to the binding cavity of HLA-II subtypes. (A) Among the exceptions, the peptide represented by PDB ID IKLU is a wild type from the triose-phosphate isomerase (TPI) enzyme and exhibits less efficacy in comparison with CLIP for its HLA-II receptor against its mutant counterpart peptide, represented by PDB ID 1KLG. This is in accordance with the results observed by Sundberg et al. and the tolerance mechanism. RMSDs of redocked peptides are shown in comparison with (D) their crystal counterparts and (E) the CLIP structure obtained from PDB ID 3QXA. Redocked peptides and crystal counterparts are shown in green and blue, respectively. (F) Overall, the low range of RMSD values established the validity of the molecular docking. To further validate our observation that docking scores are reflective of peptide–HLA cocrystal structures, RMSD comparisons of the redocked peptide structures with their cocrystallized antigenic counterparts and with the crystal structure of CLIP peptide (PDB ID 3QXA) were performed (Figure D,E), respectively. Subsequently, the low range of RMSD values, as shown, endorsed the molecular docking approach as a means to study the peptide exchange mechanism (Figure F). Of note, the crystal structure of 3QXA has a CLIP peptide bound to HLA-DR1, and the relatively higher RMSD in comparison with the CLIP peptide docked onto different HLA-II alleles is due the sequence and structural variations among the HLA-II allomorphs (Figure E,F). In essence, we may consider this to be a successful attempt to mimic the peptide exchange mechanism on a quantitative in silico scale and also to establish a Goldilocks zone for predicting HLA-II binders and potentially even their immunogenicity. Furthermore, this also has the potential to be used as a scale to quantify the efficacy and promiscuity of the antigenic peptides for HLA-II and thus to assist in designing potential candidates for peptide vaccines.

Acidic Conditions within the Lyso-Endosomal Compartment Impart Significance on the Charge and Structure of Antigenic Peptides

Structures of peptides are highly dynamic in nature, and the physicochemical environment can substantially contribute to their secondary structural components.[34,35] Antigenic peptides presented on the HLA-II surface are processed inside the acidic and degradative environment of the lyso-endosomal compartment, typically having a pH of 4.5.[2] We decided to test the structural implication of such an environment on the peptides. Amino acids are zwitterions; however, with decreasing pH, they typically become anionic, as the H+ ion adds to the carboxyl group (COO–). This can be observed in the studied peptides, that is, a rise in the net charge with the change of pH from neutral to acidic (Figure A). A progressive decrease in the helical content of the studied peptides with the decrease in the pH from 8.6 was also noticed (Figure S2A). Therefore, we have predicted the 3D structures of the studied peptides at pH 7 (using PEP-FOLD) and pH 4.5 (using MD simulation), followed by structurally evaluating their cluster representatives at both of the pH values in the context of their crystal structure counterpart (Figure S2B, Table S2). Comparing the average RG values among the top five cluster representatives of the ab initio predicted structures reveals distinct structural conformations among the same peptides at different pH values. Interestingly, the average RG of the predicted peptide structures at pH 4.5 was between the RG values of the crystal structure and the predicted peptide structures at pH 7 (Figure B). Subsequently, we computed the RMSDs of the cluster representatives of the ab initio predicted peptide structures with their crystal structure counterparts. The comparison of the RMSDs presented in Figure C demonstrates that the MD simulation under acidic conditions significantly reduced the predicted peptide structural deviation compared with its counterpart at neutral pH 7. Furthermore, 26 times out of 30, at least one or more cluster-representative peptide obtained via post-pH-mediated MD simulation had an RMSD with its crystal counterpart of <5 Å (Figure D).
Figure 5

MD simulation suggests that the acidic conditions of the lyso-endosomal compartment helped peptides to adopt a linear conformation, assisting their interaction with the HLA-II. (A) A significant rise in the net charge distribution on the peptides was observed at acidic pH 4.5 compared with neutral pH 7. Cluster representatives of the ab initio predicted structures of peptides, obtained at pH 7 (by PEP-FOLD) and pH 4.5 (by MD simulation), were evaluated by crystal structure. (B) A box plot comparing the radii of gyration of peptides suggested that MD simulations at acidic pH also linearize the peptide structures, leading to their resemblance to the crystal structures of the peptides. (C,D) Overall, a significant lessening of structural deviation was observed in MD-simulated peptides at pH 4.5, in contrast with that at neutral pH. Furthermore, in 26 out of 30 peptides, a cluster representative of the MD-simulated peptide at pH 4.5 had an RMSD with crystal structure of <5 Å. Each dot signifies the cluster representatives, and the bar diagram depicts the mean ± SEM. (A,C) Students’ paired t test and unpaired t test were used to determine the significance of deviation in the net charge on the peptides and the RMSD with the crystal structure of peptides. *, p ≤ 0.05; **, p ≤ 0.01; ***, p ≤ 0.001.

MD simulation suggests that the acidic conditions of the lyso-endosomal compartment helped peptides to adopt a linear conformation, assisting their interaction with the HLA-II. (A) A significant rise in the net charge distribution on the peptides was observed at acidic pH 4.5 compared with neutral pH 7. Cluster representatives of the ab initio predicted structures of peptides, obtained at pH 7 (by PEP-FOLD) and pH 4.5 (by MD simulation), were evaluated by crystal structure. (B) A box plot comparing the radii of gyration of peptides suggested that MD simulations at acidic pH also linearize the peptide structures, leading to their resemblance to the crystal structures of the peptides. (C,D) Overall, a significant lessening of structural deviation was observed in MD-simulated peptides at pH 4.5, in contrast with that at neutral pH. Furthermore, in 26 out of 30 peptides, a cluster representative of the MD-simulated peptide at pH 4.5 had an RMSD with crystal structure of <5 Å. Each dot signifies the cluster representatives, and the bar diagram depicts the mean ± SEM. (A,C) Students’ paired t test and unpaired t test were used to determine the significance of deviation in the net charge on the peptides and the RMSD with the crystal structure of peptides. *, p ≤ 0.05; **, p ≤ 0.01; ***, p ≤ 0.001. We may conclude that HLA-II bonded peptides are predominantly linear and devoid of any regular secondary structural features (Figure B, Table S1).[8] Interestingly, the studied peptides adopted a completely different conformation and were predominantly helical at neutral pH. The MD simulation study suggested that an acidic environment may restrict a peptide’s ability to adopt a helical conformation and allow it to remain linear, which may facilitate its interaction with the HLA-II receptor.

Comparing the Experimentally Known HLA-II Binders with Nonbinders Revealed a Distinct Structural Orientation and Binding Efficacy

We applied the insights gained while analyzing the peptide HLA-II cocrystallized complexes to evaluate the experimentally established HLA-II binders from nonbinders. A total of six test peptides of 13-, 14-, and 15-mer sizes were selected from the PRIDE database[28] (Figure A). Importantly, the HLA-II binders used (T_pep2_B, T_pep4_B, and T_pep6_B, shown in blue) are critical peptides that were also used to test and prove the accuracy of the latest generation of the HLA-II predictor MARIA.[36] Although test peptides T_pep4_B and T_pep6_B were known HLA-II binders, they were reported as nonbinders by NetMHCIIpan with percentile scores of 30 and 10, respectively (>90% score is considered for binders). The remaining three peptides were randomly selected nonbinders, which are neoantigens that were reported in a negative immunogenicity assay (T_pep1_NB, T_pep3_NB, and T_pep5_NB, shown in red).[37−40] As per the used protocol in the present study, we simulated the peptides at pH 4.5 and performed clustering to obtain five representative model structures of the test peptides. The structural comparison of the interpolated dihedral angles of test peptides against those of the cocrystallized peptides used in the study (Figure ) is shown in Figure B–D and Figure S3. Primarily, post-MD simulation, binder test peptides adopted a structural conformation similar to that of cocrystallized peptide structures, whereas the nonbinding counterparts adopted a distinctively dissimilar conformation. To further contemplate the observation, we checked the efficacy of the test peptides against three HLA-II allomorphs (Figure E–G). In seven out of nine instances, that is, 77.7%, the known binder test peptides had better efficacy for HLA-II allomorphs over the known nonbinders. Notably, the two instances where the known nonbinder peptide had better affinity, and efficacy for HLA-DP and DQ, were the 14-mer test peptides, despite the distinctive difference in the dihedral angles (Figure C,F). Furthermore, the overall efficacy of the known binder test peptides was lower compared with the CLIP peptide. However, this variation could potentially be attributed to the conformation of the ligand (whether it is of a cocrystal structure for CLIP or an ab initio modeled structure for the test peptides) used for docking. We also explored the ability of our methodology to identify three well-characterized and validated immunodominant SARS-CoV-2 CD4 epitopes that bind HLA-DRB1*04:01.[41] The predicted peptide structures obtained after pH-mediated MD simulations were linear, and a stable efficacious binding was observed when they docked onto their cognate HLA-DRB1*04:01, with an average affinity score (ΔG) of around −40 kcal/mol (Figure S3D–G).
Figure 6

Distinct structure and binding efficacy differentiate the HLA-II binders from nonbinders. (A) A list of test peptides with established immunogenicity were selected from the PRIDE database to differentiate HLA-II nonbinders from binders, depicted in red and blue, respectively. (B–D) Interpolation curves across the dihedral angles of test peptides indicate a dissimilar conformation of nonbinders in comparison with the crystal peptides and HLA-II binding test peptides. (E–G) This was also substantiated in terms of the overall low efficacy of nonbinding test peptides as compared with the HLA-II binding test peptides.

Distinct structure and binding efficacy differentiate the HLA-II binders from nonbinders. (A) A list of test peptides with established immunogenicity were selected from the PRIDE database to differentiate HLA-II nonbinders from binders, depicted in red and blue, respectively. (B–D) Interpolation curves across the dihedral angles of test peptides indicate a dissimilar conformation of nonbinders in comparison with the crystal peptides and HLA-II binding test peptides. (E–G) This was also substantiated in terms of the overall low efficacy of nonbinding test peptides as compared with the HLA-II binding test peptides. In essence, there may be a subtle difference in the structural orientation of HLA-II binding peptides, and the use of pH-mediated MD simulations coupled to docking can be a potential method to differentiate such peptides.

Discussion

Vaccines against diseases like HIV, hepatitis C, etc. still remain elusive,[42−44] and many available vaccination strategies for diseases such as tuberculosis, malaria, etc. call for improvements.[45,46] Clinical trials of peptide vaccines corroborate their incipient efficiency as an approach for developing or refining the efficiency of vaccine candidates (https://clinicaltrials.gov/). Interestingly, identifying conserved immunodominant epitopes among serological variants of pathogens may aid in designing vaccine candidates with a broad spectrum protection against the serovars.[47] Furthermore, a desirable single or multifragmented epitope may be limited in inducing not only a cell-mediated immune response but also potent humoral immunity[48] and T–B cell reciprocity.[7,49] Therefore, chemically synthesized single or multiple peptide epitopes, with appropriate adjuvants and excipients, may form a safer alternate to whole-cell vaccines in imparting prophylactic or therapeutic protection. The emergence of immunoinformatics-based tools and techniques has significantly contributed to the understanding of diverse aspects of immunology, such as the etiology of autoimmunity, tumor immunology, and so on.[50,51] In the relatively young field of vaccinomics,[52] algorithms that make predictions are also gaining a considerable impetus in envisaging immunomodulators[16] and vaccine adjuvant.[53] The fundamental premise of in silico prediction in peptide vaccines is to shortlist hitherto unknown epitopes as potential vaccine candidates. However, the majority of CD4 epitope prediction tools are still based on the sequence-based approach. The efficacy of such algorithms is inextricably linked to not just the quantity but also, more importantly, the quality of the data set and the training model employed. Although a few of these algorithms intrepidly claim to be reliable, fast, and simple to use, the limited availability of translatable in vivo applications in the literature suggests the disquieting prospect of such sequence-based methods. Interestingly, structure-based epitope prediction has the ability to overcome the demerits of sequence-based methods.[18] Thus the raison d’etre for structure-based screening approaches over the available sequence-based algorithms is ideally to set a benchmark for the following criteria: (i) Overlapping fragments must be scanned from an array of antigens to eliminate immunosuppressive regions, and a selected number of immunodominant epitopes must be shortlisted. (ii) Such epitopes must not elicit autoreactivity, thus ensuring that they do not exhibit sequence and structural homology with the host proteins. (iii) The promiscuity of such epitopes to transcend the challenge of the extensive HLA-II polymorphism must be considered, ensuring that the result is relevant from a global perspective. (iv) A judicious balance of sensitivity and specificity must be assured at all times, thus eliminating a false sense of reliability while performing the costly wet bench assays of the screened epitopes. Another important value addition in structure-based T-cell epitope prediction could be the subsequent post-differentiation assessment of multimeric pHLA-II and TCR complexes to predict the distinct CD4 T-cell phenotypes that predominantly should include Th1, Th2, and Th17 and induced regulatory T (iTreg) cells.[54] The cytokine milieu during the TCR activation also plays a central role in determining the fate during the effector function.[54,55] Interestingly in silico resources also exist to evaluate a peptide’s ability to induce cytokines such as IFN-γ, IL-4, IL-10, and IL-17,[56−59] rendering them an essential resource for predicting peptide vaccine candidates. The present study is an attempt to aid in resolving the conundrums involved the in silico prediction of CD4 epitopes,[17] which is essential for designing peptide vaccines. Dihedral angle analysis of the available cocrystallized HLA-II peptide is a potential way to structurally evaluate and distinguish binders from nonbinders. The analysis of the structural aspects of 9-mer core binding segments, along with their amino acid sequences, can form the basis to assess the cognate pHLA-II interaction. We also demonstrate the notable effect of an acidic microenvironment in the lyso-endosome on the peptide structure. The role of acidic, denaturing, and reducing conditions within the degradative environment of lysosomes may not only be limited to imparting a significant structural contribution on the dynamic conformations of a peptide generated during the “cut first, bind later” pathway but also may affect the kinetics of immunogenicity by unfolding antigenic proteins, rendering them accessible to HLA-II allomorphs for the “bind first, cut later” model. Thus for deciphering a potential HLA-II binder, MD simulations at pH 4.5 may be used to predict the starting structure(s) and to obtain the dihedral angles for the peptide conformation. Next, predicted structures can be compared with the dihedral angles of crystal structures of antigenic peptides, illustrated in the interpolation model. Furthermore, a potential HLA-II binder must possess the ability to replace CLIP to present itself on the TCR expressed on CD4 T cells.[4] Our study hitherto demonstrates that molecular docking can be used effectively to obtain the peptide efficacy for an HLA-II receptor. Importantly, this can also be further extended to study the promiscuity of potential HLA-II binders by studying their efficacy for different HLA-II allomorphs. The previously described structure-guided approach can be used as a distinct filter, in combination with statistical tests at each step, for assessing quantitatively and qualitatively the potential HLA-II binders and their promiscuity. The analysis of predictions of the test peptides (known binders and nonbinders) taken from PRIDE databases also corroborates the approach presented in this study. Furthermore, peptides from SARS-CoV-2 have also been shown to be efficient binders using this methodology. A schematic overview, with the stated canonical approach for interpreting HLA-II binders, within antigenic proteins is depicted in Figure . Structure-based methods, when implemented in a high-throughput pipeline, may be limited by the requirement of computational power to perform MD simulations, leading to a delay in the availability of results to users. In addition, the limited availability of crystal structures of HLA-II allomorphs with diverse antigenic peptides and the lack of positive and negative controls make it difficult to obtain the accuracy, sensitivity, and specificity of structure-based prediction algorithms. However, these methods are likely to score better than sequence-based methods for predicting potential vaccine epitopes. Thus a separate study employing a hybrid approach involving the structure-based prediction and wet bench validation may provide the requisite standardized data set for feeding into algorithms to be implemented in the in silico vaccine prediction techniques. Studies on similar lines have been performed for cancer epitopes and infective diseases;[60−62] however, the data set with the peptide HLA-II binding stoichiometry remains missing.
Figure 7

Empiricist layout for deciphering the HLA-II binding epitopes. The scanning process is initiated by generating overlapping peptide fragments of requisite size from the antigenic protein. These peptides can be subjected to MD simulations in a pH environment, followed by clustering to select representative structures. These structures can next be subjected to the first filter, that is, based on their torsion angles, a comparative structural analysis with a known data set to assert the likelihood of interaction with HLA-II. Molecular docking can serve as a second filter to evaluate the efficacy of a potential HLA-II binder and sort them according to their promiscuity.

Empiricist layout for deciphering the HLA-II binding epitopes. The scanning process is initiated by generating overlapping peptide fragments of requisite size from the antigenic protein. These peptides can be subjected to MD simulations in a pH environment, followed by clustering to select representative structures. These structures can next be subjected to the first filter, that is, based on their torsion angles, a comparative structural analysis with a known data set to assert the likelihood of interaction with HLA-II. Molecular docking can serve as a second filter to evaluate the efficacy of a potential HLA-II binder and sort them according to their promiscuity. In conclusion, our study attempts to holistically understand the diverse structural aspects of the cognate peptide and HLA-II interactions, thereby laying down the foundation for a structure-guided screening methodology to discern epitopes for peptide vaccines.
  62 in total

Review 1.  Folding, aggregation and molecular recognition in peptides.

Authors:  I L Karle
Journal:  Acta Crystallogr B       Date:  1992-08-01

2.  In silico tools for predicting peptides binding to HLA-class II molecules: more confusion than conclusion.

Authors:  Uthaman Gowthaman; Javed N Agrewala
Journal:  J Proteome Res       Date:  2007-11-23       Impact factor: 4.466

Review 3.  Past, present and future: 30 years of HIV research.

Authors:  Françoise Barré-Sinoussi; Anna Laura Ross; Jean-François Delfraissy
Journal:  Nat Rev Microbiol       Date:  2013-10-28       Impact factor: 60.633

4.  VMD: visual molecular dynamics.

Authors:  W Humphrey; A Dalke; K Schulten
Journal:  J Mol Graph       Date:  1996-02

Review 5.  The cytokine milieu in the interplay of pathogenic Th1/Th17 cells and regulatory T cells in autoimmune disease.

Authors:  Stewart Leung; Xuebin Liu; Lei Fang; Xi Chen; Taylor Guo; Jingwu Zhang
Journal:  Cell Mol Immunol       Date:  2010-04-12       Impact factor: 11.530

6.  Stereochemical punctuation marks in protein structures: glycine and proline containing helix stop signals.

Authors:  K Gunasekaran; H A Nagarajaram; C Ramakrishnan; P Balaram
Journal:  J Mol Biol       Date:  1998-02-06       Impact factor: 5.469

7.  Predicting HLA class II antigen presentation through integrated deep learning.

Authors:  Binbin Chen; Michael S Khodadoust; Niclas Olsson; Lisa E Wagar; Ethan Fast; Chih Long Liu; Yagmur Muftuoglu; Brian J Sworder; Maximilian Diehn; Ronald Levy; Mark M Davis; Joshua E Elias; Russ B Altman; Ash A Alizadeh
Journal:  Nat Biotechnol       Date:  2019-10-14       Impact factor: 54.908

Review 8.  Peptide-based synthetic vaccines.

Authors:  Mariusz Skwarczynski; Istvan Toth
Journal:  Chem Sci       Date:  2015-12-17       Impact factor: 9.825

9.  FireDock: a web server for fast interaction refinement in molecular docking.

Authors:  Efrat Mashiach; Dina Schneidman-Duhovny; Nelly Andrusier; Ruth Nussinov; Haim J Wolfson
Journal:  Nucleic Acids Res       Date:  2008-04-19       Impact factor: 16.971

10.  Designing of interferon-gamma inducing MHC class-II binders.

Authors:  Sandeep Kumar Dhanda; Pooja Vir; Gajendra P S Raghava
Journal:  Biol Direct       Date:  2013-12-05       Impact factor: 4.540

View more
  2 in total

1.  Empirical Study of Large-Scale HLA Simulation of Parallel Region-Matching Knowledge Recognition Algorithm Based on Region Matching.

Authors:  Guohua Zhu; Haizhou Wang
Journal:  Comput Intell Neurosci       Date:  2022-04-11

2.  Proteomics and Its Application in Pandemic Diseases.

Authors:  Suman S Thakur
Journal:  J Proteome Res       Date:  2020-11-06       Impact factor: 4.466

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.