Literature DB >> 17254312

Prediction of desmoglein-3 peptides reveals multiple shared T-cell epitopes in HLA DR4- and DR6-associated pemphigus vulgaris.

Joo Chuan Tong1, Tin Wee Tan, Animesh A Sinha, Shoba Ranganathan.   

Abstract

BACKGROUND: Pemphigus vulgaris (PV) is a severe autoimmune blistering skin disorder that is strongly associated with major histocompatibility complex class II alleles DRB1*0402 and DQB1*0503. The target antigen of PV, desmoglein 3 (Dsg3), is crucial for initiating T-cell response in early disease. Although a number of T-cell specificities within Dsg3 have been reported, the number is limited and the role of T-cells in the pathogenesis of PV remains poorly understood. We report here a structure-based model for the prediction of peptide binding to DRB1*0402 and DQB1*0503. The scoring functions were rigorously trained, tested and validated using experimentally verified peptide sequences.
RESULTS: High predictivity is obtained for both DRB1*0402 (r2 = 0.90, s = 1.20 kJ/mol, q2 = 0.82, s(press) = 1.61 kJ/mol) and DQB1*0503 (r2 = 0.95, s = 1.20 kJ/mol, q2 = 0.75, s(press) = 2.15 kJ/mol) models, compared to experimental data. We investigated the binding patterns of Dsg3 peptides and illustrate the existence of multiple immunodominant epitopes that may be responsible for both disease initiation and propagation in PV. Further analysis reveals that DRB1*0402 and DQB1*0503 may share similar specificities by binding peptides at different binding registers, thus providing a molecular mechanism for the dual HLA association observed in PV.
CONCLUSION: Collectively, the results of this study provide interesting new insights into the pathology of PV. This is the first report illustrating high-level of cross-reactivity between both PV-implicated alleles, DRB1*0402 and DQB1*0503, as well as the existence of a potentially large number of T-cell epitopes throughout the entire Dsg3 extracellular domain (ECD) and transmembrane region. Our results reveal that DR4 and DR6 PV may initiate in the ECD and transmembrane region respectively, with implications for immunotherapeutic strategies for the treatment of this autoimmune disease.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17254312      PMCID: PMC1764484          DOI: 10.1186/1471-2105-7-S5-S7

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


Background

Pemphigus vulgaris (PV) is a severe autoimmune blistering skin disorder due to loss of integrity of normal intercellular attachments within the epidermis and mucosal epithelium. Strong association of PV to the major histocompatibility complex (MHC) class II alleles DRB1*0402 and DQB1*0503 have been reported in the literature [1-6] with over 95% of PV patients possessing one or both of these alleles [7,8]. The molecular basis and immunological consequences of this dual HLA association has thus far not been elucidated. The target antigen of PV, desmoglein (Dsg) 3, is a 130-kDa transmembrane glycoprotein that belongs to the cadherin superfamily of cell adhesion molecules [9]. In the early stage of disease, patients demonstrate autoimmunity only to Dsg3 and develop mucosal blisters; while at the later stage, patients exhibit non-cross-reactive immunity to both Dsg3 and Dsg1 [10]. Despite several reports of T-cell specificities within Dsg3 [11-16], the number is limited and the role of T-cells in the pathogenesis of PV remains poorly understood. Bioinformatic tools are now commonly used in facilitating T-cell epitope discovery [17-19]. Computational methods for predicting MHC-binding peptides include procedures based on sequence motifs [13,16], quantitative matrices [20-22], decision trees [23,24], artificial neural networks [25,26], hidden Markov models [27] and support vector machines [28-30]. However, despite recent advances in sequence-based predictive techniques, effective models for DRB1*0402 and DQB1*0503 have been lacking, mainly due to the paucity of sufficient peptides as training data [31] as well as register-shifts (or positional differences between the core nonameric peptide in the binding groove) [31] and polymorphisms in the binding registers (A.A. Sinha, unpublished results). To date, few prediction techniques for MHC class II molecules have been developed using three-dimensional models as the dual issues of model quality and discrimination technique must be addressed [32]. Our strategy for prediction of T-cell epitopes involves the three-dimensional modeling of peptide/MHC complexes using a hybrid docking approach that integrates the strengths of Monte Carlo simulations and homology modeling [31,33,34]. In an earlier study, we have successfully discriminated disease-implicated from non-disease implicated and protective alleles in PV based on structural interaction rules [31]. A complementary scoring function has now been developed for effective identification of DRB1*0402 and DQB1*0503 epitopes. We investigated the T-cell epitope repertoire of the entire Dsg3 glycoprotein and show the existence of multiple extracellular and intracellular specificities within the Dsg3 self-antigen. Further analysis reveal that DRB1*0402 and DQB1*0503 share similar specificities by binding peptides at different core recognition regions. These data provide a molecular rationale for the association of PV to two distinct HLA molecules and impact our understanding of the mechanism of HLA mediated control of disease.

Results and Discussion

DRB1*0402 predictive model

The DRB1*0402 model shows excellent correlation with experimental binding affinities (r2 = 0.90, s = 1.20 kJ/mol, q2 = 0.82, s= 1.61 kJ/mol). The accuracy of the prediction model was further evaluated using (i) peptides with experimental IC50 values obtained from biochemical studies and (ii) Dsg3 peptides with T-cell proliferation values obtained from functional studies. Three threshold binding energy values (Table 1) that define levels of specificities suitable for practical applications [34] were used to evaluate the accuracy of the DRB1*0402 prediction model on the biochemical dataset (Test set 1) – LMH (low-, medium-, high-affinity binders; AROC = 0.93); MH (medium- and high-affinity binders; AROC = 0.86) and H (high-affinity binders only; AROC = 0.81). The results indicate that, overall, our DRB1*0402 peptide-binding models are highly accurate (AROC = 0.93). SP = 0.80 offers high-sensitivity predictions, whereas SP = 0.95 results in very few false positives but fewer true positives. The prediction results for our model were consistent with expected binding patterns of DRB1*0402 peptides and provided a sensitivity of 70% (SP = 0.95) for DRB1*0402-binding peptides.
Table 1

Prediction accuracy of the DRB1*0402 peptide-binding model.

Specificity (SP) LevelGroupSensitivity (SE)Binding Energy Threshold (kJ/mol)
SP = 0.80LMH0.78-25.55
MH0.81-25.79
H0.65-26.64
SP = 0.90LMH0.75-25.79
MH0.52-26.94
H0.30-28.83
SP = 0.95LMH0.70-26.94
MH0.42-27.72
H0.25-30.57

Sensitivity (SE) values and binding energy thresholds for DRB1*0402 peptide-binding model at specificity (SP) levels 0.80, 0.90 and 0.95.

Next, the predictive performance of the DRB1*0402 model was tested on the functional dataset of 25 peptides (Test set 2) with experimental T-cell proliferation values/responses using the decision thresholds defined above. All experimental positives (Table 2) are predicted with binding energy values of -26.64 kJ/mol or less (high-affinity binders; SE = 0.65, SP = 0.80). Our simulation results indicate that DRB1*0402 can bind Dsg3 96–112 at threshold -26.64 kJ/mol (SE = 0.65, SP = 0.80) in contrast to previous qualitative study results [31], albeit at a binding energy (-27.09 kJ/mol) close to the threshold. Dsg3 512–526 (ranked #10), Dsg3 78–93 (ranked #12) and Dsg3 96–110 (ranked #19) are predicted binders (SE = 0.65, SP = 0.80) that did not stimulate T-cell responses in the relevant experiments [11,13,16]. Noteworthy, peptides Dsg3 78–93 [13] and Dsg3 96–110 [16] are fully contained within Dsg3 78–94 (ranked #11) and Dsg3 96–112 (ranked #22), both of which are experimental true positives identified in an independent study [11]. For Dsg3 78–93, the preferred binding register identified by our models is 79–87, which is shows a 5-residue shift in the binding register from the preferred core of 82–90 for Dsg3 78–94. There is no change in the binding register observed for Dsg3 96–110 and Dsg3 96–112. It is possible that variability in the binding register (#12) or flanking peptide residues (#19) lead to a change in T-cell stimulation.
Table 2

Predicted binding affinities of Dsg3 peptide sequences to DRB1*0402.

RankPeptideSequencePredicted BE (kJ/mol)References
1Dsg3 342–358SVKLSIAVKNKAEFHQS-31.46Veldman et al. (2004)
2#Dsg3 342–356SVKLSIAVKNKAEFH-30.88Moesta et al. (2002)
3Dsg3 205–221GTPMFLLSRNTGEVRTL-30.44Veldman et al. (2004)
4* #Dsg3 846–860LDSLGPKFKKLAEIS-30.17Moesta et al. (2002)
5Dsg3 380–396GIAFRPASKTFTVQKGI-29.97Riechers et al. (1999)
6@Dsg3 67–81RNPIAKITSDYQATQ-29.94Moesta et al. (2002)
7Dsg3 786–800MNFLDSYFSQKAFAC-29.78Moesta et al. (2002)
8Dsg3 190–204LNSKIAFKIVSQEPA-29.74Wucherpfennig et al. (1995)
9Dsg3 189–205HLNSKIAFKIVSQEPAG-29.24Veldman et al. (2004)
10*Dsg3 512–526SARTLNNRYTGPYTF-29.21Wucherpfennig et al. (1995)
11Dsg3 78–94QATQKITYRISGVGIDQ-28.37Veldman et al. (2004)
12*Dsg3 78–93QATQKITYRISGVGID-28.30Wucherpfennig et al. (1995)
13Dsg3 191–205NSKIAFKIVSQEPAG-28.16Moesta et al. (2002)
14Dsg3 206–220TPMFLLSRNTGEVRT-28.02Wucherpfennig et al. (1995)
15Dsg3 210–226LLSRNTGEVRTLTNSL-27.98Veldman et al. (2004)
16Dsg3 251–265CECNIKVKDVNDNFP-27.88Wucherpfennig et al. (1995)
17Dsg3 250–266QCECNIKVKDVNDNFPM-27.68Veldman et al. (2004)
18Dsg3 483–499VRVPDFNDNCPTAVLEK-27.48Veldman et al. (2004)
19* @Dsg3 96–110PFGIFVVDKNTGDIN-27.47Moesta et al. (2002)
20Dsg3 762–776QSGTMRTRHSTGGTN-27.28Wucherpfennig et al. (1995)
21Dsg3 161–177IFMGEIEENSASNSLVM-27.09Hertl et al. (1998)
22Dsg3 96–112PFGIFVVDKNTGDINIT-27.09Veldman et al. (2004)
23@Dsg3 963–977ERVICPISSVPGNLA-26.87Moesta et al. (2002)
24*Dsg3 97–111FGIFVVDKNTGDINI-26.58Wucherpfennig et al. (1995)
25@Dsg3 810–824NDCLLIYDNEGADAT-24.00Moesta et al. (2002)

Predicted binding affinities of Dsg3 peptide sequences to DRB1*0402. '#', '@', '*' indicate binding, non-binding and non-immunostimulatory in the relevant experiments respectively.

DQB1*0503 predictive model

The DQB1*0503 prediction model correlate well with experimental data (r2 = 0.95, s = 1.20 kJ/mol). The DQB1*0503 model outperforms the prediction models done by Rognan et al. (1999) on training datasets of 5 HLA-A*0204 (r2 = 0.85, s= 2.40 kJ/mol) and 37 H2-Kk (r2 = 0.78, s= 3.16 kJ/mol) peptide sequences and is consistent with our DRB1*0402 prediction model (r2 = 0.90, s = 1.20 kJ/mol, q2 = 0.82, s= 1.61 kJ/mol) which is trained using the same number of data points. The cross-validation coefficient q2 and the standard error of prediction sare stable, with q2 = 0.75 and s= 2.15 kJ/mol. This iterative regression procedure validates the internal consistency of the scoring function in the current model, rendering it suitable for predictions on the test dataset obtained from functional studies. The accuracy of the DQB1*0503 prediction model was assessed on a dataset of 6 (5 stimulatory and 1 non-stimulatory peptides) Dsg3 peptides with known T-cell proliferation values (Table 3). All DQB1*0503-specific Dsg3 stimulatory peptides can be effectively discriminated from the background at the prediction threshold -26.64 kJ/mol.
Table 3

Predicted binding affinities of Dsg3 peptide sequences to DQB1*0503.

RankPeptideSequencePredicted BE (kJ/mol)References
1Dsg3 206–220TPMFLLSRNTGEVRT-30.53Wucherpfennig et al. (1995)
2Dsg3 189–205HLNSKIAFKIVSQEPAG-29.10Veldman et al. (2004)
3Dsg3 78–94QATQKITYRISGVGIDQ-27.14Veldman et al. (2004)
4Dsg3 190–204LNSKIAFKIVSQEPA-26.88Wucherpfennig et al. (1995)
5Dsg3 251–265CECNIKVKDVNDNFP-26.65Wucherpfennig et al. (1995)
6*Dsg3 762–786QSGTMRTRHSTGGTNKDYADGAISM-22.42Wucherpfennig et al. (1995)

Predicted binding affinities of Dsg3 peptide sequences to DQB1*0503. '*' indicates non-immunostimulatory in the relevant experiments.

Disease progression in PV

A variety of studies have demonstrated that a limited set of epitopes may be present in early disease, and intra-molecular epitope spreading may occur during disease transition at the B-cell level [10]. Our data support the existence of multiple immunodominant T-cell epitopes that may be responsible for both disease initiation and propagation (Figure 1 and Figure 2). These findings are in line with T-cell proliferation data obtained from DR4 and DR6 PV patients [11,13,15,16,31]. Our analysis showed that the potential Dsg3 T-cell epitope repertoire is well distributed throughout all five extracellular domains (ECDs) (ECD1: Dsg3 50–158; ECD2: Dsg3 159–268; ECD3: Dsg3 269–383; ECD4: Dsg3 386–499; ECD5: Dsg3 500–615) [35]. A large number of DQB1*0503-specific Dsg3 peptides were predicted to exist within the transmembrane region (Dsg3 616–640; Figure 2). Noteworthy, 13 of these peptides were found among the top 20 predictions (Table 4), suggesting that disease initiation may begin at the transmembrane region for DR6 PV patients. Such an initiation step would not be unique to PV as other studies have revealed disease initiation at the transmembrane region [36,37]. For DRB1*0402, 90% (18/20) of the top 20 predictions were predicted to exist within the ECDs, suggesting that disease initiation in DR4 and DR6 PV patients may be different. Although two DRB1*0402-specific intracellular peptides (Dsg3 762–786, Dsg3 786–800) have been reported [13,16], and were correctly detected by our model (SP = 0.80, SE = 0.65), relatively few intracellular peptides from the region, Dsg3 641–999, are predicted by our methods. It remains to be determined what proportion of the predicted epitopes are actually generated via antigen processing events in vivo, or remain subdominant or "cryptic", such that they are not available for recognition during the initial immune response. It is likely that immune responses develop against secondary epitopes at later stages of disease progression, as a result of intramolecular epitope spreading, and it will be important to delineate the cascade of disease relevant epitopes in temporal sequence in future work.
Figure 1

Predicted DRB1*0402-specific T-cell epitope repertoire within Dsg3. Location of predicted DRB1*0402-specific T-cell epitopes along the Dsg3 glycoprotein – extracellular domain (colored yellow; Dsg3 50–615), transmembrane region (colored green; Dsg3 616–640), and intracellular domain (colored pink; Dsg3 641–999). The red line indicates decision threshold -26.64 kJ/mol for high-affinity binders (SP = 0.80, SE = 0.65).

Figure 2

Predicted DQB1*0503-specific T-cell epitope repertoire within Dsg3. Location of predicted DQB1*0503-specific T-cell epitopes along the Dsg3 glycoprotein – extracellular domain (colored yellow; Dsg3 50–615), transmembrane region (colored green; Dsg3 616–640), and intracellular domain (colored pink; Dsg3 641–999). The red line indicates decision threshold -26.64 kJ/mol for high-affinity binders (SP = 0.80, SE = 0.65).

Table 4

Top 20 predicted Dsg3 peptide sequences binding to DQB1*0503 and DRB1*0402.

RankDQB1*0503DRB1*0402


PeptideSequencePredicted B.E. (kJ/mol)PeptideSequencePredicted B.E. (kJ/mol)
1Dsg3 627–641GLLLLLLAPLLLLTC-39.23Dsg3 601–615TTSPGTRYGRPHSGR-34.64
2Dsg3 626–640LGLLLLLLAPLLLLT-38.43Dsg3 263–277NFPMFRDSQYSARIE-33.86
3Dsg3 624–638LLLGLLLLLLAPLLL-37.87Dsg3 844–858SFLDSLGPKFKKLAE-33.46
4Dsg3 629–643LLLLLAPLLLLTCDC-37.82Dsg3 264–278FPMFRDSQYSARIEE-33.22
5Dsg3 625–639LLGLLLLLLAPLLLL-37.65Dsg3 599–613YPTTSPGTRYGRPHS-33.10
6Dsg3 628–642LLLLLLAPLLLLTCD-37.47Dsg3 352–366KAEFHQSVISRYRVQ-32.88
7Dsg3 620–634AIGLLLLGLLLLLLA-36.48Dsg3 598–612SYPTTSPGTRYGRPH-32.76
8Dsg3 623–637LLLLGLLLLLLAPLL-36.44Dsg3 442–456AEIKFVKNMNRDSTF-32.60
9Dsg3 618–632PAAIGLLLLGLLLLL-36.36Dsg3 440–454KTAEIKFVKNMNRDS-32.56
10Dsg3 293–307LDEEYTDNWLAVYFF-36.28Dsg3 439–453SKTAEIKFVKNMNRD-32.36
11Dsg3 622–636GLLLLGLLLLLLAPL-36.23Dsg3 52–66VKFAKPCREGEDNSK-32.13
12Dsg3 619–633AAIGLLLLGLLLLLL-36.20Dsg3 186–200EPNHLNSKIAFKIVS-32.10
13Dsg3 294–308DEEYTDNWLAVYFFT-36.15Dsg3 354–368EFHQSVISRYRVQST-32.10
14Dsg3 621–635IGLLLLGLLLLLLAP-35.67Dsg3 247–261LSTQCECNIKVKDVN-32.08
15Dsg3 297–311YTDNWLAVYFFTSGN-35.28Dsg3 262–276DNFPMFRDSQYSARI-32.07
16Dsg3 483–497VRVPDFNDNCPTAVL-35.17Dsg3 845–859FLDSLGPKFKKLAEI-32.07
17Dsg3 616–630LGPAAIGLLLLGLLL-34.86Dsg3 954–968LLTQNVIVTERVICP-31.93
18Dsg3 317–331EIQTDPRTNEGILKV-34.75Dsg3 351–365NKAEFHQSVISRYRV-31.87
19Dsg3 295–309EEYTDNWLAVYFFTS-34.67Dsg3 344–358KLSIAVKNKAEFHQS-31.85
20Dsg3 136–150DVEKPLILTVKILDI-34.63Dsg3 441–455TAEIKFVKNMNRDST-31.72

The nonameric core peptide in the binding groove is shown in bold, underlined text.

DRB1*0402 and DQB1*0503 cross reactivity

An in-depth analysis was performed to investigate the extent of overlap in the Dsg3 peptide-binding repertoires of DRB1*0402 and DQB1*0503. A panel of 936 15mer Dsg3 sequences were generated using an overlapping sliding window of size 15 across the entire Dsg3 glycoprotein and modeled into the binding grooves of both DRB1*0402 and DQB1*0503 (explained in detail in Peptide docking). Both the DRB1*0402 and DQB1*0503 alleles are particularly efficient in binding Dsg3-derived peptides. Furthermore, a significant level of cross-reactivity was observed between DRB1*0402 and DQB1*0503. Of the 936 overlapping 15mer peptides generated from the entire Dsg3 glycoprotein investigated in this study, 539 (57%) were predicted high-affinity binders to both alleles at threshold -26.64 kJ/mol. The computer simulation results are shown in Figures 1 and 2. Noteworthy, three previously defined immunoreactive segments of the Dsg3 extracellular domains (Dsg3 145–192, 240–303 and 570–614) [38] were also predicted by our models at this specific threshold. Among the known Dsg3 peptides (Tables 2 and 3), only 2 (Dsg3 191–205 and 762–776) were predicted to bind DRB1*0402 alone, consistent with our earlier qualitative results from structural studies alone [31]. 18 (Dsg3 78–94, 96–112, 161–177, 189–205, 190–204, 205–221, 206–220, 210–226, 250–266, 251–265, 342–356, 342–358, 376–392, 380–396, 483–499, 786–800, 810–824, 963–977) were predicted to bind to both DRB1*0402 and DQB1*0503. These observations are of particular interest in that both DRB1*0402 and DQB1*0503 are strongly linked to PV [39,40], indicating that common or overlapping dominant epitopes may be responsible for inducing disease in DR4 and DR6 patients respectively.

DRB1*0402 and DQB1*0503 peptide binding specificities

The basis for the high degree of cross-reactivities between DRB1*0402 and DQB1*0503 was subjected to further analysis. Our data support the existence of multiple binding registers within a candidate binding peptide that serve as recognition sites for DRB1*0402 and DQB1*0503, an observation previously noted for DQB1*0302 binding peptides [34]. Of 936 Dsg3 sequences, 614 were predicted high-affinity binders with 76% displaying 2 or more registers that can be docked into the binding groove of DRB1*0402 (Figure 3). Similar results are obtained for DQB1*0503, with 673 predicted high-affinity binders and 57% exhibiting 2 or more binding registers (Figure 3). DRB1*0402 and DQB1*0503 predicted consensus binding sequences number 539. A striking aspect of this analysis is that DRB1*0402 and DQB1*0503 were predicted to bind a large portion of these peptides (354/539 or 66%) at different binding registers. Noteworthy, this difference was detected in 70% (7/10) of Dsg3 peptides known to bind both DRB1*0402 and DQB1*0503 (Table 5). For example, the consensus binding peptide Dsg 205–221 showed ΔG values less than the decision threshold -26.64 kJ/mol, with the 211–219 and 212–220 registers being the preferred binding modes for DRB1*0402 and DQB1*0503 respectively. We propose that DRB1*0402 and DQB1*0503 share similar specificities by binding peptides at different binding registers.
Figure 3

Predicted binding registers for Dsg3 peptides bound to DRB1*0402 and DQB1*0503. The frequency of Dsg3 peptides docked to DRB1*0402 (colored gray) and DQB1*0503 (colored white) is shown as a function of the number of predicted binding registers.

Table 5

Preferred binding registers of experimentally determined Dsg3 peptides for PV implicated alleles.

PeptideAlleleSequencePeptideAlleleSequence
Dsg3 78–94DRB1*0402QATQKITYRISGVGIDQDsg3 250–266DRB1*0402QCECNIKVKDVNDNFPM
DQB1*0503QATQKITYRISGVGIDQDQB1*0503QCECNIKVKDVNDNFPM
Dsg3 96–112DRB1*0402PFGIFVVDKNTGDINITDsg3 251–265DRB1*0402CECNIKVKDVNDNFP
DQB1*0503PFGIFVVDKNTGDINITDQB1*0503CECNIKVKDVNDNFP
Dsg3 161–177DRB1*0402IFMGEIEENSASNSLVMDsg3 342–356DRB1*0402SVKLSIAVKNKAEFH
Dsg3 189–205DRB1*0402HLNSKIAFKIVSQEPAGDsg3 342–358DRB1*0402SVKLSIAVKNKAEFHQS
DQB1*0503HLNSKIAFKIVSQEPAGDQB1*0503SVKLSIAVKNKAEFHQS
Dsg3 190–204DRB1*0402LNSKIAFKIVSQEPADsg3 376–392DQB1*0503NVREGIAFRPASKTFTV
DQB1*0503LNSKIAFKIVSQEPADsg3 380–396DRB1*0402GIAFRPASKTFTVQKGI
Dsg3 191–205DRB1*0402NSKIAFKIVSQEPAGDsg3 483–499DRB1*0402VRVPDFNDNCPTAVLEK
Dsg3 205–221DRB1*0402GTPMFLLSRNTGEVRTLDsg3 762–776DRB1*0402QSGTMRTRHSTGGTN
DQB1*0503GTPMFLLSRNTGEVRTLDsg3 786–800DRB1*0402MNFLDSYFSQKAFAC
Dsg3 206–220DRB1*0402TPMFLLSRNTGEVRTDsg3 810–824DRB1*0402NDCLLIYDNEGADAT
DQB1*0503TPMFLLSRNTGEVRTDsg3 963–977DRB1*0402ERVICPISSVPGNLA
Dsg3 210–226DRB1*0402LLSRNTGEVRTLTNSL

Preferred binding registers of experimentally determined Dsg3 peptides for PV implicated alleles. Best fitting registers with lowest predicted binding energies are in bold, underlined text.

Conclusion

Although PV was first reported by Hippocrates in 400 B.C., few T-cell specificities within Dsg3 have been identified to date [11-16], and the role of MHC and T-cells in the pathogenesis of PV still remains poorly understood. Collectively, the results of this study provide interesting new insights into the pathology of PV. This is the first report of high-level of cross-reactivity between both PV-implicated alleles, DRB1*0402 and DQB1*0503, as well as the existence of a potentially large number of T-cell epitopes throughout the entire Dsg3 ECD and transmembrane region. Our data strongly indicates that multiple initial epitopes may be responsible for both disease initiation and progression. In addition, our modelling results revealed that DR4 and DR6 PV may initiate at different regions of Dsg3. DR4 PV may initiate in the ECD while DR6 PV may begin at the transmembrane region. It remains to be determined what proportion of predicted DRB1*0402 and DQB1*0503 binders are capable of stimulating PV-implicated alleles and autoreactive T-cells. If experimental analysis reveals limited sets of predicted Dsg3 peptides capable of eliciting functional responses, control over autoreactivity may rest at the T-cell level, rather than the level of determinant selection by MHC molecules. This will have a direct impact in the design of immunotherapeutic strategies for the treatment of this autoimmune disease.

Methods

MHC sequence data were obtained from IMGT-HLA database [41]. To identify potential structural templates available in the Protein Data Bank (PDB) [42] for model building, a sequence similarity search was performed using PSI-BLAST [43] running on the servers at NCBI and the highest quality templates (with the best resolution, highest sequence similarity and minimal number of missing residues) were selected among the returned results. The crystal structures of DRB1*0401 (PDB code 1D5Z) and DQB1*0602 (PDB code 1UVQ) were selected as templates for DRB1*0402 (97.9% identity) and DQB1*0503 (93.0% identity) respectively.

Model building

The program MODELLER [44] was employed for comparative modeling of both DRB1*0402 and DQB1*0503. The models are constructed by optimally satisfying spatial constraints obtained from the alignment of the template structure with the target sequence and from the CHARMM-22 force field [45]. The structures were relaxed by conjugate gradient minimization, using the Internal Coordinate Mechanics (ICM) 3.0 package [46].

Experimental binding data

Two sets of data are used in this study: (i) peptides with experimental IC50 values from biochemical studies and (ii) peptides with experimental T-cell proliferation values/responses from functional studies. Dataset I (Additional file 1: Table S1) comprises 59 DRB1*0402-specific peptides derived from biochemical studies (20 high-affinity, 11 medium-affinity and 13 low-affinity binders and 15 non-binders). Peptides are classified based on their experimental IC50 values (high-affinity binders: IC50 ≤ 500 nM, medium-affinity binders: 500 nM < IC50 ≤ 1500 nM, low-affinity binders: 1500 < IC50 ≤ 5000 nM and non-binders: 5000 < IC50). Dataset II (Additional file 2: Table S2) consists of 25 DRB1*0402-specific Dsg3 peptides and 14 DQB1*0503-specific Dsg3 peptides with T-cell proliferation values/responses [11-13].

Peptide docking

Overlapping 15-mer peptides are generated from the Dsg3 sequence. An overlapping sliding window of size nine is applied to each 15-mer peptide to generate all combinations of binding registers to be modeled into the binding grooves of DRB1*0402 and DQB1*0503. Docking was performed with the Empirical Conformational Energy Program for Peptides 3 force field parameters (ECEPP/3) [46] and MMFF partial charges [47] on a 4-CPU SGI Origin 3200 workstation using an extension of the protocol [33,34]: (i) pseudo-Brownian rigid body docking of peptide fragments to the ends of the binding groove, (ii) central loop closure by satisfaction of spatial constraints, (iii) refinement of the backbone and side-chain atoms of the core recognition residues and receptor contact regions within 4.00Å radius [31,33] and (iv) extension of flanking peptide residues by satisfaction of spatial constraints.

Empirical free energy functions

The scoring function presented in this study is based on the free energy potential in ICM3.0 package [47]. The binding free energy function is partitioned into three terms [34] expressed by the equation: ΔG = αΔGH + βΔGS + γΔGEL + C.     (1) ΔGH is the hydrophobic energy computed as the product of solvent accessible surface area (determined by rolling a sphere of 1.40Å radius along the surface of the molecule) by the surface tension. ΔGS refers to the entropic contribution from the protein side-chains computed from the maximal burial entropies for each type of amino acid and their relative accessibilities. ΔGEL denotes the electrostatic term composed of coulombic interactions between receptor and ligand and the desolvation of partial charges transferred from an aqueous medium to a protein core environment, and is determined by the numeric solution of the Poisson equation using an implementation of the boundary element algorithm [48-50]. The constant term C accounts for entropy change in the system due to the decrease of free molecular concentration and the loss of rotational/translational degrees of freedom upon binding [51].

Training, testing and validation

Two computational models are trained in this study – one model for the prediction of peptide binding to DRB1*0402 and the other for DQB1*0503. DRB1*0402-specific peptide data derived from biochemical studies with experimental IC50 values was divided into training and test sets. The training set comprised 8 (5 binding and 3 non-binding) Dsg3 sequences with experimentally determined binding registers (from Dataset I). Two external sets of test data were used: (i) Test set 1: 51 peptides with experimental IC50 values (20 high-affinity binders, 11 medium affinity binders, 9 low affinity binders and 11 non-binders) from biochemical studies, and (ii) Test set 2: all DRB1*0402-specific Dsg3 peptides from Dataset II, with known T-cell proliferation values. DQB1*0503 prediction model was trained using Dsg3 peptide data from functional studies in the absence of relevant biochemical data. The training set comprised 8 (5 stimulatory and 3 non-stimulatory) sequences from Dataset II. For each peptide sequence, T-cell proliferation value [13] is mapped to a theoretical IC50 value in accordance with expected binding patterns of Dsg3 binding peptides (Sinha et al., personal communications). The performance of the prediction model was subsequently evaluated on an external set of 6 peptides with known T-cell proliferation values. Coefficients (α, β, γ) and the constant term C in Equation 1 were derived using standard least-square multivariate regression analyses of the training set, followed by leave-one-out analysis to assess to quality of the scoring function [52]. For each model, the entire procedure is repeated 8 times to reduce noises in all computations, the results averaged and the observed error rate is used to estimate the expected error rate upon generalization to new data. The optimal scoring function selected from each cross-validation analysis was further assessed using sensitivity (SE), specificity (SP) and receiver operating characteristic (ROC) analysis [34]. SE = TP/(TP+FN) and SP = TN/(TN+FP), indicate percentages of correctly predicted binders and non-binders, respectively. TP (true positives) represents experimental binders with at least one predicted binding register and TN (true negatives) for experimental non-binders with no predicted binding register. FN (false negatives) denotes experimental binders predicted as non-binders and FP (false positives) represents experimental non-binders predicted as binders. The accuracy of our predictions was assessed by ROC analysis where the ROC curve is generated by plotting SE as a function of (1-SP) for various classification thresholds. The area under the ROC curve (AROC) provides a measure of overall prediction accuracy, AROC <70% for poor, AROC>80% for good and AROC>90% for excellent predictions [34]. In this study, we assessed SE for three values of SP (80%, 90% and 95%) that are considered useful in practice. All regression and validation results, including correlation coefficient (r2), standard deviation (s), cross-validation coefficient (q2), standard error of prediction (spress), SE, SP, AROC, coefficients and constant terms in the scoring function, are recorded.

Authors' contributions

JCT carried out the computational simulation studies and drafted the manuscript. AAS carried out the immunoassays. TWT, AAS and SR participated in the design of the study and interpretation of data. SR developed the project and finalized the manuscript.

Additional file 1

Table-S1 Click here for file

Additional file 2

Table-S2 Click here for file
  45 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Prediction of the binding energy for small molecules, peptides and proteins.

Authors:  M Schapira; M Totrov; R Abagyan
Journal:  J Mol Recognit       Date:  1999 May-Jun       Impact factor: 2.137

3.  Prediction of HLA-DQ3.2beta ligands: evidence of multiple registers in class II binding peptides.

Authors:  Joo Chuan Tong; Guang Lan Zhang; Tin Wee Tan; J Thomas August; Vladimir Brusic; Shoba Ranganathan
Journal:  Bioinformatics       Date:  2006-03-01       Impact factor: 6.937

4.  An empirical method for the prediction of T-cell epitopes.

Authors:  M P Davenport; I A Ho Shon; A V Hill
Journal:  Immunogenetics       Date:  1995       Impact factor: 2.846

5.  Predicting peptides that bind to MHC molecules using supervised learning of hidden Markov models.

Authors:  H Mamitsuka
Journal:  Proteins       Date:  1998-12-01

6.  Development and characterization of desmoglein-3 specific T cells from patients with pemphigus vulgaris.

Authors:  M S Lin; S J Swartz; A Lopez; X Ding; M A Fernandez-Vina; P Stastny; J A Fairley; L A Diaz
Journal:  J Clin Invest       Date:  1997-01-01       Impact factor: 14.808

Review 7.  HLA class II restriction of autoreactive T cell responses in pemphigus vulgaris: review of the literature and potential applications for the development of a specific immunotherapy.

Authors:  R Riechers; J Grötzinger; M Hertl
Journal:  Autoimmunity       Date:  1999       Impact factor: 2.815

8.  Absorption of pathogenic autoantibodies by the extracellular domain of pemphigus vulgaris antigen (Dsg3) produced by baculovirus.

Authors:  M Amagai; T Hashimoto; N Shimizu; T Nishikawa
Journal:  J Clin Invest       Date:  1994-07       Impact factor: 14.808

9.  Structural basis for major histocompatibility complex (MHC)-linked susceptibility to autoimmunity: charged residues of a single MHC binding pocket confer selective presentation of self-peptides in pemphigus vulgaris.

Authors:  K W Wucherpfennig; B Yu; K Bhol; D S Monos; E Argyris; R W Karr; A R Ahmed; J L Strominger
Journal:  Proc Natl Acad Sci U S A       Date:  1995-12-05       Impact factor: 11.205

10.  Modeling the bound conformation of Pemphigus vulgaris-associated peptides to MHC Class II DR and DQ alleles.

Authors:  Joo Chuan Tong; Jeff Bramson; Darja Kanduc; Selwyn Chow; Animesh A Sinha; Shoba Ranganathan
Journal:  Immunome Res       Date:  2006-01-21
View more
  9 in total

1.  Efficacy of HLA-DRB1∗03:01 and H2E transgenic mouse strains to correlate pathogenic thyroglobulin epitopes for autoimmune thyroiditis.

Authors:  Yi-chi M Kong; Nicholas K Brown; Jeffrey C Flynn; Daniel J McCormick; Vladimir Brusic; Gerald P Morris; Chella S David
Journal:  J Autoimmun       Date:  2011-06-17       Impact factor: 7.094

Review 2.  The Immunogenetics of Autoimmune Blistering Diseases.

Authors:  Diana Kneiber; Eric H Kowalski; Kyle T Amber
Journal:  Adv Exp Med Biol       Date:  2022       Impact factor: 2.622

3.  pDOCK: a new technique for rapid and accurate docking of peptide ligands to Major Histocompatibility Complexes.

Authors:  Javed Mohammed Khan; Shoba Ranganathan
Journal:  Immunome Res       Date:  2010-09-27

Review 4.  Immune response in pemphigus and beyond: progresses and emerging concepts.

Authors:  Giovanni Di Zenzo; Kyle T Amber; Beyza S Sayar; Eliane J Müller; Luca Borradori
Journal:  Semin Immunopathol       Date:  2015-11-23       Impact factor: 9.623

Review 5.  A reciprocal HLA-Disease Association in Rheumatoid Arthritis and Pemphigus Vulgaris.

Authors:  Vincent Van Drongelen; Joseph Holoshitz
Journal:  Front Biosci (Landmark Ed)       Date:  2017-01-01

6.  Detection of autoreactive CD4+ T cells by MHC class II multimers in HLA-linked human autoimmune diseases.

Authors:  Karolin Wieber; Christine L Zimmer; Michael Hertl
Journal:  J Clin Invest       Date:  2021-05-03       Impact factor: 14.808

7.  Detection of rare autoreactive T cell subsets in patients with pemphigus vulgaris.

Authors:  Alexandra Polakova; Leonie Kauter; Adina Ismagambetova; Dario Didona; Farzan Solimani; Kamran Ghoreschi; Michael Hertl; Christian Möbs; Christoph Hudemann
Journal:  Front Immunol       Date:  2022-09-20       Impact factor: 8.786

8.  In silico characterization of immunogenic epitopes presented by HLA-Cw*0401.

Authors:  Joo Chuan Tong; Zong Hong Zhang; J Thomas August; Vladimir Brusic; Tin Wee Tan; Shoba Ranganathan
Journal:  Immunome Res       Date:  2007-08-20

9.  Immunological hotspots analyzed by docking simulations: evidence for a general mechanism in pemphigus vulgaris pathology and transformation.

Authors:  Joo Chuan Tong; Animesh A Sinha
Journal:  BMC Immunol       Date:  2008-06-19       Impact factor: 3.615

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.