Literature DB >> 35330015

Epitope Mapping of Pathogenic Autoantigens on Sjögren's Syndrome-Susceptible Human Leukocyte Antigens Using In Silico Techniques.

Shivai Gupta1, Danmeng Li2, David A Ostrov2, Cuong Q Nguyen1,3,4.   

Abstract

Sjögren's syndrome (SjS) is characterized by lymphocytic infiltration and the dysfunction of the salivary and lacrimal glands. The autoimmune response is driven by the effector T cells and their cytokines. The activation of the effector helper T cells is mediated by autoantigen presentation by human leukocyte antigen (HLA) class II molecules of antigen-presenting cells. Studies using familial aggregation, animal models, and genome-wide association demonstrate a significant genetic correlation between specific risk HLAs and SjS. One of the key HLA alleles is HLA-DRB1*0301; it is one of the most influential associations with primary SjS, having the highest odds ratio and occurrence across different ethnic groups. The specific autoantigens attributed to SjS remain elusive, especially the specific antigenic epitopes presented by HLA-DRB1*0301. This study applied a high throughput in silico mapping technique to identify antigenic epitopes of known SjS autoantigens presented by high-risk HLAs. Furthermore, we identified specific binding HLA-DRB1*0301 epitopes using structural modeling tools such as Immune Epitope Database and Analysis Resource IEDB, AutoDock Vina, and COOT. By deciphering the critical epitopes of autoantigens presented by HLA-DRB1*0301, we gain a better understanding of the origin of the antigens, determine the T cell receptor function, learn the mechanism of disease progression, and develop therapeutic applications.

Entities:  

Keywords:  T cells; autoantigens; human leukocyte antigen (HLA); major histocompatibility complex (MHC)

Year:  2022        PMID: 35330015      PMCID: PMC8953074          DOI: 10.3390/jcm11061690

Source DB:  PubMed          Journal:  J Clin Med        ISSN: 2077-0383            Impact factor:   4.241


1. Introduction

Sjögren’s syndrome (SjS) is a chronic, systemic autoimmune disease that affects the exocrine glands of the body (salivary and lacrimal glands), which may occur in conjunction with another autoimmune disease [1]. It is estimated that approximately 4 million Americans are affected, making SjS the second most common autoimmune disease after rheumatoid arthritis (RA) [2,3,4]. SjS is a multifactorial disease related to genetic, hormonal, and environmental factors. Based on animal models and candidate gene association studies, susceptibility to developing SjS has been strongly associated with human leukocyte antigen (HLA) class II genes, particularly the HLA-DR and DQ alleles [5]. As with most autoimmune diseases, associations of HLA class II loci with SjS have been described and vary in different ethnic groups [6,7,8,9,10]. In most studies, when an HLA association with primary (p)SjS was demonstrable, a stronger association between HLAs and autoantibody titers could be found to the anti-Ro/SSA and anti-La/SB autoantibody responses. The HLA-DR3 haplotype is associated with SjS and exists within a region with extended linkage disequilibrium not observed in other places in the genome [8]. It is important to note that specific HLA-DR and -DQ alleles have been observed to present autoantigens in SjS (i.e., M3R, α-fodrin, Ro (SSA), and La (SSB)) in different ethnic populations [7,11,12]. The structural determination of disease-relevant peptide–HLA and HLA–peptide–TCR complexes is crucial for the elucidation of the molecular mechanisms responsible for the development of T cell reactivity that promotes autoimmune disease [13]. The HLA–peptide–T cell receptor (TCR) interactions that determine self and non-self-discrimination are guided by a set of rules that malfunction in immunopathology in autoimmunity [14]. There are certain principles that govern HLA restriction and TCR docking geometry with a variety of molecular mechanisms that could affect HLA–peptide–TCR interactions [15]. Autoreactive T cells are directly generated and activated by the mechanisms such as atypical HLA–peptide–TCR binding orientation, low-affinity peptide binding that facilitate thymic escape, TCR-mediated stabilization of weak peptide–HLA interaction, and presentation of peptides in a different binding register [13,16,17,18,19]. The peptide binding register refers to the ∼9-mer window of a peptide that sits directly within the peptide-binding groove at a given time. Alterations in this register, whereby the same peptide binds a peptide-binding groove utilizing a different 9-mer window, have an altered impact on the generation of autoreactive T cells [20]. Further, autoreactive TCRs can bind self-peptide–HLA complexes with a conventional binding topology and a high affinity as seen in type 1 diabetes and multiple sclerosis, thus highlighting the potential role of the peptide binding register in increasing the risk of autoimmune disease [21]. The first HLA class II associations in SjS described were at the DR3 [22,23] and DR2 [23,24] loci in Caucasian populations [25]. Together these two HLA sub-types were shown to account for up to 90% of the MHC association in patients who had SjS, which have been further confirmed in the majority of subsequent studies evaluating northern European cohorts [24]. In 2005, Anaya and colleagues [26] demonstrated that the HLA-DRB1*0301-DQB1*0201 haplotype was associated with pSjS in Latin Americans. The HLA-DR3 allele is one of the predominant alleles in SjS. The purpose of this study was to use a high-throughput in-silico mapping technique to identify antigenic epitopes binding to known risk HLAs of SjS, with significant emphasis on HLA-DR3 allele. Additionally, we sought to investigate the molecular mimicry of the antigenic epitopes by determining the homology to viral and bacterial pathogens that can bind structurally to individual HLAs.

2. Materials and Methods

2.1. In Silico Binding Affinities of Peptides for HLA-DR3 and Other Risk Alleles

The Immune Epitope Database (IEDB)—La Jolla Institute for Allergy and Immunology (LIAI), La Jolla, CA, USA—hosts a series of machine learning (ML) based tools, each trained on specific datasets of an experimental peptide-MHC binding affinity matrix. These tools encompass the common approaches of ML, namely, linear regression (LR) and utilization of artificial neural networks (ANN). The SMM-align methodology predicted the peptide-MHC binding affinity by fitting a weight matrix that relates peptide sequence to end-point binding affinity value. The datasets used for the IEDB prediction tool SMM-align included complete UniProt protein sequences of human Ro52, Ro60, La, M3R and α-fodrin. Quantitative measurements were selected by choosing the binding assay of identifying the IC50 value, and this was validated by the ΔG measurement values indicating the best possible position the peptide would fit in when being presented to the T cell. The human HLA type II alleles were predicted to bind to 15-mer peptides with a core peptide region of 10 amino acids. Finally, the result sets were analyzed, and the top predicted binders were identified.

2.2. PDB Structures of HLA-DR3 and Predicted Peptide Docking

The 1A6A HLA-DR3 MHC II-peptide binding complex was extracted from the Protein Data Bank and was used as a template for the crystal structure in COOT, and geometry regularization in PHENIX modeling (Figure 1, Figure 2, Figure 3, Figure 4 and Figure 5). The peptides were mutated using COOT [27] with rotamers that represent a local energy minimum of torsional angles. The geometry of the resulting complex was regularized in PHENIX. Autodock Vina was used for molecular docking after water, and other atoms were removed, with no presence of peptide [28]. Then, the positions of the peptides with the lowest binding energy (ΔG) were complexed using PHENIX. PyMOL (https://pymol.org/2/) accessed on 15 November 2021 v1.7.2 was used to generate molecular graphic images. The site for docking was prepared by removing all water molecules, and the protonation of HLA-DR3 residues was carried out with the SYBYL-X software. Sets of spheres were used to describe potential binding pockets on the molecular surface of HLA-DR3. The four pockets that were determined for molecular docking were determined using the SPHGEN program [28]. This program generated a grid of points that reflected the shape of the selected site, which were filtered through another program called CLUSTER [29]. CLUSTER grouped the selected spheres to define the points used by the following software called DOCK [29]. DOCK was able to match potential ligand atoms with spheres [28]. The next step used the intermolecular van der Waals and columbic AMBER energy scoring coupled with contact scoring and bump filtering. These additional characteristics were applied to the DOCK program algorithm. Atomic coordinates for all predicted peptides positioned in the selected structural pocket in 1000 different orientations were scored, and based on predicted polar (H bond) and nonpolar (van der Waals) interactions, the best images were obtained. PYMOL was used to generate molecular graphic images.
Figure 1

Predicted Ro52 peptide binders docked on HLA-DRB1*0103. (A,B) Based on the prediction by SMM-align (stabilization matrix alignment) using IEDB, the top two peptides with IC50 values of 28 and 29 were docked onto the crystal structure of HLA-DRB1*0103 PDB structure 1A6A, and the most optimum predicted position of docking is indicated for both peptides-NPWLILSEDRRQVRL and ANPWLILSEDRRQVR with the core sequence of LSEDRRQVR and LILSEDRRQ, respectively. (C,D) The highlighted region indicates the presence of the peptides in the three-dimensional structure of the Ro52 protein.

Figure 2

Predicted Ro60 peptide binders docked on HLA-DRB1*0103. (A,B) SMM-align predicted top two peptides FTFIQFKKDLKESMK and TFIQFKKDLKESMKC with IC50 values of 75 were docked onto the crystal structure of HLA-DRB1*0103 PDB structure 1A6A, and the most optimum predicted position of docking is indicated. Both predicted peptides have the same core sequence FKKDLKESM. (C,D) The highlighted region indicates the presence of the peptides in the three-dimensional structure of the Ro60 protein.

Figure 3

Predicted La peptide binders docked on HLA-DRB1*0103. (A,B) SMM-align predicted top two peptides ALKKIIEDQQESLNK and EALKKIIEDQQESLN with IC50 values of 49 and 50 were docked onto the crystal structure of HLA-DRB1*0103 PDB structure 1A6A, and the most optimum predicted position of docking is indicated. Both predicted peptides have the same core sequence IIEDQQESL. (C,D) The highlighted region indicates the presence of the peptide in the three-dimensional structure of the La protein.

Figure 4

Predicted M3R peptide binders docked on HLA-DRB1*0103. (A,B) Based on the prediction by SMM-align using IEDB the top two peptides with IC50 values of 120 and 121 were docked onto the crystal structure of HLA-DRB1*0103 PDB structure 1A6A, and the most optimum predicted position of docking is indicated for both peptides AWVISFVLWAPAILF and ISFVLWAPAILFWQY with the core sequence of AWVISFVLW and VLWAPAILF, respectively. (C,D) The highlighted region indicates the presence of the peptides in the three-dimensional structure of the M3R protein.

Figure 5

Predicted α-fodrin peptide binders docked on HLA-DRB1*0103. (A,B) Based on the prediction by SMM-align using IEDB, the top two peptides with IC50 values of 12 were docked onto the crystal structure of HLA-DRB1*0103 PDB structure 1A6A, and the most optimum predicted position of docking is indicated for both peptides SHDLQRFLSDFRDLM and HDLQRFLSDFRDLMS with the core sequence of SHDLQRFLS and FLSDFRDLM, respectively. (C,D) The highlighted region indicates the presence of the peptides in the three-dimensional structure of the α-fodrin protein.

2.3. Homology Determination for Viral and Bacterial Peptides

The software UniProt—Basic Local Alignment Search Tool (https://www.uniprot.org/blast/, accessed on 15 November 2021) was used to find regions of local similarity between sequences with an E-threshold of 10 amino acids in the viral and bacterial databases of UniProt. The top-scoring homology results were verified by the PubMed database (https://blast.ncbi.nlm.nih.gov/Blast.cgi, accessed on 15 November 2021), recorded, and are presented in Tables 14–17.

3. Results

3.1. In Silico Antigenic Mapping of High-Risk Autoantigens Presented on HLA-DR3

HLA class II alleles play an important role in the regulation of the immune responses against the Ro and La ribonucleoproteins. The generation of these autoantibodies has been correlated with the alleles DRB1*03:01, DQA1*05:01, and DQB1*02:01 in SjS patients [30,31]. As indicated in Table 1, among all the risk alleles that have been identified in SjS, the DRB1*03:01 allele was found to be a significant risk factor in many ethnicities [32]. According to the European League Against Rheumatism (EULAR) classification criteria for pSjS, the Ro60 antibodies are one of the leading indicators of the onset of disease in patients. DRB1*03:01 haplotype is found to associate with DPB1* 02:01 allele and TNF- α2 alleles in SjS patients [33]. Different HLA alleles can also be protective in nature for varied autoimmune diseases, as presented in Table 2. In DR3 transgenic mice, Ro60 has been shown to induce a strong T and B cell response [34]. Using DRB1*03:01 as the model allele for this study, we sought to map the antigenic epitopes of the five most dominant antigens associated with SjS, which include Ro52, Ro60, La, muscarinic receptor type III (M3R), and α-fodrin as indicated in Table 3. We conducted the mapping using the artificial neural networks of NetMHCIIpan from the Immune Epitope Database and Analysis Resource (IEDB) as a predictive method to identify the 15-amino acid peptides that could potentially be presented by DRB1*03:01. As indicated in Table 4, Table 5, Table 6, Table 7 and Table 8, predicted peptides with the lowest IC50 values, an indicator of half the maximal inhibitory concentration that characterizes the effectiveness of a peptide in substituting a high-affinity molecule for binding to MHC class II represents the binding affinity of that peptide. The lower the IC50 values, the stronger the binding to DRB1*03:01 by the predicted peptides. IEDB primarily uses the threshold of IC50 to select the strong peptide binders respective to the MHC. Peptide binding to MHC class II protein is based on the discrete anchor residues at pockets 1, 4, 6/7, and 9 [35] and these anchor peptide-binding motifs can be used to predict the specific T cell response.
Table 1

Identifying high-risk human leukocyte antigen (HLA) alleles in Sjögren’s syndrome (SjS).

Country of Origin/PopulationHLA Alleles ConnotationAuto-Antibodies IdentifiedReferences
U.S.A./American CaucasianHLA-B8ND *[36]
U.S.A./American CaucasianHLA-Dw3ND *[22]
U.S.A./American CaucasianHLA-Dw3-HLA-B8ND *[12]
U.S.A./American CaucasianHLA-DRw3-HLA-B8Antinuclear antibodies Ro60[37]
U.S.A./American CaucasianHLA-DRw3-HLA-B8Ro52[18]
U.S.A./American CaucasianHLA-DRw52SS-A[38]
Japan/Japanese populationHLA-DRB1*0301SS-A and SS-B[39]
HLA-DRB3*0101
HLA-DQA1*0501/DQB1*0201
Japan/Japanese populationHLA-DRB1*0405SS-A and SS-B[39]
HLA-DRB4*0101
HLA-DQA1*0301/DQB1*0401
Japan/Japanese populationHLA-DRw53Ro/SS-A and La/SS-B[40]
Japan/Japanese populationHLA-DRB1*8032/DQA1*0103/DQB1*0601Ro/SS-A and La/SS-B[39,41]
HLA-DRB1*8032
HLA-DRB1*0405-DRB4*0101
HLA-DQA1*0301
HLA-DQB1*0401
China/Chinese populationHLA-DRB1*0803SS-A and SS-B[39]
HLA-DQA1*0103/DQB1*0601
Mexico/Mexican populationHLA-DRB1*01:01Ro/SS-A and La/SS-B[42]
HLA-B*35:01
Colombia/Mestizo Colombian populationHLA-DRB1*0301Ro/SS-A and La/SS-B[26,32,43]
HLA-DQB1*0201
Israel/Israeli Jewish/GreekHLA-DQA1*001SS-A, and SS-B[44]
HLA-DQA1*0201/DQB1*0501-Jewish
HLA-DQA1*0501-Greek
Greece/Greek populationHLA-DRB1*0301Ro/SSA andanti-La/SSB[45]
Spain/Spanish populationHLA-Cw7Ro/SSA and anti-La/SSB[46]
HLA- DRB1*0301
HLA-DR11
France/French populationHLA-DRB1*1501ND *[33]
HLA- DRB1*0301
HLA-DQB1*0201
HLA-DQB1*0602
France/French populationHLA-DRB1*0301anti-SSA and/or anti-SSB[31]
HLA-DQB1*02
Italy/Italian populationHLA-DRB1*0301anti-Ro/SSA[47]
Denmark/Danish populationHLA-Dw2ND *[23]
Denmark/Danish populationHLA-DQA1*0501anti-SSA and/or anti-SSB[48]
HLA-DQB1*0201
HLA-DQA1*0301
Finland/Finnish populationHLA-DRB1*0301anti-SS-A/Ro and anti-SS-B/La[49]
HLA-DQA1*0501
HLA-DQB1*0201
Norway/Norwegian Caucasian populationHLA-DRB1*0301Ro/SSA and La/SSB[50]
Norway/Norwegian Caucasian populationHLA-DRB1*0301anti-La/SSB strong positive association with DQA1*0501anti-Ro/SSA and anti-La/SSB autoantibody response was positively associated with DRB1*03, DQB1*02 and DRB1*03/DRB1*15-DQB1*02/DQB1*0602[19]
HLA-DQB1*02
HLA-DQA1*0501
United Kingdom/British Caucasian populationHLA-DRB1*0301Ro/SSA and La/SSB[51]
HLA-DRw52
Australia/Australian populationHLA-DRB1*0301Ro/SSA and La/SSB[16]
HLA-DQA1*0501
HLA-DQB1*02
Tunisian populationHLA-DQB1 CAR1/CAR2ND *[52]
European and African American populationHLA-DQB1*0201SSA[53]
HLA-DQA1*0101

ND *—not determined.

Table 2

Identifying protective HLA alleles in different autoimmune diseases.

DiseaseProtective HLA Class II AlleleReferences
Graves’ diseaseHLA-DRB1*07[54]
HLA-DQB1*02
HLA-DQA1*02
Hashimoto’s thyroiditisHLA-DRB1*07[55]
HLA-DQB1*02
HLA-DQA1*02
Rheumatoid arthritisHLA-DRB1*0103[56]
HLA-DRB1*07
HLA-DRB1*1201
HLA-DRB1*1301
HLA-DRB1*1501
Multiple sclerosisDRB1*14-DQB1*06-DQA1*0102[9]
Type 1 diabetesDRB1*14-DQB1*06-DQA1*0102[57]
DRB1*15-DQB1*06-DQA1*01
Systemic lupus erythematosusDR4[58]
DR5
DR11
DR14
Table 3

Peptides for SjS that have been tested in vivo.

PeptideAmino AcidsAmino AcidSequenceIn Vivo ConfirmationReferencesHLA-DR3IC50
M3R205–237LFWQYFVGKRTVPPGECFIQFLSEPTITFGTAINOD/LtJ mice[59]GECFIQFLSEPTITF473
208–227QYFVGKRTVPPGECFIQFLSImmunization of young female NOD/LtJ mice on autoimmune response[60]QYFVGKRTVPPGECF8607
Part of second extracellular loop
213–228KRTVPPGECFIQFLSEBALB/c[61]KRTVPPGECFIQFLS50,000
514–527NTFCDSCIPKTFWNBALB/c[61]NTFCDSCIPKTFWNL6549
MTLHSNSTTSPLFPNISSSWVHSPSEAGLP, N1C57BL/6j (B6) mice (M3R+/+)M3R−/− miceRag1−/− mice[62]PNISSSWVHSPSEAG4760
VHSPSEAGLPLGTVSQLDSYNISGTSGNFS, N2LPLGTVSQLDSYNIS6028
NISQTSGNFSSNDTSSDPLGGHTIWQV, N3TSGNFSSNDTSSDPL6471
FTTYIIMNRWALGNLACDLW, Extracellular loop 1FTTYIIMNRWALGNL955
QYFVGKRTVPPGECFIQFLSEP, Extracellular loop 2QYFVGKRTVPPGECF8607
VLVNTFCDSCIPKTYWNLGY, Extracellular loop 3VLVNTFCDSCIPKTY5219
H 441–465PAGGTDCSLPMIWAQKTNTPADVFISJL/L (H-2s) A/J(H-2a)[63]TDCSLPMIWAQKTNT2068
H 316–335KARIHPFHILIALETYKTGHSJL/L (H-2s) BALB/c (H-2d)A/J(H-2a)[63]IHPFHILIALETYKT1485
H 306–325EKLCNEKLLKKARIHPFHILSJL/L (H-2s)[63]EKLLKKARIHPFHIL1721
H 26–45QVTDMNRLHRFLCFGSEGGTSJL/L (H-2s)[63]QVTDMNRLHRFLCFG2266
H 401–425MVVTREKDSYVVAFSDEMVPCPVTSJL/L (H-2s) A/J(H-2a)[63]REKDSYVVAFSDEMV2879
H 481–505IALREYRKKMDIPAKLIVCGMSTNGSJL/L (H-2s)[63]REYRKKMDIPAKLIV622
H 201–225YITKGWKEVHELYKEKALSVETEKLBALB/c (H-2d)[63]VHELYKEKALSVETE2191
H 241–265ELEVIHLIEEHRLLTNHLKSBALB/c (H-2d)A/J(H-2a)[63]VIHLIEEHRLLTNHL130
Ro52Full peptideFull proteinNew Zealand Mixed Mice (NZMZ) 2758[64]NPWLILSEDRRQVRL28
Ro60480–494AIALREYRKKMDIPAAnimals were immunized with peptide Ro480–494[65,66]AIALREYRKKMDIPA1876
274–290QEMPLTALLRNLGKMTAnimals were immunized with peptide Ro274–290[65,66]EMPLTALLRNLGKMT1598
274–290Human QEMPLTALLRNLGKMTAmino acid sequences of the human 60-kd Ro peptides used for immunization of BALB/c mice[67]EMPLTALLRNLGKMT1598
Mouse QEMPLTALLRNLGKMT
413–428Human VAFSDEMVPCPVTTDM
Mouse VAFACDMVPFPVTTDM
Rabbit VAFSDEMVPCPLTTDM
480–495Human AIALREYRKKMDIPAVAFSDEMVPCPVTTD20,917
Mouse AVALREYRKKMDIPAAIALREYRKKMDIPA1876
La1–107GYVDISLLVSFNKMKKLTTDGKLIARALKSSSVVELDLEGTRIRRKKPLGERPKDEEERTVYVELLPKNVTH [68]MKKLTTDGKLIARAL136
243–345KAKKRAQKDGVGQAASEVSKESRDLEFCSTEEEKETDRKGDSLSKVKRKHKKKHKERHKMGEEVIPLRVLSKTEWMDLKKEYLALQKASMASLKKTISQSKTEWMDLKKEYLAL922
111–242EQAAKAIEFLNNPPEEAPRKPGIFPKTVKNKPIPSLRVAEEKKKKKKKKGRIKKEESVQAKESAVDSSSSGVCKATKRPRTASEGSEAETPEAPKQPAKKKKKRDKVEASSLPEARAGKRERCSAEDEDCLSSSGVCKATKRPRTA561
Table 4

HLA-DR3 allele with predicted peptides of human Ro52.

AlleleStartEndLengthCore SequencePeptide SequenceIC50Percentile RankAdjusted Rank
HLA-DRB1*03:0129731115LSEDRRQVRNPWLILSEDRRQVRL28.000.100.10
HLA-DRB1*03:0129631015LILSEDRRQANPWLILSEDRRQVR29.000.110.11
HLA-DRB1*03:0129831215LSEDRRQVRPWLILSEDRRQVRLG29.000.110.11
HLA-DRB1*03:0129931315LSEDRRQVRWLILSEDRRQVRLGD29.000.110.11
HLA-DRB1*03:0130031415LSEDRRQVRLILSEDRRQVRLGDT29.000.110.11
HLA-DRB1*03:0130131515LSEDRRQVRILSEDRRQVRLGDTQ90.000.910.91
HLA-DRB1*03:0130231615LSEDRRQVRLSEDRRQVRLGDTQQ93.000.950.95
HLA-DRB1*03:0119721115LEKDEREQLLQELEKDEREQLRIL129.001.601.60
HLA-DRB1*03:0119621015LEKDEREQLQLQELEKDEREQLRI137.001.601.60
HLA-DRB1*03:0119821215LEKDEREQLQELEKDEREQLRILG137.001.601.60
Table 5

HLA-DR3 allele with predicted peptides of human Ro60.

AlleleStartEndLengthCore SequencePeptide SequenceIC50Percentile RankAdjusted Rank
HLA-DRB1*03:0112614015FKKDLKESMFTFIQFKKDLKESMK75.000.700.70
HLA-DRB1*03:0112714115FKKDLKESMTFIQFKKDLKESMKC75.000.700.70
HLA-DRB1*03:0112513915LFTFIQFKKLFTFIQFKKDLKESM76.000.740.74
HLA-DRB1*03:0124425815LIEEHRLVRVIHLIEEHRLVREHL78.000.760.76
HLA-DRB1*03:0112814215FKKDLKESMFIQFKKDLKESMKCG79.000.770.77
HLA-DRB1*03:0124525915LIEEHRLVRIHLIEEHRLVREHLL80.000.770.77
HLA-DRB1*03:0112914315FKKDLKESMIQFKKDLKESMKCGM81.000.790.79
HLA-DRB1*03:0124225615LIEEHRLVRLEVIHLIEEHRLVRE82.000.820.82
HLA-DRB1*03:0124325715LIEEHRLVREVIHLIEEHRLVREH82.000.820.82
HLA-DRB1*03:0124125515ELEVIHLIEELEVIHLIEEHRLVR83.000.830.83
Table 6

HLA-DR3 allele with predicted peptides of human La.

AlleleStartEndLengthCore SequencePeptide SequenceIC50Percentile RankAdjusted Rank
HLA-DRB1*03:013281515IIEDQQESLALKKIIEDQQESLNK49.000.360.36
HLA-DRB1*03:013271515IIEDQQESLEALKKIIEDQQESLN50.000.370.37
HLA-DRB1*03:013261515KEALKKIIEKEALKKIIEDQQESL51.000.380.38
HLA-DRB1*03:013291515IIEDQQESLLKKIIEDQQESLNKW51.000.380.38
HLA-DRB1*03:013301515IIEDQQESLKKIIEDQQESLNKWK54.000.400.40
HLA-DRB1*03:01911515ISEDKTKIRAELMEISEDKTKIRR131.001.601.60
Table 7

HLA-DR3 allele with predicted peptides of human M3R.

AlleleStartEndLengthCore SequencePeptide SequenceIC50Percentile RankAdjusted Rank
HLA-DRB1*03:0119220615AWVISFVLWAWVISFVLWAPAILF120.001.401.40
HLA-DRB1*03:0119520915VLWAPAILFISFVLWAPAILFWQY121.001.401.40
HLA-DRB1*03:0119320715VLWAPAILFWVISFVLWAPAILFW123.001.501.50
HLA-DRB1*03:0119420815VLWAPAILFVISFVLWAPAILFWQ123.001.501.50
HLA-DRB1*03:0119621015VLWAPAILFSFVLWAPAILFWQYF125.001.501.50
HLA-DRB1*03:0137538915ILNSTKLPSSTILNSTKLPSSDNL169.002.302.30
HLA-DRB1*03:0137438815ILNSTKLPSHSTILNSTKLPSSDN170.002.302.30
HLA-DRB1*03:0137138515LPGHSTILNLPGHSTILNSTKLPS171.002.302.30
HLA-DRB1*03:0137238615ILNSTKLPSPGHSTILNSTKLPSS171.002.302.30
HLA-DRB1*03:0137338715ILNSTKLPSGHSTILNSTKLPSSD171.002.302.30
HLA-DRB1*03:0154856215FRTTFKMLLNKTFRTTFKMLLLCQ198.002.702.70
HLA-DRB1*03:0154656015FRTTFKMLLLCNKTFRTTFKMLLL199.002.702.70
HLA-DRB1*03:0154956315FRTTFKMLLKTFRTTFKMLLLCQC199.002.702.70
HLA-DRB1*03:0154756115FRTTFKMLLCNKTFRTTFKMLLLC200.002.702.70
Table 8

HLA-DR3 allele with predicted peptides of human α-fodrin.

AlleleStartEndLengthCore SequencePeptide SequenceIC50Percentile RankAdjusted Rank
HLA-DRB1*03:011318133215SHDLQRFLSSHDLQRFLSDFRDLM12.000.010.01
HLA-DRB1*03:011319133315FLSDFRDLMHDLQRFLSDFRDLMS12.000.010.01
HLA-DRB1*03:011320133415FLSDFRDLMDLQRFLSDFRDLMSW12.000.010.01
HLA-DRB1*03:011322133615FLSDFRDLMQRFLSDFRDLMSWIN12.000.010.01
HLA-DRB1*03:0136337715FLADFRDLTLQRFLADFRDLTSWV26.000.060.06
HLA-DRB1*03:0136037415SYRLQRFLASYRLQRFLADFRDLT27.000.070.07
HLA-DRB1*03:0136137515FLADFRDLTYRLQRFLADFRDLTS27.000.070.07
HLA-DRB1*03:0136237615FLADFRDLTRLQRFLADFRDLTSW27.000.070.07
HLA-DRB1*03:0136437815FLADFRDLTQRFLADFRDLTSWVT28.000.100.10
HLA-DRB1*03:011323133715FLSDFRDLMRFLSDFRDLMSWING36.000.160.16
HLA-DRB1*03:011324133815FLSDFRDLMFLSDFRDLMSWINGI37.000.170.17
HLA-DRB1*03:0136537915FLADFRDLTRFLADFRDLTSWVTE83.000.830.83
HLA-DRB1*03:0136638015FLADFRDLTFLADFRDLTSWVTEM85.000.830.83
As presented in Table 4 and Figure 1, predicted Ro52 peptides on HLA-DR3 showed the core peptide with an anchor hydrophobic residue leucine at position 1, followed by a negatively charged residue at position 4 with the top-scoring aspartic acid residue and arginine residue at positions 6 and 9. The predicted Ro60 peptides showed a similar trend, in which lysine or phenylalanine (being predominantly a hydrophobic amino acid) was predicted at position 1 followed by a negatively charged residue at position 4, positively charged histidine, lysine, or arginine at position 6, and a positively charged residue at position 9 (Table 5 and Figure 2). The La predicted peptides also indicate the same pattern with isoleucine being present at position 1, followed by a positively charged or uncharged side chain amino acid (e.g., aspartic acid) at position 4 and a polar uncharged side chain at position 6 with a hydrophobic side chain at position 9 (Table 6 and Figure 3). The M3R predicted amino acid 9-mers indicate predominantly hydrophobic side-chained amino acids throughout the entire structure at positions 1, 4, 6, and 9 (Table 7 and Figure 4). Being a 240 KDa protein, α-fodrin showed a similar motif to Ro52, Ro60, and La with a hydrophobic amino acid at position 1 and predominantly charged amino acids at position 4 (predominantly negative) and 6 (predominantly positive), with a hydrophobic amino acid (i.e., lysine) at position 9, as shown in Table 8 and Figure 5. In summary, in silico antigenic epitope mapping of DRB1*03:01 allele with Ro52, Ro60, La, M3R, and α-fodrin showed that the general trend of all peptides predicted to bind have a backbone structure with position 1 being occupied by a hydrophobic residue, position 4 favors charged amino acids, position 6 favors negatively charged amino acids, and position 9 (especially for Ro52, Ro60) having a positively charged amino acid; α-fodrin was an anomaly preferring a hydrophobic residue at this position. La and M3R mostly indicate hydrophobic residues and amino acids with polar uncharged side chains. This also predicts the nature of the pockets in HLA DRB1*03:01, with positions 1 and 4 being rigid, whereas flexibility in the presentation of amino acids on positions 6 and 9 with either charged or hydrophobic amino acids.

3.2. Elucidating the Nature of Predicted Peptides Presented on Other Risk Alleles

As presented in Table 1, in addition to the HLA-DRB1*03:01 allele, there are other pertinent risk HLA alleles that were shown to associate with SjS. To further characterize the antigenic epitopes, we selected five different predominant alleles, specifically HLA-DRB1*01:01, HLA-DRB1*15:01, HLA DRB1*04:05, HLA-DRB4*01:01, and HLA-DRB3*01:01. As indicated in Table 9, Ro52 with the same trend of hydrophobic and charged peptides indicates a strong predictive binding by the NetMHCIIPan for the HLA-DRB1*01:01 and HLA DRB1*04:05 alleles with IC50 values that are lower than 50 nM. Most Ro60 potential binders showed higher IC50 predicted scores for most peptides identifying them to be poor binders (Table 10). La predicted peptides point toward having a slightly different amino acid composition for predicted peptides, with the second anchor position being primarily hydrophobic instead of negatively charged (Table 11). M3R peptides showed a wide disparity in predicted peptide binding for some alleles, suggesting that M3R antigens may be selectively processed and presented based on the presence of alleles such as HLA DRB1*04:05, HLA- DRB1*15:01, and HLA-DRB1*01:01 (Table 12). Lastly, α-fodrin peptide analysis indicates a slightly different sequence of peptides on most risk alleles, as indicated in Table 13. Following a similar pattern to the peptide composition presented, with slight deviations in HLA-DRB3*01:01 and HLA DRB1*04:05, it was observed that HLA-DRB1*01:01 and HLA- DRB1*15:01 had similar peptide presentation patterns to HLA-DRB1*03:01, indicating the higher probability of these alleles presenting the same peptides. In summary, in silico antigenic epitope mapping of HLA-DRB1*01:01, HLA-DRB1*15:01, HLA DRB1*04:05, HLA-DRB4*01:01, and HLA-DRB3*01:01 alleles with Ro52, Ro60, La, M3R, and α-fodrin showed that a similar trend of positions 1 and 4 having hydrophobic and positively charged residues but positions 6 and 9 being fluid to present either a charged or a hydrophobic amino acid for most predicted peptides.
Table 9

Predicted peptides on risk alleles for human Ro52.

AlleleCore SequencePeptide SequenceIC50
HLA-DRB1*01:01LKNLRPNRQRFLLKNLRPNRQLAN44.00
RFLLKNLRPCRQRFLLKNLRPNRQ52.00
HLA-DRB1*15:01TGPLRPFFSCAFTGPLRPFFSPGF122.00
LRPFFSPGFAFTGPLRPFFSPGFN123.00
HLA -DRB1*04:05EAGMVSFYNLDYEAGMVSFYNITD39.00
MVSFYNITDDYEAGMVSFYNITDH39.00
HLA-DRB4*01:01LKNLRPNRQRFLLKNLRPNRQLAN102.00
RFLLKNLRPCRQRFLLKNLRPNRQ110.00
HLA-DRB3*01:01KRADWKEVIIAIKRADWKEVIIVL229.00
EVEIAIKRAEVEIAIKRADWKEVI247.00
Table 10

Predicted peptides on risk alleles for human Ro60.

AlleleCore SequencePeptide SequenceIC50
HLA-DRB1*01:01LFTFIQFKKLFTFIQFKKDLKESM286.00
FKKDLKESMFTFIQFKKDLKESMK289.00
HLA-DRB1*15:01IQEIKSFSQCEVIQEIKSFSQEGR238.00
VIQEIKSFSGRGCEVIQEIKSFSQ251.00
HLA-DRB1*04:05LRLSHLKPSHKDLLRLSHLKPSSE75.00
LSHLKPSSEDLLRLSHLKPSSEGK75.00
HLA-DRB4*01:01TYYIKEQKLEGGTYYIKEQKLGLE228.00
KDLLRLSHLSHKDLLRLSHLKPSS237.00
HLA-DRB3*01:01LFTFIQFKKLFTFIQFKKDLKESM286.00
FKKDLKESMFTFIQFKKDLKESMK289.00
Table 11

Predicted peptides on risk alleles for human La.

AlleleCore SequencePeptide SequenceIC50
HLA-DRB1*01:01FNVIVEALSTDFNVIVEALSKSKA52.00
DFNVIVEALNRLTTDFNVIVEALS60.00
HLA- DRB1*15:01LHILFSNHGREDLHILFSNHGEIK34.00
DLHILFSNHQTCREDLHILFSNHG37.00
HLA-DRB1*04:05FNVIVEALSLTTDFNVIVEALSKS66.00
NRLTTDFNVNRLTTDFNVIVEALS67.00
HLA-DRB4*01:01EIMIKFNRLVPLEIMIKFNRLNRL75.00
IKFNRLNRLPLEIMIKFNRLNRLT77.00
HLA-DRB3*01:01DLDDQTCREDLDDQTCREDLHILF142.00
CREDLHILFLDDQTCREDLHILFS143.00
Table 12

Predicted peptides on risk alleles for human M3R.

AlleleCore SequencePeptide SequenceIC50
HLA-DRB1*01:01IAFLTGILAVVFIAFLTGILALVT9.00
LTGILALVTFIAFLTGILALVTII12.00
HLA- DRB1*15:01IIGNILVIVVTIIGNILVIVSFKV14.00
ILVIVSFKVIIGNILVIVSFKVNK14.00
HLA-DRB1*04:05VPPGECFIQVPPGECFIQFLSEPT7.00
FIQFLSEPTPPGECFIQFLSEPTI7.00
HLA-DRB4*01:01LVTIIGNILGILALVTIIGNILVI88.00
IGVISMNLFADLIIGVISMNLFTT98.00
HLA-DRB3*01:01GECFIQFLSGECFIQFLSEPTITF124.00
FLSEPTITFECFIQFLSEPTITFG127.00
Table 13

Predicted peptides on risk alleles for human α-fodrin.

AlleleCore SequencePeptide SequenceIC50
HLA-DRB1*01:01FQKIKSMAANGRFQKIKSMAASRR3.00
IKLLQAQKLMREKGIKLLQAQKLV5.00
HLA- DRB1*15:01WRRLKAQMILDRWRRLKAQMIEKR68.00
EVLDRWRRLNEVLDRWRRLKAQMI71.00
HLA-DRB1*04:05FRSSLSSAQHDAFRSSLSSAQADF38.00
HDAFRSSLSREAHDAFRSSLSSAQ39.00
HLA-DRB4*01:01KMREKGIKLKMREKGIKLLQAQKL5.00
IKLLQAQKLMREKGIKLLQAQKLV5.00
HLA-DRB3*01:01IQETRTYLLIQETRTYLLDGSCMV25.00
YLLDGSCMVQETRTYLLDGSCMVE25.00

3.3. Homology of Predicted Peptides Binding to HLA-DRB1*03:01 to Viral and Bacterial Proteins

Molecular mimicry is one of the main mechanisms by which infections might trigger autoimmune disease [69]. Several viruses and bacteria have been implicated as potential etiological agents in human patients, and specific viruses were determined to cause various clinical signs of SjS in animal models. However, there is still little information about the causative role in disease initiation and progression [70]. As presented, we have identified specific antigenic epitopes of the DRB1*03:01 allele with Ro52, Ro60, La, M3R, and α-fodrin proteins in silico. To determine whether these antigenic epitopes mimic viral and bacterial proteins, we utilized the BLAST tool to identify the amino acid homology between the SjS-associated antigenic epitopes of HLA-DRB1*03:01 and all known viral proteins in the Uniprot databases. As presented in Table 14, Ro52 peptides showed similarities between bat viruses such as Miniopterus schreibersii polyomavirus, and other plant-based pathogens. Ro60 peptides showed 100% homology between Botrytis (gray mold) viruses which have been stipulated to infect Botrytis (a major agricultural hazard) [71]. M3R peptides showed 88.9% homology between a variety of plant-based viral pathogens and affected the growth of agriculture and horticulture-based fungi (pests). La and human/mouse α-fodrin peptides indicate a similarity between Helenium virus and Caudovirales phages that belong to the family of multiple Carlaviruses that infect various ornamental plants [72]. As presented in Table 15 for bacterial proteins, Ro52 predicted core peptides showed 100% homology to Stigmatella aurantiaca and Cystobacter fuscus that are naturally occurring and a promising source for the discovery of new biologically active natural products [73,74]. Ro60 peptides did not indicate homology to any known bacterial peptides as represented. M3R peptides present a likeness with Desulfobacterales bacterium that has a sulfur-based metabolism [75]. La peptides showed 100% similarity between Pseudomonas species which is known to cause pneumonia and infections in blood [76], while α-fodrin peptides are homologous to certain aquatic and terrestrial bacteria with an 88.9% similarity. In summary, the results suggest that several environmental factors may be involved in the pathogenesis of SjS, with the main role being played by infectious agents for animals or plants, with molecular homologs acting as triggers that may contribute to disease progression in the existence of a predisposing genetic background.
Table 14

Homology of predicted peptides binding to HLA-DRB1*03:01 to viral proteins.

ProteinPredicted PeptideVirusProteinHomology with Sequence (Percentage)
Human Ro52LEKDEREQLMiniopterus schreibersii polyomavirus 1Large T antigen88.9%
Micromonas pusilla virus PL1Uncharacterized77.8%
Miniopterus schreibersii polyomavirus 1Small T antigen88.9%
Mouse Ro52MEMDLTMQR Wiseana iridescent virus (WIV) (Insect iridescent virus type 9)70%
Mouse Ro52KELAEKMEMMimivirus LCMiAC02Uncharacterized77.8%
Mouse Ro60LFTFIQFKKBotrytis virus X (isolate Botrytis cinerea/New Zealand/Howitt/2006) (BOTV-X)RNA replication100%
Human Ro60LFTFIQFKKBotrytis virus X (isolate Botrytis cinerea/New Zealand/Howitt/2006) (BOTV-X)B19:B22RNA replication protein100%
Human M3RAWVISFVLWPseudomonas phage PaMx74Putative membrane protein75%
Human M3RLPGHSTILNPepper mild mottle virus (strain Spain) (PMMV-S)Replicase large subunit88.9%
Odontoglossum ringspot virus (isolate Korean Cy) (ORSV-Cy)Replicase large88.9%
Tobacco mild green mosaic virus (TMGMV) (TMV strain U2)Replicase large subunit88.9%
Turnip vein-clearing virus (TVCV)Replicase large subunit88.9%
Youcai mosaic virus (YoMV)Replicase large subunit88.9%
Hoya necrotic spot virusMethyltransferase/RNA helicase88.9%
Odontoglossum ringspot virusMethyltransferase/RNA helicase88.9%
Virgaviridae sp.Replication-associated protein88.9%
Tobacco mild green mosaic virus (TMGMV) (TMV strain U2)Replicase large subunit88.9%
Brugmansia mild mottle virusMethyltransferase/RNA helicase88.9%
Streptocarpus flower break virusMethyltransferase/RNA helicase88.9%
Ribgrass mosaic virus (RMV)Methyltransferase/RNA helicase88.9%
Wasabi mottle virusMethyltransferase/RNA helicase88.9%
Piper chlorosis virusReplicase large subunit88.9%
Human LaKEALKKIIEHelenium virus S (HelVS)Helicase88.9%
Arthrobacter phage BoersmaDNA polymerase I100%
Human/Mouse α-fodrinSYRLQRFLAUncultured Caudovirales phageUncharacterized protein88.9%
Table 15

Homology of predicted peptides binding to HLA-DRB1*03:01 to bacterial proteins.

Human Ro52LSEDRRQVRStigmatella aurantiaca (strain DW4/3-1)Peptidase, M20 family100%
Cystobacter fuscus DSM 2262Acetylornithine deacetylase100%
Stigmatella aurantiaca (strain DW4/3-1)Peptidase, M20/M25/M40 family100%
Human Ro52LEKDEREQLGeobacter sp. (strain M21)Endopeptidase La100%
Seonamhaeicola marinus RNA polymerase sigma factor100%
Mouse Ro52MEMDLTMQR Sulfuriferula nivalis Phytoene synthase88.9%
Corallococcus exercitus Phytoene/squalene synthase88.9%
Corallococcus aberystwythensis Phytoene/squalene synthase88.9%
Corallococcus sp. CA047BPhytoene/squalene synthase88.9%
Corallococcus exercitus Phytoene/squalene synthase88.9%
Mouse Ro52KELAEKMEM Arenicella xantha RNA pol sigma factor100%
Gamma proteobacterium SS-5RNA pol sigma factor100%
Granulosicoccus antarcticus RNA pol sigma factor100%
Gammaproteobacteria bacterium RNA pol sigma factor100%
Granulosicoccus sp.RNA pol sigma factor100%
Candidatus Methyloumidiphilum RNA pol sigma factor100%
Gammaproteobacteria bacterium Fumarate flavoprotein100%
Tindallia magadiensis RNA pol sigma factor100%
Oceanospirillales bacterium RNA pol sigma factor100%
Cyanobacterium sp. IPPASRNA pol sigma factor100%
Cyanobacterium sp. HL-69RNA pol sigma factor100%
Culicoidibacter larvae RNA pol sigma factor100%
Chromobacterium violaceum RNA pol sigma factor100%
Cyanobacterium stanieri RNA pol sigma factor100%
Clostridium cellulovorans RNA pol sigma factor100%
Anaerolineaceae bacterium RNA pol sigma factor100%
Pseudobythopirellula maris RNA pol sigma factor100%
Bacteroidetes bacterium RNA pol sigma factor100%
Epulopiscium sp.RNA pol sigma factor100%
Betaproteobacteria bacterium RNA pol sigma factor100%
Fulvivirga imtechensis AK7 100%
Human M3RAWVISFVLW Planctomycetes bacterium Uncharacterized protein88.9%
Mouse M3RVLWAPAILF Desulfobacterales bacterium Site-2 protease family protein88.9%
Human LaKEALKKIIECandidatus Dojkabacteria bacteriumUncharacterized protein100%
Hydrogenimonas sp.Anthranilate phosphoribosyltransferase100%
candidate division WOR-3 bacteriumUncharacterized protein100%
Mouse LaQRYWQKILVPlanctomycetes bacterium SM23_25Uncharacterized protein88.9%
Mouse LaILVDRQAKLPseudomonas sp. NFR16Uncharacterized100.0%
Pseudomonas sp. Bc-hUncharacterized100.0%
Pseudomonas sp. GV021Uncharacterized100.0%
Pseudomonas abietaniphila Uncharacterized100.0%
Pseudomonas graminis DUF2914 family100.0%
Pseudomonas graminis Uncharacterized100.0%
Pseudomonas graminis Uncharacterized100.0%
Pseudomonas graminis DUF2914 domain100.0%
Pseudomonas graminis Uncharacterized100.0%
Pseudomonas sp.DUF2914 domain100.0%
Pseudomonas sp. NFACC02Uncharacterized100.0%
Pseudomonas sp. LP_7_YMDUF2914 domain100.0%
Pseudomonas sp. M47T1Uncharacterized100.0%
Pseudomonas eucalypticola DUF2914 domain100.0%
Pseudomonas sp. K1S02-6DUF2914 domain100.0%
Human/Mouse α-fodrinFLSDFRDLM Cocleimonas flava Uncharacterized88.9%
Verrucomicrobiales bacteriumUncharacterized88.9%
Planctomycetaceae bacteriumSH3 domain88.9%

3.4. Homology of Predicted Peptides Binding to Other Risk HLA Alleles to Viral and Bacterial Proteins

As indicated previously, we have also identified antigenic epitopes of HLA-DRB1*01:01, HLA-DRB1*15:01, HLA-DRB1*04:05, HLA-DRB4*01:01, and HLA-DRB3*01:01 alleles with Ro52, Ro60, La, M3R, and α-fodrin. To further determine if these antigenic epitopes mimic any known viral or bacterial proteins, we compared these peptide sequences using the Uniprot databases. As presented in Table 16, Ro60 predicted peptides of HLA-DRB1*01:01 showed homology with the RNA replication protein of Botrytis virus X. Salmonella phage SPFM12 showed a similarity to La peptides. In contrast, the M3R peptides indicated a 100% similarity with the Bacillus phage. While the α-fodrin peptides for this allele did not indicate a homology with any viral proteins, they were very similar to naturally occurring bacteria that are responsible for fermentation, such as Candidatus pseudoramibacter [77] and Eubacteriaceae bacterium, which is a pathogen that has been recently found to contribute to colorectal cancer initiation via promoting colitis [78]. HLA- DRB1*15:01 exhibited a homology for either bacteria or viruses for Ro52, Ro6,0, and La. Still, it yielded a 100% homology to the viral protein u (Vpu) protein of the human immunodeficiency virus 1 (HIV-1) to the M3R peptide (IIGNILVIV). Furthermore, HLA- DRB1*15:01 allele is predicted to present peptide EVLDRWRRL, which is very similar to many proteins found in Streptomyces and Saccharopolyspora species which have been investigated extensively for their bioactive natural pharmacological products [79]. The allele HLA-DRB4*01:01 was shown to present RFLLKNLRP peptide of Ro52. This specific peptide showed a 100% homology between the glycoprotein 120 (gp120) of HIV-1. The RFLLKNLRP peptide of Ro52 also showed homology with many phages and other viruses. Lastly, TYYIKEQKL peptide of Ro60 showed a similarity between the Ro-like RNA binding protein for the Streptomyces phage.
Table 16

Homology of predicted peptides binding to HLA alleles to viral proteins.

AlleleSjS ProteinCore SequenceVirusViral ProteinHomology
HLA-DRB4*01:01Ro52RFLLKNLRPHuman Immunodeficiency VirusGlycoprotein 120100.0%
Serratia phage 2050H2Uncharacterized87.5%
Klebsiella phage 31Endopeptidase Rz87.5%
Escherichia phage ECA2Endopeptidase87.5%
Leclercia phage 10164RHUncharacterized87.5%
Citrobacter phage SH1Endopeptidase87.5%
Citrobacter phage phiCFP-1Uncharacterized87.5%
Serratia phage SALSAEndopeptidase87.5%
Citrobacter phage SH2Endopeptidase Rz87.5%
Klebsiella phage KPP-5Endopeptidase87.5%
Leclercia phage 10164-302Uncharacterized87.5%
Enterobacter phage E-2Endopeptidase87.5%
Klebsiella phage NL_ZS_3Endopeptidase Rz87.5%
Serratia phage SM9-3YI-spanin87.5%
Escherichia phage LL2I-spanin87.5%
Salmonella phage phiSG-JL2Gp18.587.5%
Yersinia phage phiYeO3-12Endopeptidase87.5%
Enterobacter phage E-4Endopeptidase Rz87.5%
Enterobacter phage E-3Endopeptidase87.5%
Yersinia phage phiYe-F10Uncharacterized87.5%
Klebsiella phageendopeptidase87.5%
HLA-DRB1*01:01Ro60LFTFIQFKKBotrytis virus XRNA replication protein
100%Ro60TYYIKEQKLStreptomyces phageRo-like RNA binding protein88.9%
Streptomyces phageRo-like RNA binding protein88.9%
Streptomyces phage BeuffertRo-like RNA binding protein88.9%
Pyramimonas orientalis virusUncharacterized protein69.2%
KDLLRLSHLBotrytis virus XRNA replication100%
HLA-DRB1*01:01LaDFNVIVEALSalmonella phage SPFM12Uncharacterized88.9%
HLA-DRB3*01:01LaDLDDQTCRELeviviridae sp.RNA replicase beta chain64.3%
HLA-DRB1*01:01M3RIAFLTGILABacillus phage 031MP004Uncharacterized100%
Bacillus phage 055SW001Uncharacterized100%
Bacillus phage 022DV001Uncharacterized100%
Bacillus phage 031MP002Uncharacterized100%
Bacillus phage 031MP003Uncharacterized100%
HLA- DRB1*15:01M3RIIGNILVIVHuman immunodeficiency virus 1Protein Vpu100%
Compared to the bacterial proteins (Table 17), we found that Helicobacter sp. showed a 100% homology with the Ro60 peptide IKLLQAQKL. Helicobacter sp. has been found to cause chronic gastritis and plays an important role in peptic ulcer disease, gastric carcinoma, and gastric lymphoma. In addition, the homology between Ro60 peptide TYYIKEQKL of HLA-DRB4*01:01 and Fusobacterium necrophorum, a rare causative agent of otitis and sinusitis, indicates the linkage of an oral biology homologue [80]. HLA-DRB3*01:01 for both Ro52 and La peptides showed Virgibacillus massiliensis and Oscillospiraceae bacterium, which have been isolated from the human stool and may form a part of the microbiome, had a 100% homology with CREDLHILF (La) and KRADWKEVI (Ro52) [81,82]. The α-fodrin peptide IKLLQAQKL was indicated to be 100% similar to the glycosyltransferase protein of both Eubacteriaceae bacterium and Candidatus pseudoramibacter [83], which are microbes that have been observed in the gut [77]. Alterations in the gut and oral microbiota composition have previously been suggested as possible environmental factors in the etiology of pSjS and SLE [84]. In summary, the results suggest that different species in Table 17 belong to Bacteriodes, Actinomyces, and Lactobacillus that have been found in patients of both pSjS and SLE [85,86,87,88]. In conclusion to the results observed for bacterial homology, it is known that pSjS patients have less diversity in their gut microbiome with less abundant beneficial bacteria and more abundant opportunistic bacteria with pro-inflammatory activity compared with healthy individuals. Out of the primary homologs observed, most of them indicate a 100% homology to the three main bacterial species found in the gut, indicating the gut microbiome contribution in disease progression by molecular mimicry on a genetically predisposed background.
Table 17

Homology of predicted peptides binding to HLA alleles to bacterial proteins.

AlleleSjS ProteinCore SequenceBacteriaBacterial ProteinHomology
HLA -DRB1*04:05Ro52EAGMVSFYN Legionella moravica Ankyrin88.9%
Legionella sp. Km535Ankyrin repeat domain-containing protein88.9%
Ro52MVSFYNITD Legionella moravica Ankyrin88.9%
Legionella sp. Km535Ankyrin repeat domain-containing protein88.9%
HLA-DRB4*01:01Ro60TYYIKEQKLHelicobacter sp. 11S03491-1Protoporphyrinogen oxidase100%
Fusobacterium Uncharacterized100%
Fusobacterium Uncharacterized100%
HLA-DRB3*01:01LaCREDLHILF Virgibacillus massiliensis Uncharacterized100%
HLA-DRB4*01:01M3RLVTIIGNILUnculturedUncharacterized88.9%
HLA-DRB1*01:01Alpha FodrinIKLLQAQKL Eubacteriaceae Glycosyltransferase100.0%
Candidatus Pseudoramibacter Glycosyltransferase100.0%
HLA- DRB1*15:01Alpha FodrinEVLDRWRRL Desulfonatronum sp. Thioredoxin88.9%
Thermoleophilaceae bacterium Proline RNA ligase88.9%
Thermoleophilaceae bacterium Proline tRNA ligase88.9%
Nonomuraea nitratireducens DUF885 family protein88.9%
Nonomuraea phyllanthi DUF885 domain-containing protein88.9%
Firmicutes bacterium Biotin protein ligase88.9%
Firmicutes bacterium Biotin protein ligase88.9%
Streptomyces malaysiensis Putative non-ribosomal peptide synthetase100.0%
Streptomyces malaysiensis Non-ribosomal peptide synthetase100.0%
Streptomyces malaysiensis Carrier domain-containing protein100.0%
Aquisphaera giovannonii Phosphomannomutase/phosphoglucomutase100.0%
Streptomycetaceae bacterium Uncharacterized protein88.9%
Curtobacterium sp. MCPF17_047Uncharacterized protein100.0%
Nitriliruptorales bacterium DUF1932 domain-containing protein100.0%
Paracoccus homiensis Acetyltransferase (GNAT) family protein100.0%
Actinophytocola xanthii SnoaL-like domain-containing protein100.0%
Frigoribacterium sp. PhB160S-DNA-T family DNA segregation ATPase100.0%
Frigoribacterium sp. PhB107S-DNA-T family DNA segregation ATPase100.0%
Frigoribacterium sp. ACAM 257Cell division protein FtsK100.0%
Geodermatophilus sp. DF01_2Peptidase_M16_C domain-containing protein100.0%
Acidobacteria bacterium Uncharacterized protein100.0%
Nitrosococcus oceani C-27Transposase88.9%
Nitrosococcus oceani (strain)Y1_Tnp domain-containing protein88.9%
Dietzia sp. MeA6-2017Uncharacterized protein100.0%
Firmicutes bacterium Bifunctional ligase/repressor BirA100.0%
Dietzia sp. oral taxon 368Uncharacterized protein100.0%
Saccharopolyspora sp. ASAGF58Uncharacterized protein100.0%
Saccharopolyspora spinosa Uncharacterized protein100.0%
Chloroflexi bacterium Biotin [acetyl-CoA-carboxylase] ligase100.0%
Actinobacteria bacterium 13Biotin [acetyl-CoA-carboxylase] ligase100.0%
Pelagibaca abyssi Uncharacterized protein100.0%
Candidatus Kentron sp. LFYType III restriction enzyme88.9%
Planctomycetes bacterium Diguanylate cyclase88.9%
Candidatus Kentron sp. LFYType III restriction enzyme, res subunit88.9%
Candidatus Solibacter sp.3-isopropylmalate dehydratase large subunit88.9%
Hyalangium minutum Uncharacterized protein88.9%
Actinokineospora terrae AraC-type DNA-binding protein88.9%
Actinokineospora cianjurensis AraC-like DNA-binding protein88.9%
HLA-DRB4*01:01Alpha FodrinIKLLQAQKL Eubacteriaceae Candidatus Pseudoramibacter GlycosyltransferaseGlycosyltransferase100.0%100.0%

4. Discussion

HLA genes are the best documented genetic risk factors for the development of autoimmune diseases and could be directly involved in SjS [89]. This study shows the presence of a similar pattern of amino acids that may be presented by the HLAs based on their structure. The similarity and overlap in the peptides presented on different risk alleles suggest that the same antigenic peptides may be responsible for presenting different autoantigens and thereby initiating the autoimmune cascade. In addition, the results provide insight towards not only the genetic predisposition but also environmental and biological factors that contribute to the onset and progression of the disease. The peptide homology represents similarities in peptides presented to the immune system that shows homology to viral pathogens and bacteria that are both environmental triggers. Bacteria form part of the microbiome of an individual. Different amino acids present at specific positions in the biochemical structure may confer protection in the peptides presented. It has been shown in previous studies that, consistent with our results, the requirements of peptides for binding to HLA-DR3 vary among different DR3 binding peptides [90]. Similar to our results, the anchor peptides at different positions 1, 4, 6 and 9 indicate the absence of an anchor or the presence of only a weak anchor residue at either position 4 can be compensated for by the presence of a strong, positively charged anchor residue at position 6 in case of both viral antigens and autoimmune peptides [90,91]. Similar to the predicted peptide trend indicated, Verhagen et al. [92] showed that most insulin and pro-insulin peptides presented in type 1 diabetes also show a similar trend of hydrophobic residues at key anchor positions with a mix of charged residues preferred at other anchor locations. In Graves’ disease, arginine (a positively charged amino acid) has previously been reported to confer a high risk if present at a specific position (in the case of the processed peptide presentation), highlighting the importance of specific residues being present at specific positions for the onset of disease [93,94]. Additionally, we examined certain HLA alleles’ protective role that reduces the probability of specific antigen presentation. HLA-DRB1*01 allele has been proven to be negatively associated with pSjS, a result consistent with the Hungarian population in the study carried out by Kovacs et al. [95]. The protective role of the DRB1*01 allele was confirmed by a meta-analysis in which serological groups DR1 and DR7 were negatively associated with pSjS. However, further research is required in the area [32]. Investigating the cross-presentation of the autoantigen epitopes with bacteria or viruses can provide an important insight into a potential mechanism of disease initiation. The results showed predicted peptides of the five autoantigens exhibiting 100% homology to various reported gut commensal and oral bacteria. Additionally, viral infectious agents that may mimic SjS include hepatitis A, B or C, parvovirus B19, dengue, Epstein Barr virus (EBV), and HIV. Certain viruses express tropism for salivary and lacrimal glandular tissue, especially the herpesvirudae family, which is a large family of DNA viruses that includes cytomegalovirus (CMV), EBV, and human herpesvirus (HHV)-6,7,8. Several lines of epidemiological, serological, and experimental evidence implicate retroviral infections—especially human T-lymphotropic virus type (HTLV)-1, HIVs, human intracisternal A-type retroviral particle (HIAP)-I, and human rhinoviruses (HRV)-5—as triggering factors for the development of SjS. The gut is the most abundant site for bacteria, with nearly 1000 species having microbes that belong to four major phyla: Firmicutes, Bacteroidetes, Actinobacteria, and Proteobacteria. Bacteroidetes, along with Firmicutes, represent more than 90% of the entire plethora of microbes in the gut. Based on our findings, the bifunctional ligase/repressor protein of Firmicutes bacterium indicated a 100% homology for a peptide from α-fodrin for the allele HLA- DRB1*15:01. Eubacteriaceae bacterium, Pseudoramibacter, and other Firmicutes bacteria’s glycosyltransferases are indicated to have perfect homology to a predicted α-fodrin peptide for the allele HLA-DRB4*01:01. There are indications of multiple Candidatus bacterial species, which all belong to the Firmicutes phylum for multiple predicted peptides in different alleles, as observed in Table 15 and Table 17. Most indicated bacteria in the data presented had been found to be from three phyla (Firmicutes, Bacteroidetes, Actinobacteria) mentioned above that indicate the probability of bacterial peptides being similar to predicted salivary and lacrimal gland-based proteins that are presented on HLA’s and result in inflammation. In this study, we were able to predict antigenic epitopes or pathogenic peptides that may be presented in SjS based on a structure-based approach for the HLA cell surface protein. The finding may refine the etiology of the autoimmune process. As simplified in Figure 6, the disease progression is initiated by an environmental trigger like a viral infection on a genetically susceptible individual with a specific HLA allele. Salivary gland epithelial cells experience increased apoptosis and act as sources of pro-inflammatory cytokines such as IFN-γ. Macrophages are attracted to the region and act as the main agents for phagocytosis by participating in tissue destruction. Presentation of viral/bacterial antigens by MHC molecules on antigen-presenting cells leads to priming CD4+ T cells. With the help of T cells, B cells can form lymphocytic infiltrates or participate in ectopic germinal center formation where they can undergo class switching, affinity maturation, and differentiation into plasma cells that secrete high levels of antibodies. These antibodies may be cross-reactive against autoantigens such as Ro52, Ro60, La, α-fodrin, and M3R. The autoantibodies can form immune complexes by binding autoantigens and fixing complement or engaging Fc-γ receptors, further facilitating apoptosis. This process results in inflammation and tissue destruction through the recruitment of inflammatory cells and phagocytes to tissues. Apoptotic cells from damaged tissues can be taken up by phagocytes, which present novel autoantigens, supporting further priming and autoreactivity. Therefore, in order to understand the etiology and designing therapies, it is imperative that we understand the genetic factors and the environmental agents working together to create a suitable setting to initiate the autoimmune cascade.
Figure 6

Disease progression for individuals with genetic predisposition (specific HLA) and microbial trigger.

The apparent limitation of the study is that the peptide prediction is strictly based in silico. Since this is an in silico study, the results presented are theoretical and should be subjected to many of the same limitations implicit in the MHC binding affinity prediction tool(s) upon which it is based. Regardless, this is the first study that provides a comprehensive mapping of the antigenic epitopes based on the HLA structure. The advantage of this approach that we describe to map peptides will facilitate in identifying drugs and therapies specific and targeted to disease-susceptible HLA. As listed, many autoimmune diseases are associated with specific HLA alleles and high-resolution crystal structures exist for almost all MHC class II molecules. Strategies for the selection of HLA allele-specific peptides presented and testing their activity in experimental systems can be implemented. Further, this research will aid the ability to identify HLA allele-specific drugs based on the structure that will have applicability for treating autoimmune diseases and other HLA-associated conditions.
  95 in total

1.  Evidence for H2 consumption by uncultured Desulfobacterales in coastal sediments.

Authors:  Stefan Dyksma; Petra Pjevac; Kin Ovanesov; Marc Mussmann
Journal:  Environ Microbiol       Date:  2017-09-14       Impact factor: 5.491

2.  Interaction between innate immunity and Ro52-induced antibody causes Sjögren's syndrome-like disorder in mice.

Authors:  Barbara M Szczerba; Paulina Kaplonek; Nina Wolska; Anna Podsiadlowska; Paulina D Rybakowska; Paromita Dey; Astrid Rasmussen; Kiely Grundahl; Kimberly S Hefner; Donald U Stone; Stephen Young; David M Lewis; Lida Radfar; R Hal Scofield; Kathy L Sivils; Harini Bagavant; Umesh S Deshmukh
Journal:  Ann Rheum Dis       Date:  2015-02-05       Impact factor: 19.103

3.  Cepharanthine blocks TSH receptor peptide presentation by HLA-DR3: Therapeutic implications to Graves' disease.

Authors:  Cheuk Wun Li; Roman Osman; Francesca Menconi; Erlinda Concepcion; Yaron Tomer
Journal:  J Autoimmun       Date:  2020-01-21       Impact factor: 7.094

4.  The Sjogren's syndrome-associated autoantigen Ro52 is an E3 ligase that regulates proliferation and cell death.

Authors:  Alexander Espinosa; Wei Zhou; Monica Ek; Malin Hedlund; Susanna Brauner; Karin Popovic; Linn Horvath; Therese Wallerskog; Mohamed Oukka; Filippa Nyberg; Vijay K Kuchroo; Marie Wahren-Herlenius
Journal:  J Immunol       Date:  2006-05-15       Impact factor: 5.422

Review 5.  HLA class II peptide-binding and autoimmunity.

Authors:  J A Gebe; E Swanson; William W Kwok
Journal:  Tissue Antigens       Date:  2002-02

Review 6.  Viruses of botrytis.

Authors:  Michael N Pearson; Andrew M Bailey
Journal:  Adv Virus Res       Date:  2013       Impact factor: 9.937

7.  [Immunogenetics of the Sjogren's syndrome in southern Spain].

Authors:  R García Portales; M A Belmonte Lope; M T Camps García; P Ocón Sánchez; A Alonso Ortiz; M Guil García; E de Ramón Garrido
Journal:  An Med Interna       Date:  1994-02

8.  Relationshipp of HLA-Dw3 and HLA-B8 to Sjögren's syndrome.

Authors:  K H Fye; P I Terasaki; J P Michalski; T E Daniels; G Opelz; N Talal
Journal:  Arthritis Rheum       Date:  1978-04

9.  Subgingival microbiota dysbiosis in systemic lupus erythematosus: association with periodontal status.

Authors:  Jôice Dias Corrêa; Débora Cerqueira Calderaro; Gilda Aparecida Ferreira; Santuza Maria Souza Mendonça; Gabriel R Fernandes; E Xiao; Antônio Lúcio Teixeira; Eugene J Leys; Dana T Graves; Tarcília Aparecida Silva
Journal:  Microbiome       Date:  2017-03-20       Impact factor: 14.650

10.  Immune responses to Ro60 and its peptides in mice. I. The nature of the immunogen and endogenous autoantigen determine the specificities of the induced autoantibodies.

Authors:  U S Deshmukh; J E Lewis; F Gaskin; C C Kannapell; S T Waters; Y H Lou; K S Tung; S M Fu
Journal:  J Exp Med       Date:  1999-02-01       Impact factor: 14.307

View more
  1 in total

1.  Special Issue "Diseases of the Salivary Glands-Part II".

Authors:  Margherita Sisto
Journal:  J Clin Med       Date:  2022-09-22       Impact factor: 4.964

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.