Literature DB >> 26849221

Identifying Patient-Specific Epstein-Barr Nuclear Antigen-1 Genetic Variation and Potential Autoreactive Targets Relevant to Multiple Sclerosis Pathogenesis.

Monika Tschochner1, Shay Leary1, Don Cooper1, Kaija Strautins1, Abha Chopra1, Hayley Clark1, Linda Choo1, David Dunn1, Ian James1, William M Carroll2,3, Allan G Kermode1,2,3, David Nolan1,4.   

Abstract

BACKGROUND: Epstein-Barr virus (EBV) infection represents a major environmental risk factor for multiple sclerosis (MS), with evidence of selective expansion of Epstein-Barr Nuclear Antigen-1 (EBNA1)-specific CD4+ T cells that cross-recognize MS-associated myelin antigens in MS patients. HLA-DRB1*15-restricted antigen presentation also appears to determine susceptibility given its role as a dominant risk allele. In this study, we have utilised standard and next-generation sequencing techniques to investigate EBNA-1 sequence variation and its relationship to HLA-DR15 binding affinity, as well as examining potential cross-reactive immune targets within the central nervous system proteome.
METHODS: Sanger sequencing was performed on DNA isolated from peripheral blood samples from 73 Western Australian MS cases, without requirement for primary culture, with additional FLX 454 Roche sequencing in 23 samples to identify low-frequency variants. Patient-derived viral sequences were used to predict HLA-DRB1*1501 epitopes (NetMHCII, NetMHCIIpan) and candidates were evaluated for cross recognition with human brain proteins.
RESULTS: EBNA-1 sequence variation was limited, with no evidence of multiple viral strains and only low levels of variation identified by FLX technology (8.3% nucleotide positions at a 1% cut-off). In silico epitope mapping revealed two known HLA-DRB1*1501-restricted epitopes ('AEG': aa 481-496 and 'MVF': aa 562-577), and two putative epitopes between positions 502-543. We identified potential cross-reactive targets involving a number of major myelin antigens including experimentally confirmed HLA-DRB1*15-restricted epitopes as well as novel candidate antigens within myelin and paranodal assembly proteins that may be relevant to MS pathogenesis.
CONCLUSIONS: This study demonstrates the feasibility of obtaining autologous EBNA-1 sequences directly from buffy coat samples, and confirms divergence of these sequences from standard laboratory strains. This approach has identified a number of immunogenic regions of EBNA-1 as well as known and novel targets for autoreactive HLA-DRB1*15-restricted T cells within the central nervous system that could arise as a result of cross-reactivity with EBNA-1-specific immune responses.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 26849221      PMCID: PMC4744032          DOI: 10.1371/journal.pone.0147567

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Epstein-Barr virus (EBV) is the only human-adapted member of the Lymphocryptovirus genus, belonging to a lineage of Old World primate gamma-1 herpesviruses that was transferred to a hominid ancestor approximately twelve million years ago, and which is now responsible for near-universal and lifelong human infection [1,2]. Viral transmission is generally via saliva, with evidence that age of infection is associated with cultural and socioeconomic factors [3]. Uniquely, chronic infection is established within ‘immortalised’ B-lymphocytes that are transformed by an array of viral proteins that functionally mimic host proteins to create long-lived memory cells [4,5]. Viral persistence is then promoted through mechanisms that reduce antigen presentation to the adaptive immune system [6], including the involvement of latency programs that limit viral protein expression to a minimal subset critical for replication; most notably Epstein-Barr Nuclear Antigen-1 (EBNA-1), which maintains host chromosomal attachment of viral episomal DNA thus linking viral and cellular replication cycles [7]. These mechanisms of viral persistence would predict limited viral sequence diversity, in keeping with the relatively slow evolutionary rate of EBV and other gamma-1 herpesviruses [2] and evidence of geographically-defined viral subtypes [8]. Nevertheless, evidence of diversifying selection involving latency genes including EBNA-1 has been identified [9], including preferential variation within human leukocyte antigen (HLA) binding sites (viral epitopes) suggesting that antigen presentation can promote HLA-specific viral escape mutations [10,11]. Thus, EBNA-1 is not immunologically ‘silent’ as once thought [12] but is an antigenic target for both CD4 and CD8 T-cell responses [12-14] as well as antibodies [15], in keeping with finely tuned immune surveillance mechanisms that generally maintain persistent but stable cycles of EBV infection involving both epithelial and B-lymphocyte compartments [5]. Within this paradigm, mechanisms of viral antigen display [13,14] and the general hierarchy of EBV-specific immune responses including regulatory as well as effector T cell responses are being examined [14,16-18]. These have particular relevance to the therapeutic application of EBV-specific T-cell adoptive immunotherapy against EBV-related malignancies including Burkitt’s and Hodgkin’s lymphoma and nasopharyngeal carcinoma [19], now supported by positive findings in clinical trials [20,21]. This strategy is underpinned by knowledge of EBV sequence diversity in tissue samples [9,22,23] and its utilisation to predict viral epitope targets [24]. Our own investigations have focused on multiple sclerosis, an inflammatory demyelinating disease of the central nervous system that often leads to neurodegeneration and long-term disability despite current treatment strategies [25]. While a comprehensive explanation of multiple sclerosis pathogenesis remains incomplete, it is clear that the major component of genetic risk is associated with the HLA-DR locus [26-29], thus implicating HLA-restricted antigen binding and presentation [30], as well as genetic determinants that predominantly relate to T-cell activation [29]. Several lines of evidence link Epstein-Barr virus-specific immunity to multiple sclerosis risk. Both serological [27,31,32] and CD4 T cell responses [33] directed against EBNA-1 have been associated with multiple sclerosis, with further evidence that EBNA-1-specific antibodies differentiate disease-discordant identical twins [34]. Several groups have demonstrated higher EBV seroprevalence in MS patients compared to controls and it has further been demonstrated that EBV infection late in life, in particular if manifested as infectious mononucleosis, increases a person’s MS risk [27, 35, 36]. A recent study has also explored the use of Epstein-Barr virus-specific adoptive immunotherapy for progressive multiple sclerosis, with promising preliminary results [37]. Further observations include that cerebrospinal fluid oligoclonal bands that are a hallmark of MS specifically can target EBNA-1 [38] and one group has additionally identified the presence of EBV-infected B cells within white matter MS lesions at all disease stages [39], although this result has not been replicated in other studies [40]. In this study, we have utilised DNA obtained from buffy coat samples of patients with multiple sclerosis to analyse EBNA-1 sequence variation using both Sanger and FLX ‘next-generation’ sequencing technologies, without any requirement for primary culture techniques or the creation of cell lines through ex vivo EBV transformation. We have then sought to identify potential HLA-DRB1*1501-restricted viral epitopes within the EBNA-1 protein sequence using standard HLA binding algorithms [41], and investigated potential homology with similarly HLA-restricted antigens in a dataset of human central nervous system proteins [42]. The results of this study, which follow from our previous investigations of the contributions made by HLA alleles and Epstein-Barr virus immunity to multiple sclerosis risk [27,28], highlight the divergence of autologous ‘wild-type’ EBNA-1 sequences from those of laboratory strains commonly used for experimental purposes, and suggest possible avenues of investigation that acknowledge both host and viral genetic diversity in higher-resolution analyses of the role of host-pathogenic interactions in autoimmunity [30,43,44].

Materials and Methods

Research participants

A total of 79 study participants in the Perth Demyelinating Disease Database (PDDD) were included in the study. The study protocol was approved by the Sir Charles Gairdner Hospital Human Research Ethics Committee, and written informed consent was obtained from all participants.

DNA extraction

DNA was isolated from buffy coats (stored at -80°C) using an automated robotic setup using Genfind according to the manufacturer’s instructions. Briefly, 100ul of buffy coat were lysed and Proteinase K added to rupture cell membranes and digest protein. DNA was then immobilized on magnetic particles by the addition of a magnetic bead binding reagent. DNA was separated from contaminants using a magnetic field and washing steps. DNA was eluted in 125μl from the magnetic particles. A minority of samples were manually extracted using Qiagen with the provided protocol. 200μl of buffy coat were lysed and Proteinase K added to remove protein and other contaminants. DNA was absorbed on to the silica-gel membrane during centrifugation of columns and then washed twice to ensure complete removal of any residual contaminants. Finally, samples are recovered from the membrane using 200μL elution buffer. Concentration of all eluted samples was determined using Nanodrop and 1 μl of each sample was loaded on a 1% agarose gel to test for presence and integrity of DNA. All samples were stored at 4°C until further use.

HLA typing

PCR and sequencing based HLA genotyping of the MS cohort resolved to at least the 4-digit level was performed as previously described using heterozygous ambiguity resolving primers where applicable [27,28].

EBV amplification

The N- and C-terminal ends excluding the glycine-alanine rich regions of the EBNA-1 gene were amplified using semi-nested PCRs and fully automated setup utilising Biomek FX robots. EBV reference strain B95-8 was extracted from a B95-8 transformed cell line and was diluted and used as a control in each EBV PCR. All PCR reactions were performed using Roche High Fidelity Taq in 25μl reactions with forward and reverse primers at a concentration of 25pmol/μl. All primers used for amplification have been previously published and named according to the position in the reference strain B95-8 [11,45,46], as summarised below: 107754F: TCCGGGCTGCGAGTAATTGG 107881F: GTCTGCACTCCCTGTATTCA 109111F: TCATCATCATCCGGGTCTCCACCGC 108160R: GGACACCATCTCTATGTCTTGGCC 109135R: GCGGTGGAGACCCGGATGATGATGA 109459R: CCCAAGTTCCTTCGTCGGTAGTCC 109759R: CTCCATCGTCAAAGCTGCA 109869R: CTGCCCTTCCTCACCCTCAT 109970R: CAACAGCACGCATGATGTCT For amplification of the N-terminal end the first round primer pair 107754F-109135R (PCR1) resulting in a 1381 base pair fragment (bp) was used. Detailed information about size and location of EBNA-1 PCRs with reference to the EBV strain B95-8 can be found in S1 Fig. Second round amplification was then performed using either 107881F-109135R (PCR2) or 107754F-108160R (PCR3), resulting in 1254bp or 406bp fragments respectively. C-terminal EBNA-1 PCR was performed as described previously [27]. Briefly, first round amplification with the primer pair EBV109111F-EBV109970R (PCR 4) resulted in an 859bp fragment. A semi-nested PCR was followed using the primer combination 109111F-109869R (PCR5) resulting in a final 758 base pair product. Alternatively, shorter semi-nested PCRs were performed using the primer combinations 109111F-109759R (PCR6, 648bp) and 109111F-109459R (PCR7, 348bp) respectively. In some cases PCRs with alternative primers have been performed. For an overview of primer pairs used in each PCR, nucleotide coordinates of primers within the B95-8 reference strain as well as primer melting temperatures, elongation times and product sizessee S1 Table. Successful PCR samples were purified using magnetic particles with AMPure (Beckman Coulter) on Biomek FX robots and stored at 4°C until further use.

Sanger sequencing and analysis

Samples were directly sequenced on an automated 96 capillary ABI 373 DNA Sequencer, followed by purification of sequencing products with magnetic particles using CleanSEQ (Beckman Coulter) on Biomek FX robots. Analysis of electropherograms was performed using the ASSIGN V4.0.1.36 software (Conexio Genomics). Threshold for mixture detection in Sanger sequencing has been established to be ~30%. For construction of the Phylogenetic tree, 53 MS sequences and reference strains B95-8, AG876, GD1 and HKNPC1 covering the majority of nucleotide positions B95-8: 109135–109815 (EBNA-1: 1186–1866) were included. Genetic distance was visualized using the Neighbour-joining method based on the p-distance model with pairwise deletion within the PHYLIP (Phylogeny Inference Package) version 3.695 [47].

454 FLX sequencing

For the 454 FLX sequencing strategy, 24 samples were pooled in a single FLX lane. 20 MS samples were selected based on successful Sanger sequencing for three epitopes of interest (EBNA-1 aa PPP: 401–416, AEG: 482–496 and MVF: 563–577). Three additional MS samples were included that displayed a band of correct size on a 1% agarose gel but were not successfully sequenced by Sanger methods. The B95-8 strain of EBV was used as a control in each EBV PCR and was also selected as a control for FLX sequencing. First round C-terminal EBV PCRs products with the primer combination EBV109111F-EBV109970R (PCR4) were used as templates to generate shorter second round PCRs for the selected deep sequencing samples. For PCR length and location please refer to S1 Fig. This nested PCR was performed with molecular barcoded primers. These tags consisted of eleven nucleotides unique extension to the 5’ end of the second round primers EBV109111F and EBV109869R (PCR5b) resulting in 780bp amplicons. This method facilitates sample multiplexing while also increasing the ability to accurately assign reads back to the sample. Resulting PCR amplicons were pooled at equimolar ratios (3x1011 copies each) to achieve similar number of reads. Standard Library was constructed using 454 Roche Titanium Chemistry protocol. The denatured DNA library was immobilized onto beads and emulsified with the amplification reagents in a water-in-oil mixture and clonally amplified (emPCR). Following emPCR, the capture beads with bound DNA were enriched according to the 454FLX titanium manual and used for pyrosequencing on one lane of an eight lane 454 FLX sequencing run according to the 454 sequencing manual. The reads obtained from the sequencing were separated according to the unique tags and linked back to the original samples, using the NextGENe software package from SoftGenetics, Inc (State College, PA, USA). Further analysis was performed with inhouse software.

FLX data analysis

The NextGENe software package version 2.3.0 (SoftGenetics, Inc, USA) was used to create a consensus sequence present at >45% for each sample based on the B95-8 EBV reference strain (GenBank accession number: V01555.2). For each sample, all reads were aligned to the consensus using a pairwise alignment. The pairwise alignments were combined into a multiple alignment by matching the reference positions for all aligned pairs. The aligned files were used to detect homopolymers, which are known to occur in FLX sequencing as artefacts. During analysis, all homopolymers not present in the respective consensus sequence have been excluded. Minorities present at <1% were not taken into consideration and insertion deletions were also excluded except for a strain specific in-frame insertion of three amino acids (glycine (GGA), aspartic acid (GAT), aspartic acid (GAC)) at position 2367 of EBNA-1 position (109818/109819 of reference strain B95-8) in several samples, which has been described previously [11]. All mutations detected were additionally manually analysed in the raw FLX data file of each sample to exclude any contribution of homopolymer errors. A nucleotide change was considered a real mutation if the mutation could not have been caused by a nucleotide insertion/deletion before or after the mutation. Additionally, mutations were only taken into consideration if they were present in at least three sequences, independent of the total number of individual reads per sample. Mutations within three nucleotides from the beginning or end of a read were excluded. Furthermore, unresolved nucleotide mixtures within reads were few, but indicated low signal quality in this position and were not taken into consideration.

Epitope predictions

HLA binding algorithms (NetMHCII, NetMHCIIpan) were utilized to identify potential HLA-DRB1*1501 class II HLA epitopes within the EBNA-1 Sanger derived sequences. Additionally, predictions were performed for two known HLA-DRB1*1501 class II EBNA-1 epitopes (denoted AEG and MVF) using all FLX consensus and Sanger sequences generated. All predicted 15mer peptides which resulted in HLA-DRB1*1501 strong (<50 nM) and weak binders (50–500 nM) were selected and tested for potential cross-reactivity on a dataset of CNS proteins enriched for axoglial proteins (human protein reference database (HPRD.org) and [42], as well as a selection of brain proteins derived from NCBI (S2 Table). Amongst these potential cross-reactive epitopes, a subset of epitopes sharing the majority of peptide amino acid residues within the epitope core HLA-binding sequence of nine amino acids, were identified.

Statistical analysis

We assessed whether EBNA-1 nucleotide polymorphism at each position was significantly associated with MS risk alleles by grouping alleles according to previous genetic analysis by our group [27]. Samples were categorized as carrying high MS risk, low MS risk or neutral risk alleles, respectively, if they carried at least one risk allele (HLA-DR1*08, *15, *16); at least one protective allele (HLA-DR1*04, *07, *09) and no risk allele; or two neutral risk alleles. Tables of nucleotide frequencies by risk groups were created at each nucleotide position and associations assessed by Fisher exact tests. We assessed clustering based on all 62 C-terminal Sanger sequences using the “partitioning around medoids” (PAM) method [48]. All nucleotide positions demonstrating some nucleotide variation were included and clusters displayed via a plot of the first two principal components [48]. Associations of HLA risk groups with viral clusters found in the cluster analysis were also assessed using Fisher exact tests. Analyses were carried out using TIBCO Spotfire S+ 8.2 (Somerville, MA).

Results

Epstein Barr Nuclear Antigen-1 (EBNA-1) was successfully amplified for bulk (Sanger) sequencing in 76 MS samples from DNA extracted buffy coats without any requirement for primary culture to enrich for EBV episomes. All samples were obtained from participants in the Perth Demyelinating Disease Database (PDDD) with confirmed MS, reflecting a wide range of age and disease severity (Fig 1). Females were more prevalent (73%) than males (27%) in the study population, with a slightly higher age (median: 53) compared to males (median: 49). Within this dataset, 62 samples were successfully sequenced across the EBNA-1 C-terminal region, which is known to contain the majority of MHC Class II-restricted T-cell epitopes [49-51], and a subset of 37 samples was sequenced in the N-terminal EBNA-1 region additionally. Sequences for both the N- and C-terminal end were obtained from 23 samples.
Fig 1

Age and gender distribution of patients recruited from the Perth Demyelinating Disease Database.

Wild-type EBNA-1 sequence variation

The C-terminal region of EBNA-1 demonstrated sequence variation within four major clusters with additional minor variation (Fig 2). In keeping with previous analyses of wild-type EBNA-1 sequences [9,11], wild-type sequences showed strong similarity to the EBV reference strain B95-8 in only a minority of cases (9/53, 16.9%) and none clustered with the type 2 AG876 strain. No nucleotide mixtures were identified by Sanger sequencing methods that would indicate the presence of multiple EBNA-1 populations, either as a result of mutation or superinfection with multiple strains. N-terminal sequence analysis showed high conservation and identified only three different variants: 19 samples demonstrated 100% sequence similarity with the N terminal sequence of the EBV reference strain B95-8, while 12 sequences matched the previously identified type 2 EBV strain AG876 which differs to B95-8 in the positions: Q16E, E18G, D24E, S27G and A85T [52]. Three of these positions (Q16E, E18G, S27G) have previously been described to occur in combination (44). The third variant occurred in seven samples and aligned well with AG876 but contained two mismatches to it: EBNA-1 amino acid positions: V70A), (Q74P.
Fig 2

Phylogenetic tree of C-terminal EBNA-1 Sanger sequences including the reference strains B95-8, AG876, GD1 and HKNPC1.

Phylogenetic tree covering nucleotide positions B95-8: 109135–109815; EBNA-1: 1186–1866; blue triangles: EBV reference strains; red dots: MS cases (n = 53).

Phylogenetic tree of C-terminal EBNA-1 Sanger sequences including the reference strains B95-8, AG876, GD1 and HKNPC1.

Phylogenetic tree covering nucleotide positions B95-8: 109135–109815; EBNA-1: 1186–1866; blue triangles: EBV reference strains; red dots: MS cases (n = 53).

Association of HLA and viral sequence variation

Association analysis of MS risk group with EBNA-1 sequence variation revealed nine positions at which individuals in the high risk group had nucleotide frequencies significantly differing from the other risk groups with p<0.05 –namely positions 1428 (P476), 1460 (A487), 1475, (S492), 1498 (D499), 1690 (M563), 1722 (V574), 1754 (T585), 1782 (R594) and 1785 (V595). Cluster analysis based on the C-terminal Sanger sequences described in Methods revealed two distinct populations (Fig 3), a main cluster (A) of 47 cases and a smaller cluster (B) of 15 cases. Across the nine positions with significant HLA association there were 373 consensus nucleotides and seven non-consensus among the 47 cases in cluster A and just 10 consensus nucleotides and 109 non-consensus in cluster B. Hence at these nine positions the two clusters were almost mutually exclusive.
Fig 3

Principal Component plot (Component 1 vs Component 2) for cluster analysis based on the C-terminal EBNA-1 Sanger sequences revealed two distinct populations of 47 (A) and 15 (B) cases.

Principal Component plot (Component 1 vs Component 2) for cluster analysis based on the C-terminal EBNA-1 Sanger sequences revealed two distinct populations of 47 (A) and 15 (B) cases. When assessed for HLA MS risk group, cluster A contained 5, 9 and 33 individuals classified as MS risk neutral, protective and high, respectively, compared with the smaller cluster (B) which contained 4, 6 and 5 individuals with neutral, protective and high MS risk. The high MS risk group was thus significantly over-represented in the large cluster (p = 0.016).

454 deep sequencing

Additional sequencing of the C-terminal EBNA-1 fragment (EBNA-1 nucleotide positions 1160–1906) was then undertaken utilising FLX technology for 23 samples and a B95-8 control. Among these, 14 samples and the B95-8 control had coverage of all positions between 80–1632 reads, while 10 samples had very low coverage of reads per position (average reads <15) and were not included in subsequent sequence minority analysis. As shown in Table 1, low-level EBNA-1 sequence variation could be detected although only two samples (samples 1 and 5) showed the presence of minor sequence variants at a level of ≥10%. Mixtures occurred at nucleotide positions 1190 (R397) and 1588 (P529) in sample 1 and at 14 different positions in sample 5: 1286 (V429), 1420 (G473), 1421 (S474), 1428 (P476), 1460 (A487), 1475 (S492), 1498 (D499), 1561 (L520), 1572 (T524), 1660 (P553), 1690 (M563), 1722 (V574), 1754 (T585), 1782 (R594) and 1785 (V595). Silent mutations occurred at amino acid positions G473, D499, L520, P529 and P553, whereas the other mutations lead to a mixture of wild type and variant amino acids: R397R/G, V429V/M, S474S/T, P476P/Q, A487A/T, S492S/C, T524/TI, M563M/I, V574V/G, T585T/P, R594R/K and V595V/A. Lowering this threshold to 5% revealed minority variants in 3.6% positions (27/749 nucleotides sequenced); increasing to 5.0% at a 2% cut-off (37/749 nucleotides) and 8.3% at a 1% cut-off (62/749 nucleotides). As noted in Table 1, most of the mutations that defined minority variants were unique to individual samples (42 individual positions) (Table 1).
Table 1

EBNA-1 quasispecies detected with FLX sequencing.

Minority EBV sequence variants at a level of ≥10% were detected in two samples only. Sequence mixtures present at a ≥5% threshold revealed minority variants in 3.6% of investigated nucleotide positions (27/749 nucleotides sequenced,), increasing to 5.0% at a ≥2% cut-off (37/749 nucleotides,) and 8.3% at a ≥1% cut-off (62/749 nucleotides,). Samples 7, 8 and 13 did not have minority species present at ≥1%.

EBNA-1 aa pos397410411422425429431432435435459459460469473474476484487492499507508520524525525528528529533534536539549550553559561563574585594595595
B95-8 aaRGEGGVPGEERRKRGSPGASDVFLTAAIIPLTLLPQPVYMVTRVV
B95-8 codonAGGGGGGAAGGCGGTGTGCCGGGAGAGGAGCGCCGCAAACGTGGTTCCCCGGGTGCTAGTGACGTGTTCCTAACTGCCGCCATTATTCCACTTACATTGCTCCCACAACCGGTCTATATGGTTACAAGGGTGGTG
EBNA-1 pos119012311232126512751286129212961304130513771378137914061420142114281453146014751498152015251561157215741575158315841588159816031608161616481651166016781684169017221754178217851786
variant aaGGQSSMTVstopGHRQSGTQGTCDMLLI$$$TPITSIPQPVYIGPKAA
variant codonGGGGGACAAAGCGATATGACGGTATAGGGGCACCGACAAAGTGGCACCCAGGGCACTTGTGATATGTTACTCATTSSCSSCRYTACTCCGATTACGTCGATCCCGCAGCCAGTATACATTGGTCCAAAGGCGGCT
sample 1, 1%AG         AG                  AG        AT    CT 
sample 1, 2%AG                            AG        AT      
sample 1, 5%AG                            AG               
sample 1, 10%AG                            AG               
sample 2, 1%                         CGCG    AGCT            
sample 3, 1%         AG ACAC                      AG         
sample 3, 2%            AC                                
sample 3, 5%            AC                                
sample 4, 1%                 CT                           
sample 4, 2%                 CT                           
sample 5, 1%  CG  AG        CTATAC AGATCT  ACCT  AG  AC     AG  GTGTACAGCT 
sample 5, 2%  CG  AG        CTATAC AGATCT  ACCT  AG  AC     AG  GTGTACAGCT 
sample 5, 5%  CG  AG        CTATAC AGATCT  ACCT  AG  AC     AG  GTGTACAGCT 
sample 5, 10%     AG        CTATAC AGATCT  ACCT           AG  GTGTACAGCT 
sample 6, 1%                                  AG          
sample 10, 1%              CT                              
sample 11, 1%              CT                              
sample 12, 1% AG   AG          AC AGATCT  ACCT           AG  GTGTACAGCT 
sample 12, 2% AG   AG          AC AGATCT  ACCT           AG  GTGTACAGCT 

Subsequent capital letters indicate both nucleotide mixtures detected. Single capital letters indicate amino acids present. Amino acid positions representing silent mutations have been indicated in bold. EBNA-1 = Epstein-Barr virus nuclear antigen-1, aa = amino acid, pos = position

EBNA-1 quasispecies detected with FLX sequencing.

Minority EBV sequence variants at a level of ≥10% were detected in two samples only. Sequence mixtures present at a ≥5% threshold revealed minority variants in 3.6% of investigated nucleotide positions (27/749 nucleotides sequenced,), increasing to 5.0% at a ≥2% cut-off (37/749 nucleotides,) and 8.3% at a ≥1% cut-off (62/749 nucleotides,). Samples 7, 8 and 13 did not have minority species present at ≥1%. Subsequent capital letters indicate both nucleotide mixtures detected. Single capital letters indicate amino acids present. Amino acid positions representing silent mutations have been indicated in bold. EBNA-1 = Epstein-Barr virus nuclear antigen-1, aa = amino acid, pos = position

Sequence conservation and relevance to HLA-DRB1*15 binding

As shown in Fig 3, EBNA-1 sequences were mapped against the standard reference strain B95-8 in order to illustrate known and putative HLA-DRB1*15-restricted epitopes and their relationship with EBNA-1 sequence conservation at the amino acid level. This approach, which utilises the ConSeq server [53], reflects amino acid conservation in terms of the influence of physicochemical properties of amino acid substitution as well as the frequency of sequence variation. Hence, ‘dips’ in the conservation plot (highlighted in red) reflect sites of variation that are likely to influence protein structure. Above these plots, predicted HLA-DRB1*15 binding sites are denoted along with the core binding regions, derived from NetMHCIIPan analysis [41]. As shown, this approach revealed two known HLA-DRB1*1501 restricted epitopes (AEGLRALLARSHVER (‘AEG’: aa 481–496) and MVFLQTHIFAEVLKD (‘MVF’: aa 562–577), as well as two overlapping putative epitopes covering a region between positions 502–543. In contrast, no HLA-DRB1*15 epitopes were identified within the N-terminal region of EBNA-1. The most frequent sequences at these epitope sites were AEGLRLLARHVER and VFLQTHIFAELKD (differences to B95-8 reference underlined). Variant sequences within these epitope regions are described in Table 2, along with comparisons of HLA-DRB1*15 binding affinity. As noted, there are no wild-type EBNA-1 variants within these known epitopes that would be predicted to abrogate HLA-DRB1*15 binding completely, although further studies will be required to establish if minor variations in binding affinity could influence the nature of the T cell response, noting that a previous study has demonstrated both regulatory and effector EBNA-1-specific CD4+ T cells with identical epitope specificity [18]. Interestingly, eight of the nine polymorphic sites from the cluster analysis fell within our predicted HLA-DRB1 epitopes including two changes in the previously described ‘AEG’ and ‘MVF’ epitopes, respectively.
Table 2

EBNA-1 sequence variation identified with next-generation sequencing technology, and impact on HLA-DRB1*15 binding affinity within known epitopes.

The two most frequent variants for the A) AEG epitope and B) MVF epitope are printed in bold, the most frequent variant is underlined. All epitopes are predicted to be weak binders (affinity 50nM-500nM).

Table 2A                NetMHCIINetMHCIIpan
nt B95-8gcagaaggtttaagagctctcctggctaggagtcacgtagaaaggpeptidecoreaffinity(nM)coreaffinity(nM)
aa B95-8AEGLRALLARSHVERAEGLRALLARSHVERRALLARSHV197LRALLARSH63
 ---------------AEGLRALLARSHVERRALLARSHV197LRALLARSH63
 -----V---------AEGLRVLLARSHVERRVLLARSHV87LRVLLARSH61
 ----K----------AEGLKALLARSHVERKALLARSHV165LKALLARSH88
 -----T----C----AEGLRTLLARCHVERLRTLLARCH267LRTLLARCH91
 -----A----S----AEGLRALLARSHVERRALLARSHV197LRALLARSH63
pos B95-8482483484485486487488489490491492493494495496     
Table 2B                NetMHCIINetMHCIIpan
nt B95-8atggtctttttacaaactcatatatttgctgaggttttgaaggatpeptidecoreaffinity(nM)Coreaffinity(nM)
aa B95-8MVFLQTHIFAEVLKDMVFLQTHIFAEVLKDMVFLQTHIF303VFLQTHIFA257
 ---------------MVFLQTHIFAEVLKDMVFLQTHIF303VFLQTHIFA257
 I----------G---IVFLQTHIFAEGLKDIVFLQTHIF261IVFLQTHIF259
 ------------F--MVFLQTHIFAEVFKDVFLQTHIFA281VFLQTHIFA270
 I----------V---IVFLQTHIFAEVLKDIVFLQTHIF275IVFLQTHIF181
 M----------G---MVFLQTHIFAEGLKDVFLQTHIFA272MVFLQTHIF386
 M----------V---MVFLQTHIFAEVLKDMVFLQTHIF303VFLQTHIFA257
pos B95-8563564565566567568569570571572573574575576577     

nt: nucleotides, aa: amino acid, pos: amino acid position, B95-8: wildtype EBV reference strain

EBNA-1 sequence variation identified with next-generation sequencing technology, and impact on HLA-DRB1*15 binding affinity within known epitopes.

The two most frequent variants for the A) AEG epitope and B) MVF epitope are printed in bold, the most frequent variant is underlined. All epitopes are predicted to be weak binders (affinity 50nM-500nM). nt: nucleotides, aa: amino acid, pos: amino acid position, B95-8: wildtype EBV reference strain

Identification of HLA-DRB1*15 epitopes within brain proteins homologous to EBNA-1

For this analysis we utilised three datasets enriched for CNS proteins [42], to identify candidate cross-reactive proteins that shared a propensity for HLA-DRB1*15 binding as well as homology to natural EBNA-1 sequences at these sites. This analysis was predicated on the hypothesis that (1) EBNA-1-specific T cell immunity reflects a standard model of HLA-restricted binding and antigen presentation, providing a long-term stimulus for T cell responses that could then (2) cross-react with CNS-specific antigens in a manner that requires HLA-restricted presentation but which may be less predictably associated with HLA binding affinity given the constraints of negative selection against high-affinity autoantigens, and the known altered topology of many autoreactive HLA-peptide-TCR interactions [54]. This approach is also informed by the previous demonstration of EBNA-1-specific CD4+ T cells capable of producing pro-inflammatory responses against myelin antigens in a seminal study by Lunemann and colleagues [33]. Thus, we initially selected EBNA-1 epitopes of interest based on HLA binding affinity, and then identified candidate CNS protein epitopes that would be predicted to bind HLA-DRB1*15 with sufficient affinity to allow antigen presentation (strong and weak binders with affinity threshold 500 nM) and which exhibited homology with the EBNA-1 epitope (threshold ≥3 residues within the 9 amino acid core binding region identified by NetMHCII and NetMHCIIPan analysis). Applying this approach to myelin proteins of known interest in MS research in the first instance reviewed in [55], we identified potential cross-reactive responses involving a number of major myelin antigens (Table 3) including experimentally confirmed HLA-DRB1*15-restricted epitopes associated with encephalitogenic T cell responses (asterisked) including 2',3'-cyclic-nucleotide 3'-phosphodiesterase, alpha B crystallin, myelin basic protein and oligodendrocyte-specific protein. Additionally, we identified several novel candidate antigens within glial fabrillary acidic protein, myelin proteolipid protein, neurofilament heavy polypeptide and myelin-oligodendrocyte glycoprotein.
Table 3

Putative HLA-DRB1*15 binders within autologous EBNA-1 peptide sequences and candidate myelin antigens.

Brain PeptidesEBNA-1 PeptidesPeptide Match CountBrain Binding ScoresEBNA-1 Binding ScoresEBNA-1 Protein PositionBrain Protein PositionBrain Protein Accession No(s)
GKLYSLGNGRWMLTLGSKTSLYNLRRGTTLX…XX.X.X…XX66267512370P09543
RGKLYSLGNGRWMLTGGSKTSLYNLRRGVA.X…XX.X.X....41347511369P09543
LYSLGNGRWMLTLAK *KTSLYNLRRGVALAI..XX.X.X....XX.30645514372P09543
SRGKLYSLGNGRWML *SKTSLYNLRRGVALAX…XX.X..X....4190513368P09543
LSPFYLRPPSFLRAP **TSLYNLRRGVALAIP.X…XX....X..X475251444ACP18852
LSPFYLRPPSFLRAP **TSLYNLRRGTALAIP.X…XX....X..X1351651544ACP18852
SPFYLRPPSFLRAPSSLYNLRRGVALAIPQX…XX....X..X.595451545AAB23453
KTKEGVLYVGSKTREWVAGVFVYGGSKTSL.......X.XXXX..54850332Q16143
EKTKEGVLYVGSKTRNWVAGVLVYGGSKTS........X.XXXX.674850231Q16143
TRLSLARMPPPLPTRLRVLLARSHVERTTE.X..XXX......X.4683248535P14136
KLALDIEIATYRKLLNLRRGIGLAIPQCLL.X…X..X....XX176323518356P14136
GKGRGLSLSRFSWGAPQCRITPLSRLPFGM…X…XXX…X.287384528131AAC41944
EFAPVLLLESHCAAA ***GLRVLLARSHVERTTXX......XX....X41012548379AAC41944
PGVLVLLAVLPVLLLEGLRVLLARSHVERT.X..XXXX…X…3018483153Q16653
GPLVALIICYNWLHRGPLRESIVCYFIVFLXXX…X.XX.....26373551219Q16653
GEGKVTLRIRNVRFSAEGLRTLLARCHVXR.XX..XX..X.....8916482106Q16653
DPFYWVSPGVLVLLA QKFENIAXGLRTLLA..X.....X…XXX134202475146Q16653
DPFYWVSPGVLVLLA PKFENIAEGLKLLLA..X.....X…XXX185438475146Q16653
CSAVPVYIYFNTWTTVAGVFVYGGXNTSLY…X.XX…XX…318504169AAA59565
ATYNFAVLKLMGRGTGTWVAGVLVYGGSKT.X....XX…X..X2198501261AAA59565
LLTFMIAATYNFAVLLVMTKPAPTCNIKVTX.....X.X.X..X.7032582254AAA59565
EEITEYRRQLQARTTENIAEGLRVLLARSHX.X.X..X.X.XX..376239479316P12036
EMRGAVLRLGAARGQENIAEGLRLLLARCHX.....XXX..XX..138154479152P12036
GAVLRLGAARGQLRLAEGLRLLLARCHVER…XXX..XX.....6788482155P12036
TRLSFTSVGSITSGYVAGVFVYGGSRTSLY....X…XX.XX.X20671504398P07196
KVVLIKNTLRSLEVLKFENIAEGLRLLLARX…X…XX.X…74336477137P23515
SLEVLNLSSNKLWTVGLRVLLARSHVERTT.X.XX…X....X.62125484147P23515
PGTLINLTNLTHLYLKTSLYNLRRGTALAI…X.XX…X.X..10277514184P23515
ENVSTTLRALAPRLMENIAEGLRALLARSHXX....XXXX..X..3932479187AAC25187
AGVLLILLALCALVAAEGLRTLLARCHVERX..X..XXX.X....38932482123AAC25187
NVSTTLRALAPRLMRNIAEGLRALLARSHVX....XXXX..X…2516480188AAC25187
CKPLVDILILPGYVQ ††IKDLVMIKPAPTCNI.X.XX.X…X....4812057865AAC25187
ELEKAMVALIDVFHQEGLKALLARSHVERTX..XX..X…X…376164833NP_006263
AFVAMVTTACHEFFEVCYFMVFLQTHIFAE....XX....X.X.X4577155876NP_006263
FGAEILKKIPGRVSTNIAEGLKALLARSHV..XX.XX....X…511648091NP_006746
GIRKFAADAVKLERMNLRRGIALAVQQCRL..X…X.XX…X.38732519311NP_006746
PILAVLLFSSLVLSPEGLRVLLARSHVERT..X.XXX..X.X…66848312P25189
ILAVLLFSSLVLSPAGLRVLLARSHVERTT.X.XXX..X.X....651648413P25189
EFAPVLLLESHCAAAEGLRVLLARSHVERTX…XXX..XX....1778483410P20916
VEFAPVLLLESHCAAAEGLRVLLARSHVER.X…XXX..XX…25016482409P20916
PGVLVLLAVLPVLLLEGLRVLLARSHVERT.X..XXXX…X…3018483153Q16653
VLGPLVALIICYNWLNLRRGVALAIPQCRL.X…XXX.X....X14832519217Q16653
GAEIRHVLVTLGEKMGTWVAGVLVYGGSKTX.....XXX..X.X.3588501106P60660
KLRRGDLPFVVPRRMNLRRGIALAVQQCRL.XXXX....X…X.498325191920P35579
AEELRARLTAKKQELAEGLRALLARSHVERXX.XXX.X.....X.466115482899P35579

Underlined is the core of the peptide. Binding score (nM) prediction using NetMHCII and NetMHCIIpan. Binding score <50: strong binder, binding score 50–500 weak binder.

*Known CNP epitope [56]

**Known aB-crystallin epitope [57]

***Known MBP epitope [58]

†Known MOG epitope [59]

††Known OSP epitope [60].

Underlined is the core of the peptide. Binding score (nM) prediction using NetMHCII and NetMHCIIpan. Binding score <50: strong binder, binding score 50–500 weak binder. *Known CNP epitope [56] **Known aB-crystallin epitope [57] ***Known MBP epitope [58] †Known MOG epitope [59] ††Known OSP epitope [60]. We then extended this analysis to a larger set of central nervous system antigens enriched for axoglial proteins that maintain myelinated nerves and nodes of Ranvier critical for saltatory conduction reviewed in [61], noting recent evidence that the axoglial apparatus may be targeted in the earliest phases of multiple sclerosis lesion development [62,63]. As described in Table 4, which presents a subset of results based on optimal EBNA-1 epitope binding and core match values ≥3, this analysis identified a larger set of potentially cross-reactive CNS proteins including neurofascin [62] as well as a number of other proteins involved in actin organisation and paranodal assembly like ankyrins, contactin-associated proteins as well as gelsolin [64].
Table 4

Extended axoglial brain protein dataset with HLA-DRB1*1501 predicted brain epitopes overlapping with predicted EBV binders.

Brain PeptidesEBNA-1 PeptidesPeptide Match CountBrain Binding ScoresEBNA-1 Binding ScoresEBNA-1 Protein PositionBrain Protein PositionBrain Protein Accession No(s)Brain Protein Genbank Description(s)
TGQFVYCGKKAQLNIAGVFVYGGSKTSLYN.X.XXX.X.X..X..203850584P6291760S ribosomal protein L8
GQFVYCGKKAQLNIGGVFVYGGSKTSLYNLX.XXX.X.X..X…237850685P62917 
QFVYCGKKAQLNIGNVFVYGGSKTSLYNLR.XXX.X.X..X....3021650786P62917 
GDRGKLARASGNYATGLRTLLARCHVERTTX.X..XXX......X32932484121P62917 
REEIHEYRRQLQARTFENIAEGLRVLLARS.X.X.X..X.X.XX.23132478309Q16352Alpha-internexin
EEIHEYRRQLQARTIENIAEGLRTLLARCHX.X.X..X.X.XX..18932479310Q16352 
VAELLATLQASSQAAIAEGLRTLLARSHVE.XX.X.XX.X.X…348280480235Q16352 
VASVLLEAGAAHSLAVAGVLVYGGSKTSLYXX.XX…X…XX.14022504545Q01484Ankyrin-2
DVASVLLEAGAAHSLWVAGVLVYGGSKTSL.XX.XX…X…XX17829503544Q01484 
ELLLERGAPLLARTKENIAEGLRPLLARCHX…X…XXXXX..112408479316Q01484 
RITCRLVKPQKLSTPRRGIGLAIPQCLLTPX....X..XX.X.XX227369520948AAA51732 
DIVKLLLPRGGSPHSNWVAGVLVYGGSKTS..X…X..XXX..X7448502583AAA51732 
GAYVKLLSKTPELNLGVLVYGGSKTSLYNLX..X…XXX…XX28330506162P27824Calnexin
TAEAIKALGAKHCVKIAEGLKALLARSHVE.XX..XXX.X…X.6916481209P30042ES1 protein homolog, mitochondrial
AEAIKALGAKHCVKEAEGLKALLARSHVERXX..XXX.X…X..7316482210P30042 
LSGESLGHLRSLGALGSKTSLYNLRRGVAL.X..XX..XX…XX40616512189P0C6S8Leucine-rich repeat and immunoglobulin-like
SGESLGHLRSLGALRSKTSLYNLRRGVALAX..XX..XX…XX.5416513190P0C6S8 
LSGESLGHLRSLGALGSKTSLYNLRRGIAL.X..XX..XX…XX40616512189P0C6S8 
AFLGLRQIRLLNLSNVALAIPQCRLTPLSR..X…X.XX..XX.3432524313P0C6S8 
LGLRQIRLLNLSNNLLAVQQCRLTPLSRLPX…X.XX..XX…2632526315P0C6S8 
AGVLLILLALCALVAAEGLRTLLARCHVXRX..X..XXX.X....38916482123NP_005593claudin-11 isoform 1
VLLILLALCALVATIGLRTLLARCHVERTT.X..XXX.X....X.34932484125NP_005593 
LYCIYVAIGQKRSTVRRGIALAIPQCRLTP…X..XX.X.X.X.12432521242P25705ATP synthase subunit alpha, mitochondrial
SPGWLADGSVRYPIVQPGPLRESIVCYFIV.XX.X....X.X.XX398361550302Q96GW7Brevican core protein
LLGRWKALLIPPSSPNLRRGIALAIPQCXL.X.X..XX.XX....7832519893Q96GW7 
EDSLECLRAMLSANIENIAEGLRALLARSHX…X.XXX.X....10532479661Q00610Clathrin heavy chain 1
SLECLRAMLSANIRQSLYNLRRGISLAIPQXX..XX…X..X.X3877516663Q00610 
GVLLILLALCALVATEGLRTLLARCHVERT..X..XXX.X....X38732483124O75508Claudin-11
VLLILLALCALVATIGLRTLLARCHVERTT.X..XXX.X....X.34932484125O75508 
ENLIVPGGVKTIEANAGVFVYGGSKTSLYN....X.XX.XX…X318850547Q14194Dihydropyrimidinase-related protein 1
DNLIVPGGVKTIEANAGVLVYGGSKTSLYN....X.XX.XX…X3512450547Q14195 
ELRREISYAIKNIHGNLRRGIALAIPQCXL.XXX.X..XX.....7432519383Q05193Dynamin-1
KELRREISYAIKNIHYNLRRGISLAIPQCR..XXX.XX.XX....116237518382Q05193 
DEKELRREISYAIKNSLYNLRRGISLAIPQ....XXX.XX.XX..24677516380Q05193 
EDLRRGLVMVKPGSIKDAIKDLVMTKPAPT.X....XXX.XX…16232576332P49411Elongation factor Tu, mitochondrial
DLRRGLVMVKPGSIKDGIKDLVMTKPAPTCX....XXX.XX....9332577333P49411 
IVFRGEHGFIGCRKVIVFLQTHIFAEGLKXXXX…X.X....X.23316563386Q16658Fascin
IVFRGEHGFIGCRKVIVFLQTHIFXEGLKDXXX…X.X....X.23332563386Q16658 
TGAQELLRVLRAQPVENIAEGLRVLLARSH....X.XXXX.X…5232479616P06396Gelsolin
GDSYIILYNYRHGGRGGSKTSLYNLRRGVAX.X…XXX.X.X..11347511471P06396 
KVKAHGKKVLGAFSDENIAEGLRVLLARSH…X.X..XX.X.X.4983247960P68871Hemoglobin subunit beta
AFLGLRQIRLLNLSNVALAIPQCRLTPLSR..X…X.XX..XX.3432524313P0C6S8Leucine-rich repeat and immunoglobulin-like
LGLRQIRLLNLSNNLLAVQQCRLTPLSRLPX…X.XX..XX…2632526315P0C6S8 
YSWGMAVNVYSTSITGNWVAGVLVYGGSKT..X…X.XX..X.X239850140Q15555Microtubule-associated protein RP/EB family member 2
WGMAVNVYSTSITQEWVAGVFVYGGSKTSLX…X.XX..X.X..252850342Q15555 
LPRWQLALAVGAPLLNLRRGIALAVQQCRL..X…XXXX....X973251936O94826Mitochondrial import receptor subunit TOM70
IFQKLMFKNAPTPQEIKDLVMTKPAPTCNIX....X.X.XXX…9016579114P13591Neural cell adhesion molecule 1
VKIFQKLMFKNAPTPDAIKDLVMTKPAPTC..X....X.X.XXX.4032577112P13591 
SLLVTRLQKALGVRQSLYNLRRGIALAVQQXX…X…XX.X.X121575165Q99798Aconitate hydratase, mitochondrial
SLLVTRLQKALGVRQSLYNLRRGIALAVSQXX…X…XX.X.X121585165Q99798 
SLLVTRLQKALGVRQSLYNLRRGIALAVPQXX…X…XX.X.X121725165Q99798Aconitate hydratase, mitochondrial
NKGIGLAIVRDLCRLRRGIGLAIPQCLLTP..XXXXXX…X…43636952114P16152Carbonyl reductase NADPH 1
RRGRLAVSFRFRTWDLRALLARSHVERTTD.X..XX.X…XX.X77407485379P78357Contactin-associated protein 1
LGAALRRCAVAATTRLRLLLARCHVERTTEX…X.XX.X..XX.4012774852P20674Cytochrome c oxidase subunit 5A, mitochondrial
LGAALRRCAVAATTRLRTLLXRCHVERTTXX…X.XX.X..XX.4012484842P20674 
GTRLSLARMPPPLPTGLRLLLARCHVERTTX.XX.XXX......X32613948434P14136Glial fibrillary acidic protein
VVFFNVPEKLRLPDAQKFENIAEGLRLLLA..X.X..X.XXX..X416382476134P78559Microtubule-associated protein 1A
TAYARLRGIEQAVQSSLYNLRRGIALAVQQ..X…XXX..XXX.48257516541Q16891Mitochondrial inner membrane protein
NTAYARLRGIEQAVQTSLYNLRRGIALAVQ…X…XXX..XXX34963515540Q16891 
FTLKVLTTRGVAERTEGLRVLLARSHVERT..X.XX..X…XXX209113483229O94856Neurofascin
TLKVLTTRGVAERTPGLKLLLARSHVERTT.XX.X..X…XXX.36896484230O94856 
FTLKVLTTRGVAERTEGLKLLLARSHVERT..XX.X..X…XXX20987483229O94856 
PTEIIAFSNRAEDFRKFENIADSLRALLAR..X.XX.X.XX…X8343647752P32119Peroxiredoxin-2
EIIAFSNRAEDFRKLENIADSLRALLARSHX.XX.X.XX…X..23319447954P32119 
GSVILLENLRFHVEEGSKTSLYNLRRGVALXX…X.XXX..X..33119512114P00558Phosphoglycerate kinase 1
AGSVILLENLRFHVEGGSKTSLYNLRRGVA.XX…X.XXX..X.26347511113P00558 
ASLQRVRRPVAMVMPTSLYNLRRGVALAIP.XX…XX.XX…X2685251595Q15149Plectin
LKKGLLSAEVARLLLSQKFENIAEGLRLLL..X....XX..XXXX704654753847Q15149 
VRVFRIFKLSRHSKGVPQCRITPLSRLPFGX…XX..XXX…X13475528299P16389Potassium voltage-gated channel subfamily A member 2
VRVFRIFKLSRHSKGVSQCRITPLSRLPFGX…XX..XXX…X13427528299P16389 
LFPGVALLLAAARLALRRGVALAIPQCRLTX..XXXX.....XX.1314135208P30101Protein disulfide-isomerase A3
ALFPGVALLLAAARLNLRRGVALAIPQCRL.X..XXXX.....XX2423335197P30101 
PAIRLLYAKRPGIGLGSRTSLYNLRRGIGL.....XX..X.XXXX40250512190O75061Putative tyrosine-protein phosphatase auxilin
RLLYAKRPGIGLSPSTSLYNLRRGIALAVS..XX..X.XX.X..X46664515193O75061 
SFASDPILYRPVAVASKTSLYNLRRGVALAX..X…X.X.XX.X2539051397P14618Pyruvate kinase PKM
VEGSFVYKGGKIYKVVAGVFVYGGSKTSLYX.X.XXX.X.X....120100504105P31150Rab GDP dissociation inhibitor alpha
ARKKKLLEAQSHFRKAEGLKLLLARSHVERX…XXX.X.XX…212864822074Q13813Spectrin alpha chain, non-erythrocytic 1
TTCIELGKSLLARKHENIAEGLKLLLARSH....X..X.XXXX.X622544791968Q01082 

Underlined is the core of the peptide. Binding score (nM) prediction using NetMHCII and NetMHCIIpan. Binding score <50: strong binder, binding score 50–500 weak binder.

Underlined is the core of the peptide. Binding score (nM) prediction using NetMHCII and NetMHCIIpan. Binding score <50: strong binder, binding score 50–500 weak binder.

Discussion

In this study we have proven the feasibility of obtaining EBNA-1 sequences directly from buffy coat samples, without any requirement for primary cultures that could theoretically be associated with preferential selection of viral sequence variants through ex vivo expansion. In this regard our findings are in keeping with those of Burrows and colleagues [11], who demonstrated similar patterns of EBNA-1 sequence variation predominantly within the C-terminal region in both MS cases and controls, in a study that did involve primary B lymphocyte cultures. Both studies, as well as a more recent analysis of spontaneously outgrown human lymphoblastoid cell lines [65] are in agreement in demonstrating that the majority of autologous sequences do not align closely with the widely used B95-8 laboratory strain–a result that is perhaps not surprising given that this strain was originally identified following transfusion-associated EBV in an elderly woman and subsequently selected for its ability to efficiently immortalise B lymphocytes [66]. Our results are also in agreement with other studies that have identified similar patterns of EBNA-1 [67] and EBNA-2 [68] sequence variation when comparing MS cases and controls, albeit at low resolution in these cases, suggesting that MS susceptibility is not likely to be readily explained by an ‘encephalitogenic strain’ of EBV. We have also explored sequence variation within individual samples using next-generation sequencing techniques, to investigate if the presence of multiple viral sequence variants could indicate sites of immune selection pressure–as has been suggested previously for EBNA-1 [11]–and/or that infection with multiple EBV strains could represent a risk factor for MS disease as has been previously proposed [69]. We identified nucleotide mixtures present at 10 percent in two samples. Mixtures were primarily caused by point mutations leading to amino acid changes in 12 different positions compared to silent mutations in five positions only. This could indicate EBV superinfection or viral immune escape. While we were able to identify low-level EBNA-1 sequence variation in these samples (involving 8.3% of nucleotides at a 1% threshold), our results do not support a strong influence of intraindividual EBV sequence variation in MS disease risk and we cannot exclude that some of these point mutations are due to technical artefacts. It is however interesting to note that EBNA-1 sequence conservation described in Fig 4 (reflecting genetic variation as well as the impact this has on amino acid properties), does appear to map to HLA-DRB1 binding regions, although we were unable to identify natural sequence variants that were associated with abrogation of HLA binding using the NetMHCIIPan prediction algorithm. However, in the HLA-viral sequence variation association analyses, we could identify eight out of nine EBNA-1 polymorphic nucleotide positions significantly associated with MS risk alleles within these HLA-DRB1 binding regions, including two in the previously described HLA-DRB1*15 ‘AEG’ and ‘MVF’ epitopes respectively, noting in each case that the more common (wild-type) viral sequence was favoured in the presence of disease-associated HLA-DR alleles. These differences will be explored further in terms of their impact on epitope-specific CD4+ T-cell immune responses, acknowledging in relation to MS pathogenesis that important differences may relate to the selection of regulatory versus effector EBNA-1-specific CD4+ T cells [18], rather than simply reflecting immune evasion.
Fig 4

EBNA-1 sequence conservation and relationship to NetMHCII predicted HLA-DRB1*DR15-restricted epitopes derived from patient MS sequences.

Shown are EBNA-1 amino acid positions 473–593. Predicted epitopes: blue boxes, known epitopes: black underlined, predicted epitope core: grey shade.

EBNA-1 sequence conservation and relationship to NetMHCII predicted HLA-DRB1*DR15-restricted epitopes derived from patient MS sequences.

Shown are EBNA-1 amino acid positions 473–593. Predicted epitopes: blue boxes, known epitopes: black underlined, predicted epitope core: grey shade. We have also explored the potential for autoantigens to be selected by cross-reactive EBNA-1-specific T cells, according to a shared propensity for HLA-DRB1*15-restricted antigen presentation as well as evidence of sequence homology. This concept is in keeping with previous experimentally-proven examples of this phenomenon (albeit without identification of specific epitopes involved) [35], with additional support from observations that HLA-DRB1*15-restricted immune responses are characterised by a relatively high level of TCR degeneracy that would favour cross-reactivity [70]. These results are preliminary and require experimental confirmation of their functional validity.The main purpose of this analysis was to create a platform for experimental design that acknowledges natural patient derived EBNA-1 sequence variants as the basis for epitope selection, while also expanding the possibilities of identifying novel candidate CNS antigens that may have a role in MS pathogenesis. As noted by Ben-Nun and colleagues [55] and Lassmann and colleagues [71], MS research is increasingly moving away from reductionist experimental models towards an interest in a wide array of myelin and axoglial antigen targets, which would be in keeping with a model of MS pathogenesis that allows for cross-reactive T cell (as well as humoral) responses that are initially driven by viral-specific responses–with EBNA-1 representing a legitimate candidate target based on previous work [27, 31–39]. These observations, along with continuing evidence of patient-specific heterogeneity of MS lesion pathology [72] and oligoclonal TCR repertoire [73] would support a model of MS disease pathogenesis in which virus-specific immunity, which is oligoclonal in nature as determined by viral sequence variation seen in this and other studies [9-11] as well as by polymorphic HLA-restricted antigen presentation, could then trigger cross-reactive autoimmune responses. We now hope to investigate these possibilities further, with a particular focus on the roles of both antigen-presenting B cells as well as antigen-specific T cells in provoking inflammatory immune responses. In this respect, we would anticipate that targeted T-cell immunotherapy is likely to require a patient-specific approach as recently performed by Pender and colleagues [37], while targeting EBV-infected B cells may have the potential to provide a more universal treatment strategy, particularly in light of recent evidence that antigen-experienced B cells within the central nervous system in MS cases are likely to be derived from the peripheral blood and lymph nodes [74,75].

Location of EBNA-1 PCR primers given with reference to nucleotide position of the B95-8 strain and PCR size given as base pairs (bp).

Start and stop indicate the EBNA-1 gene. Purple indicates position of known epitopes. (TIF) Click here for additional data file.

Primers used for EBV amplification.

(DOC) Click here for additional data file.

Human brain proteins included in the analysis from NCBI database.

(DOC) Click here for additional data file.

Sanger sequence Genbank Accession numbers.

(DOC) Click here for additional data file.
  72 in total

1.  Widespread sequence variation in Epstein-Barr virus nuclear antigen 1 influences the antiviral T cell response.

Authors:  Melissa J Bell; Rebekah Brennan; John J Miles; Denis J Moss; Jacqueline M Burrows; Scott R Burrows
Journal:  J Infect Dis       Date:  2008-06-01       Impact factor: 5.226

2.  Localization and characterization of gelsolin in nervous tissues: gelsolin is specifically enriched in myelin-forming cells.

Authors:  J Tanaka; K Sobue
Journal:  J Neurosci       Date:  1994-03       Impact factor: 6.167

3.  Epitope distribution in ordered and disordered protein regions. Part B - Ordered regions and disordered binding sites are targets of T- and B-cell immunity.

Authors:  Mirjana D Pavlović; Davorka R Jandrlić; Nenad S Mitić
Journal:  J Immunol Methods       Date:  2014-04-12       Impact factor: 2.303

4.  B cells populating the multiple sclerosis brain mature in the draining cervical lymph nodes.

Authors:  Joel N H Stern; Gur Yaari; Jason A Vander Heiden; George Church; William F Donahue; Rogier Q Hintzen; Anita J Huttner; Jon D Laman; Rashed M Nagra; Alyssa Nylander; David Pitt; Sriram Ramanan; Bilal A Siddiqui; Francois Vigneault; Steven H Kleinstein; David A Hafler; Kevin C O'Connor
Journal:  Sci Transl Med       Date:  2014-08-06       Impact factor: 17.956

5.  Multiple sclerosis.

Authors:  Alastair Compston; Alasdair Coles
Journal:  Lancet       Date:  2008-10-25       Impact factor: 79.321

6.  Genome-wide analysis of wild-type Epstein-Barr virus genomes derived from healthy individuals of the 1,000 Genomes Project.

Authors:  Gabriel Santpere; Fleur Darre; Soledad Blanco; Antonio Alcami; Pablo Villoslada; M Mar Albà; Arcadi Navarro
Journal:  Genome Biol Evol       Date:  2014-04       Impact factor: 3.416

7.  Structural and dynamical insights on HLA-DR2 complexes that confer susceptibility to multiple sclerosis in Sardinia: a molecular dynamics simulation study.

Authors:  Amit Kumar; Eleonora Cocco; Luigi Atzori; Maria Giovanna Marrosu; Enrico Pieroni
Journal:  PLoS One       Date:  2013-03-26       Impact factor: 3.240

8.  Structural requirements for binding of an immunodominant myelin basic protein peptide to DR2 isotypes and for its recognition by human T cell clones.

Authors:  K W Wucherpfennig; A Sette; S Southwood; C Oseroff; M Matsui; J L Strominger; D A Hafler
Journal:  J Exp Med       Date:  1994-01-01       Impact factor: 14.307

9.  Fine-mapping the genetic association of the major histocompatibility complex in multiple sclerosis: HLA and non-HLA effects.

Authors:  Nikolaos A Patsopoulos; Lisa F Barcellos; Rogier Q Hintzen; Catherine Schaefer; Cornelia M van Duijn; Janelle A Noble; Towfique Raj; Pierre-Antoine Gourraud; Barbara E Stranger; Jorge Oksenberg; Tomas Olsson; Bruce V Taylor; Stephen Sawcer; David A Hafler; Mary Carrington; Philip L De Jager; Paul I W de Bakker
Journal:  PLoS Genet       Date:  2013-11-21       Impact factor: 5.917

10.  Epstein-Barr virus-specific adoptive immunotherapy for progressive multiple sclerosis.

Authors:  Michael P Pender; Peter A Csurhes; Corey Smith; Leone Beagley; Kaye D Hooper; Meenakshi Raj; Alan Coulthard; Scott R Burrows; Rajiv Khanna
Journal:  Mult Scler       Date:  2014-02-03       Impact factor: 6.312

View more
  4 in total

Review 1.  Tolerance regeneration by T regulatory cells in autologous haematopoietic stem cell transplantation for autoimmune diseases.

Authors:  Kevin Hendrawan; Malini Visweswaran; David D F Ma; John J Moore
Journal:  Bone Marrow Transplant       Date:  2019-10-16       Impact factor: 5.483

Review 2.  The Multiple Sclerosis (MS) Genetic Risk Factors Indicate both Acquired and Innate Immune Cell Subsets Contribute to MS Pathogenesis and Identify Novel Therapeutic Opportunities.

Authors:  Grant P Parnell; David R Booth
Journal:  Front Immunol       Date:  2017-04-18       Impact factor: 7.561

Review 3.  Viruses and endogenous retroviruses in multiple sclerosis: From correlation to causation.

Authors:  A-F A Mentis; E Dardiotis; N Grigoriadis; E Petinaki; G M Hadjigeorgiou
Journal:  Acta Neurol Scand       Date:  2017-05-23       Impact factor: 3.209

4.  Epstein-Barr Functional Mimicry: Pathogenicity of Oncogenic Latent Membrane Protein-1 in Systemic Lupus Erythematosus and Autoimmunity.

Authors:  Melissa E Munroe; Jourdan R Anderson; Timothy F Gross; Laura L Stunz; Gail A Bishop; Judith A James
Journal:  Front Immunol       Date:  2021-02-03       Impact factor: 7.561

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.