| Literature DB >> 36207326 |
Mariliis Jaago1,2, Annika Rähni1,2, Nadežda Pupina1, Arno Pihlak1, Helle Sadam1,2, Jürgen Tuvikene1,2,3, Annela Avarlaid2, Anu Planken4, Margus Planken4, Liina Haring5, Eero Vasar6,7, Miljana Baćević8, France Lambert9, Eija Kalso10,11, Pirkko Pussinen12, Pentti J Tienari13, Antti Vaheri14, Dan Lindholm15,16, Tõnis Timmusk1,2, Amir M Ghaemmaghami17, Kaia Palm18.
Abstract
Immunity to previously encountered viruses can alter response to unrelated pathogens. We reasoned that similar mechanism may also involve SARS-CoV-2 and thereby affect the specificity and the quality of the immune response against the virus. Here, we employed high-throughput next generation phage display method to explore the link between antibody immune response to previously encountered antigens and spike (S) glycoprotein. By profiling the antibody response in COVID-19 naïve individuals with a diverse clinical history (including cardiovascular, neurological, or oncological diseases), we identified 15 highly antigenic epitopes on spike protein that showed cross-reactivity with antigens of seasonal, persistent, latent or chronic infections from common human viruses. We observed varying degrees of cross-reactivity of different viral antigens with S in an epitope-specific manner. The data show that pre-existing SARS-CoV-2 S1 and S2 cross-reactive serum antibody is readily detectable in pre-pandemic cohort. In the severe COVID-19 cases, we found differential antibody response to the 15 defined antigenic and cross-reactive epitopes on spike. We also noted that despite the high mutation rates of Omicron (B.1.1.529) variants of SARS-CoV-2, some of the epitopes overlapped with the described mutations. Finally, we propose that the resolved epitopes on spike if targeted by re-called antibody response from SARS-CoV-2 infections or vaccinations can function in chronically ill COVID-19 naïve/unvaccinated individuals as immunogenic targets to boost antibodies augmenting the chronic conditions. Understanding the relationships between prior antigen exposure at the antibody epitope level and the immune response to subsequent infections with viruses from a different strain is paramount to guiding strategies to exit the COVID-19 pandemic.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36207326 PMCID: PMC9540097 DOI: 10.1038/s41598-022-20849-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Descriptions of clinical cohorts of COVID-19 naïve individuals.
| No | Cohort (abbreviation) | Study group | Group size (n) | Gender | Age (mean ± standard deviation (SD)), years | Origin | Case or ctrl | Previous studies |
|---|---|---|---|---|---|---|---|---|
| 1 | CAD | No-CAD | 32 | 17M/15 F | 60.2 ± 8.3 | Finland | Ctrl | [ |
| Stable-CAD | 32 | 26M/6F | 64.3 ± 8.4 | Finland | Case | [ | ||
| Acute coronary syndrome | 32 | 24M/8F | 61.3 ± 8.2 | Finland | Case | [ | ||
| 2 | MI | HC | 61 | 25M/36F | 46.2 ± 15.5 | Estonia | Ctrl | [ |
| MI | 50 | 29M/13F/10 not available (NA) | 66.8 ± 12.6 | Estonia | Case | [ | ||
| 3 | T2D and foot ulcer | T2D | 25 | 21M/4F | 69.0 ± 8.7 | Belgium | Case | None |
| 4 | MS | MS | 20 | 4M/16F | 32.3 ± 7.8 | Finland | Case | [ |
| 5 | BC | BC | 57 | 0M/57F | 55.7 ± 7.2 | Finland | Case | [ |
| 6 | FEP | HC | 30 | 15M/15F | 24.0 ± 6.1 | Estonia | Ctrl | [ |
| FEP | 30 | 16M/14F | 25.6 ± 4.9 | Estonia | Case | [ | ||
| 7 | SZ | HC | 30 | 12M/18F | 42.1 ± 18.2 | Finland | Ctrl | [ |
| SZ | 30 | 12M/18F | 41.9 ± 18.4 | Finland | Case | [ | ||
| 8 | Donors | HC | 109 | 64M/45F | 40.6 ± 11.6 | Estonia | Ctrl | [ |
Figure 1MVA-defined epitopes on SARS-CoV-2 spike protein. Alignment profiles of the most abundant and common immune response features from MVA immunoprofiling data of COVID-19 naïve subjects (n = 538), including altogether 22,949 unique epitopes characterising core motif sequences (hypergeometric p-value < 10–3 for core epitope recognition) on the primary sequence of SARS-CoV-2 S protein (UniProtKB code: P0DTC2). The abundance of aligned core epitope sequences (black Specific, primary y-axis) in peaks was significantly higher (**p < 0.01, ***p < 0.001) compared to random alignment (light gray random, primary y-axis). Of aligned core epitopes, 111 were with exact matches in all amino acid positions (blue specific, secondary y-axis). Regions of primary sequence with alignment load of > 2 motifs (the calculated theoretical random) (gray random line, secondary y-axis) were considered as potentially immunogenic and included in further analysis. Alignment profiles were visualised as centred moving averages across 9 amino acids. 15 epitopes (designated S1.1….10 on subunit 1 of S (S1) and S2.1….5 on subunit 2 of S (S2)) from MVA data analysis (predicted) were defined on SARS-CoV-2 S protein (x-axis) with representative consensus sequences shown. Common mutations observed in emerging SARS-CoV-2 variants were mapped to MVA-identified S epitopes and highlighted with red labels. Most common variants with enhanced transmissibility of SARS-CoV-2 variants with L452R, E484K/Q, and H655Y mutations[99,109,123] encompass epitopes in S1.5, S1.6, or S1.10 respectively. Primary y-axis—abundance of specific- vs random-aligned sequences with ≥ 4 matching positions (includes also exactly-matching sequences). Secondary y-axis—abundance of aligned motif sequences with all positions matching. SARS-CoV-2 S protein domains are adapted from Wrapp et al.[54] with additional information on RBD from Yuan et al.[53]. SS signal peptide 1–12; NTD N-terminal domain (13–303); RBD receptor binding domain (319–542); S1/S2 S1 subunit end and S2 start site (683–686); S2’ S2’ protease cleavage site (815–816); FP fusion peptide (816–833); HR1 heptad repeat 1 (908–985); CH central helix (986–1035); CD connector domain (1076–1141); HR2 heptad repeat 2 (1163–1202); TM transmembrane domain (1214–1234).
The epitopes identified by MVA on SARS-CoV-2 spike glycoprotein are validated by using external data showing partial overlap with antigenic domains reported for COVID-19 naïve and diseased and with neutralising epitopes.
| Epitope identification (a) | Amino acid position (b) | Sequence (c) | Representative epitope (d) | Data on antigenic regions from other studies (e) | S mutants of variants of SARS-CoV-2 (f) | |||
|---|---|---|---|---|---|---|---|---|
| Bio-informatic predictions in COVID-19 naïve | Serostudies | IEBD: B cell neutralising antibody response | ||||||
| COVID-19 naïve | COVID-19 diseased | |||||||
| S1.1 | 26–34 | PAYTNSFTR | NSF.R | P26 | P26S[ | |||
| S1.2 | 47–58 | VLHSTQDLFLPF | V..S..D…P | – | Q52R[ | |||
| S1.3 | 170–185 | YVSQPFLMDLEGKQGN | L..K.GN | 178–185 | ||||
| S1.4 | 384–390 | PTKLNDL | PTKL..L | 384–390 | ||||
| S1.5 | 445–471 | VGGNYNYLYRLFRKSNL-KPFERDISTE | K….DI.T | 469–483[ | 445–471 | G446S L452M L452Q L452R L452R/Q*[ G446S | ||
| S1.6 | 481–495 | NGVEGFNCYFPLQSY | N.VE.F | 469–483[ | 481–495 | E484K/Q*[ E484A F486V F490S[ Q493/K | ||
| S1.7 | 514–523 | SFELLHAPAT | S…LH…T | 514–523 | ||||
| S1.8 | 570–582 | ADTTDAVRDPQTL | PQTL | 550–570[ | 553–570[ 550–570[ 532–588[ 560–616[ | A570 | A570D[ | |
| S1.9 | 599–612 | TPGT | GTN.S | 592–620[ | 588–644[ 560–616[ | – | ||
| S1.10 | 650–660 | LIGAEHV | L..A…..SY | 652–661[ | 655–672[ 616–672[ | H655 | H655Y[ N658S | |
| S2.1 | 757–768 | GSFCTQLNRALT | T.LNR | 757–769[ | 766–785[ | – | N764K | |
| S2.2 | 804–815 | QILPDPSKPSKR | I.P…KP | 810–816[ 785–805[ 810–830[ | 809–826[ 787–822[ 810–816[ 785–805[ 810–830[ 812–868[ | – | ||
| S2.3 | 858–869 | LTVLPPLLTDEM | V.P.L…E | 867–880[ | 812–868[ | – | T859N[ | |
| S2.4 | 937–944 | SLSSTASA | SL.S…A | 915–946[ | – | |||
| S2.5 | 1151–1161 | ELDKYFK | EL….K…S | 1157–1164[ | 1146–1166[ | 1147–1158[ 1146–1166[ | 1148–1158 | |
The following information is given in columns: unique identification (a), amino acid position (b), amino acid sequence with glycosylation patterns bolded and underlined (c), representative epitope consensus sequence (d), immunogenic regions with amino acid positions on S indicated from other informatic and seroreactivity studies and from IEBD data on B cell neutralising antibody response (see Table S5) (e), frequent mutations described in new variants of being monitored (VBM) and of concern (VOC) of SARS-CoV-2 S[122], where the highly mutated Omicron (B.1.1.529) sublineages (designated with “O”) that have enhanced transmissibility and mutations that show differential (often escape from) neutralising antibody response (marked with “*”) are shown in (f) as A1, A2, A3, A4, A5—Omicron sublineages, BA1, BA2, BA3, BA4 and BA4 respectively.[2,71,99,109,122–126].
N neutralising/protective antibodies shown, **, high IgG titre was associated with poor clinical outcome (development of pneumonia, needing care in the intensive unit or needing assisted pulmonary ventilation).
Figure 215 SARS-CoV-2 S protein epitopes mimic common viral protein antigens and self-proteins implicated in normal development and disease. Pathogen proteins and human proteome were accessed from UniProtKB and aligned with 13,500 most abundant IgG-bound 12-mer peptides containing one representative SARS-CoV-2 S protein epitope (from S1.1 to S2.5) using standalone BLAST with alignment criteria customised to short sequences. (A,B) Viral and human proteins mimicked by SARS-CoV-2 S protein epitopes are depicted on violin plots and were identified using blastp alignment analysis at E-value ≤ 0.05 (except for visualising SARS-CoV, SARS-CoV-2, HKU1, and OC43 alignments where E-value was not restricted). Each dot represents the relative abundance of IgG response to a peptide in one sample from the cohort of SARS-CoV-2 naïve subjects (n = 538). See Tables S3 and S4 for detailed information on pathogen and human protein alignments, full protein names and accession codes.
Figure 3Differential antibody response to SARS-CoV-2 spike protein in COVID-19 diseased and naïve individuals. (A) Individual MVA immunoprofiles of antibody response to 15 epitopes of SARS-CoV-2 S in COVID-19 naïve samples shown as a ratio of specific vs random alignments. Ratios are visualised as centred moving averages across 9 amino acids. Y-axis—identifiers of samples, IgG pool refers to human pooled IgG sample, numbers 1–8 refer to COVID-19 naïve subjects (same as in B,C), Predicted—MVA-delineated epitopes on SARS-CoV-2 S; S1.1 to S1.10—epitopes on subunit 1 of S (dark blue); S2.1 to S2.5—epitopes on subunit 2 of S (orange colour); total abundance—calculated total abundance of IgG-bound peptides aligned to S per individual sample; colour bar (ratio)—ratio of alignment loads of specific vs random (scrambled) IgG-bound peptide sets. (B) Seroreactivity of COVID-19 naive (n = 8) and COVID-19 patients (n = 2) to spike protein subunits S1 and S2. 50 ng of SARS-CoV-2 S protein recombinant subunits S1 and S2 were immobilised on nitrocellulose slides and incubated with human serum/plasma samples to measure immunoreactivity to SARS-CoV-2 spike protein. Samples (n = 8) of selected subjects from COVID-19 naïve cohort (COVID-19 naïve 1–8) were used, alongside with the pooled human IgG sample (IgG pool, n = 2700 individual healthy donors, Sigma-Aldrich, # I4506). Samples (n = 2) taken at hospital intensive care admittance from patients (P1#1, P2#1) diagnosed with COVID-19 were included as positive controls for anti-spike immunoreactivity (COVID-19+, 1–2). Dots represent dot-ELISA data normalised to the sample from patient 1 (P1#1) (COVID-19+) separately for S1 and S2 subunits (100 represents 100%). Bar plot shows the sum of average of dot-ELISA results for S1 and S2 from independent experiments (n = 3). Error bars represent summarised SEM from independent S1 and S2 results. (C) Scatter plot depicts average ratio of SARS-CoV-2 S2 and S1 signals in dot-ELISA experiments (n = 3) from (B). Error bars represent SEM. (D) Heatmap shows the abundance of IgG-bound peptides containing the 15 epitopes of SARS-CoV-2 S protein in COVID-19 patients and controls. MVA immunoprofiles of serial samples from COVID-19 patients (n = 6) at different time points, where #1 is the sample taken at time of hospital admittance and “#n” where n designates the number of days from the first sample withdrawal. CTRL (n = 6) are age- and gender-matched healthy subjects selected from the cohort of 538 people. Relative abundance depicts the abundance of IgG-bound peptides containing the corresponding S epitopes, where values above 2000 are capped to 2000.
Figure 4Antibody response to immunodominant epitopes on SARS-CoV-2 spike protein as predictors of ill health. (A) Individual immunoprofiles of SARS-CoV-2 S protein epitopes in COVID-19 naïve subjects, grouped into Ctrl (n = 262) or Case (with presented chronic diseases) (n = 276). “low” or “high”—subjects with relatively lower or higher overall response to S, calculated based on abundance of IgG-bound peptides containing the epitopes (S1.1 to S2.5) on SARS-CoV-2 spike protein (see “Methods”). x-axis—individual samples sub-grouped by Abundance (blue colour bar) where colour intensity shows individual abundance of IgG-bound peptides containing epitopes of spike normalised with 97.5th percentile value for visualisation. Age (grey colour bar) in Ctrl; y-axis—epitopes on Spike; S1 epitopes on subunit 1 of S; S2 on subunit 2 of S; COVID-19 naïve Case sub-groups: BC breast cancer (n = 57), MS multiple sclerosis (n = 20), T2D type 2 diabetes (n = 25), CVD—cardiovascular disease (n = 114), ND—neuropsychiatric disorders (n = 60). (B) Multivariable logistic regression analysis was used to describe the associations of epitope seroresponse predictors with the acute and/or chronic disease outcomes. Figure shows receiver operating characteristic (ROC) curve of using response to epitopes S1.6, S1.8, and S2.1 on training data (80% subset, i.e. 431/538 samples) for classifying Case (n = 276) vs Ctrl (n = 262). Area under curve (AUC) = 0.74 with 95% CI = (0.70…0.79). On validation with test set of 20% (107/538) samples, the select model classified samples into Case vs Ctrl with balanced accuracy of 62.0% for “low” group and 65.2% for “high” group (Fig. S10C). (C) Separately, antibody response to epitope S1.6 was identified as prevalent among Ctrl group subjects, whereas immune responses to epitopes S1.8 and S2.1 were prevalent among Case group in COVID-19 naïve cohort. Mann–Whitney U test, **p < 0.01; ****p < 0.0001. Group sizes: Ctrl (n = 262), Case (n = 276); abundance—abundance of IgG-bound peptides containing respective epitopes.
| Reagent or resource | Source | Identifier |
|---|---|---|
| Human IgG reference pool | Sigma-Aldrich | Cat#i4506 |
| Rabbit anti-human IgG (H&L) (HRP) | Abcam | Cat#ab6759 |
| COVID-19 | Tartu University Hospital | COVID-19 |
| Coronary artery disease | Helsinki University Hospital | CVD (CAD) |
| Myocardial infarction | North Estonia Regional Hospital | CVD (MI) |
| T2D and T2D with foot ulcers (DFU) | University of Liege | T2D |
| Multiple sclerosis | Helsinki University Hospital | MS |
| Breast cancer | Helsinki University Hospital | BC |
| First episode psychosis, schizophrenia | Psychiatry Clinic of Tartu University Hospital, Helsinki University Hospital | ND (FEP; SZ) |
| Healthy donor | North Estonia Medical Blood Centre | HC |
| SARS-CoV-2 spike protein S1 subunit | Icosagen | Cat#P-305-100 |
| SARS-CoV-2 spike protein S2 subunit | Icosagen | Cat#P-306-100 |
| Ph.D.™-12 phage display peptide library (modified from original library) | New England Biolabs | Cat#E8110S |
| Catalysed signal amplification (CSA) system II, biotin-free, HRP, DAB+ | Dako (Agilent) | Cat#K1497 |
| Anti-CMV ELISA (IgG) | EUROIMMUN | Cat# EI 2570-9601 G |
| Anti-EBV-CA ELISA (IgG) | EUROIMMUN | Cat# EI 2791-9601 G |
| SPEXS2 algorithm | Courtesy of Egon Elbre | |
| ImageJ v. 1.53a | Schneider et al., 2012 | |
| RStudio v. 1.3.959 | RStudio Team, 2020 | |
| R “tidyverse” packages | Wickham et al., 2019 | |
| R “HPAanalyze” package | Tran et al., 2019 | |
| R “ggpubr” package | Courtesy of Kassambara | |
| R “ggbeeswarm” package | Courtesy of Clarke and Sherrill-Mix | |
| MS Office Excel 2016 | Microsoft Corporation | |
| Adobe Photoshop CS4 version 11.0 | Adobe Systems Inc | |
| Protein G magnetic beads | New England Biolabs | Cat#S1430S |
| Immune epitope database | Immune Epitope Database | |
| Human protein atlas v.20.0 | Uhlen et al., 2015[ | |
| UniProtKB human reference proteome | [ | |
| SARS-CoV-2 proteome sequences | [ | |