| Literature DB >> 33561453 |
Woochang Hwang1, Winnie Lei2, Nicholas M Katritsis3, Méabh MacMahon4, Kathryn Chapman1, Namshik Han5.
Abstract
SARS-CoV-2, which causes COVID-19, was first identified in humans in late 2019 and is a coronavirus which is zoonotic in origin. As it spread around the world there has been an unprecedented effort in developing effective vaccines. Computational methods can be used to speed up the long and costly process of vaccine development. Antigen selection, epitope prediction, and toxicity and allergenicity prediction are areas in which computational tools have already been applied as part of reverse vaccinology for SARS-CoV-2 vaccine development. However, there is potential for computational methods to assist further. We review approaches which have been used and highlight additional bioinformatic approaches and PK modelling as in silico methods which may be useful for SARS-CoV-2 vaccine design but remain currently unexplored. As more novel viruses with pandemic potential are expected to arise in future, these techniques are not limited to application to SARS-CoV-2 but also useful to rapidly respond to novel emerging viruses. CrownEntities:
Mesh:
Substances:
Year: 2021 PMID: 33561453 PMCID: PMC7871111 DOI: 10.1016/j.addr.2021.02.004
Source DB: PubMed Journal: Adv Drug Deliv Rev ISSN: 0169-409X Impact factor: 17.873
COVID-19 severity studies.
| Study | Omics | Moderate | Severe | Cell types | Data | DEG analysis | Ref |
|---|---|---|---|---|---|---|---|
| Overmyer et al. | Bulk RNA-seq | 51 | 55 | PBMC | Available | EBSeq | |
| Jain et al. | Bulk RNA-seq | 10 | 3 | PBMC | Available | DESeq2 | |
| Liu et al. | scRNA-seq | 3 | 6 | PBMC | NA | NA | |
| Xu et al. | scRNA-seq | 5 | 8 | PBMC | Available | Seurat | |
| Silvin et al. | scRNA-seq | 1 | 2 | PBMC | NA | NA | |
| Arunachalam et al. | Bulk RNA-seq | 4 | 12 | PBMC | Available | DESeq2 |
Fig. 1Antigen selection from transcriptomics analysis and protein interactome network analysis. (A) Venn diagram to show the numbers of DEGs in the COVID-19 moderate group and the severe group. (B) Bar graph to show the top 20 highly enriched functions of the up-regulated DEGs in the severe group. (C) SARS-CoV-2 proteins and their SHPs that are differentially expressed genes in the severe group. (D) Protein interactome network between the up-regulated DEGs and SHPs of nsp16.
Common B-cell epitope prediction tools.
| Tool | Method | Accuracy | Utility in SARS-CoV-2 vaccine design publications |
|---|---|---|---|
| Single AA propensity scale | 61.21 AUC with epitopes of 16 AA length [Liu et al., 2020] | [Rakib et al., 2020] | |
| Single AA propensity scale | Self-reported optimal accuracy of 75% | [Rakib et al., 2020] | |
| Single AA propensity scale | 60.76 AUC with epitopes length of 16 AA [Liu et al., 2020] | [Rakib et al., 2020] | |
| Single AA propensity scale | Unavailable | [Rakib et al., 2020] | |
| Single AA propensity scale | 52.70 AUC with epitopes length of 16 AA [Liu et al., 2020] | [Rakib et al., 2020] | |
| Multiple AA propensity scales Based on AA hydrophilicity, accessibility, flexibility, and secondary structure property | Self-reported optimal accuracy of 60% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Multiple AA propensity scales | Self-reported optimal accuracy of 99.29% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy 71.09% | [Tohidinia and Sefid 2020] | |
| ML-based | Self-reported optimal accuracy of 66.41% | [Behmard et al 2020] | |
| ML-based | Self-reported optimal accuracy of 72.94% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy of 0.62 AUC | [Rahman et al., 2020] | |
| ML-based | Self-reported optimal accuracy of 73% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy 68.50% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy of 0.758 AUC | [Singh et al., 2020] | |
| ML-based | Self-reported optimal accuracy of 72.9% | [Dar et al., 2020] | |
| ML-based | Self-reported optimal accuracy of 0.702 AUC | [Banerjee et al., 2020] | |
| ML-based | Self-reported optimal accuracy of 0.829 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy of 0.728 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy of 86% | [Anand et al., 2020] | |
| Algorithms using residue accessibility and spatial distance cut-off | Self-reported optimal accuracy of 75% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| DiscoTope method incorporated with spatial neighbourhood and half-sphere exposure information | Self-reported optimal accuracy of 0.824 AUC | [Forni et al., 2020] | |
| Thornon’s method integrated with residue clustering algorithm, MODELLER program in predicting and visualizing the epitope structures. | Self-reported optimal accuracy of 91% | [Rajesh Anand et al., 2020] | |
| Incorporating AA propensity scale with side chain orientation and solvent accessibility information of epitope/non-epitope | Self-reported optimal AUC of 75.38 | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Scoring based on comparison to local spatial information of surface residues of 82 antigen–antibody protein complexes | Self-reported optimal accuracy of 0.742 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Mimotope-based | Self-reported optimal sensitivity of 1.00, specificity of 0.839, and precision of 0.256 | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy of 89.4% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy of 0.597 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy of 86.6% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| ML-based | Self-reported optimal accuracy of 0.62 AUC on conformational epitopes | Not utilised/cited in any SARS-CoV-2 vaccine design publications for discontinuous B-cell epitope prediction. | |
This table shows some of the common B-cell epitope prediction tools and their reported accuracy or evaluated accuracy, if available. Their utilities in SARS-CoV-2 vaccine design papers are summarised briefly by the result they have indicated and their application on which protein on SARS-CoV-2, if mentioned. As seen, some tools with relatively high accuracies are yet to be utilised for SARS-CoV-2 vaccine designs.
Common T-cell epitope prediction tools.
| Tool | Method | Accuracy | Utility in SARS-CoV-2 vaccine design publications |
|---|---|---|---|
| Average relating binding (ARB) matrix-based prediction of binding propensity to MHC II | Self-reported optimal accuracy of 83% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Position specific scoring matrix-based predicting of MHC binding peptides | Self-reported optimal accuracy of 0.96 AUC for MHC II and 80% for MHC I | [He et al., 2020] | |
| A matrix-based method utilising pocket profiles of MHC binding peptides | Self-reported optimal sensitivity of 80% | [Dong et al., 2020] | |
| A deep neural model trained on amino acid sequences of MHC epitopes | Self-reported optimal accuracy of 94.18% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| A long short-term memory network trained on common or rare alleles of MHC epitopes | Self-reported optimal accuracy of 0.924 AUC | [Campbell et al., 2020] | |
| Heuristic method translating positional scanning combinatorial library data of MHC I and II epitope data into main and secondary anchor positions with preferred residues for MHC binding peptide predictions | Self-reported optimal accuracy of 0.935 AUC | [Martin and Cheng, 2020] | |
| Indirect method | Self-reported optimal accuracy of 0.952 PPV | [Liu et al., 2020] | |
| A peptide library of X-ray structure of HLA-DP2 proteins with prediction performed using AutoDock [Morris et al., 2009] and Rosetta Dock [Lyskov and Gray 2008] | Self-reported optimal accuracy of 0.900 AUC | [Can et al., 2020] | |
| Quantitative matrix-based approach in predicting MHC II binding based on proteochemometrics | Self-reported optimal accuracy of 93% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| SVM-based model in identifying HLA-DRB1*0401 binding peptides | Self-reported optimal accuracy of 86% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Specificity-determining residue (SDR) based MHC II binding prediction integrated for 95% of MHC II allelic variants | Self-reported optimal accuracy of 0.872 AUC | [Dar et al., 2020] | |
| A stabilized matrix alignment method based on MHC II epitope amino acids preferences by Gibs sampler and predicting using SVRMHC predictions | Self-reported optimal accuracy of 0.756 AUC | [Dong et al., 2020] | |
| ANN-based trained on MHC II binding epitopes with binding core and affinity results available upon query | Self-reported average accuracy of 0.855 AUC | [Behmard et al., 2020] | |
| Combination of NN-align, SMM-align, and the combinatorial peptide scanning library methods on IEDB | Self-reported average accuracy of 0.89 ± 0.05 AUC | [Yazdani et al., 2020] | |
| Proteasomal cleavage site prediction tool. | Self-reported optimal accuracy of 0.805 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Proteasomal cleavage site prediction tool. | Self-reported optimal accuracy of 0.85 AUC | [Shomuradova et al., 2020] | |
| Proteasomal cleavage site prediction tool. | Reported sensitivity of 0.88 and specificity of 0.57 in Gomez-Perosanz et al., 2019 | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Proteasomal fragement-TAP binding prediction tool | Self-reported optimal accuracy of 0.81 Pearson's correlation coefficient | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Proteasomal fragement-TAP binding prediction tool | Self-reported optimal accuracy of 88% | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Proteasomal fragement-TAP binding prediction tool | Self-reported optimal accuracy of 0.89 ± 0.03 Pearson’s correlation coefficient | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Stabilized matrix method (SMM) used to score sequence position in distinguishing TAP-binding epitopes | Self-reported optimal accuracy of 0.89 AUC | [Srivastava et al., 2020] | |
| MHC I binding prediction tool | Self-reported optimal accuracy of 0.94 AUC | [Campbell et al., 2020] | |
| MHC I binding prediction tool | Self-reported optimal accuracy of 0.976 AUC | [Ayyagari et al., 2020] | |
| MHC I binding prediction tool | Self-reported optimal accuracy of 0.8291 PPV | [Campbell et al., 2020] | |
| MHC I binding prediction tool | Self-reported optimal accuracy of 0.979 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| MHC I binding prediction tool | Self-reported optimal accuracy of 0.92 AUC | [Mishra, 2020] | |
| A convolutional neural network-based trained on HLA-Vec, an AA distributed representation of epitopes | Self-reported average accuracy of 0.836 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| A combinatorial optimization from OptiVax tools considering HLA allele frequencies | Unavailable | [Liu et al., 2020] | |
| A visualization tool for predicting antigen immunogenicity based on the HLA presentation | Unavailable | [Yamarkovich et al., 2020] | |
| Estimate baseline frequencies of TCR specificities by database of TCR sequence with known antigen specificities | Unavailable | [Shomuradova et al., 2020] | |
| NLP-based methods in predicting CD4+ or CD8+ TCR binders using large-scale TCR-peptide dictionaries | Self-reported optimal accuracy of 0.98 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| A CNN-based sequence-based predictor trained on AA sequence of peptide and CDR3 region of TCR β-chain | Self-reported optimal accuracy of 0.727 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| Multiple deep learning methods trained on data of MHC binding and TCR sequences from multimodal single-cell experiments | Self-reported optimal accuracy of 0.87 AUC | Not utilised/cited in any SARS-CoV-2 vaccine design publications | |
| SVM-based classifier applied on AA composition and length of IFN-γ inducing MHC class II peptides | Self-reported optimal accuracy of 81.39% | [Behmard et al., 2020] | |
| SVM-based classifier applied on IL4 inducing peptides for Th2 helper cell epitope prediction | Self-reported optimal accuracy of 75.76% | [Behmard et al., 2020] | |
| RF-based model trained on motif and residues of IL-10 inducing peptides | Self-reported optimal accuracy of 0.88 AUC | [Behmard et al., 2020] | |
This table shows some of the common T-cell epitope prediction tools and their reported accuracy or evaluated accuracy, if available. Their utilities in SARS-CoV-2 vaccine design papers are summarised briefly by the result they have indicated and their application on which protein on SARS-CoV-2, if mentioned. As seen, some tools with relatively high accuracies are yet to be utilised for SARS-CoV-2 vaccine designs.
Common formalisms used in modelling immunogenicity.
| Formalism | Advantages | Disadvantages | Application to SARS-CoV-2 Immune Response Studies |
|---|---|---|---|
| Admit analytical solution; | Has a finite dimensional state vector; | ||
| Admit analytical solution; | More complex than ODEs and DDEs; | ||
| Allow derivates with sudden changes in states over continuous model; | Relatively low scalability; | ||
| Able to exhibit complex system-level behaviour with qualitive results | Non-quantitative; | ||
| Explores behaviour of individual entities; | Non-quantitative and entities of properties are discrete; |
Fig. 2Summary of the advantages using computational approaches in vaccine development. Computational approaches can accelerate vaccine development at various stages. The top panel shows a traditional vaccine development process that requires at least 10 years of research and validation. In the centre, the diagram shows how different computational approaches (texts in teal) spur the vaccine development process. In the bottom panel, the timeline at the vaccine design stage has been enlarged. The box summarises the prospectives and challenges our review has proposed at each stage of our suggested computational-assisted vaccine design tools. Processes that have been associated as parts of reverse vaccinology are represented in filled-gold boxes.