Literature DB >> 30788259

In silico design of a triple-negative breast cancer vaccine by targeting cancer testis antigens.

Sepideh Parvizpour1, Jafar Razmara2, Mohammad M Pourseif1, Yadollah Omidi1,3.   

Abstract

Introduction: Triple-negative breast cancer (TNBC) is an important subtype of breast cancer, which occurs in the absence of estrogen, progesterone and HER-2 receptors. According to the recent studies, TNBC may be a cancer testis antigen (CTA)-positive tumor, indicating that the CTA-based cancer vaccine can be a treatment option for the patients bearing such tumors. Of these antigens (Ags), the MAGE-A family and NY-ESO-1 as the most immunogenic CTAs are the potentially relevant targets for the development of an immunotherapeutic way of the breast cancer treatment.
Methods: In the present study, immunoinformatics approach was used to design a multi-epitope peptide vaccine to combat the TNBC. The vaccine peptide was constructed by the fusion of three crucial components, including the CD8+ cytotoxic T lymphocytes (CTLs) epitopes, helper epitopes and adjuvant. The epitopes were predicted from the MAGE-A and NY-ESO-1 Ags. In addition, the granulocyte-macrophage-colony-stimulating factor (GM-CSF) was used as an adjuvant to promote the CD4+ T cells towards the T-helper for more strong induction of CTL responses. The components were conjugated by proper linkers.
Results: The vaccine peptide was examined for different physiochemical characteristics to confirm the safety and immunogenic behavior. Furthermore, the 3D-structure of the vaccine peptide was predicted based on the homology modeling approach using the MODELLER v9.17 program. The vaccine structure was also subjected to the molecular dynamics simulation study for structure refinement. The results verified the immunogenicity and safety profile of the constructed vaccine as well as its capability for stimulating both the cellular and humoral immune responses.
Conclusion: Based on our in-silico analyses, the proposed vaccine may be considered for the immunotherapy of TNBC.

Entities:  

Keywords:  Multi-epitope vaccine; T-cell epitope; T-helper epitope; Triple-negative breast cancer

Year:  2018        PMID: 30788259      PMCID: PMC6378095          DOI: 10.15171/bi.2019.06

Source DB:  PubMed          Journal:  Bioimpacts        ISSN: 2228-5652


Introduction

Breast cancer is the most prevalent malignancy and the leading cause of cancer death among women around the world. According to the statistics published by the National Cancer Institute (NCI) of United States, almost 14.5 million people live beyond a cancer diagnosis in 2014, while this figure is expected to rise up to nearly 19 million by 2024. It is also estimated that around one in nine American women will develop breast cancer during their lifetimes. The main risk factors for developing breast cancer are not clear. Many women with a high risk of breast cancer incidence do not get it, while many others without any known risk factor develop breast cancer. The impact of genetic factors on the mammary carcinogenesis has been proven, in which gene mutations involved in the development of breast cancer have been observed in around 10% of these patients.[1] Exploration of efficient prophylactic and therapeutic strategies against cancer need preliminary knowledge about the role of the immune system in controlling cancer, the origin and initiation of tumor, its progression and heterogeneity. A hypothesis for tumor origination and its heterogeneity is suggested via the clonal evolution model.[2] Based on the model, several mutations appear to happen during the time and accumulated within tumor cells. Many of these mutations are identified and removed by the immune system. However, a number of such mutations may sustain in the fittest cells, leading to heterogeneity and initiation of the cancerous cells.[3] Another hypothesis refers to the concept of cancer stem cells, which can avoid the anoikis with a capability of an extensive proliferation similar to the properties of stem cells. These cells seem to play a central role in the initiation, progression and recurrence of tumors.[4] In addition to the elimination of cancerous cells, it is essential to recognize and eradicate the cancer stem cells in order to treat cancer. Perhaps, the development of effective immunotherapy approaches might be one of the best strategies. In fact, the presence or absence of three known receptors plays a critical role in the diagnosis and treatment of the breast cancer, including estrogen receptors (ER), progesterone receptors (PR), and the epidermal growth factor receptor 2 (HER2). To propose a robust way for the treatment of breast cancer, it is important to target these receptors. In 15%-20% of all breast cancer, however, none of these receptors is expressed by the breast tumor as the ER-, PR- and HER2-negative cancer; the so-called a triple negative breast cancer (TNBC).[2] In the case of TNBC, the patients may show a high rate of inconsistency and failure in terms of the treatment, leading to recurrence, metastasis and finally death. As a result, the treatment of patients with TNBC is very difficult [4] and it is known as the most lethal form of the breast cancer.[2] Recently, cancer testis antigens (CTAs) have been reported as the cancer molecular markers, targeting of which can provide much more significant treatment strategy in cancer therapy.[5] Due to the expression of CTAs in the malignant tumors and their absence in the normal tissues except for the testicular germ cells, they can be valuable targets for cancer immunotherapy. They can also be used as potential mediators for the diagnosis of cancer cell.[6] Until now, more than 100 CTAs and their families have been recognized, which are expressed in a wide range of cancers, including breast cancer.[2] These antigens (Ags) encode immunogenic proteins, which can induce spontaneous humoral or cellular antitumor responses. Autoimmunity in the testis and placenta, however, is disallowed, in large part due to the down-regulation of the MHC and the production of immune-suppressive factors. The detailed systematic knowledge of manually curated CTAs has been collected in the Cancer Testis database (CTdatabase)[7] by the Ludwig Institute for Cancer Research, including immunogenic data, splicing variants, and the gene expression levels for each cancer type. The expression of CTAs in large portion of TNBC has recently been reported in several studies.[8,9] Further, NY-ESO-1 looks like one of the most immunogenic CTA, known to date, which is expressed in the TNBC. Patients with tumors containing expression of NY-ESO-1 frequently show humoral and cellular immune responses against this Ag. In addition, the human melanoma-associated antigen (MAGE) that refers to a family of CTAs categorized into the type I and II based on the tissue expression pattern. The type I MAGEs in the human contains the subfamily of MAGE-A, -B, and –C which are clustered on the X chromosome.[10] The type II MAGEs (i.e., MAGE-D, -E, -F, -G, -H, and –L) are expressed in several human tissues in the X chromosome.[11] A common MAGE homology domain (MHD) exists in both types having around 170 amino acids, which is conserved about 46% within all human MAGEs.[12] The q28 region of the X chromosome carries 12 homologous genes of MAGE-A (MAGE-A1 to -A12). The pieces of evidence accuse the expression of MAGE-A of the growth and progression of tumors.[13] It is also expressed frequently in the TNBC patients.[9,14] Recently, immunotherapies have been shown to play an outstanding role in the cancer therapy since they are well-tolerated and considered as robust treatment modalities for most of the pre-invasive and invasive tumors. Of the anticancer immunotherapies, the multi-epitope peptide cancer vaccines appear as the next generation immunomedicines designed to elicit the immune system responses against tumor cells. The anti-tumor immunity action may be enhanced through either intensifying the CD4+ helper T lymphocytes (HTLs) and CD8+ cytolytic T lymphocytes (CTLs) responses or preventing the immune response suppressors.[15] In addition, various agents can be conjugated to the peptide vaccines as an adjuvant, including granulocyte/macrophage colony-stimulating factor (GM-CSF) to augment the immune responses.[16] It was shown that the magnitude of both humoral and cellular immune responses to the vaccine constructs is enhanced by GM-CSF through professional antigen-presenting cells (APCs). The efficiency of a cancer vaccine is related to its ability for the stimulation of both HTL and CTL responses. The tumor peptide segments, which are basically identified as the T-cell immunodominant epitopes and presented by the major histocompatibility complex class I (MHC-I) molecules on the antigen presenting cells (APCs), are recognized and removed by the CTLs. These epitopes are important elements for the construction of epitope-based vaccines for cancer therapy. The epitope-based techniques for the vaccine design offer several advantages, including cost-effectiveness, safety, specificity and stability under different conditions due to the definition of epitopes. Nevertheless, the vaccine development by this approach may also face with some issues, including fast destruction by tissue and serum peptidases, inability in terms of effective activation of the HTLs and possible limitations in terms of the MHC molecules.[17] However, focusing on the molecular mechanisms of the involved Ags can lead to overcome these difficulties and improve the performance of the epitope vaccine. In this way, the computational methods play a critical role to reduce the complexity of vaccine development, whereas they can be used in different stages of the design process such as selection of proper Ags, carriers, and adjuvants. The emerging bioinformatics approaches have led researchers to propose several alternative strategies for the development of new‐generation vaccines through key approaches, including (i) immunoinformatics, (ii) reverse vaccinology, and (iii) structural vaccinology. In the current study, we intended to in-silico design of a multi-epitope polypeptide vaccine against TNBC. For the development of the vaccine, we capitalized on NY-ESO-1, MAGE-A3, MAGE-A10 and multi MAGE-A antigens as target Ags for the epitope prediction. In addition, GM-CSF was also added to the peptide as an important component in order to augment an antibody- and cell-mediated immunity. The selected components were linked together via appropriate linkers. The designed vaccine was analyzed using several immunoinformatics tools to evaluate and validate the efficacy and capability of the vaccine for the immunotherapy of TNBC.

Methods

Sequence retrieval and preparation

The UniProt database[18] was used to retrieve the amino acid sequence of the human melanoma-associated Ags, including MAGE-A3, MAGE-A10, multi-MAGE-A (MAGE-A-3,10,6,2,12,4) and NY-ESO-1. The sequences were submitted to some different bioinformatics tools for the assessment of antigenicity and prediction of epitope(s). In the sequel, proper amino acid linkers were used to link the derived peptide segments from helper Ag, main Ags and proper adjuvant. The obtained whole protein sequence was analyzed by several online servers to determine its characteristics. The sequence is also back-translated to the DNA sequence to adapt with E. coli codon using JCAT online server.[19] The resulting DNA sequence was optimized in order to clone and process its expression in the E. coli host.

Sequence analysis by immunoinformatics tools

The prediction of MHC class I binding peptides

T cells are major agents for the induction of immune response against foreign Ags. They are activated through binding of the antigenic fragment to the MHC molecules. Thus, it is a critical step in the vaccine design to predict which antigenic fragment can bind to the MHC molecule. IEDB server provides robust tools to look for the MHC class I and II binding common epitopes. The whole numbers of MHC allele interactions can be estimated using outcomes of the IEDB server. The server provides a set of options for users to choose their desired prediction technique, including artificial neural network (ANN), average relative binding (ARB), stabilized matrix method (SMM), SMM with a peptide: MHC binding energy covariance matrix (SMMPMBEC), scoring matrices derived from the combinatorial peptide libraries, consensus, and NetMHCpan.[20] Alleles were added to the list of candidates due to the lowest level of energy (E-total) in docking between each peptide and active site of their alleles. The 3D structure of these alleles has mostly been deposited to the protein data bank (PDB). In the cases lacking 3D structures of alleles in PDB, their structures were predicted by online servers. The MHC-peptide docking was done using the Hex v8.0.0 program. The final T cell epitopes have been chosen within the peptides having the most negative E-total value as a stable system. The LigPlot v1.4.5 program was used to visualize and analyze the hydrogen and hydrophobic interactions between the MHC alleles and the peptides.

The prediction of B-cell epitopes

Linear B-cell epitopes were predicted using BCPREDS online server. A novel method of a subsequence kernel with performance accuracy of 74.57% was used for the prediction of B-cell epitopes with BCPREDS specificity threshold of 0.75%.[21] Furthermore, DiscoTope 2.0 was used to designate the discontinuous B-cell epitopes from the 3D protein structure.[22] DiscoTope uses a novel epitope propensity amino acid score and surface accessibility calculation. Combination of the propensity scores of residues in spatial proximity and the contact numbers is used to calculate the final scores. The identification of epitopes was performed using the default threshold of 3.7, by which the sensitivity and specificity values of 0.47 and 0.75 were obtained, respectively.

Construction of multi-epitope vaccine sequence

The vaccine sequence was the product of the above prediction steps using immunoinformatics tools. The sequence is constructed by linking the high-scoring CTL and HTL epitopes together via appropriate linkers. In addition, GM-CSF was included at the N-terminal end of the vaccine as an adjuvant to improve its immunogenicity.

The prediction of allergenicity

Allergenicity of the constructed multi-epitope vaccine was assessed by submitting the sequence to the online AlgPred web server.[23] The prediction of allergenicity is technically done based on the similarity of the query sequence to the known protein sequences. The AlgPred server looks for MEME/MAST allergen motifs and predicts the sequence allergenicity in the case of the existence of a motif. The server uses an SVM-based module for the composition of amino acids or dipeptides, and hence, the prediction of allergen. The sequence is considered as an allergen protein if there is a hit in its BLAST search against 2890 allergen-representative peptides.

Analysis of physicochemical properties

Physicochemical properties (e.g., molecular weight, sequence length, instability index, aliphatic index, half-life, and grand average of hydropathicity) of the constructed multi-epitope vaccine were further analyzed using the ProtParam web tool.[24] The ProtParam calculates the physicochemical properties of a submitted sequence on the basis of pK values of different amino acids. Stability of a protein is revealed by the instability index, in which an index less than 40 is considered stable. It predicts the half-life based on the ‘N-end rule’, indicating that the N-terminal amino acids determine the degradation of a protein. The aliphatic index indicates the volume occupied by the aliphatic side chains, including alanine, valine, leucine, and isoleucine. In order to calculate the grand average of hydropathicity, the sum of hydropathy of all residues is divided by the total number of residues in a protein.

The prediction of secondary and tertiary structure

The secondary structure of the constructed vaccine was predicted by the PSIPRED online server.[25] The server works based on incorporating two feed-forward neural networks in order to analyze the output obtained from the PSI-BLAST. Further, the tertiary structure of the multi-epitope vaccine sequence was predicted through the homology modeling technique.[26] This prediction was performed by means of the MODELLER version 9.17 software.[27] The latter software is a standalone program applying an automated strategy for comparative modeling of a protein structure based on satisfaction of spatial restraints. UCSF Chimera[28] was also used for visualizing the 3D structures of proteins.

Molecular dynamic (MD) simulation

The constructed vaccine was subjected to the GROMACS 5.0.7 software package for the energy minimization. The structure of the model was refined with a molecular dynamics simulation using the GROMOS 96 force field. To this end, the model was placed in a cubic box with suitable size, and 13 Na+ ions were added to provide a neutralized environment. The MD simulation was done at 300 K for 30 ns. Afterward, the RMSD graph was drawn for analyzing the dynamic behavior and the protein structural changes.

The tertiary structure validation

Different tools were used to assess the resulting 3D-model, including ProSA-web,[29] PRO-CHECK and ERRAT.[30] The overall quality score is calculated by analyzing atomic coordinates of the model using the ProSA-web server. The server provides a plot of the z-score for experimentally determined structures deposited in the PDB. The Ramachandran plot of residue-by-residue stereochemical qualities of models is produced by PROCHECK. The server examines the backbone conformation based on a Psi/Phi Ramachandran plot. In addition, the statistics of the non-bonded atom-atom interactions within a database of reliable high-resolution crystallography structures is calculated by ERRAT.

Codon optimization of the vaccine sequence

Due to the frequent variations of codon usage among organisms, it is necessary to optimize the protein expression within the host cells. In addition, the formation of an unstable structure of mRNA and a large volume of G/C content possibly decrease the protein synthesis due to its negative effects on the initiation and elongation of the translation. Accordingly, the vaccine sequence was submitted to JCat server[31] for the codon optimization. The program calculates the codon adaptation index (CAI) and G/C content, and also predicts the secondary structure of mRNA.

Results

Sequence retrieval

The amino acid sequence of NY-ESO-1 (UniProt ID: P78358), MAGE-A3 (UniProt ID: P43357), MAGE-A10 (UniProt ID: P43363) and multi-MAGE-A proteins were retrieved from the UniProt database. The sequences were subjected to the immunoinformatics analysis in order to select CD8+ CTL and CD4+ helper epitopes. Further, the sequence of GM-CSF (UniProt ID: P04141) was used as the adjuvant.

Epitope prediction

The alleles were submitted to the IEDB server[32] for the prediction of the T-cell epitope. Tables 1, 2 and 3 show the outcomes of the IEDB server for the MHC class I prediction for NY-ESO-1, MAGE-A3, MAGE-A10, multi MAGE-A family, respectively. The results are shown in units of the IC50 score while a lower value specifies a higher affinity. Specifically, the peptides are classified into the high-, intermediate- and low-affinity based on the IC50 scores lower than 50, 500 and 5000 nM, respectively. Additionally, another score is calculated namely percentile rank by comparing the peptide's IC50 score against those of a set of random peptides from the Swiss-Prot database. The higher affinity is indicated by a small value of percentile rank. From the values shown in Tables 1 to 3, the best peptides were chosen based on both IC50 and percentile rank scores. The T-cell epitopes were docked with the MHC alleles. The docking energy (E-total) between each peptide and active site of its allele is shown in Tables 1 to 3. The sequences with the lowest E-total value (kcal/mol) were selected as the CTL epitopes. According to the literature, we found some of the epitopes that have experimentally been confirmed. Figs. 1, 2, and 3 represent the molecular docking of the lowest E-total value fragment between the NY-ESO-1, MAGE-A3, and MAGE-A10 antigens and HLA Class I alleles. The MAGE-A family sequences were aligned as shown in Fig. 4. Based on the alignment, two commonly aligned epitopes were chosen, and they have already been published by Graff-Dubois et al.[33] Similarly, the HTL epitopes were opted from the results obtained from the IEDB MHC-II prediction module based on the higher binding affinity with the MHC-II, as shown in Table 3.
Table 1

CD8+ T-cell epitope selection for NY-ESO-1, MAGE-A3 and MAGE-A10

HLA class I IEDB Position Length Method ic50 score Percentile rank E-total (kJ/mol) Published
NY-ESO-1
A*02:01 SLLMWITQC 157-165 9 ann 32.98 0.1 -540.1 Yes [34]
A*30:01 AARASGPGG 50-58 9 ann 21.2 0.1 -559.95 No
A*32:01 RLLEFYLAMP86-98 10ann/smm36.080.5-418.27No
B*07:02 TPMEAELARR98-109 10ann11.900.1-490.14No
B*40:02 KEFTVSGNI 124-132 9 ann 34.4 0.2 -552.45 No
B*40:02 RLLEFYLAMP85-98 10ann17.00.2-539.32 YES [35]
B*55:01 MPFATPMEA94-102 9ann23.700.2-419.10No
MAGE-A3
A*02:01 KVAELVHFL112-1209ann0.145.93-374.92 Yes [36]
A*02:01 FLWGPRALV 271-279 9 ann 0.1 12.02 -591.74 Yes [37]
A*24:02 IMPKAGLLI 195-203 9 ann 0.1 - -529.78 Yes [38]
A*01:01 ELMEVDPIGHL 165-174 9 ann 0.1 34.74 -648.0 No
A*03:01 RKVAELVHFLL 111-12210ann0.146.43-457.78No
A*02:06 GSVVGNWQYF137-14810ann0.145.46-486.05No
MAGE-A10
A*01:01 KLLTQDWVQ269-2809ann44.110.1-489.12No
A*01:01NGSDPRSFPL319-33010ann54.210.1-552.37No
A*02:01 SLLKFLAKV 310-318 9 ann 8.90 0.4 -663.25 Yes [39]
A*02:01 GLYDGMEHL 254-262 9 ann 9.09 0.8 -432.16 Yes [39]
A*03:01 LLFKYQMKE 145-153 9 ann 47.96 0.1 -664.36 No
A*03:01RAHAEIRKMS301-31310ann54.200.1-699.67No
A*11:01MLSDVQSMPK214-22310ann12.720.2-533.53No
A*31:01KFLAKVNGSD313-32410ann39.870.1-502.90No
A*31:01RSFPLWYEEA324-33510ann29.130.1-551.39No
A*31:01VIWEALNMMG245-25610ann53.00.2-438.42No

* Selected as the highest peptide binding affinity.

The selected sequences are shown in bold.

Table 3

CD4+ helper T-cell epitope selection for NY-ESO-1, MAGE-A3, MAGE-A10

IEDB Position Length Percentile rank ic50 score E-total (kJ/mol)
NY-ESO-1
HLA-DRB1*01:01 SRLLEFYLAMPFATP 85-99 15 2.05 13 -629.80
HLA-DRB1*01:06 HRQLQLSISSCLQQL 142-156 15 1.66 5.81 -590.80
HLA-DRB1*01:01 QCFLPVFLAQPPSGQ 164-178 15 4 40 -557.20
MAGE-A3
HLA-DRB1*01:01 ESEFQAALSRKVAEL 102-116 15 1.15 9 -475.11
HLA-DRB1*01:01AGLLIIVLAIIAREG199-203 153.0217-392.30
HLA-DRB1*01:01ACYEFLWGPRALVET267-281 154.1622-480.01
HLA-DRB1*01:06 FPVIFSKASSSLQLV 147-161 15 1.61 4.32 -637.50
HLA-DRB1*01:05PDLESEFQAALSRKV99-113 151.918.28-344.27
HLA-DRB1*01:04 ETSYVKVLHHMVKIS 280-294 15 2.41 6.55 -496.60
MAGE-A10
HLA-DRB1*01:01 PKTGILILILSIVFI 222-236 15 3.24 18 -821.30
HLA-DRB1*01:01 TGHSFVLVTSLGLTY 197-211 15 4.39 6 -620.60
HLA-DRB1*01:01 ARYEFLWGPRAHAEI 292-306 15 3.37 33 -624.00
HLA-DRB1*01:04AEIRKMSLLKFLAKV304-318 154.018.41-366.54

The selected sequences are shown in bold.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
* Selected as the highest peptide binding affinity. The selected sequences are shown in bold. The selected sequences are shown in bold. The selected sequences are shown in bold. Molecular docking of the lowest E-total value fragment of NY-ESO-1 (amino acids 124-132) with the HLA-B*40:02 active site. (A) The surface and cartoon representations of the complexes HLA-B*40:02 (sky blue) and NY-ESO-1 (gold) visualized by the UCSF Chimera software. (B) The interactions between the vaccine peptide and the MHC allele prepared by Ligplot+ software. The pink dotted lines show hydrogen bonds and the red spline curves indicate hydrophobic contacts. Molecular docking of the lowest E-total value fragment of MAGE-A3 (amino acids 165-174) with the HLA-A*01:01 active site. A) The surface and cartoon representation of the complexes HLA-A*01:01 (sky blue) and MAGE-A3 (red) visualised by UCSF Chimera software. B) The interactions between the vaccine peptide and the MHC allele prepared by Ligplot+ software. The pink dotted lines show hydrogen bonds and the red spline curves indicate hydrophobic contacts. Molecular docking of the lowest E-total value fragment of MAGE-A10 (amino acids 145-153) with the HLA-A*01:01 active site. A) The surface and cartoon representation of the complexes HLA-A*01:01 (sky blue) and MAGE-A10 (yellow) visualised by UCSF Chimera software. B) The interactions between the vaccine peptide and the MHC allele prepared by Ligplot+ software. The pink dotted lines show hydrogen bonds and the red spline curves indicate hydrophobic contacts. MAGE-A family sequence alignment. Two conserved epitopes are shown in red box.

Construction of the multi-epitope peptide vaccine

Based on the results obtained from immunoinformatics analysis, three regions from each sequence of NY-ESO-1, MAGE-A3, MAGE-A10 and two regions from multi-MAGE-A (Fig. 4) were selected as the CTL epitopes. In addition, the same number of regions were selected from each sequence as the helper epitopes (Table 3). The selected CTL and helper epitopes were arranged and fused together within the vaccine peptide by proper linkers. Fig. 5 shows the schematic diagram of vaccine domains and linker's sites.
Fig. 5
Schematic diagram of the vaccine construct consists of the adjuvant, helper epitopes and CTL epitopes, which joined together by proper linkers.

Allergenicity evaluation

The allergenicity of the vaccine construct was evaluated by submitting the sequence to the AlgPred server. Based on the result, the vaccine peptide was predicted to be non-allergic in the nature having a score of -1.1623. The threshold score was found to be -0.40, in which an input sequence is considered non-allergic for the scores less than this value.

Physicochemical analysis of vaccine constructs

Physicochemical characteristics of the vaccine construct were calculated by submitting the sequence to the ProtParam server. The molecular weight (MW) and theoretical isoelectric point (pI) value of the vaccine construct were 55.6 kDa and 5.70, respectively. The in vitro estimation of half-life in mammalian reticulocyte was found to be 30 hours. This value is estimated in vivo to be more than 20 and 10 hours in yeast and E. coli, respectively. The constructed vaccine obtained an instability index of 35.89 indicating stability of the protein. The aliphatic index is estimated to be 86.11 showing thermostability of the protein, in which higher value of this index indicates its more thermostability.[40] The grand average of hydropathicity (GRAVY) was found to be -0.062, while its negativity specifies its hydrophilicity in nature and ability to interact with water molecules.

The secondary structure prediction

The secondary structure of the constructed vaccine was predicted by the PSIPRED server. Fig. 6 shows the predicted secondary structure of the protein. The prediction revealed that the protein consists of 32.8% alpha helix (H), 3.8% beta strand and 63.4% coil (C). The output of the secondary structure prediction was used to refine the 3D model of the vaccine construct.
Fig. 6
The secondary structure prediction of the constructed vaccine prepared by PSIPRED. Based on prediction, the protein consists of alpha-helix (32.8%), beta strands (3.8%) and coils (63.4%).

Homology modeling

The amino acid sequence of the constructed vaccine was submitted to the PSI-BLAST server to look for the potential structural templates of homology modeling. Based on the PSI-BLAST search result, different parts of the construct showed significant alignments with different proteins along its amino acid sequence, including PDB-IDs: 5D71, 4V0P, 2WA0, 4RS1, and 2L27. Accordingly, based on multi-template homology modeling, multiple alignments of these proteins and the vaccine sequence was calculated and used to generate a 3D-model for the construct by the MODELLER 9.17 program.[27] The MODELLER program predicts the 3D-model of an input sequence based on its alignment with sequences of known protein structures as a template. The best model was chosen based on the DOPE and GA341 objective functions, while the higher GA341 and/or the lower DOPE indicates higher quality of a generated model.

The tertiary structure prediction and refinement

The best generated 3D-model was refined in three steps. The model was first submitted to a loop refinement procedure running by MODELLER 9.17. The output structure of the loop refinement was further refined by GalaxyRefine server in order to refine the whole protein structure.[41] The GalaxyRefine method first reconstructs side chains and performs side-chain repacking and following overall structure relaxation (mild and aggressive relaxation) by molecular dynamics simulation.

Energy minimization of the constructed vaccine

A molecular dynamics simulation was done for 30ns in order to minimize the energy level of the constructed vaccine. It can be observed from the Fig. 8(A) that the RMSD value of the vaccine structure is constant and did not change significantly after 6 ns. This observation illustrates that the simulation time was long enough to achieve an equilibrium structure. Additionally, an average temperature of 30 ns simulation at 300 K for the studied system was equal to 300 ± 0.5 K as shown in Fig. 8(B) . Fig. 9 shows superimposition of the initial model and the refined model. It is clear from the figure that the quality of the 3D model was increased after efficient loop refinement and energy minimization.
Fig. 8
Fig. 9
The 3D structure of the constructed vaccine Molecular dynamic simulation analysis. A) A RMSD for the constructed vaccine. B) An average temperature for the vaccine. Superimposition of the initial model and the refined model shown in yellow and blue colors, respectively. To obtain the refined model, quality of the initial model was refined after precise loop refinement and energy minimization.

Model evaluation

The quality of the constructed 3D-model was assessed before and after the refinement using different evaluation tools. The PROCHECK web tool was used to investigate the backbone conformation of the model based on a Psi/Phi Ramachandran plot. The quality of the model after refinement was improved up to 82.4% in the most favored region and 14.2% in generally allowed region, in which only 2.9% of residues were located in the outlier region. ERRAT is another model evaluation tool to examine the overall quality of a model for non-bonded atomic interactions. Evaluation of the model by the ERRAT server indicates an improvement in the structure from 26.4% to 65.2% after the refinement. Furthermore, ProSA-web was used to calculate an overall quality score for the model. The server calculates a z-score of -4.43, which is in the normal range of scores for the native proteins. The overall scores obtained for the model are considered reasonable regarding that the vaccine structure was constructed based on multiple templates with a low sequence identity and using the threading approach.[42,43]

Reverse translation

The sequence of vaccine sequence was submitted to the JCat server in order to generate a codon optimized DNA sequence. The gene sequence resulted in a codon adaptation index (CAI) of 1, where a CAI index more than 0.8 is acceptable for the high-level expression of a gene. The calculated frequency distribution in the gene was about 62%. The overall GC content of the gene was 40.89%, where the ideal range is from 30% up to 70%. The optimized gene does not contain the negative CIS elements and repeat sequences. The gene was cloned in E. coli vectors by introducing the NcoI and XhoI restriction sites to N and C- terminals of the sequence, respectively.

Identification of B-cell epitopes

The role of B-cell epitopes in cancer immunotherapy has been already proven. To predict the B-cell epitopes within the constructed vaccine, the full-length protein was submitted to BCPred and DiscoTope servers. Linear B-cell epitopes were predicted by BCPred as shown in Table S1, and discontinuous B-cell epitopes were determined by discoTope as listed in Table S2 (See online supplementary data).

Discussion

TNBC is known as one of the most difficult types of breast cancer for the treatment having a high probability of dissemination and recurrence. The rate of the survival of TNBC comparatively is lower than the other types of breast cancer, showing short overall survival, a high degree of malignancy and strong invasive potential. It is mostly diagnosed at advanced stages with a high risk of visceral metastasis.[44-46] Regarding these specific characteristics of TNBC and limited modalities of its treatment, immunotherapy can be considered as a relevant strategy for its prevention and treatment. Cancer vaccines generally work based on the cellular immunity deduced from the CTL responses.[47] The efficiency of a constructed vaccine depends on the selection of proper CTL epitopes as the most key component, helper epitopes and adjuvant.[48] In addition, the fusion of these components needs appropriate linkers in order to generate a protein with the desired functionality. In other words, the selection of improper linkers may result in a protein having different characteristics. Accordingly, it is critically important to choose accurate components for the construction of a peptide vaccine. To achieve this goal, the tools from immunoinformatics can provide useful facilities for high accurate vaccine design. The feasibility, accuracy and speed of in silico tools for the analysis and design of multi-epitope vaccines against different infectious diseases or even cancer has been demonstrated by different computational studies.[49,50] Peptide-based vaccines generally induce antigen-specific T-cell responses against cancer-related antigens when they are administered with proper adjuvants. Essentially, the adjuvant is a key component within a peptide vaccine for optimizing the functionality of dendritic cells and generating an appropriate immune response. GM-CSF has been used in a number of tumor-associated antigen (TAA) protein-based vaccines. The capability of the GM-CSF-based vaccines to induce potent antiviral and antitumor response in preclinical experiments has been demonstrated.[16,51] It provides a critical link between dendritic cells and T cells, whereas it activates dendritic cells through the mechanism known as GM-CSF-induced antitumor immunity. Consequently, the GM-CSF can be an efficient candidate immune adjuvant to induce antitumor responses.[52] Additionally, linkers play an essential role in the structural and functional behavior of a peptide vaccine. The selected components for the vaccine construction were joined together via different linkers. Two motifs were used to link CTL epitopes, including HEYGAEALERAG and AAY. The specific cleavage site for both proteasomal and lysosomal degradation system is provided by the HEYGAEALERAG sequence. Moreover, the epitope presentation is improved by the AAY sequence whereas the generated C-terminal after cleavage properly binds to the TAP transporter or other chaperons.[53,54] Furthermore, the predicted T-helper components were linked together via the GPGPG linker. These linkers stimulate HTL responses and the conserve conformational dependent immunogenicity of helpers as well as antibody epitopes.[55] The EAAAK sequence was also used as a rigid linker for controlling the molecular distances and reducing interferences between the domains.[54,56] Ultimately, the whole vaccine structure was constructed by linking the three major sections of the vaccine, namely adjuvant, T-helper epitopes and CTL epitopes via the above appropriate linkers. Several evaluations were carried out to investigate the immunological, physicochemical and structural features of the constructed vaccine. The protein sequence consists of 514 amino acids, which is considered a reasonable length for a cancer vaccine. A longer peptide enforces its presentation by dendritic cells increasing the induction strength of T-cell immune responses. However, sequences with a length less than 30 residues possibly bind to MHC-I or MHC-II molecules of nonqualified APCs.[57] In addition to the main goal of cancer vaccination in the induction of cellular arm of the host immune system, the maximum efficiency could be achieved by considering the HTL responses in the vaccine design. Furthermore, the critical role of CD4+ T-helper cells in the preservation of CTL responses has been demonstrated.

Conclusion

TNBC has emerged as an important subtype of breast cancer and the reason for a large number of deaths among women worldwide. In this work, we used the reverse vaccinology approach to design a multi-epitope based vaccine for the cancer immunotherapy using immunoinformatics tools. The vaccine was constructed by the fusion of multiple CTL and HTL epitopes via appropriate linkers in order to stimulate the humoral, cellular and innate immune responses. The vaccine was also included GM-CSF as an adjuvant to increase its immunogenicity. Based on several immunoinformatic analyses, the designed multi-epitope peptide can be efficiently used for the therapeutic or prophylactic purposes against TNBC. The successful immunogenicity of the proposed vaccine would be proved by the future in vitro and in vivo studies.

Acknowledgment

The authors like to acknowledge the Research Center for Pharmaceutical Nanotechnology at Tabriz University of Medical Sciences and also the Department of Computer Science at the Faculty of Mathematical Sciences at the University of Tabriz for their technical support.

Funding sources

The authors would like to thank Iranian Elite National Foundation “Bonyad Melli Nokhbegan” and the Iranian National Science Foundation (INSF) for the financial support (grant No: 95005404).

Ethical approval

None to be declared.

Competing interests

As the contributing author to this study, YO acts as the Editor of the Bioimpacts journal. It is hereby declared that his association with the journal has neither influenced the review process nor affected the acceptance of this study.

Authors contribution

SP and YO developed the concept of the study. SP, MMP and JR performed the analyses. SP and MMP drafted the manuscript. YO finalized the manuscript and the submission process. All the coauthors approved the submission of the manuscript.

Supplementary Materials

Supplementary data contains Tables S1-S2. √ Triple-negative breast cancer (TNBC) is an important subtype of breast cancer, which occurs in the absence of estrogen, progesterone and HER-2 receptors. Cancer testis antigens (CTAs) have been reported as one of the promising be a more significant therapeutic way for cancer treatment. √ In the present study, immunoinformatics approach was used to design a multi-epitope peptide vaccine to combat the TNBC malignancies. √ The vaccine peptide was constructed by the fusion of three crucial components, including the CD8+ cytotoxic T lymphocytes (CTLs) epitopes, helper epitopes and adjuvant. √ The homology modeling approaches were conducted for prediction of the 3D structure of the constructed cancer vaccine.
Table 2

Multi-MAGE-A family epitope selection[33]

Sequence MAGE-A family
CLGLSYDGLL1, 2, 3, 4, 6, 12
YLEYRQVPG 2, 3, 4, 6, 10, 12
YEFLWGPRA 1, 2, 3, 4, 6, 10, 12

The selected sequences are shown in bold.

  5 in total

1.  In Silico Design and Evaluation of PRAME+FliCΔD2D3 as a New Breast Cancer Vaccine Candidate.

Authors:  Mortaza Taheri-Anganeh; Amir Savardashtaki; Asma Vafadar; Ahmad Movahedpour; Zahra Shabaninejad; Amir Maleksabet; Ahmad Amiri; Younes Ghasemi; Cambyz Irajie
Journal:  Iran J Med Sci       Date:  2021-01

Review 2.  Review on Approved and Inprogress COVID-19 Vaccines.

Authors:  Amir Farnudian-Habibi; Mobina Mirjani; Vahideh Montazer; Shima Aliebrahimi; Iman Katouzian; Saeed Abdolhosseini; Ali Rahmani; Hossein Keyvani; Seyed Nasser Ostad; Mazda Rad-Malekshahi
Journal:  Iran J Pharm Res       Date:  2022-01-24       Impact factor: 1.962

3.  Designing a novel SOX9 based multi-epitope vaccine to combat metastatic triple-negative breast cancer using immunoinformatics approach.

Authors:  HemaNandini Rajendran Krishnamoorthy; Ramanathan Karuppasamy
Journal:  Mol Divers       Date:  2022-10-10       Impact factor: 3.364

4.  The cancer-testis antigen a-kinase anchor protein 3 facilitates breast cancer progression via activation of the PTEN/PI3K/AKT/mTOR signaling.

Authors:  Chuan-Hua Zhan; Dong-Shen Ding; Wei Zhang; Hong-Liang Wang; Zhe-Yu Mao; Guo-Jun Liu
Journal:  Bioengineered       Date:  2022-04       Impact factor: 6.832

5.  A multi-method and structure-based in silico vaccine designing against Echinococcus granulosus through investigating enolase protein.

Authors:  Mohammad Mostafa Pourseif; Mitra Yousefpour; Mohammad Aminianfar; Gholamali Moghaddam; Ahmad Nematollahi
Journal:  Bioimpacts       Date:  2019-03-08
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.