Literature DB >> 35057815

The polymorphism analysis and epitope predicted of Alphapapillomavirus 9 E6 in Sichuan, China.

Jiaoyu He1,2,3, Qiufu Li1,2, Shiyu Ma1,2, Tianjun Li1,2, Yuning Chen4, Yiran Liu1,2, Yanru Cui1,2, Jianying Peng1,2, Yunfan Shi1,2, Xia Wei1,2, Xianping Ding5,6,7.   

Abstract

BACKGROUND: The Alphapapillomavirus 9 (α-9 HPV) is a member of the Alphapapillomavirus genus and Papillomaviridae family. These viruses are almost all carcinogenic HPV, which is closely related to 75% of invasive cervical cancer worldwide, and has a high prevalence in Sichuan. The carcinogenic function is mainly realized by its E6 oncoprotein.
METHODS: Cell samples were collected by cervical scraped for HPV detecting and typing. HPV-16, HPV-31, HPV-33, HPV-52, HPV-58 5 α-9 genus HPV subtype positive samples were selected, their E6 gene was sequenced and analyzed. The positive selection sites of HPV E6 genes were estimated by PAML 4.8 server. The secondary and tertiary structure of E6 protein were predicted by PSIPred and Swiss-model. The T-cell antigen epitopes of E6 protein were predicted by IEDB.
RESULTS: α-9 HPV has a high prevalence in Sichuan, China. From 2012 to 2017, 18,067 cell cervical samples were collected, and 3135 were detected with α-9 HPV infection. Among which, 250 cases HPV-16 E6, 96 cases HPV-31 E6, 216 cases HPV-33 E6, 288 cases HPV-52 E6 and 405 cases HPV-58 E6 were successfully amplified, 17, 6, 6, 13, and 4 non-synonymous nucleotide mutations were respectively detected in HPV-16, 31, 33, 52, and 58 E6, 7 positive selection sites of α-9 HPV E6 were selected out (D32E of HPV-16 E6, K35N, K93N and R145I of HPV-33 E6, K93R of HPV-52 E6, K93N and R145K of HPV-58 E6). The structure and antigen epitopes of E6 protein with amino acid substitution differ from those of wild-type E6 protein, especially for the mutation located in the E6 positive selection site.
CONCLUSIONS: HPV E6 nucleotide non-synonymous mutation in the positive selection site influence the protein structure and decrease the antigen epitopes affinity of the E6 protein overall, making it more difficult for the HPV-infected cells to be detected by the immune system, and enhancing the HPV adaptability to the environment. Mutations influence the validity of HPV clinical diagnostic probes, the polymorphism analysis of α-9 HPV E6 enrich the data of HR-risk HPV in Sichuan China, and the detection probes designed with the polymorphism data in mind can improve the efficiency of clinical detection; Mutations influence epitopes affinity, the association of E6 polymorphism and epitope affinity can improve the design of therapeutic vaccine with good immunity and high generality antigen epitope; The above study all provide a good theoretical basis for the prevention and treatment of HPV-related diseases.
© 2022. The Author(s).

Entities:  

Keywords:  Alphapapillomavirus 9 genus HPV; Antigen epitope; E6 gene polymorphisms; Human Papillomavirus; Positive selection site; Protein structure

Mesh:

Substances:

Year:  2022        PMID: 35057815      PMCID: PMC8772103          DOI: 10.1186/s12985-021-01728-4

Source DB:  PubMed          Journal:  Virol J        ISSN: 1743-422X            Impact factor:   4.099


Introduction

Cervical cancer is the second most common cancer among women aged 15–44, 99.7% of the cervical cancers were found to be associated with high-risk (HR) Human Papilloma Virus (HPV) persistent infection [1]. The three main genera are α, β, γ, of which α genus is associated with anal and oral mucosal infection. α-9 genus HPV is almost all carcinogenic HPV, causing 75% invasive cervical cancer worldwide, and its carcinogenicity is mainly realized by E6 and E7 early proteins encoded by HPV E6 and E7. The carcinogenicity of HPV E6 protein is more evident than that of E7 protein, in terms of the cell cycle changes and the efficiency of HPV infected cells to permanent biochemical transformations [2, 3]. Without E7, E6 can connect and ubiquitin degradation the p53 protein via E6AP, interfere the cell cycle, activate telomerase and reverse transcriptase to accumulate the mutations, for infected cells immortalization and maintain immortalization, which is closely related to the function of HPV immortalization, cell transformation, and carcinogenesis [4-7]. At present, there is no specific drug for HPV treatment and mainly relies on the body's immune system to detect and eliminate the virus. Human leukocyte antigen (HLA) has the function of recognizing itself target by recognizing and stimulating CD8+ cytotoxic T lymphocytes (CTL), CD4+ helper T lymphocytes (Th) as well as binding antigen polypeptide to regulate the immune response, control and eliminate HPV infection [8-11]. The antigen epitopes are composed of specific amino acid sequences, are the targets of immune rejection [12-14]. HPV E6 protein has been considered as a potential target for the activation of T cells in immune response strategies and maybe an ideal target for HPV therapeutic vaccines [15, 16]. HPV is a high infectious and mutable virus, with different epidemic trends and mutation types in different regions and populations [17]. The polymorphism of HPV E6 oncogene is strong, its non-synonymous mutation changed E6 protein amino acid composition, which may relate to the differences in immune response and pathogenicity [18]. Some mutant strains can even fix their genes by mutations, enhancing their adaptability to the environment and changing its infection rate [19]. For example, L83V (L90V) of HPV-16 E6 in Swedish and Italian populations as well as D25E (D32E) of HPV-16 E6 in Japanese populations have been proven to be associated with the progression of cervical cancer [20-23]. In the current HPV vaccine design targeting E6 protein, the E6 mutants have almost never been considered. In the epitope-specific vaccine designed by Kelly L, the body's long-active T-cell response to HPV-18 was induced by targeting the reference sequence of HPV E6 and E7. Due to the relatively rapid mutation rate of HPV, the host response capacity, malignant tumor prevention and therapeutic efficacy of vaccine were changed great [15]. HPV has strong regional and population differences, the prevalence of α-9 HPV and the harmfulness of E6 oncoprotein are extremely high, and E6 polymorphism is closely related to the difference of immunogenicity, adaptability, and pathogenicity. Therefore, it’s urgent to study the genetic diversity, positive selection sites, antigen epitope, the protein structure of α-9 HPV E6 for providing data to realize the effective prevention and control of the disease in this region.

Materials and methods

Samples resource

The study was ethically approved by the Education and Research Committee and Ethics Committee of Sichuan University, Sichuan, China. Eighteen thousand sixty-seven specimens were randomly collected from January 2012 to December 2017 in Chengdu Women and Children's Center Hospital, Chengdu Jinjiang District Women and Children's Hospital, Angel Women's and Children's Hospital, Affiliated Hospital of Sichuan Reproductive Health Research Center, Sichuan Reproductive Health Research Center Affiliated Hospital, Shuangnan Hospital, Chengdu Song zi niao Sterility Hospital, Infertility Hospital Affiliated to Chengdu Medical College and Chengdu Jinsha hospital. Before sample collection, written informed consent was obtained from all patients or their guardians, and patient privacy is strictly protected. The cell specimens were collected randomly by cervical scraped and placed in − 20 °C antiseptic buffer (9 g NaCl, 10 g C6H5CO2Na, 1 L H2O).

Genomic DNA extraction and HPV typing

HPV DNA was extracted and evaluated using the Human Papillomavirus Genotyping Kit For 23 Types (Yaneng Bio, Shenzhen, China) according to the manufacturer’s guidelines.

PCR amplification and variant identification

The primers of α-9 HPV E6 were designed by PRIMER version 5.0 and NCBI (National Center for Biotechnology Information) Primer Blast based on the reference sequences, the primers and reference sequences used for the molecular characterization analysis of α-9 HPV E6 were shown in Additional file 1: Table S1 and synthesized by TSINGKE (Chengdu, China). The PCR reaction system consists of 5 µl HPV DNA, 13.1 µl ddH2O, 1 µl primers, 0.4 µl TransTaq DNA polymerase, 2.5 µl dNTPs, and 3 µl buffer. The reaction conditions were shown in Additional file 1: Table S1. The PCR products were visualized by gel electrophoresis in 2% agarose gel (Sangon Biotech Co., Ltd.). The target products of E6 were purified and sequenced by TSINGKE at least twice (Chengdu, China).

Sequence analysis

Genetic polymorphisms analysis of α-9 HPV E6 gene

The successfully amplified sequences was sequenced, and the sequences were analyzed by NCBI BLAST, Premier5, and DNAMAN5.2.2. Nucleotide mutations of α-9 HPV E6 sequence were determined according to the reference sequence in GenBank (Additional file 1: Table S1). Chi-square test was used to confirm the significance of data differences, and P < 0.05 was considered as significant differences between the data.

Selective pressure analysis of α-9 HPV E6

Phylogenetic Analysis by Maximum Likelihood 4.8 (PAML 4.8, http://abacus.gene.ucl.ac.uk/software/paml.html) was used to calculate the ratio (ω = dN/dS) between non-synonymous mutation rate (dN) and synonymous mutation rate (dS) to determine the α-9 HPV E6 gene-positive selection sites.

Amino acid composition and protein structure analysis of α-9 HPV E6

Mega6.0 software was used to translate the E6 nucleotide sequence into the E6 protein sequence. PSIPred (http://bioinf.cs.ucl.ac.uk/psipred/) and Swiss-model were used to analyze the secondary and tertiary structure of E6 protein.

T-cell antigen epitopes predicted analysis of α-9 HPV E6 protein

According to the Chinese major histocompatibility complex database (dbMHC) average frequency of HLA alleles, 13 HLA-I and 6 HLA-II alleles were selected (Additional file 1: Table S2). Based on the selected HLA alleles, the T-lymphocyte epitopes of α-9 HPV E6 protein were predicted by IEDB resource (http://www.iedb.org/). According to the method recommended by IEDB, lower the percentile rank (PR) of antigen epitopes is better the affinity, peptides with PR < 1.0 for HLA-I and peptides with PR < 5.0 for HLA-II were deemed to meaningful as well as selected for further analysis.

Results

The prevalence of α-9 HPV in Sichuan

Out of 18,067 samples, 6092 positive results were detected and 4466 were HR HPV, all of which belonged to α genus. The HPV positive samples of α-1, α-3, α-5, α-6, α-7, α-8, α-9, α-10, α-11 are 167 (2.74%), 25 (0.41%), 137 (2.25%), 438 (7.19%), 571 (9.37%,), 413 (6.78%), 3270 (53.68%), 1021 (16.76%), 50 (0.82%) respectively (Fig. 1). α-9 HPV accounting for 73.22% of HR HPV positive samples. Due to the small positive sample sizes of HPV-35 and HPV-67, five other HPV type (HPV-16, HPV-31, HPV-33, HPV-52, HPV-58) were selected for subsequent studies.
Fig. 1

Distribution of different HPVs positive samples from 2012 to 2017

Distribution of different HPVs positive samples from 2012 to 2017

Nucleotide polymorphisms and selective pressure analysis of α-9 HPV E6

250 HPV-16 E6 were successfully amplified, 162 (64.80%) HPV-16 E6 samples were variants, and 17 non-synonymous mutations were detected. 96 HPV-31 E6 were successfully amplified, 68 (70.80%) variants and 6 non-synonymous mutations were detected. 216 HPV-33 E6 were successfully amplified, 76 (35.19%) variants and 6 non-synonymous mutations were detected. 288 HPV-52 E6 were successfully amplified, 250 (86.80%) variants and 13 non-synonymous mutations were detected. 405 HPV-58 E6 were successfully amplified, 356 (87.90%) variants and 4 non-synonymous mutations were detected. Details of α-9 HPV E6 nucleotide polymorphisms were shown in Tables 1, 2, 3, 4 and 5. All the sequences were submitted to the GenBank, and accession numbers were obtained. (HPV16E6: MZ803036-MZ803058, HPV31E6: MZ803026-MZ803035, HPV33E6: MZ576479-MZ576485, HPV52E6: MZ803059-MZ803078, HPV58E6: MZ803079-MZ803087).
Table 1

Nucleotide mutation and amino acid substitution in HPV-16 E6

HPV-16 E6
No123456789101112131415161718192021
Location212304994969696101103103111185192194196253268313360452
MutationT–AG–AG–AA–GG–AT–AT–GT–CT–GT–GT–CT–CG–TG–TA–GC–GC–TT–GG–CA–CG–C
Frequency (%)0.803.200.800.800.800.8049.60.800.800.800.801.600.800.801.600.800.805.600.805.600.80
SubstitutionM1KR17GD32ND32ED32ED32EI34RL35VL35VR62IN65SP66AH85YL90VD105HE120DR151T

Compared with the HPV-16 E6 reference sequence (NC001526), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations

Table 2

Nucleotide mutation and amino acid substitution in HPV-31 E6

HPV-31 E6
No12345678910111213
Location2769141178190194205219228297321368413
MutationT–AC–TT–CC–TA–GA–GT–CA–GT–CG–AA–GA–GC–T
Frequency (%)8.3327.084.1731.254.172.084.1716.672.0831.2527.084.1735.42
SubstitutionH60YT64AK65RF69LK123RA138V

Compared with the HPV-31 E6 reference sequence (J04353), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations

Table 3

Nucleotide mutation and amino acid substitution in HPV-33 E6

HPV-33 E6
No12345678
Location105165221256279338434441
MutationA–CA–GG–CA–CA–CA–GG–TT–C
Frequency (%)19.447.873.247.4119.447.4115.2811.57
SubstitutionK35NS74TN86HK93NQ113RR145I

Compared with the HPV-33 E6 reference sequence (M12732.1), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations

Table 4

Nucleotide mutation and amino acid substitution in HPV-52 E6

HPV-52 E6
No123456789101112131415161718192021
Location618293118136136230237249255265270277277278315315366381412429
MutationG–AC–TG–CA–CC–GC–TG–AT–CG–TG–AG–AG–AA–CA–GA–GT–GT–AC–AT–AG–AA–G
Frequency (%)0.700.700.700.700.702.100.700.7085.409.700.700.701.402.1084.000.700.700.700.700.702.80
SubstitutionE21KL46VL46VR77KE89KK93RK93RK93RI105MI105MN122KN127IE138K

Compared with the HPV-52 E6 reference sequence (NC001592), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations

Table 5

Nucleotide mutation and amino acid substitution in HPV-58 E6

HPV-58 E6
No12345678
Location7894150198258279286434
MutationC–TG–CA–GC–TC–AA–CT–CG–A
Frequency (%)1.730.250.4958.520.2527.410.253.95
SubstitutionE32QD86EK93NR145K

Compared with the HPV-58 E6 reference sequence (D90400), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations

Nucleotide mutation and amino acid substitution in HPV-16 E6 Compared with the HPV-16 E6 reference sequence (NC001526), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations Nucleotide mutation and amino acid substitution in HPV-31 E6 Compared with the HPV-31 E6 reference sequence (J04353), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations Nucleotide mutation and amino acid substitution in HPV-33 E6 Compared with the HPV-33 E6 reference sequence (M12732.1), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations Nucleotide mutation and amino acid substitution in HPV-52 E6 Compared with the HPV-52 E6 reference sequence (NC001592), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations Nucleotide mutation and amino acid substitution in HPV-58 E6 Compared with the HPV-58 E6 reference sequence (D90400), the mutations are marked with the corresponding bases and amino acid, and those without changes are replaced with a dash (–). No. means the number of nucleotide mutations, Location means the sites of nucleotide mutations, Mutation means the style of nucleotide mutations, Frequency (%) means the percentage of nucleotide mutations, Substitution means the amino acid substitution that occurred by nucleotide mutations Calculated by Codeml software using Naive NEB and Bayes Empirical Bayes models, seven positive selection sites of α-9 HPV E6 were detected, there were D32E of HPV-16 E6, K35N, K93N, R145I of HPV-33 E6, K93R of HPV-52 E6, K93N, R145K of HPV-58 E6. In contrast, no reliable HPV-31 E6 positive selection site was selected out (Table 6).
Table 6

Positive section site of α-9 HPV E6

ModelHPV-16HPV-31HPV-33HPV-52HPV-58
M7NANANANANA
M832D**NA35 K**, 93 K**, 145R**93 K**93 K**, 145R**

M7 means NEB (Naive Empirical Bayes) model, M8 means BEB (Bayes Empirical Bayes) model. When the posterior probability was ≥ 0.9, the BEB method was used to identify the positive selection sites. P < 0.05 indicates that the results of M8 model are reliable, two asterisks means a posteriori probability ≥ 0.99, and NA means not apply

Positive section site of α-9 HPV E6 M7 means NEB (Naive Empirical Bayes) model, M8 means BEB (Bayes Empirical Bayes) model. When the posterior probability was ≥ 0.9, the BEB method was used to identify the positive selection sites. P < 0.05 indicates that the results of M8 model are reliable, two asterisks means a posteriori probability ≥ 0.99, and NA means not apply

The protein structure analysis of α-9 HPV E6

Nucleotides non-synonymous mutation changed the amino acid composition of protein, which affects the structure of the protein, while the protein function is mainly realized by its structures. With the help of Mega6.0, PSIPred and Swiss-model, the primary, secondary, and tertiary structure difference of α-9 HPV E6 protein reference and mutation sequence were revealed. In HPV-16 E6, I34R, L35V, R62I, P66A and L90V all located in β-fold, E120D, D32N, D32E located in the periphery of the spatial protein structure and close to the active region of znic granules. The amino acid number in the α-helix and β-sheet regions are different in protein reference and mutation sequence. Details are shown in Figs. 2 and 3.
Fig. 2

Secondary structure of HPV-16 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-16 E6 reference sequence; b is the secondary structure pattern diagram constructed based on HPV-16 E6-1 mutation sequence; c is the secondary structure pattern diagram constructed based on HPV-16 E6-2 mutation sequence, b and c are different in 32th amino acid. The black boxes are the difference areas between the reference and mutation sequence secondary structure.

Fig. 3

Tertiary structure of HPV-16 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-16 E6 reference sequence; b is the larlarian diagram of HPV-16 E6 reference sequence homology modeling; c is homology modeling structure of HPV-16 E6 mutation sequence; d is the larchian diagram of HPV-16 E6 mutation sequence homology modeling

Secondary structure of HPV-16 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-16 E6 reference sequence; b is the secondary structure pattern diagram constructed based on HPV-16 E6-1 mutation sequence; c is the secondary structure pattern diagram constructed based on HPV-16 E6-2 mutation sequence, b and c are different in 32th amino acid. The black boxes are the difference areas between the reference and mutation sequence secondary structure. Tertiary structure of HPV-16 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-16 E6 reference sequence; b is the larlarian diagram of HPV-16 E6 reference sequence homology modeling; c is homology modeling structure of HPV-16 E6 mutation sequence; d is the larchian diagram of HPV-16 E6 mutation sequence homology modeling In HPV-31 E6, T64A, K65R, and F69L located in α-helix, T60Y located in β-sheet region, and K123R, A138V located in the coil. Amino acid substitution has no influence on the secondary and tertiary structure (Figs. 4, 5).
Fig. 4

Secondary structure of HPV-31 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-31 E6 reference sequence; b is the secondary structure pattern diagram constructed based on HPV-31 E6 mutation sequence

Fig. 5

Tertiary structure of HPV-31 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-31 E6 reference sequence; b is the larlarian diagram of HPV-31 E6 reference sequence homology modeling; c is homology modeling structure of HPV-31 E6 mutation sequence; d is the larchian diagram of HPV-31 E6 mutation sequence homology modeling

Secondary structure of HPV-31 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-31 E6 reference sequence; b is the secondary structure pattern diagram constructed based on HPV-31 E6 mutation sequence Tertiary structure of HPV-31 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-31 E6 reference sequence; b is the larlarian diagram of HPV-31 E6 reference sequence homology modeling; c is homology modeling structure of HPV-31 E6 mutation sequence; d is the larchian diagram of HPV-31 E6 mutation sequence homology modeling S74T and Q113R located in α-helix of HPV-33 E6 protein, K93N located on the outer edge of E6 protein and near the zinc granule, the above amino acid substitutions all located in the active region of the protein. Amino acid substitution changed the number of amino acids in the α-helix and β-sheet region, as well as made the E6 protein show more contact with the environment (Figs. 6, 7).
Fig. 6

Secondary structure of HPV-33 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-33 E6 reference sequence; a is the secondary structure pattern diagram constructed based on HPV-33 E6 mutation sequence. The black boxes are the difference areas between the reference and mutation sequence secondary structure

Fig. 7

Tertiary structure of HPV-33 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-33 E6 reference sequence; b is the larlarian diagram of HPV-33 E6 reference sequence homology modeling; c is homology modeling structure of HPV-33 E6 mutation sequence; d is the larlarian diagram of HPV-33 E6 mutation sequence homology modeling

Secondary structure of HPV-33 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-33 E6 reference sequence; a is the secondary structure pattern diagram constructed based on HPV-33 E6 mutation sequence. The black boxes are the difference areas between the reference and mutation sequence secondary structure Tertiary structure of HPV-33 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-33 E6 reference sequence; b is the larlarian diagram of HPV-33 E6 reference sequence homology modeling; c is homology modeling structure of HPV-33 E6 mutation sequence; d is the larlarian diagram of HPV-33 E6 mutation sequence homology modeling R77K, E89K of HPV-52 E6 located in α-helix, N127I located in the β-sheet region, K93R situated on the outer edge of E6 protein and close to the zinc granules, all the amino acid substitutions found in the active region of the protein. Amino acid substitution increased the number of amino acids in the α-helix and β-sheet region, and the number of buried amino acids decreased (Figs. 8, 9).
Fig. 8

Secondary structure of HPV-52 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-52 E6 reference sequence; b is the secondary structure pattern diagram constructed based on HPV-52 E6 mutation sequence. The black boxes are the difference areas between the reference and mutation sequence secondary structure

Fig. 9

Tertiary structure of HPV-52 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-52 E6 reference sequence; b is the larlarian diagram of HPV-52 E6 reference sequence homology modeling; c is homology modeling structure of HPV-52 E6 mutation sequence; d is the larlarian diagram of HPV-52 E6 mutation sequence homology modeling

Secondary structure of HPV-52 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-52 E6 reference sequence; b is the secondary structure pattern diagram constructed based on HPV-52 E6 mutation sequence. The black boxes are the difference areas between the reference and mutation sequence secondary structure Tertiary structure of HPV-52 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-52 E6 reference sequence; b is the larlarian diagram of HPV-52 E6 reference sequence homology modeling; c is homology modeling structure of HPV-52 E6 mutation sequence; d is the larlarian diagram of HPV-52 E6 mutation sequence homology modeling E32Q, D86E, K93N and R145K are located in the coil of HPV-58 E6 protein, E32Q, K93N situated on the outer edge of E6 protein, and close to the zinc granule, belonging to the active region of E6 protein. Amino acid substitution increased the number of amino acids in the α-helix region and decreased the number of amino acids in the coil region (Figs. 10, 11).
Fig. 10

Secondary structure of HPV-58 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-58 E6 reference sequence; b is the secondary structure pattern diagram constructed based on HPV-58 E6 mutation sequence. The black boxes are the difference areas between the reference and mutation sequence secondary structure

Fig. 11

Tertiary structure of HPV-58 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-58 E6 reference sequence; b is the larlarian diagram of HPV-58 E6 reference sequence homology modeling; c is homology modeling structure of HPV-58 E6 mutation sequence; d is the larlarian diagram of HPV-58 E6 mutation sequence homology modeling

Secondary structure of HPV-58 E6 comparing reference to the variant sequence. Note a is the secondary structure pattern diagram constructed based on HPV-58 E6 reference sequence; b is the secondary structure pattern diagram constructed based on HPV-58 E6 mutation sequence. The black boxes are the difference areas between the reference and mutation sequence secondary structure Tertiary structure of HPV-58 E6 comparing reference to the variant sequence. Note a is the homology modeling structure of HPV-58 E6 reference sequence; b is the larlarian diagram of HPV-58 E6 reference sequence homology modeling; c is homology modeling structure of HPV-58 E6 mutation sequence; d is the larlarian diagram of HPV-58 E6 mutation sequence homology modeling

The antigen epitopes analysis of α-9 HPV E6 protein

In HPV-16 E6 reference sequence, 97 HLA-I and 25 HLA-II epitopes were selected out, and epitope prediction results of variants were different, details were shown in Additional file 1: Tables S3 and S4. M1K made epitope affinity increase; R17G, D32N, D32E, I34R, L35V, P66A, H85Y and L90V changed epitope number and affinity; E120D and R151T made new epitopes appear. The effection of amino-acid substitution on HPV-16 E6 epitopes were summarized in Table 7.
Table 7

Effect of Amino-acid substution on T-cell epitopes of HPV-16 E6

SubstutionHPV-16 E6
EpitopesAllesEffection
M1K1-9KHQKRTAMFHLA-A*24:02Better affinity
R17G13-22QERPGKLPQLHLA-B*40:01Better affinity
9-19FQDPQERPGKLHLA-B*13:01
9-19FQDPQERPGKLHLA-C*08:01Affinity decreased
8-17MFQDPQERPRHLA-A*33:03Disappear
17-26RKLPQLCTELHLA-C*01:02
9-19FQDPQERPRKLHLA-C*03:04
7-17AMFQDPQERPRHLA-A*33:03
D32E26-34LQTTIHEIIHLA-B*13:01Better affinity
29-39TIHEIILECVYHLA-B*15:02Affinity decreased
29-37TIHEIILECHLA-A*02:01
29-38TIHEIILECVHLA-A*02:01
24-33TELQTTIHEIHLA-B*40:01New epitopes
25-33ELQTTIHEIHLA-B*13:01
32-39EIILECVYHLA-B*15:02
25-33ELQTTIHEIHLA-A*02:01
31-39HEIILECVYHLA-B*15:02
26-33LQTTIHEIHLA-B*13:01
29-39TIHDIILECVYHLA-B*46:01Disappear
25-39ELQTTIHDIILECVYHLA-DQA1*01:01/DQB1*02:01
24-38TELQTTIHDIILECVHLA-DQA1*01:01/DQB1*02:01
25-39ELQTTIHDIILECVYHLA-DQA1*01:01/DQB1*05:01
26-40LQTTIHDIILECVYCHLA-DQA1*01:01/DQB1*02:01
26-40LQTTIHDIILECVYCHLA-DQA1*01:01/DQB1*05:01
24-38TELQTTIHDIILECVHLA-DQA1*01:01/DQB1*05:01
D32N26-34LQTTIHNIIHLA-B*13:01Better affinity
24-35TELQTTIHNIILHLA-B*40:01
24-33TELQTTIHNIHLA-B*40:01New epitopes
27-35QTTIHNIILHLA-C*03:04
29-39TIHDIILECVYHLA-B*15:02Disappear
29-37TIHDIILECHLA-A*02:01
D32N29-38TIHDIILECVHLA-A*02:01Disappear
29-39TIHDIILECVYHLA-B*46:01
25-39ELQTTIHDIILECVYHLA-DQA1*01:01/DQB1*02:01
24-38TELQTTIHDIILECVHLA-DQA1*01:01/DQB1*02:01
25-39ELQTTIHDIILECVYHLA-DQA1*01:01/DQB1*05:01
26-40LQTTIHDIILECVYCHLA-DQA1*01:01/DQB1*02:01
26-40LQTTIHDIILECVYCHLA-DQA1*01:01/DQB1*05:01
24-38TELQTTIHDIILECVHLA-DQA1*01:01/DQB1*05:01
I34R29-39TIHDIRLECVYHLA-B*15:02Better affinity
29-39TIHDIRLECVYHLA-B*46:01Affinity decreased
24-35TELQTTIHDIRLHLA-B*40:01
25-34ELQTTIHDIRHLA-A*33:03New epitopes
29-37TIHDIILECHLA-A*02:01Disappear
26-34LQTTIHDIIHLA-B*13:01
29-38TIHDIILECVHLA-A*02:01
25-39ELQTTIHDIILECVYHLA-DQA1*01:01/DQB1*02:01
24-38TELQTTIHDIILECVHLA-DQA1*01:01/DQB1*02:01
25-39ELQTTIHDIILECVYHLA-DQA1*01:01/DQB1*05:01
26-40LQTTIHDIILECVYCHLA-DQA1*01:01/DQB1*02:01
26-40LQTTIHDIILECVYCHLA-DQA1*01:01/DQB1*05:01
24-38TELQTTIHDIILECVHLA-DQA1*01:01/DQB1*05:01
L35V29-39TIHDIIVECVYHLA-B*15:02Better affinity
29-38TIHDIIVECVHLA-A*02:01
29-39TIHDIIVECVYHLA-B*46:01
29-37TIHDIIVECHLA-A*02:01
35-45VECVYCKQQLLHLA-B*40:01New epitopes
24-35TELQTTIHDIILHLA-B*40:01Disappear
25-39ELQTTIHDIILECVYHLA-DQA1*01:01/DQB1*02:01
24-38TELQTTIHDIILECVHLA-DQA1*01:01/DQB1*02:01
25-39ELQTTIHDIILECVYHLA-DQA1*01:01/DQB1*05:01
26-40LQTTIHDIILECVYCHLA-DQA1*01:01/DQB1*02:01
L35V26-40LQTTIHDIILECVYCHLA-DQA1*01:01/DQB1*05:01Disappear
24-38TELQTTIHDIILECVHLA-DQA1*01:01/DQB1*05:01
P66A59-67IVYRDGNAYHLA-B*15:02Better affinity
59-67IVYRDGNAYHLA-B*46:01
59-67IVYRDGNAYHLA-C*03:02
66-76AYAVCDKCLKFHLA-A*24:02
61-69YRDGNAYAVHLA-A*11:01Affinity decreased
59-67IVYRDGNAYHLA-C*08:01
56-67DLCIVYRDGNAYHLA-B*15:02New epitopes
57-67LCIVYRDGNAYHLA-B*46:01
60-67VYRDGNAYHLA-B*15:02
58-67CIVYRDGNAYHLA-B*15:02
57-67LCIVYRDGNAYHLA-B*15:02
60-69VYRDGNPYAVHLA-A*24:02Disappear
H85Y85-95YYCYSLYGTTLHLA-A*24:02Better affinity
77-86YSKISEYRYYHLA-B*46:01
82-96EYRYYCYSLYGTTLEHLA-DRB1*15:02
80-94ISEYRYYCYSLYGTTHLA-DRB1*15:02
79-93KISEYRYYCYSLYGTHLA-DRB1*15:02
81-95SEYRYYCYSLYGTTLHLA-DRB1*15:02
78-92SKISEYRYYCYSLYGHLA-DRB1*15:02
83-97YRYYCYSLYGTTLEQHLA-DRB1*15:02
77-91YSKISEYRYYCYSLYHLA-DRB1*15:02
82-96EYRYYCYSLYGTTLEHLA-DPA1*01:03/DPB1*04:01
83-97YRYYCYSLYGTTLEQHLA-DPA1*01:03/DPB1*04:01
77-85YSKISEYRYHLA-C*03:02New epitopes
77-85YSKISEYRYHLA-B*46:01
77-85YSKISEYRYHLA-B*15:02
77-85YSKISEYRYHLA-B*58:01
77-86YSKISEYRYYHLA-C*03:02
76-85FYSKISEYRYHLA-A*24:02
H85Y80-94ISEYRYYCYSLYGTTHLA-DPA1*01:03/DPB1*04:01New epitopes
84-98RYYCYSLYGTTLEQQHLA-DPA1*01:03/DPB1*04:01
79-93KISEYRYYCYSLYGTHLA-DPA1*01:03/DPB1*04:01
85-99YYCYSLYGTTLEQQYHLA-DPA1*01:03/DPB1*04:01
84-98RYYCYSLYGTTLEQQHLA-DRB1*15:02
85-99YYCYSLYGTTLEQQYHLA-DRB1*15:02
72-86KCLKFYSKISEYRYYHLA-DRB1*12:02
81-95SEYRYYCYSLYGTTLHLA-DPA1*01:03/DPB1*04:01
81-90SEYRYYCYSLHLA-B*40:01Affinity decreased
82-90EYRYYCYSLHLA-A*24:02
80-88ISEYRHYCYHLA-C*03:02Disappear
77-86YSKISEYRHYHLA-C*03:02
76-86FYSKISEYRHYHLA-A*24:02
L90V86-95YCYSVYGTTLHLA-C*03:04Better affinity
88-95YSVYGTTLHLA-C*01:02
88-95YSVYGTTLHLA-C*03:02
88-95YSVYGTTLHLA-C*03:04
85-95HYCYSVYGTTLHLA-A*24:02
89-99SVYGTTLEQQYHLA-B*46:01
89-101SVYGTTLEQQYNKHLA-A*11:01
86-95YCYSVYGTTLHLA-A*24:02
87-95CYSVYGTTLHLA-A*24:02
89-99SVYGTTLEQQYHLA-A*11:01
89-99SVYGTTLEQQYHLA-C*03:02New epitopes
89-97SVYGTTLEQHLA-A*11:01
89-99SVYGTTLEQQYHLA-B*15:02Affinity decreased
82-90EYRHYCYSLHLA-A*24:02Disappear
81-90SEYRHYCYSLHLA-B*40:01
82-96EYRHYCYSLYGTTLEHLA-DPA1*01:03/DPB1*04:01
83-97YRHYCYSLYGTTLEQHLA-DPA1*01:03/DPB1*04:01
E120D118-126CPDEKQRHLHLA-C*08:01New epitopes
E120D118-126CPDEKQRHLHLA-C*03:04New epitopes
R151T150-158STTRRETQLHLA-C*01:02New epitopes
144-154MSCCRSSTTRRHLA-A*33:03
150-158STTRRETQLHLA-C*03:04
Effect of Amino-acid substution on T-cell epitopes of HPV-16 E6 125 HLA-I and 43 HLA-II epitopes of HPV-31 E6 reference sequence was selected out, and epitope of variants were different (Additional file 1: Tables S5, S6). H60Y, K65R changed epitope number and affinity, T64A decreased epitope number, and K123R, A138V made new epitope appear. The effection of amino-acid substitution on HPV-31 E6 epitopes were summarized in Table 8.
Table 8

Effect of Amino-acid substution on T-cell epitopes of HPV-31 E6

SubstutionHPV-31 E6
EpitopesAllesEffection
H60Y52-62IVYRDDTPYGVHLA-A*02:01Affinity decreased
54-62YRDDTPYGVHLA-C*08:01
52-60IVYRDDTPYHLA-B*15:02New epitopes
51-60TIVYRDDTPYHLA-B*15:02
52-60IVYRDDTPYHLA-B*46:01
52-60IVYRDDTPYHLA-C*03:02
57-65DTPYGVCTRHLA-A*33:03
53-62VYRDDTPHGVHLA-A*24:02Disappear
55-65RDDTPHGVCTKHLA-A*11:01
T64A55-65RDDTPHGVCTKHLA-A*11:01Disappear
K65R65-76RCLRFYSKVSEFHLA-A*24:02Better affinity
57-65DTPHGVCTRHLA-A*33:03New epitopes
65-79RCLRFYSKVSEFRWYHLA-DPA1*01:03/DPB1*04:01
55-65RDDTPHGVCTKHLA-A*11:01Disappear
K123R117-125RHLDKKRRFHLA-A*24:02New epitopes
A138V132-140WTGRCIVCWHLA-B*58:01New epitopes
Effect of Amino-acid substution on T-cell epitopes of HPV-31 E6 109 HLA-I and 41 HLA-II epitopes of HPV-33 E6 reference sequence was selected out, epitope of variants were different (Additional file 1: Tables S7, S8). K35N decreased epitope number and affinity; S74T, N86H, K93N and R145I changed epitope number and affinity; Q113R increased epitope affinity. The effection of amino-acid substitution on HPV-33 E6 epitopes were summarized in Table 9.
Table 9

Effect of Amino-acid substution on T-cell epitopes of HPV-33 E6

SubstutionHPV-33 E6
EpitopesAllesEffection
K35N35-43KPLQRSEVYHLA-C*03:02Disappear
35-49NPLQRSEVYDFAFADHLA-DQA1*01:01/DQB1*02:01Affinity decreased
S74T68-76RFLSKITEYHLA-B*46:01Better affinity
69-76FLSKITEYHLA-B*46:01
69-77FLSKITEYRHLA-A*33:03
68-77RFLSKITEYRHLA-A*33:03
68-76RFLSKITEYHLA-B*15:02
68-76RFLSKITEYHLA-A*24:02
68-76RFLSKITEYHLA-C*03:02
69-76FLSKITEYHLA-C*03:02
62-76ICKLCLRFLSKISEYHLA-DRB1*14:01Disappear
69-79FLSKISEYRHYHLA-B*15:02
74-83SEYRHYNYSVHLA-B*40:01
73-81ITEYRHYNYHLA-C*03:02Affinity decreased
63-77CKLCLRFLSKITEYRHLA-DRB1*12:02
64-78KLCLRFLSKITEYRHHLA-DRB1*12:02
62-76ICKLCLRFLSKITEYHLA-DRB1*12:02
61-75GICKLCLRFLSKITEHLA-DRB1*12:02
65-79LCLRFLSKITEYRHYHLA-DRB1*12:02
60-74FGICKLCLRFLSKITHLA-DRB1*12:02
66-80CLRFLSKITEYRHYNHLA-DRB1*12:02
64-78KLCLRFLSKITEYRHHLA-DRB1*14:01
63-77CKLCLRFLSKITEYRHLA-DRB1*14:01
65-79LCLRFLSKITEYRHYHLA-DRB1*14:01
N86H81-88YSVYGHTLHLA-C*03:02Better affinity
81-88YSVYGHTLHLA-C*01:02
81-88YSVYGHTLHLA-C*03:04
81-88YSVYGHTLHLA-C*08:01
80-88NYSVYGHTLHLA-A*24:02
78-88HYNYSVYGHTLHLA-A*24:02
79-88YNYSVYGHTLHLA-A*24:02
N86H82-92SVYGHTLEQTVHLA-A*02:01Better affinity
83-92VYGHTLEQTVHLA-A*24:02Affinity decreased
84-92YGHTLEQTVHLA-C*08:01
77-88RHYNYSVYGHTLHLA-A*24:02New epitopes
82-90SVYGHTLEQHLA-A*11:01
81-88YSVYGHTLHLA-B*46:01
80-88NYSVYGHTLHLA-C*01:02
N86H/K93N86-94HTLEQTVNKHLA-A*11:01Better affinity
85-94GHTLEQTVNKHLA-A*11:01
86-94HTLEQTVNKHLA-A*33:03
83-94VYGHTLEQTVNKHLA-A*11:01Affinity decreased
86-96HTLEQTVNKPLHLA-B*40:01New epitopes
82-93SVYGNTLEQTVKHLA-A*11:01Disappear
K93N88-96LEQTVNKPLHLA-B*40:01Better affinity
85-94GNTLEQTVNKHLA-A*11:01
86-94NTLEQTVNKHLA-A*11:01
86-94NTLEQTVNKHLA-A*33:03Affinity decreased
91-102TVNKPLNEILIRHLA-A*33:03
83-94VYGNTLEQTVNKHLA-A*11:01
82-94SVYGNTLEQTVNKHLA-A*11:01
91-99TVNKPLNEIHLA-C*08:01New epitopes
91-99TVNKPLNEIHLA-B*13:01
91-99TVNKPLNEIHLA-C*03:04
91-99TVNKPLNEIHLA-C*01:02
87-96TLEQTVNKPLHLA-B*40:01
82-93SVYGNTLEQTVKHLA-A*11:01Disappear
Q113R113-121REKKRHVDLHLA-B*40:01Better affinity
R145I141-149RSRRIETALHLA-C*03:04Better affinity
141-149RSRRRETALHLA-C*01:02Disappear
Effect of Amino-acid substution on T-cell epitopes of HPV-33 E6 95 HLA-I and 50 HLA-II epitopes of HPV-52 E6 reference sequence was selected out, epitope of variants were different (Additional file 1: Tables S9, S10). E21K, L46V, E89K, K93R and N127I changed epitopes number and affinity; 105 M increased epitope affinity; N122K decreased epitopes number and E138K decreased epitope affinity. The effection of amino-acid substitution on HPV-52 E6 epitopes were summarized in Table 10.
Table 10

Effect of Amino-acid substution on T-cell epitopes of HPV-52 E6

SubstutionHPV-52 E6
EpitopesAllesEffection
E21K17-27EVLEKSVHEIRHLA-A*33:03Better affinity
18-26VLEKSVHEIHLA-B*13:01Affinity decreased
18-26VLEKSVHEIHLA-A*02:01
19-28LEKSVHEIRLHLA-B*40:01
18-26VLEKSVHEIHLA-C*08:01
18-26VLEKSVHEIHLA-C*01:02
19-26LEKSVHEIHLA-B*40:01
10-21RTLHELCEVLEKHLA-A*11:01New epitopes
20-28EESVHEIRLHLA-B*40:01Disappear
82-93SLYGKTLEERVKHLA-A*11:01
18-28VLEESVHEIRLHLA-A*02:01
9-23PRTLHELCEVLEESVHLA-DQA1*01:01/DQB1*02:01
10-24RTLHELCEVLEESVHHLA-DQA1*01:01/DQB1*02:01
L46V46-55VFTDLRIVYRHLA-A*33:03Better affinity
42-51VYKFVFTDLRHLA-A*33:03
45-55FVFTDLRIVYRHLA-A*33:03
41-51EVYKFVFTDLRHLA-A*33:03
45-54FVFTDLRIVYHLA-B*46:01
40-47REVYKFVFHLA-B*40:01
46-54VFTDLRIVYHLA-B*46:01
42-50VYKFVFTDLHLA-A*24:02
45-54FVFTDLRIVYHLA-C*03:02
46-54VFTDLRIVYHLA-C*03:02
45-54FVFTDLRIVYHLA-B*15:02Affinity decreased
44-52KFVFTDLRIHLA-A*24:02
45-53FVFTDLRIVHLA-A*02:01
41-55EVYKFVFTDLRIVYRHLA-DPA1*01:03/DPB1*04:01
39-53RREVYKFVFTDLRIVHLA-DPA1*01:03/DPB1*04:01
43-57YKFVFTDLRIVYRDNHLA-DPA1*01:03/DPB1*04:01
38-52QRREVYKFVFTDLRIHLA-DPA1*01:03/DPB1*04:01
37-51LQRREVYKFVFTDLRHLA-DPA1*01:03/DPB1*04:01
L46V44-58KFVFTDLRIVYRDNNHLA-DPA1*01:03/DPB1*04:01Affinity decreased
36-50ELQRREVYKFVFTDLHLA-DPA1*01:03/DPB1*04:01
41-55EVYKFVFTDLRIVYRHLA-DRB1*14:01
42-56VYKFVFTDLRIVYRDHLA-DRB1*14:01
43-57YKFVFTDLRIVYRDNHLA-DRB1*14:01
40-54REVYKFVFTDLRIVYHLA-DRB1*14:01
41-55EVYKFVFTDLRIVYRHLA-DRB1*12:02
46-54VFTDLRIVYHLA-A*24:02New epitopes
40-50REVYKFVFTDLHLA-B*40:01
45-53FVFTDLRIVHLA-B*46:01
45-53FVFTDLRIVHLA-C*03:02
45-53FVFTDLRIVHLA-C*03:04
40-54REVYKFLFTDLRIVYHLA-DQA1*01:01/DQB1*05:01Disappear
39-53RREVYKFLFTDLRIVHLA-DQA1*01:01/DQB1*05:01
41-55EVYKFLFTDLRIVYRHLA-DQA1*01:01/DQB1*05:01
42-56VYKFLFTDLRIVYRDHLA-DQA1*01:01/DQB1*05:01
42-56VYKFLFTDLRIVYRDHLA-DRB1*12:02
40-54REVYKFLFTDLRIVYHLA-DQA1*01:01/DQB1*02:01
41-55EVYKFLFTDLRIVYRHLA-DQA1*01:01/DQB1*02:01
40-54REVYKFLFTDLRIVYHLA-DRB1*12:02
44-58KFLFTDLRIVYRDNNHLA-DRB1*14:01
43-57YKFLFTDLRIVYRDNHLA-DRB1*12:02
E89K82-92SLYGKTLKERVHLA-A*02:01Affinity decreased
86-94KTLKERVKKHLA-A*11:01
85-94GKTLKERVKKHLA-A*11:01
82-93SLYGKTLKERVKHLA-A*11:01
41-55EVYKFVFTDLRIVYRHLA-DPA1*01:03/DPB1*04:01
39-53RREVYKFVFTDLRIVHLA-DPA1*01:03/DPB1*04:01
43-57YKFVFTDLRIVYRDNHLA-DPA1*01:03/DPB1*04:01
38-52QRREVYKFVFTDLRIHLA-DPA1*01:03/DPB1*04:01
37-51LQRREVYKFVFTDLRHLA-DPA1*01:03/DPB1*04:01
E89K44-58KFVFTDLRIVYRDNNHLA-DPA1*01:03/DPB1*04:01Affinity decreased
36-50ELQRREVYKFVFTDLHLA-DPA1*01:03/DPB1*04:01
42-56VYKFVFTDLRIVYRDHLA-DRB1*14:01
41-55EVYKFVFTDLRIVYRHLA-DRB1*14:01
43-57YKFVFTDLRIVYRDNHLA-DRB1*14:01
40-54REVYKFVFTDLRIVYHLA-DRB1*14:01
41-55EVYKFVFTDLRIVYRHLA-DRB1*12:02
82-91SLYGKTLKERHLA-A*11:01Better affinity
82-91SLYGKTLKERHLA-A*33:03
75-89EYRHYQYSLYGKTLKHLA-DPA1*01:03/DPB1*04:01
81-89YSLYGKTLKHLA-A*11:01New epitopes
79-89YQYSLYGKTLKHLA-A*11:01
80-89QYSLYGKTLKHLA-A*11:01
82-94SLYGKTLEERVKKHLA-A*11:01Disappear
84-94YGKTLEERVKKHLA-A*11:01
40-54REVYKFLFTDLRIVYHLA-DQA1*01:01/DQB1*05:01
39-53RREVYKFLFTDLRIVHLA-DQA1*01:01/DQB1*05:01
41-55EVYKFLFTDLRIVYRHLA-DQA1*01:01/DQB1*05:01
42-56VYKFLFTDLRIVYRDHLA-DQA1*01:01/DQB1*05:01
42-56VYKFLFTDLRIVYRDHLA-DRB1*12:02
40-54REVYKFLFTDLRIVYHLA-DQA1*01:01/DQB1*02:01
41-55EVYKFLFTDLRIVYRHLA-DQA1*01:01/DQB1*02:01
40-54REVYKFLFTDLRIVYHLA-DRB1*12:02
44-58KFLFTDLRIVYRDNNHLA-DRB1*14:01
43-57YKFLFTDLRIVYRDNHLA-DRB1*12:02
K93R85-94GKTLEERVRKHLA-A*11:01Affinity decreased
82-94SLYGKTLEERVRKHLA-A*11:01
84-94YGKTLEERVRKHLA-A*11:01
88-96LEERVRKPLHLA-B*40:01
82-93SLYGKTLEERVRHLA-A*33:03New epitopes
91-99RVKKPLSEIHLA-B*13:01Disappear
K93R82-93SLYGKTLEERVKHLA-A*11:01Disappear
I105M97-105SEITIRCIMHLA-B*40:01Better affinity
N122K120-128NANKRFHNIHLA-C*08:01Disappear
120-128NANKRFHNIHLA-C*03:04
N127I127-135IIMGRWTGRHLA-A*33:03Affinity decreased
124-132RFHIIMGRWHLA-A*24:02
126-135HIIMGRWTGRHLA-A*33:03Better affinity
125-135FHIIMGRWTGRHLA-A*33:03
120-128NANKRFHIIHLA-C*03:04
120-128NANKRFHIIHLA-C*08:01
120-131NANKRFHIIMGRHLA-A*33:03New epitopes
121-135ANKRFHIIMGRWTGRHLA-DRB1*12:02
123-137KRFHIIMGRWTGRCSHLA-DRB1*12:02
122-136NKRFHIIMGRWTGRCHLA-DRB1*12:02
120-134NANKRFHIIMGRWTGHLA-DRB1*12:02
119-133VNANKRFHIIMGRWTHLA-DRB1*12:02
124-138RFHIIMGRWTGRCSEHLA-DRB1*12:02
124-132RFHNIMGRWHLA-B*58:01Disappear
E138K132-140WTGRCSKCWHLA-B*58:01Affinity decreased
Effect of Amino-acid substution on T-cell epitopes of HPV-52 E6 113 HLA-I and 44 HLA-II epitopes of HPV-58 E6 reference sequence was selected out, epitope of variants were different (Additional file 1: Tables S11, S12). D86E, K93N changed epitopes number and affinity, and R145K changed HLA-I epitope. The effection of amino-acid substitution on HPV-58 E6 epitopes were summarized in Table 11.
Table 11

Effect of Amino-acid substution on T-cell epitopes of HPV-58 E6

SubstutionHPV-58 E6
EpitopesAllesEffection
D86E81-88YSLYGETLHLA-C*03:04better affinity
84-92YGETLEQTLHLA-C*08:01
81-88YSLYGETLHLA-C*08:01
80-88NYSLYGETLHLA-A*24:02
79-88YNYSLYGETLHLA-A*24:02
78-88HYNYSLYGETLHLA-A*24:02
83-92LYGETLEQTLHLA-A*24:02Affinity decreased
82-91SLYGETLEQTHLA-A*02:01
84-92YGETLEQTLHLA-C*03:04
81-88YSLYGETLHLA-C*01:02
82-92SLYGETLEQTLHLA-B*13:01
85-92GETLEQTLHLA-B*40:01New epitopes
81-88YSLYGETLHLA-C*03:02
84-92YGETLEQTLHLA-C*01:02
84-92YGDTLEQTLHLA-C*03:02Disappear
D86E/K93N85-94GETLEQTLNKHLA-A*11:01Better affinity
86-94ETLEQTLNKHLA-A*11:01
82-94SLYGETLEQTLNKHLA-A*11:01Affinity decreased
84-94YGETLEQTLNKHLA-A*11:01
85-96GETLEQTLNKCLHLA-B*40:01New epitopes
86-94ETLEQTLNKHLA-A*33:03
82-93SLYGDTLEQTLKHLA-A*11:01Disappear
K93N88-96LEQTLNKCLHLA-B*40:01Better affinity
85-94GDTLEQTLNKHLA-A*11:01
84-94YGDTLEQTLNKHLA-A*11:01Affinity decreased
82-94SLYGDTLEQTLNKHLA-A*11:01
91-99TLNKCLNEIHLA-A*02:01New epitopes
82-93SLYGDTLEQTLKHLA-A*11:01Disappear
R145K137-145AVCWRPRRKHLA-A*11:01New epitopes
137-145AVCWRPRRRHLA-A*33:03Disappear
Effect of Amino-acid substution on T-cell epitopes of HPV-58 E6

Discussion

Cervical cancer is the second major malignant tumor in women in childbearing age and seriously threatens women's health. HR-HPV persistent infection is closely related to the occurrence and development of cervical cancer and other malignant diseases. α-9 HPV is almost all carcinogenic and associated with 75% cervical cancers. Sichuan is a multi-ethnic mixed residence area with a high prevalence rate of α-9 HPV. From 2012 to 2017, α-9 HPV positive samples accounted for 53.68% of all HPV positive samples and 73.22% of high-risk HPV positive samples, showing an increasing trend. α-9 genus HPV is highly prevalent and pathogenic; its carcinogenicity is mainly realized through E6 oncoproteins guided by the E6 gene through triggering the immortalization of infected cells. As an early gene of the virus, HPV E6 has a high mutation risk. In Sichuan, 21, 13, 8, 21, 8 nucleotide mutations were detected in HPV-16, HPV-31, HPV-33, HPV-52, HPV-58 E6 respectively, and among them, 17, 6, 6, 13, 4 were non-synonymous mutation. Gene non-synonymous mutations change the amino acid composition, and structure of the protein, as well as the functions of protein, are mainly realized by its structures. HPV E6 consists of one N-terminal (residues 1–36), one C-terminal (residues 147–158) and two Zinc fingers (residues 37–73 and 110–146, CxxC-(29x)-CxxC) three domains. The two Zinc finger binding domains form a deep pocket, which can mediate the most important tumor suppressor protein p53 ubiquitination degradation by binding to the "LXXLL" sequence of E6AP protein [24, 25]. 145–149 were PDZ domain-containing combined region that was the target of E6 protein for cellular transformation and the carboxy-terminal half being principally involved in p53 binding [26]. K93N of HPV-33 E6, K93R of HPV-52 E6, and K93N of HPV-58 E6 are located at the outer edge of E6 protein and near the zinc granule [27]. The N86H, R145I of HPV-33 E6 and D86E, R145K of HPV-58 E6 occurred in the same positions; K93N of HPV-33 E6, K93R of HPV-52 E6 and K93N of HPV-58 E6 all located in the 93rd of the E6 protein; those amino acid substitutions located in protein active region, can cause the E6 terminal and the trend of the carboxyl end structure disorder. Those protein conformational changes may lead to the differences in their ability to bind to the host p53 protein and other potential proteins, thus affecting the pathogenicity of α-9 HPV [29]. Positive selection sites make the gene frequency of the corresponding amino acid increasingly stable and enhance the species' adaptability to the environment [28]. According to the calculation, the positive selection site of HPV-16 E6 was D32E (128/250); HPV-33 E6 were K35N (42/216), K93N (42/216), R145I (33/216); HPV-52 E6 was K93R (252/288); HPV-58 E6 were K93N (111/405), R145K (16/405); These positive selection sites all belong to its high frequency non-synonymous mutation, suggesting that these positive selection sites, which contribute to the adaptation of α-9 HPV E6, have been widely spread. HPV E6 protein plays a key role in cervical cancer development. During HPV infection, the immune system will treat E6 protein as an antigen presentation to eliminate HPV infection and reduce the occurrence risk of HPV-related diseases with the help of body immunity [30]. Some specific mutations in HPV E6 may lead to the differences in the infection ability and pathogenicity of the virus. Positive selection sites of HPV-16 E6 D32E, D32N located in protein outer edge and next to the zinc granules; 6 HLA-II epitopes disappeared due to D32E/D32N; In Japan, D32E has been confirmed to be associated with the development of cervical cancer [31]; T-cell antigen epitopes affinity reduced due to D32E, D32N, that may lead to the persistent infection of virus and promote the development of cervical cancer. Positive selection sites K35N and K93N of HPV-33 E6 are close to the zinc granules, while R145I located in the E6 PDZ binding domain; K35N and R145I made 35-43KPLQRSEVY for HLA-C*03:02 and 141-149RSRRRETAL for HLA-C*01:02 disappear respectively, K93N changed the epitope number and affinity; Above three positive selection sites of HPV-33 E6 located in E6 protein active region, affect the protein conformation, function and reduced the immunogenicity of the peptide containing the above sites to a certain amount. The positive selection site K93R of HPV-52 E6 changed the epitope number and decreased the affinity of excellent epitopes. K93N and R145I of HPV-58 E6 reduced the affinity of excellent HLA-I antigen epitopes. Those positive selection sites reduced the immunogenicity of E6 overall, which may make HPV-infected cells more difficult to be detected by the immune system, and enhance HPV adaptability to the environment. No positive selection site was selected out in HPV-31 E6, and the high-frequency non-synonymous mutation sites enhanced the affinity and number of E6 epitopes, which may relate to its extremely low prevalence. Studies have found that mutations affect the efficiency of HPV vaccine [32], the protein structure and antigen epitope bioinformatics prediction method were introduced to analyze the influence of HPV E6 mutation on protein conformational and immunogenicity. We discussed the relationship between protein structure, positive selection site, antigen epitope and pathogenicity of α-9 HPV E6 protein in Sichuan was discussed for the first time. Amino acid substitution in positive selection sites may affect the virus infection efficiency, immunogenicity, and pathogenicity by altering their T-cell epitopes affinity to improve the survival ability of α-9 HPV as well as an adaptation to evolution. These results help explore the relationship between HPV E6 polymorphism and HPV affection capacity and its action mechanism to improve the therapeutic vaccine of α-9 HPV in Sichuan regions of China.

Conclusion

α-9 HPV is extremely prevalent in Sichuan, China. The positive selection site K93N of HPV-33 E6, K93R of HPV-52 E6 and K93N of HPV-58 E6 all occurred in the 93rd amino acid of E6 protein. N86H, R145I (positive selection site) of HPV-33 E6 and D86E, R145K (positive selection site) of HPV-58 E6 occurred in the same location of E6. α-9 HPV E6 positive selection sites that adaptive to the environment D32E, K35N, K93N, R145I, K93R, R145K have been widely spread, they all located in the E6 protein active region and altered their protein structure, as well as overall reduce the immunogenicity of the E6 protein, so that HPV infected cells are more difficult to be detected by the immune system and enhance the adaptability of α-9 HPV to the environment. E6 mutations in positive selection sites may affect the virus infection efficiency, immunogenicity by altering their protein structure, epitopes affinity to improve the survival ability of HPV. Additional file 1: Table S1. Primers and PCR condition used for the molecular characterization of α-9 HPV E6. Table S2. Average frequency of HLA-I and HLA-II alleles (> 5%) across the Chinese population. Table S3. The HLA-I predicted epitopes of HPV-16 E6. Table S4. The HLA-II predicted epitopes of HPV-16 E6. Table S5. The HLA-I predicted epitopes of HPV-31 E6. Table S6. The HLA-II predicted epitopes of HPV-31 E6. Table S7. The HLA-I predicted epitopes of HPV-33 E6. Table S8. The HLA-II predicted epitopes of HPV-33 E6. Table S9. The HLA-I predicted epitopes of HPV-52 E6. Table S10. The HLA-II predicted epitopes of HPV-52 E6. Table S11. The HLA-I predicted epitopes of HPV-58 E6. Table S12. The HLA-II predicted epitopes of HPV-58 E6.
  31 in total

Review 1.  Recognition of trophoblast HLA class I molecules by decidual NK cell receptors--a review.

Authors:  A King; S E Hiby; L Gardner; S Joseph; J M Bowen; S Verma; T D Burrows; Y W Loke
Journal:  Placenta       Date:  2000 Mar-Apr       Impact factor: 3.481

2.  Paternal inheritance of a female moth's mating preference.

Authors:  Vikram K Iyengar; H Kern Reeve; Thomas Eisner
Journal:  Nature       Date:  2002-10-24       Impact factor: 49.962

3.  The HPV E7 oncoprotein inhibits tumor necrosis factor alpha-mediated apoptosis in normal human fibroblasts.

Authors:  D A Thompson; V Zacny; G S Belinsky; M Classon; D L Jones; R Schlegel; K Münger
Journal:  Oncogene       Date:  2001-06-21       Impact factor: 9.867

4.  A population-based prospective study of carcinogenic human papillomavirus variant lineages, viral persistence, and cervical neoplasia.

Authors:  Mark Schiffman; Ana Cecilia Rodriguez; Zigui Chen; Sholom Wacholder; Rolando Herrero; Allan Hildesheim; Rob Desalle; Brian Befano; Kai Yu; Mahboobeh Safaeian; Mark E Sherman; Jorge Morales; Diego Guillen; Mario Alfaro; Martha Hutchinson; Diane Solomon; Philip E Castle; Robert D Burk
Journal:  Cancer Res       Date:  2010-03-30       Impact factor: 12.701

5.  HLA alleles and risk of cervical intraepithelial neoplasia among southwestern American Indian women.

Authors:  Melissa A Schiff; Raymond J Apple; Patricia Lin; J Lee Nelson; Cosette M Wheeler; Thomas M Becker
Journal:  Hum Immunol       Date:  2005-10-25       Impact factor: 2.850

6.  Association between human papillomavirus 16 E6 variants and human leukocyte antigen class I polymorphism in cervical cancer of Swedish women.

Authors:  Ingeborg Zehbe; Joannis Mytilineos; Ingrid Wikström; Rudi Henriksen; Lutz Edler; Massimo Tommasino
Journal:  Hum Immunol       Date:  2003-05       Impact factor: 2.850

7.  Novel E6 and E7 oncogenes variants of human papillomavirus type 31 in Brazilian women with abnormal cervical cytology.

Authors:  Bárbara Simas Chagas; Marcus Vinicius de Aragão Batista; Sergio Crovella; Ana Pavla Almeida Diniz Gurgel; Jacinto da Costa Silva Neto; Ivi Gonçalves Soares Santos Serra; Carolina Maria Medeiros Amaral; Valdir Queiroz Balbino; Maria Tereza Cartaxo Muniz; Antonio Carlos Freitas
Journal:  Infect Genet Evol       Date:  2013-02-10       Impact factor: 3.342

8.  Human papillomavirus 33 worldwide genetic variation and associated risk of cervical cancer.

Authors:  Alyce A Chen; Daniëlle A M Heideman; Debby Boon; Zigui Chen; Robert D Burk; Hugo De Vuyst; Tarik Gheit; Peter J F Snijders; Massimo Tommasino; Silvia Franceschi; Gary M Clifford
Journal:  Virology       Date:  2013-11-12       Impact factor: 3.616

9.  Structure of the E6/E6AP/p53 complex required for HPV-mediated degradation of p53.

Authors:  Denise Martinez-Zapien; Francesc Xavier Ruiz; Juline Poirson; André Mitschler; Juan Ramirez; Anne Forster; Alexandra Cousido-Siah; Murielle Masson; Scott Vande Pol; Alberto Podjarny; Gilles Travé; Katia Zanier
Journal:  Nature       Date:  2016-01-20       Impact factor: 49.962

10.  Epitope specificity and longevity of a vaccine-induced human T cell response against HPV18.

Authors:  Kelly L Smith; Amanda Tristram; Kathleen M Gallagher; Alison N Fiander; Stephen Man
Journal:  Int Immunol       Date:  2004-12-27       Impact factor: 4.823

View more
  1 in total

1.  Genetic variability of E6 and E7 genes of human papillomavirus type 58 in Jingzhou, Hubei Province of central China.

Authors:  Zhiping Yang; Chunlin Zhang; Ping Luo; Mengxia Ye; Quan Gong; Bing Mei
Journal:  Virol J       Date:  2022-04-22       Impact factor: 4.099

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.