| Literature DB >> 22911739 |
Ivan Sabol1, Mihaela Matovina, Ali Si-Mohamed, Magdalena Grce.
Abstract
BACKGROUND: The variation of the most common Human papillomavirus (HPV) type found in cervical cancer, the HPV16, has been extensively investigated in almost all viral genes. The E1 gene variation, however, has been rarely studied. The main objective of the present investigation was to analyze the variability of the E6 and E1 genes, focusing on the recently identified E1-1374^63nt variant. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2012 PMID: 22911739 PMCID: PMC3404080 DOI: 10.1371/journal.pone.0041045
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Distribution of the HPV16 variants within different grades of cervical lesions.
| Patient cytological diagnosis | |||||||
| E1 status | Variant group | No. | U | NC | ASCUS | LSIL (%) | HSIL (%) |
| E1-1374∧63nt variant | E-G350 | 46 | 3 | 0 | 9 | 16 (34.8%) | 18 (39.1%) |
| E-G350 other | 5 | 0 | 0 | 1 | 2 (40%) | 2 (40%) | |
| E-C109/G350 | 22 | 2 | 0 | 3 | 4 (18.2%) | 13 (59.1%) | |
| Subtotal E1 variant | 73 | 5 | 0 | 13 | 22 (30.1%) | 33 (45.2%) | |
| E1 reference | E-r | 220 | 18 | 1 | 36 | 45 (20.5%) | 120 (54.5%) |
| E-r other | 20 | 1 | 0 | 5 | 5 (25%) | 9 (45%) | |
| E-G350 | 319 | 29 | 2 | 56 | 67 (21%) | 165 (51.7%) | |
| E-G350 other | 39 | 2 | 1 | 6 | 5 (12.8%) | 25 (64.1%) | |
| E-C109/G350 | 9 | 1 | 0 | 3 | 0 (0%) | 5 (55.6%) | |
| E-G131/G350 | 6 | 1 | 0 | 0 | 1 (16.7%) | 4 (66.7%) | |
| E-G350+ E-r | 3 | 0 | 0 | 0 | 0 (0%) | 3 (100%) | |
| Subtotal E | 616 | 52 | 4 | 106 | 123 (20%) | 331 (53.7%) | |
| NA1-b/r | 8 | 1 | 0 | 1 | 3 (37.5%) | 3 (37.5%) | |
| NA1 other | 2 | 0 | 0 | 0 | 0 (0%) | 2 (100%) | |
| AA-a/r | 2 | 0 | 0 | 1 | 1 (50%) | 0 (0%) | |
| Af1-b/r | 3 | 0 | 0 | 2 | 0 (0%) | 1 (33.3%) | |
| Af2-a/C109/G403 | 1 | 1 | 0 | 0 | 0 (0%) | 0 (0%) | |
| Subtotal Non-E | 16 | 2 | 0 | 4 | 4 (25%) | 6 (37.5%) | |
| Subtotal E1 reference | 632 | 54 | 4 | 110 | 127 (20.1%) | 337 (53.3%) | |
| Total | 705 | 59 | 4 | 123 | 149 (21.1%) | 370 (52.5%) | |
U, unknown diagnosis; NC, normal cytology; ASCUS, atypical squamous cells of undetermined significance; LSIL, low grade squamous intraepithelial neoplasia; HSIL, high grade squamous intraepithelial neoplasia.
variant names according to the recently proposed nomenclature [16]; E-G350 other, all other variants containing 350G and other variations but found in less than 5 samples each; E-r other, all other variants containing 350T with other variations.
No., number of samples.
this was statistically significant in comparison to E1 reference E-r (p = 0.0227), E1 reference E-G350 (p = 0.0326), E1 reference subtotal E (p = 0.0128) and subtotal E1 reference samples (p = 0.0143).
this was statistically significant in comparison to E1 reference subtotal E (p = 0.0452) and borderline significant in comparison with subtotal E1 reference samples (p = 0.0503).
Positions of nucleotide and amino acid changes within the whole genome and E1 sequences of the major HPV16 variants containing the E1-1374∧63 duplication.
| Curated referent sequence position | Referent sequence | Nucleotide changes (number observed/number sequenced) | Description of the variation | |
| E1-1374∧63 E-G350 | E1-1374∧63 E-C109/G350 | |||
| 24 | C | C (2/2) |
| Silent |
| 109 | T | T (2/2) |
| E6 silent |
| 350 | T |
|
| E6 83 L > V |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3058 | G |
| G (2/2) | E2 silent |
| 3410 | C |
|
| E2 219 P>S and E4 silent |
| 3979 | A |
|
| E5 44 I > L |
| 4042 | A |
|
| E5 65 I > V |
| 4211 ins | - |
|
| Silent Insertion of GTTT |
| 4211 ins | - |
| - (2/2) | Silent insertion of GTT |
| 4228 | T |
|
| Silent |
| 4234 | A |
| A (2/2) | Silent |
|
|
|
|
|
|
| 4563 | G |
| G (2/2) | L2 silent |
| 4938 | G |
|
| L2 silent |
|
|
|
|
|
|
| 5226 | A |
|
| L2 330 L>F |
| 5518 | A |
| A (2/2) | L2 428 I>L |
| 6434 | A |
|
| L1 292 T>A |
|
|
|
|
|
|
| 7193 | G |
|
| Silent |
| 7320 | A | A (2/2) |
| Silent |
|
|
|
|
|
|
| 7450 | T |
|
| Silent |
| 7521 | G |
|
| Silent |
novel nucleotide changes are highlighted in bold and underlined.
changes from the reference are highlighted in bold.
variations differentiating E1-1374∧63nt duplication containing E-G350 and E-C109/G350 variants.
variation present in all samples.
Figure 1Phylogenic clustering of E1-1374∧63nt variant sequences with other currently available published HPV16 whole genome sequences.
The whole genome sequences of four E1-1374∧63nt variant samples (ZG01-118, ZG01-258, ZG03-145 and ZG05-249) were aligned with the available published whole genome sequences and the phylogenetic tree was created using MEGA5. The leftmost part shows the percent nucleotide sequence differences calculated using MEGA5 to determine variant lineages according to a proposed taxonomic classification [18], [19]. The top part of the figure was adapted from the transcription map of HPV16 available at http://pave.niaid.nih.gov/. Sequences at respective positions are shaded in different shades of blue according to the distance from the E1-1374∧63nt variant. Dots "." indicate no change from referent sequence while "∼" and "INS" indicate insertions. The European (or A) lineage appears to be subdivided into 3 sublineages A1–A3, based on the tree topology and sequence percent difference similarly as previously described [17]–[19]. The A1 lineage contains the reference HPV16 genome, the A2 lineage contains the E-G350 related variants, including the E1-1374∧63nt variant samples and the E-12 variant described in Bhattacharjee et al. [15]. The A3 sublineage contains only the E-G131 sample, which exhibits the most difference from other sequences. Variant "E-12" was previously shown to be more prevalent in cancer samples [15], while the variation 7450 T>C was shown to be statistically more prevalent in cancer samples [49]. Positions where samples ZG01-118 and ZG01-258 differ have been sequenced for 3 additional samples and in each case only one of 5 sequenced samples has this change. Additional whole length E1 sequencing has revealed that changes at positions 1656 and 1692 were only present in 2 of 12 samples. Thus, all those changes are unlikely to be the major cause of the association of the E1-1374∧63nt E-G350 variant with low grade cervical lesions. The major difference between E1-1374∧63nt E-G350 variant and other variants are the positions 1053 and 1374, while the only differences between the E-G350 and E-C109/G350 sequenced variants are at positions 24 and 109.
Figure 2Prediction of E1 protein structure.
Panel A shows 3D structure comparison between superposed monomer of the previously solved BPV1 E1 helicase domain hexamer structure (cyan; PDB ID:2GXA) [25], HPV18 E1 helicase domain structure (yellow; PDB ID:1TUE) [26], BPV1 E1 DNA binding domain structure (red; PDB ID:1F08) [27] and the predicted referent E1 model (white). The sequence at the position where the duplication occurs within the E1-1374∧63nt variants is highlighted in green, as is the amino acid 63 that is also changed in the E1-1374∧63nt variants. Panel B depicts superposed structures of the referent (white) and E1-1374∧63nt variant (magenta) E1 models. As before the referent sequence is highlighted in green and the changes specific to E1-1374∧63nt variant are highlighted in red. It can be seen that the newly added 21 amino acids (highlighted in red) partially overlap the potentially phosphorylated threonines (highlighted in green) at positions 153 and 155 and possibly influence the phosphorylation regulation of the E1 function. Panels C and D show the hexamer structure derived from the predicted E1 reference or E1-1374∧63nt variant proteins, respectively. The individual monomers are colored differently. The original referent sequence is colored in green within the white referent E1 monomer and the E1 variant magenta monomer. The sequence duplication in the E1-1374∧63nt variants is located at the junction between 2 adjacent E1 monomers within the structure and is highlighted in red on panel D. Positions of potentially phosphorylated threonines and amino acid substitution at position 63 are only highlighted on the white and magenta monomers.