| Literature DB >> 26985902 |
Enrico Lavezzo1, Giulia Masi2, Stefano Toppo3, Elisa Franchin4, Valentina Gazzola5, Alessandro Sinigaglia6, Serena Masiero7, Marta Trevisan8, Silvana Pagni9, Giorgio Palù10, Luisa Barzon11.
Abstract
Different human papillomavirus (HPV) types are characterized by differences in tissue tropism and ability to promote cell proliferation and transformation. In addition, clinical and experimental studies have shown that some genetic variants/lineages of high-risk HPV (HR-HPV) types are characterized by increased oncogenic activity and probability to induce cancer. In this study, we designed and validated a new method based on multiplex PCR-deep sequencing of the E6/E7 region of HR-HPV types to characterize HPV intra-type variants in clinical specimens. Validation experiments demonstrated that this method allowed reliable identification of the different lineages of oncogenic HPV types. Advantages of this method over other published methods were represented by its ability to detect variants of all HR-HPV types in a single reaction, to detect variants of HR-HPV types in clinical specimens with multiple infections, and, being based on sequencing of the full E6/E7 region, to detect amino acid changes in these oncogenes potentially associated with increased transforming activity.Entities:
Keywords: E6/E7; cervical cancer; deep sequencing; genotyping; high-risk HPV; human papillomavirus; next-generation sequencing; pyrosequencing; subtype; variant
Mesh:
Substances:
Year: 2016 PMID: 26985902 PMCID: PMC4810269 DOI: 10.3390/v8030079
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
High-risk HPV types included in the design of the primer set.
| HPV Species | HPV Type | IARC Group | No. of Sequences in Nucleotide Database Covering the HPV E6/E7 Region |
|---|---|---|---|
| α 5 | 51 | 1 | 6 |
| α 6 | 56 | 1 | 126 |
| α 7 | 18 | 1 | 130 |
| 39 | 1 | 25 | |
| 45 | 1 | 104 | |
| 59 | 1 | 13 | |
| 68 | 2A | 92 | |
| α 9 | 16 | 1 | 1897 |
| 31 | 1 | 276 | |
| 33 | 1 | 82 | |
| 35 | 1 | 98 | |
| 52 | 1 | 363 | |
| 58 | 1 | 795 |
Primers targeting the full E6/E7 region of high-risk HPV types.
| HPV Species | Primer Type | Primer Sequence * | HPV Type and Variant § | Amplicon Length (HPV Type) |
|---|---|---|---|---|
| α 5 | F | AACCGAAAAGGGTTATGACCGA | 51 (all known variants) | 815 |
| R | TCTGCTGTACAACGCGAAGG | |||
| α 6 | F | AGGCAGCTTATTCTGTGTGGA | 56 (all known variants) | 873 |
| R | CAGAGTGGGCACGTTACTGT | |||
| α 7 | F | AGGGAGTRACCRAAAACGGT | 18, 39 (all known variants) | 814 (HPV18) 807 (HPV39) |
| R | GGAATGCTCGAAGGTCGTCT | 18 (all known variants) | ||
| R | CCCGTGAGGCTTCTACTACC | 39 (all known variants) | ||
| F | TGCAACCAAAAACGGTGCAT | 45 (all known variants) | 849 | |
| R | TTAGTTGCACACCACGGACA | |||
| F | GCATGGCACGCTTTGAGG | 59 (all known variants) | 806 | |
| R | GTTTGCTGCACACAAAGGACA | |||
| F | GGTCACGACCGAAAACGG | 68 (A2, B, D1, D2, E, F1, F2) | 815 | |
| R | AGCAGYTSYAGCTTCCGCA | 68 (C1, C2, D1, D2, E, F1, F2) | ||
| F | GKGACCGRAARCGGTCAT | 68 (C1, C2) | 830 | |
| R | AACAGCTGYTSTAGTGTCCG | 68 (A2, B) | ||
| α 9 | F | AGGGYGTAACCGAAAACGGT | 52, 58 (all known variants) | 801 (HPV52) 828 (HPV58) |
| R | CCGGGGCACACAACTTGTAA | 52 (all known variants) | ||
| R | ACAGCTAGGGCACACAATGG | 58 (A1, A2, A3, B1, B2, D1) | ||
| R | GCTGTAGGGTTCGTSCTTCA | 58 (C, D2) | 785 | |
| F | AGGGCGTAACCGAAATCGGT | 16 (A1, A2, A3, A4, D1, D2, D3) | 830 | |
| R | TGAGAACAGATGGGGCACAC | 16 (A1, A2, A3, B1, B2, C, D1, D2, D3) | ||
| F | TTGMACCGAAACCGGTTAGT | 16 (B1, B2, C) | 807 | |
| R | RCAGATGGGGCACACAATTC | 16 (A4) | ||
| F | GGTGAACCGAAAACGGTTGG | 31 (all known variants) | 793 | |
| R | GGGGCACACGATTCCAAATG | |||
| F | AAGTAGGGTGTAACCGAAAGCG | 33 (all known variants) | 787 | |
| R | TGCTGTATGGTTCGTAGGTCAC | |||
| F | ACGGTTGCCATAAAAGCAGAA | 35 (all known variants) | 827 | |
| R | TCTCTGTGAACAGCCGGGG |
F: forward; R: reverse; * The 454-specific adapters were added to the 5′ end of both F and R primers, together with a 10-base multiplex identifier; § For each primer, the target genotype is reported; the variants that are covered by the corresponding primer are reported within round brackets.
Comparison of HPV typing results obtained by Sanger sequencing and LiPA.
| Sample ID | LiPA | Sanger Sequencing | |
|---|---|---|---|
| HPV Type | HPV Type | Scores (Fw, Rw) * | |
| 004 | HPV16 | HPV16 | 49, 38 |
| 088 | HPV16 | HPV16 | 48, 42 |
| 198 | HPV16 | HPV16 | 45, 42 |
| 281 | HPV16, | HPV16 | 46, 45 |
| 387 | HPV16 | HPV16 | 47, 47 |
| 644 | HPV16 | HPV16 | 47, 46 |
| 007 | HPV18 (HPV39) | HPV18 | 36, 15 |
| 201 | HPV31 | HPV31 | 30, 15 |
| 235 | HPV31 (HPV52, | HPV31 | 30, 36 |
| 995 | HPV31 (HPV52, | HPV31 | 50, 47 |
| 351 | HPV33, HPV11 (HPV52, | HPV33 | 31, 46 |
| 901 | HPV33 (HPV52, | HPV33 | 29, 45 |
| 805 | HPV35, | HPV35 | 46, 45 |
| 843 | HPV35, | HPV35 | 46, 48 |
| 787 | HPV39, | HPV39 | 46, 48 |
| 860 | HPV39 | HPV39 | 49, 49 |
| 225 | HPV45, | HPV45 | 45, 31 |
| 292 | HPV45 | HPV45 | 47, 43 |
| 913 | HPV45 | HPV45 | 46, 38 |
| 096 | HPV51 | HPV51 | 37, 43 |
| 772 | HPV51 | HPV51 | 39, 40 |
| 933 | HPV51 | HPV51 | 42, 42 |
| 014 | HPV52 | HPV52 | 37, 41 |
| 199 | HPV52, HPVX | HPV52 | 33, 48 |
| 082 | HPV56 | HPV56 | 15, 30 |
| 095 | HPV56 | HPV56 | 34, 23 |
| 099 | HPV58 | HPV58 | 23, 34 |
| 108 | HPV58 | HPV58 | 16, 26 |
| 203 | HPV58, | HPV58 | 43, 35 |
| 206 | HPV58 | HPV58 | 47, 35 |
| 890 | HPV58 (HPV52) | HPV58 | 28, 43 |
| 6517 | HPV58, | HPV58 | 26, 14 |
| 002 | HPV59 | HPV59 | 28, 26 |
| 041 | HPV59, | HPV59 | 49, 45 |
| 043 | HPV59, | HPV59 | 39, 39 |
| 541 | HPV68 (HPV39) | HPV68 | 49, 40 |
Note: The presence of HPV types reported within brackets is defined as possible according to LiPA. HPVX: Presence of HPV type/s not identifiable by LiPA. HPV types marked in italics are non-high-risk HPV types (not included in the design of the NGS primer set). * Electropherogram quality values calculated by Sequencing Analysis Software 5.3.1 (ThermoFisher): scores were considered high (≥20), medium (≥15 and <20) or low (<15).
Comparison of HPV typing results obtained by deep sequencing and LiPA.
| Sample ID | LiPA | 454 Deep Sequencing | |||
|---|---|---|---|---|---|
| HPV Type | HPV Type | Total No. Reads (% of the Total) | No. Forward Reads | No. Reverse Reads | |
| 8517 | HPV18, HPV31, (HPV39, HPV52, | HPV31 | 50,660 (93.6) | 25,876 | 24,784 |
| HPV18 | 3435 (6.3) | 1738 | 1697 | ||
| 507 | HPV18, HPV52, | HPV52 | 39,860 (72.5) | 19,435 | 20,425 |
| HPV18 | 14,913 (27.1) | 7629 | 7284 | ||
| 151 (0.3) | 81 | 70 | |||
| 665 | HPV16, HPV31, | HPV31 | 69,627 (99.4) | 36,230 | 33,397 |
| HPV16 | 408 (0.6) | 214 | 194 | ||
| 731 | HPV18 | 42,878 (99.9) | 23115 | 19,763 | |
| 631 | HPV18, HPV56, (HPV39, | HPV18 | 62,182 (97.4) | 33,380 | 28,802 |
| HPV56 | 1564 (2.4) | 845 | 719 | ||
| 385 | HPV18, HPV31, (HPV39, HPV52, | HPV31 | 45,947 (83.7) | 26,291 | 19,656 |
| HPV18 | 8915 (16.2) | 5328 | 3587 | ||
| HPV52 | 22 (0.04) | 11 | 11 | ||
| 877 | HPV18, HPV52, | HPV18 | 32,879 (68.6) | 14,681 | 18,198 |
| 13,895 (29) | 6048 | 7847 | |||
| HPV52 | 1012 (2.1) | 421 | 591 | ||
| 137 (0.3) | 82 | 55 | |||
| 260 | HPV16, HPV39, | HPV39 | 59,838 (72.9) | 29,785 | 30,053 |
|
| 15,042 (18.3) | 7274 | 7768 | ||
| HPV16 | 6145 (7.5) | 2921 | 3224 | ||
| 942 (1.1) | 423 | 519 | |||
Notes: The presence of HPV types reported within brackets is defined as possible according to LiPA. HPV types marked in italics are non-high-risk HPV types (not included in the design of the NGS primer set). HPV types marked in bold represent discordant results between LiPA and 454 deep sequencing. HPVX: Presence of HPV type/s not identifiable by LiPA.
Figure 1(A–C) Phylogenetic trees of HPV E6/E7 sequences obtained from clinical samples (red and blue labels for sequences obtained with 454 pyrosequencing and Sanger, respectively) and references (black labels) belonging to the α-9 (A); α-7 (B); α-5 and alpha-6 (C) species inferred by using the Maximum Likelihood method based on the Tamura 3-parameter model [51] (A) and the Hasegawa-Kishino-Yano model [52] (B,C). The percentage of bootstrap replications in which the associated samples clustered together is shown next to the branches (over 1000 iterations). A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories: +G, parameter = 0.6006 (A), 1.1395 (B), 1.6034 (C)). Tree branch lengths are proportional to the number of substitutions per site. There were a total of 741, 796, and 695 positions, respectively, in the final dataset of phylogenetic trees (A–C). Evolutionary analyses were conducted in MEGA6 [40] and displayed with FigTree [53]. Sample labels include sample ID; reference sequence labels include the GenBank accession number and the ID of variant lineages/sublineages (variant lineages are designed as A, B, etc.; variant sublineages are designed as A1, A2, etc.).
HPV type and variant classification through phylogenetic analysis and SNP score.
| HPV Type | Bootstrap Value | HPV Intra-Type Variant | Sample IDs | |
|---|---|---|---|---|
| Phylogenetic Analysis | SNP Score | |||
| 16 | 1.00 | A | A | 281, 260, 877, 665, 198, 644, 088 |
| 16 | 1.00 | C | C | 387 |
| 16 | 0.70 | D | A, B, C, D * | 004 |
| 18 | 0.99 | A | A | 507, 631, 665, 8517, 877, 731, 007 |
| 18 | 0.91 | B | B | 385 |
| 31 | 1.00 | B | B | 995, 385, 665, 235 |
| 31 | 1.00 | C | C | 201, 631, 260, 8517 |
| 33 | 1.00 | A | A | 901, 351 |
| 35 | 1.00 | A | -** | 805, 843 |
| 39 | 0.99 | A | A | 260, 787, 860 |
| 45 | 0.90 | A | A | 225 |
| 45 | 0.84 | B | B | 913, 292 |
| 51 | 1.00 | A | A | 096, 933, 772 |
| 52 | 0.75 | A | A | 199, 014, 507, 877, 385 |
| 56 | 0.91 | B | B | 082, 095, 631 |
| 58 | 0.78 | A | A | 206, 6517, 203, 108 |
| 58 | 0.96 | B | B | 890, 099 |
| 59 | 0.95 | B | B | 002, 043, 041 |
| 68 | 1.00 | A, B * | A, B * | 541 |
* Unable to discriminate among indicated lineages. ** The SNP score for HPV35 was not computed since only one variant is known.
Figure 2HPV variant assignment based on the SNP score. SNP score is reported in the x axis for the reference variant genomes (A) and for the samples sequenced in this study (B). HPV sequences detected in samples were assigned to the variant that obtained the highest score. The maximum score achievable for each genotype is reported in each panel and is labeled as “SNPs.”