| Literature DB >> 34696378 |
Maisa Pinheiro1, Ariana Harari2, Mark Schiffman1, Gary M Clifford3, Zigui Chen4, Meredith Yeager1,5, Michael Cullen1,5, Joseph F Boland1,5, Tina Raine-Bennett6, Mia Steinberg1,5, Sara Bass1,5, Yanzi Xiao1, Vanessa Tenet3, Kai Yu1, Bin Zhu1, Laurie Burdett1,5, Sevilay Turan1,5, Thomas Lorey7, Philip E Castle1,8, Nicolas Wentzensen1, Robert D Burk2,9, Lisa Mirabello1.
Abstract
Human papillomavirus (HPV) type 31 (HPV31) is closely related to the most carcinogenic type, HPV16, but only accounts for 4% of cervical cancer cases worldwide. Viral genetic and epigenetic variations have been associated with carcinogenesis for other high-risk HPV types, but little is known about HPV31. We sequenced 2093 HPV31 viral whole genomes from two large studies, one from the U.S. and one international. In addition, we investigated CpG methylation in a subset of 175 samples. We evaluated the association of HPV31 lineages/sublineages, single nucleotide polymorphisms (SNPs) and viral methylation with cervical carcinogenesis. HPV31 A/B clade was >1.8-fold more associated with cervical intraepithelial neoplasia grade 3 and cancer (CIN3+) compared to the most common C lineage. Lineage/sublineage distribution varied by race/ethnicity and geographic region. A viral genome-wide association analysis identified SNPs within the A/B clade associated with CIN3+, including H23Y (C626T) (odds ratio = 1.60, confidence intervals = 1.17-2.19) located in the pRb CR2 binding-site within the E7 oncogene. Viral CpG methylation was higher in lineage B, compared to the other lineages, and was most elevated in CIN3+. In conclusion, these data support the increased oncogenicity of the A/B lineages and suggest variation of E7 as a contributing risk factor.Entities:
Keywords: HPV31; cervical carcinogenesis; viral genomic variation; viral methylation
Mesh:
Year: 2021 PMID: 34696378 PMCID: PMC8540939 DOI: 10.3390/v13101948
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1HPV31 phylogenetic tree of lineages and sublineages and the characteristics of each sublineage by case-control status and race/ethnicity are illustrated for the PaP Study and by world region for the IARC collection as indicated. Footnote. Controls = Cervical intraepithelial neoplasia (CIN) grade 1 or lower (≤CIN1); CIN2 = CIN grade 2; CIN3+ = CIN grade 3 and cancer; AfAm = African-American; LatinAm = Latin-American. The maximum likelihood (ML) tree was constructed using RAXML version 8.2.12, with bootstrap supports indicated on or near branches. Bar plots represent the percent of each corresponding sublineage by case-control status, race/ethnicity or world region as indicated with numbers of samples provided under the colored squares. p = Fisher’s exact test.
HPV31 lineage associations with CIN3+, and effect modification of race/ethnicity, in the PaP cohort.
| Lineages | Controls | CIN3+ | OR | 95% CI |
| |||
|---|---|---|---|---|---|---|---|---|
| N | % | N | % | |||||
|
| ||||||||
| C | 521 | 50.4% | 89 | 35.6% | ref | |||
| A | 326 | 31.5% | 103 | 41.2% |
|
|
| |
| B | 187 | 18.1% | 58 | 23.2% |
|
|
| |
| Total | 1034 | 100.0% | 250 | 100.0% | ||||
|
| ||||||||
| C3 | 322 | 31.1% | 55 | 22.0% | ref | |||
| A1 | 267 | 25.8% | 78 | 31.2% |
|
|
| |
| A2 | 59 | 5.7% | 25 | 10.0% |
|
|
| |
| B1 | 35 | 3.4% | 9 | 3.6% | 1.51 | 0.69 | 3.31 | |
| B2 | 152 | 14.7% | 49 | 19.6% |
|
|
| |
| C1 | 58 | 5.6% | 6 | 2.4% | 0.61 | 0.25 | 1.47 | |
| C2 | 37 | 3.6% | 4 | 1.6% | 0.63 | 0.22 | 1.85 | |
| C4 | 104 | 10.1% | 24 | 9.6% | 1.35 | 0.8 | 2.29 | |
| Total | 1034 | 100.0% | 250 | 100.0% | ||||
|
| ||||||||
|
| ||||||||
| White | 143 | 50.4% | 64 | 63.4% |
|
|
| |
| Hispanic | 81 | 28.5% | 19 | 18.8% | 0.58 | 0.33 | 1.02 | |
| African-American | 18 | 6.3% | 3 | 3.0% | 0.45 | 0.13 | 1.57 | |
| Asian | 42 | 14.8% | 15 | 14.9% | 1.00 | 0.53 | 1.90 |
|
| Total | 284 | 100.0% | 101 | 100.0% | ||||
|
| ||||||||
| White | 90 | 54.9% | 29 | 53.7% | 0.95 | 0.51 | 1.77 | |
| Hispanic | 42 | 25.6% | 15 | 27.8% | 1.12 | 0.56 | 2.23 | |
| African-American | 11 | 6.7% | 3 | 5.6% | 0.82 | 0.22 | 3.05 | |
| Asian | 21 | 12.8% | 7 | 13.0% | 1.01 | 0.41 | 2.54 | 0.976 |
| Total | 164 | 100.0% | 54 | 100.0% | ||||
|
| ||||||||
| White | 237 | 49.0% | 48 | 57.1% | 1.39 | 0.87 | 2.22 | |
| Hispanic | 132 | 27.3% | 16 | 19.0% | 0.63 | 0.35 | 1.12 | |
| African-American | 51 | 10.5% | 6 | 7.1% | 0.65 | 0.27 | 1.57 | |
| Asian | 64 | 13.2% | 14 | 16.7% | 1.31 | 0.7 | 2.47 | 0.113 |
| Total | 484 | 100.0% | 84 | 100.0% | ||||
Footnote: Controls = Cervical intraepithelial neoplasia (CIN) grade 1 or lower; CIN3+ = CIN grade 3 and cancer; OR = Odds ratio and 95% confidence intervals (CI) from logistic regression; For race/ethnicity, the reference group is all other races combined except the tested group. p = Wald-test for heterogeneity. Significant p-values are bolded.
Figure 2HPV31 viral genome wide association analysis and SNPs associated with CIN3+ in the PaP Cohort. Red circles indicate the 13 nonsynonymous SNPs significantly different between cases and controls after false discovery rate (FDR) correction for multiple comparisons. Dashed thicker line represents the FDR significance threshold. Dashed thinner line represents the logistic regression 0.05 significance threshold. y-axis represents p-values in logarithm scale. x-axis represents HPV31 genome positions and viral gene regions indicated by the colored key below the figure.
Rare variant burden analysis for all HPV31 lineages, and within the A lineage, in the PaP Cohort.
| Viral Gene/Region | No. Individuals with Variants (%) |
| ||||
|---|---|---|---|---|---|---|
| All HPV31 lineages ( | ||||||
| Controls ( | CIN3+ cases ( | |||||
| E1 | 223 | 21.6% | 45 | 18.0% | 0.214 | 0.535 |
| E2 | 148 | 14.3% | 34 | 13.6% | 0.772 | 0.875 |
| E4 | 65 | 6.3% | 8 | 3.2% | 0.064 | 0.320 |
| E5 | 58 | 5.6% | 13 | 5.2% | 0.799 | 0.875 |
| E6 | 61 | 5.9% | 12 | 4.8% | 0.501 | 0.835 |
| E7 | 21 | 2.0% | 3 | 1.2% | 0.389 | 0.778 |
| L1 | 236 | 22.8% | 42 | 16.8% |
| 0.320 |
| L2 | 375 | 36.3% | 92 | 36.8% | 0.875 | 0.875 |
| URR | 181 | 17.5% | 41 | 16.4% | 0.679 | 0.875 |
| WG | 724 | 70.0% | 164 | 65.6% | 0.175 | 0.535 |
|
| ||||||
|
|
| |||||
| E1 | 61 | 18.7% | 9 | 8.7% |
| 0.163 |
| E2 | 35 | 10.7% | 6 | 5.8% | 0.146 | 0.292 |
| E4 | 19 | 5.8% | 1 | 1.0% | 0.074 | 0.185 |
| E5 | 13 | 4.0% | 2 | 1.9% | 0.335 | 0.497 |
| E6 | 12 | 3.7% | 6 | 5.8% | 0.348 | 0.497 |
| E7 | 5 | 1.5% | 1 | 1.0% | 0.674 | 0.674 |
| L1 | 74 | 22.7% | 14 | 13.6% |
| 0.163 |
| L2 | 114 | 35.0% | 39 | 37.9% | 0.593 | 0.672 |
| URR | 53 | 16.3% | 19 | 18.4% | 0.605 | 0.672 |
| WG | 225 | 69.0% | 60 | 58.3% |
| 0.163 |
Footnote: Controls = Cervical intraepithelial neoplasia (CIN) grade 1 or lower; CIN3+ = CIN grade 3 and cancer; L1 = Late gene 1; L2 = Late gene 2; E1 = early gene 1; E2 = early gene 2; E4 = early gene 4; E5 = early gene 5; E6 = early gene 6; E7 = early gene 7; URR = untranslated regulatory region; WG = whole HPV31 genome. p = Fisher’s exact test. Significant p-values are bolded.
Figure 3Viral methylation levels across all CpG sites tested by HPV31 lineages C, A and B, shown for CIN3+ cases and controls. The y-axis represents the percent methylation levels. x-axis represents each individual lineage. Footnote: p-values from Mann–Whitney (Wilcoxon Rank Sum) test. ns = non-significant; *** = p < 0.001; ** = p < 0.01; * = p < 0.05.
Top CpG sites with high methylation associated with CIN3+ for each HPV31 lineage in the PaP Cohort.
| Gene | Lineage | CpG | Controls | CIN3+ | Difference |
| AUC | 95% CI | OR | 95% CI |
| ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| Median |
| Median | ||||||||||||||
| E2 | A | 3414 | 22 | 1.89 | 32 | 6.19 | 4.30 ** | 6.6 × 10−7 | 4.4 × 10−5 | 0.90 | 0.82 | 0.98 | 22.62 | 5.16 | 99.19 | 3.5 × 10−5 | 0.001 |
| B | 3414 | 13 | 5.47 | 28 | 7.77 | 2.30 | 0.560 | 0.560 | 0.56 | 0.37 | 0.74 | 1.81 | 0.47 | 6.97 | 0.390 | 0.410 | |
| C | 3414 | 50 | 2.67 | 29 | 5.29 | 2.62 | 0.035 | 0.049 | 0.64 | 0.52 | 0.77 | 3.38 | 1.29 | 8.81 | 0.013 | 0.020 | |
| L2 | A | 5530 | 22 | 3.97 | 32 | 11.88 | 7.92 | 0.005 | 0.011 | 0.73 | 0.58 | 0.88 | 9.52 | 2.71 | 33.51 | 4.5 × 10−4 | 0.003 |
| B | 5530 | 9 | 4.33 | 27 | 19.41 | 15.08 ** | 0.002 | 0.007 | 0.84 | 0.66 | 1.00 | 25.00 | 3.39 | 184.50 | 0.002 | 0.004 | |
| C | 5530 | 45 | 4.32 | 23 | 11.56 | 7.24 | 0.004 | 0.009 | 0.72 | 0.59 | 0.85 | 5.14 | 1.69 | 15.63 | 0.004 | 0.007 | |
| L2 | A | 5521 | 22 | 2.66 | 32 | 5.40 | 2.74 | 0.004 | 0.009 | 0.73 | 0.58 | 0.88 | 8.00 | 2.33 | 27.46 | 9.5 × 10−4 | 0.004 |
| B | 5521 | 9 | 3.65 | 27 | 6.62 | 2.97 | 0.096 | 0.110 | 0.69 | 0.46 | 0.91 | 5.50 | 1.07 | 28.20 | 0.041 | 0.056 | |
| C | 5521 | 45 | 3.10 | 23 | 6.26 | 3.16 ** | 6.0 × 10−5 | 0.002 | 0.80 | 0.69 | 0.91 | 7.20 | 2.24 | 23.17 | 9.3 × 10−4 | 0.004 | |
Footnote: Controls = Cervical intraepithelial neoplasia (CIN) grade 1 or lower; CIN3+ = CIN grade 3 and cancer; † = HPV31 genome position; ** = Sites that best distinguished cases from controls for each lineage; p * = Mann–Whitney (Wilcoxon Rank Sum) p-value; p # = Univariate regression p-value; p-FDR = Adjusted false discovery rate p-value; AUC = Area under the curve; OR = Odds ratio and 95% confidence intervals (CI) for the association between high methylation, dichotomized at the 2nd tertile based on controls for each site, and CIN3+, with controls as the referent.