Literature DB >> 32060290

Mutations in the HPV16 genome induced by APOBEC3 are associated with viral clearance.

Bin Zhu1, Yanzi Xiao1, Meredith Yeager1,2, Gary Clifford3, Nicolas Wentzensen1, Michael Cullen1,2, Joseph F Boland1,2, Sara Bass1,2, Mia K Steinberg1,2, Tina Raine-Bennett4, DongHyuk Lee1, Robert D Burk5,6, Maisa Pinheiro1, Lei Song1,2, Michael Dean1, Chase W Nelson7, Laurie Burdett1,2, Kai Yu1, David Roberson1,2, Thomas Lorey8, Silvia Franceschi9, Philip E Castle6, Joan Walker10, Rosemary Zuna10, Mark Schiffman1, Lisa Mirabello11.   

Abstract

HPV16 causes half of cervical cancers worldwide; for unknown reasons, most infections resolve within two years. Here, we analyze the viral genomes of 5,328 HPV16-positive case-control samples to investigate mutational signatures and the role of human APOBEC3-induced mutations in viral clearance and cervical carcinogenesis. We identify four de novo mutational signatures, one of which matches the COSMIC APOBEC-associated signature 2. The viral genomes of the precancer/cancer cases are less likely to contain within-host somatic HPV16 APOBEC3-induced mutations (Fisher's exact test, P = 6.2 x 10-14), and have a 30% lower nonsynonymous APOBEC3 mutation burden compared to controls. We replicate the low prevalence of HPV16 APOBEC3-induced mutations in 1,749 additional cases. APOBEC3 mutations also historically contribute to the evolution of HPV16 lineages. We demonstrate that cervical infections with a greater burden of somatic HPV16 APOBEC3-induced mutations are more likely to be benign or subsequently clear, suggesting they may reduce persistence, and thus progression, within the host.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32060290      PMCID: PMC7021686          DOI: 10.1038/s41467-020-14730-1

Source DB:  PubMed          Journal:  Nat Commun        ISSN: 2041-1723            Impact factor:   14.919


Introduction

High-risk human papillomaviruses (HR-HPVs) are small double-stranded DNA viruses that cause cervical cancer[1] and a large proportion of other anogenital and oropharyngeal cancers[2,3]. HPV16 is the most potent of the 12 HR-HPV types; it accounts for >50% of the >500,000 incident cervical cancer cases worldwide annually[4-6]. It is unknown why HPV16 is more carcinogenic than other HPV types, or why the majority of HPV16 infections “clear” (are either eliminated or controlled) while others persist and lead to cervical precancer and cancer[7-9]. Within HPV16, genetic variation partly predicts risk of precancer and cancer. For example, the sublineages (A1–A4, B1–B4, C1–C4, D1–D4) of HPV16, defined by genetic variation, have been associated with substantial differences in cervical carcinogenicity[10-22], and specific sublineages are linked to adenocarcinomas with an odds ratio (OR) of >100[21]. There is also much finer genetic variation among circulating HPV16 isolates[23,24], which varies by viral gene region and infection outcome, with more variation observed in women with low-grade or benign HPV16 infections than in those with cancers[24,25]. Most notably, the E7 oncogene lacks nonsynonymous (amino acid changing) variants in cervical cancers from around the world compared with controls, illustrating that rigid E7 conservation is necessary for carcinogenicity[24]. While cell-mediated immunity is thought to explain much of HPV clearance, innate immunity may also be important[26]. Specifically, the expression of human apolipoprotein B mRNA-editing, enzyme-catalytic, polypeptide-like 3 (APOBEC3) family of cytidine deaminases, APOBEC3A or APOBEC3B (hA3A/B), have been shown to be upregulated following HPV infection and act as a restriction factor, activated by the HPV16 E6 and E7 oncoproteins[27-32]. This elevated activity of hA3A/B enzymes can mediate mutations in both the host and viral genomes[32-34]. Large comprehensive human cancer genomic studies have characterized somatic mutations and discovered APOBEC3-mediated mutational signatures among multiple cancer types[35-41], particularly enriched in HPV-positive cervical[35] and HPV-positive head and neck cancers[42-44]. A recent study of oral squamous cell carcinomas (OSCCs) determined that the somatic APOBEC3 mutation burden was strongly linked to the total mutation burden in HPV-positive, but not HPV-negative OSCCs[44]. APOBEC3 mutagenesis is a source of oncogenic driver events[35,43-45] (e.g., PIK3CA E545K hotspot mutations in cervical cancers[35]) and contributes to clonal evolution and intratumor heterogeneity[46] (for more details, see refs. [32,47-49]). At the same time, APOBEC3 cytidine deaminases have been shown to induce mutations in HPV genomes and act as a restriction factor in the early stages of HPV infection[30,50,51]. A targeted sequencing study of 9 HPV16 precancers demonstrated that the HPV16 genome was hyperedited with G > A and C > T changes by human APOBEC3 cytidine deaminases[52]; subsequently, G > A and C > T hypermutations were verified in two small studies of the E2 gene and long control region (LCR) of the HPV16 genome[53,54]. A recent study showed that APOBEC3 cytidine deaminase was a driver of HPV mutations at the trinucleotide APOBEC3 target, TpCpN, across 151 HPV16/52/58 whole genomes[25], particularly in low-grade lesions. It appears that host–pathogen coevolution has selected for HPV16 genomes with fewer APOBEC3 attackable motifs[55]. Thus, the remaining motifs are probably necessary or important for the full infectious viral life cycle. There have been no large-scale genomic studies to comprehensively characterize APOBEC3 mutagenesis and other mutational signatures across the HPV16 genome in cervical precancer/cancer cases and controls. To investigate the viral genetic variation across the HPV16 genome potentially induced by human APOBEC3 cytidine deaminases and other mutational processes (signatures) and evaluate how these variants contribute to infection outcome (i.e., viral clearance or carcinogenesis), we analyze HPV16 whole-genome sequence data from 5328 case–control samples. Our primary analysis includes 3579 HPV16 genomes[21,24] from 1265 controls (women with a benign HPV16 infection defined as causing ≤ cervical intraepithelial neoplasia [CIN] grade 1 [CIN1], and/or “clearing”) and 1032 CIN2, 1170 CIN3 precancer, and 112 cancer cases in the prospective NCI-Kaiser Permanente Northern California (PaP) Cohort[56]. We replicate case findings in 1749 HPV16 genomes[24], including HPV16 from 444 CIN3 precancer and cancer cases (i.e., CIN3+) in a cross-sectional U.S. population[17,57-59] from the Study to Understand Cervical Cancer Early Endpoints and Determinants (SUCCEED), and 1305 invasive cervical cancers collected internationally by the International Agency for Research on Cancer (IARC)[60-63].

Results

Whole-genome sequencing of HPV16

Using a PCR based next-generation sequencing (NGS) assay[64], we performed HPV16 whole-genome sequencing of 5328 HPV16-positive cervical precancer/cancer cases and controls (Table 1). The mean number of sequencing reads aligned across the HPV16 genome per sample was 520,127 (standard error 76,629), with a mean of 3696 sequencing reads per gene region.
Table 1

Summary of HPV16-infected women from three studies by case status.

StudyStatusNo. of women
NCI-Kaiser PaPControl1265
CIN21032
CIN31170
Cancer112
SUCCEEDCIN3314
Cancer130
IARCCancer1305
Total5328

CIN2 cervical intraepithelial neoplasia (CIN) grade 2, CIN3 CIN grade 3.

Summary of HPV16-infected women from three studies by case status. CIN2 cervical intraepithelial neoplasia (CIN) grade 2, CIN3 CIN grade 3.

Mutational signature analysis identified four signatures

We first characterized the 96 trinucleotide mutation types taking into account the sequence context immediately 5′ and 3′ to each mutated base, and conducted a de novo mutational signature analysis of all variants across 3579 HPV16 genomes. We identified four mutational signatures present in the HPV16 genomes (Fig. 1). Signature A was characterized by C > T mutations enriched at the TCW motif, and it was highly similar to the known COSMIC (the Catalogue of Somatic Mutations in Cancer)[65] single-base substitution (SBS) signature 2 (cosine similarity = 0.963), which has been associated with the activity of the APOBEC cytidine deaminases. Signature B was represented by C > T mutations outside the TCW motif (excluding C > T mutations in signature A), with some similarity to COSMIC[65] SBS signature 32 (cosine similarity = 0.825), the etiology of this signature is less understood and suggested to be associated with damage to guanine and repair by transcription-coupled nucleotide excision repair. Signature C was characterized by T > C mutations, and highly similar to COSMIC[65] SBS signature 26 (cosine similarity = 0.966), which has been associated with defective mismatch repair. The fourth signature D illustrated a relatively flat signature across many different mutation types with little similarity to a known COSMIC[65] SBS signature.
Fig. 1

Four de novo mutational signatures identified using all variants across the HPV16 genome in women from the PaP cohort.

The x-axis indicates the 5′ and 3′ nucleotides for each of the top panel substitutions for the three base-pair motifs. The y-axis shows the single-base substitution (SBS) composition of each mutational signature by the 96 trinucleotide sequence motifs. For each identified signature, shown as A–D, the similarity was determined to the known COSMIC SBS signatures (https://cancer.sanger.ac.uk/cosmic/signatures/SBS/). The identified signature letter (A–D) and in parentheses the most similar COSMIC SBS signature number along with the cosine similarity are shown along the right y-axis. Cosine similarity ranges from 0 to 1, with a cosine of 1 indicating a perfect match.

Four de novo mutational signatures identified using all variants across the HPV16 genome in women from the PaP cohort.

The x-axis indicates the 5′ and 3′ nucleotides for each of the top panel substitutions for the three base-pair motifs. The y-axis shows the single-base substitution (SBS) composition of each mutational signature by the 96 trinucleotide sequence motifs. For each identified signature, shown as A–D, the similarity was determined to the known COSMIC SBS signatures (https://cancer.sanger.ac.uk/cosmic/signatures/SBS/). The identified signature letter (A–D) and in parentheses the most similar COSMIC SBS signature number along with the cosine similarity are shown along the right y-axis. Cosine similarity ranges from 0 to 1, with a cosine of 1 indicating a perfect match. We further evaluated the distribution of the mutations identified by each signature in our case–control samples and examined if they were linked to viral clearance or carcinogenicity with a primary focus on the HPV16 APOBEC3-induced mutations.

HPV16 APOBEC3 mutations are more prominent at a low variant allele fraction

As expected, cytosine-to-thymine (C > T) changes were the most frequent variants observed (48.7% of total mutations; Supplementary Fig. 1). We observed that the distribution of mutation types differed by variant allele fraction (VAF; Fig. 2a, b). VAF refers to the fraction or percent of total viral sequence reads per woman containing the mutation at a given genomic position (see “Methods” for more details). The VAF is expected to be ~1.0 if the variant was present in the virus at acquisition (i.e., constitutive variant in a haploid HPV genome), and in a lower fraction of the reads if the viral mutation occurred de novo during the infection period (i.e., a within-host somatic mutation). We evaluated constitutive HPV16 variants at a high VAF in each woman, defined based on the distribution of mutations as a variant occurring in >60% of the sequence reads at that locus, and within-host HPV16 somatic mutations at a low VAF, defined as occurring in 10–60% of the sequence reads (variants in <10% of the reads were excluded), separately.
Fig. 2

Frequency of the 96 trinucleotide mutation types for variants across the HPV16 genome in women from the PaP cohort.

Illustrated for (a) high variant allele fraction (VAF) and (b) low VAF variants. The x-axis indicates the 5′ and 3′ nucleotides for each of the top panel substitutions for the three base-pair motifs.

Frequency of the 96 trinucleotide mutation types for variants across the HPV16 genome in women from the PaP cohort.

Illustrated for (a) high variant allele fraction (VAF) and (b) low VAF variants. The x-axis indicates the 5′ and 3′ nucleotides for each of the top panel substitutions for the three base-pair motifs. The C > T substitutions were most prominent at low VAFs (Fig. 2a, b). The previously established APOBEC3 mutation types[39], characterized by a C > T substitution with thymine on its 5′ side and adenine or thymine on its 3′ side (TCW motif [W is A or T]), identified as mutational signature A, and also C > T substitutions outside the TCW motif (signature B), were specifically more prevalent at a low VAF compared with high VAF (APOBEC3 mutations: 10.2% of low VAF, and 4.8% of high VAF; proportion test, P-value < 0.001; C > T outside the TCW motif: 59.8% of low VAF, and 40.7% of high VAF; proportion test, P-value < 0.001). However, APOBEC3-associated[39] C > G substitutions at the TCW motif were rare across the HPV16 genome at both low and high VAFs (0.7% of low VAF, and 1.2% of high VAF). Supplementary Fig. 2 illustrates the distribution of APOBEC3-induced mutations across levels of VAF and shows that most mutations were in either very low or very high VAF levels. The within-host viral somatic mutations at low VAF were most pronounced in the controls, while the highest VAF constitutive variants, likely representing variants present in the HPV16 genome at acquisition, were equivalent among cases and controls. We also observed mutation types characterized by T > C (signature C) and T > G (signature D) mutations across the HPV16 genomes at high VAF (Fig. 2a). T > G mutations are expected to be rare DNA changes[66]; the reason for their higher frequency in HPV16 genomes is unknown.

Within-host HPV16 APOBEC3 mutations are more frequent in controls

To test if APOBEC3-induced mutations, and the other mutational signature substitutions, were associated with case–control status, we compared the HPV16 APOBEC3-induced mutations (Fig. 3a, b) in the precancer/cancer cases and controls stratified by low/high VAF (see “Methods” for more details). Within-host viral somatic APOBEC3-induced mutations were present in significantly fewer CIN3 + cases (11.9%) compared with controls (23.2%; OR 0.45, 95% CI 0.36–0.56, Fisher’s exact test, P-value = 6.2 × 10−14; Table 2). The results were similar for CIN2 + cases compared with controls (OR 0.48, 95% CI 0.40–0.57, Fisher’s exact test, P-value = 5.8 × 10−16; Supplementary Table 1). In a subset analysis, we compared only the incident cases (N = 333) that developed CIN3 + during the follow-up study period (i.e., after baseline or enrollment) to the controls (i.e., women that cleared their HPV16 or did not progress to CIN2+), and also showed that the incident CIN3+ cases had significantly fewer somatic APOBEC3-induced mutations than controls (OR 0.57, 95% CI 0.41–0.79, Fisher’s exact test, P-value = 0.0003). We further evaluated if somatic APOBEC3-induced mutations were associated with case/control status for each viral gene region. L1 and L2 gene regions had significantly more somatic APOBEC3-induced nonsynonymous mutations in the controls compared with CIN3+ cases (Wald test, P-value = 0.01 and 6.7 × 10−4, respectively; Supplementary Table 2). There was no apparent clustering of mutations in these gene regions by functional domain (Supplementary Fig. 3a, b).
Fig. 3

The number of APOBEC3-induced mutations across the HPV16 genome by gene region in women from the PaP cohort.

The plots show only the cases and controls with one or more APOBEC3-induced mutations at a (a) high variant allele fraction (VAF) and (b) low VAF. Each vertical line represents a sample with at least one APOBEC3-induced mutation, colored by the number of mutations observed, as 1–3 (see legend), per viral gene region. Samples with no APOBEC3-induced mutations are not illustrated; the size of the case and control panels correspond to the number of individuals with at least one APOBEC3-induced mutation. The samples are organized along the x-axis by status (case vs. control). Cases are cervical intraepithelial neoplasia grade 3 and cancer cases (CIN3+). The right y-axis represents viral gene regions with the overall frequency of APOBEC3 mutations summarized, taking into account the sample sizes of the cases/controls and potential APOBEC3-mutable sites, for CIN3+ cases in dark gray and controls in light gray. The top panel histogram summarizes the total APOBEC3-induced mutations for the cases and controls across the HPV16 genomes. URR upstream regulatory region, E6 early gene 6, E7 early gene 7, E1 early gene 1, E2 early gene 2, E4  early gene 4, E5  early gene 5, L2  late gene 2, L1  late gene 1.

Table 2

Cases and controls with and without APOBEC3-induced mutations by variant allele fraction in the NCI-Kaiser PaP cohort.

VAFStatusNo APOBEC3 mutationAPOBEC3 mutation%OR95% CIP-valuea
LowbCIN3+112915311.9%0.45(0.36–0.56)6.2 × 10−14
Control97129423.2%Ref.
HighcCIN3 + 99728522.2%1.14(0.94–1.39)0.17
Control101225320.0%Ref.

VAF variant allele fraction, CIN3+ cervical intraepithelial neoplasia grade 3 and cancer cases, OR odds ratio, CI confidence intervals, Ref. referent group.

aFisher’s exact test, two-sided, comparing the number of women with at least one APOBEC3-induced mutation and those without APOBEC3-induced mutations in the CIN3+ cases to controls.

bLow VAF is defined as VAF >10% and < = 60%.

cHigh VAF is defined as VAF >60%.

The number of APOBEC3-induced mutations across the HPV16 genome by gene region in women from the PaP cohort.

The plots show only the cases and controls with one or more APOBEC3-induced mutations at a (a) high variant allele fraction (VAF) and (b) low VAF. Each vertical line represents a sample with at least one APOBEC3-induced mutation, colored by the number of mutations observed, as 1–3 (see legend), per viral gene region. Samples with no APOBEC3-induced mutations are not illustrated; the size of the case and control panels correspond to the number of individuals with at least one APOBEC3-induced mutation. The samples are organized along the x-axis by status (case vs. control). Cases are cervical intraepithelial neoplasia grade 3 and cancer cases (CIN3+). The right y-axis represents viral gene regions with the overall frequency of APOBEC3 mutations summarized, taking into account the sample sizes of the cases/controls and potential APOBEC3-mutable sites, for CIN3+ cases in dark gray and controls in light gray. The top panel histogram summarizes the total APOBEC3-induced mutations for the cases and controls across the HPV16 genomes. URR upstream regulatory region, E6 early gene 6, E7 early gene 7, E1 early gene 1, E2 early gene 2, E4  early gene 4, E5  early gene 5, L2  late gene 2, L1  late gene 1. Cases and controls with and without APOBEC3-induced mutations by variant allele fraction in the NCI-Kaiser PaP cohort. VAF variant allele fraction, CIN3+ cervical intraepithelial neoplasia grade 3 and cancer cases, OR odds ratio, CI confidence intervals, Ref. referent group. aFisher’s exact test, two-sided, comparing the number of women with at least one APOBEC3-induced mutation and those without APOBEC3-induced mutations in the CIN3+ cases to controls. bLow VAF is defined as VAF >10% and < = 60%. cHigh VAF is defined as VAF >60%. In contrast, there was no significant difference between the high VAF, constitutive HPV16 APOBEC3-induced variants in the CIN3+ cases compared with controls (OR 1.14, 95% CI 0.94–1.39, Fisher’s exact test, P-value = 0.17; Table 2). By gene region, there were more constitutive APOBEC3 nonsynonymous variants in the cases compared with controls in E4 (Mutation burden ratio 7.99, 95% CI 1.85–34.4; Wald test, P-value = 0.01; Supplementary Table 2). We replicated the prevalence of APOBEC3-induced mutations throughout the HPV16 genome in precancer/cancer cases in two independent case study populations at both low and high VAFs (Supplementary Table 3). In particular, 12.4% and 12.5% of precancer/cancer cases had HPV16 somatic APOBEC3-induced mutations in SUCCEED and IARC, respectively, similar to the 11.9% observed in the PaP cohort cases and significantly fewer than the PaP controls (23.2%; OR 0.47, Fisher’s exact test, P-value = 5.1 × 10−7 and 1.1 × 10−12, respectively). The frequency of HPV16 APOBEC3-induced variants at high VAF among the SUCCEED and IARC CIN3 + cases was similar to the PaP CIN3 + cases, and slightly higher than the PaP controls (Supplementary Table 3). Within-host viral somatic C > T substitutions outside the TCW motif (signature B) and the T > C substitutions (signature C) were also present in significantly fewer CIN3 + cases (36.1% and 18.5%, respectively) compared with controls (43.2% and 32.9%, respectively; Fisher’s exact test P-value < 0.001) in the PaP cohort (Supplementary Table 4). However, the frequency of these specific substitutions did not replicate in the precancer/cancer cases in our two independent case study populations (SUCCEED and IARC), so we did not evaluate their relationship to case–control status further. The substitutions characterized by mutational signature D were not associated with case–control status at low VAF (Supplementary Table 4).

Characteristics of HPV16 mutations induced by APOBEC3 enzymes

We first estimated all possible DNA changes and resultant amino acid changes across the HPV16 reference genome. For the HPV16 A1 sublineage reference genome (see Methods), there were a total of 263 APOBEC3 targetable sites, and fewer targetable sites for the more carcinogenic HPV16 D2/D3 sublineage[10-22] genomes, 247/246 APOBEC3 targetable sites (Supplementary Table 5). Although HPV16 A1/A2 sublineages had only 14–17 more APOBEC3 targetable sites than the D2/D3 sublineages, the controls with A1/A2 sublineages had more APOBEC3 mutations than D2/D3 (31.2% vs. 15.8%; P = 0.04). Of the possible HPV16 APOBEC3 targetable sites (A1 reference genome), 96.2% of these APOBEC3 substitutions would result in a nonsynonymous change compared with 77.2% of non-APOBEC3 sites (proportion test, P-value < 0.001). Consequently, 95.2% of the APOBEC3-induced mutations we observed were nonsynonymous, which was significantly higher than the proportion observed for the non-APOBEC3 mutations, 71.2% (proportion test, P-value < 0.001; Supplementary Table 6). For the HPV16 infections with at least one APOBEC3-induced mutation, we compared the APOBEC3-induced mutation burden per virus in cases vs. controls and between nonsynonymous vs. synonymous mutations, adjusting for the number of possible HPV16 APOBEC3-mutable sites (see “Methods” for more details). For within-host viral somatic mutations, CIN3 + cases had a significantly lower APOBEC3-induced mutation burden compared with controls (mutation burden ratio 0.71, 95% CI 0.61–0.82; Wald test, P-value = 6.42 × 10−6) for all mutations and stratified by both nonsynonymous (mutation burden ratio 0.71; Wald test, P-value = 1.2 × 10−5) and synonymous mutations (ratio 0.68; Wald test, P-value = 0.28) (Table 3; Supplementary Table 7). There was no evidence of selection against nonsynonymous relative to synonymous mutations (ratio of nonsynonymous-to-synonymous mutation rate 1.0; Wald test, P-value = 0.99; Table 3). Here, the nonsynonymous and synonymous mutation rates were evaluated based on the number expected relative to the number of APOBEC3 targetable sites that could result in a nonsynonymous or synonymous mutation. Most of these somatic HPV16 APOBEC3-induced mutations at low VAF were rare in the population with a minor allele frequency (MAF) < 0.01, and rare somatic APOBEC3 mutations were significantly more frequent in the controls compared to the cases (proportion test, P-value = 1.5 × 10−4).
Table 3

Burden of APOBEC3-induced mutations in cases and controls from the NCI-Kaiser PaP cohort. Mutations are compared in CIN3+ cases vs. controls and for nonsynonymous (nonsyn) vs. synonymous (syn) mutations by variant allele fraction.

VAFParameterInterpretationMutation burdena95% CIP-valued
LowbrsynEnrichment of synonymous mutations in cases vs. controls0.68(0.34–1.36)0.28
rnonsynEnrichment of nonsynonymous mutations in cases vs. controls0.71(0.61–0.83)1.2 × 10−5
WSelection of nonsynonymous mutations vs. synonymous mutations in controls1.00(0.70–1.44)0.99
HighcrsynEnrichment of synonymous mutations in cases vs. controls1.27(0.73–2.21)0.40
rnonsynEnrichment of nonsynonymous mutations in cases vs. controls0.82(0.68–1.00)0.05
WSelection of nonsynonymous mutations vs. synonymous mutations in controls0.55(0.35–0.85)7.9 × 10−3

VAF variant allele fraction, CIN3+ cervical intraepithelial neoplasia grade 3 and cancer cases, CI confidence intervals.

aMutation burden ratio of APOBEC3-induced mutations was calculated using a Poisson regression model to compare the mutation burden or enrichment of APOBEC3-induced mutations per virus between cases and controls for nonsynonymous and synonymous mutations (r); selection of nonsynonymous mutations in the controls was estimated adjusting for the number of cases and controls and the potential APOBEC3-mutable bases that result in nonsynonymous and synonymous mutations (w).

bLow VAF is defined as VAF >10% and < = 60%.

cHigh VAF is defined as VAF >60%.

dP-values are generated by the Wald test of a Poisson regression model.

Burden of APOBEC3-induced mutations in cases and controls from the NCI-Kaiser PaP cohort. Mutations are compared in CIN3+ cases vs. controls and for nonsynonymous (nonsyn) vs. synonymous (syn) mutations by variant allele fraction. VAF variant allele fraction, CIN3+ cervical intraepithelial neoplasia grade 3 and cancer cases, CI confidence intervals. aMutation burden ratio of APOBEC3-induced mutations was calculated using a Poisson regression model to compare the mutation burden or enrichment of APOBEC3-induced mutations per virus between cases and controls for nonsynonymous and synonymous mutations (r); selection of nonsynonymous mutations in the controls was estimated adjusting for the number of cases and controls and the potential APOBEC3-mutable bases that result in nonsynonymous and synonymous mutations (w). bLow VAF is defined as VAF >10% and < = 60%. cHigh VAF is defined as VAF >60%. dP-values are generated by the Wald test of a Poisson regression model. In contrast, at high VAF, there was no significant difference in the mutation burden of APOBEC3-induced nonsynonymous and synonymous mutations in cases and controls, and there was evidence of negative selection against nonsynonymous mutations relative to synonymous mutations (mutation burden ratio 0.55; Wald test, P-value = 7.9 × 10−3; Table 3). Among the possible APOBEC3 targetable bases on each strand, the percent mutated on the positive strand (5.8/kb) was comparable with that estimated on the negative strand (6.2/kb, proportion test, P-value = 0.61; Supplementary Table 8). In addition, among the APOBEC3A (YTCA, Y is a pyrimidine base) and APOBEC3B (RTCA, R is a purine base) possible mutable bases[67], we compared the rate of having at least one APOEBC3A or APOBEC3B mutation and found no significant difference (0.35 vs. 0.34/kb, proportion test, P-value = 0.86; Supplementary Table 9). When further separating APOBEC3-induced mutations by high/low VAF, there was no significant difference between the APOBEC3 mutation rates on the positive or negative strand, or for APOBEC3A or APOBEC3B mutations (Supplementary Tables 8 and 9).

APOBEC mutations contributed to the evolution of HPV16 lineages

We determined that APOBEC3 editing may have contributed to the evolution of HPV16 lineages using 239 HPV16 non-A1/2 sublineage sequences. We specifically evaluated the HPV16 nucleotide positions that are known to “define” each of the HPV16 main lineages[68] (i.e., lineage/sublineage diagnostic SNPs; each SNP in the lineage haplotypes) compared with the derived ancestral sequence for each main lineage and sublineage at each node of the phylogenetic tree (Supplementary Fig. 4). We determined that there was a range of 6–41% of the lineage-defining SNPs for each of the different sublineages potentially induced by APOBEC3 (Supplementary Fig. 4). The D2/D3 sublineages, which are known to be the most carcinogenic of the HPV16 sublineages[21], had the greatest number of lineage-defining SNPs potentially induced by APOBEC3 (35 and 41%, respectively).

Discussion

We report the largest HPV16 whole-genome sequencing study to date evaluating viral genome mutational signatures and identify viral APOBEC3 mutations likely induced during a woman’s infection linked to benign or clearing infections. We[24] and others[23] have previously shown that there is high genetic diversity among HPV16 isolates circulating in the population; these studies were focused on viral SNPs presumed to be “inherited” variants present at HPV acquisition and detected in all or nearly all viral sequence reads at a given locus (high VAF). In contrast, here we use deep NGS to more finely evaluate “acquired” somatic mutations presumed to be induced recently during a woman’s infection and detected in a minority of sequence reads (low VAF, but with sufficient read depth >10×). We have discovered additional somatic viral genetic diversity that is likely driven by APOBEC3 activity and associated with benign infections or subsequent viral clearance in our large prospective cohort. Our data suggest that these APOBEC3-induced mutations may constrain the viability, and by extension the oncogenic potential, of HPV16. We further determined that mutations induced by APOBEC3 cytidine deaminases contributed to HPV16 genetic diversity that shaped viral evolution of the important HPV16 lineages. The combination of our large study population and deep NGS technology has revealed that HPV16 genomes, which have generally been considered stable during a persistent infection, can accumulate somatic mutations during infection, presenting in a fraction of the viruses within a host driven by APOBEC3 activity and other mutational processes. These somatic mutations, including APOBEC3-induced mutations, likely result in clearance or a reduced ability of the virus to persist. We observed four mutational signatures across HPV16 genomes, and the somatic substitutions related to the three main mutational signatures (A, B, and C) were all more prevalent in the controls compared with the precancer/cancer cases. However, the frequency of somatic substitutions related to signatures B and C did not replicate in two additional case populations; further follow-up is needed to determine the etiology and relevance of these substitutions. The APOBEC3-induced mutations (signature A) were enriched in the controls compared with cases in all three of our precancer/cancer case populations. This suggests that APOBEC3 is primarily inducing mutations during a woman’s infection when HPV16 is replicating, and ssDNA is exposed and targetable, during a productive infection[69] in benign or low-grade (HPV clearance and transmission likely additionally contributes to the high diversity of HPV16 in the population previously reported[23,24], and to viral evolution[25,55,70]. There was no disparity between the mutation burden of nonsynonymous (i.e., missense or nonsense) and synonymous somatic mutations in controls, suggesting that these mutations are likely recent somatic mutations arising during a woman’s infection which would not have had time to be selectively removed from the viral population (i.e., ineffective purifying selection). These mutations were also most often rare in our population (i.e., observed in <1% of women) or singletons (i.e., observed in only one woman among 3500+), further suggesting that most of them arose recently within the host. It is possible that a minority of the rare low VAF variants could be due to PCR error or artifacts, although given our stringent quality control including the requirement that all variants be present in >10 sequence reads, our low assay error rate, and the specific enrichment of APOBEC3 signature mutations in controls at a low VAF instead of random mutations across all samples (as would be expected from errors), this is likely minimal. We observed a significantly higher proportion of viral nonsynonymous APOBEC3-induced mutations compared with that for other non-APOBEC3 mutations. This is consistent with the TpC dinucleotide depletion at the third codon position observed in the viral open-reading frame[55,70], which would result in our observed viral APOBEC3-induced mutations primarily occurring at the first and second codon positions and thus causing the enrichment of nonsynonymous changes. Given that 95% of the APOBEC3-induced mutations were nonsynonymous and more frequent in the controls, we presume these mutations were deleterious to viral persistence and thus constrained the viability of HPV16. Alternatively, it is also possible that these APOBEC3-induced mutations are a biomarker of an innate immune response to the virus. Earlier targeted HPV16 sequencing studies[52-54] and an HPV whole-genome sequencing study of 151 HPV16, HPV52, or HPV58 samples[25] also identify APOBEC3-induced mutations in HPV genomes. Our within-host viral somatic mutations are consistent with the study by Hirose et al.[25], suggesting that the high levels of HPV genomic variation they observed, particularly in the low-grade CIN1 lesions, were likely the result of accumulating somatic mutations during infection. For a more direct comparison of our data with the HPV16 genome data from Hirose et al.[25], we downloaded their 45 HPV16 genomes (GenBank accession numbers: LC368952 to LC368996) and created a 96 trinucleotide mutation-type plot after exclusion of the common evolutionary-derived HPV16 lineage-defining substitutions[68,71], since inclusion of lineage-defining substitutions would skew the distribution for evaluations of recent or within-host mutations. The resulting distribution of mutation types looks similar to ours (Supplementary Fig. 5), including specific APOBEC3-associated variants and T > C and T > G substitutions. Although, mutational signature extraction was not possible for these 45 genomes[25] due to the overall small number of variants. Interestingly, we did not detect leading/lagging or transcribed DNA strand biases or APOBEC3-associated C > G changes at TCW motifs, which correspond to the APOBEC3 COSMIC[65] SBS signature 13, as observed in human somatic genomes related to APOBEC mutagenesis[36,39]. Our observations are consistent with previous HPV data[25,52] and suggest a difference between HPV and human genome APOBEC3 mutagenesis. It is possible that the antiviral APOBEC3 response to HPV infection is separate or slightly different from the role of APOBEC3 in host somatic genome mutagenesis. The distribution of APOBEC3-induced mutations across the viral genome was different between the controls and precancer/cancer cases, suggesting that sites in specific regions of the viral genome may be “hit” more often by APOBEC3, or mutations at specific sites may have more deleterious effects to the virus if absent from the cases, such as mutations in L1 and L2. Alternatively, since APOBEC3 mutations occur when the viral DNA is single stranded, the mutations observed in the L1 and L2 gene regions more frequently in the controls may reflect that these regions of the viral genome are more likely to be transcribed and single stranded in the controls with a productive infection. As noted, HPVs have evolved to limit the number of TpC dinucleotides in their genomes to avoid restriction[55,70], yet even with a limited number of APOBEC3 targetable sites, somatic APOBEC3-induced mutations were still observed and enriched in benign infections, suggesting an antiviral effect. We note that the majority of viruses did not have APOBEC3-induced mutations. We may be underestimating the level of APOBEC3-induced mutations due to our stringent quality control and low VAF cut point of 10%, instead of >0.5%[25], which would not detect the lowest VAF-induced mutations. It is also possible that some of the viruses with somatic APOBEC3-induced mutations were rapidly cleared, if they were less viable, and thus not part of our study. In contrast, the high VAF constitutive APOBEC3 variants, which included a lower burden of nonsynonymous variants, were likely present in the HPV16 population for a longer period of time and/or present at viral acquisition and reflect the natural variants circulating in the population. Purifying selection would have prevented a disproportionate number of nonsynonymous variants from reaching high frequencies in HPV16 populations, since they are more likely than synonymous variants to be disadvantageous (e.g., leading to a reduced ability to persist in the host), as we observed. These high VAF APOBEC3 variants were overall equivalent in cases and controls, and possibly neutral with respect to carcinogenesis. However, we did observe that there were specifically more high VAF E4 variants in the cases, suggesting that these genetic variants could be slightly advantageous to viral persistence. APOBEC3 upregulation occurs throughout disease progression to inhibit the HPV infection[30]. However, we still observed absent or reduced somatic APOBEC3-induced mutations in the precancer/cancer cases, which represent infections that have been successfully persisting for years, suggesting that these viruses may be evading host restriction by APOBEC3 and/or the more homogeneous viral genomes in the cases reflects outgrowth of a clone with a selective growth advantage. The somatic viral APOBEC3 mutations induced within the host that are not deleterious to the virus may instead aid in evasion of the host adaptive immune response by altering viral antigens, and this viral clone would then be selected for in that host. It is also possible that the precancer/cancer HPV16 viruses may be partially evading restriction through integration of HPV DNA into the host genome, where a portion of the viral genome and episomal HPV genomes are lost[69,72] and therefore there is less viral DNA present in these infections to be targeted by APOBEC3 enzymes. In addition, increased viral DNA methylation of the cases[73,74] may partially protect the viral genome from mutation by APOBEC3 enzymes. If APOBEC3 activity is upregulated and failed to clear HPV in the cases with advanced lesions, this may be contributing to the off-target host somatic mutations observed in cervical and other HPV-associated cancers. The difference in APOBEC3-induced mutations in the cases and controls may also reflect differences in activation and/or regulation of the innate immune response that leads to APOBEC3 expression and downstream viral mutations[32], partially related to the activity of IFN-α[75,76], inflammation and NF-κB signaling[77,78], TP53[31,79], or human genetic variation[44,80,81]. In summary (Fig. 4), we determined that APOBEC3 cytidine deaminases induce somatic mutations across the HPV16 genome, these mutations have contributed to the evolution of important viral lineages, and infections with somatic viral mutations induced during a woman’s infection were more likely to become benign infections or infections that subsequently cleared.
Fig. 4

Summary of the effects of mutations induced by the activity of APOBEC3.

Viral APOBEC3-induced mutations are illustrated in the circle plot by viral gene region in the inner ring for low variant allele fraction (VAF) somatic mutations in red, and the outer ring for high VAF constitutive variants in blue for all individuals in the PaP cohort (mutations in both cases and controls are illustrated). Modified from the Cancer Genome Atlas Research Network[35].

Summary of the effects of mutations induced by the activity of APOBEC3.

Viral APOBEC3-induced mutations are illustrated in the circle plot by viral gene region in the inner ring for low variant allele fraction (VAF) somatic mutations in red, and the outer ring for high VAF constitutive variants in blue for all individuals in the PaP cohort (mutations in both cases and controls are illustrated). Modified from the Cancer Genome Atlas Research Network[35].

Methods

Study populations

The cases and controls for the large discovery phase of our study were chosen from the Kaiser Permanente Northern California (KPNC)-NCI HPV Persistence and Progression (PaP) cohort[21]. This study population has been previously described[56]. The HPV Persistence and Progression (PaP) Cohort is a repository of residual cervical specimens stored in specimen transport medium (STM; Qiagen, Valencia, CA), from women who underwent cervical cancer screening from January 2007 to January 2011 at Kaiser Permanente Northern California (KPNC). Women could opt out of having their residual cervical specimens retained; only 8% of women with collected specimens opted out from having their specimen banked and tested. De-identified demographic and clinical information as well as all HPV and cytology test results and cervical histopathology were obtained on the cohort from electronic health records. This cross-sectional study included 3579 exfoliated cervical cell specimens collected at enrollment previously determined to contain HPV16 DNA[56], including 112 cancers, 1170 CIN3, 1032 CIN2, and 1265 controls (cancer cases were diagnosed at baseline (i.e., at enrollment; prevalent cases) or during the study follow-up period after the baseline specimens were collected (i.e., incident cases). The controls were defined as women having enrollment specimens with HPV16 DNA, and no histologic evidence of equivocal precancer or worse (CIN2+) during the follow-up study period according to the coded data obtained from electronic health records. Therefore, controls were the HPV16-positive women that either cleared their infections or had not progressed to CIN2+ during the follow-up study period. Women are followed as long as possible, and only censored if they received treatment for a CIN2+ lesion, or until the last documented follow-up cytology or histology. The study protocol was reviewed and approved yearly by Kaiser Permanente and the National Cancer Institute Institutional Review Boards. To confirm the findings from the PaP Cohort, we evaluated HPV16-positive women from the Study to Understand Cervical Cancer Early Endpoints and Determinants (SUCCEED). The details of the study design and specimen collection were previously described[57-59]. Briefly, a total of 2004 women were enrolled into SUCCEED between November 2003 and October 2009. We recruited women that were referred to colposcopy or treatment at the University of Oklahoma Dysplasia Clinic based at the University of Oklahoma Health Sciences Centre (OUHSC), with a recent abnormal Pap smear diagnosis or a biopsy diagnosis of CIN/cancer. Here, we included all CIN3+ exfoliated cervical cell specimens previously found to contain HPV16 DNA, including 444 women: 314 CIN3 and 130 cancer cases. Written informed consent was obtained from all women enrolled in the study, and Institutional Review Board approval was provided by OUHSC and the US National Cancer Institute. In total, 645 additional HPV16-positive cervical cell or tissue (frozen biopsy or formalin-fixed paraffin-embedded [FFPE]) specimens from cervical cancer cases were studied to assess the worldwide generalizability of our main finding, from the biobank at IARC. These samples were part of the IARC-coordinated cervical cancer case series, cervical cancer case–control studies and population-based HPV prevalence surveys from 39 countries worldwide[60-63]. Both local and IARC ethical committees approved all studies. We sequenced all HPV16-positive histologically confirmed cervical cancers with adequate DNA left in the IARC biobank.

HPV16 detection and DNA isolation

DNA was extracted from the banked STM specimens as previously described[82]. Typing methods varied for different subsets of the cohort, many of the enrollment PaP samples were tested by the Burk laboratory (Bronx, NY) using MY09/M11 L1 degenerate primer PCR (MY09/11 PCR) and type-specific dot-blot hybridization methods[82,83]. Other specimens were tested with either the Linear Array® HPV Genotyping System (Roche Molecular Diagnostics, Pleasanton, CA) or typed by BD using Onclarity (BD, Sparks, MD). Details of DNA isolation and HPV detection have been previously described[57,84]. Briefly, DNA was isolated from 1 mL aliquots of PreservCyt-fixed cells using the QIAamp DNA Blood Mini Kit (Qiagen) following a rinse in Hanks’ balanced salt solution (HBSS). The Linear Array® HPV Genotyping System (Roche Molecular Diagnostics) was used to detect HPV genotypes. Hybridization of PCR products to linear arrays and subsequent signal detection were performed using the Auto-LiPA automated staining system (Innogenetics N.V., Belgium). Hybridization to both β-globin probes was required to report genotyping results. A hybridization signal was called “positive” when an unambiguous, continuous band was observed on the array. DNA was extracted from frozen biopsy specimens, cervical cells, or FFPE at IARC, as previously described[85]. Samples were genotyped for 37 HPV types using a GP5+/6+-based PCR system[86] in one centralized laboratory (Department of Molecular Pathology, Vrije University, Amsterdam, The Netherlands).

Ion Torrent library preparation and sequencing

We used a custom Thermo Fisher Ion Torrent AmpliSeq HPV16 panel approach to amplify the entire 7906 bp HPV16 genome, as previously described[64]. In brief, the next-generation sequencing (NGS) assay used the Thermo Fisher Life Sciences’ Ion Torrent S5 and a custom HPV16 Ion Ampliseq panel of 47 multiplexed primer sets. Custom overlapping degenerate primers were designed to cover the entire viral genomes for all HPV16 variant lineages. After amplification, an Ion Torrent adapter-ligated library was generated following the manufacturer’s Ion AmpliSeq Library Preparation kit 2.0-96LV protocol with slight modifications (Life Technologies, Part #4480441). Raw sequencing reads were quality and adaptor trimmed using the Torrent Suite™ Software and aligned to the HPV16 reference sequence (7906 bp, NCBI accession number NC_001526) using the Torrent Mapping Alignment Program v5.0.13. SNP calls were made using the Torrent Variant Caller v.5.0.3, and variants were annotated with HPV gene/region using snpEff v.3.6c[87]. Pipeline settings and parameters can be found at https://github.com/cgrlab/cgrHPV16.

HPV16 variant lineage classification

HPV16 variant lineage assignment was based on the maximum likelihood (ML) tree topology constructed using RAxML MPI v7.2.8.27[88] that included 16 HPV16 European and non-European variant lineage reference sequences. We excluded samples with overall poor coverage, per individual nucleotide site per sample with low reads (<5), as previously described[24].

Statistical analyses

To identify the mutational processes generating the HPV variants, we carried out a de novo mutational signature analysis. There are a total of 12 possible single-nucleotide variants (SNV). The SNVs on the complementary DNA strands are considered the same, and we use pyrimidines (C and T) to annotate the SNV. Therefore, we have the following six basic types of SNVs: C > A, C > G, C > T, G > A, G > C, and G > T. We further considered the adjacent nucleotides in both 5′ and 3′ directions around the SNV as a three base-pair motif and obtained 96 (4 × 6 × 4) mutation types. We calculated the frequency of variants belonging to 96 mutation types for 3579 HPV16 genomes in a 96 × 3579 mutational catalog matrix. The mutational catalog matrix is regarded as a combination of mutational signatures induced by multiple mutational processes. To extract de novo mutational signatures, we applied the non-negative matrix factorization[89] and compared the similarity of resulting mutational signatures with the COSMIC mutational signatures v3 measured by the cosine similarity. We calculated the frequency of 96 mutation types, based on a three base-pair motif, across the HPV16 genome. Among these mutation types, APOBEC3-induced mutations are identified as C > T or C > G mutations specifically at the TCW motif (W is A or T), for which only T allowed in the 5′ end and G and C are excluded in the 3′ end[40]. This definition of APOBEC3-induced mutations has been well established and is more stringent than the motif defined (C > T mutations at motif YCN with Y a C or T and N being any nucleotide) by Vartanian et al.[52]. Both rare (minor allele frequency (MAF) < 0.01) and common (MAF > = 0.01) APOBEC3-induced mutations were examined, and variants occurring in <10% of the sequence reads and the lineage-diagnostic sites[68] were excluded. For each APOBEC3-induced mutation, we estimated the variant allele frequency (VAF) for HPV16 sequence reads per woman. Note, the difference between MAF and VAF: MAF quantifies the frequency of an APOBEC3-induced mutation across all samples in a given study or among women in a population (regardless of case and control status), while VAF measures the frequency of sequence reads containing the APOBEC3-induced mutation among all reads covering a specific APOBEC3 motif within a sample or per woman. Hence, the VAF reflects the percentage of HPV viruses, or sequence reads, with the specific APOBEC3-induced mutation in a sample. The VAF is expected to be ~1.0, if the APOBEC3-induced mutation was present in the virus at acquisition and being replicated subsequently; in contrast, the VAF would be much lower if the APOBEC3-induced mutation occurs de novo during the infection period, and the virus with this somatic mutation has not become dominant in the sample. We defined “high VAF” as a variant occurring in >60% of the sequence reads, which we have previously published performs well for calling the predominant HPV variant for a haploid HPV genome[24]. To evaluate lower level within-host somatic changes, we defined “low VAF” as variants occurring in 10% to < = 60% of sequence reads. To minimize false positive mutation calls, we used a VAF lower cut point of 10% and required at least ten sequence reads for each variant call. To count the number of APOBEC3 targetable sites, we first counted the TCW motifs across the HPV16 reference A1 genome, and for each main HPV16 sublineage. Since there are three possible changes at each nucleotide position (C > A, C > T and C > G), APOBEC3 targetable sites for the C > T and C > G changes only at TCW motif were counted as two-thirds of the number of TCW motifs. To examine the relative contribution of APOBEC3A- and APOBEC3B-induced mutations across the HPV genome, we compared the proportion of APOBEC3A mutations and APOBEC3B mutations separately by their specific motifs. APOBEC3A and APOBEC3B specific motifs have been reported as distinguishable in the yeast genome as YTCA (for APOBEC3A) and RTCA (for APOBEC3B) (where Y is a pyrimidine base and R is a purine base)[67]. We examined the association between the presence or absence of mutations with case and control status for all samples. Logistic regression was used to obtain the odds ratio (OR) and 95% confidence intervals (CI) for precancer/cancer risk for the specified exposure groups using the controls (i.e., women with HPV16 and women having an APOBEC3-induced mutation (per individual, coded as “yes” at least one APOBEC3 mutation or “no” APOBEC3 mutations) among HPV16 sublineages. For a subset of samples with at least one APOBEC3-induced mutation, we further compared the mutation burden of APOBEC3-induced mutations per virus between nonsynonymous and synonymous variants and between cases and controls, which adjusts the sample size of the cases and controls and the potential mutable bases of APOBEC3-induced nonsynonymous and synonymous mutations, respectively. Let Y represent the number of synonymous (denoted by j = 1) or nonsynonymous (j = 2) APOBEC3-induced mutations in cases (i = 2) and controls (i = 1) for each type of APOBEC3-induced mutation (k = 1, 2…, 8). The Y’s are modeled by a Poisson distribution with expected count E(Y): (1) with the number of potential mutable bases and t the mutation burden for synonymous mutations in controls; (2) with the mutation burden ratio of nonsynonymous and synonymous mutations in controls, where and and similar notations hold for E(Y11) and N11; (3) E(Y21) = N21 × rsyn with the enrichment of synonymous mutations in cases compared to controls, where and ; and (4) E(Y22) = N22 × t × w rnonsyn with mutation burden ratio or the enrichment of nonsynonymous mutations in cases compared to controls, where and and similar notations hold for E(Y12) and N12. w is essentially the dN/dS ratio measuring the selection of nonsynonymous mutations compared to synonymous mutations in controls; if w = 1, it suggests that the mutations are neutral, while w < 1 suggests that nonsynonymous mutations are under negative or purifying selection. A Poisson regression model was fit to obtain the maximum likelihood estimation (MLE) of t, w, rsyn, and rnonsyn. Statistical analyses were performed with R version 3.5.1; all statistical tests were two-sided. To determine if APOBEC3-induced mutations contributed to the evolution of HPV16 lineages, we conducted the following analyses: (1) ancestral HPV16 sequences and our current day HPV16 genome sequences were used to create a phylogenetic tree that represents HPV16 ancestral states (Supplementary Fig. 4). Ancestral HPV16 sequences included nine HPV16 lineage/sublineages sequences (All, A, A1, A4, B1, C1, D, D2, and D3) that were inferred using the Maximum Likelihood method[90] under the Tamura-Nei model[91]. The initial tree was inferred using a pre-computed tree file. The rates among sites were treated as being uniform among sites (Uniform rates option). The analysis included 63 nucleotide reference sequences from R.D.B. All positions containing gaps and missing data were eliminated. There were a total of 7697 HPV16 genome positions in the final data set. We also utilized 239 current day HPV16 sequences that represented A1, A4, B1, C1, D2, and D3 sublineages. All current day HPV16 sequences were controls from the PaP cohort. Evolutionary analyses were conducted in MEGA7[92]. (2) For non-A1 lineages, we selected common HPV16 variant positions that are known lineage-defining positions. For the A1 sublineage, which is the reference sublineage (i.e., there are no lineage-defining positions), we evaluated all common variants (MAF > 1%) occuring within the A1 sublineage viruses. (3) We aligned each ancestral sequence to an ancestral sequence from the previous node in the phylogenetic tree. For example, ancestral A1 was compared with the ancestral A sequence, and ancestral D2 was compared with the ancestral D sequence. Then we calculated the percentage of mutations that were potentially induced by APOBEC3 among lineage-defining positions of that particular lineage or sublineage. (4) When comparing current day HPV16 sequences to the ancestral sequences, we looped over all available current day sequences of that particular sublineage and calculated the average percentage of APOBEC3-induced mutation among lineage-defining positions.
  78 in total

1.  Human papillomavirus type 16 variants and risk of cervical cancer.

Authors:  A Hildesheim; M Schiffman; C Bromley; S Wacholder; R Herrero; A Rodriguez; M C Bratti; M E Sherman; U Scarpidis; Q Q Lin; M Terai; R L Bromley; K Buetow; R J Apple; R D Burk
Journal:  J Natl Cancer Inst       Date:  2001-02-21       Impact factor: 13.506

2.  Human papillomavirus is a necessary cause of invasive cervical cancer worldwide.

Authors:  J M Walboomers; M V Jacobs; M M Manos; F X Bosch; J A Kummer; K V Shah; P J Snijders; J Peto; C J Meijer; N Muñoz
Journal:  J Pathol       Date:  1999-09       Impact factor: 7.996

3.  [Effect of toothpaste on tooth abrasion].

Authors:  S Katada
Journal:  Shigaku       Date:  1972-02

4.  [Histologic examination of the appendix or not?].

Authors:  H Eck
Journal:  Zentralbl Chir       Date:  1965-11-13       Impact factor: 0.942

5.  HPV DNA, E6/E7 mRNA, and p16INK4a detection in head and neck cancers: a systematic review and meta-analysis.

Authors:  Cathy Ndiaye; Marisa Mena; Laia Alemany; Marc Arbyn; Xavier Castellsagué; Louise Laporte; F Xavier Bosch; Silvia de Sanjosé; Helen Trottier
Journal:  Lancet Oncol       Date:  2014-10-16       Impact factor: 41.316

6.  Long-term absolute risk of cervical intraepithelial neoplasia grade 3 or worse following human papillomavirus infection: role of persistence.

Authors:  Susanne K Kjær; Kirsten Frederiksen; Christian Munk; Thomas Iftner
Journal:  J Natl Cancer Inst       Date:  2010-09-14       Impact factor: 13.506

7.  Human papillomavirus genotype attribution in invasive cervical cancer: a retrospective cross-sectional worldwide study.

Authors:  Silvia de Sanjose; Wim Gv Quint; Laia Alemany; Daan T Geraets; Jo Ellen Klaustermeier; Belen Lloveras; Sara Tous; Ana Felix; Luis Eduardo Bravo; Hai-Rim Shin; Carlos S Vallejos; Patricia Alonso de Ruiz; Marcus Aurelho Lima; Nuria Guimera; Omar Clavero; Maria Alejo; Antonio Llombart-Bosch; Chou Cheng-Yang; Silvio Alejandro Tatti; Elena Kasamatsu; Ermina Iljazovic; Michael Odida; Rodrigo Prado; Muhieddine Seoud; Magdalena Grce; Alp Usubutun; Asha Jain; Gustavo Adolfo Hernandez Suarez; Luis Estuardo Lombardi; Aekunbiola Banjo; Clara Menéndez; Efrén Javier Domingo; Julio Velasco; Ashrafun Nessa; Saibua C Bunnag Chichareon; You Lin Qiao; Enrique Lerma; Suzanne M Garland; Toshiyuki Sasagawa; Annabelle Ferrera; Doudja Hammouda; Luciano Mariani; Adela Pelayo; Ivo Steiner; Esther Oliva; Chris Jlm Meijer; Waleed Fahad Al-Jassar; Eugenia Cruz; Thomas C Wright; Ana Puras; Cecilia Ladines Llave; Maria Tzardi; Theodoros Agorastos; Victoria Garcia-Barriola; Christine Clavel; Jaume Ordi; Miguel Andújar; Xavier Castellsagué; Gloria I Sánchez; Andrzej Marcin Nowakowski; Jacob Bornstein; Nubia Muñoz; F Xavier Bosch
Journal:  Lancet Oncol       Date:  2010-10-15       Impact factor: 41.316

8.  Human papillomavirus types in 115,789 HPV-positive women: a meta-analysis from cervical infection to cancer.

Authors:  Peng Guan; Rebecca Howell-Jones; Ni Li; Laia Bruni; Silvia de Sanjosé; Silvia Franceschi; Gary M Clifford
Journal:  Int J Cancer       Date:  2012-03-20       Impact factor: 7.396

9.  Natural history of cervical neoplasia and risk of invasive cancer in women with cervical intraepithelial neoplasia 3: a retrospective cohort study.

Authors:  Margaret R E McCredie; Katrina J Sharples; Charlotte Paul; Judith Baranyai; Gabriele Medley; Ronald W Jones; David C G Skegg
Journal:  Lancet Oncol       Date:  2008-04-11       Impact factor: 41.316

Review 10.  Carcinogenic human papillomavirus infection.

Authors:  Mark Schiffman; John Doorbar; Nicolas Wentzensen; Silvia de Sanjosé; Carole Fakhry; Bradley J Monk; Margaret A Stanley; Silvia Franceschi
Journal:  Nat Rev Dis Primers       Date:  2016-12-01       Impact factor: 52.329

View more
  21 in total

1.  Repurposing the Hybrid Capture 2 (HC2) screening test for whole-genome sequencing of human papillomaviruses.

Authors:  A Debernardi; W Jarassier; C Soret; C Mougin; S Alizon; I G Bravo; D Guenat; J L Prétet; Q Lepiller
Journal:  Arch Virol       Date:  2021-10-07       Impact factor: 2.574

Review 2.  Leveraging extrachromosomal DNA to fine-tune trials of targeted therapy for glioblastoma: opportunities and challenges.

Authors:  Imran Noorani; Paul S Mischel; Charles Swanton
Journal:  Nat Rev Clin Oncol       Date:  2022-09-21       Impact factor: 65.011

3.  The Improving Risk Informed HPV Screening (IRIS) Study: Design and Baseline Characteristics.

Authors:  Julia C Gage; Tina Raine-Bennett; Mark Schiffman; Megan A Clarke; Li C Cheung; Nancy E Poitras; Nicole E Varnado; Hormuzd A Katki; Philip E Castle; Brian Befano; Malini Chandra; Greg Rydzak; Thomas Lorey; Nicolas Wentzensen
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2021-11-17       Impact factor: 4.090

4.  Enhanced therapeutic efficacy of Listeria-based cancer vaccine with codon-optimized HPV16 E7.

Authors:  Feifei Duan; Jiaqi Chen; Hao Yao; Yuting Wang; Yanyan Jia; Zhiting Ling; Youwei Feng; Zhiming Pan; Yuelan Yin; Xin'An Jiao
Journal:  Hum Vaccin Immunother       Date:  2021-01-15       Impact factor: 3.452

Review 5.  APOBEC: A molecular driver in cervical cancer pathogenesis.

Authors:  Sundaramoorthy Revathidevi; Avaniyapuram Kannan Murugan; Hirofumi Nakaoka; Ituro Inoue; Arasambattu Kannan Munirajan
Journal:  Cancer Lett       Date:  2020-10-07       Impact factor: 8.679

6.  The interesting relationship between APOBEC3 deoxycytidine deaminases and cancer: a long road ahead.

Authors:  Milaid Granadillo Rodríguez; Ben Flath; Linda Chelico
Journal:  Open Biol       Date:  2020-12-09       Impact factor: 6.411

7.  High Levels of Within-Host Variations of Human Papillomavirus 16 E1/E2 Genes in Invasive Cervical Cancer.

Authors:  Yusuke Hirose; Mayuko Yamaguchi-Naka; Mamiko Onuki; Yuri Tenjimbayashi; Nobutaka Tasaka; Toyomi Satoh; Kohsei Tanaka; Takashi Iwata; Akihiko Sekizawa; Koji Matsumoto; Iwao Kukimoto
Journal:  Front Microbiol       Date:  2020-11-24       Impact factor: 5.640

8.  APOBEC Mutagenesis Is Concordant between Tumor and Viral Genomes in HPV-Positive Head and Neck Squamous Cell Carcinoma.

Authors:  Daniel L Faden; Krystle A Lang Kuhs; Maoxuan Lin; Adam Langenbucher; Maisa Pinheiro; Meredith Yeager; Michael Cullen; Joseph F Boland; Mia Steinberg; Sara Bass; James S Lewis; Michael S Lawrence; Robert L Ferris; Lisa Mirabello
Journal:  Viruses       Date:  2021-08-23       Impact factor: 5.048

Review 9.  Exploring the roles of HPV16 variants in head and neck squamous cell carcinoma: current challenges and opportunities.

Authors:  Daniela Cochicho; Rui Gil da Costa; Ana Felix
Journal:  Virol J       Date:  2021-11-08       Impact factor: 4.099

10.  Ancient Evolutionary History of Human Papillomavirus Type 16, 18 and 58 Variants Prevalent Exclusively in Japan.

Authors:  Kohsei Tanaka; Gota Kogure; Mamiko Onuki; Koji Matsumoto; Takashi Iwata; Daisuke Aoki; Iwao Kukimoto
Journal:  Viruses       Date:  2022-02-24       Impact factor: 5.048

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.