| Literature DB >> 23977318 |
Zigui Chen1, Mark Schiffman, Rolando Herrero, Rob DeSalle, Kathryn Anastos, Michel Segondy, Vikrant V Sahasrabuddhe, Patti E Gravitt, Ann W Hsing, Robert D Burk.
Abstract
BACKGROUND: The species Alphapapillomavirus 7 (alpha-7) contains human papillomavirus genotypes that account for 15% of invasive cervical cancers and are disproportionately associated with adenocarcinoma of the cervix. Complete genome analyses enable identification and nomenclature of variant lineages and sublineages.Entities:
Mesh:
Year: 2013 PMID: 23977318 PMCID: PMC3745470 DOI: 10.1371/journal.pone.0072565
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of HPV isolates, genome sizes, variability and variant lineages.
| HPV type | Isolates | Genomes analyzed b | Genome size (nucleotide)c | Mean GC content (%) | Number of CpG site d | Variable nt positions e | Number of aa codons f | Variable aa positions g | Variant lineage / sublineage |
|---|---|---|---|---|---|---|---|---|---|
| HPV18 | 380 | 46 | 7857 (7824, 7857) | 40.2 | 230 (2.9%) | 422 (5.4%) | 2476 | 165 (6.7%) | A1-A5, B1-B3, C |
| HPV39 | 122 | 20 | 7912 (7833, 7885) | 40.2 | 183 (2.3%) | 186 (2.4%) | 2428 | 61 (2.5%) | A1-A2, B |
| HPV45 | 217 | 24 | 7858 (7841, 7858) | 39.7 | 181 (2.3%) | 239 (3.0%) | 2440 | 89 (3.6%) | A1-A3, B1-B2 |
| HPV59 | 45 | 8 | 7898 (7896, 7898) | 38.7 | 164 (2.1%) | 169 (2.1%) | 2441 | 66 (2.7%) | A1-A3, B |
| HPV68 | 88 | 21 | 7850 (7814, 7836) | 39.9 | 247 (3.1%) | 771 (9.8%) | 2419 | 232 (9.6%) | A1-A2, B, C1-C2, D1-D2, E, F1-F2 |
| HPV70 | 295 | 9 | 7922 (7905, 7922) | 40.3 | 172 (2.2%) | 192 (2.4%) | 2413 | 67 (2.8%) | A, B |
aNumber of isolates characterized by sequencing the URR ± E6 region;
bNumber of complete genomes analyzed (including sequenced for this report, the prototype and other complete genomes available in NCBI/GenBank), see Table S1 for complete list with accession numbers;
cNumber of nucleotides within the reference genome based on one genome size for each HPV type calculated from the global sequence alignments (see Materials and Methods). Minimum and maximum lengths of sequenced genomes for each type are shown and differ from the presence of insertions and deletions (indels);
dNumber of CpG sites is the cumulative number in each alignment;
eTotal number and percentage of variable nucleotide positions based on a reference genome for each HPV type as described above. Nucleotide variations include single nucleotide polymorphisms (SNPs) and indels, which are considered equivalent to one variation per indel, independent of indel size;
fNumber of amino acid codons (not including overlapping ORFs) based on the reference genome for each type as described above. Cumulative number of amino acid codons are taken from 7 ORFs (E6, E7, E1, E2, E5, L2 and L1), E4 is not counted separately nor are other overlapping ORFs;
gTotal number and percentage of variable amino acids based on the total number of amino acid positions derived from the reference genome for each HPV type.
Figure 1Alpha-7 phylogenetic tree showing representative types and variant lineages.
A maximum likelihood (ML) tree was constructed using RAxML MPI v 7.2.8.27 [42] and PhyML MPI v 1.4.3 [43] inferred from the global alignment of complete circular genome nucleotide sequences linearized at the first ATG of the E1 ORF. Numbers on or near branches indicate support indices < 100% by RAxML and PhyML, respectively. The shaded areas represent groupings of lineages and sublineages of HPV18, HPV39, HPV45, HPV59, HPV68 and HPV70; the prototype sequences of HPV85 and HPV97 are also included as indicated, although no variant lineages are distinguished due to the limited number of isolates of these types. The length of broken and solid lines represent distance between clades, although the number of changes is different for these two lines, as indicated in the upper left corner of the figure. HPV51, an alpha-5 type, was set as the outgroup.
Figure 2HPV18 tree topology and pairwise comparisons of individual complete genomes.
Phylogenetic trees were inferred from global alignment of complete genome nucleotide sequences (the other alpha-7 HPV reference prototypes were set as the outgroup). Numbers on or near branches indicate support indices in the following order: maximum likelihood (ML) bootstrap percentages using RAxML MPI v 7.2.8.27 [42] and PhyML MPI v 1.4.3 [43], and Bayesian credibility value percentage using MrBayes v 3.1.2 [44]. An asterisk (*) indicates 100% agreement between methods. “NA” indicates disagreement between a method and the reference RAxML tree at a given node. Thus, one tree is shown, but three different methods of tree construction were used to estimate the support of the provided tree, as explained above. Distinct variant lineages (i.e., termed A, B, and C) are classified according to the topology and nucleotide sequence differences from > 1% to < 10%; distinct sublineages (e.g., termed A1 and A2) were also inferred from the tree topology and nucleotide sequence differences in the > 0.5% to < 1% range. The percent nucleotide differences for each isolate compared to all other isolates (i.e., 1 x 1 comparisons) are shown in the panel to the right of the phylogeny. Values for each comparison of a given isolate are connected by lines and the comparison to self is indicated by the 0% difference point. Symbols and colored lines are used to distinguish each isolate.
Figure 3HPV39 variant tree topology and pairwise comparisons of individual complete genomes.
The phylogenetic tree was constructed as described in Figure 2. Distinct variant lineages and sublineages were determined as described in Figure 2. The percent nucleotide sequence differences are shown in the panel to the right of the phylogenetic tree as described in Figure 2.
Figure 4HPV45 variant tree topology and pairwise comparisons of individual complete genomes.
The phylogenetic tree was constructed as described in Figure 2. Distinct variant lineages and sublineages were determined as described in Figure 2. The percent nucleotide sequence differences are shown in the panel to the right of the phylogenetic tree as described in Figure 2.
Figure 5HPV59 variant tree topology and pairwise comparisons of individual complete genomes.
The phylogenetic tree was constructed as described in Figure 2. Distinct variant lineages and sublineages were determined as described in Figure 2. The percent nucleotide sequence differences are shown in the panel to the right of the phylogenetic tree as described in Figure 2.
Figure 6HPV68 variant tree topology and pairwise comparisons of individual complete genomes.
The phylogenetic tree was constructed as described in Figure 2. Distinct variant lineages and sublineages were determined as described in Figure 2. The percent nucleotide sequence differences are shown in the panel to the right of the phylogenetic tree as described in Figure 2.
Figure 7HPV70 variant tree topology and pairwise comparisons of individual complete genomes.
The phylogenetic tree was constructed as described in Figure 2. Distinct variant lineages and sublineages were determined as described in Figure 2. The percent nucleotide sequence differences are shown in the panel to the right of the phylogenetic tree as described in Figure 2.
Figure 8Phylogenetic tree and pairwise comparisons of representative Alpha-7 lineages and sublineage complete genomes.
The phylogenetic tree was constructed as described in Figure 2. Representative genomes for each variant lineage and sublineage described in this report are used and the names of the isolates are shown to the right of the tree branches. The percent nucleotide sequence differences are shown in the panel to the right of the phylogenetic tree as described in Figure 2.