| Literature DB >> 35945577 |
Charles D Warden1, Preetam Cholli2, Hanjun Qin1, Chao Guo1, Yafan Wang3, Chetan Kancharla4, Angelique M Russell5, Sylvana Salvatierra6, Lorraine Z Mutsvunguma7, Kerin K Higa8, Xiwei Wu1, Sharon Wilczynski9, Raju Pillai3,9, Javier Gordon Ogembo10.
Abstract
BACKGROUND: Human papillomavirus (HPV) is the primary cause of invasive cervical cancer (ICC). The prevalence of various HPV genotypes, ranging from oncogenically low- to high-risk, may be influenced by geographic and demographic factors, which could have critical implications for the screening and prevention of HPV infection and ICC incidence. However, many technical factors may influence the identification of high-risk genotypes associated with ICC in different populations.Entities:
Keywords: Genotyping; High-throughput sequencing; Human papillomavirus; Invasive cervical cancer
Year: 2022 PMID: 35945577 PMCID: PMC9361560 DOI: 10.1186/s13027-022-00456-w
Source DB: PubMed Journal: Infect Agent Cancer ISSN: 1750-9378 Impact factor: 3.698
Summary of sample and patient characteristics
| All samplesa | Archived DNA | Frozen tissue | FFPE tissue | ||
|---|---|---|---|---|---|
| Cervical samplesb | 128 (incl. 7 frozen-FFPE and 3 DNA-frozen tumor-tumor pairs) | 28 | 44 (incl. 4 tumor-normal pairs) | 56 (incl. 2 tumor-tumor pairs) | |
| Additional samples | 7 | 6 (prostate) | 0 | 1 (vaginal tissue) | |
| Samples with L1 amplicon sequencing | 135 | 34 | 44 | 57 | |
| Number of samples with QC array data | 70 | 26 | 44 | 0 | |
| Collection year (Mean ± SD) | 1999 ± 11 | Not reported | 1995 ± 11 | 2003 ± 9 | |
| Age (Mean ± SD) | 51.4 ± 14.9 | Not reported | 51.6 ± 15.5 | 50.83 ± 14.7 | |
Histological subtype | Adenocarcinoma (AC) | 25 | 3 | 9 | 13 |
| Adenosquamous Carcinoma (ASC) | 6 | 1 | 0 | 5 | |
| Squamous Cell Carcinoma (SSC) | 87 | 23 | 27 | 37 | |
| Other | 3 | 1 | 1 | 1 | |
| Not reported/ not applicable | 14 | 6 | 7 | 1 | |
| Reported racec | Asian | 11 | Not reported | 1 | 10 |
| Black | 4 | 2 | 2 | ||
| Other | 4 | 2 | 2 | ||
| White/caucasian | 70 | 27 | 43 | ||
| ADMIXTURE-predicted super-population assignmentd | African (AFR) | 4 | 0 | 4 | Not processed |
Admixed American (AMR) | 23 | 3 | 20 | ||
| East Asian (EAS) | 4 | 2 | 2 | ||
| European (EUR) | 27 | 16 | 11 | ||
| South Asian (SAS) | 0 | 0 | 0 | ||
| AMR/EUR | 7 | 1 | 6 | ||
| AFR/EUR | 1 | 1 | 0 | ||
| SAS/EUR | 1 | 0 | 1 | ||
aData shown for all samples are the sum of or summarize data only from samples for which each characteristic was reported (or estimated)
bIncludes ICC samples, one archived DNA sample from vulval cancer, and one FFPE sample that includes a mix of ICC and endometrial cancer, as well as adjacent normal tissue
cHispanic ethnicity was not reported for any samples
dADMIXTURE-predicted super-population assignments were provided for samples with QC Array call rates > 75%. ADMIXTURE super-populations with contributions of > 20% to a patient’s genome are reported above
Fig. 1QC Array super-population assignments. A Super-population clusters of select 1000 Genomes reference samples and ICC samples with QC Array call rates > 85%, projected onto the first three principal components (PC1–3). 1000 Genomes individuals from AFR populations and QC Array samples from patients expected to have African ancestry were most clearly distinguished by the first two principal components. 1000 Genomes individuals from current EUR and EAS populations were separated along the third and, to a lesser extent, second principal components. B Similar to (A) but with the third principal component plotted against the fourth (PC4), demonstrating that AMR individuals become more distinct along the fourth principal component. No QC Array samples were predicted to have SAS ancestry. These principal components were not the primary method for assigning ethnicities but provide an effective way to visualize variation among samples. C Frequencies of reported race per archive type, as well as supervised AMR/EUR predictions for archived DNA and frozen tissues from reported “White/Caucasian” individuals. Counts are among HPV + tumor samples, counted once per patient per archive type. If race was not reported (or reported to be “Other”) or QC Array data were not available, the corresponding samples were not included in this plot
Fig. 2HPV58 read fractions and the percentage of off-target human reads vary by archive type. A Box-plots showing the read fractions of HPV16, HPV18, HPV58, and HPV45 in archived DNA, frozen tissue, and FFPE tissue samples. Although read fractions tended to be lower for HPV58 than for HPV16 and HPV18, HPV58 was detected more frequently (especially at read fractions between 20 and 80%) in FFPE tissue samples than in archived DNA and frozen tissue samples. B Associations between HPV16, HPV18, HPV58, and HPV45 read fractions and the percentages of off-target human reads in all samples. For HPV16 and HPV18, the HPV genotype-specific read fractions tended to be lower in samples with higher percentages of off-target human reads. In contrast, the frequency of HPV58 reads tended to be higher in frozen and FFPE tissue samples with higher percentages of human reads
Fig. 3Amplified DNA concentrations vary by archive type. Concentrations of DNA amplified from each sample, quantified by qPCR after gel extraction, plotted against the percentage of off-target human reads detected by L1 amplicon sequencing. Samples are color-coded by archive type: archived DNA, frozen tissue, or FFPE tissue. The gray line indicates the average DNA concentration for HPV− archived DNA samples. Samples with DNA concentrations < 2 nM, indicated by red shading on the plot, were excluded from our HPV genotype analyses
Summary of HPV Genotypes and Co-Infections Among ICC Patients and Samples
| Total patients | Total samples | Archived DNA samples | Frozen tissue samples | FFPE tissue samples | ||
|---|---|---|---|---|---|---|
| Number of ICC samplesa | 112 | 124 | 28 | 40 | 56 | |
| Samples passing qPCR filter | 102 (91%) | 110 (89%) | 28 (100%) | 40 (100%) | 42 (75%) | |
| HPV + patients/samples | 97 (95%) | 105 (95%) | 25 (89%) | 38 (95%) | 42 (100%) | |
| Overall “Unclear” samples | 3 (3%) | 3 (3%) | 1 (4%) | 2 (5%) | 0 (0%) | |
| HPV genotype-positive samplesb | HPV16 | 59 (58%) | 62 (56%) | 19 (68%) | 21 (53%) | 22 (52%) |
| HPV18 | 19 (19%) | 23 (21%) | 4 (14%) | 8 (20%) | 11 (26%) | |
| HPV31 | 2 (2%) | 2 (2%) | 0 (0%) | 0 (0%) | 2 (5%) | |
| HPV33 | 2 (2%) | 2 (2%) | 0 (0%) | 0 (0%) | 2 (5%) | |
| HPV45 | 6 (6%) | 6 (5%) | 0 (0%) | 2 (5%) | 4 (10%) | |
| HPV58 | 8 (8%) | 8 (7%) | 1 (4%) | 4 (10%) | 3 (7%) | |
| HPV59 | 2 (2%) | 3 (3%) | 2 (7%) | 1 (3%) | 0 (0%) | |
| HPV67 | 1 (1%) | 1 (1%) | 0 (0%) | 0 (0%) | 1 (2%) | |
| HPV73 | 2 (2%) | 2 (2%) | 0 (0%) | 0 (0%) | 2 (5%) | |
| Unclearc | 2 (2%) | 2 (2%) | 0 (0%) | 2 (5%) | 0 (0%) | |
| HPV genotypes per sample | 1 | 100 (91%) | 24 (86%) | 38 (95%) | 38 (90%) | |
| 2 | 4 (4%) | 1 (4%) | 0 (0%) | 3 (7%) | ||
| 3 | 1 (1%) | 0 (0%) | 0 (0%) | 1 (2%) | ||
| 4 | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | ||
aIncludes one archived DNA sample extracted from vulval cancer, as well as one FFPE sample that includes a mix of ICC and endometrial cancer; excludes negative control prostate samples, ICC-adjacent normal tissues, and non-malignant vaginal sample
bBecause of co-infections, one sample can contribute to the counts of multiple HPV genotypes
cSamples with an “unclear” genotype meet the read requirement for overall HPV reads but not for any particular HPV genotype. For patients, only one assignment in one sample was required to be designated positive for a given HPV genotype (i.e., “clear”)
Fig. 4Excluding FFPE samples based on amplified DNA concentrations likely reduces false-positive HPV58 genotype and HPV co-infection assignments. Concentrations of DNA amplified from all samples (A–B), as shown in Fig. 3, or FFPE samples only (C–D). Samples are color-coded by the number of HPV co-infections detected (A and C) or their assigned HPV58 genotype status (B and D). The gray lines indicate the average DNA concentration for HPV− archived DNA samples. The 2-nM threshold used to filter samples is indicated by red shading