| Literature DB >> 30597724 |
Atlas Khan1, Qian Liu2, Xuelian Chen3,4, Andres Stucky3,4, Parish P Sedghizadeh3,4, Daniel Adelpour3,4, Xi Zhang3,4,5, Kai Wang2, Jiang F Zhong3,4.
Abstract
Next-generation sequencing provides an opportunity to detect viral species from RNA-seq data of human tissues, but existing computational approaches do not perform optimally on clinical samples. We developed a bioinformatic method called VirTect for detecting viruses in neoplastic human tissues using RNA-seq data. Here, we used VirTect to analyze RNA-seq data from 363 head and neck squamous cell carcinoma (HNSCC) patients and identified 22 human papillomavirus (HPV)-induced HNSCCs. These predictions were validated by manual review of pathology reports on histopathologic specimens. VirTect showed better performance in recall and accuracy compared to the two existing prediction methods, VirusFinder and VirusSeq, in identifying viral sequences from RNA-seq data. The majority of HPV carcinogenesis studies thus far have been performed on cervical cancer and generalized to HNSCC. Our results suggest that carcinogenesis of HPV-induced HNSCC and other cases of HNSCC involve different genes, so understanding the underlying molecular mechanisms will have a significant impact on therapeutic approaches and outcomes. In summary, RNA-seq together with VirTect can be an effective solution for the detection of viruses from tumor samples and can facilitate the clinicopathologic characterization of various types of cancers with broad applications for oncology.Entities:
Keywords: RNA-seq; VirTect; carcinogenesis; head and neck squamous cell carcinoma; human papillomavirus; viral detection
Mesh:
Year: 2019 PMID: 30597724 PMCID: PMC6441885 DOI: 10.1002/1878-0261.12435
Source DB: PubMed Journal: Mol Oncol ISSN: 1574-7891 Impact factor: 6.603
Figure 1Schematic overview of the VirTect pipeline for virus detection from NGS data.
Clinical confirmation of HPV16 infection in HNSCC cases
| Patient number (TCGA case number) | Sample location | Pathological evidence | P16 assay confirmation | Morphology confirmation | Sex |
|---|---|---|---|---|---|
| 1 (abbcc1d8‐ab74‐4459‐bf28‐cc627bef440e) | Oropharynx/tonsil | Oropharynx/tonsil | P16 positive (cytoplasm and nuclei) | Yes | Female |
| 2 (e37f479f‐ca05‐45e4‐ac6d‐a974bed6e7f8) | Left tonsil | Left tonsil | NA | Yes | Male |
| 3 (5ac57aee‐4be1‐4a29‐a53f‐343f5a3d2e86) | Pharynx, oropharynx | Pharynx, oropharynx | NA | Yes | Male |
| 4 (cac30b32‐03ef‐4ecb‐88d8‐d5577ab6b2a6) | Right base of tongue | Right base of tongue | NA | Yes | Male |
| 5 (59a7d695‐cb0c‐47b8‐ba9d‐ca52ca2cfa7d) | Right tonsil | Right tonsil | P16 positive | Yes | Male |
| 6 (cd032ddb‐55f2‐4c77‐8dcf‐e4e630f7de6f) | Right tonsil | Right tonsil | NA | Yes | Male |
| 7 (05f01280‐bf77‐4682‐a7a8‐20dd0eac77bd) | Left tonsil and soft palate | Left tonsil and soft palate | NA | Yes | Male |
| 8 (c7df3466‐b9a7‐4818‐883b‐d0cd08483570) | Right tonsil | Right tonsil | NA | Yes | Female |
| 9 (4bfbce2b‐9d0b‐4e8a‐950f‐fd8e0ba3e05a) | Right base of the tongue | Right base of tongue | NA | Yes | Female |
| 10 (387db1df‐ebaa‐41c2‐b036‐f46ad61e313a) | Base of the tongue | Base of the tongue | NA | Yes | Male |
| 11 (9d469689‐0413‐4898‐aa83‐c6756cbfe117) | Right soft palate | Right soft palate | P 16/18 | Yes | Male |
| 12 (54b295d4‐6315‐4416‐bde7‐a221859f965c) | Left glossotonsillar sulcus extending into lingual tonsil | Left glossotonsillar sulcus extending into lingual tonsil | NA | Yes | Male |
| 13 (b3631718‐9e0a‐454c‐bee1‐8f36ebc509d8) | Left tonsil | NA | Yes | Male | |
| 14 (27c28c89‐f4e7‐4aec‐a806‐c0da7756e47f) | Right floor of mouth | Right floor of mouth | NA | Yes | Male |
| 15 (03c3ae62‐d0aa‐412e‐bd3c‐4577fc9f919c) | Right tonsil | Right tonsil | Positive for p16 and HPV16. | Yes | Male |
| 16 (9f89510c‐ed07‐471f‐b35e‐7c87c237b9fe) | Left tonsil | Left tonsil | NA | Yes | Male |
| 17 (602f2512‐00b6‐44b6‐9ed6‐f8b010224f8c) | Pharynx, oropharynx | Pharynx, oropharynx | P16 IHC positive, P16 ISH positive | Yes | Male |
| 18 (180036a2‐3b56‐405e‐a1fe‐d5932517b6c7) | Right tonsil | Right tonsil | NA | Yes | Male |
| 19 (10f53522‐9ae7‐47b9‐80aa‐b4b481561465) | Right tonsil | Right tonsil | NA | Yes | Male |
| 20 (a0b136fb‐3a0a‐4411‐8907‐4ca775c7d04e) | Left tonsil | Left tonsil | P16 staining positive | Yes | Male |
Figure 2Histopathology confirms in silico results of HPV infection. (A–C) Histopathology of cancer specimens without in silico HPV detection. (A) Moderately differentiated invasive squamous cell carcinoma (20×). (B) Higher magnification showing invasive tumor islands with central keratin pearl formation and individual cell dyskeratosis (40×). (C) Cellular atypia, pleomorphism, nucleoli, and mitotic activity are observed regularly (40×). (D–F) Histopathology of in silico HPV‐positive specimens which have characteristic nonkeratinizing tumor morphology. (D) A typical HPV16‐positive specimen (20×) detected by VirTect and confirmed by serological test. (E) Higher magnification (40×) shows infiltrating tumor islands lacking squamous maturation and comprising a cell population of ovoid to spindle‐shaped cells with indistinct borders or gap junctions, and hyperchromatic nuclei that lack prominent nucleoli. (F) Overlying mucosa and significant koilocytosis indicating HPV infection. Viral cytopathic effect can be seen in the form of keratinocyte nuclear enlargement and hyperchromasia with perinuclear clearing or halo.
Comparison of virus detection results among VirTect, VirusSeq, and VirusFinder
| Method | Precision | Recall | Accuracy |
|---|---|---|---|
| VirTect | 1.00 | 1.00 | 1.00 |
| VirusSeq | 1.00 | 0.087 | 0.344 |
| VirusFinder | 1.00 | 0.913 | 0.938 |
There were 32 RNA‐seq data for analysis including 21 HPV16+ RNA‐seq data and 11 HPV16− RNA‐seq data. Precision is the number of HPV16+ RNA‐seq data where HPV16 was detected by a tool divided by the number of RNA‐seq data where HPV16 was detected, recall is the number of HPV16+ RNA‐seq data where HPV16 was detected by a tool divided by the number of HPV16+ RNA‐seq data, while accuracy is the number of correctly identified RNA‐seq data, no matter HPV16+ or HPV16−, divided by the total number of RNA‐seq data.
Figure 3The igv figures of the nonhuman reads of two head and neck cancer samples affected by HPV16 (A) and HPV33 (B) showing the coverage of a sample affected by HPV16/HPV33 and control.
Figure 4HPV16 viral genes detected in HNSCC patients. Among the 20 patients with detectable HPV16 viral sequence, the number of detectable HPV16 genes ranged from 3 (patients 9, 11, and 15) to 10 (patient 19). E2 and E7 were most common, detected in 85% (17/20) and 90% (18/20) of patients, respectively. Most patients (15/20) had both detectable E2 and E7. The alternatively spliced E8^E2 was detected in 90% (18/20) of patients. The two patients (patients 9 and 15) without E8^E2 only had 3 detectable viral genes (E1, E1^E4, and E7). E1^E4 and E4 were detected in 95% (19/20) and 85% (17/20) of patients, respectively. However, E6 (83–559) was only detected in 30% (6/20) of patients. Fragments of E6*(83–226) and E6*# (409–417) were detected in 25% (5/20) and 15% (3/20) of patients, respectively. Patients 16 and 17 had detectable E6 (83–559) and E6*(83–226), while patient 19 had all E6 fragments detected. This viral expression pattern suggests that E2 and E7 are the major players in HPV‐induced HNSCC.
Figure 5Carcinogenesis pathway of HPV‐induced HNSCC. (A) HNSCC patients with (n = 20) and without (n = 20) HPV infection had significantly different expression values for p16 (P = 4.88E‐7), p19 (P = 7.6E‐6), E2F1 (P = 3.63E‐5), and TP53 (P = 9.75E‐7). (B) Ingenuity pathway analysis (ipa) identified the molecules involved in the G1/S checkpoint to be the most significantly (P = 3.28E‐18) active molecular pathway, including 17 genes with significantly different expression values between HPV+ and HPV− HNSCC (P < 0.05). Red: upregulated in HPV+ HNSCC; green: downregulated in HPV+ HNSCC. While E6 plays a minor role, E7 plays a major role in HPV carcinogenesis in HNSCC.