| Literature DB >> 26658849 |
Orla M Gannon1, Annika Antonsson2, Michael Milevskiy3, Melissa A Brown3,4, Nicholas A Saunders1, Ian C Bennett5,6.
Abstract
Infectious agents are thought to be responsible for approximately 16% of cancers worldwide, however there are mixed reports in the literature as to the prevalence and potential pathogenicity of viruses in breast cancer. Furthermore, most studies to date have focused primarily on viral DNA rather than the expression of viral transcripts. We screened a large cohort of fresh frozen breast cancer and normal breast tissue specimens collected from patients in Australia for the presence of human papilloma virus (HPV) DNA, with an overall prevalence of HPV of 16% and 10% in malignant and non-malignant tissue respectively. Samples that were positive for HPV DNA by nested PCR were screened by RNA-sequencing for the presence of transcripts of viral origin, using three different bioinformatic pipelines. We did not find any evidence for HPV or other viral transcripts in HPV DNA positive samples. In addition, we also screened publicly available breast RNA-seq data sets for the presence of viral transcripts and did not find any evidence for the expression of viral transcripts (HPV or otherwise) in other data sets. This data suggests that transcription of viral genomes is unlikely to be a significant factor in breast cancer pathogenesis.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26658849 PMCID: PMC4677295 DOI: 10.1038/srep18081
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Frequency of HPV DNA in tissue samples by clinical and histological features.
| All samples N (%) | HPV-positive N (%) | HPV-negative N (%) | ||
|---|---|---|---|---|
| Number of samples | 90 (100) | 14 (16) | 76 (84) | |
| <50 | 35 | 7 (50) | 27 (36) | 0.34 |
| >50 | 53 | 7 (50) | 47 (61) | |
| Unknown | 2 | 0 (0) | 2 (3) | |
| Infiltrating ductal carcinoma | 58 | 9 (64) | 49 (64) | 0.72 |
| Infiltrating lobular carcinoma | 6 | 1 (7) | 5 (7) | |
| DCIS | 3 | 0 (0) | 3 (4) | |
| Mixed | 11 | 3 (22) | 8 (11) | |
| Benign | 10 | 1 (7) | 9 (12) | |
| Unknown | 2 | 0 (0) | 2 (2) | |
| 1 | 7 | 1 (7) | 6 (8) | 0.84 |
| 2 | 33 | 5 (36) | 28 (37) | |
| 3 | 35 | 7 (50) | 28 (37) | |
| DCIS | 3 | 0 (0) | 3 (4) | |
| Unknown | 2 | 0 (0) | 2 (3) | |
| Benign | 10 | 1 (7) | 9 (12) | |
| Negative | 48 | 9 (64) | 39 (51) | 0.53 |
| Positive | 30 | 4 (53) | 26 (34) | |
| Unknown | 2 | 0 (0) | 2 (3) | |
| Benign | 10 | 1 (7) | 9 (12) | |
| Negative | 14 | 3 (21) | 11 (14) | 0.62 |
| Positive | 64 | 10 (72) | 53 (70) | |
| Unknown | 2 | 0 (0) | 3 (4) | |
| Benign | 10 | 1(7) | 9 (12) | |
| Negative | 16 | 3 (21) | 13 (17) | 0.80 |
| Positive | 62 | 10 (72) | 52 (68) | |
| Unknown | 2 | 0 (0) | 2 (3) | |
| Benign | 10 | 1 (7) | 9 (12) | |
| Negative | 63 | 11 (79) | 52 (68) | 0.85 |
| Positive | 13 | 2 (14) | 11 (14) | |
| Unknown | 4 | 0 (0) | 4 (5) | |
| Benign | 10 | 1 (7) | 9 (12) | |
| Triple Negative | 6 | 1 (7) | 5 (7) | 1 |
| Non-triple negative | 70 | 12 (86) | 58 (76) | |
| Unknown | 4 | 0 (0) | 4 (5) | |
| Benign | 10 | 1 (7) | 9 (12) | |
HPV is Human papilloma virus, a positive sample is positive in 1 or greater of 3 MY09/MY11 nested PCR technical repeats. P-Values were obtained by Chi Squared test and a P value of less than 0.05 is statistically significant. Unknown and benign samples were not included in Chi-squared testing for association of HPV DNA with clinic-pathological characteristics.
Clinicopathological Features of tissue samples that were HPV positive.
| ID | RINS | Readscan | VF | Pathology | Age | Grade | Nodes | Size (mm) | ER | PR | Her2 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| HeLa | HPV18 | HPV18 | HPV18 | ||||||||
| 43 | IDC | 40 | 3 | POS | 22 | POS | POS | NEG | |||
| PS | IDC | 51 | 2 | NEG | 25 | POS | POS | NEG | |||
| SM | IDC + DCIS | 30 | 2 | POS | 18 | POS | POS | NEG | |||
| PH | IDC | 44 | 3 | POS | 15 | NEG | NEG | NEG | |||
| MG | IDC | 53 | 3 | NEG | 15 | NEG | NEG | POS | |||
| WB | n.a | n.a | n.a | IDC | 48 | 2 | NEG | 5 | POS | POS | NEG |
| 2 | n.a | n.a | n.a | IDC | 68 | 3 | POS | 30 | NEG | NEG | POS |
| 55 | n.a | n.a | n.a | IDC | 69 | 3 | NEG | 14 | POS | POS | NEG |
| 51 | n.a | n.a | n.a | IDC | 62 | 3 | NEG | 10 | POS | POS | NEG |
| KM | n.a | n.a | n.a | IDC + DCIS | 35 | 3 | NEG | 20 + 20 DCIS | POS | POS | NEG |
| 54 | n.a | n.a | n.a | IDC | 60 | 1 | NEG | 12 | POS | POS | NEG |
| TP | n.a | n.a | n.a | IDC + DCIS | 31 | 2 | NEG | 18 + 20 DCIS | POS | POS | NEG |
| 53 | n.a | n.a | n.a | ILC | 60 | 2 | NEG | 25 | POS | POS | NEG |
| 42 | n.a | n.a | n.a | FA | 40 |
14 tissue samples were positive for HPV DNA by nested PCR. Samples in bold were analyzed by RNA seq, samples below the dotted line were not analyzed by RNA seq. RINS is Rapid Identification of non human sequences bioinformatic pipeline, READSCAN is readscan bioinformatic pipeline, VF is Virusfinder 2.0 bioinformatic pipeline, ER is estrogen receptor, PR is progesterone receptor, HER2 is human epidermal growth factor receptor 2, HPV18 is human papilloma virus 18, n.d. is no virus detected in bioinformatic analysis, POS is positive, NEG is negative, IDC is infiltrating ductal carcinoma, DCIS is ductal carcinoma in situ, ILC is infiltrating lobular carcinoma, FA is fibroadenoma, n.a. is not available.
Positive control RNA seq data sets were analysed by RINS for presence of viral transcripts.
| Reference | Sample | Library | Platform | Read | Virus | Viral reads | Total reads | ppm |
|---|---|---|---|---|---|---|---|---|
| SRR540252 | HeLa | Poly A | N.A | Paired | HPV18 | 21407 | 8000000 | 2675 |
| SRR702400 | HeLa | rRNA | GAII | Single | HPV18 | 26378 | 21000000 | 1256 |
| SRR629571 | HeLa | N.A | HiSeq | Paired | HPV18 | 44405 | 15000000 | 2960 |
| SRR073726 | Prostate | Poly A | N.A | Single | HPV18 | 25755 | 13000000 | 1981 |
| SRR069060 | Akata | N.A | GAII | Single | HHV4 | 34696 | 26000000 | 949 |
| SRR497704 | CD4 T-cells | N.A | GAII | Single | HIV | 64478 | 23000000 | 2803 |
| UNCID-1487840 | HNSCC | Poly A | HiSeq | Paired | HPV16 | 3470 | 182000000 | 19 |
| UNCID-1488824 | HNSCC – normal adjacent | Poly A | Hiseq | Paired | n.d | 0 | 169000000 | 0 |
| UNCID-1494200 | HNSCC | Poly A | HiSeq | Paired | HPV33 | 29131 | 112000000 | 260 |
| UNCID-1489199 | HNSCC – normal adjacent | Poly A | HiSeq | Paired | n.d | 0 | 157000000 | 0 |
Reference describes the accession identification used for data download from NCBI (SRR) or TCGA (UNCID). Poly A is a library prepared from poly A isolated mRNA, rRNA is a library prepared from rRNA depleted library. Sample describes the cell line or sample type used in the analysis, where HNSCC is head and neck squamous cell carcinoma. GAII is Illumina Genome Analyzer II, HiSeq refers to HiSeq 1000, 2000 or 2500 platforms. HPV is human papilloma virus, HHV is human herpes virus 4, HIV is human immunodeficiency virus. Total reads are total number of next generation sequencing reads, Viral reads are number of viral reads aligning to viral sequence. N.A is not available, n.d. is not detected, ppm is read proportion per million.