Literature DB >> 26658849

No association between HPV positive breast cancer and expression of human papilloma viral transcripts.

Orla M Gannon1, Annika Antonsson2, Michael Milevskiy3, Melissa A Brown3,4, Nicholas A Saunders1, Ian C Bennett5,6.   

Abstract

Infectious agents are thought to be responsible for approximately 16% of cancers worldwide, however there are mixed reports in the literature as to the prevalence and potential pathogenicity of viruses in breast cancer. Furthermore, most studies to date have focused primarily on viral DNA rather than the expression of viral transcripts. We screened a large cohort of fresh frozen breast cancer and normal breast tissue specimens collected from patients in Australia for the presence of human papilloma virus (HPV) DNA, with an overall prevalence of HPV of 16% and 10% in malignant and non-malignant tissue respectively. Samples that were positive for HPV DNA by nested PCR were screened by RNA-sequencing for the presence of transcripts of viral origin, using three different bioinformatic pipelines. We did not find any evidence for HPV or other viral transcripts in HPV DNA positive samples. In addition, we also screened publicly available breast RNA-seq data sets for the presence of viral transcripts and did not find any evidence for the expression of viral transcripts (HPV or otherwise) in other data sets. This data suggests that transcription of viral genomes is unlikely to be a significant factor in breast cancer pathogenesis.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26658849      PMCID: PMC4677295          DOI: 10.1038/srep18081

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Worldwide, breast cancer is the most common cancer affecting women. There were 1.7 million new cases reported in 2012, representing approximately 12% of all new cancer cases and 25% of all cancers in women. Although incidence trends for older women have recently stabilized, younger women are experiencing a rising incidence1. Despite the efforts of large scale multinational consortia no clear genomic feature(s) can explain the high incidence of breast cancer. For example, even within familial breast cancer cohorts we can only identify evidence of genetic variants that predispose to breast cancer in 30 percent of cases2. Furthermore, there are significant population differences in the incidence of breast cancer that are not explained by ethnicity alone34. This means that the majority of breast cancers occur sporadically, without evidence for a heritable genetic, transcriptomic, or epigenetic cause. These factors, combined with evidence that a high proportion (estimated at 16%) of other cancers are caused by infectious agents5, led to the hypothesis that oncogenic viruses may be an etiologic factor in sporadic breast cancer. However, despite many attempts the association of viruses such as mouse mammary tumour virus (MMTV)6, herpes viruses7, Epstein-Barr virus (EBV)8, cytomegalovirus9 and human papilloma virus (HPV)10 with normal and malignant breast tissue remains in question. The lack of clarity about a viral association with breast cancer is, in part, attributable to the limits of the technologies used. Using next generation sequencing (NGS) techniques it has become possible to examine deep sequencing data for the presence of pathogenic nucleic acids. An early, successful example of this approach is the identification of a novel polyomavirus, Merkel Cell Polyomavirus11 in Merkel Cell carcinoma. Subsequently, similar studies have attempted to identify viruses in deep sequencing datasets in a variety of cancer types using different bioinformatic pipelines12131415. Using this approach it has been possible to identify a novel virus in organ transplant recipients16, HPV in squamous cell carcinoma of the head and neck17, hepatitis B virus in hepatocellular carcinoma18 and Epstein Barr virus in gastric carcinoma17. However, to date, NGS approaches have failed to detect the expression of viral RNA (transcripts) in breast adenocarcinoma171920. Moreover, only one study has examined whether a breast cancer sample that is positive for HPV DNA is also positive for HPV transcript19. This study failed to find evidence of HPV transcript in the two HPV positive breast cancer samples examined. However, the low HPV DNA prevalence reported by Fimereli et al.19 differs to the majority of published PCR-based studies in which HPV positivity is greater10. This highlights the “playoff” between sensitivity achievable with PCR-based strategies (high sensitivity) and NGS data (lower sensitivity with conventional coverage) against the potential for false positive results (PCR > NGS). In this study we have interrogated fresh breast cancer tissue using degenerate HPV primer pairs against HPV DNA. By using this approach we have biased our initial screen to detect even very low copy number HPV DNA. Samples which were positive for HPV DNA by nested PCR underwent massively parallel deep sequencing, followed by bioinformatic analysis to determine whether any viral transcripts (HPV or otherwise) were present. Three different bioinformatic pipelines which can identify pathogenic nucleic acids in next generation sequencing data were utilized. Using this highly sensitive approach, we failed to find any evidence for expression of HPV or other viral transcripts in breast cancer samples, even in samples which had detectable HPV DNA.

Results

HPV detection by nested PCR

We extracted genomic DNA from 80 breast cancer samples and 10 normal breast tissues and confirmed their integrity by PCR for a 260 bp region on chromosome 1 (S100A8). All samples had readily amplifiable DNA. Samples were then subjected to three repeats of nested PCR analysis using MY09/MY1121 and GP5+/6+ primers22. Genomic DNA isolated from HeLa cells (infected with HPV18) were used as a positive control, and genomic DNA isolated from HPV negative SCC25 squamous cell carcinoma cell line was a negative control. HPV positivity was declared when more than one PCR reaction was positive in the second nested PCR. Based on this criteria 1 of the 10 (10%) normal tissue specimens was positive for HPV in the GP5+/6+ PCR and 13 of 80 (16%) breast cancer specimens were positive for HPV. The rates of HPV DNA detected by PCR between benign and malignant tissue specimens was not statistically significant (p = 0.6072, Chi Squared test). There was no statistically significant association between HPV positivity and any clinical or histopathological features (p > 0.05; Table 1). Interestingly, one patient had two separate malignancies over the course of the study (2007 left breast, 2012 right breast), and only the 2007 specimen was positive for HPV DNA.
Table 1

Frequency of HPV DNA in tissue samples by clinical and histological features.

 All samples N (%)HPV-positive N (%)HPV-negative N (%)P-value
Number of samples90 (100)14 (16)76 (84) 
Age (years)
 <50357 (50)27 (36)0.34
 >50537 (50)47 (61)
 Unknown20 (0)2 (3)
Diagnosis
 Infiltrating ductal carcinoma589 (64)49 (64)0.72
 Infiltrating lobular carcinoma61 (7)5 (7)
 DCIS30 (0)3 (4)
 Mixed113 (22)8 (11)
 Benign101 (7)9 (12)
 Unknown20 (0)2 (2)
Grade
 171 (7)6 (8)0.84
 2335 (36)28 (37)
 3357 (50)28 (37)
 DCIS30 (0)3 (4)
 Unknown20 (0)2 (3)
 Benign101 (7)9 (12)
Nodes
 Negative489 (64)39 (51)0.53
 Positive304 (53)26 (34)
 Unknown20 (0)2 (3)
 Benign101 (7)9 (12)
Estrogen Receptor
 Negative143 (21)11 (14)0.62
 Positive6410 (72)53 (70)
 Unknown20 (0)3 (4)
 Benign101(7)9 (12)
Progesterone Receptor
 Negative163 (21)13 (17)0.80
 Positive6210 (72)52 (68)
 Unknown20 (0)2 (3)
 Benign101 (7)9 (12)
HER2-neu
 Negative6311 (79)52 (68)0.85
 Positive132 (14)11 (14)
 Unknown40 (0)4 (5)
 Benign101 (7)9 (12)
Triple negative status
 Triple Negative61 (7)5 (7)1
 Non-triple negative7012 (86)58 (76)
 Unknown40 (0)4 (5)
 Benign101 (7)9 (12)

HPV is Human papilloma virus, a positive sample is positive in 1 or greater of 3 MY09/MY11 nested PCR technical repeats. P-Values were obtained by Chi Squared test and a P value of less than 0.05 is statistically significant. Unknown and benign samples were not included in Chi-squared testing for association of HPV DNA with clinic-pathological characteristics.

HPV RNA detection in HPV DNA positive samples

Total RNA was isolated from five breast cancer samples which were positive for HPV DNA (Table 2, samples in bold), and from HeLa cells. The remaining breast cancer samples did not have RNA of suitable quality for RNA-sequencing due to degradation of RNA from long-term (6–9 years) fresh-frozen tissue storage. Total RNA was depleted of rRNA and tRNA and next generation sequencing libraries were generated and sequenced on the Illumina HiSeq 2500 platform.
Table 2

Clinicopathological Features of tissue samples that were HPV positive.

IDRINSReadscanVFPathologyAgeGradeNodesSize (mm)ERPRHer2
HeLaHPV18HPV18HPV18        
43n.dn.dn.dIDC403POS22POSPOSNEG
PSn.dn.dn.dIDC512NEG25POSPOSNEG
SMn.dn.dn.dIDC + DCIS302POS18POSPOSNEG
PHn.dn.dn.dIDC443POS15NEGNEGNEG
MGn.dn.dn.dIDC533NEG15NEGNEGPOS
WBn.an.an.aIDC482NEG5POSPOSNEG
2n.an.an.aIDC683POS30NEGNEGPOS
55n.an.an.aIDC693NEG14POSPOSNEG
51n.an.an.aIDC623NEG10POSPOSNEG
KMn.an.an.aIDC + DCIS353NEG20 + 20 DCISPOSPOSNEG
54n.an.an.aIDC601NEG12POSPOSNEG
TPn.an.an.aIDC + DCIS312NEG18 + 20 DCISPOSPOSNEG
53n.an.an.aILC602NEG25POSPOSNEG
42n.an.an.aFA40      

14 tissue samples were positive for HPV DNA by nested PCR. Samples in bold were analyzed by RNA seq, samples below the dotted line were not analyzed by RNA seq. RINS is Rapid Identification of non human sequences bioinformatic pipeline, READSCAN is readscan bioinformatic pipeline, VF is Virusfinder 2.0 bioinformatic pipeline, ER is estrogen receptor, PR is progesterone receptor, HER2 is human epidermal growth factor receptor 2, HPV18 is human papilloma virus 18, n.d. is no virus detected in bioinformatic analysis, POS is positive, NEG is negative, IDC is infiltrating ductal carcinoma, DCIS is ductal carcinoma in situ, ILC is infiltrating lobular carcinoma, FA is fibroadenoma, n.a. is not available.

The positive control library generated from HeLa cells detected HPV18 viral transcripts, with 34000 sequencing reads attributable to HPV E6 and E7 genes, for a read proportion per million (ppm) of 850, which is comparable to other reports20 (Table 2). We next analyzed the five breast cancer RNAseq datasets (Table 2), sequenced to an average of 40 million sequencing reads per sample, using three different bioinformatic pipelines: RINS13, READSCAN15 and VirusFinder 2.014. In no instance did we find any evidence of HPV, or other viral transcripts, in the samples that were positive for HPV DNA.

Bioinformatic Analysis for viral RNA in next generation sequencing datasets

It is known that there are geographical variations in breast cancer incidence and that the prevalence of viral transcripts may be low; thus we extended our study to include publicly available breast cancer RNA-seq datasets available from The Cancer Genome Atlas (TCGA) and a cohort of triple negative breast cancer (TNBC)23. This also served to validate our findings in an independent data set. We again validated the three different bioinformatic pipelines (RINS, Readscan and VirusFinder 2.0) using publicly available RNA sequencing data for tumour and normal matched tissue from HPV positive oral cavity SCC, EBV infected lymphoma cells, HPV infected HeLa cervical cancer cells and HIV infected T cells (Table 3). As anticipated, viral RNA was detected in these positive control samples (Table 3). We also screened 53 publicly available breast cancer and 10 normal breast tissue RNA seq datasets for the presence of any viral sequences. The cohort of RNA-seq datasets we examined were from triple negative breast cancers (TNBC) and hormone receptor positive breast cancers. In line with other studies171920, and the results presented in this manuscript, no viral transcripts were detected in any of these sequence sets. (Supplementary Table 1).
Table 3

Positive control RNA seq data sets were analysed by RINS for presence of viral transcripts.

ReferenceSampleLibraryPlatformReadVirusViral readsTotal readsppm
SRR540252HeLaPoly AN.APairedHPV182140780000002675
SRR702400HeLarRNAGAIISingleHPV1826378210000001256
SRR629571HeLaN.AHiSeqPairedHPV1844405150000002960
SRR073726ProstatePoly AN.ASingleHPV1825755130000001981
SRR069060AkataN.AGAIISingleHHV43469626000000949
SRR497704CD4 T-cellsN.AGAIISingleHIV64478230000002803
UNCID-1487840HNSCCPoly AHiSeqPairedHPV16347018200000019
UNCID-1488824HNSCC – normal adjacentPoly AHiseqPairedn.d01690000000
UNCID-1494200HNSCCPoly AHiSeqPairedHPV3329131112000000260
UNCID-1489199HNSCC – normal adjacentPoly AHiSeqPairedn.d01570000000

Reference describes the accession identification used for data download from NCBI (SRR) or TCGA (UNCID). Poly A is a library prepared from poly A isolated mRNA, rRNA is a library prepared from rRNA depleted library. Sample describes the cell line or sample type used in the analysis, where HNSCC is head and neck squamous cell carcinoma. GAII is Illumina Genome Analyzer II, HiSeq refers to HiSeq 1000, 2000 or 2500 platforms. HPV is human papilloma virus, HHV is human herpes virus 4, HIV is human immunodeficiency virus. Total reads are total number of next generation sequencing reads, Viral reads are number of viral reads aligning to viral sequence. N.A is not available, n.d. is not detected, ppm is read proportion per million.

Discussion

To our knowledge, this is the first study designed to address the question of whether viral DNA detected in breast tumours are associated with the expression of viral transcripts. To do this we have used a highly sensitive PCR approach to identify HPV DNA positive tissues even at very low copy number. This is used to stratify positive patients for RNA-seq analysis for HPV transcripts. In addition, we used rRNA depleted libraries to allow for the detection of both polyadenylated and non-polyadenylated viral transcripts and expanded our study to interrogate databases for any known human viral transcripts. Using this stringent approach we find no evidence of active viral (HPV and non-HPV) transcription within human breast tumour tissue. Our analysis of RNA sequencing data in HPV-DNA positive breast cancer extends other studies by examining greater numbers of breast cancer samples without prior knowledge of HPV DNA positivity171920 and reaches the same conclusion. Whilst we find that approximately 16% of breast tumour tissues have HPV DNA, we did not find evidence for viral transcription within those samples which were positive for HPV DNA. The most reasonable conclusion to draw from this is that the viral genomes are not being transcribed and hence are functionally inactive and not able to contribute to oncogenesis. It should be noted that the low copy number for HPV DNA detected could be attributable to an association with nonmalignant cells such as white blood cells since HPV can be detected in peripheral blood mononuclear cells, dendritic cells, B cells and neutrophils24, or alternatively the virus may have transited via the ducts and be a bystander but not active in carcinogenesis. Whilst nested PCR may detect viral DNA from a small proportion of cells, the sensitivity of RNAseq analysis is for 10 million sequencing reads, there is a 99.99% probability of detecting at least one viral read if every cell is infected and the viral transcript is present with a frequency of 0.0001% (i.e. 1 transcript per million reads)1725. Whilst we cannot deny that higher depth sequencing (i.e. the 100–150 million reads per sample as per TCGA) may yield a higher frequency of HPV genome detection this would also increase the likelihood that positivity was associated with the blood cell elements rather than the malignant tissue compartment. The literature describes a wide range of oncogenic HPV DNA positivity in breast cancer– from 0%26, to 86%27 (reviewed in10). Several factors may contribute to this variable prevalence, such as differences in sampling populations, different assay sensitivities or potential sample contamination. Interestingly, a recent publication highlighted the prevalence of sample contamination, even in a well run high throughput sequencing facility, with HPV-18 RNA from Hela cells detected in TCGA RNA sequencing data28. Indeed, when highly sensitive assays such as nested PCR are utilized, careful controls and experimental procedures must be utilized to ensure that samples do not become contaminated by extraneous HPV DNA. Whilst it could be argued that we only assayed for HPV transcripts in a small number of HPV DNA positive samples (n = 5) this is sufficient to exclude HPV transcription as a common event in HPV DNA positive specimens. This is also supported by a previous study which also failed to detect oncongenic HPV or other viral RNA in 810 breast cancer cases and 104 normal tissues sequenced at 2–3 fold greater depth than in this study20. Even if one assumes that only 10% of the 810 breast cancer samples studied by Tang et al. were HPV positive, this still indicates that the vast majority of breast cancer tissue that are positive for HPV DNA fail to make transcripts and thus we conclude that HPV does not play a role as an etiological factor in most breast tumours. Careful analysis of the supplementary data from Tang et al. (2014) shows that a very small proportion of the malignant and non-malignant tissue (1.2% and 0.96% respectively) had an extremely low level of HPV-18 (8 reads from 169 million) reads which is well below any reasonable threshold for disease association; furthermore the prevalence is the same in malignant and non-malignant tissues. However, it must also be conceded that no definitive ‘cut off’ for disease causation has been fully accepted for next generation sequencing data to date. In this regard we note that recent reports have shown that the APOCEB3 enzyme, which is highly expressed in breast cancer29 and can be regulated the HPV E6 oncogene3031, can induce genetic instability and increase breast cancer risk29. However, given that we do not find any evidence of HPV oncogene expression in the samples, it would seem unlikely that APOECB3 upregulation is HPV-mediated. Nonetheless, the effects of the recent introduction of HPV vaccination programmes will provide an interesting epidemiological perspective on the possible aetiology of HPV in breast cancer, although it may be decades until a cause and effect phenomenon can be identified. Similarly, a recent epidemiological study showed that individuals with a compromised immune system have an increased rate of virally-mediated cancers such as Kaposi’s sarcoma and cervical cancer; whereas the incidence of breast cancer is not increased32, again supporting our postulate that breast cancer is unlikely to have a viral aetiology. The only caveats to our conclusion that HPV transcripts do not contribute to breast tumour development are i) that HPV could contribute to rare breast cancer subtypes which were poorly represented in our sample set, ii) that our RNA-seq analysis was unable to detect viral transcripts with a sequence that is greater than 50% divergent from a virus in the reference database (e.g. for RINS13), or a virus which is present at very low levels (i.e. less than 0.1–1 copies per cell) or, iii) that the carcinogenic action of a virus acts only at the initiation stage before it is cleared; i.e. the “hit and run” phenomenon seen with bovine papillomavirus (BPV) in oesophageal cancer in cattle33. However, notwithstanding these caveats, our work strongly suggests that HPV, or other known viruses, are not expressed in human breast cancer at detectable levels and are unlikely to be a significant aetiological factor in breast carcinogenesis in humans.

Methods

Sample Collection

80 breast cancer tissue specimens and 10 non-malignant (from patients with benign breast disease) specimens were aseptically collected by one surgical team. The sample was placed into a sterile tube and transported to a tissue bank, snap frozen and stored at −80 °C. Clinicopathological features, including receptor positivity, were accessed from medical records. This study was approved by the institutional ethics committee and all studies were performed in accordance with the approved protocols. All patients provided written, informed consent for tissue collection for the purposes of research.

Tissue Culture

HeLa cells were a kind gift from Nigel McMillan (Griffith University, QLD, Australia), were used within 6 months of passaging from receipt from ATCC and were maintained in Dulbecco’s modified Eagle’s medium (Invitrogen, Scoresby, VIC) supplemented with 10% fetal calf serum (GIBCO, Scoresby, VIC), 100 units/mL penicillin G, 100 μg/mL streptomycin sulfate and 0.29 mg/mL L-Glutamine (Invitrogen). SCC25 cells were maintained as per34 and were verified by STR genotyping.

DNA isolation

Using aseptic techniques, a section of frozen breast cancer tissue was isolated using a sterile tissue culture dish (Corning, Murrarie, QLD, Australia) and a sterile scalpel. For cell lines, the cells were released from the tissue culture vessel with trypsin to isolate a cell pellet. DNA was isolated with the Isolate II DNA Isolation Kit (Bioline, Alexandria NSW, Australia) as per the manufacturer’s instructions. DNA concentration was determined using the NanoDrop spectrophotometer (Thermo Scientific, Scoresby, VIC, Australia) and stocks of DNA were made at 10 ng/μL for analysis by PCR.

RNA isolation

Tissue samples were homogenized using the gentleMACS Octo Dissociator (Miltenyi Biotec, Macquarie Park, NSW, Australia) with an M-Tube (Miltenyi Biotech). For cell lines, the cells were released from the tissue culture vessel with trypsin to isolate a cell pellet. RNA was isolated with Isolate II RNA Mini Kit (Bioline) with on-column DNAse digestion in accordance with the manufacturer’s instructions. Bioanalyzer RNA Nano chip (Agilent, Forrest Hill, Victoria, Australia) was used to assess RNA quality. All samples used for RNA-seq had a RIN (RNA integrity number) of 7 or higher.

rRNA depletion

5 μg total RNA was depleted of ribosomal and transfer RNA using RiboZero Magnetic Gold Kit (Human/Mouse/Rat)(Illumina, Scoresby, VIC, Australia). Depleted RNA was purified with the Isolate II RNA micro kit (Bioline) and assessed for quality using the Bioanalyzer (Agilent). Depleted RNA was used in library preparation using the NEBNext Ultra RNA Library Prep Kit for Illumina (New England Biolabs, Ipswich, MA #E7560) in accordance with the manufacturer’s instructions. NEBNext Indexed Primers for Illumina (New England Biolabs) were used to barcode samples. Ampure XP beads (Beckman Coulter, Mount Waverley, VIC, Australia) were used for size selection and all purification steps in accordance with the manufacturer’s instructions. 15 cycles of PCR were used for library amplification. Libraries were assessed for quality using a Bioanalyzer High Sensitivity DNA ChIP (Agilent).

PCR analysis

Genomic DNA was subjected to PCR for a positive control region, S100A8 (S100A8 F: 5′-GGG TCC CTC GGC ACT TCA-3′ and S100A8 R: 5′-AAA TCC TGG GGA ATT GGC-3′) to ensure that DNA was of sufficient quality for PCR analysis. PCR based detection of HPV DNA was performed using degenerate primers MY09/MY11 and GP5 + /6 + in a nested PCR reaction (MYO9: 5′-GCM CAG GGW CAT AAY AAT GG-3, ′ MY11: 5′-CGT CCM ARR GGA WAC TGA TC-3′, GP5+: 5′-TTT GTT ACT GTG GTA GAT ACT AC-3′, GP6+: 5′-GAA AAA TAA ACT GTA AAT CAT ATT C-3′). S100A8 and MY09/MY11 PCR was performed in a 50 uL reaction mix using 100 ng of genomic DNA as template, 1x ThermoPol reaction buffer (New England Biolabs), 200 μM dNTP (dATP, dCTP, dGTP, dTTP) (Bioline), 0.2 μM of each oligonucleotide primer and 1.25 U Taq DNA polymerase (New England Biolabs). PCR cycling conditions for S100A8 and MY09/MY11 PCR were 95 °C for 30 seconds, followed by 30 (S100A8) to 40 (MY09/MY11) cycles of 95 °C for 30 seconds, 52 °C for 30 seconds, 68 °C for 30 seconds for 30 cycles and a final extension of 68 °C for 5 minutes. Ten percent of the MY09/MY11 reaction volume was used as a template for the GP5 + /6 + nested PCR. GP5 + /6 + PCR was performed in a 50 mL reaction volume containing 1x PCR reaction buffer I, 0.2 mM dNTP, 2.5 mM magnesium chloride, 0.2 μM of each oligonucleotide primers and 1.25 U AmpliTaq Gold DNA Polymerase. GP5 + /6 + cycling conditions were as follows: 94 °C for 10 minutes, followed by 45 cycles of 94 °C 1 minute, 40 °C 2 minutes, 72 °C 1 minute with a final extension step of 72 °C for 7 minutes. For HPV testing, each PCR was repeated 3 times and a sample which showed positivity in 1 or more repeats of the GP5 + /6 + PCR was deemed positive. For S100A8 and MY09/MY11 primer sets Taq Polymerase with Thermopol Buffer (New England Biolabs) was used. After PCR, 10% of the PCR reaction was elotrophoresed on a 2% agarose gel stained with ethidium bromide (Sigma Aldrich) and visualized under UV transillumination.

Sequencing

Samples were sequenced on the HiSeq 2500 (Illumina) in rapid mode using 2×100bp paired end chemistry.

Data analysis

The human genome build hg19 genome was downloaded from UCSC. CASAVA (Illumina) was used to demultiplex sequencing reads. The analyses were performed on a high performance computing cluster using PBS Pro 12.01 running on Red Hat Enterprize Linux 6. Viral genomes were downloaded from NCBI on the 1st of April 2015 using search terms Viruses[PORG] and scrdb_refseq[PROP] and combined into a multifasta file. RINS13, Readscan15 and VirusFinder 2.014 were accessed as per their publications. Readscan was run using default parameters. RINS was run using default parameters. Bowtie and blast indexes were built using standard parameters. Software versions used were: Bowtie (version 0.12.6)35, Trinity (version 050811)36, blat (v34)37 and NCBI Blast Suite (version 2.2.27)38. VirusFinder2 was run using default parameters. Bowtie2 and blast indexes were built using standard parameters. Software versions used were: NCBI Blast Suite (version 2.2.27)38, Bowtie 2 (Version 2.2.5)35, BWA (Version 0.6.1)39, Trinity (version r2012-06-08)36, SVDetect (version 0.8)40, Blat (v34)37. For TCGA and TNBC data, bam files were obtained from respective data providers. To minimize data storage and analysis time, sam2fastq (Picard) was used to extract unaligned reads from bam files which were converted to fastq prior to analysis with virus finding software. This approach was first validated using RNA-seq data from HeLa files (SRR702400). For TCGA and TNBC samples, Enterophage PhiX DNA, which is used as a sequencing control on Illumina Hiseq platform, was detected by RINS.

Reference sequencing samples

RNA Seq datasets were accessed through The Cancer Genome Atlas (TCGA), NCBI Short Read Archive and from23. Accession and identification numbers for publicly available datasets are provided in Supplementary Table 1.

Statistical analysis

Statistical analysis of clinicopathological features of the fresh frozen breast cancer were examined by Chi-squared testing performed in GraphPad Prism (GraphPad Software, Treestar).

Additional Information

How to cite this article: Gannon, O. M. et al. No association between HPV positive breast cancer and expression of human papilloma viral transcripts. Sci. Rep. 5, 18081; doi: 10.1038/srep18081 (2015).
  39 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  Use of multiple PCR primer sets for optimal detection of human papillomavirus.

Authors:  F Karlsen; M Kalantari; A Jenkins; E Pettersen; G Kristensen; R Holm; B Johansson; B Hagmar
Journal:  J Clin Microbiol       Date:  1996-09       Impact factor: 5.948

3.  APOBEC3B is an enzymatic source of mutation in breast cancer.

Authors:  Michael B Burns; Lela Lackey; Michael A Carpenter; Anurag Rathore; Allison M Land; Brandon Leonard; Eric W Refsland; Delshanee Kotandeniya; Natalia Tretyakova; Jason B Nikas; Douglas Yee; Nuri A Temiz; Duncan E Donohue; Rebecca M McDougle; William L Brown; Emily K Law; Reuben S Harris
Journal:  Nature       Date:  2013-02-06       Impact factor: 49.962

4.  Clonal integration of a polyomavirus in human Merkel cell carcinoma.

Authors:  Huichen Feng; Masahiro Shuda; Yuan Chang; Patrick S Moore
Journal:  Science       Date:  2008-01-17       Impact factor: 47.728

5.  Dysregulation of the repressive H3K27 trimethylation mark in head and neck squamous cell carcinoma contributes to dysregulated squamous differentiation.

Authors:  Orla M Gannon; Lilia Merida de Long; Liliana Endo-Munoz; Mehlika Hazar-Rethinam; Nicholas A Saunders
Journal:  Clin Cancer Res       Date:  2012-11-27       Impact factor: 12.531

Review 6.  Global burden of cancers attributable to infections in 2008: a review and synthetic analysis.

Authors:  Catherine de Martel; Jacques Ferlay; Silvia Franceschi; Jérôme Vignat; Freddie Bray; David Forman; Martyn Plummer
Journal:  Lancet Oncol       Date:  2012-05-09       Impact factor: 41.316

7.  A new arenavirus in a cluster of fatal transplant-associated diseases.

Authors:  Gustavo Palacios; Julian Druce; Lei Du; Thomas Tran; Chris Birch; Thomas Briese; Sean Conlan; Phenix-Lan Quan; Jeffrey Hui; John Marshall; Jan Fredrik Simons; Michael Egholm; Christopher D Paddock; Wun-Ju Shieh; Cynthia S Goldsmith; Sherif R Zaki; Mike Catton; W Ian Lipkin
Journal:  N Engl J Med       Date:  2008-02-06       Impact factor: 91.245

8.  Human papillomavirus E6 triggers upregulation of the antiviral and cancer genomic DNA deaminase APOBEC3B.

Authors:  Valdimara C Vieira; Brandon Leonard; Elizabeth A White; Gabriel J Starrett; Nuri A Temiz; Laurel D Lorenz; Denis Lee; Marcelo A Soares; Paul F Lambert; Peter M Howley; Reuben S Harris
Journal:  mBio       Date:  2014-12-23       Impact factor: 7.867

Review 9.  Metagenomics and the molecular identification of novel viruses.

Authors:  Nicholas Bexfield; Paul Kellam
Journal:  Vet J       Date:  2010-12-15       Impact factor: 2.688

10.  Exploring the prevalence of ten polyomaviruses and two herpes viruses in breast cancer.

Authors:  Annika Antonsson; Seweryn Bialasiewicz; Rebecca J Rockett; Kevin Jacob; Ian C Bennett; Theo P Sloots
Journal:  PLoS One       Date:  2012-08-15       Impact factor: 3.240

View more
  6 in total

1.  Human papillomavirus infection increases the risk of breast carcinoma: a large-scale systemic review and meta-analysis of case-control studies.

Authors:  Chutong Ren; Kai Zeng; Chujun Wu; Lan Mu; Jiangsheng Huang; Mingming Wang
Journal:  Gland Surg       Date:  2019-10

Review 2.  Human Papilloma Viruses and Breast Cancer - Assessment of Causality.

Authors:  James Sutherland Lawson; Wendy K Glenn; Noel James Whitaker
Journal:  Front Oncol       Date:  2016-09-29       Impact factor: 6.244

Review 3.  Oncogenic Viruses and Breast Cancer: Mouse Mammary Tumor Virus (MMTV), Bovine Leukemia Virus (BLV), Human Papilloma Virus (HPV), and Epstein-Barr Virus (EBV).

Authors:  James S Lawson; Brian Salmons; Wendy K Glenn
Journal:  Front Oncol       Date:  2018-01-22       Impact factor: 6.244

4.  Detection of human papillomavirus genotypes, herpes simplex, varicella zoster and cytomegalovirus in breast cancer patients.

Authors:  Morvarid Golrokh Mofrad; Zohreh Azita Sadigh; Sanaz Ainechi; Ebrahim Faghihloo
Journal:  Virol J       Date:  2021-01-22       Impact factor: 4.099

5.  Presence of Human Papillomavirus DNA in Malignant Neoplasia and Non-Malignant Breast Disease.

Authors:  Erika Maldonado-Rodríguez; Marisa Hernández-Barrales; Adrián Reyes-López; Susana Godina-González; Perla I Gallegos-Flores; Edgar L Esparza-Ibarra; Irma E González-Curiel; Jesús Aguayo-Rojas; Adrián López-Saucedo; Gretel Mendoza-Almanza; Jorge L Ayala-Luján
Journal:  Curr Issues Mol Biol       Date:  2022-08-13       Impact factor: 2.976

Review 6.  Epstein-Barr Virus and Human Papillomaviruses Interactions and Their Roles in the Initiation of Epithelial-Mesenchymal Transition and Cancer Progression.

Authors:  Farhan S Cyprian; Halema F Al-Farsi; Semir Vranic; Saghir Akhtar; Ala-Eddin Al Moustafa
Journal:  Front Oncol       Date:  2018-05-01       Impact factor: 6.244

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.