Literature DB >> 27814725

Whole genome sequencing of 51 breast cancers reveals that tumors are devoid of bovine leukemia virus DNA.

Nicolas A Gillet1,2, Luc Willems3,4.   

Abstract

Controversy exists regarding the association of bovine leukemia virus (BLV) and breast cancer. PCR-based experimental evidence indicates that BLV DNA is present in breast tissue and that as many as 37% of cancer cases may be attributable to viral exposure. Since this association might have major consequences for human health, we evaluated 51 whole genomes of breast cancer samples for the presence of BLV DNA. Among 32 billion sequencing reads retrieved from the NCBI database of genotype and phenotype, none mapped on different strains of the BLV genome. Controls for sequence divergence and proviral loads further validated the approach. This unbiased analysis thus excludes a clonal insertion of BLV in breast tumor cells and strongly argues against an association between BLV and breast cancer.

Entities:  

Keywords:  BLV; Bovine leukemia virus; Breast cancer

Mesh:

Substances:

Year:  2016        PMID: 27814725      PMCID: PMC5095936          DOI: 10.1186/s12977-016-0308-3

Source DB:  PubMed          Journal:  Retrovirology        ISSN: 1742-4690            Impact factor:   4.602


Background

BLV naturally infects cattle, water buffalo, yak and zebu [1-4]. Sporadic infections with BLV have occasionally been reported in other species like alpaca [5]. Experimentally, BLV can also be transmitted to a number of species including sheep [6], goats [6], rats [7] and rabbits [8]. BLV infection causes B cell lymphocytosis, leukemia and/or lymphoma in natural and some experimental hosts [1]. There is also controversial evidence suggesting that BLV might infect humans: (1) antibodies against the BLV capsid were detected in 74% of human sera from the Berkeley Community, California [9], (2) BLV DNA was detected in breast tissues using PCR [10-12]. Based on a positive correlation between the rates of BLV infection and tumor frequencies (36–59% compared to 29–45% in normal tissue), as many as 37% of breast cancer cases may be attributable to BLV exposure [12]. Although these observations initiated some skepticism within the scientific community [13], the potential consequences for human health clearly require further investigation.

Results and discussion

To avoid potential experimental artifacts associated with DNA amplification techniques, we directly analyzed whole genomes of breast tumors and adjacent tissues. After retrieval of raw DNA sequences from the NCBI dbGaP [14, 15], paired-reads were probed for alignment on different BLV strains using Bowtie2. As a positive control, a nuclear DNA fragment (chr12: 53,959,600–53,964,000) devoid of repeated sequences that would lead to an overestimation of aligned reads and set to 4.4 kb to fit with the monoploid 8.8 kb BLV genome was selected from the human genome. Alignment of 51 breast tumors genomes on the nuclear control sequence identified between 283 and 1287 paired-reads (illustrated on Fig. 1 and summarized on Table 1). In contrast, no homology was found with 5 different BLV subtypes (highlighted in blue on the phylogenic tree of Fig. 2a). In 19 biopsies adjacent to the breast tumors, 386–1197 paired-reads aligned onto the nuclear DNA sequence whereas none mapped on BLV (Table 1). All DNA samples contained extranuclear DNA as indicated by alignment of a control mitochondrial sequence (NC_012920) (Table 1).
Fig. 1

Representative alignment of dbGaP sequencing reads to human and BLV DNA. Breast cancer patients were BRC3 from USA (study phs000472), MEX-BR-15 from Mexico and SX1A2 from Vietnam (study phs000369). Aligned reads were visualized using integrative genomics viewer (IGV)

Table 1

Absence of BLV DNA in 51 whole genomes of breast tumors

Subject IDCountryAgeDiagnosisSample typeGradeHER2 statusER statusPR statusTotal no of readsNo. of reads that align on
Control DNA (nuclear)Control DNA (mitochondrial)BLV_AF033818BLV_AF257515BLV_D00647BLV_K02021BLV_LC080667
MEX-BR-106Mexico42IDCTumorII++583,906,975669396,23900000
MEX-BR-116Mexico92IDCTumorIII+577,618,1967961,166,91600000
MEX-BR-15Mexico45IDCTumorII++571,043,2276521,167,67200000
MEX-BR-154Mexico52IDCTumorIII++700,630,351811400,38300000
MEX-BR-165Mexico42IDCTumorII++757,323,566737742,64600000
MEX-BR-198Mexico44IDCTumorII++745,509,52910191,264,55500000
MEX-BR-50Mexico47IDCTumorII++605,198,587653958,81200000
MEX-BR-82Mexico59IDCTumorII+681,881,066687547,86300000
BRC12USA81IDCTumorIIUU548,255,1697451,113,30600000
BRC13USA51IDCTumorIII7U587,461,4826861,106,78000000
BRC14USA86IDCTumorIII7U755,094,2078991,469,97600000
BRC15USA83IDCTumorII7U758,784,2629342,327,82400000
BRC16USA61IDCTumorIII7U821,134,04012872,084,78200000
BRC18USA85IDCTumorI8U568,355,4556771,395,82300000
BRC19USA75IDCTumorII8U596,337,8427471,648,87000000
BRC20USA61IDCTumorIII4U507,651,9005701,026,83000000
BRC21USA73IDCTumorI7U719,742,1228171,710,01000000
BRC22USA64ILCTumorI6U608,469,920708953,10000000
BRC23USA68IDCTumorI7U613,481,2156871,272,51900000
BRC24USA51IDCTumorII7U656,115,8007211,980,03000000
BRC25USA52IDCTumorII5U583,560,227712580,20300000
BRC28USA52IDCTumorI7U664,667,777781973,99000000
BRC29USA74IDCTumorIII6U785,019,5635962,085,48200000
BRC3USA62IDCTumorII8U695,174,96710263,134,34100000
BRC30USA60ILCTumorII5U663,769,7447941,442,01400000
BRC31USA66IDCTumorII6U734,384,35210281,415,99600000
BRC32USA54IDCTumorI7U643,884,1787031,404,43600000
BRC33USA83IDCTumorII8U660,668,8778191,284,59900000
BRC34USA79IDCTumorI7U572,861,9307041,499,41400000
BRC35USA76IDCTumorII6U543,480,4746971,709,94300000
BRC36USA68IDCTumorII7U706,448,3488041,501,76300000
BRC40USA66IDCTumorI8U600,847,5166901,686,11200000
BRC41USA55IDCTumorII8U689,312,2178123,735,59100000
BRC42USA74IDCTumorIIUU684,312,3026851,308,94800000
BRC44USA64IDCTumorII7U717,390,2518911,430,06400000
BRC47USA54IDCTumorIII5U580,674,755865960,94400000
BRC48USA66IDCTumorII6U782,262,3537831,236,10200000
BRC49USA56IDCTumorII8U577,656,003559881,80400000
BRC5USA72IDCTumorII7U762,026,86011552,462,81900000
BRC50USA78ILCTumorI4U661,525,693792357,91500000
BRC7USA78IDCTumorII8U455,727,994795580,48400000
BRC8USA87IDCTumorI8U518,548,2856281,394,43900000
BRC9USA65ILCTumorII8U516,702,8026971,759,44400000
9DDA1Vietnam60IDCTumorIIIUUU706,450,9507591,109,34000000
9P4X9Vietnam54IDCTumorIIIUUU610,913,537778619,06600000
9YBUFVietnam52IDCTumorIIIUUU595,959,881616788,05800000
CI5PDVietnam51IDCTumorIIIUUU572,612,309626786,78700000
FYGW6Vietnam38IDCTumorIIIUUU238,201,059282221,94200000
GT33 VVietnam52IDCTumorIIIUUU548,640,325604766,32000000
SX1A2Vietnam53IDCTumorIIIU++598,405,1436931,002,57700000
UQWDSVietnam35IDCTumorIIIU596,126,8256651,285,88400000
9DDA1Vietnam60IDCNormalIIIUUU691,060,6497971,122,13300000
9P4X9Vietnam54IDCNormalIIIUUU601,815,791664694,15300000
9YBUFVietnam52IDCNormalIIIUUU593,968,9226461,202,17500000
CI5PDVietnam51IDCNormalIIIUUU566,065,567595911,13300000
FYGW6Vietnam38IDCNormalIIIUUU337,274,647386361,06300000
GT33 VVietnam52IDCNormalIIIUUU581,403,7836521,189,00300000
SX1A2Vietnam53IDCNormalIIIU++608,739,604700878,36200000
UQWDSVietnam35IDCNormalIIIU590,387,671685829,84700000
MEX-BR-106Mexico42IDCNormalII++539,137,287526351,03400000
MEX-BR-116Mexico92IDCNormalIII+513,833,151520258,28700000
MEX-BR-123Mexico71IDCNormalIII+U668,026,494761515,50100000
MEX-BR-15Mexico45IDCNormalII++592,958,041670756,77800000
MEX-BR-154Mexico52IDCNormalIII++670,289,201929817,44600000
MEX-BR-165Mexico42IDCNormalII++712,308,425706537,51600000
MEX-BR-198Mexico44IDCNormalII++726,225,752831216,10900000
MEX-BR-200Mexico42IDCNormalII++767,097,5421197279,03100000
MEX-BR-28Mexico79MCNormalII++588,561,634607215,02200000
MEX-BR-50Mexico47IDCNormalII++551,537,695618394,84200000
MEX-BR-82Mexico59IDCNormalII+608,849,308719385,78900000

Whole genome sequencing data from 51 breast tumors and 19 normal adjacent breast tissues were downloaded from the NCBI dbGaP. Hundreds of millions of paired-reads per sample were probed for alignment on different BLV strains and on nuclear and mitochondrial human control sequences

IDC infiltrating ductal carcinoma, ILC infiltrating lobular carcinoma, MC mixed carcinoma, U unknown

Fig. 2

Analysis of sequence variation and proviral load in sequence alignments. a Neighbour-joining phylogenetic tree of BLV and HTLV-1 genomes. b Using the ART simulation tool (NIH), Illumina-like 100 bp paired-reads were generated in silico from the mutants. 880 simulated reads were probed for alignment on BLV AF033818 using Bowtie2 and visualized using IGV. c Correlation between proviral loads and predicted number of reads

Representative alignment of dbGaP sequencing reads to human and BLV DNA. Breast cancer patients were BRC3 from USA (study phs000472), MEX-BR-15 from Mexico and SX1A2 from Vietnam (study phs000369). Aligned reads were visualized using integrative genomics viewer (IGV) Absence of BLV DNA in 51 whole genomes of breast tumors Whole genome sequencing data from 51 breast tumors and 19 normal adjacent breast tissues were downloaded from the NCBI dbGaP. Hundreds of millions of paired-reads per sample were probed for alignment on different BLV strains and on nuclear and mitochondrial human control sequences IDC infiltrating ductal carcinoma, ILC infiltrating lobular carcinoma, MC mixed carcinoma, U unknown Analysis of sequence variation and proviral load in sequence alignments. a Neighbour-joining phylogenetic tree of BLV and HTLV-1 genomes. b Using the ART simulation tool (NIH), Illumina-like 100 bp paired-reads were generated in silico from the mutants. 880 simulated reads were probed for alignment on BLV AF033818 using Bowtie2 and visualized using IGV. c Correlation between proviral loads and predicted number of reads Although no paired-read corresponding to five different BLV variants could be identified, the possibility remains that extensive sequence variability impaired detection. On average, the whole genome sequencing procedure generated 660 million reads per sample. Given that the BLV provirus length is 8.8 kb and that a normal human diploid genome is 6.6 billion base pairs, the average number of reads that would be generated by a 8.8 kb-long monoploid sequence is 880 (660,000,000/6600,000,000 × 8800). Providing that the BLV provirus is integrated in a single copy per cell, the whole genome sequencing procedure would thus generate 880 reads on average. If the strain in the sample diverges from the five reference sequences, a fraction of the reads would not be retrieved. Therefore, BLV variants were artificially generated in silico by introducing 2, 3, 6, 10 and 20% nucleotide changes in reference AF033818 (mutants 0.02, 0.03, 0.06, 0.10 and 0.20, respectively). Phylogenetic analysis of Fig. 2a illustrates that in silico generated divergence far exceeds the maximal natural sequence variations observed worldwide [16]. 880 Illumina-like reads were then simulated from these in silico variants using ART simulation tool and mapped on BLV genome AF033818. Most reads (818 of 880) generated from mutant 0.02 aligned on reference sequence AF033818 (Fig. 2b). Even the highly divergent mutant 0.10 still aligned 41% of its 880 reads on the reference. Up to 20% divergence in mutant 0.20 was required to significantly impair detection, although BLV specific reads were still identified (Fig. 2b). Whole genome analysis thus excludes clonal integration of natural and highly divergent BLV strains in breast tumors. Since only a small proportion of cells may carry the provirus, the sensitivity of the analysis was correlated to the proviral loads. Any natural BLV variant that would infect 10% of the tumor cells is expected to generate about 100 reads (Fig. 2c, dotted blue line). The number of expected reads decreases along with the percentage of infected cells to reach approximately one read with a proviral load of 0.1% (Fig. 2c, dotted blue line). Considering a 59% prevalence of breast tumors positive for BLV [12], 30 samples out of our 51 should be positive. Even with an individual proviral load around 0.1%, this should make about 30 reads (on average one per patient) mapping on BLV, whereas none were found. Using whole genome analysis, we concluded that there is no evidence for a single BLV-specific or even related sequence. The discrepancies and limitations of this report and others pertain to: The origin of the samples It is indeed possible that tumor biopsies from previous studies originating from US [11, 12] and Colombia [10] significantly differ from those reported in the dbGaP NCBI database. Even if we restrict our observations on US originating samples (n = 35), the discrepancy remains highly significant. Indeed, Buehring reported 67 breast tumors positive for BLV over 114 cases [12] whereas we found none over 35 cases (the p value for fisher test is 1.12 × 10−6). The DNA extraction technique In situ PCR suggested that BLV proviral DNA is localized in the cytoplasm [11, 12]. Analysis of mitochondria-specific sequences (Table 1) shows that dbGaP NCBI database includes reads corresponding to 16 kb-long, circular and extranuclear mitochondrial DNA. The strain divergence Artificial in silico simulation of highly divergent mutants still identified BLV specific reads (Fig. 2b). Since nucleotide substitutions among BLV strains worldwide are limited to 2.3% [16], it remains questionable whether these mutants still belong to the same species. Further analysis show that breast tumor genomes do not map on HTLV-1 sequences (data not shown). Why BLV-conserved sequences were previously identified by PCR remains an enigma. Viral expression Although BLV is expressed at trace levels in the bovine species, the p24 viral capsid protein was detected in 5% of breast tumors [12]. This observation is inconsistent with RNASeq analysis of 154.7 billion of transcriptome sequencing reads from The Cancer Genome Atlas Research Network [17, 18]. Our present study based on whole genome analysis excludes a clonal insertion of BLV in tumor cells and does not support converging lines of evidence which previously suggested an association between BLV infection and breast cancer.

Methods

Raw DNA sequences from whole genomes of breast tumors and normal breast tissues adjacent the tumor were retrieved from the NCBI database of genotype and phenotype (dbGaP). These sequences were extracted from two studies: (1) estrogen receptor positive breast cancer: aromatase inhibitor response study (accession number phs000472) [14] and (2) sequence analysis of mutations and translocations across breast cancer subtypes (accession number phs000369) [15]. Archive files were downloaded with prefetch v2.5.7 and sequencing reads were extracted with fastdump v2.5.7 using “split-3” option to separate paired reads and single reads (NCBI SRA Toolkit). Paired reads were probed for alignment on different BLV variants (accession numbers: AF033818, AF275515, D00647, K02120, LC080667) and, as positive control, on human genomic sequences using Bowtie2 (version 2.2.5). We used the “very-sensitive” option of Bowtie2 to maximize the likelihood of viral detection. Analyses were performed on computing cluster running on Linux OS. BLV divergent sequences were created in silico by introducing substitutions, deletions or insertions with equal probabilities in 2, 3, 6, 10 and 20% of the reference AF033818 (mutants 0.02, 0.03, 0.06, 0.10 and 0.20, respectively). Neighbor-joining phylogenetic tree was elaborated using Clustal Omega (EMBL-EBI) and visualized by Dendroscope 3. Illumina-like paired-reads were generated from the BLV sequence using the ART simulation tool (version GreatSmokyMountains-04-17-2016, NIH).
  16 in total

1.  Induction of immune deficiency syndrome in rabbits by bovine leukaemia virus.

Authors:  V Altanerova; J Ban; C Altaner
Journal:  AIDS       Date:  1989-11       Impact factor: 4.177

2.  Infection of bovine immunodeficiency virus and bovine leukemia virus in water buffalo and cattle populations in Pakistan.

Authors:  S Meas; J Seto; C Sugimoto; M Bakhsh; M Riaz; T Sato; K Naeem; K Ohashi; M Onuma
Journal:  J Vet Med Sci       Date:  2000-03       Impact factor: 1.267

3.  Landscape of DNA virus associations across human malignant cancers: analysis of 3,775 cases using RNA-Seq.

Authors:  Joseph D Khoury; Nizar M Tannir; Michelle D Williams; Yunxin Chen; Hui Yao; Jianping Zhang; Erika J Thompson; Funda Meric-Bernstam; L Jeffrey Medeiros; John N Weinstein; Xiaoping Su
Journal:  J Virol       Date:  2013-06-05       Impact factor: 5.103

4.  Infection of rats with bovine leukaemia virus: establishment of a virus-producing rat cell line.

Authors:  V Altanerova; D Portetelle; R Kettmann; C Altaner
Journal:  J Gen Virol       Date:  1989-07       Impact factor: 3.891

5.  Experimental infection of sheep and goat with bovine leukemia virus: localization of proviral information on the target cells.

Authors:  R Kettmann; M Mammerickx; D Portetelle; D Grégoire; A Burny
Journal:  Leuk Res       Date:  1984       Impact factor: 3.156

6.  Bovine leukemia virus infection in a juvenile alpaca with multicentric lymphoma.

Authors:  Laura C Lee; William K Scarratt; Gertrude C Buehring; Geoffrey K Saunders
Journal:  Can Vet J       Date:  2012-03       Impact factor: 1.008

7.  Whole-genome analysis informs breast cancer response to aromatase inhibition.

Authors:  Matthew J Ellis; Li Ding; Dong Shen; Jingqin Luo; Vera J Suman; John W Wallis; Brian A Van Tine; Jeremy Hoog; Reece J Goiffon; Theodore C Goldstein; Sam Ng; Li Lin; Robert Crowder; Jacqueline Snider; Karla Ballman; Jason Weber; Ken Chen; Daniel C Koboldt; Cyriac Kandoth; William S Schierding; Joshua F McMichael; Christopher A Miller; Charles Lu; Christopher C Harris; Michael D McLellan; Michael C Wendl; Katherine DeSchryver; D Craig Allred; Laura Esserman; Gary Unzeitig; Julie Margenthaler; G V Babiera; P Kelly Marcom; J M Guenther; Marilyn Leitch; Kelly Hunt; John Olson; Yu Tao; Christopher A Maher; Lucinda L Fulton; Robert S Fulton; Michelle Harrison; Ben Oberkfell; Feiyu Du; Ryan Demeter; Tammi L Vickery; Adnan Elhammali; Helen Piwnica-Worms; Sandra McDonald; Mark Watson; David J Dooling; David Ota; Li-Wei Chang; Ron Bose; Timothy J Ley; David Piwnica-Worms; Joshua M Stuart; Richard K Wilson; Elaine R Mardis
Journal:  Nature       Date:  2012-06-10       Impact factor: 49.962

8.  A new genotype of bovine leukemia virus in South America identified by NGS-based whole genome sequencing and molecular evolutionary genetic analysis.

Authors:  Meripet Polat; Shin-Nosuke Takeshima; Kazuyoshi Hosomichi; Jiyun Kim; Taku Miyasaka; Kazunori Yamada; Mariluz Arainga; Tomoyuki Murakami; Yuki Matsumoto; Veronica de la Barra Diaz; Carlos Javier Panei; Ester Teresa González; Misao Kanemaki; Misao Onuma; Guillermo Giovambattista; Yoko Aida
Journal:  Retrovirology       Date:  2016-01-12       Impact factor: 4.602

9.  Exposure to Bovine Leukemia Virus Is Associated with Breast Cancer: A Case-Control Study.

Authors:  Gertrude Case Buehring; Hua Min Shen; Hanne M Jensen; Diana L Jin; Mark Hudes; Gladys Block
Journal:  PLoS One       Date:  2015-09-02       Impact factor: 3.240

10.  The landscape of viral expression and host gene fusion and adaptation in human cancer.

Authors:  Ka-Wei Tang; Babak Alaei-Mahabadi; Tore Samuelsson; Magnus Lindh; Erik Larsson
Journal:  Nat Commun       Date:  2013       Impact factor: 14.919

View more
  15 in total

1.  Solution Conformation of Bovine Leukemia Virus Gag Suggests an Elongated Structure.

Authors:  Dominic F Qualley; Sarah E Cooper; James L Ross; Erik D Olson; William A Cantara; Karin Musier-Forsyth
Journal:  J Mol Biol       Date:  2019-02-04       Impact factor: 5.469

Review 2.  Can Bovine Leukemia Virus Be Related to Human Breast Cancer? A Review of the Evidence.

Authors:  Lucia Martinez Cuesta; Pamela Anahi Lendez; Maria Victoria Nieto Farias; Guillermina Laura Dolcini; Maria Carolina Ceriani
Journal:  J Mammary Gland Biol Neoplasia       Date:  2018-05-18       Impact factor: 2.673

Review 3.  Oncogenic Viruses and Breast Cancer: Mouse Mammary Tumor Virus (MMTV), Bovine Leukemia Virus (BLV), Human Papilloma Virus (HPV), and Epstein-Barr Virus (EBV).

Authors:  James S Lawson; Brian Salmons; Wendy K Glenn
Journal:  Front Oncol       Date:  2018-01-22       Impact factor: 6.244

4.  Bovine leukemia virus DNA associated with breast cancer in women from South Brazil.

Authors:  Daniela Schwingel; Ana P Andreolla; Luana M S Erpen; Rafael Frandoloso; Luiz C Kreutz
Journal:  Sci Rep       Date:  2019-02-27       Impact factor: 4.379

Review 5.  Catching viral breast cancer.

Authors:  James S Lawson; Wendy K Glenn
Journal:  Infect Agent Cancer       Date:  2021-06-10       Impact factor: 2.965

6.  Multiple oncogenic viruses are present in human breast tissues before development of virus associated breast cancer.

Authors:  James S Lawson; Wendy K Glenn
Journal:  Infect Agent Cancer       Date:  2017-10-16       Impact factor: 2.965

7.  Bovine leukemia virus relation to human breast cancer: Meta-analysis.

Authors:  Andrew Gao; Valentina L Kouznetsova; Igor F Tsigelny
Journal:  Microb Pathog       Date:  2020-07-27       Impact factor: 3.738

8.  Development of an indirect ELISA based on recombinant capsid protein to detect antibodies to bovine leukemia virus.

Authors:  Ana Paula Andreolla; Luana Marina Scheer Erpen; Rafael Frandoloso; Luiz Carlos Kreutz
Journal:  Braz J Microbiol       Date:  2018-05-22       Impact factor: 2.476

Review 9.  Breast Cancer Gone Viral? Review of Possible Role of Bovine Leukemia Virus in Breast Cancer, and Related Opportunities for Cancer Prevention.

Authors:  Gertrude C Buehring; Hannah M Sans
Journal:  Int J Environ Res Public Health       Date:  2019-12-27       Impact factor: 3.390

Review 10.  Regulation of Expression and Latency in BLV and HTLV.

Authors:  Aneta Pluta; Juan P Jaworski; Renée N Douville
Journal:  Viruses       Date:  2020-09-25       Impact factor: 5.048

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.