| Literature DB >> 22489537 |
Natalia Rego1, Sergio Bianchi, Pilar Moreno, Helena Persson, Anders Kvist, Alvaro Pena, Pablo Oppezzo, Hugo Naya, Carlos Rovira, Guillermo Dighiero, Otto Pritsch.
Abstract
As an approach to determining the aetiology of chronic lymphocytic leukaemia (CLL), we searched for a virus expressed in human CLL B-cells by combining high-throughput sequencing and digital subtraction. Pooled B-cell mRNA transcriptomes from five CLL patients and five healthy donors were sequenced with 454 Life Sciences technology. Human reads were excluded by BLAST (Basic Local Alignment Search Tool) and BLAT (BLAST-like alignment tool) searches. Remaining reads were screened with BLAST against viral databases. Purified B-cells from two CLL patients, with and without stimulation by phorbol-esters, were sequenced using Illumina technology to achieve depth of sequencing. Burrows-Wheeler Aligner mapping and BLAST searches were used for the Illumina data. Pyrosequencing resulted in about 400 000 reads per sample. No viral candidate could be found. Illumina single-end sequencing for 115 cycles yielded an average of 26 ± 2·5 million filtered reads per sample, of which 2·2 ± 0·6 million remained unmapped to human references. BLAST searches of these reads against viral and human databases assigned nine reads to an Epstein-Barr virus origin, in one sample following phorbol-ester stimulation. Other reads showing a putative viral origin were dismissed after further analysis. Despite an in-depth analysis of the CLL transcriptome reaching more than 100 million sequences, we have not found evidence for a putative viral candidate in CLL.Entities:
Mesh:
Year: 2012 PMID: 22489537 PMCID: PMC7161782 DOI: 10.1111/j.1365-2141.2012.09116.x
Source DB: PubMed Journal: Br J Haematol ISSN: 0007-1048 Impact factor: 6.998
Clinical and molecular characteristics of patients
| Patient | Sex | Age (years) | Binet | CD38 |
| MS | Outcome |
|---|---|---|---|---|---|---|---|
| 454 pool | |||||||
| CLL 046 | F | 45 | A | Neg | Pos | UM | Progressor |
| CLL 072 | M | 61 | C | Neg | Pos | UM | Indolent |
| CLL 080 | F | 63 | C | Neg | Pos | UM | Progressor |
| CLL 083 | M | 60 | B | Pos | ND | UM | Progressor |
| CLL 096 | M | 70 | C | Neg | Neg | UM | Progressor |
| Illumina | |||||||
| CLL 238 | M | 76 | C | Neg | Neg | M | Progressor |
| CLL 250 | M | 57 | B | ND | Pos | UM | Progressor |
Binet, Binet stage; LPL, lipoprotein lipase; MS, mutational status; F, female; M, male; Neg, negative; Pos, positive; ND, no data; UM, unmutatated; M, mutated.
Figure 1Computational subtraction approach developed to identify non‐human sequences in the chronic lymphocytic leukaemia (CLL) transcriptome. (A) Subtraction strategy for 454 data. (B) Subtraction strategy for Illumina data.
Illumina sequencing, filtering and mapping statistics
| Reads | CLL250 | CLL250_act | CLL238 | CLL238_act |
|---|---|---|---|---|
| Raw counts | 34 248 055 | 38 889 936 | 33 948 020 | 38 421 323 |
| Filtering steps | ||||
| Purity filter | 28 771 020 | 32 477 113 | 29 210 502 | 31 924 855 |
| Additional filter | 22 479 555 | 27 956 701 | 26 039 540 | 27 613 647 |
| Burrows‐Wheeler Aligner mapping steps | ||||
| Ribosomal | 5 050 513 | 5 022 371 | 3 851 214 | 4 627 416 |
| Genomic | 13 790 644 | 18 914 191 | 18 738 268 | 17 791 494 |
| Transcriptomic | 1 634 399 | 2 050 829 | 1 677 217 | 2 144 262 |
| Final unmapped | 2 004 005 | 1 969 310 | 1 772 841 | 3 050 475 |
BLASTN results of unmapped reads
| CLL250 | CLL250_act | CLL238 | CLL238_act | |
|---|---|---|---|---|
| Initial viral reads | 230 221 | 200 276 | 163 277 | 284 222 |
| Non‐human reads | 689 | 551 | 705 | 790 |
| Final viral reads | 0 | 0 | 0 | 9 |
All initial viral reads were BLASTN searched in human databases. Non‐human reads are those that were not aligned and, if aligned, those classified by the discriminant function either as ‘viral’ or ‘human’ with posteriors lower than 0·9.
After a careful inspection of BLASTN searches in the non‐redundant database, only nine reads were conclusively assigned to a viral origin.
Figure 2Taxonomy of BLASTN results of the final candidate viral reads in the non‐redundant database. The shown classes were defined arbitrarily (online Data S1 and Table SII). Only the ‘Epstein‐Barr virus’ class involves reads with a final viral assignment. MPMV, Mason‐Pfizer monkey virus.