| Literature DB >> 23872706 |
E Braggio1, J B Egan, R Fonseca, A K Stewart.
Abstract
Next-generation sequencing has led to a revolution in the study of hematological malignancies with a substantial number of publications and discoveries in the last few years. Significant discoveries associated with disease diagnosis, risk stratification, clonal evolution and therapeutic intervention have been generated by this powerful technology. As part of the post-genomic era, sequencing analysis will likely become part of routine clinical testing and the challenge will ultimately be successfully transitioning from gene discovery to preventive and therapeutic intervention as part of individualized medicine strategies. In this report, we review recent advances in the understanding of hematological malignancies derived through genome-wide sequence analysis.Entities:
Year: 2013 PMID: 23872706 PMCID: PMC3730204 DOI: 10.1038/bcj.2013.26
Source DB: PubMed Journal: Blood Cancer J ISSN: 2044-5385 Impact factor: 11.037
Figure 1Evolution of genetic detection methods and discoveries. Landmark findings from each method are indicated.
Summary of high-throughput sequencing methods
| Whole genome | 10 ng–1 μg genomic DNA | Small input DNA requirement | Lower DNA input may reduce library complexity and representation |
| Variant detection in all regions of the genome | PCR duplicates can impact accuracy of variant detection software | ||
| Computationally intensive analysis | |||
| Mate pair | 5–10 μg genomic DNA | Identification of large structural rearrangements | Large input DNA requirement |
| High false discovery rate | |||
| Whole exome | 1 μg genomic DNA | Deep coverage of exome enabling precise interrogation of coding regions | Non-coding regions excluded |
| Multiple samples can be pooled and run together reducing time and cost per sample | Standard capture kits do not capture all exons | ||
| mRNASeq | 100–400 ng total RNA | Dynamic range of expression detection can be much broader than using microarrays | RNA fragmentation methods can bias the resulting library |
| Detection of rare and hybrid transcripts | Artifacts from amplified cDNA libraries[ | ||
| Precise quantitation of highly expressed transcripts and multiple isoforms | Appropriate normal controls may be difficult to obtain for tumor/normal comparison | ||
| Investigation of 3'UTR and promoters | |||
| ChIPSeq | 10 ng ChIP enriched DNA | Detection of DNA–protein interactions | Quality of sequencing results dependent on the quality of ChIP assay |
| Discovery of new interactions in regions not represented on microarray chips | Library preparation can introduce GC-rich region bias | ||
| Avoids hybridization problems associated with array-based ChIP assays | |||
| Single molecule | 1 μg genomic DNA | No amplification step resulting in no PCR duplicates | High error rate |
| Long-read length (>1 kb) | Throughput not comparable to current platforms |
Abbreviations: ChIPSeq, chromatin immunoprecipitation sequencing; UTR, untranslated region.
Cost per sample is highly variable depending on the platform and on the amount of multiplexing utilized.
May vary by platform and approach.
May require polyA RNA- or rRNA-depleted total RNA.
Figure 2Schematic of a bioinformatics pipeline. Examples of the most commonly used publicly available software programs utilized at a particular step are in parentheses. The programs listed were the most commonly used in 2012 hematological malignancy sequencing analyses. These are only examples and are not intended to be an exhaustive list. The number of publicly available tools is rapidly expanding and review of these tools is beyond the scope of this report.
Figure 3Most frequent somatic genetic mutations per hematological malignancy. Only original data from massively parallel sequencing were included, excluding confirmation data in previously mutated genes (for example, ATM and TP53 in CLL, FLT3 in AML, RAS and TP53 in MM, MYD88 in DLBCL, mutations in the nuclear factor-kB pathways in DLBCL and MM). In cases where data were obtained from multiple studies, the data originating from the largest cohort were included.
Summary of high-throughput sequencing studies performed so far in hematological malignancies
| ALL | 1 | 24 | mRNASeq | GAII | NR | DPEP1 (4%), longitudinal detection of PLXNB2and CXorf21 | 2012 | [ |
| ALL(Phlike) | 15 | 231 | WGS/mRNASeq | GAIIx/HiSeq | NR | NUP214-ABL1 fusion (2%), IK2F1 (67%) | 2012 | [ |
| ALL (ETP) | 12 | 94 | WGS | GAII | 33 | RAS pathway (67%), hematopoiesis and lymphoid development (58%), histone modification (42%) | 2012 | [ |
| ALL (T) | 11 | — | WES/mRNASeq | HiSeq | 55/15 | DNMT3A (17%), JARID2 (8%), IDH2 (8%),EZH2(17%) | 2012 | [ |
| AML | 1 | — | WES | HiSeq2000 | NR | Leukemic transformation from SCN to AML | 2012 | [ |
| AML | 8 | — | WGS/deep sequencing | GAIIx | 25/590 | Clonal evolution | 2012 | [ |
| AML | 5 | 160 | WES | GAIIx | NR | GATA2 (39%) with biallelic CEBPA mutation | 2012 | [ |
| AML | 2 | 3 | ChIPSeq | GAII/HiSeq | NR | Differential H3K4me3/H3K27me3 gene enrichment of stem and progenitor cells | 2013 | [ |
| AML-CN | 1 | 95 | mRNASeq | GAIIx | NR | TLE4 (2%), SHKBP1 (2%) | 2011 | [ |
| AML-CN | 1 | 262 | WES | GAIIx | 69 | BCOR (4%), DNMT3A (13%) | 2011 | [ |
| AML-CN | 7 | 230 | mRNASeq | HiScanSQ | 36 | CBFA2T3-GLIS2 fusion (8%) | 2013 | [ |
| AML-M1 | 1 | — | WGS | GA | 33 | First genome sequenced | 2008 | [ |
| AML-M1 | 1 | 187 | WGS | GAII | 23 | IDH1 (8%) | 2009 | [ |
| AML-M1 | 1 | 281 | WGS | GAII | 39 | DNMT3A (22%) | 2010 | [ |
| AML-M5 | 14 | 98 | WES | GAIIx | 97 | DNMT3A (21%) | 2011 | [ |
| sAML | 7 | 200 | WGS/WES | GAIIx/HiSeq | 34 | UMODL1 (29%), SMC3 (14%), CDH23, ZSWIM4 (14%) | 2012 | [ |
| BL | 28 | 78 | mRNASeq | HiSeq2000 | NR | ID3 (59%), TCF3 (29%), CCND3 (15%) | 2012 | [ |
| BL | 4 | 97 | WGS/WES/mRNASeq/MethylSeq | GAII/HiSeq | 32/121 | ID3 (42%) | 2012 | [ |
| BL | 14 | 45 | WES | GAIIx/HiSeq | 47 | ID3 (34%) | 2012 | [ |
| CLL | 4 | 363 | WGS/WES | GAIIx | 40/119 | NOTCH1 (12%), MYD88 (3%) | 2011 | [ |
| CLL | 5 | 226 | WES | Genome Sequencer FLX | 10 | NOTCH1 (17%) | 2011 | [ |
| CLL | 3/88 | 101 | WGS/WES | GAII | 38/132 | SF3B1 (15%), MYD88 (10%) | 2011 | [ |
| CLL | 105 | 279 | WES | GAIIx | 62 | SF3B1 (10%), NOTCH1 (10%) | 2012 | [ |
| CLL | 7 | 103 | RNASeq/WGS | GAII | NR/12 | 2013 | [ | |
| CLL | 160 | — | WES | GAIIx/HiSeq | 112 | Patterns of clonal evolution | 2013 | [ |
| DLBCL | 6 | 105 | WES | Genome Sequencer FLX | 10 | MLL2 (24%), regulation of immune response (63% ABC, 31% GCB) | 2011 | [ |
| DLBCL | 13/83 | 37 | WGS/mRNASeq | GAIIx/HiSeq | 32/41 | MLL2 (32%) MEF2B (11%), histone modification (13%), lymphocyte activation | 2011 | [ |
| DLBCL | 49 | — | WES | HiSeq | 150 | Histone H1 proteins (69%), ACTB (10%),P2RY8 (12%), PCLO (35%) | 2012 | [ |
| DLBCL | 34 | 39 | WGS/WES | GAII/HiSeq | 29/47 | Signal transduction | 2013 | [ |
| FL | 1/12 | 35 | WGS/mRNASeq | GAIIx/HiSeq | 9/28 | MLL2 (89%), MEF2B (13%), histone modification (15%), lymphocyte activation | 2011 | [ |
| HCL | 1 | 47 | WES | GAIIx | 71 | BRAF V600E (100%) | 2011 | [ |
| MCL | 18 | 108 | mRNASeq | GAII | NR | NOTCH1 (12%), CCND1 (19%) | 2012 | [ |
| MM | 23/16 | 161 | WGS/WES | GAII | 33/104 | Protein translation (42%), HOX9 pathway (29%) | 2011 | [ |
| MM | 22 | 127 | WES | GAIIx | 61 | Distinct mutation patterns between t(4;14)and t(11;14) | 2012 | [ |
| MM | 1 | — | WGS | SOLiD/HiSeq | 30 | Genomic evolution and clonal tides over course of disease | 2012 | [ |
| MDS | 9 | 354 | WES | GAIIx | NR | SF3B1 (67%) | 2011 | [ |
| MDS | 29 | 582 | WES | GAIIx/HiSeq | 134 | RNA splicing (55%) | 2011 | [ |
| MDS | 1 | 150 | WGS | GAIIx/HiSeq | 39 | U2AF1 (9%) | 2012 | [ |
| MDS/MPN | 15 | 310 | WESmRNASeq | HiSeq | NR | RNA splicing, SRSF2 (24%), clinical outcomes associated with mutations | 2012 | [ |
| MPN | 40 | — | WES | HiSeq | NR | SUZ12 (3%) | 2011 | [ |
| NHL(B-cell) | 2 | 263 | mRNASeq | GAII | NR | CIITA translocations (16%) | 2011 | [ |
| PCNSL | 4 | 25 | WES | GAIIx | NR | MYD88 L265P (38%), TBL1XR1 (14%) | 2012 | [ |
| SMZL | 6 | 93 | WGS | Complete genomics | 80 | MLL2 (50%), NOTCH2 (25%) | 2012 | [ |
| SMZL | 8 | 109 | WES | HiSeq | 111 | NOTCH2 (21%), NOTCH1 (5%), SPEN (5%),DTX1 (2%) | 2012 | [ |
| WM | 30 | 54 | WGS | Complete genomics | 66 | MYD88 L265P (91%) | 2012 | [ |
Abbreviations: ALL, acute lymphocytic leukemia; AML, acute myelogenous leukemia; BL, Burkitt's lymphoma; ChIPSeq, chromatin immunoprecipitation sequencing; CLL, chronic lymphocytic leukemia; DLBCL, diffuse large B-cell lymphoma; ETP ALL, early T-cell precursor acute lymphoblastic leukemia; FL, follicular lymphoma; HCL, hairy cell leukemia; MCL, mantle cell lymphoma; MDS, myelodysplastic syndrome; MethylSeq, methylation sequencing; MM, multiple myeloma; MPN, myeloproliferative neoplasms; mRNASeq, messenger RNA sequencing; NHL, non-hodgkins lymphoma; NR not reported; PCNSL, primary central nervous system lymphoma; SMZL, splenic marginal zone lymphoma; WM, Waldenströms macroglobulinemia.
Platforms: GAII, GAIIx, HiScanSQ and HiSeq2000 are from Illumina, Genome Sequencer FLX from 454 Sequencing Roche and SOLiD from Life Technologies.
Indicates significant pathway enrichment.