| Literature DB >> 30918253 |
Erin E Heyer1, Ira W Deveson1,2, Danson Wooi1,2, Christina I Selinger3, Ruth J Lyons1, Vanessa M Hayes1,2,4,5,6, Sandra A O'Toole2,3,6,7,8, Mandy L Ballinger7, Devinder Gill9, David M Thomas7, Tim R Mercer10,11,12, James Blackburn13,14.
Abstract
Fusion genes are a major cause of cancer. Their rapid and accurate diagnosis can inform clinical action, but current molecular diagnostic assays are restricted in resolution and throughput. Here, we show that targeted RNA sequencing (RNAseq) can overcome these limitations. First, we establish that fusion gene detection with targeted RNAseq is both sensitive and quantitative by optimising laboratory and bioinformatic variables using spike-in standards and cell lines. Next, we analyse a clinical patient cohort and improve the overall fusion gene diagnostic rate from 63% with conventional approaches to 76% with targeted RNAseq while demonstrating high concordance for patient samples with previous diagnoses. Finally, we show that targeted RNAseq offers additional advantages by simultaneously measuring gene expression levels and profiling the immune-receptor repertoire. We anticipate that targeted RNAseq will improve clinical fusion gene detection, and its increasing use will provide a deeper understanding of fusion gene biology.Entities:
Mesh:
Year: 2019 PMID: 30918253 PMCID: PMC6437215 DOI: 10.1038/s41467-019-09374-9
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Overview of targeted RNAseq and panel validation. a Schematic of targeted RNAseq process. b Scatterplot of targeted RNAseq enrichment for ERCCs included on (blue) or excluded from (orange) the blood panel. c Abundance of captured ERCCs before and after targeted sequencing on blood panel. d Metagene plot of K562 targeted RNAseq read coverage across all genes on the blood panel. TS=Transcript Start site; TE=Transcript End site
Summary of cell line fusion genes and mapping statistics
| Panel | Cancer type | Sample | Detected fusion genes | Uniquely mapped reads (million) | On-target capture rate (%) |
|---|---|---|---|---|---|
| Blood | Bone marrow | K562 RNASeq | BCR-ABL1, NUP214-XKR3 | 46.0 | 3 |
| K562 | BCR-ABL1, NUP214-XKR3 | 10.7 | 98 | ||
| K562 1:10 | BCR-ABL1, NUP214-XKR3 | 49.7 | 72 | ||
| K562 1:100 | BCR-ABL1, NUP214-XKR3 | 4.9 | 91 | ||
| K562 1:1000 | BCR-ABL1, NUP214-XKR3a | 29.0 | 81 | ||
| K562 1:10,000 | BCR-ABL1a | 11.4 | 87 | ||
| T-cell | KARPAS45 | KMT2A-FOXO4 | 16.9 | 97 | |
| WT | GM12878 | – | 10.4 | 98 | |
| Solid | Sarcoma | 143B | EXOC2-MET, PAFAH1B2-FOXR1, ERG-LINC00240 | 27.5 | 92 |
| GOT3 | GPC6-WIF1, WNK1-ERC1, PPARD-IRF2BP2 | 27.1 | 93 | ||
| MLS1765-92 | FUS-DDIT3, CREB1-METTL21A | 20.1 | 93 | ||
| RDES RNAseq | EWSR1-FLI1 | 31.0 | 5 | ||
| RDES | EWSR1-FLI1, SMC04-EWSR1, FUS-DDIT3 | 30.4 | 88 |
aIndicates fusion gene identified by either STARfusion or FusionCatcher, but not both
Fig. 2Validation of targeted RNAseq for fusion gene detection. a Diagram of BCR-ABL1 fusion gene and transcript, depicting spanning and junction reads used to identify fusion genes. b Bar charts comparing abundance of fusion reads from targeted and canonical RNASeq libraries in K562 (top) and RDES (bottom) cell lines. c Scatterplot of observed (blue dots) and expected (red dots) BCR-ABL1 read counts in K562 dilution series. d Scatterplot of fusion sequin junction reads versus input concentration
Fig. 3Fusion identification in clinical cohort samples. a FISH identification of ROS1 rearrangement in lung cancer sample MO-16-000393. Positive signal is 1 fused set of red and green dots and ≥1 isolated green dots per cell. White arrows point to fused dots; grey arrows point to green dots. b FISH identification of ALK rearrangement in lung cancer sample SP-15-11000. Positive signal is 1 fused set of red and green dots, 1 isolated red and 1 isolated green dot per cell. White arrows point to fused dots; grey arrows point to isolated red and green dots. c RT-PCR analysis to diagnose TMPRSS2-ERG fusion genes in prostate samples. * indicates TMPRSS2-ERG bands. Source data are provided as a Source Data file. d Overview of fusion gene identification in all clinical cohort samples; each oval represents one patient. Other blood cancers includes chronic lymphocytic leukaemia, multiple myeloma and uncategorised blood cancer patients. BMA=bone marrow aspirate; PB=peripheral blood; FFPE=formalin-fixed paraffin-embedded. e Read coverage across EZR and ROS1 genes in lung cancer patient sample MO-16-000393. Dotted line marks fusion junction of EZR-ROS1 fusion gene
Fusion genes found within the clinical cohort
| Panel | Cancer type | Fusion genes detected with targeted RNAseq | FISH & RT-PCR | Targeted RNAseq |
|---|---|---|---|---|
| Blood | Acute lymphoblastic leukaemia | KMT2A-AFF1, AFF1-KMT2A, RUNX1-RUNX1T1, TCF3-PBX1, AFF1-MYC, TAF15-ZNF384, ZNF384-TAF15 | 2/4 | 5/5 |
| Acute myeloid leukaemia | CBFB-MYH11, NSD1-NUP98, RUNX1-RUNX1T1, RUNX1T1-RUNX1, KMT2A-MLLT3, DEK-NUP214, NUP214-DEK, MN1-ETV6, ETV6-MN1, DDX3X-MLLT10, KMT2A-SEPT9, SEPT9-KMT2A, PML-RARA, RARA-PML | 2/9 | 9/15 | |
| Chronic myeloid leukaemia | BCR-ABL1, RUNX1-RUNX1T1 | 3/4 | 5/5 | |
| Lymphoma | MYC-IGH, IGH-BCL6 | 3/4 | 3/4 | |
| Other blood cancers | FGFR1-ZMYM2 | 0/1 | 1/3 | |
| Solid | Lung | EZR-ROS1, EML4-ALK | 2/2 | 2/2 |
| Prostate | TMPRSS2-ERG, ACSL3-ETV1, SP3-CTU2, SLC45A3-SKIL | 10/20 | 14/20 | |
| Sarcoma | SS18-SSX1, SS18-SSX2/2B, FUS-DDIT3, DDIT3-FUS, EWSR1-ERG, EWSR1-FLI1, PATZ1-EWSR1 | 17/18 | 16/18 |
Columns on the right indicate the number of patient samples with a positive fusion gene diagnosis from prior clinical assessment or targeted RNAseq; discrepancies in total sample number reflect the lack of available clinical data
Fig. 4Fusion junction diversity and gene expression. a Schematic of BCR-ABL1 fusion isoforms +/−BCR exon 14. b TMPRSS2 and ERG gene structures and TMPRSS2-ERG fusion isoform prevalence. Bar charts on the right indicate the number of samples expressing each isoform. For simplicity, junctions beyond exon 1 are depicted utilising exon 1a. Black line represents retained intronic sequence. c, d Schematic of EZR-ROS1 and ACLS3-ETV1 fusions and quantification of read count expression across the endogenous genes in lung cancer sample MO-16-000393 and prostate cancer sample 12543, respectively. Horizontal lines indicate mean expression levels; coloured dots represent expression from the fused alleles plus nonrearranged alleles, while the grey dots represent expression of the canonical, nonrearranged alleles
Fig. 5Novel findings in transcriptomic analysis. a Schematic of immune receptor capture probe design across the T cell receptor β (TCRβ) locus and a transcript expressed post-V(D)J rearrangement. b Immune receptor clonotypes in cell lines and clinical patient samples quantified using MiXCR. Each colour represents a single clonotype. c Novel ETV6 exons shown underneath GENCODE v27 annotation. Red arrows indicate exons found in lymphoma samples, blue arrows indicate exons found in leukaemia samples