| Literature DB >> 34134471 |
Paul Kerbs1, Sebastian Vosberg1, Stefan Krebs2, Alexander Graf2, Helmut Blum2, Anja Swoboda3, Aarif M N Batcha4, Ulrich Mansmann4, Dirk Metzler5, Caroline A Heckman6, Tobias Herold1, Philipp A Greif7.
Abstract
Identification of fusion genes in clinical routine is mostly based on cytogenetics and targeted molecular genetics, such as metaphase karyotyping, fluorescence in situ hybridization and reverse-transcriptase polymerase chain reaction. However, sequencing technologies are becoming more important in clinical routine as processing time and costs per sample decrease. To evaluate the performance of fusion gene detection by RNAsequencing compared to standard diagnostic techniques, we analyzed 806 RNA-sequencing samples from patients with acute myeloid leukemia using two state-of-the-art software tools, namely Arriba and FusionCatcher. RNA-sequencing detected 90% of fusion events that were reported by routine with high evidence, while samples in which RNA-sequencing failed to detect fusion genes had overall lower and inhomogeneous sequence coverage. Based on properties of known and unknown fusion events, we developed a workflow with integrated filtering strategies for the identification of robust fusion gene candidates by RNA-sequencing. Thereby, we detected known recurrent fusion events in 26 cases that were not reported by routine and found discrepancies in evidence for known fusion events between routine and RNA-sequencing in three cases. Moreover, we identified 157 fusion genes as novel robust candidates and comparison to entries from ChimerDB or Mitelman Database showed novel recurrence of fusion genes in 14 cases. Finally, we detected the novel recurrent fusion gene NRIP1- MIR99AHG resulting from inv(21)(q11.2;q21.1) in nine patients (1.1%) and LTN1-MX1 resulting from inv(21)(q21.3;q22.3) in two patients (0.25%). We demonstrated that NRIP1-MIR99AHG results in overexpression of the 3' region of MIR99AHG and the disruption of the tricistronic miRNA cluster miR-99a/let-7c/miR-125b-2. Interestingly, upregulation of MIR99AHG and deregulation of the miRNA cluster, residing in the MIR99AHG locus, are known mechanisms of leukemogenesis in acute megakaryoblastic leukemia. Our findings demonstrate that RNA-sequencing has a strong potential to improve the systematic detection of fusion genes in clinical applications and provides a valuable tool for fusion discovery.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34134471 PMCID: PMC8719081 DOI: 10.3324/haematol.2021.278436
Source DB: PubMed Journal: Haematologica ISSN: 0390-6078 Impact factor: 9.941
Summary of the patients’ characteristics.
Statistics for RNA-sequencing, mapping and fusion calling.
Figure 1.Evidence for fusions by routine clinical diagnostics and RNA-sequencing. (A) True fusions detected by Karyotyping, molecular diagnostics (MDx) and RNAsequencing (RNA-seq) in the AMLCG, DKTK, Beat AML and FIMM cohorts. Dark green boxes indicate high evidence, light green boxes indicate low evidence. Gray boxes represent no evidence although the respective method was performed. White boxes indicate that the respective method was not performed, or information was missing. (B) Known fusions detected with high evidence by RNA-seq that were missed or detected with low evidence only by Karyotyping/MDx. (C) Venn diagram summarizing fusions detected with the different methods.
Figure 2.Detection workflow and filtering of fusion events. (A) Detection workflow and number of filtered fusion events by filtering strategies. (B) Ratios of fusion events excluded by Arriba and FusionCatcher in each filter step and cohort. AML: acute myeloid leukemia; FTS: Fusion Transcript Score; PS: Promiscuity Score; RS: Robustness Score.
Figure 3.Genomic origin of fusion events detected by RNA-sequencing. Circos plots of (A) known and (B) unknown fusion gene candidates found in the AMLCG, DKTK, Beat AML and FIMM cohorts, illustrating chromosomal origin of the fusion events. Lines connect the positions of fusion partners. Thickness of lines indicates recurrence. Recurrent fusions are labeled with gene symbols of the partner genes. Blue lines indicate known fusion events, red lines indicate recurrent novel and gray lines show non-recurrent novel fusion events.
Figure 4.Detection and validation of the novel Evidence for the NRIP1-MIR99AHG fusion gene in sample AM-0028-DX determined by various methods. (A) Schematic representation of the fusion transcript as predicted by RNA-sequencing. (B) Gel-electrophoresis of reverse transcriptase polymerase chain reaction analysis of fusion breakpoint and NRIP1 exon 4. Three samples from cytogenetically normal patients with acute myeloid leukemia were used as negative controls. (C) A trace from Sanger sequencing of the fusion breakpoint. (D) Mapping of long reads from Nanopore sequencing of genomic DNA. Each line represents one read, which can be divided at the breakpoints of the fusion. Single parts of the read can be mapped to the positive strand (blue) at one locus with the other part mapped to the negative strand (red) at the other locus. The consensus inversed region is indicated by orange. The mapping structure of a highlighted read at the bottom shows that one part of the read was inversely mapped to the NRIP1 locus, while the other part was mapped to the MIR99AHG locus.
Figure 5.Gene expression of genes involved in fusions. Gene expression of the 5' and 3' partner genes of the respective fusion. Red dots indicate samples positive for the respective fusion, gray dots represent samples negative for the respective fusion. TPM: transcripts per million.