| Literature DB >> 36009413 |
Elizaveta Rabushko1,2, Maxim Sorokin1,2,3, Maria Suntsova1,2, Alexander P Seryakov4, Denis V Kuzmin2, Elena Poddubskaya1, Anton A Buzdin1,2,5,6.
Abstract
Drugs targeting receptor tyrosine kinase (RTK) oncogenic fusion proteins demonstrate impressive anti-cancer activities. The fusion presence in the cancer is the respective drug prescription biomarker, but their identification is challenging as both the breakpoint and the exact fusion partners are unknown. RNAseq offers the advantage of finding both fusion parts by screening sequencing reads. Paraffin (FFPE) tissue blocks are the most common way of storing cancer biomaterials in biobanks. However, finding RTK fusions in FFPE samples is challenging as RNA fragments are short and their artifact ligation may appear in sequencing libraries. Here, we annotated RNAseq reads of 764 experimental FFPE solid cancer samples, 96 leukemia samples, and 2 cell lines, and identified 36 putative clinically relevant RTK fusions with junctions corresponding to exon borders of the fusion partners. Where possible, putative fusions were validated by RT-PCR (confirmed for 10/25 fusions tested). For the confirmed 3'RTK fusions, we observed the following distinguishing features. Both moieties were in-frame, and the tyrosine kinase domain was preserved. RTK exon coverage by RNAseq reads upstream of the junction site were lower than downstream. Finally, most of the true fusions were present by more than one RNAseq read. This provides the basis for automatic annotation of 3'RTK fusions using FFPE RNAseq profiles.Entities:
Keywords: FFPE; RNA sequencing; RNAseq; clinical oncology; fusion oncogene; receptor tyrosine kinase; tumor molecular diagnostics
Year: 2022 PMID: 36009413 PMCID: PMC9405289 DOI: 10.3390/biomedicines10081866
Source DB: PubMed Journal: Biomedicines ISSN: 2227-9059
Comparative analysis of confirmed and non-confirmed fusion structural and expression features.
| Sample ID | Fusion Type (5′-3′) | ORF Preserved (Yes/No) 1 | TK Domain Retained (Yes/No) 2 | RTK on 5′/3′ End | Presence in Fusion DBs 3 | Greater RTK Downstream Reads (Yes/No) 4 | Expr. of 5′/3′ Moieties 5 | N Reads 6 |
|---|---|---|---|---|---|---|---|---|
| Experimentally confirmed RTK fusions | ||||||||
| AL-98 |
| Yes | Yes | 3′ | Yes | Yes | 1.02/0.47 | 1 |
| FS-1 |
| Yes | Yes | 3′ | Yes | Yes | 0.4/0.35 | 3 |
| LuC-46 |
| Yes | Yes | 3′ | Yes | Yes | 24.53/5.29 | 1 |
| LuC-46 |
| Yes | Yes | 3′ | Yes | Yes | 23.29/6.09 | 5 |
| LuC-46 |
| Yes | Yes | 3′ | Yes | Yes | 23.29/5.87 | 4 |
| LuC-46 |
| No | Yes | 3′ | Yes | Yes | 23.29/5.93 | 1 |
| OC-11 |
| Yes | Yes | 5′ | No * | ND | 0.72/0.43 | 6 |
| OC-11 |
| Yes | No | 3′ | No | No | 5.74/0.29 | 5 |
| BC-105 |
| No | Yes | 3′ | Yes | Yes | 1.28/11.77 | 18 |
| BC-105 |
| Yes | Yes | 3′ | No | No | 0.41/12.41 | 8 |
| Experimentally non-confirmed RTK fusions | ||||||||
| GC-30 |
| Yes | No | 5′ | No | ND | 0.04/0.43 | 1 |
| LuC-71 |
| No | No | 5′ | No | ND | NA | 1 |
| LuC-11 |
| Yes | Yes | 3′ | No | No | 0.02/0.67 | 1 |
| LuC-19 |
| No | No | 5′ | No | ND | 0.83/0.24 | 1 |
| LuC-81 |
| No | No | 5′ | No | ND | 0.22/6.36 | 1 |
| XC-1 |
| No | No | 5′ | No | ND | 0.33/0.31 | 1 |
| OC-15 |
| Yes | No | 5′ | No | ND | 0.45/0.8 | 1 |
| AL-44 |
| No | No | 5′ | No | ND | 0.15/0.6 | 1 |
| AL-7 |
| No | No | 5′ | No | ND | 0.016/0.48 | 1 |
| J1 |
| No | No | 3′ | No | No | 1.94/1.9 | 1 |
| A549 |
| Yes | No | 5′ | No | ND | 1.21/0.52 | 1 |
| SkC-1 |
| No | No | 5′ | No | ND | 0.05/3.58 | 1 |
| BC-59 |
| No | No | 5′ | No | ND | 0.55/1.5 | 1 |
| EpS-1 |
| No | No | 3′ | No | No | 0.34/5.89 | 1 |
| PC-24 |
| Yes | Yes | 3′ | No | No | 0.12/0.023 | 1 |
|
| All confirmed vs. non-confirmed |
|
|
|
|
|
|
|
|
| All confirmed vs. non-confirmed |
|
|
|
|
|
|
|
| Putative RTK fusions that were not experimentally investigated by RT-PCR | ||||||||
| TC-123 |
| No | No | 5′ | No | ND | 0.97/0.22 | 1 |
| N-63 |
| No | No | 3′ | No | Yes | 0.88/0.24 | 8 |
| RAIR-4 |
| Yes | Yes | 3′ | Yes | Yes | 0.69/0.11 | 1 |
| TC-32 |
| Yes | No | 5′ | No | ND | 2.55/0.74 | 1 |
| TC-12 |
| No | Yes | 3′ | No | Yes | 0.46/0.02 | 1 |
| BT-24 |
| Yes | Yes | 5′ | No | ND | 0.23/0.006 | 1 |
| BC-47 |
| Yes | Yes | both | No | No | 0.06/1.36 | 1 |
| FFPE_4-2 |
| Yes | Yes | 3′ | No | No | 0.06/0.46 | 1 |
| AL-84 |
| No | Yes | 3′ | No | Yes | 0.82/0.17 | 1 |
| FFPE-5 |
| No | Yes | 3′ | No | No | NA | 1 |
| INI-1 |
| Yes | No | 3′ | No | Yes | 1.17/0.19 | 1 |
1 Presence of preserved open reading frame. 2 Presence of preserved tyrosine kinase domain. 3 Presence of the chimeric transcript with the same fusion partners in ChimerDB (http://www.kobic.re.kr/chimerdb/ accessed on 21 June 2022), ChiTaRS (http://chitars.md.biu.ac.il/ accessed on 21 June 2022), and TumorFusions (https://tumorfusions.org/ accessed on 21 June 2022) databases (searched 15 June 2022). 4 Statistically significantly greater RTK RNAseq exon coverage downstream to the fusion site (t-test, p-value < 0.05 and/or the coverage of the exons downstream to the fusion site in sample with fusion was higher than 95th percentile of the same exons’ mean coverage in the control group, while the coverage of upstream exons was fitting within the 95% confidence interval (CI) of exon coverage level in samples without fusions), assessed only for the 3′RTK fusions; “ND” stands for the 5′RTK fusions. 5 Averaged expression levels of 5′- and 3′ fusion moieties calculated as RNAseq read counts normalized on lengths of exons. 6 Number of chimeric RNAseq reads. * FGFR2–LGSN transcript is absent from ChimerDB, ChiTaRS, and TumorFusions databases but was previously published in [32].
Figure 1Schematic representation of the CCDC6-RET fusion transcript identified: (A) gene structures upstream and downstream of fusion site; (B) electropherogram of RT-PCR product obtained with primers complementary to the fusion moieties. The deduced PCR product size is 143 bp long; (C) Sanger sequencing of RT-PCR product confirms the fusion of exon 1 of CCDC6 with exon 12 of RET. Black arrows denote position of PCR primers. TK, tyrosine kinase domain within the structure of RET.
Figure 2Schematic representation of fusion transcripts identified in BC105 sample: (A) gene structures upstream and downstream of FBXL20-ERBB2 fusion site; (B) gene structures upstream and downstream of ATP2B1-ERBB2 fusion site. TK, ERBB2-encoded tyrosine kinase domain.
Figure 3Schematic representation of fusion transcripts identified in OC11 sample: (A) gene structures upstream and downstream of FGFR2-LGSN fusion site; (B) gene structures upstream and downstream of RPS24-FGFR2 fusion site. TK, FGFR2-encoded tyrosine kinase domain.
Figure 4Schematic representation of SLC34A2-ROS1 fusion transcripts identified: (A) gene structures upstream and downstream of fusion site for transcript variant 1 (in-frame); (B) gene structures upstream and downstream of fusion site for transcript variant 2 (in-frame); (C) gene structures upstream and downstream of fusion site for transcript variant 3 (in-frame); (D) gene structures upstream and downstream of fusion site for transcript variant 4 (out-frame). TK, ROS1-encoded tyrosine kinase domain.
Figure 5The 3′ fusion partner gene exon coverage, investigated for the 3′RTK fusion transcripts: (A) length-normalized RNAseq reads coverage of ABL1 gene in sample AL98 (BCR-ABL1 fusion); (B) length-normalized RNAseq reads coverage of RET gene in sample FS1 (CCDC6-RET fusion); (C) length-normalized RNAseq reads coverage of ROS1 gene in sample LuC46 (SLC34A2-ROS1 fusion, all variants); (D) length-normalized RNAseq reads coverage of FGFR2 gene in sample OC11 (RPS24-FGFR2 fusion); (E) length-normalized RNAseq reads coverage of ERBB2 gene in sample BC105 (ATP2B1-ERBB2 fusion); (F) length-normalized RNAseq reads coverage of ERBB2 gene in sample BC105 (FBXL20-ERBB2 fusion); (G) length-normalized RNAseq reads coverage of NTRK2 gene in sample LuC11 (KIF27-NTRK2 fusion); (H) length-normalized RNAseq reads coverage of ABL1 gene in sample J11 (RBM18-ABL1 fusion); (I) length-normalized RNAseq reads coverage of FGFR1 gene in sample EpS1 (HMBOX1-FGFR1 fusion); (J) length-normalized RNAseq reads coverage of FGFR2 gene in sample PC24 (DOCK1-FGFR2 fusion). (A–F), verified fusions (blue); (G–J), non-verified fusions (orange).
Figure 6Gene exon coverage comparison in sample with verified 3′RTK fusion and the other cancer samples of the same type. Double normalized exon coverage by RNA sequencing reads is shown: (i) by exon length and (ii) by overall gene expression level in a sample. (A) RET gene exon coverage in FS-1 sample (blue) and in all sarcoma samples (orange). (B) ROS1 gene exon coverage in LuC46 sample (blue) and in all lung cancer samples (orange).
Figure 7Schematic representation and RNAseq reads coverage of FGFR2-LGSN fusion transcript identified in OC11 sample: (A) fusion partner gene structures upstream and downstream of fusion site; (B) LGSN exon coverage by normalized RNA sequencing reads in OC-11 sample (blue) and in control ovarian cancer samples without fusion (orange). Double normalized exon coverage by RNA sequencing reads is shown: (i) by exon length and (ii) by overall gene expression level in a sample. *—the coverage of exon in OC-11 sample is higher than the 95th percentile of exon coverage in unrelated control ovarian cancer samples without fusion.