| Literature DB >> 32180803 |
Gavin Robert Oliver1,2, Garrett Jenkinson1,2, Eric W Klee1,2.
Abstract
Several recent studies have demonstrated the utility of RNA-Seq in the diagnosis of rare inherited disease. Diagnostic rates 35% higher than those previously achievable with DNA-Seq alone have been attained. These studies have primarily profiled gene expression and splicing defects, however, some have also shown that fusion transcripts are diagnostic or phenotypically relevant in patients with constitutional disorders. Fusion transcripts have traditionally been studied as oncogenic phenomena, with relevance only to cancer testing. Consequently, fusion detection algorithms were biased toward the detection of well-known oncogenic fusions, hindering their application to rare Mendelian genetic disease studies. A recent methodology published by the authors successfully tailored a traditional algorithm to the detection of pathogenic fusion events in inherited disease. A key mechanism of decreasing false positive or biologically benign events was comparison to a database of events detected in normal tissues. This approach is akin to population frequency-based filtering of genetic variants. It is predicated on the idea that pathogenic fusion transcripts are absent from normal tissue. We report on an analysis of RNA-Seq data from the genotype-tissue expression (GTEx) project in which known pathogenic fusions are computationally detected at low levels in normal tissues unassociated with the disease phenotype. Examples include archetypal cancer fusion transcripts, as well as fusions responsible for rare inherited disease. We consider potential explanations for the detectability of such transcripts and discuss the bearing such results have on the future profiling of genetic disease patients for pathogenic gene fusions.Entities:
Keywords: GTEx; RNA-Seq; fusion transcript; normal tissue; rare genetic disease
Year: 2020 PMID: 32180803 PMCID: PMC7059617 DOI: 10.3389/fgene.2020.00173
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Dot plots illustrating the number of observations of selected exon-exon fusion transcripts in the GTEx RNA-Seq data by tissue type. Fusion analysis was performed using RNA-Seq data from 8187 samples passing QC, representing 549 individuals and 52 tissue types, extracted from GTEx (version 6p). Fusion transcript identification was performed using STAR-Fusion (Haas et al., 2017) with default settings following STAR (v2.5.2b) two-pass alignment (Dobin et al., 2013). Similar to our previously described methods, preliminary fusion calls were used to maximize sensitivity by avoiding default filters encoded in the callers (Oliver et al., 2019b). Fusion-supporting junction and spanning reads identified by STAR Fusion were combined into a single supporting read count for each event. Fusions (A)–(F) are fusion candidates originating from a cohort analysis of rare disease patients previously published by the authors (Oliver et al., 2019b). Five fusions experimentally validated in the authors’ cohort analysis were not observed in the GTEx database and are not displayed in the figure. SAMD12-EXT1 (A) was detected in the authors’ cohort study and demonstrated to be a pathogenic event responsible for the rare condition of multiple exostoses. Candidate SAMD12-EXT1 fusions sharing the same exon-exon boundaries were later shown to be detectable with limited read support in a subset of tissues for five healthy individuals in GTEx. A selection of alternative exon-exon SAMD12-EXT1 fusions were observed in 10 further healthy individuals. The oncogenic BCR-ABL1 (G) was detectable in 22 healthy individuals, although with limited read support and within a small subset of tissues. Limited read support observed in healthy individuals contrasts strongly with the substantial read support visible in leukemia cell lines (red dots). KANSL1-ARL17B (H) and TFG-GPR128 (I) are previously described polymorphic fusion events, observed here in larger numbers of patients and tissues, with greater read support than the pathogenic or suspected pathogenic fusions originating from the authors’ cohort study. (J) shows the per-individual affected tissue count (PIC) for each healthy individual for which a fusion candidate was detectable. Each dot represents the number of tissues containing the relevant fusion in a single individual. Fusions in (J) are labeled (A–I) corresponding to the fusions appearing in plots (A–I). Pathogenic or potentially pathogenic fusions from the authors’ cohort study are detectable in small numbers of tissues per individual, similarly to the known pathogenic BCR-ABL1 fusion event. Polymorphic fusions are detectable in larger numbers of tissues per healthy individual.
Fusion candidates assessed for presence in the GTEx normal tissue fusion database.
| Yes – ddPCR and PCR of RNA, sequencing of DNA | Causative of severe combined immunodeficiency | No | ||
| Yes – ddPCR and PCR of RNA, aCGH and molecular inversion probe analysis of DNA | Causative of multiple exostoses | Yes | ||
| Yes – ddPCR and PCR of RNA | Potentially pathogenic | Yes | ||
| Yes – ddPCR of RNA | Potentially pathogenic | No | ||
| Yes – ddPCR of RNA | Potentially pathogenic | Yes | ||
| Yes – ddPCR of RNA | Potentially pathogenic | No | ||
| Yes – ddPCR and PCR of RNA, aCGH of DNA | Potentially pathogenic | No | ||
| No – negative ddPCR and PCR of RNA | Potentially pathogenic | Yes | ||
| No – negative ddPCR and PCR of RNA | Potentially pathogenic | Yes | ||
| No – negative ddPCR and PCR of RNA | Potentially pathogenic | Yes | ||
| Yes – extensively published | Oncogenic in several leukemias | Yes | ||
| Yes – extensively published | Oncogenic primarily in prostate cancer | No | ||
| Yes – extensively published | Oncogenic in cholangiocarcinoma and other solid tumors | No | ||
| Yes – extensively published | Oncogenic primarily in lung cancer | No | ||
| Yes – extensively published | Oncogenic primarily in prostate cancer | No | ||
| Yes – extensively published | Polymorphic | Yes | ||
| Yes – extensively published | Polymorphic | Yes |