| Literature DB >> 23323884 |
Andrew E Bruno1, Jeffrey C Miecznikowski, Maochun Qin, Jianmin Wang, Song Liu.
Abstract
BACKGROUND: Gene fusions are the result of chromosomal aberrations and encode chimeric RNA (fusion transcripts) that play an important role in cancer genesis. Recent advances in high throughput transcriptome sequencing have given rise to computational methods for new fusion discovery. The ability to simulate fusion transcripts is essential for testing and improving those tools.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23323884 PMCID: PMC3637076 DOI: 10.1186/1471-2105-14-13
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Gene selection options in FUSIM
| Random (default) | uniform | Limit all fusions to specific geneId, transcriptId, or chrom | |
| | | Filter for gene1 | |
| | | Filter for gene2 | |
| | | Filter for gene3 | |
| Background | uniform emprical binned | Path to BAM file containing background reads. Genes will be selected for fusions according to the read profile of the background reads | |
| | | RPKM cutoff when using background BAM file. Genes below the cutoff will be ignored | |
| | | Method to use when selecting genes for fusions | |
| Number of threads to spawn when processing background BAM file |
Gene selection modes and corresponding options in FUSIM.
Figure 1Fusion transcript simulation. Example of fusion transcript simulation. (a) Original transcripts of three selected genes NPS, PR44, ATP5I. Boxes represent exons and solid lines refer to introns. (b) Illustration of three basic types of fusion transcripts. Hybrid fusions use exons from two distinct genes, Self fusions join exons from a single gene, Complex fusions use exons from three distinct genes. (c) Example of the available options for controlling fusion transcript generation. Split exons randomly selects breakpoints in the exons involved. Keep exon boundary forces fusion breakpoints to fall on exon boundaries. CDS only creates fusions using exons within the coding sequence region. Foreign insertion inserts a randomly generated sequence between fusion breakpoints. Auto-correct orientation forces FUSIM to correct the orientation of exons.
Figure 2Example FUSIM output. Example of simulated fusion transcripts generated by FUSIM. (a) Text output of two simulated fusion transcripts NPS-RPL38 (inter-chromosome hybrid fusion) and MMP14-LRP10 (readthrough fusion). (b) FASTA output of the raw sequence data showing fusion junctions. (c) Results of BLAT search using the FASTA sequences in (b) validating FUSIM output. The black square boxes represent the exons from RefSeq genes and the colored boxes represent the exons from the gene fusion generated by FUSIM.