| Literature DB >> 21247443 |
Henrik Edgren1, Astrid Murumagi, Sara Kangaspeska, Daniel Nicorici, Vesa Hongisto, Kristine Kleivi, Inga H Rye, Sandra Nyberg, Maija Wolf, Anne-Lise Borresen-Dale, Olli Kallioniemi.
Abstract
BACKGROUND: Until recently, chromosomal translocations and fusion genes have been an underappreciated class of mutations in solid tumors. Next-generation sequencing technologies provide an opportunity for systematic characterization of cancer cell transcriptomes, including the discovery of expressed fusion genes resulting from underlying genomic rearrangements.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21247443 PMCID: PMC3091304 DOI: 10.1186/gb-2011-12-1-r6
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Fusion gene identification by paired-end RNA-sequencing. (a) Identification of fusion gene candidates through selection of paired-end reads, the ends of which align to two different and non-adjacent genes. (b) Identification of the exact fusion junction by aligning non-mapped short reads against a computer generated database of all possible exon-exon junctions between the two partner genes. Separation of true fusions (left) from false positives (right) by examining the pattern of short read alignments across exon-exon junctions. Genuine fusion junctions are characterized by a stacked/ladder-like pattern of short reads across the fusion point. False positives lack this pattern; instead, all junction matching short reads align to the exact same position or are shifted by one to two base pairs. Furthermore, this alignment is mostly to one of the exons.
Identified and validated fusion gene candidates
| Sample | 5' gene | 5' chromosome | 3' gene | 3' chromosome | Number of paired-end reads | Number of junction reads | In frame | Amplified | Genetic rearrangement validated |
|---|---|---|---|---|---|---|---|---|---|
| BT-474 | 17 | 17 | 57 | 72 | Yes | Yes | Yes | ||
| BT-474 | 17 | 17 | 43 | 68 | Yes | Yes | Yes | ||
| BT-474 | 20 | 17 | 41 | 26 | Yes | Yes | Yes | ||
| BT-474 | 20 | 20 | 35 | 14 | No | Yes | Yes | ||
| BT-474 | 20 | 19 | 9 | 12 | No | Yes | Yes | ||
| BT-474 | 17 | 17 | 8 | 7 | Yes | Yes | Yes | ||
| BT-474 | 20 | 20 | 8 | 1 | Yes | No | |||
| BT-474 | 17 | 20 | 4 | 6 | Yes | Yes | Yes | ||
| BT-474 | 13 | 13 | 5 | 3 | No | No | Yes | ||
| BT-474 | 3 | 3 | 6 | 2 | Yes | No | Yes | ||
| BT-474 | 20 | 20 | 4 | 2 | No | Yes | Yes | ||
| SK-BR-3 | 8 | 17 | 28 | 447 | Yes | Yes | Yes | ||
| SK-BR-3 | 20 | 20 | 10 | 20 | Yes | Yes | |||
| SK-BR-3 | 17 | 8 | 13 | 10 | Yes | Yes | Yes | ||
| SK-BR-3 | 5 | 5 | 12 | 6 | Yes | No | Yes | ||
| SK-BR-3 | 14 | 14 | 6 | 6 | Yes | No | Yes | ||
| SK-BR-3 | 3 | 3 | 14 | 5 | Yes | No | |||
| SK-BR-3 | 8 | 8 | 3 | 3 | Yes | Yes | Yes | ||
| SK-BR-3 | 17 | 8 | 38 | 2 | Yes | Yes | Yes | ||
| SK-BR-3 | 20 | 20 | 3 | 2 | Yes | No | Yes | ||
| SK-BR-3 | 20 | 20 | 5 | 9 | Yes | Yes | |||
| KPL-4 | 19 | 19 | 22 | 14 | Yes | No | Yes | ||
| KPL-4 | 12 | 2 | 2 | 6 | Yes | No | Yes | ||
| KPL-4 | 9 | 9 | 4 | 6 | Yes | No | Yes | ||
| MCF-7 | 20 | 17 | 133 | 142 | Yes | Yes | Previously reported | ||
| MCF-7 | 20 | 20 | 17 | 25 | Yes | Yes | Previously reported | ||
| MCF-7 | 17 | 17 | 2 | 7 | Yes | Yes | Previously reported |
A total of 24 novel fusion genes were identified in BT-474, SK-BR-3 and KPL-4. Three fusion genes detected in MCF-7 have been reported before and served as positive controls in our study. Two paired-end reads and two fusion junction spanning short reads were required for selecting a fusion candidate for further validation. In-frame prediction, copy number amplification (at least one of the fusion partner genes) and validation of the genomic rearrangement are indicated. Lower level copy number gains were excluded.
Figure 2Experimental validation of identified breast cancer fusion transcripts. RT-PCR validation of fusions found in MCF-7 and KPL-4 (upper), SK-BR-3 (middle), and BT-474 (lower). Also shown is the marker and the negative control.
Figure 3Genomic structure, validation and functional significance of . (a) Exonic expression of VAPB-IKZF3 is indicated by sequencing coverage (red). Copy number changes measured by array comparative genomic hybridization (aCGH; black dots) in reference to normal copy number (horizontal grey line) and fusion break points (vertical grey line) are indicated. Gene structures are shown below the aCGH data. Arrows below gene structures indicate which strand the genes lie on. Fusion transcript structure is pictured below wild-type (wt) gene structures. (b) Interphase FISH showing amplification of VAPB and IKZF3 and the VAPB-IKZF3 fusion in BT-474. White arrows indicate gene fusions. (c) Expression of the 5' and 3' partner genes and the fusion gene. RPKM denotes reads per kilobase per million sequenced short reads. (d) Quantitative RT-PCR validation of small interfering RNA (siRNA) knock-down efficiency of cells transfected either with a scramble siRNA or with gene-specific siRNAs. Error bars show standard deviation. (e) CTG cell viability analysis of cells transfected either with a scramble siRNA or with gene-specific siRNAs. Asterisks indicate the statistical significance of growth reduction: ***P < 0.001. Error bars show standard deviation.
Figure 4Genomic rearrangements in SK-BR-3 and BT-474. (a) Circos plots representing chromosomal translocations in SK-BR-3 (upper right) and BT-474 (lower left). Chromosomes are drawn to scale around the rim of the circle and data are plotted on these coordinates. Selected chromosomes involved in the fusion events are shown in higher magnification. Each intrachromosomal (red) and interchromosomal (blue) fusion is indicated by an arc. Copy number measured by aCGH is plotted in the inner circle where amplifications are shown in red and deletions in green. N denotes the number of fusion genes per cell line. (b) Fusion gene formation in the ERBB2-amplicon region. Fusion partner genes within and near the amplicon region are connected with black lines (both partners on chromosome 17), or location of the other partner is indicated (partner gene on different chromosomes). Smoothed aCGH profiles (log2) for SK-BR-3 (blue) and BT-474 (red) indicate copy number changes in reference to normal copy number (horizontal grey line). ERBB2, which is not fused (arrow), and chromosomal positions (bottom) are indicated.
Figure 5Exclusive expression of the exons of the 3' partner genes taking part in the fusions. Exonic expression of CEP250 in ZMYND8-CEP250 (upper), ENSG00000236127 in CSE1L-ENSG00000236127 (second from top), GSDMB in TATDN1-GSDMB (second from bottom) and BCAS4 in BCAS3-BCAS4 (lower) is indicated by sequencing coverage (red). Copy number changes measured by aCGH (black dots) in reference to normal copy number (horizontal grey line) and fusion break points (vertical grey line) are indicated. Chromosomal positions and transcript structures are shown below the aCGH data. Transcript structures above and below chromosome coordinates denote forward and reverse strand, respectively.