| Literature DB >> 19906715 |
Pora Kim1, Suhyeon Yoon, Namshin Kim, Sanghyun Lee, Minjeong Ko, Haeseung Lee, Hyunjung Kang, Jaesang Kim, Sanghyuk Lee.
Abstract
Chromosome translocations and gene fusions are frequent events in the human genome and have been found to cause diverse types of tumor. ChimerDB is a knowledgebase of fusion genes identified from bioinformatics analysis of transcript sequences in the GenBank and various other public resources such as the Sanger cancer genome project (CGP), OMIM, PubMed and the Mitelman's database. In this updated version, we significantly modified the algorithm of identifying fusion transcripts. Specifically, the new algorithm is more sensitive and has detected 2699 fusion transcripts with high confidence. Furthermore, it can identify interchromosomal translocations as well as the intrachromosomal deletions or inversions of large DNA segments. Importantly, results from the analysis of next-generation sequencing data in the short read archives are incorporated as well. We updated and integrated all contents (GenBank, Sanger CGP, OMIM, PubMed publications and the Mitelman's database), and the user-interface has been improved to support diverse types of searches and to enhance the user convenience especially in browsing PubMed articles. We also developed a new alignment viewer that should facilitate examining reliability of fusion transcripts and inferring functional significance. We expect ChimerDB 2.0, available at http://ercsb.ewha.ac.kr/fusiongene, to be a valuable tool in identifying biomarkers and drug targets.Entities:
Mesh:
Year: 2009 PMID: 19906715 PMCID: PMC2808913 DOI: 10.1093/nar/gkp982
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The number of gene pairs in the ChimerDB 2.0 according to the information source.
Statistics of transcriptome analysis in ChimerDB 2.0
| Class | Interchromosomal | Intrachromosomal | ||||
|---|---|---|---|---|---|---|
| A | A + B | A + B + C | A | A + B | A + B + C | |
| No. of transcripts | 1900 | 6073 | 8833 | 515 | 887 | 1065 |
| mRNA | 479 | 855 | 900 | 110 | 143 | 146 |
| EST | 1247 | 3972 | 5397 | 396 | 677 | 781 |
| NGS | 174 | 1246 | 2536 | 9 | 67 | 138 |
| No. of genes (9358 in total) | 2855 | 6976 | 8710 | 703 | 1276 | 1543 |
| No. of gene pairs | 2209 | 7362 | 10639 | 490 | 909 | 1108 |
| With multiple transcripts | 278 | 807 | 1137 | 144 | 220 | 246 |
| With Solexa evidence | 14 | 14 | 15 | 65 | 67 | 67 |
Figure 2.User interface of ChimerDB 2.0. The search page is designed to support diverse types of search. The ‘search result’ page shows the gene pairs and the disease-related information in OMIM, Sanger CGP and Mitelman’s database with the title and journal name of PubMed articles. Clicking ‘more info’ link shows the detailed information on fusion genes and transcripts as seen in the bottom panel. The ‘alignment view’ shows the hypothetical fusion gene (head gene in blue, tail gene in red) and the candidate fusion transcript (in magenta) along with the UCSC-annotated genes (exons in black, UTRs in grey). The repeat regions and the Pfam domains are indicated in green and orange colors, respectively. Clicking on the alignment picture opens a magnified view. The information contents in this figure are trimmed for brevity.