| Literature DB >> 17108355 |
Namshin Kim1, Alexander V Alekseyenko, Meenakshi Roy, Christopher Lee.
Abstract
We have greatly expanded the Alternative Splicing Annotation Project (ASAP) database: (i) its human alternative splicing data are expanded approximately 3-fold over the previous ASAP database, to nearly 90,000 distinct alternative splicing events; (ii) it now provides genome-wide alternative splicing analyses for 15 vertebrate, insect and other animal species; (iii) it provides comprehensive comparative genomics information for comparing alternative splicing and splice site conservation across 17 aligned genomes, based on UCSC multigenome alignments; (iv) it provides an approximately 2- to 3-fold expansion in detection of tissue-specific alternative splicing events, and of cancer versus normal specific alternative splicing events. We have also constructed a novel database linking orthologous exons and orthologous introns between genomes, based on multigenome alignment of 17 animal species. It can be a valuable resource for studies of gene structure evolution. ASAP II provides a new web interface enabling more detailed exploration of the data, and integrating comparative genomics information with alternative splicing data. We provide a set of tools for advanced data-mining of ASAP II with Pygr (the Python Graph Database Framework for Bioinformatics) including powerful features such as graph query, multigenome alignment query, etc. ASAP II is available at http://www.bioinformatics.ucla.edu/ASAP2.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17108355 PMCID: PMC1669709 DOI: 10.1093/nar/gkl884
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Statistics for ASAP II database
| Organism | Genome assembly | UniGene clusters | Detected splices/Clusters | Isoforms | Alternative splicing | Alternatively spliced | |||
|---|---|---|---|---|---|---|---|---|---|
| Total | Mapped | Splices | Clusters | Relationships | Clusters | ||||
| Human | hg17 | 66 488 | 47 477 | 193 023 | 22 220 | 260 198 | 89 078 | 11 717 | 53 |
| Mouse | mm7 | 43 104 | 32 522 | 141 284 | 16 404 | 135 465 | 33 057 | 8711 | 53 |
| Rat | rn3 | 41 687 | 34 003 | 82 941 | 14 195 | 53 212 | 7210 | 3378 | 24 |
| Western clawed frog | xenTro1 | 33 132 | 24 617 | 65 633 | 10 880 | 34 293 | 4836 | 2349 | 22 |
| Chicken | galGal2 | 30 470 | 19 708 | 51 471 | 9671 | 26 557 | 4244 | 2154 | 22 |
| Cow | bosTau2 | 39 432 | 28 709 | 60 813 | 11 448 | 32 401 | 6692 | 3008 | 26 |
| Dog | canFam2 | 22 930 | 16 645 | 29 290 | 6834 | 11 424 | 1633 | 951 | 14 |
| ce2 | 20 621 | 15 546 | 54 395 | 12 580 | 23 393 | 1309 | 763 | 6 | |
| ci2 | 15 587 | 1373 | 5611 | 972 | 2161 | 150 | 98 | 10 | |
| Zebrafish | danRer3 | 32 400 | 22 297 | 67 598 | 12 136 | 27 547 | 2611 | 1577 | 13 |
| Fruit fly | dm2 | 16 635 | 14 568 | 37 469 | 9683 | 26 854 | 4850 | 1841 | 19 |
| fr1 | 2355 | 1980 | 3014 | 798 | 866 | 33 | 24 | 3 | |
| Yellow fever mosquito | AaegL 1 | 15 182 | 10 624 | 3594 | 1787 | 2529 | 120 | 87 | 5 |
| Honeybee | apiMel2 | 5900 | 5027 | 6270 | 2548 | 2990 | 90 | 57 | 2 |
| African malaria mosquito | anoGam1 | 15 609 | 14 173 | 17 278 | 8013 | 15 115 | 1070 | 605 | 8 |
aGenome assembly sequences were downloaded from UCSC genome browser except for Yellow fever mosquito, which was downloaded from Enesmbl genome browser.
bAlternative spliced genes (%) = No. of alternatively spliced clusters/No. of spliced clusters.
Fifty-three percent of human and mouse multi-exon genes are detected to contain alternative splicing. Focusing on genes with at least mRNA, 75 and 60% of human and mouse multi-exons genes were detected to contain alternative splicing (ASAP II website for details). Due to limited mRNA and EST coverage (Fugu and honeybee) and incomplete genome assembly (Fugu, Ciona and yellow fever mosquito), number of mapped clusters (Ciona, 9%; 1373 out of 15 587) or alternatively spliced clusers (24 for Fugu, 98 for Ciona, 57 for honeybee, 87 for and yellow fever mosquito) can be significantly lower than expected, these data cannot be considered comprehensive: 19–26% of fruit fly, western clawed frog, chicken, rat and cow multi-exon genes were detected to contain alternative splicing.
Statistics for orthologous exons and introns
| Multiple alignments | Exons with orthologous exons | Total internal exons | Introns with orthologous introns | Total canonical introns |
|---|---|---|---|---|
| hg17 referenced 17 species Multigenome alignments | 85 673 | 129 981 | 100 447 | 193 024 |
| mm7 referenced 17 species Multigenome alignments | 81 296 | 105 260 | 97 371 | 141 285 |
| galGal2 referenced 7 species Multigenome alignments | 20 471 | 36 865 | 24 973 | 51 472 |
| danRer3 referenced 5 species Multigenome alignments | 18 977 | 50 792 | 22 367 | 67 599 |
| xenTro1 referenced 5 species Multigenome alignments | 23 428 | 49 679 | 26 893 | 65 634 |
aOnly orthologous exons and introns that have two exact matches of both canonical splice sites (U1/U2 and U11/U12). List of species used in multigenome alignments is available at UCSC genome browser (34).
Sixty-six percent (85 673 out of 1 29 981) for human and 77% (81 296 out of 105 260) for mouse internal exons have at least one orthologous exons; 52% (1 00 447 out of 1 93 024) for human and 69% (97 371 out of 1 41 285) for mouse canonical introns have at least one orthologous introns. Most of orthologous exons and introns were from human and mouse orthologs due to larger number of mRNA and EST sequences than other genomes: 56, 37 and 47% of chicken (galGal2), zebrafish (danRer3), and western clawed frog (xenTro1) internal exons have at least one orthologous exons and 49, 33 and 41% for orthologous introns. Because a set of genome assemblies used for multigenome alignments is different from ASAP II calculation for chicken, zebrafish and western clawed frog (Table 1 for details), numbers of orthologous exons and intron can be decreased.
Figure 1Popup page for orthologous exons, introns and splice sites. (A) List of orthologous genes are described in UniGene summary section. (B) Orthologous Exons. ‘EXACT’ means both splice sites are exactly aligned in multigenome alignments. Change in protein modularity (remainder divided by three) is denoted as ‘(0 → 0)’. (C) Orthologous Introns. ‘(1157 → 2382)’ means human intron (ID 365585) size is increased in rat intron (ID 28953). (D) Multiple alignments of splice site sequences and its phylogenetic tree generated by UCSC Phylogenetic Tree Gif Maker (34) on the fly. Splice site consensus of this intron (ID 365585) is well-conserved within close genomes but not in distant genomes; dog (canFam2) and opossum (monDom2).
Comparison of alternative splicing analyses with other databases
| Database | Species | Genes | Internal exons | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Alternatively spliced | Spliced | Constitutive | Alternative | Total | |||||
| ASD | Human | 9929 | 61% | 16 293 | — | — | — | — | — |
| Mouse | 8211 | 50% | 16 391 | — | — | — | — | — | |
| ECgene | Human | 21 266 | 43% | 49 546 | — | — | — | — | — |
| Mouse | 17 706 | 40% | 43 932 | — | — | — | — | — | |
| Rat | 8699 | 32% | 27 406 | — | — | — | — | — | |
| DEDB | Fruit fly | 2646 | 20% | 13 222 | — | — | — | — | — |
| HOLLYWOOD | Human | — | — | — | 114 839 | 75% | 37 366 | 25% | 151 199 |
| Mouse | — | — | — | 79 217 | 87% | 11 673 | 13% | 90 885 | |
| ASAP II | Human | 11 717 | 53% | 22 220 | 83 193 | 64% | 46 788 | 36% | 129 981 |
| Mouse | 8711 | 53% | 16 404 | 82 839 | 79% | 22 421 | 21% | 105 260 | |
| Rat | 3378 | 24% | 14 195 | — | — | — | — | — | |
| Fruit fly | 1841 | 19% | 9683 | — | — | — | — | — | |
aAll numbers are taken from ASD (11), ECgene (14), DEDB (12) and HOLLYWOOD (10).