| Literature DB >> 19208117 |
Hao Zheng1, Xingyi Hang, Ji Zhu, Minping Qian, Wubin Qu, Chenggang Zhang, Minghua Deng.
Abstract
BACKGROUND: Alternative splicing (AS) is an important regulatory mechanism for gene expression and protein diversity in eukaryotes. Previous studies have demonstrated that it can be causative for, or specific to splicing-related diseases. Understanding the regulation of AS will be helpful for diagnostic efforts and drug discoveries on those splicing-related diseases. As a novel exon-centric microarray platform, exon array enables a comprehensive analysis of AS by investigating the expression of known and predicted exons. Identifying of AS events from exon array has raised much attention, however, new and powerful algorithms for exon array data analysis are still absent till now.Entities:
Mesh:
Year: 2009 PMID: 19208117 PMCID: PMC2648792 DOI: 10.1186/1471-2105-10-S1-S18
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Design of simulation 1 for evaluation
| Gene ID | Treatment Group | Control Group |
| 1 | Exon 5~N (0, 1) | All exons ~N (2.5,1), correlated |
| Exon 6~N (0, 1) | ||
| Others ~N (2.5, 1), correlated | ||
| 2 | Exon 5~N (0, 1) | All exons ~N (1.5,1), correlated |
| Exon 6~N (0, 1) | ||
| Others ~N (1.5, 1), correlated | ||
| 3 | Exon 5~N (2.5, 1) | Exon 5~N (0, 1) |
| Exon 6~N (0, 1) | Exon 6~N (2.5, 1) | |
| Others ~N (2.5, 1), correlated | Others ~N (2.5, 1), correlated | |
| 4 | Exon 5~N (1.5, 1) | Exon 5~N (0, 1) |
| Exon 6~N (0, 1) | Exon 6~N (1.5, 1) | |
| Others ~N (1.5, 1), correlated | Others ~N (1.5, 1), correlated |
Exon 5 and Exon 6 are simulated as cassette exons in alternatively spliced gene 1 and 2, while they are defined as mutually exclusive exons in alternatively spliced gene 3 and 4.
Figure 1Results of simulation 1. Panel 'a' shows frequencies of selection by REMAS for the 50 genes in simulation 1. Panel 'b' shows distribution of average values of θ for the first 50 exons in simulation 1. The total number of selection is 1000.
Figure 2Selection frequency in the first iteration in simulation 2. Frequencies of the 50 genes selected by REMAS in simulation 2 are illustrated. The total number of selection is 1000.
Figure 3Selection frequency in the second iteration in simulation 2. Frequencies of the 48 genes selected by REMAS in simulation 2 after removing alternatively spliced gene 1 and 3. The total number of selection is 1000.
Design of simulation 3 for evaluation
| Gene ID | Treatment Group | Control Group |
| 1 | Exon 5~N (0, 1) | All exons ~N (1.5,1), correlated |
| Exon 6~N (0, 1) | ||
| Others ~N (2.5, 1), correlated | ||
| 2 | Exon 5~N (0, 1) | All exons ~N (2.5,1), correlated |
| Exon 6~N (0, 1) | ||
| Others ~N (1.5, 1), correlated | ||
| 3 | Exon 5~N (2.5, 1) | Exon 5~N (0, 1) |
| Exon 6~N (0, 1) | Exon 6~N (1.5, 1) | |
| Others ~N (2.5, 1), correlated | Others ~N (1.5, 1), correlated | |
| 4 | Exon 5~N (1.5, 1) | Exon 5~N (0, 1) |
| Exon 6~N (0, 1) | Exon 6~N (2.5, 1) | |
| Others ~N (1.5, 1), correlated | Others ~N (2.5, 1), correlated |
The design of alternatively spliced exons is same with simulation 1, but the power of normal distribution is different between two sample groups to simulate the differential expression in gene level.
Figure 4Results of simulation 3. Panel 'a' shows frequencies of selection by REMAS for the 50 genes in simulation 3. Panel 'b' shows distribution of average values of θ for the first 50 exons in simulation 3. The total number of selection is 1000.
List of identified alternatively spliced genes by REMAS validated by RT-PCR or supported by the literature
| ID | Transcript cluster ID | Gene Symbol | Ranks |
| 1 | 2425756 | COL11A1 | 5 |
| 2 | 2727226 | FIP1L1 | 11 |
| 3 | 2605321 | COL6A3 | 18 |
| 4 | 3604147 | KIAA1199 | 24 |
| 5 | 2515933 | ZAK | 28 |
| 6 | 3049522 | TENS3 | 29 |
| 7 | 2907671 | PTK7 | 77 |
| 8 | 3569814 | ACTN1 | 120 |
| 9 | 2413203 | LRP8 | 134 |
| 10 | 3252071 | VCL | 148 |
| 11 | 2598261 | FN1 | 190 |
| 12 | 3939707 | CABIN1 | 193 |
| 13 | 2712236 | MUC4 | 212 |
| 14 | 3597338 | TPM1 | 231 |
| 15 | 3735151 | ITGB4 | 287 |
| 16 | 3025545 | CALD1 | 307 |
| 17 | 2375706 | ATP2B4 | 335 |
| 18 | 3304301 | PSD | 354 |
| 19 | 3694657 | CDH11 | 418 |
| 20 | 3252036 | PLAU | 480 |
Figure 5. These 12 genes are not only identified by REMAS but also by SI algorithm. Importantly, they have been validated by RT-PCR or supported by literatures. The Affymetrix probeset ID (7-digit numbers) indicates the position of alternatively spliced exon in the target gene.