| Literature DB >> 16274476 |
A D Neverov1, I I Artamonova, R N Nurtdinov, D Frishman, M S Gelfand, A A Mironov.
Abstract
BACKGROUND: Alternative splicing is a major mechanism of generating protein diversity in higher eukaryotes. Although at least half, and probably more, of mammalian genes are alternatively spliced, it was not clear, whether the frequency of alternative splicing is the same in different functional categories. The problem is obscured by uneven coverage of genes by ESTs and a large number of artifacts in the EST data.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16274476 PMCID: PMC1298288 DOI: 10.1186/1471-2105-6-266
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 3Blue columns: raw data. Red columns: normalized data (see Results). The difference between histograms before and after normalization is weak, because the fraction of highly expressed genes (>400 ESTs) is small (approximately 4%).
Figure 4Correlation between the isoform number and EST coverage. Blue: raw data (all ESTs). Red: normalized data (coverage-dependent filter on the number of clone libraries supporting exons, see Results). Each dot represents the average EST coverage for genes with the given number of isoforms. The peak in the normalized plot corresponds to the gene "eukaryotic translation elongation factor 1 alpha 1", represented by 18841 EST.
Figure 5Influence of normalization on the isoform number of proteins from the "Ribosome" GO category. Color code as in Fig. 1.
Figure 6Number of constitutive (blue) and alternative (red) regions in the longest isoform. The fraction of completely alternative genes is ~1%.
Figure 7Distribution of the isoform number in GO functional clusters.
Correlation between alternative splicing (AS) and protein-protein interactions (PPI). Expected numbers under independency assumption are given in parentheses. "EST-N" denotes isoforms with each exon supported by ESTs from at least N clone libraries.
| protein data, χ2 = 11.17 | No PPI | At least one PPI | TOTAL |
| No AS | 5434 (5408) | 122 (= 83% of 148) | 5556 |
| At least two AS-isoforms | 3692 (3718) | 127 (= 125% of 101) | 3819 |
| TOTAL | 9126 | 249 | 9375 |
| mRNA data, χ2 = 9.05 | No PPI | At least one PPI | TOTAL |
| No AS | 4879 (4856) | 107 (= 82% of 130) | 4986 |
| At least two AS-isoforms | 4360 (4383) | 141 (= 120% of 118) | 4501 |
| TOTAL | 9239 | 248 | 9487 |
| EST-5 data, χ2 = 6.57 | No PPI | At least one PPI | TOTAL |
| No AS | 4463 (4443) | 99 (= 83% of 119) | 4562 |
| At least two AS-isoforms | 4804 (4324) | 149 (= 115% of 129) | 4953 |
| TOTAL | 9267 | 248 | 9515 |
| EST-3 data, χ2 = 7.79 | No PPI | At least one PPI | TOTAL |
| No AS | 4001 (3979) | 85 (= 80% of 107) | 4086 |
| At least two AS-isoforms | 5302 (5324) | 164 (= 115% of 142) | 5466 |
| TOTAL | 9303 | 249 | 9552 |
| EST-2 data, χ2 = 7.57 | No PPI | At least one PPI | TOTAL |
| No AS | 3299 (3278) | 68 (= 77% of 89) | 3367 |
| At least two AS-isoforms | 6063 (6084) | 185 (= 113% of 164) | 6248 |
| TOTAL | 9362 | 253 | 9615 |
| normalized data, χ2 = 7.23 | No PPI | At least one PPI | TOTAL |
| No AS | 2275 (2257) | 42 (= 70% of 60) | 2317 |
| At least two AS-isoforms | 7066 (7084) | 206 (= 110% of 188) | 7272 |
| TOTAL | 9341 | 248 | 9589 |