| Literature DB >> 16620375 |
Ekaterina O Ermakova1, Ramil N Nurtdinov, Mikhail S Gelfand.
Abstract
BACKGROUND: At least half of mammalian genes are alternatively spliced. Alternative isoforms are often genome-specific and it has been suggested that alternative splicing is one of the major mechanisms for generating protein diversity in the course of evolution. Another way of looking at alternative splicing is to consider sequence evolution of constitutive and alternative regions of protein-coding genes. Indeed, it turns out that constitutive and alternative regions evolve in different ways.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16620375 PMCID: PMC1459143 DOI: 10.1186/1471-2164-7-84
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The data flow through the analysis pipeline.
Substitution rates in constitutive and alternative regions of human-mouse concatenated alignments
| Concatenate type | total alignment length, bp | amino-acid identity | dN | dS | ω |
| All genes | |||||
| C | 2822439 | 0.891 | 0.068 | 0.409 | 0.166 |
| A | 3081642 | 0.879 | 0.077 | 0.410 | 0.187 |
| AN | 2194521 | 0.880 | 0.075 | 0.404 | 0.186 |
| AI | 790026 | 0.884 | 0.074 | 0.421 | 0.176 |
| AC | 97095 | 0.813 | 0.133 | 0.446 | 0.297 |
| Slow-evolving genes | |||||
| C | 897471 | 0.964 | 0.020 | 0.327 | 0.061 |
| A | 920970 | 0.960 | 0.023 | 0.324 | 0.071 |
| AN | 670623 | 0.961 | 0.022 | 0.323 | 0.068 |
| AI | 230754 | 0.957 | 0.024 | 0.327 | 0.075 |
| AC | 19593 | 0.940 | 0.036 | 0.329 | 0.111 |
| Medium-speed evolving genes | |||||
| C | 978984 | 0.913 | 0.052 | 0.422 | 0.123 |
| A | 1092459 | 0.903 | 0.059 | 0.420 | 0.139 |
| AN | 751620 | 0.903 | 0.059 | 0.413 | 0.142 |
| AI | 313896 | 0.905 | 0.057 | 0.434 | 0.131 |
| AC | 26943 | 0.887 | 0.073 | 0.455 | 0.160 |
| Fast-evolving genes | |||||
| C | 945984 | 0.800 | 0.135 | 0.480 | 0.281 |
| A | 1068213 | 0.785 | 0.147 | 0.481 | 0.306 |
| AN | 772278 | 0.788 | 0.143 | 0.474 | 0.302 |
| AI | 245376 | 0.787 | 0.148 | 0.504 | 0.294 |
| AC | 50559 | 0.724 | 0.209 | 0.493 | 0.425 |
C – constitutive regions
A – alternative regions
AN – N-terminal alternative regions
AI – internal alternative regions
AC – C-terminal alternative regions
R – the transition to transvertion rate ratio
dN – the nonsynonymous substitution rate
dS – the synonymous substitution rate
ω = dN/dS
Figure 2Amino acid identity in alignments of constitutive and alternative coding regions of 3029 alternatively spliced genes classified by the overall evolution rate. C – constitutive regions. A – alternative regions. AN – N-terminal alternative regions. AI – internal alternative regions. AC – C-terminal alternative regions. Five columns on the left show results for three equal bins of slowly evolving genes, genes evolving with medium speed, and fast evolving genes. Five columns on the right correspond to the total sample of 3029 alternatively spliced genes. To estimate the average amino acid identity, we used global meta-alignments (see Methods). The boxplots represent the results for 2000 bootstrap replications.
Figure 3Nonsynonymous substitution rate (d. Notation and layout as in Figure 2.
Figure 4Synonymous substitution rate (d. Notation and layout as in Figure 2.
Figure 5Selection measure ω = d. Notation and layout as in Figure 2.
Figure 6Distributions of ω. a. The distribution of ωC-ωA for 2358 genes with the total length of human-mouse alignments of constitutive regions and of alternative regions both exceeding 80 bp shows that ω tends to be larger in alternative regions. In particular, there are 23 genes with ωC-ωA<-0.8 and no genes with ωC-ωA>0.8. b, c, d. The distributions of ωC-ωAN, ωC-ωAI, ωC-ωAC for genes with long N-terminal (1674 genes, top, ωC-ωAN), internal (976 genes, middle, ωC-ωAI), and C-terminal (110 genes, bottom, ωC-ωAC) alternative regions, respectively, show that ωA* tends to be larger than ωc in all types of alternative regions. the grey shadows are symmetrical to the left parts of the histograms.