| Literature DB >> 18254963 |
Florian Odronitz1, Martin Kollmar.
Abstract
BACKGROUND: Alternative splicing of mutually exclusive exons is an important mechanism for increasing protein diversity in eukaryotes. The insect Mhc (myosin heavy chain) gene produces all different muscle myosins as a result of alternative splicing in contrast to most other organisms of the Metazoa lineage, that have a family of muscle genes with each gene coding for a protein specialized for a functional niche.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18254963 PMCID: PMC2257972 DOI: 10.1186/1471-2199-9-21
Source DB: PubMed Journal: BMC Mol Biol ISSN: 1471-2199 Impact factor: 2.946
Figure 1Diagram of the arthropod . The gene structures of the arthropod muscle myosins genes are shown using the following color code: light-gray: intron sequences; dark-gray: common exons; colored: alternatively spliced exons. The Drosophila melanogaster Mhc1 gene is shown as representative for all Drosophila sp. Mhc1 genes, because their gene structures only differ in the length of the introns. The transcriptional and translational start sites, the stop codons and polyadenylation sites are shown if they have been determined. Some genes are spread on several contigs. The corresponding gap positions are shown in black, if further exons are not expected, and in red, if exons are definitively missing. The genes are drawn to scale except for the Aedes aegypti genes where the extremely long introns have been shortened. Gaps have been filled with 100 bp although their exact length is unknown.
Nucleotide ID's and number of combinations of alternative exons for the motor domains and the full-length proteins.
| Species | Species Abbr. | Nucleotide ID's GenBank: | Motor domain | Full-length protein |
| 1536 | > 3072 | |||
| 192 | 768 | |||
| 192 | > 384 | |||
| 144 | > 288 | |||
| 96 | 384 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 120 | 480 | |||
| 128 | 768 | |||
| 128 | 512 | |||
| 1 | 1 | |||
| 16 | 32 | |||
| 128 | 512 | |||
| 1 | 1 | |||
| 1 | 1 |
Figure 2Relationships between alternatively spliced exon. Sections of the Mhc1 genes of Figure 1 have been aligned showing the relationship between the exon-intron structures of the regions containing alternatively spliced exons. Continuous lines connect variants that are almost identical and thus expected to be derived from a common ancestor. Bold lines connecting alternative exons in regions containing multiple variants per Mhc1 gene highlight particularly conserved exons in these sets. Dotted lines represent putative connections between certain variants although their identity is not very strong on the protein level.
Figure 3Sequence conservation in the first set of the alternatively spliced exons. On top, the protein sequence alignment of the alternative exons is shown. The upper sequences, termed Mhc1, Mhc3, and Mhc4, respectively, represent the variant "a" exons. Below, the comparison of the sequence identity between each exon and variant "a" and "b" of every other Mhc1 protein is shown. The graphic has to be read in columns. The higher identity between an exon listed on top and variant "a" or "b" of a certain Mhc1 protein listed on the left side has been set to 1 (red color) while the difference of the lower identity to the value of the higher identity is plotted for the other combination of exons. Thus, in every column the higher identity of the named exon to one of the variants of the other Mhc1 proteins is visualized.
Figure 4Phylogenetic tree of the arthropod muscle myosin heavy chain proteins. The amino acid sequences of the full-length proteins were aligned manually. Because of their incompleteness the sequences of Drosophila persimilis and Drosophila yakuba have been omitted from the tree calculation. Support values for each internal branch were obtained by 1,000 bootstrap steps. The scale bar corresponds to 0.1 estimated amino acid substitutions per site.
Figure 5Diagram of the arthropod Mhc1 proteins. The exon-intron structure of the Mhc1 genes is shown based on the protein sequence. Exons are shown as boxes while introns are represented by spaces. The same colour scheme has been used as in Figure 1. Numbers on alternative exons denote the number of variants. The exons are drawn that the intron positions align between the different Mhc1 genes. Thus, the exon lengths are not drawn to scale (e.g. the exons encoding the variable loop-2 are different in lengths). On the right side, the protein sequence of Drosophila melanogaster Mhc1 is shown as reference. Dotted lines connect amino acids that are derived from split codons.
Figure 6Structure of the myosin motor domain. The structure of the motor domain of the class-II myosin of Dictyostelium discoideum has been used to highlight the regions encoded by alternatively spliced exons in arthropod Mhc1 genes. The color-coding is the same as in Figure 1 allowing the identification of corresponding regions.
Figure 7Model for the process of alternative splicing. The model describes the three different origins of pseudogenes. Non-processed pseudogenes are often found adjacent to their paralogous functional gene and retain the same exon-intron structure. Processed pseudogenes are marked by the absence of both 5' promotor sequence and introns, the presence of flanking direct repeats, and are randomly integrated into the genome. In the case of the arthropod Mhc genes, these get in the first step transcribed. In a second step, the alternative exons get spliced resulting in a certain combination of alternative exons and retaining the exon-intron structure. In the case of AeaMhc3, CpqMhc3, and CpqMhc4, these transcripts have been integrated into the genome. Normally in a third step, the introns get spliced revealing the final mRNA ready for translation. Dark grey bars represent constitutive and coloured bars alternatively spliced exons. Light grey bars represent non-coding sequence.