| Literature DB >> 25931610 |
Federico Abascal1, Michael L Tress1, Alfonso Valencia2.
Abstract
Alternative splicing and gene duplication are the two main processes responsible for expanding protein functional diversity. Although gene duplication can generate new genes and alternative splicing can introduce variation through alternative gene products, the interplay between the two processes is complex and poorly understood. Here, we have carried out a study of the evolution of alternatively spliced exons after gene duplication to better understand the interaction between the two processes. We created a manually curated set of 97 human genes with mutually exclusively spliced homologous exons and analyzed the evolution of these exons across five distantly related vertebrates (lamprey, spotted gar, zebrafish, fugu, and coelacanth). Most of these exons had an ancient origin (more than 400 Ma). We found examples supporting two extreme evolutionary models for the behaviour of homologous axons after gene duplication. We observed 11 events in which gene duplication was accompanied by splice isoform separation, that is, each paralog specifically conserved just one distinct ancestral homologous exon. At other extreme, we identified genes in which the homologous exons were always conserved within paralogs, suggesting that the alternative splicing event cannot easily be separated from the function in these genes. That many homologous exons fall in between these two extremes highlights the diversity of biological systems and suggests that the subtle balance between alternative splicing and gene duplication is adjusted to the specific cellular context of each gene.Entities:
Keywords: alternative splicing; gene duplication; homologous exons; protein diversity; subfunctionalization
Mesh:
Substances:
Year: 2015 PMID: 25931610 PMCID: PMC4494069 DOI: 10.1093/gbe/evv076
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
F(A) The date of origin and loss of the 97 human MEHE AS patterns shown against the phylogeny of human and five distant vertebrate species. Gain of AS event is shown in green, and the inferred number of AS losses in red. (B) The percentage of conservation of the 97 human AS events in each species.
The 11 Cases in Which Each Duplicated Gene Lost or Retained One of the Ancestral MEHEs in a Concerted Manner (Splice Isoform Separation) Are Shown, Indicating Which Genes and Lineages Are Affected
| Human Gene | Origin of MEHEs | Human Exons (GRCh38) | Differential Conservation of Ancestral MEHEs in Lineage (Genes) |
|---|---|---|---|
| Vertebrates | 12:2504435–2504539, 12:2504841–2504945; 12:2633628–2633712, 12:2634296–2634374 ( | Vertebrates ( | |
| Jawed vertebrates | 7:128754261–128754455, 7:128754528–128754722 | Teleosts ( | |
| Jawed vertebrates | 1:22091427–22091517, 1:22089942–22090032 | Zebrafish ( | |
| Bilaterians | 7:101816011–102258233, 7:101816031–102249042; 7:101815904–102283957, 7:101816031–102283090 | Zebrafish ( | |
| Chordates | 6:111700103–111700268, 6:111699514–111699670 | Vertebrates (many genes, e.g., | |
| Vertebrates | 16:71640389–71641027, 16:71634192–71634803 | Lamprey, spotted gar, zebrafish, fugu (also other vertebrates) | |
| Chordates (?) | 4:185508298–185508562, 4:185514702–185514890 | Platypus (ENSOANG00000006867, ENSOANG00000013438) | |
| Vertebrates (?) | 1:63623460–63623760, 1:63593488–63593734 | Teleosts ( | |
| Jawed vertebrates | X:106726913–106727397, X:106694002–106694408 | Zebrafish and cave fish (Otophysa) ( | |
| Jawed vertebrates | 14:70060835–70060939, 14:70063822–70063929 | Spotted gar, fugu, coelacanth … ( | |
| Jawed vertebrates | 21:6493043–6493110, 21:6492130–6492197 | Fugu, tilapia and stickleback (Percomorphaceae?) ( |
Note.—Genes in bold indicate cases undergoing complete splice isoform separation.
aCUX1 is not a case of homologous but of nonhomologous MEEs.
FSplice isoform separation of CALU in teleosts by differential retention of ancestral MEHEs (A) that code for the first EF-hand domain (B) is strongly supported by the position in the ML exon tree of two distinct teleost genes, CALUA and CALUB, each within the group of monophyly defined by each ancestral MEHE (C; with the best-fit evolutionary model LG+I+G). Numbers close to nodes indicate cases with more than 70% of bootstrap support based on 1,000 replicates. The multiple sequence alignment reveals some positions (blue arrows) with specific conservation patterns between MEHEs of human, spotted gar and coelacanth, and between duplicated genes in zebrafish and other teleosts (D).
FThe ML phylogenetic tree of MARVELD3 exons (LG+I+G+F evolutionary model), which shows the evolutionary relationship between equivalent homologous exons in different species. The exons exist either in the form of alternatively spliced exons or as constitutively spliced exons in separate genes. The numbers at each internal node indicate bootstrap support.
Groups of Human Paralogs with Homologous Patterns of AS along with the Date of the Corresponding Duplication Events and the Relative Position of MEHEs within the Gene
| Human Paralogs | Description | Duplication Ancestor | Region Affected and AS Role |
|---|---|---|---|
| Acyl-CoA synthetase long-chain | Jawed vertebrates | Internal | |
| Alpha-actinin | Vertebrates. One AS conserved in fruitfly | Two pairs of internal MEHEs. Actin-binding domain ( | |
| Acid-sensing ion channel | Vertebrates | 5 prime. N-terminus and first transmembrane helix of the channel | |
| Voltage-dependent L-type calcium channel subunit alpha-1 | Vertebrates | Internal. Cytoplasmic C-terminal region. Fine tuning of channel properties ( | |
| Voltage-dependent L-type calcium channel subunit alpha-1 | Vertebrates | Two pairs of internal MEHEs. End of first ion transport domain, beginning of last ion transport domain | |
| Claudin | Vertebrates. MEHEs also found in | 5 prime. PMP22_Claudin domain. Permeability for anions or cations ( | |
| Cytochrome P450, family 4, subfamily F | Catarrhini | Internal. Beginning of p450 domain | |
| Beta-defensin | Amniotes | 3 prime. A signal peptide is shared between isoforms, while the extracellular domain, with many conserved Cys, is alternatively spliced | |
| Dynamin | Vertebrates | Internal. Dynamin_M domain | |
| Fibroblast growth factor receptor | Vertebrates, jawed vertebrates. MEHEs also found in tunicates | Internal. C-terminal half of the third Ig-like domain. Interaction with FGF and heparan sulfate proteoglycans ( | |
| Guanine nucleotide-binding protein G(olf/s) subunit alpha | Jawed vertebrates | 5 prime. N-terminal region predicted disordered and beginning of G-alpha domain | |
| AMPA glutamate receptor | Vertebrates | Internal. Ligand-gated ion channel domain. Channel-gating kinetics ( | |
| Integrin alpha | Vertebrates | 3 prime. Cytoplasmic C-termini. Interaction with HPS5 ( | |
| Mitogen-activated protein kinase/JNK. | Vertebrates | Internal. Kinase domain. Different affinities for ATF-2, Elk-i and Jun transcription factors ( | |
| Myocyte-specific enhancer factor. | Jawed vertebrates | Internal. Holliday junction regulator protein family C-terminal repeat | |
| Pro-neuregulin | Jawed vertebrates | Internal. Tissue specificity, cell localization, etc. ( | |
| PDZ and LIM domain protein 3 (ALP), LIM domain-binding protein 3 (Enigma) | Chordates? (not in the same Ensembl tree) | Tissue specific AS affecting the small ZM domain responsible for alpha-actinin-2 binding ( | |
| Sodium channel protein subunit alpha | Amniotes, vertebrates | Internal. Beginning/middle of first ion transport domain. Developmental and tissue specificities ( | |
| Choline transporter-like protein | Vertebrates | 3 prime. Cytoplasmic C-terminal tail | |
| Sodium/calcium exchanger | Vertebrates | Internal in calx-beta motif. May modulate the dynamic properties of Ca2+ sensing ( | |
| Tropomyosin alpha chain | Vertebrates | Several: 5 prime, internal, 3 prime. Developmental and tissue specificities (reviewed in |
Note.—Groups in bold indicate cases in which all the paralogs descending from the last GD event conserved the ancestral MEHEs.
FThe 3D-structure of human MAPK8 (pdb code 3O17) is shown in (A) emphasizing the region corresponding to the MEHEs (blue), which of the residues coded by the MEHEs differ between alternative MAPK8 isoforms (purple) and the location of the active ATP-binding site (orange). (B) Direct comparison between the two alternative human MAPK8 isoforms (3O17 in blue, 1UKH in red), showing that most differences are found within the loop. (C) Multiple sequence alignment of MEHEs of JNKs (E6a and E6b in MAPK8), highlighting residues that are specifically conserved within each ancestral exon (blue dots) or that are conserved in one but variable in the other (orange dots).
FThe multiple sequence alignment (A) of a pair of MEHEs from different alpha actinins (corresponding to exons 8a and 8b in human ACTN2) reveals the ancient ancestry of this AS event (it first appeared in the ancestor of bilaterians) and how the original pattern has been conserved in multiple gene lineages despite several GD events. Alternatively spliced MEHEs are highlighted by using same colors. Human ACTN4 has two MEHE events, one conserved in ACTN2 (see above) and another that is found in ACTN1, which are spatially close in the 3D dimeric structure of alpha actinin, within the actin-binding regions shown in (B). The structure corresponds to the cryoEM model of chicken ACTN1 (pdb:1SJJ; Liu et al. 2004).
FMultiple sequence alignments of two sets of homologous exons from human genes CACNA1C and CACNA1D, along with the equivalent exons from the CACNA1F and CACNA1S paralogs. After duplication CACNA1F retained one homologous exon from each pair of ancestral MEHEs and CACNA1S the other. CACNA1C also has a third pair of MEHEs at the beginning of the third ion transport domain (blue). Exon numbering is distinct in CACNA1C and CACNA1D.