| Literature DB >> 27358427 |
Elodie Marchadier1, Matt E Oates2, Hai Fang2, Philip C J Donoghue3, Alistair M Hetherington4, Julian Gough2.
Abstract
To progress our understanding of molecular evolution from a collection of well-studied genes toward the level of the cell, we must consider whole systems. Here, we reveal the evolution of an important intracellular signaling system. The calcium-signaling toolkit is made up of different multidomain proteins that have undergone duplication, recombination, sequence divergence, and selection. The picture of evolution, considering the repertoire of proteins in the toolkit of both extant organisms and ancestors, is radically different from that of other systems. In eukaryotes, the repertoire increased in both abundance and diversity at a far greater rate than general genomic expansion. We describe how calcium-based intracellular signaling evolution differs not only in rate but in nature, and how this correlates with the disparity of plants and animals.Entities:
Keywords: calcium signaling; diversification; evolution; protein architecture; specialization
Mesh:
Substances:
Year: 2016 PMID: 27358427 PMCID: PMC4987107 DOI: 10.1093/gbe/evw139
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
List of the 31 Calcium-Binding Domains with Their Correspondence to Prosite and the Link to Their Structure
| Superfamily ID | Superfamily | Prosite Domain |
|---|---|---|
| 47874 | Annexin | PDOC00195 |
| 47473 | EF-hand | PDOC50031;PDOC00018;PDOC00535 |
| 48619 | Phospholipase A2, PLA2 | PDOC00109 |
| 63446 | Type I dockerin domain | PDOC00416 |
| 101887 | Apyrase | |
| 63829 | Calcium-dependent phosphotriesterase | |
| 49562 | C2 domain (Calcium/lipid-binding domain, CaLB) | PDOC51210;PDOC00380 |
| 49899 | Concanavalin A-like lectins/glucanases | PDOC51117;PDOC51328;PDOC00636;PDOC50092;PDOC50188 |
| 81653 | Calcium ATPase, transduction domain A | PDOC00139 |
| 49313 | Cadherin-like | PDOC00205 |
| 63887 | P-domain of calnexin/calreticulin | PDOC00636 |
| 56784 | HAD-like | PDOC00139 |
| 51735 | NAD(P)-binding Rossmann-fold domains | PDOC51201;PDOC51201 |
| 56436 | C-type lectin-like | PDOC00537 |
| 81660 | Metal cation-transporting ATPase, ATP-binding domain N | PDOC00139 |
| 81665 | Calcium ATPase, transmembrane domain M | PDOC00139 |
| 100895 | Kazal-type serine protease inhibitors | PDOC00535 |
| 57196 | EGF/Laminin | PDOC00913;PDOC51117;PDOC00535 |
| 57424 | LDL receptor-like module | PDOC00929 |
| 103647 | TSP type-3 repeat | PDOC51236;PDOC50092;PDOC51234 |
| 82895 | TSP-1 type 1 repeat | PDOC50092 |
| 110083 | Peptidylarginine deiminase Pad4, middle domain | |
| 144270 | Eferin C-derminal domain-like | |
| 57630 | GLA Gamma-carboxyglutamic acid-rich-domain | |
| 48092 | Transcription factor STAT-4 N-domain | |
| 47668 | N-terminal domain of cbl (N-cbl) | |
| 101112 | Oxygen-evolving enhancer protein 3, | |
| 57581 | TB module/8-cys domain | |
| 103647 | TSP type-3 repeat | |
| 140570 | MukF C-terminal domain-like | |
| 82026 | Calcium-mediated lectin |
. 7.(a) Treemaps showing the reconstruction of the evolution of multipurpose architectures in calcium-signaling components from LECA to representative species of the eukaryotes. On the left, multipurpose architectures are expressed in numbers. On the right, multipurpose architectures are expressed in percentage of the architectures. (b) Treemaps showing the reconstruction of the evolution of the architecture diversity of calcium-signaling components from LECA to representative species of the eukaryotes. The percentages of the single purpose architectures dedicated to each component are indicated by the color-scale.
. 1.Phylogenetic tree of a subset of representative species of the diversity of organisms used in the study. Phylums are characterized by a color code that will be used in the next figures.
. 2.The occurences of building blocks of proteins (domains). Calcium-binding domains are labeled on the top horizontal axis. Both current and reconstructed ancestral organisms are labeled on the vertical axis. The entries of extint ancestral species represent their reconstructed proteome. The blue-scale of the matrix (on the left) stand for the number of proteins containing a calcium-binding domain that occur in each genome. The green-scale of the bar (middle) represents the diversity of proteins in the genome of each of the organisms (i.e., the size of the proteome). The red-scale of the bar (right side) represents the size of the genomes in number of genes. The scales of the color bar scale are indicated at the top right of the figure.
. 3.The number of domains correlating with the number of protein architectures. On the y axis is the number of different domain architectures containing at least one calcium-binding domain versus on the x axis the total number of proteins having these architectures. Each point on the graph represents a single proteome; the 44 proteomes used in figure 1 are plotted. Archaea are represented in brown, bacteria in gray, fungi in red, plants in green, chordates in dark blue, other metazoans in blue, and the rest of the eukaryotes in black. Shown in gray is a fitted regression line (Pearson correlation coefficient = 0.91).
. 4.Left (a). Calcium-binding architectures diversity: fraction of architectures that contain at least one calcium-binding domain. Right (b). Calcium-binding architecture redundancy: the number of proteins per architecture expressed as a ratio between architectures containing a calcium-binding domain and all architectures in the proteome. On the x axis the 1,558 organisms are sorted by taxonomy. Each point represents a genome (colored by superkingdom).
. 5.A schema of some components involved in calcium signaling. Boxes with dashed lines represent components able to bind calcium. Green boxes are calcium influx components, pink boxes are calcium efflux components, and blue boxes are signal decoding and relay components. The light red line encircles the minimal toolkit and includes boxes for which all organisms have at least one of the architecture of the component function.
. 6.A functional breakdown of calcium-signaling architectures in three representative eukaryote genomes. The left part (a) shows multipurpose architectures and their functional components. Architectures present in the organism in question are displayed in colors, and totals for all organisms in gray. The right part (b) shows single-purpose architectures as histograms of the number (y axis) for each functional component (x axis). Data for the organism of interest are shown in colors, and totals for the 1,558 genomes are shown in gray.
Examples of Architectures Identified In the “Relay-Muscle Contraction” Component
| Architectures Which Are Multi-Purpose and Present In Several Genomes | |
| 47473 | Homo sapiens: A2RRN2, calcyphosin Q02045, Myosin light chain 5 C9JV47, Grancalcin Arabidopsis thaliana: Q9LIK5, Calmodulin-like protein 11 Q9ZPX9, Calcium-binding protein KIC O81445, Calcineurin B-like protein 1 Colletotrichum graminicola: E3QMN8, Myosin regulatory light chain cdc4 Drosophila melanogaster: P42325, Neurocalcin homolog |
| Architectures Which Are Component-Specific (Single-Purpose) and Present in Several Genomes | |
| _gap_,56112 | Homo sapiens: J3KQG8, Epithelial discoidin domain-containing recept O96013, Serine/threonine-protein kinase PAK 4 Arabidopsis thaliana: P0C5E2, Probable serine/threonine-protein kinase At1g18390 Q93V58, Serine/threonine-protein kinase GRIK1 C. elegans: P34314, Serine/threonine-protein kinase Saccharamyces cerevisiae: P36004, Probable serine/threonine-protein kinase KKQ8 |
| Architectures Which Are Multipurpose and Genome-Specific (In Only One Genome) | |
| 89837,89837,56112 | Eimeria tenella: C8TE04, Myosin light chain kinase takes part of the “relay muscle contraction” and “relay calmodulin kinase” components |
| 47473,_gap_,50729,56112,_gap_ | Plasmodium yoelii yoelii: Q7RTG4, Myosin light chain kinase takes part of the “relay muscle contraction” and “decoding-CDPK” components |
| Architectures Which Are Component and Genome-Specific (Single-Purpose In Only One Genome) | |
| 48726,48726,_gap_,48726,48726,48726,_gap_, 48726,48726,49265,56112,48726 | Homo sapiens: D3DN97, Myosin light polypeptide kinase, isoform CRA_d |
| 47473,55729 | Harpegnathos saltator: E2BYA7, Myosin-2 essential light chain |