| Literature DB >> 26191481 |
Joana Projecto-Garcia1, Didier Jollivet1, Jean Mary1, François H Lallier1, Stephen W Schaeffer2, Stéphane Hourdez1.
Abstract
Multi-domain proteins form the majority of proteins in eukaryotes. During their formation by tandem duplication or gene fusion, new interactions between domains may arise as a result of the structurally-forced proximity of domains. The proper function of the formed proteins likely required the molecular adjustment of these stress zones by specific amino acid replacements, which should be detectable by the molecular signature of selection that governed their changes. We used multi-domain globins from three different invertebrate lineages to investigate the selective forces that acted throughout the evolution of these molecules. In the youngest of these molecules [Branchipolynoe scaleworm; original duplication ca. 60 million years (Ma)], we were able to detect some amino acids under positive selection corresponding to the initial duplication event. In older lineages (didomain globin from bivalve mollusks and nematodes), there was no evidence of amino acid positions under positive selection, possibly the result of accumulated non-adaptative mutations since the original duplication event (165 and 245 Ma, respectively). Some amino acids under positive selection were sometimes detected in later branches, either after speciation events, or after the initial duplication event. In Branchipolynoe, the position of the amino acids under positive selection on a 3D model suggests some of them are located at the interface between two domains; while others are locate in the heme pocket.Entities:
Keywords: Annelids; Hemoglobin; Molluscs; Nematodes; Positive selection
Year: 2015 PMID: 26191481 PMCID: PMC4503718 DOI: 10.1186/s40064-015-1124-2
Source DB: PubMed Journal: Springerplus ISSN: 2193-1801
Fig. 1Maximum likelihood trees of the multimeric globin genes from the three groups of metazoans analyzed. a Branchipolynoe hemoglobin phylogram; b NEMATODE globins’ phylogram; c Barbatia globins’ phylogram. In all cases GTR was used as the nucleotide substitution model. ω values are shown in bold over/under the branches, and N*dN/S*dN ratios are within brackets. Mg myoglobin, SD single-domain, D1–D4 domains 1 through 4. Insets (adapted from Weber and Vinogradov 2001) represent the quaternary structure of globins to which the correspondent tree is right bellow. Size of the represented assemblies is depicted next to each inset.
Results from the analyses of positive selection (PAML v 4.7, Yang 2007)
| Model | ln | κ | np | Model estimates | LRT (df) | Sites under positive selection (BEB >0.95) |
|---|---|---|---|---|---|---|
|
| ||||||
| Branch model | ||||||
| M0 | −2,061.60 | 1.755 | 19 | ω = 0.278 | ||
| M1 | −2,052.07 | 1.781 | 35 | 0.001 < ω < ∞ | 19.05NS (16) | |
| Site model | ||||||
| M1a ‘nearly neutral’ | −2,050.01 | 1.842 | 20 | ω0 = 0.177 (79%) | ||
| M2a ‘positive selection’ | −2,050.01 | 1.842 | 22 | ω0 = 0.177 (79%) | 0.00NS (2) | |
| Branch-site model | ||||||
| MA branch | −2,038.08 | 1.851 | 22 | ω0 = 0.154 (65.6%) | 23.86***** (2) | 27 V |
| MA branch | −2,049.37 | 1.845 | 22 | ω0 = 0.167 (0%) | 1.29NS | – |
| MA_branch | −2,048.87 | 1.856 | 22 | ω0 = 0.167 (0%) | 2.28NS | – |
| Nematodes | ||||||
| Branch model | ||||||
| M0 | −2,153.41 | 1.455 | 9 | ω = 0.037 | ||
| M1 | −2146.71 | 1.313 | 15 | 0.029 < ω < ∞ | 13.4* (6) | |
| Site model | ||||||
| M1a ‘nearly neutral’ | −2,141.70 | 1.476 | 10 | ω0 = 0.071 (87.3%) | ||
| M2a ‘positive selection’ | −2,141.70 | 1.476 | 12 | ω0 = 0.071 (87.3%) | 0NS | |
| Branch-site model | ||||||
| MA branch | −2,133.19 | 1.565 | 12 | ω0 = 0.074 (51.2%) | 17.02***** (2) | 16 sites |
| MA branch | −2,135.16 | 1.407 | 12 | ω0 = 0.063 (74.2%) | 13.09**** (2) | 11A |
| MA branch | −2,128.13 | 1.373 | 12 | ω0 = 0.061 (79.1%) | 27.14***** (2) | – |
|
| ||||||
| Branch model | ||||||
| M0 | −1,989.05 | 1.398 | 11 | ω = 0.332 | ||
| M1 | −1,980.29 | 1.393 | 19 | 0.136 < ω < 3.160 | 17.54** (8) | |
| Site model | ||||||
| M1a | −1,967.14 | 1.514 | 12 | ω0 = 0.128 (62%) | ||
| M2a | −1,956.20 | 1.649 | 14 | ω0 = 0.120 (54.9%) | 21.88***** (2) | |
| Branch-site model | ||||||
| MA branch | −1,961.52 | 1.572 | 14 | ω0 = 0.119 (59.9%) | 11.24*** (2) | – |
| MA branch | −1,962.64 | 1.510 | 14 | ω0 = 0.117 (62.3%) | 9.00** (2) | 31G |
In the 4 invertebrate groups, the likelihood (ln) values correspond to branches leading to various duplications (see Fig. 1a–c) in each phylogram. Selection models implemented in the codeml package, its parameters and associated results of likelihood ratio tests (LRT) are shown (significance threshold: p < 0.05).
κ transition/transversion ratio, np number of parameters estimated by the model, model estimates: ω estimated ω for the category of sites under purifying selection (ω < 1), ω estimated ω for sites for the category under neutral evolution (ω ~ 1), ω estimated ω for sites under positive selection in the foreground branches against background branches under purifying selection, ω estimated ω for sites under positive selection in the foreground branches against background branches under neutral evolution, df degrees of freedom, NS non significant.
* Significant at 0.05; ** significant at 0.025; *** significant at 0.01; **** significant at 0.005; ***** significant at 0.001; BEB >0.95: sites identified by Bayes Empirical Bayes analysis with a posterior probability greater than 95%.
Fig. 23D models of the globins found in the different species studied, with their heme group. a The B. symmytilida model was obtained based on L. terrestris Hb structure; residues under positive selection are labeled in white (27V, 30A, 52Q and 65C). b The B. reveeana model is based on the tertiary structure of S. inaequivalis Hb; the residue under positive selection is shown (31G). c Ascaris D2 model was built based on the di-domain, D1 crystal (1ash); residue under positive selection on D1 is depicted (11A), name of the residues with possible interactions with amino acid in position 11 are shown in grey. A comparative scenario is presented for Ascaris D1 and D2 with the interacting residues being in equivalent positions in the protein tertiary structure. Helices are represented in a color spectrum depicting helix A in blue all through helix H in red. Depicted residues are colored by element; OH: red; NH: blue: C: helix color. Heme group is shown in light grey. See “Methods” and “Results” sections for detailed information.
Fig. 3Theoretical 3D model of an annelid tetramer based on B. symmytilida D1 sequence and the crystal structure of Lumbricus terrestris (1X9F). Amino acids identified as potentially under positive selection are highlighted. 52Q appears to be in an area where it could participate to the formation of an oligomer. Depicted residues are colored by element; OH: red; NH: blue: C: helix color. Heme group is shown in grey. Model constructed with Modeller (Mod9v13, Eswar et al. 2006).