| Literature DB >> 25371435 |
Mathieu Groussin1, Joanne K Hobbs2, Gergely J Szöllősi3, Simonetta Gribaldo4, Vickery L Arcus2, Manolo Gouy5.
Abstract
The resurrection of ancestral proteins provides direct insight into how natural selection has shaped proteins found in nature. By tracing substitutions along a gene phylogeny, ancestral proteins can be reconstructed in silico and subsequently synthesized in vitro. This elegant strategy reveals the complex mechanisms responsible for the evolution of protein functions and structures. However, to date, all protein resurrection studies have used simplistic approaches for ancestral sequence reconstruction (ASR), including the assumption that a single sequence alignment alone is sufficient to accurately reconstruct the history of the gene family. The impact of such shortcuts on conclusions about ancestral functions has not been investigated. Here, we show with simulations that utilizing information on species history using a model that accounts for the duplication, horizontal transfer, and loss (DTL) of genes statistically increases ASR accuracy. This underscores the importance of the tree topology in the inference of putative ancestors. We validate our in silico predictions using in vitro resurrection of the LeuB enzyme for the ancestor of the Firmicutes, a major and ancient bacterial phylum. With this particular protein, our experimental results demonstrate that information on the species phylogeny results in a biochemically more realistic and kinetically more stable ancestral protein. Additional resurrection experiments with different proteins are necessary to statistically quantify the impact of using species tree-aware gene trees on ancestral protein phenotypes. Nonetheless, our results suggest the need for incorporating both sequence and DTL information in future studies of protein resurrections to accurately define the genotype-phenotype space in which proteins diversify.Entities:
Keywords: ancestral sequence reconstruction; gene tree reconciliation; lateral gene transfer; phylogeny; protein evolution; protein resurrection
Mesh:
Substances:
Year: 2014 PMID: 25371435 PMCID: PMC4271536 DOI: 10.1093/molbev/msu305
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FSchematic illustration of the impact of gene tree/species tree reconciliation on the topology of the gene tree. The gene family under consideration evolves along the species tree shown in (a). In this example, two copies of a gene are present in each of the species, genes in species A are denoted a1 and a2, genes in B by b1 and b2 etc. The gene tree reconstructed using the traditional S-unaware method that optimizes a sequence evolution-based score is shown in (b). It contains poorly supported phylogenetic relationships, including branches in red that conflict with the species tree. On the left part of the tree in (b), the conflict is strongly supported by the sequence information and conserved in the reconciled tree shown in (c), suggesting that a horizontal gene transfer likely occurred between species B and C. In the middle part of the tree in (b), the support for a possible transfer is low (30/100), indicating that a gene tree with a higher joint likelihood probably exists. In the reconciled S-aware tree (c) that optimizes a joint sequence evolution and gene family evolution score (here the joint likelihood that considers both the sequence substitutions and the duplication, transfer, and loss of genes), some of the conflicts are resolved (blue branches), because, while the sequence evolution component of joint likelihood is slightly lower, the gene family evolution component is significantly improved. Reconciling the gene tree (c) with the species tree (a) requires a horizontal transfer and a gene duplication predating the divergence of species A, B, and C. In contrast, the reconciliation of the gene tree (b) requires, beside the statistically supported gene transfer, at least two losses, another transfer, and a duplication.
FImpact of the phylogenetic tree on ASR. (a) Phylogenetic reconstruction accuracy. Robinson–Foulds distances were computed between S-unaware trees (LG or C60) or S-aware trees and the “true” tree. The exODT model is the reconciliation model described in Szöllősi, Tannier, et al. (2013) (b) ASR accuracy depending on the phylogenetic tree. Distances between inferred and true ancestral sequences were computed for nodes defining similar monophyletic clades between the S-unaware or S-aware tree and the true tree. ***P value < 0.001; NS, nonsignificant.
FReconstruction errors also affect slow-evolving sites with S-unaware trees. For a given gene tree reconstruction method, the pattern of reconstruction errors for all internal nodes over the 100 simulations was analyzed in the light of site-specific (posterior) evolutionary rates. Rates were computed a posteriori with the use of the Gamma distribution of site rates used during the ML reconstruction of ancestral sequences. Orange: Species tree-unaware gene trees. Blue: Species tree-aware gene trees. The two data series are overlapped.
Kinetic Constants, Thermoactivity, and Biophysical Parameters for the Ancestral LeuB Enzyme from the Firmicutes Ancestor.
| Enzyme | |||||
|---|---|---|---|---|---|
| BPSYC | 0.2 | 0.6 | 6.5 | 47 | 94.9 ± 0.2 |
| BSUB | 0.7 | 8.1 | 48.7 | 53 | 95.9 ± 0.5 |
| BCVX | 1.1 | 0.8 | 53.8 | 69 | 100.7 ± 0.2 |
| ANC1 | 1.3 | 0.5 | 141.8 | 73 | 100.9 ± 0.5 |
| ANC2 | 1.0 | 0.9 | 41.7 | 49 | 91.1 ± 0.4 |
| ANC3 | 2.7 | 1.0 | 102.3 | 60 | 95.6 ± 0.1 |
| ANC4 | 1.7 | 1.0 | 362.2 | 70 | 110.8 ± 0.4 |
Note.—Values obtained in this study for the ancestor of the Firmicutes (italic characters) were inferred using either the LeuB S-unaware tree or the LeuB S-aware reconciled tree, with the site-heterogeneous EX_EHO model. Data for contemporary (first three lines) and other ancestral LeuBs for Bacillus (ANC1-4) characterized in Hobbs et al. (2012) are shown for comparison. Errors for are the standard error in the calculation of from ≥5 measurements of the unfolding rate in different urea concentrations.
FResurrection of ancestral LeuBs and impact of the phylogenetic tree on biological interpretations. (a) Thermoactivity profiles for ancestral LeuB enzymes. Blue curve: LeuB–unaw. Green curve: LeuB–aw. (b) Unfolding rates of ancestral LeuB enzymes. Unfolding rates are shown in 8 M urea as a decrease in intrinsic protein fluorescence with time. Colors are the same as in (a).