| Literature DB >> 24710290 |
Jessica Siltberg-Liberles1, Johan A Grahnen2, David A Liberles3.
Abstract
Protein sequence, structure, and function are inherently linked through evolution and population genetics. Our knowledge of protein structure comes from solved structures in the Protein Data Bank (PDB), our knowledge of sequence through sequences found in the NCBI sequence databases (http://www.ncbi.nlm.nih.gov/), and our knowledge of function through a limited set of in-vitro biochemical studies. How these intersect through evolution is described in the first part of the review. In the second part, our understanding of a series of questions is addressed. This includes how sequences evolve within structures, how evolutionary processes enable structural transitions, how the folding process can change through evolution and what the fitness impacts of this might be. Moving beyond static structures, the evolution of protein kinetics (including normal modes) is discussed, as is the evolution of conformational ensembles and structurally disordered proteins. This ties back to a question of the role of neostructuralization and how it relates to selection on sequences for functions. The relationship between metastability, the fitness landscape, sequence divergence, and organismal effective population size is explored. Lastly, a brief discussion of modeling the evolution of sequences of ordered and disordered proteins is entertained.Entities:
Year: 2011 PMID: 24710290 PMCID: PMC3927589 DOI: 10.3390/genes2040748
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1A possible conformational energy landscape for a typical structured protein. The protein has two alternative folding pathways (top), proceeding from the unfolded state (U) to the native state (N) through one (I2) or two (I1A, I1B) intermediate conformations. The funnel-shaped landscape guarantees rapid folding to the native state, passing various metastable states with different rates of interconversion on the way. The shaded area near the native state indicates the magnitude of change in folding energy that is selectively neutral (dependent upon to population size Ne and selective pressure s).
Figure 2Evolution of an energy landscape and its conformational ensemble after gene duplication. At the root, the gene giving rise to the protein with the blue energy landscape resulting in conformations A to G is duplicated. At the next speciation event we can see that the two different gene copies have evolved along different trajectories. The blue copy at the speciation node has evolved under negative selection and resembles the ancient blue. The green copy at the speciation node has evolved under positive selection and of the original conformational ensemble, conformations F and G are no longer forming, but a new conformation, H, is forming. In addition, the equilibrium of the conformations is different in the blue vs. green energy landscapes. From the speciation node down to the extant sequences, blue is much conserved, while green although under negative selection, will lose conformation D, in one lineage. Analysis of the extant sequences would show that blue and green are structurally disordered homologs. However, although all these proteins are structurally disordered, the conformational ensembles differ between blue and green (while being the same within the blue copies, and very similar within the green copies.)