| Literature DB >> 35588054 |
Grzegorz Kudla1, Marcin Plech1.
Abstract
Using a neural network to predict how green fluorescent proteins respond to genetic mutations illuminates properties that could help design new proteins.Entities:
Keywords: E. coli; GFP; computational biology; evolutionary biology; fitness landscape; machine learning; molecular evolution; protein engineering; systems biology
Mesh:
Substances:
Year: 2022 PMID: 35588054 PMCID: PMC9119673 DOI: 10.7554/eLife.79310
Source DB: PubMed Journal: Elife ISSN: 2050-084X Impact factor: 8.713
Figure 1.The fitness landscape of green fluorescent proteins.
Fitness landscapes provide a graphical representation of how a protein’s genetic sequence relates to its performance, leading to a multidimensional surface made up of peaks, ridges, and valleys. In the fitness landscape shown, horizontal distance represents the number of mutations that separate variants of a protein, while vertical elevation represented by contour lines indicates the fluorescence of each mutant. Two naturally occurring green fluorescent proteins (GFPs; dots outlined in black) reside on different peaks of the landscape (top left and top right) and are connected by a narrow ridge (area of high fitness). Mutant proteins at the peaks and ridges are all functional and able to fluoresce (green dots), whereas those in the valleys are non-functioning (grey dots). Application of a machine learning algorithm expanded the fitness landscape (right; blue contour lines) by including mutations that are not generated by evolution. This led to the creation of functional, synthetic variants (green dot, bottom right) that reside on different fitness peaks to variants that are naturally occurring.