| Literature DB >> 35709156 |
Sophia M Hartley1, Kelly A Tiernan1, Gjina Ahmetaj1, Adriana Cretu1, Yan Zhuang2, Marc Zimmer1.
Abstract
AlphaFold2 and RoseTTAfold are able to predict, based solely on their sequence whether GFP-like proteins will post-translationally form a chromophore (the part of the protein responsible for fluorescence) or not. Their training has not only taught them protein structure and folding, but also chemistry. The structures of 21 sequences of GFP-like fluorescent proteins that will post-translationally form a chromophore and of 23 GFP-like non-fluorescent proteins that do not have the residues required to form a chromophore were determined by AlphaFold2 and RoseTTAfold. The resultant structures were mined for a series of geometric measurements that are crucial to chromophore formation. Statistical analysis of these measurements showed that both programs conclusively distinguished between chromophore forming and non-chromophore forming proteins. A clear distinction between sequences capable of forming a chromophore and those that do not have the residues required for chromophore formation can be obtained by examining a single measurement-the RMSD of the overlap of the central alpha helices of the crystal structure of S65T GFP and the AlphaFold2 determined structure. Only 10 of the 578 GFP-like proteins in the pdb have no chromophore, yet when AlphaFold2 and RoseTTAFold are presented with the sequences of 44 GFP-like proteins that are not in the pdb they fold the proteins in such a way that one can unequivocally distinguish between those that can and cannot form a chromophore.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35709156 PMCID: PMC9202861 DOI: 10.1371/journal.pone.0267560
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1The cyclization-dehydration-oxidation pathway for chromophore formation.
[26] Structure I is known as the immature precyclized form of the protein. The tight-turn distance depicted in structure I was measured for all sequences and is presented in S1 Data. Structure V shows the fully formed mature chromophore (green).
Fig 2RMSD overlap of the α-helix of the 1EMA-crystal structure with the α-helix of 1EMA as determined by AlphaFold2 for GFP-like proteins that will form a chromophore (SYG) and those that do not (no SYG).