Literature DB >> 35709156

AlphaFold2 and RoseTTAFold predict posttranslational modifications. Chromophore formation in GFP-like proteins.

Sophia M Hartley1, Kelly A Tiernan1, Gjina Ahmetaj1, Adriana Cretu1, Yan Zhuang2, Marc Zimmer1.   

Abstract

AlphaFold2 and RoseTTAfold are able to predict, based solely on their sequence whether GFP-like proteins will post-translationally form a chromophore (the part of the protein responsible for fluorescence) or not. Their training has not only taught them protein structure and folding, but also chemistry. The structures of 21 sequences of GFP-like fluorescent proteins that will post-translationally form a chromophore and of 23 GFP-like non-fluorescent proteins that do not have the residues required to form a chromophore were determined by AlphaFold2 and RoseTTAfold. The resultant structures were mined for a series of geometric measurements that are crucial to chromophore formation. Statistical analysis of these measurements showed that both programs conclusively distinguished between chromophore forming and non-chromophore forming proteins. A clear distinction between sequences capable of forming a chromophore and those that do not have the residues required for chromophore formation can be obtained by examining a single measurement-the RMSD of the overlap of the central alpha helices of the crystal structure of S65T GFP and the AlphaFold2 determined structure. Only 10 of the 578 GFP-like proteins in the pdb have no chromophore, yet when AlphaFold2 and RoseTTAFold are presented with the sequences of 44 GFP-like proteins that are not in the pdb they fold the proteins in such a way that one can unequivocally distinguish between those that can and cannot form a chromophore.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35709156      PMCID: PMC9202861          DOI: 10.1371/journal.pone.0267560

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.752


Introduction

AlphaFold2 [1, 2] and RoseTTAfold [3] are two freely available programs that can predict three-dimensional protein structures from their amino acid sequence with atomic accuracy. Both programs were created by machine learning and the ~180,000 structures in the protein data bank (pdb) [4, 5] were used as an important training set. The three-dimensional structures of many of the protein structures in the pdb have been influenced by the proteins binding ligands and post-translation modifications. Agirre and co-workers have shown that AlphaFold2 predicts the folding of glycosylated proteins solely from the amino acid sequence of the proteins because the program was trained on the pdb in which the glycosylated proteins are found in their full or partially glycanated forms [6]. In the absence of its heme cofactor AlphaFold2 will fold a hemoglobin subunit so that it has a cavity complementary with a heme subunit [6, 7] and it will identify iron-sulfur cluster and zinc binding sites [8]. Finally, AlphaFold2 has been shown to generate protein-peptide complex structures without multiple alignments of the peptide fragment [9]. Fluorescent Proteins (FPs) are commonly used molecular tracer molecules. A Nobel Prize was awarded in 2008 for the development of GFP as a “tagging tool in bioscience” [10] and, in 2014, for the use of fluorescent proteins in “the development of super-resolved fluorescence microscopy” [11]. Four books [12-15] and numerous reviews [16-21] have been written about fluorescent proteins. It is their post-translational autocatalytic chromophore formation that makes these genetic tracer molecules so useful. They are all folded into a β-barrel composed of eleven β sheets surrounding the chromophore that is located in a central alpha helix. Some Green Fluorescent Proteins-like (GFP-like) proteins fold into the characteristic β-barrel shape but do not form a chromophore. The training set (pdb) is heavily weighted towards structures with a chromophore, less than 2% of the GFP-like structures in the pdb have no chromophore. There are 578 GFP-like proteins in the pdb, 568 of these structures have a fully formed chromophore, they are the GFP-like fluorescent proteins (GFP-FPs) and 10 have no chromophore. The commonly accepted mechanism for chromophore formation in GFP-like proteins is shown in Fig 1. In order for the autocatalytic chromphore formation to occur the immature GFP-like protein has to be in the tight-turn conformation (Fig 1) [22]. This tight turn breaks the canonical i to i+4 hydrogen bonding arrangement found in alpha helices [23], forming a kink that removes the intra main-chain hydrogen bonds that are commonly associated with an α-helix (S1 Fig). It is presumed that this aids in chromophore formation because the hydrogen bonds would otherwise have to be broken during maturation. Tyr66, Gly65, Arg96 and Glu222 (GFP numbering) are involved in chromophore formation and are highly conserved in all naturally occurring GFP-like FPs [24, 25]. The central tyrosine of the chromophore, Tyr66, is conserved in all naturally-occurring GFP-like FPs, although any aromatic residue in that position will auto-catalytically form a chromophore [24]. Thus, we have chemical knowledge that can be used to predict whether a GFP-like protein will form a chromophore or not. A GFP-like protein has to have a glycine65, an aromatic residue in position 66 and an arginine96 to form a chromophore. This is chemical knowledge that was not directly provided to AlphaFold2 or RoseTTAFold.
Fig 1

The cyclization-dehydration-oxidation pathway for chromophore formation.

[26] Structure I is known as the immature precyclized form of the protein. The tight-turn distance depicted in structure I was measured for all sequences and is presented in S1 Data. Structure V shows the fully formed mature chromophore (green).

The cyclization-dehydration-oxidation pathway for chromophore formation.

[26] Structure I is known as the immature precyclized form of the protein. The tight-turn distance depicted in structure I was measured for all sequences and is presented in S1 Data. Structure V shows the fully formed mature chromophore (green). We have used AlphaFold2 and RoseTTAFold to generate the structures of two series of GFP-like sequences; one that can form a chromophore and the other that can’t. And have shown that both programs fold the chromophore forming GFP-like sequences into a distinctly different conformation compared to the sequences that cannot form a chromophore. The structures in the training set taught the AI programs chemistry to distinguish which proteins will be post-translationally modified from those that will not. This was highly unexpected.

Materials and methods

Sequence selection

Standard structures

Three GFP-like proteins with known solid-state structures were selected: 1EMA, 2AWJ, 1H4U (pdb codes). 1EMA is the crystal structure of the S65T mutant of GFP. Together with 1GFL [27] it was the first crystal structure of GFP to be solved [28]. It is the structure of a fluorescent protein that has a mature chromophore formed from the 65TYG67 triad of residues (Fig 1). 2AWJ is an immature precyclized GFP-like protein structure. It is the R96M mutant that takes about 3 months to form the chromophore [29]. Arginine and methionine have similar sizes, allowing us to safely assume that the conformation of immature wild-type GFP and its R96M mutant would be the same. The crystal structure of the R96M mutant is the closest solid-state structure to the immature GFP precyclized structure (Fig 1). 1H4U is the structure of the G2 domain of mouse nidogen. It is a GFP-like structure with a central alpha helix in an 11 stranded beta barrel. It has no chromophore and cannot form a chromophore [30].

Sequences of chromophore forming GFP-like proteins

Amino acid sequences of GFP-like fluorescent proteins that do not appear in the pdb were obtained from FPbase [31]. The chromophore forming GFP-like FP sequences were chosen from the FPBase in a way that they spanned as much phylogenetic space possible and had not been crystalized i.e. they did not appear in the pdb. Protein sequences were selected from each FP lineage, starting from the parent protein. If the parent protein’s crystal structure was already solved, then the next protein sequence in the lineage closest to the parent was selected. No sequence was used from lineages in which all structures have been solved (DrCBD) and far-red FPs with a tetrapyrroloid chromphores as these FPs are not GFP-like β-barrels.

Sequences of GFP-like proteins that cannot form a chromophore

Sequences of GFP-like non-fluorescent proteins were taken from Haddock’s “Non-excitable fluorescent protein orthologs found in ctenophores” paper [32] and a series of nidogen-G2 domains were obtained from a standard protein BLAST search (BLASTP) using the 1H4U FASTA sequence as a protein query (The KAB1270852.1 Nidogen-1 Camelus dromedarius sequence was chosen to honor the fact that the Connecticut College mascot is the camel). Only the G2 fragment region of the nidogen sequences were used in the AI alignments and the subsequent analyses. See S1 Table for sequences studied and their sequence similarity to S65T GFP (1EMA).

AI predictions

RoseTTAFold was accessed via the RosettaCommons web-interface server Robetta [33] and simplified AlphaFold(v2.1.0) was accessed through AlphaFold’s Colab notebook [34].

Alignments

MAFFT DASH [35] was used for the multiple alignment and Clustal Omega [36] to find the percent identity matrix of all of the proteins relative to 1EMA, see S1 Table. All residues aligned with residues 60 to 74 of 1EMA were considered part of the central alpha-helix (S2 Fig).

Measurements

Maestro Version 12.9.123 [37] was used to perform hydrogen bond measurements of the structures obtained from pdb [5], Alphafold2 and RoseTTAFold. RMSD values for the alpha helical overlaps were obtained by determining the pairwise distances between structures, using the rms displacement after optimal rigid-body superposition between pairs of non-hydrogen backbone atoms of residues highlighted in the alignment shown in S2 Fig of the supplementary material [38, 39]. All the measurements are presented in S1 Data.

Results and discussion

Structures examined

AlphaFold2 and RoseTTAFold were used to predict the 3-dimensional structure of a series of GFP-like proteins—a set of 3 standard structures with known solid state structures; a group of 21 GFP-like fluorescent proteins that will post-translationally form a chromophore and whose solid state structures have not been determined; as well as a group of 23 GFP-like non-fluorescent proteins that do not have the residues required to form a chromophore and whose solid state structures have not been determined. S1 Table lists all the sequences and how they were obtained.

How does AI deal with the chromophore?

Since the chromophore formation involves post-translational modification of the 65TYG67 triad we can’t expect the AI programs to model the structure of GFP-like FPs with a fully formed chromophore (Structure V in Fig 1). The central alpha-helical strand (residues 60 to 74) of the structure obtained from the AI predicted folding of the S65T GFP sequence (as found in 1EMA) was overlapped with the pdb structure of 1EMA. Both AlphaFold2 and RoseTTAFold predict a structure closest to the immature precyclized form (structure I in Fig 1, a structure similar to the one obtained if one was to graphically mutate the chromophore back to 65TYG67 and then minimize), a conformation most closely represented by crystal structure of the R96A mutant 2AWJ (RMSD for AF2 = 0.32Å and RF = 0.40Å). The alpha helical overlaps with the regular alpha helix observed in the nidogen G2 domains (RMSD vs 1H4U for AF2 = 1.65Å and RF = 1.68Å) or even vs. its own crystal structure (RMSD vs 1EMA for AF2 = 0.78Å and RF = 0.80Å) are much higher, see S1 Data.

Can AI folding programs predict whether GFP-like proteins will form a chromophore or not?

AlphaFold2 and RoseTTAFold were used to predict the 3-dimensional structures of 21 GFP-like fluorescent proteins that will post-translationally form a chromophore as well as a group of 23 GFP-like non-fluorescent proteins that do not have the residues required to form a chromophore. All sequences folded into GFP-like 11 stranded β-barrels with a central alpha helix. The resultant structures were analyzed to establish whether according to the chromophore forming mechanism discussed in the introduction and shown in Fig 1, they were in conformations geometrically primed for chromophore formation or whether they could not form a chromophore. We overlapped their central alpha helical strands with the strands obtained from the crystal structures of 1EMA (has chromophore), 2AWJ (has no chromophore but is in the right conformation to form one) and 1H4U (has no chromophore and is in the wrong conformation to form chromophore); their tight-turns were measured, as were the i to i+4 hydrogen bonding distances in their alpha helices.

Statistical analysis of the AlphaFold2 predictions

From the boxplot, Fig 2, as well as the data presented in S1 Data and S2 Table, it is apparent that the RMSD for α-helix overlap of the AlphaFold2 predicted structures with 1EMA-crystal is much higher for those that will not form a chromophore than those that do. Moreover, the Welch two sample t-test shows there is a significant difference in the mean values of α-helix overlap with 1EMA-crystal between GFP-like proteins that will form a chromophore and those that do not (t-value = 10.646 and p-value = 7.77 x 10−12). The average RMSD for GFP-like proteins that will not form a chromophore is 1.372Å while the average value for GFP-like proteins that will form a chromophore is 0.799Å.
Fig 2

RMSD overlap of the α-helix of the 1EMA-crystal structure with the α-helix of 1EMA as determined by AlphaFold2 for GFP-like proteins that will form a chromophore (SYG) and those that do not (no SYG).

For the structures predicted by AlphaFold2 one only needs to compare the predicted alpha helical structure with that of the S65T GFP pdb structure to know whether the GFP-like protein will form a chromophore or not. A LASSO regression model was built using “can form chromophore” (= 1) and “cannot form chromophore” (= 0) as the response variable with all the measurements including Alpha helix overlap, tight turn distance, as well as H-Bond distances in angstrom. The final model includes the following predictors—the alpha helix overlap with 1EMA-crystal as well as H-bond distances in Angstrom between residues 61–65 (HD2), 62–66 (HD3), 70–74 (HD11). The model shows there is a clear distinction between the sequences capable of chromophore forming and those that don’t have the residues required for chromophore formation, see S3 Fig and S3 Table. A model built on just the measurements of the hydrogen bonding distances is also able to distinguish between chromophore and non-chromophore forming structures.

Statistical analysis of the RoseTTAFold predictions

The RMSD for the α-helix overlap of the RoseTTAFold predicted structures with 1EMA-crystal structures is generally higher for those structures that will not form a chromophore than those that do. However the appropriate boxplot (S4 Fig) as well as the data presented in S1 Data and S4 Table, show that while RoseTTAFold can distinguish between posttranslationally modified and non-posttranslationally structures it is not as distinctive as AlphaFold2. The Welch two sample t-test shows that there is a significant difference in the mean values for alpha helix overlap with 1EMA-crystal between GFP-like proteins that will form a chromophore and those that do not (t-value = 4.699 and p-value = 3.526 x 10−5). The average value for GFP-like proteins that will not form a chromophore is 1.317 while the average value for GFP-like proteins that will form a chromophore is 1.003. For data collected using RoseTTaFold, we built a LASSO regression model using “can form chromophore” (= 1) and “cannot form chromophore” (= 0) as the response variable. The selected predictors of final model are: the RMSD of the alpha helix overlap with 1EMA-crystal and 2AWJ-crystal, the tight turn distances, as well as H-bond distances in Angstrom between residues 61 and 65 (HD2), 62–66 (HD3), 63–67 (HD4), 64–68 (HD5), see S5 Table and S5 Fig. From the prediction results, we can see there is a clear distinction between the sequences capable of forming a chromophore and those that can’t. A model built on just the measurements of hydrogen bonding distances is also able to distinguish between chromophore and non-chromophore forming structures.

Conclusions

Only 10 of the 578 GFP-like proteins in the pdb have no chromophore, yet when AlphaFold2 and RoseTTAFold are presented with the sequences of 44 GFP-like proteins that are not in the pdb they fold the proteins in such a way that one can unequivocally distinguish between those that can and cannot form a chromophore. They predict the conformation of the immature protein with a kink in the α-helix (as expected from machine learning vs. memorization) and have used their training set to learn some chemistry, they can distinguish between GFP-like proteins that will form a chromophore and those that do not. We suspect that the pdb training set and multiple sequence alignments enable AlphaFold2 and RoseTTAFold to “think” like chemists and look for the presence of residues equivalent to Arg96, Gly65 and an aromatic residue at position 66 in GFP–the residues required for chromophore formation–and use those to fold the sequence in a chromophore forming or non-chromophore forming conformations.

Main chain i to i+4 “hydrogen bonding” alpha helical interactions measured and presented in S1 Data.

(DOCX) Click here for additional data file.

Alignment of all sequences used.

(DOCX) Click here for additional data file.

Predicted possibility that structure forms a chromophore or cannot form a chromophore using LASSO model from S2 Table on AlphaFold2 data.

(DOCX) Click here for additional data file.

RMSD overlap of the α-helix of the 1EMA-crystal with the α-helix of 1EMA as determined by RoseTTAFold for GFP-like proteins that will form a chromophore and those that do not.

(DOCX) Click here for additional data file.

Predicted possibility that structure forms a chromophore or cannot form a chromophore using LASSO model from S4 Table, which is based on RoseTTAFold data.

(DOCX) Click here for additional data file.

RMSD overlap of the α-helix of the 1EMA, 2AWJ, 1H4U crystal structures with the α-helix of the RoseTTAFold and AlphaFold determined structures, tight turn distances of RoseTTAFold and AlphaFold determined structures and the main chain i to i+4 “hydrogen bonding” alpha helical interactions.

These are the geometric measurements that are crucial to chromophore formation. The structures are divided into two groups, GFP-like proteins that will form a chromophore and those that do not. (XLSX) Click here for additional data file.

Description of all sequences used in this study [28–30, 32, 40–62].

(DOCX) Click here for additional data file.

Summary statistics of the RMSD overlap (in Angstrom) of the α-helix of the 1EMA-crystal with the α-helix of 1EMA as determined by AlphaFold2 for GFP-like proteins that will form a chromophore and those that do not.

(DOCX) Click here for additional data file.

LASSO model results for the alpha helix overlap with 1EMA-crystal as well as H-bond distances in angstrom between residues 61–65 (HD2), 62–66 (HD3), 70–74 (HD11) collected using AlphaFold2.

(DOCX) Click here for additional data file.

Summary statistics of the RMSD overlap (in Angstrom) of the α-helix of the 1EMA-crystal with the α-helix of 1EMA as determined by RoseTTAFold for GFP-like proteins that will form a chromophore and those that do not.

(DOCX) Click here for additional data file.

LASSO model results for the RMSD of the alpha helix overlap with 1EMA and 2AWJ-crystal structures, the tight turn distances, as well as H-bond distances in Angstrom between residues 61 and 65 (HD2), 62–66 (HD3), 63–67 (HD4), 64–68 (HD5) collected using RoseTTAFold simulations.

(DOCX) Click here for additional data file. 31 Mar 2022
PONE-D-22-04782
AlphaFold2 and RoseTTAFold Predict Posttranslational modifications. Chromophore Formation in GFP-like Proteins.
PLOS ONE Dear Dr. Marc Zimmer, Thank you for submitting your manuscript to PLOS ONE. Apologies for the delay in getting back to you. Seven reviewers were contacted and out of them only one answered. After careful consideration of the review, we feel that your manuscript has merit but needs the minor amendements denoted by the reviewer to fully meet PLOS ONE’s publication criteria . Therefore, we invite you to submit a revised version of the manuscript that convincingly addresses the points raised during the review process. Please submit your revised manuscript by May 15  2022. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Maria Gasset, Ph.D. Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf. 2. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ. 3. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: I Don't Know ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The authors report on a very interesting computational approach to figure out whether green fluorescent protein-like proteins fluoresce, based on their amino acid sequence. They use the AlphaFold2 and RoseTTAFold artificial intelligence software which was trained on 180,000 protein structures in the protein data bank to predict the folded protein with atomic resolution. This data base contains 578 GFP-like proteins of which 10 have no chromophore, i.e. do not fluoresce. (It wasn’t clear to me whether they can still absorb, though, like a dark acceptor in FRET?) The present manuscript presents the amino acid sequences of 44 GFP-like proteins that are not in the protein data bank to AlphaFold2 and RoseTTAFold, and the software calculates the 3-dimensional structure of these proteins to atomic resolution. While the precise solid-state structure of these 44 proteins has not been determined experimentally, importantly, it is known from experiment whether they fluoresce or not. The software correctly folds the amino acid sequences into a fluorophore for the fluorescent proteins, and no fluorophore for the ones that do not fluoresce, if I understand this correctly. I think is a phenomenal result, and the link to experiment via fluorescence is ingenious – it requires no details apart from “yes” or “no” of fluorescence, and no details of spectra or lifetime, which essentially makes it a powerful binary approach (0 or 1, so to speak). The manuscript is very well written and easy to follow. The work will be of interest to the research community developing (and also using) fluorescent proteins, and those predicting protein structure from amino acid sequences in general. It will also be of interest for researchers in artificial intelligence, machine learning and other computational and computer-simulation approaches. I only have some minor comments: 1) Maybe a figure with a representative structure of a GFP-like protein with fluorophore investigated in this study would be useful, as well as such a protein without a fluorophore. 2) By chromophore, the authors mean a structure that fluoresces, not one that just absorbs light, is that correct? ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.
6 Apr 2022 We were pleased to see that the reviewer was so positive about the manuscript and only had two minor suggestions. Here are our responses to the suggestions. 1) Maybe a figure with a representative structure of a GFP-like protein with fluorophore investigated in this study would be useful, as well as such a protein without a fluorophore. I don’t think a figure such as the one suggested above will contribute much to the understanding of the work presented. They are essentially the same because the deep learning programs show the conformation prior to post-translational cyclization and do not show the mature chromophore. However, we would be happy to insert a figure if desired. (See response to reviewers .doc file) 2) By chromophore, the authors mean a structure that fluoresces, not one that just absorbs light, is that correct? Yes, we have clarified this by adding the following definition at the first occurrence of the word chromophore. “chromophore (the part of the protein responsible for fluorescence)”. Submitted filename: Response to Reviewers.doc Click here for additional data file. 12 Apr 2022 AlphaFold2 and RoseTTAFold Predict Posttranslational modifications. Chromophore Formation in GFP-like Proteins. PONE-D-22-04782R1 Dear Dr. Marc Zimmer, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Maria Gasset, Ph.D. Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: I Don't Know ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Now I agree that a figure of a fluorescent and non-fluorescent GFP would be of limited use, as it is dominated by the barrel, not the all-important fluorophore. I leave it to the authors to decide whether to include this figure or not, it may be helpful for non-experts in the field. The chromophore definition is definitely useful, and I look forward to seeing this nice work published. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No 27 May 2022 PONE-D-22-04782R1 AlphaFold2 and RoseTTAFold predict posttranslational modifications. Chromophore formation in GFP-like proteins. Dear Dr. Zimmer: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Maria Gasset Academic Editor PLOS ONE
  49 in total

1.  Fluorescent proteins from nonbioluminescent Anthozoa species.

Authors:  M V Matz; A F Fradkov; Y A Labas; A P Savitsky; A G Zaraisky; M L Markelov; S A Lukyanov
Journal:  Nat Biotechnol       Date:  1999-10       Impact factor: 54.908

2.  Optimized and far-red-emitting variants of fluorescent protein eqFP611.

Authors:  Simone Kredel; Karin Nienhaus; Franz Oswald; Michael Wolff; Sergey Ivanchenko; Florian Cymer; Andreas Jeromin; Francois J Michel; Klaus-Dieter Spindler; Ralf Heilker; G Ulrich Nienhaus; Jörg Wiedenmann
Journal:  Chem Biol       Date:  2008-03

Review 3.  The Growing and Glowing Toolbox of Fluorescent and Photoactive Proteins.

Authors:  Erik A Rodriguez; Robert E Campbell; John Y Lin; Michael Z Lin; Atsushi Miyawaki; Amy E Palmer; Xiaokun Shu; Jin Zhang; Roger Y Tsien
Journal:  Trends Biochem Sci       Date:  2016-11-01       Impact factor: 13.807

4.  Crystal structure of the Aequorea victoria green fluorescent protein.

Authors:  M Ormö; A B Cubitt; K Kallio; L A Gross; R Y Tsien; S J Remington
Journal:  Science       Date:  1996-09-06       Impact factor: 47.728

5.  Understanding GFP chromophore biosynthesis: controlling backbone cyclization and modifying post-translational chemistry.

Authors:  David P Barondeau; Carey J Kassmann; John A Tainer; Elizabeth D Getzoff
Journal:  Biochemistry       Date:  2005-02-15       Impact factor: 3.162

6.  The Role of the Tight-Turn, Broken Hydrogen Bonding, Glu222 and Arg96 in the Post-translational Green Fluorescent Protein Chromophore Formation.

Authors:  Nathan P Lemay; Alicia L Morgan; Elizabeth J Archer; Luisa A Dickson; Colleen M Megley; Marc Zimmer
Journal:  Chem Phys       Date:  2008-06-02       Impact factor: 2.348

Review 7.  The fluorescent protein palette: tools for cellular imaging.

Authors:  Richard N Day; Michael W Davidson
Journal:  Chem Soc Rev       Date:  2009-08-04       Impact factor: 54.564

8.  Molecular evolution of versatile derivatives from a GFP-like protein in the marine copepod Chiridius poppei.

Authors:  Akihisa Shimizu; Ikuo Shiratori; Katsunori Horii; Iwao Waga
Journal:  PLoS One       Date:  2017-07-11       Impact factor: 3.240

9.  Highly accurate protein structure prediction for the human proteome.

Authors:  John Jumper; Demis Hassabis; Kathryn Tunyasuvunakool; Jonas Adler; Zachary Wu; Tim Green; Michal Zielinski; Augustin Žídek; Alex Bridgland; Andrew Cowie; Clemens Meyer; Agata Laydon; Sameer Velankar; Gerard J Kleywegt; Alex Bateman; Richard Evans; Alexander Pritzel; Michael Figurnov; Olaf Ronneberger; Russ Bates; Simon A A Kohl; Anna Potapenko; Andrew J Ballard; Bernardino Romera-Paredes; Stanislav Nikolov; Rishub Jain; Ellen Clancy; David Reiman; Stig Petersen; Andrew W Senior; Koray Kavukcuoglu; Ewan Birney; Pushmeet Kohli
Journal:  Nature       Date:  2021-07-22       Impact factor: 69.504

10.  Highly accurate protein structure prediction with AlphaFold.

Authors:  John Jumper; Richard Evans; Alexander Pritzel; Tim Green; Michael Figurnov; Olaf Ronneberger; Kathryn Tunyasuvunakool; Russ Bates; Augustin Žídek; Anna Potapenko; Alex Bridgland; Clemens Meyer; Simon A A Kohl; Andrew J Ballard; Andrew Cowie; Bernardino Romera-Paredes; Stanislav Nikolov; Rishub Jain; Demis Hassabis; Jonas Adler; Trevor Back; Stig Petersen; David Reiman; Ellen Clancy; Michal Zielinski; Martin Steinegger; Michalina Pacholska; Tamas Berghammer; Sebastian Bodenstein; David Silver; Oriol Vinyals; Andrew W Senior; Koray Kavukcuoglu; Pushmeet Kohli
Journal:  Nature       Date:  2021-07-15       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.