Literature DB >> 35622853

Whole-proteome structures shed new light on posttranslational modifications.

Robbie P Joosten1, Jon Agirre2.   

Abstract

Accurate but protein-only AlphaFold models may show structural fingerprints of likely posttranslational modifications (PTMs). In this issue of PLOS Biology, Bludau and colleagues add a functional context to models by combining them with readily available proteomics results.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35622853      PMCID: PMC9182549          DOI: 10.1371/journal.pbio.3001673

Source DB:  PubMed          Journal:  PLoS Biol        ISSN: 1544-9173            Impact factor:   9.593


The recent artificial intelligence revolution in protein structure prediction, spearheaded by DeepMind’s AlphaFold [1] and swiftly seized upon by RoseTTAFold [2], is allowing scientists to arrive at an accurate structural model of a protein, or at least parts of it, in a matter of hours. This already diminutive lead time can be further compressed to mere seconds if the protein of interest is found in the complete set of proteins expressed by an organism (proteome) in the list of the ever expanding set of organisms covered by the AlphaFold Protein Structure Database (AFDB). The AFDB, released in 2021 and subsequently updated [3], is expected to cover the 100 million set of sequences in the proteomes available at UniRef90 [4]. It offers immediate access to predicted models of human proteins, alongside reliable estimates of their accuracy in the form of 2 metrics: pLDDT (per-residue confidence) and PAE (positional alignment error of each residue with respect to the rest). Structures with a consistently high pLDDT and very low PAE are expected to show an accuracy on par with experimentally determined protein models. Human proteins are obvious targets for therapeutics; however, their function and structure are, more often than not, modulated or regulated by co- and posttranslational (covalent) modifications, plus ligands and cofactors (noncovalent). Those important moieties, not currently targeted by the AlphaFold algorithm, are conspicuously absent from predicted structures [5]: As an example, many more than half of all human proteins are expected to include either protein glycosylation [6], phosphorylation [7], or both. Thus, the analysis of AlphaFold structures of modified proteins can produce misleading results [5]. Recent studies have suggested that most predicted models are accurate enough to include space for the absent modifications, ligands, and cofactors to be added postprediction [5,8]. Importantly, these endeavours can only be as successful as our ability to pinpoint their occurrence and location on a protein’s structure. In a slightly different case, transplanting likely ligands (e.g., a heme group onto hemoglobin or a polysaccharide onto a glycoside hydrolase) onto AlphaFold models by homology with experimental structure models becomes increasingly error-prone when the homology becomes more distant. In the case that homology is absent altogether, transferable knowledge from experimental structure models is absent as well, and this process becomes a speculative docking experiment. In the absence of experimental structural models, the extensive proteomics datasets available today can provide information on co- and posttranslational modifications (PTMs) on the respective target proteins [9]. Furthermore, the covalent transference of modifications onto protein often follows a consensus sequence—e.g., N-glycosylation on Asn-X-Ser/Thr where X is any amino acid other than proline; these consensus sequences are variably well studied across modifications. Crucially, mapping proteomics and bioinformatics information onto AlphaFold 3D models may allow us to not just complete models, but to learn more about the structural fingerprints left by modifications: the structure of their protein scaffold and their environment. In this issue, Bludau and colleagues [10] discuss the first results from the implementation of such an approach, targeting different modification types including phosphorylation, ubiquitination, and more. Not all PTMs are made equal: They may play different roles depending on whether they are buried or exposed to solvent (Fig 1), added to a correctly folded region, a misfolded region, or to an intrinsically disordered one. On that last point, the synergy with AlphaFold brings another important contribution to the table: Because AlphaFold has been trained on data from the structured parts of ordered proteins—a precondition for atomic positions to be well resolved in both X-ray crystallography and electron cryo-microscopy, the 2 main techniques contributing structures to the Protein Data Bank (PDB)—there is a good correlation between intrinsic disorder and low prediction confidence as measured by AlphaFold;s pLDDT [1]. Bludau and colleagues [10] use this knowledge to select PTMs that are enriched for having regulatory functions. These regulatory modification sites show a preference for short intrinsically disordered regions such as the activation loops in protein kinases. In addition, the authors use AlphaFold models to show that different regulatory modification sites have a strong tendency to flock together in 3D and not just in sequence space, hinting at coregulation or even cross talk between different types of PTMs [10].
Fig 1

Different sequence profiles in exposed and solvent-excluded phosphosites (STY) as recognised by different kinases.

The availability of accurate 3D models now allows for this direct mapping of sequence profiles onto structures and allows estimating their solvent accessibility. Extracted from Fig 3d of Bludau and colleagues [10].

Different sequence profiles in exposed and solvent-excluded phosphosites (STY) as recognised by different kinases.

The availability of accurate 3D models now allows for this direct mapping of sequence profiles onto structures and allows estimating their solvent accessibility. Extracted from Fig 3d of Bludau and colleagues [10]. The work, as one of the first systematic analyses of the functional importance of PTMs, lays an important foundation for new experimental studies targeting PTMs in specific proteins. The authors provide software tools to shortlist the modification sites of regulatory importance, thereby allowing more focused experimental studies. Importantly, the software—for which source code is available from the “structuremap” and “alphamap” repositories at https://github.com/MannLabs—will also enable richer annotation of PTMs on AlphaFold entries. To this end, we think the results from Bludau and colleagues [10] would make a worthy contribution to the recently introduced 3D-Beacons database, which aims to become a reference point for structural knowledge (https://www.ebi.ac.uk/pdbe/pdbe-kb/3dbeacons).
  9 in total

1.  Ultradeep human phosphoproteome reveals a distinct regulatory nature of Tyr and Ser/Thr-based signaling.

Authors:  Kirti Sharma; Rochelle C J D'Souza; Stefka Tyanova; Christoph Schaab; Jacek R Wiśniewski; Jürgen Cox; Matthias Mann
Journal:  Cell Rep       Date:  2014-08-21       Impact factor: 9.423

2.  The case for post-predictional modifications in the AlphaFold Protein Structure Database.

Authors:  Haroldas Bagdonas; Carl A Fogarty; Elisa Fadda; Jon Agirre
Journal:  Nat Struct Mol Biol       Date:  2021-11       Impact factor: 15.369

3.  The structural context of posttranslational modifications at a proteome-wide scale.

Authors:  Isabell Bludau; Sander Willems; Wen-Feng Zeng; Maximilian T Strauss; Fynn M Hansen; Maria C Tanzer; Ozge Karayel; Brenda A Schulman; Matthias Mann
Journal:  PLoS Biol       Date:  2022-05-16       Impact factor: 9.593

Review 4.  Determination of glycosylation sites and site-specific heterogeneity in glycoproteins.

Authors:  Hyun Joo An; John W Froehlich; Carlito B Lebrilla
Journal:  Curr Opin Chem Biol       Date:  2009-08-21       Impact factor: 8.822

5.  UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches.

Authors:  Baris E Suzek; Yuqi Wang; Hongzhan Huang; Peter B McGarvey; Cathy H Wu
Journal:  Bioinformatics       Date:  2014-11-13       Impact factor: 6.937

6.  The functional landscape of the human phosphoproteome.

Authors:  David Ochoa; Andrew F Jarnuczak; Cristina Viéitez; Maja Gehre; Margaret Soucheray; André Mateus; Askar A Kleefeldt; Anthony Hill; Luz Garcia-Alonso; Frank Stein; Nevan J Krogan; Mikhail M Savitski; Danielle L Swaney; Juan A Vizcaíno; Kyung-Min Noh; Pedro Beltrao
Journal:  Nat Biotechnol       Date:  2019-12-09       Impact factor: 54.908

7.  Accurate prediction of protein structures and interactions using a three-track neural network.

Authors:  Minkyung Baek; Frank DiMaio; Ivan Anishchenko; Justas Dauparas; Sergey Ovchinnikov; Gyu Rie Lee; Jue Wang; Qian Cong; Lisa N Kinch; R Dustin Schaeffer; Claudia Millán; Hahnbeom Park; Carson Adams; Caleb R Glassman; Andy DeGiovanni; Jose H Pereira; Andria V Rodrigues; Alberdina A van Dijk; Ana C Ebrecht; Diederik J Opperman; Theo Sagmeister; Christoph Buhlheller; Tea Pavkov-Keller; Manoj K Rathinaswamy; Udit Dalwadi; Calvin K Yip; John E Burke; K Christopher Garcia; Nick V Grishin; Paul D Adams; Randy J Read; David Baker
Journal:  Science       Date:  2021-07-15       Impact factor: 47.728

8.  Highly accurate protein structure prediction for the human proteome.

Authors:  John Jumper; Demis Hassabis; Kathryn Tunyasuvunakool; Jonas Adler; Zachary Wu; Tim Green; Michal Zielinski; Augustin Žídek; Alex Bridgland; Andrew Cowie; Clemens Meyer; Agata Laydon; Sameer Velankar; Gerard J Kleywegt; Alex Bateman; Richard Evans; Alexander Pritzel; Michael Figurnov; Olaf Ronneberger; Russ Bates; Simon A A Kohl; Anna Potapenko; Andrew J Ballard; Bernardino Romera-Paredes; Stanislav Nikolov; Rishub Jain; Ellen Clancy; David Reiman; Stig Petersen; Andrew W Senior; Koray Kavukcuoglu; Ewan Birney; Pushmeet Kohli
Journal:  Nature       Date:  2021-07-22       Impact factor: 69.504

9.  Highly accurate protein structure prediction with AlphaFold.

Authors:  John Jumper; Richard Evans; Alexander Pritzel; Tim Green; Michael Figurnov; Olaf Ronneberger; Kathryn Tunyasuvunakool; Russ Bates; Augustin Žídek; Anna Potapenko; Alex Bridgland; Clemens Meyer; Simon A A Kohl; Andrew J Ballard; Andrew Cowie; Bernardino Romera-Paredes; Stanislav Nikolov; Rishub Jain; Demis Hassabis; Jonas Adler; Trevor Back; Stig Petersen; David Reiman; Ellen Clancy; Michal Zielinski; Martin Steinegger; Michalina Pacholska; Tamas Berghammer; Sebastian Bodenstein; David Silver; Oriol Vinyals; Andrew W Senior; Koray Kavukcuoglu; Pushmeet Kohli
Journal:  Nature       Date:  2021-07-15       Impact factor: 49.962

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.