| Literature DB >> 34599769 |
John Jumper1, Richard Evans1, Alexander Pritzel1, Tim Green1, Michael Figurnov1, Olaf Ronneberger1, Kathryn Tunyasuvunakool1, Russ Bates1, Augustin Žídek1, Anna Potapenko1, Alex Bridgland1, Clemens Meyer1, Simon A A Kohl1, Andrew J Ballard1, Andrew Cowie1, Bernardino Romera-Paredes1, Stanislav Nikolov1, Rishub Jain1, Jonas Adler1, Trevor Back1, Stig Petersen1, David Reiman1, Ellen Clancy1, Michal Zielinski1, Martin Steinegger2,3, Michalina Pacholska1, Tamas Berghammer1, David Silver1, Oriol Vinyals1, Andrew W Senior1, Koray Kavukcuoglu1, Pushmeet Kohli1, Demis Hassabis1.
Abstract
We describe the operation and improvement of AlphaFold, the system that was entered by the team AlphaFold2 to the "human" category in the 14th Critical Assessment of Protein Structure Prediction (CASP14). The AlphaFold system entered in CASP14 is entirely different to the one entered in CASP13. It used a novel end-to-end deep neural network trained to produce protein structures from amino acid sequence, multiple sequence alignments, and homologous proteins. In the assessors' ranking by summed z scores (>2.0), AlphaFold scored 244.0 compared to 90.8 by the next best group. The predictions made by AlphaFold had a median domain GDT_TS of 92.4; this is the first time that this level of average accuracy has been achieved during CASP, especially on the more difficult Free Modeling targets, and represents a significant improvement in the state of the art in protein structure prediction. We reported how AlphaFold was run as a human team during CASP14 and improved such that it now achieves an equivalent level of performance without intervention, opening the door to highly accurate large-scale structure prediction.Entities:
Keywords: AlphaFold; CASP; deep learning; machine learning; protein structure prediction
Mesh:
Substances:
Year: 2021 PMID: 34599769 PMCID: PMC9299164 DOI: 10.1002/prot.26257
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
FIGURE 1Examples of visualizations used in the prediction checking Colaboratory notebook, shown with CASP target T1101
FIGURE 2T1024: (A) Per‐residue lDDT‐Cα and pLDDT of T1024. Vertical gray shading indicates residues missing in the experimental structure, and colored shading indicates minimum and maximum values over five predictions. The pLDDT shows low confidence in the linker region indicating possible flexibility and qualitatively agrees with the true per‐residue lDDT‐Cα. (B) Unrealized distances in the expected distances of T1024 indicating possible alternate relative conformations of the two domains
FIGURE 3T1044: Comparison of the number of effective alignments (Neff) per residue for each MSA, derived both from domain sequences and from cropping the full sequence MSA. Four domains (T1033, T1039, T1040, and T1043) substantially benefit from using the full sequence MSA. The dashed green line shows the approximate 30 alignment threshold considered sufficient for a good prediction with AlphaFold
T1044: (A) Confidence scores (pLDDT) for different prediction systems that were considered and (B) accuracy (GDT_TS) for predictions of domains in T1044
| (A) Confidence (pLDDT) | |||||
|---|---|---|---|---|---|
| Original domain | Original full sequence | Crop‐then‐fold domain | Submitted full sequence | Final system full sequence | |
| T1031 | 80.6 | 64.4 | 84.1 | 71.9 | 69.4 |
| T1033 | 54.7 | 69.0 | 85.0 | 73.9 | 77.2 |
| T1035 | 81.7 | 78.8 | 83.2 | 77.9 | 82.6 |
| T1037 | 83.9 | 78.7 | 89.7 | 77.7 | 82.8 |
| T1039 | 75.0 | 68.2 | 89.3 | 67.9 | 71.6 |
| T1040 | 73.1 | 51.6 | 87.2 | 82.0 | 74.7 |
| T1041 | 83.7 | 78.4 | 86.6 | 77.8 | 80.0 |
| T1042 | 79.7 | 68.1 | 81.5 | 80.6 | 73.5 |
| T1043 | 37.5 | 47.4 | 79.7 | 82.8 | 64.1 |
| Average | 72.2 | 67.2 | 85.2 | 76.9 | 75.1 |
| Full sequence | N/A | 71.3 | N/A | 77.1 | 77.4 |
T1044: Confidence scores (pLDDT) for different prediction systems that were considered. The mean full‐sequence pLDDT over a given domain cannot be directly compared to the mean pLDDT found just by folding that domain, as pLDDT will consider the effect of mispredicting inter‐domain distances as well as intra‐domain distances, which penalizes longer predictions. However, it can be seen that using “crop‐then‐fold” led to an improvement, often substantial, in confidence across all domains. The full sequence confidences of predictions made with the submitted (template‐patched) system were also superior to the original system. The final improved system gives an equivalent level of confidence to the submitted prediction.
Accuracy (GDT_TS) for predictions of domains in T1044. It can be seen that, using the original system, T1033 was predicted more accurately as part of the full chain, but T1040 was predicted more accurately when folded as an independent domain. Both crop‐then‐fold domains and submitted (template‐patched) full‐sequence predictions get the best of both worlds and give better mean domain accuracy. The final system gets equivalent performance with no complex interventions, and better chain‐level TM score.
FIGURE 4Comparison of three different prediction methods for the targets with significant interventions: “Original system” is the automated prediction system as it existed at target release. “Submitted prediction” is the submitted structure prediction. “Final system” is the automated system as it existed at the end of the CASP14 assessment, improved by experience
FIGURE 5All models produced for T1064–pLDDT versus final lDDT‐Cα. The strong correlation indicates that ranking many predictions by pLDDT was a successful strategy for this target