| Literature DB >> 34199677 |
Donghyuk Suh1, Jai Woo Lee1, Sun Choi1, Yoonji Lee2.
Abstract
The new advances in deep learning methods have influenced many aspects of scientific research, including the study of the protein system. The prediction of proteins' 3D structural components is now heavily dependent on machine learning techniques that interpret how protein sequences and their homology govern the inter-residue contacts and structural organization. Especially, methods employing deep neural networks have had a significant impact on recent CASP13 and CASP14 competition. Here, we explore the recent applications of deep learning methods in the protein structure prediction area. We also look at the potential opportunities for deep learning methods to identify unknown protein structures and functions to be discovered and help guide drug-target interactions. Although significant problems still need to be addressed, we expect these techniques in the near future to play crucial roles in protein structural bioinformatics as well as in drug discovery.Entities:
Keywords: 3D structure of proteins; deep learning; drug discovery; protein sequence homology; structural bioinformatics
Mesh:
Year: 2021 PMID: 34199677 PMCID: PMC8199773 DOI: 10.3390/ijms22116032
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Overview of deep learning (DL) architectures frequently used for protein structure prediction.
List of methods for protein structure predictions. The entries consist of the most recent versions of the series.
| Method/Server | Target a | Topology | Evolutionary | Site |
|---|---|---|---|---|
| JPred | 1D—SS, SA | FFNN | PSI-BLAST |
|
| SSpro | 1D—SS, SA(ACCpro) | BRNN–CNN | PSI-BLAST |
|
| DISSPred | 1D—SS, TA | SVM | PSI-BLAST |
|
| SPIDER3 | 1D—SS, SA, TA, CN | BLSTM | PSI-BLAST |
|
| ProteinUnet | 1D—SS, SA, TA, CN | CNN | None |
|
| NetSurfP-2.0 | 1D—SS, SA, TA, DR | BLSTM | HHBlits |
|
| IUPred | 1D—DR | Regression | None |
|
| PSIPRED | 1D—SS(PSIPRED), DR(DISOPRED3) | FFNN | PSI-BLAST |
|
| SPOT | 1D—SS, SA, TA, CN(SPOT-1D), DR(SPOT-Disorder) | Residual CNN BLSTM | PSI-BLAST |
|
| Distill(Brewery) | 1D—SS(Porter), LM(Porter+), SA(PaleAle), CN(BrownAle) | BRNN–CNN | PSI-BLAST |
|
| RaptorX | 1D—SS, SA, DR(RaptorX-Property) | CNF | PSI-BLAST |
|
| MULTICOM | 2D—CM(DNCON2) | CNN | PSI-BLAST |
|
| TripletRes | 2D—CM | Residual CNN | HHblits |
|
| DeepContact | 2D—CM | Residual CNN | HHblits |
|
| DeepCov | 2D—CM | CNN | HHblits |
|
| Pconsc4 | 2D—CM | CNN | HHblits |
|
| DeepCDPred | 2D—MCM | FFNN | HHblits |
|
| Alphafold2 | 2D—MCM | Residual CNN | PSI-BLAST | Alphafold: |
| Rosetta Suite | 2D—MCM(trRosetta) | Residual CNN | PSI-BLAST |
|
| EVfold | 3D—TS | FFNN | HHblits |
|
| DESTINI | 3D—TS | Residual CNN |
| |
| ThreaderAI | 3D—TS | Residual CNN | HHblits |
|
| NEST | 3D—TS | FFNN | PSI-BLAST |
|
| C-I-TASSER | 3D—TS | Residual CNN | PSI-BLAST |
|
a The following abbreviations are used for targets: secondary structure (SS), solvent accessibility (SA), torsional angle (TA), contact number/density (CN), disordered region (DR), contact map (CM), multi-state contact map (MCM), and tertiary structure (TS).
Figure 2Contact map of the exemplary small protein Crambin (UniProt id: P01542). (left) Part of the multiple sequence alignment of the homologous proteins of Crambin. (middle, lower half) Contact map and the corresponding 3D model predicted by RaptorX. A contact is defined by Cβ-Cβ distance ≤ 8 Å. Darker color indicates a higher probability. (middle, upper half) Distance map based on the 3D experimental structure of the protein. The map was visualized using the VMD plugin. The Cα–Cα distance for each pair is plotted and colored black at 0.0 Å distance, to a linear grayscale between 0.0 and 10.0 Å, and white when equal to or greater than 10.0 Å. (right) X-ray crystal structure of Crambin (PDB id: 4fc1) overlaid with the contact (Cα-Cα, cutoff distance 8 Å) marked in gray dashed lines.