| Literature DB >> 31878333 |
Eman Alnabati1, Daisuke Kihara2,1.
Abstract
Cryo-electron microscopy (cryo-EM) has now become a widely used technique for structure determination of macromolecular complexes. For modeling molecular structures from density maps of different resolutions, many algorithms have been developed. These algorithms can be categorized into rigid fitting, flexible fitting, and de novo modeling methods. It is also observed that machine learning (ML) techniques have been increasingly applied following the rapid progress of the ML field. Here, we review these different categories of macromolecule structure modeling methods and discuss their advances over time.Entities:
Keywords: cryo-EM; cryo-electron microscopy; de novo modeling; density map; machine learning methods; protein modeling; structure fitting algorithms
Mesh:
Year: 2019 PMID: 31878333 PMCID: PMC6982917 DOI: 10.3390/molecules25010082
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1The number of rigid fitting, flexible fitting, and de novo modeling software published per year. The statistics are based on publication. The plot shows 28 rigid fitting methods [9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36], 33 flexible fitting methods [37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69], and 8 de novo modeling methods [70,71,72,73,74,75,76,77].
Strengths and limitations of de novo methods.
| Methods | Strengths | Limitations |
|---|---|---|
| EM-Fold [ |
Able to build 3-D structure models of α-helical proteins in intermediate resolution up to 9 Å One of the pioneers in de novo methods |
Models only α-helical proteins Density rods in density map are identified manually Uses external software (Rosetta) for building loops and side chains Code is not available |
| Gorgon [ |
Interactive software with visualization Tools for multiple steps for model structure building are provided Intended to work on maps of resolution up to 10 Å |
Generates Cα-only models Human interaction is needed Due to the intended resolution, no atom level refinement provided |
| Rosetta [ |
Part of the Rosetta package, which has many tools for structure modeling Good local structure quality Able to handle both α-helices and β-strands Generates full-atom models |
Depends on fragments retrieved from a database Has difficulty to model β-sheets Model quality deteriorates for maps at 4.5–5 Å or worse |
| Pathwalking [ |
Part of the EMAN2 cryo-EM modeling package Able to trace the backbone of multi-subunit complexes |
Does not assign sequence to the Cα backbone models Generated models are not ranked |
| Phenix [ |
Part of the Phenix structure modeling package Models proteins, RNA, and DNA Generates full-atom models Tested on 476 EM maps in their paper |
Model quality deteriorates for maps at 4.5–5 Å or worse |
| MAINMAST [ |
Generates full-atom models Does not depend on any reference structures or fragments Provides many models with a confidence score |
Uses external software (MDFF, Rosetta, Phenix) to refine models Model quality deteriorates for maps at 4.5–5 Å or worse |
Figure 2Schematic flow diagram of the MAINMAST algorithm. The cryo-EM density map shown on the right is of structural protein 5 of cytoplasmic polyhedrosis virus solved at a 2.9 Å resolution (EMD-6374). This figure was adapted from the MAINMAST paper [76].
Figure 3Emap2sec applied on two experimental maps. Density maps and their fitted protein structures are shown on the left and the secondary structure detection by Emap2sec is shown on the right. Spheres in magenta, yellow, and green show detected α-helices, β-strands, and other structures, respectively. (a) Archaeal 20S proteasome (EMD-1733 of resolution 6.8 Å; PDB 3C91). (b) Eschirichia coli replicative DNA polymerase complex (EMD-3201 of resolution 8.34 Å; PDB 5FKU). This figure was adapted from the EMap2sec paper [101].
Strengths and limitations of SSE detection methods.
| Methods | Strengths | Limitations |
|---|---|---|
| RENNSH [ |
Tested on simulated maps at 6, 8, and 10 Å, as well as experimental maps of resolutions 3.8, 6.8, and 8 Å (Algorithm: nested K-nearest neighbors classifiers) |
Detects only α-helices Does not build an atomic model of predicted α-helices Limited testing on experimental maps |
| SSELearner [ |
Identifies both α-helices and β-strands (Algorithm: Support Vector Machines) Tested on simulated maps at 8 Å and experimental maps of resolutions (3.8–9 Å) |
Does not place secondary structure elements in the density map Does not detect loops |
| CNN by Li. et al. [ |
Identifies both α-helices and β-strands (Algorithm: pioneer in using CNN in SSE detection) |
Not tested on experimental maps Does not place SSE structures in the density map |
| Emap2sec [ |
Identifies three structure classes: α-helices, β-strands, and loops (Algorithm: CNN) Tested on both simulated and experimental maps of resolution up to 10 Å Code is available |
Does not place SSE structures in the density map |