| Literature DB >> 32240155 |
Yazan Haddad1,2, Vojtech Adam1,2, Zbynek Heger1,2.
Abstract
The purpose of this quick guide is to help new modelers who have little or no background in comparative modeling yet are keen to produce high-resolution protein 3D structures for their study by following systematic good modeling practices, using affordable personal computers or online computational resources. Through the available experimental 3D-structure repositories, the modeler should be able to access and use the atomic coordinates for building homology models. We also aim to provide the modeler with a rationale behind making a simple list of atomic coordinates suitable for computational analysis abiding to principles of physics (e.g., molecular mechanics). Keeping that objective in mind, these quick tips cover the process of homology modeling and some postmodeling computations such as molecular docking and molecular dynamics (MD). A brief section was left for modeling nonprotein molecules, and a short case study of homology modeling is discussed.Entities:
Year: 2020 PMID: 32240155 PMCID: PMC7117658 DOI: 10.1371/journal.pcbi.1007449
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Homology-modeling workflow.
Numbers show the steps where tips are applicable.
The protonation states of polar and charged amino acids.
| Amino acid | Charge | Three-letter code | Comment | Common choice for physiological pH |
|---|---|---|---|---|
| Arginine (ARG) | Positive | ARG | Protonated | Yes |
| Neutral | ARN | |||
| Aspartic acid (ASP) | Negative | ASP | Yes | |
| Neutral | ASH | Protonated | ||
| Cysteine (CYS) | Neutral | CYS | Protonated | Yes |
| Bridge | CYS or CYS2 | In disulfide bridge | ||
| Glutamic acid (GLU) | Negative | GLU | Yes | |
| Neutral | GLH | Protonated | ||
| Histidine (HIS) | Neutral | HIE | Protonated on Nε | Yes |
| Neutral | HID | Protonated on Nδ | ||
| Positive | HIP | Protonated on Nδ and Nε | ||
| Lysine (LYS) | Positive | LYS | Protonated | Yes |
| Neutral | LYN |
Common biomolecular FFs.
| Force Fields | Example | Reference |
|---|---|---|
| AMBER | FF99SB: Primarily for proteins | [ |
| CHARMM | C36: Proteins, nucleic acids, and lipids | [ |
| GROMOS | 54A7: Proteins and nucleic acids | [ |
| OPLS | OPLS-AA: for organic molecules and peptides | [ |
| Open FF | Smirnoff99Frosst: Open force field compatible to AMBER | [ |
AMBER, Assisted Model Building with Energy Refinement; CHARMM, Chemistry at Harvard Macromolecular Mechanics; FF, force field; GROMOS, GROningen Molecular Simulation; OPLS, Optimized Potentials for Liquid Simulations
Fig 2Case study: Homology modeling of the MAM1 domain in ALK receptor.
(A) Alignment of MAM1 target sequence with potential templates. (B) Homology model built using SwissModel server using the 2v5y.1.A template 3D structure. QMEAN local estimates are shown in colored ribbon (worst in orange to best in blue). (C) Sequence–structure alignment of target template, showing secondary structures calculated via DSSP method and the quality QMEAN local scores in colored bars. (D) Ramachandran outliers for general, glycine, and proline residues. Representative Pro15 and Pro31 residues are shown. The Ramachandran plot shows the values of dihedral angles Φ and Ψ while green contours highlight the optimal expected values. (E) Ab initio prediction via the PEP-FOLD3 server for the region 35 to 52 (ASQMDLLDGPGAERSKEM) showing two α-helices with a GPGA linker. Since this region is relatively large and loop modeling often predicts loop secondary structures, we recommend using an ab initio approach as an additional source for homology modeling (as a template) for more accurate prediction. (F) Local QMEAN estimates before and after manual refinement. The major drops in quality before the refinements were corrected with selective optimization. The lowest QMEAN achieved was −2.44. At this stage, three remaining drops in regions 17 to 19, 37 to 40, and 106 to 112 were not correctable by optimization. ALK, anaplastic lymphoma kinase; DSSP, Dictionary of Secondary Structure of Proteins; QMEAN, Qualitative Model Energy Analysis.