| Literature DB >> 34222200 |
Mingyuan Xu1, Tong Zhu1,2, John Z H Zhang1,2,3,4.
Abstract
The development of accurate and efficient potential energy functions for the molecular dynamics simulation of metalloproteins has long been a great challenge for the theoretical chemistry community. An artificial neural network provides the possibility to develop potential energy functions with both the efficiency of the classical force fields and the accuracy of the quantum chemical methods. In this work, neural network potentials were automatically constructed by using the ESOINN-DP method for typical zinc proteins. For the four most common zinc coordination modes in proteins, the potential energy, atomic forces, and atomic charges predicted by neural network models show great agreement with quantum mechanics calculations and the neural network potential can maintain the coordination geometry correctly. In addition, MD simulation and energy optimization with the neural network potential can be readily used for structural refinement. The neural network potential is not limited by the function form and complex parameterization process, and important quantum effects such as polarization and charge transfer can be accurately considered. The algorithm proposed in this work can also be directly applied to proteins containing other metal ions.Entities:
Keywords: force field; metalloproteins; molecular dynamic simulation; neural network; zinc protein
Year: 2021 PMID: 34222200 PMCID: PMC8249736 DOI: 10.3389/fchem.2021.692200
Source DB: PubMed Journal: Front Chem ISSN: 2296-2646 Impact factor: 5.221
FIGURE 1Definition of the metal binding group in a CCCH-type zinc finger protein.
FIGURE 2Framework of the ESOINN-DP method.
FIGURE 3The training process of the NN potential.
The performance of NN potentials on four zinc-containing proteins with different coordination modes.
| MBG type | PDB ID | Number of subsets in the ESOINN layer | Training set/test set | |||
|---|---|---|---|---|---|---|
| Size | RMSE of E (kcal/mol) | RMSE of F (kcal/(mol·Å)) | RMSE of Q (e) | |||
| CCCC | 1ZIN | 7 | 11,900/1,200 | 1.43/1.29 | 1.53/1.43 | 0.04/0.05 |
| CCCH | 2L30 | 12 | 28,156/3,200 | 1.38/1.34 | 1.68/1.75 | 0.03/0.04 |
| CCHH | 1AAY | 14 | 45,328/5,100 | 1.78/1.64 | 1.41/1.52 | 0.02/0.03 |
| HHHO | 1HFS | 11 | 27,100/3,000 | 1.30/1.26 | 1.63/1.72 | 0.04/0.03 |
FIGURE 4The distribution of error indicator in the four representative systems. The unit of is
The averaged RESP charge of zinc ion and its ligated atoms in the MD simulation. The unit of charge is e.
| 1ZIN (CCCC) | Coordinated atoms | Zn2+ | Sγ@C5 | Sγ@C8 | Sγ@C25 | Sγ@C28 |
| NN/MM-RESP-MBG | 1.25 | −0.85 | −0.87 | −0.86 | −0.84 | |
| Amber | 2 | −0.88 | −0.88 | −0.88 | −0.88 | |
| 2L30 (CCCH) | Coordinated atoms | Zn2+ | Sγ@C5 | Sγ@C8 | Nδ@H37 | Sγ@C40 |
| NN/MM-RESP-MBG | 0.98 | −0.87 | −0.95 | −0.41 | −0.81 | |
| Amber | 2 | −0.88 | −0.88 | −0.57 | −0.88 | |
| 1AAY (CCHH) | Coordinated atoms | Zn2+ | Sγ@C5 | Sγ@C10 | Nε@H23 | Nε@H27 |
| NN/MM-RESP-MBG | 0.83 | −0.82 | −0.81 | −0.37 | −0.42 | |
| Amber | 2 | −0.88 | −0.88 | −0.57 | −0.57 | |
| 1HFS (HHHO) | Coordinated atoms | Zn2+ | Nε@H64 | Oδ1/Oδ2@D66 | Nε@H78 | Nδ@H92 |
| NN/MM-RESP-MBG | 0.98 | −0.51 | −0.64/−0.87 | −0.43 | −0.38 | |
| Amber | 2 | −0.57 | −0.88 | −0.57 | −0.57 |
FIGURE 5Typical structures of four different proteins with different MBGs after the MD simulations with NN/MM-RESP-MBG models. (A) CCCC type (PDB ID: 1ZIN), (B) CCHH type (PDB ID: 1AAY), (C) CCCH type (PDB ID: 2L30), and (D) HHHO (PDB ID: 1HFS).
Comparison of computed average distances and angles between zinc and its ligated atoms of different zinc proteins with experimental measurements, statistical values (Alberts et al., 1998), and results from MD simulations with the QPCT (Zhu et al., 2013) force field and QM/MM13 (all the bond lengths are in Angstroms and bond angles in degrees).
| PDB ID | Zinc-ligand geometry | PDB survey | NN/MM-RESP-MBG | X-ray | QPCT | QM/MM (50 ps) |
|---|---|---|---|---|---|---|
| 1ZIN (CCCC) | Zn-Sγ@C5 | 2.35 ± 0.09 | 2.41 | 2.33 | 2.37 ± 0.06 | 2.39 ± 0.08 |
| Zn-Sγ@C8 | 2.35 ± 0.09 | 2.39 | 2.3 | 2.36 ± 0.07 | 2.39 ± 0.09 | |
| Zn-Sγ@C25 | 2.35 ± 0.09 | 2.4 | 2.32 | 2.37 ± 0.06 | 2.42 ± 0.09 | |
| Zn-Sγ@C28 | 2.35 ± 0.09 | 2.41 | 2.33 | 2.36 ± 0.06 | 2.42 ± 0.08 | |
| ∠Sγ@C5-Zn-Sγ@C8 | 111 ± 8 | 107 | 114 | N/A | N/A | |
| ∠Sγ@C5-Zn-Sγ@C25 | 111 ± 8 | 116 | 106 | 114 ± 11 | 109 ± 6 | |
| ∠Sγ@C25-Zn-Sγ@C28 | 111 ± 8 | 111 | 112 | N/A | N/A | |
| 1AAY (CCHH) | Zn-Sγ@C5 | 2.35 ± 0.09 | 2.35 | 2.29 | 2.29 ± 0.07 | 2.32 ± 0.06 |
| Zn-Sγ@C10 | 2.35 ± 0.09 | 2.34 | 2.29 | 2.30 ± 0.08 | 2.34 ± 0.07 | |
| Zn-Nε@H23 | 2.05 ± 0.12 | 2.07 | 2.04 | 2.07 ± 0.12 | 2.12 ± 0.07 | |
| Zn-Nε@H27 | 2.05 ± 0.12 | 2.09 | 2.04 | 2.08 ± 0.12 | 2.13 ± 0.07 | |
| ∠Nε@H23-Zn-Nε@H27 | 107 ± 8 | 97 | 105 | 101 ± 13 | 99 ± 7 | |
| ∠Nε@H23-Zn-Sγ@C5 | 109 ± 8 | 108 | 109 | 110 ± 13 | 108 ± 7 | |
| ∠Sγ@C5-Zn-Sγ@C10 | 111 ± 8 | 116 | 113 | 114 ± 10 | 114 ± 6 | |
| 2L30 (CCCH) | Zn-Sγ@C5 | 2.35 ± 0.09 | 2.4 | 2.34 | 2.34 ± 0.07 | 2.34 ± 0.07 |
| Zn-Sγ@C8 | 2.35 ± 0.09 | 2.39 | 2.34 | 2.34 ± 0.08 | 2.34 ± 0.08 | |
| Zn-Nδ@H37 | 2.14 ± 0.09 | 2.15 | 2.01 | 2.17 ± 0.08 | 2.17 ± 0.08 | |
| Zn-Sγ@C40 | 2.35 ± 0.09 | 2.35 | 2.34 | 2.34 ± 0.07 | 2.31 ± 0.08 | |
| ∠Nδ@H37-Zn-Sγ@C40 | 109 ± 8 | 113 | 114 | 112 ± 7 | 108 ± 5 | |
| ∠Sγ@C5-Zn-Sγ@C8 | 111 ± 8 | 108 | 109 | N/A | N/A | |
| ∠Sγ@C8-Zn-Sγ@C40 | 111 ± 8 | 114 | 113 | 112 ± 7 | 114 ± 4 | |
| 1HFS (HHHO) | Zn-Nε@H64 | 2.05 ± 0.12 | 2.02 | 1.83 | 1.94 ± 0.05 | 2.01 ± 0.08 |
| Zn-Oδ2@D66 | 1.95 ± 0.08 | 1.97 | 2 | 1.97 ± 0.11 | 2.07 ± 0.09 | |
| Zn-Nε@H79 | 2.05 ± 0.12 | 2.01 | 1.78 | 1.93 ± 0.09 | 1.98 ± 0.09 | |
| Zn-Nδ@H93 | 2.14 ± 0.09 | 2.06 | 2.01 | 1.95 ± 0.10 | 2.03 ± 0.09 | |
| ∠Nε@H64-Zn-Oδ2@D66 | 107 ± 12 | 111 | 105 | 113 ± 6 | 107 ± 6 | |
| ∠Nε@H64-Zn-Nδ@H79 | 112 ± 7 | 116 | 119 | 108 ± 7 | 111 ± 7 | |
| ∠Nε@H79-Zn-Nδ@H93 | 112 ± 7 | 112 | 113 | N/A | N/A |
FIGURE 6Structure refinement with NN potential for an ill structure of the 2L30 protein. (A) The ill structure, (B) the refined structure, and (C) the time evolution of the Zn-S distance.
FIGURE 7The RMSD of the MBGs in ill and optimized structures of the 1ZIN protein. The MBG of the X-ray structure was taken as the reference.