Literature DB >> 29307140

Structural Analysis of Recombinant Human Preproinsulins by Structure Prediction, Molecular Dynamics, and Protein-Protein Docking.

Sung Hun Jung1,2, Chang-Kyu Kim3, Gunhee Lee4, Jonghwan Yoon4, Minho Lee5.   

Abstract

More effective production of human insulin is important, because insulin is the main medication that is used to treat multiple types of diabetes and because many people are suffering from diabetes. The current system of insulin production is based on recombinant DNA technology, and the expression vector is composed of a preproinsulin sequence that is a fused form of an artificial leader peptide and the native proinsulin. It has been reported that the sequence of the leader peptide affects the production of insulin. To analyze how the leader peptide affects the maturation of insulin structurally, we adapted several in silico simulations using 13 artificial proinsulin sequences. Three-dimensional structures of models were predicted and compared. Although their sequences had few differences, the predicted structures were somewhat different. The structures were refined by molecular dynamics simulation, and the energy of each model was estimated. Then, protein-protein docking between the models and trypsin was carried out to compare how efficiently the protease could access the cleavage sites of the proinsulin models. The results showed some concordance with experimental results that have been reported; so, we expect our analysis will be used to predict the optimized sequence of artificial proinsulin for more effective production.

Entities:  

Keywords:  leader peptide; molecular dynamics; preproinsulin; protein docking; structure prediction

Year:  2017        PMID: 29307140      PMCID: PMC5769858          DOI: 10.5808/GI.2017.15.4.142

Source DB:  PubMed          Journal:  Genomics Inform        ISSN: 1598-866X


Introduction

Human insulin, produced by beta-cells of the pancreatic islets, plays a critical role in the regulation of the metabolism of glucose [1]. Dysfunction of the synthesis or release of insulin may lead to diabetes mellitus [2]. Millions of people suffer from diabetes mellitus worldwide [3], and the most common medication to treat diabetes is insulin; so, a large number of studies on insulin have been done [4]. Insulin is first produced as an inactive protein, called preproinsulin. Preproinsulin, including a signal peptide, is a single, long protein. The chain evolves into proinsulin by cutting out the signal peptide. Then, proinsulin needs to be cleaved into insulin (an A chain and B chain) by removing the C-peptide linking the two chains [5]. Nowadays, recombinant DNA techniques enable us to produce insulin through biochemical processes using Escherichia coli [6]. However, the production of proinsulin by E. coli strains has to several drawbacks, such as low expression, difficulty in solubilizing the inclusion body, short half-life in the host cell, high proteolysis, and inefficient translation of the underlying coding sequences [7]. A new fusion protein system, which fuses an artificial leader peptide to the N-terminus of proinsulin, has been invented as a solution to these problems [8]. The specificity of cleavage and refolding rates are known to be dependent on the sequences of leader peptides, and different kinds of sequences have been proposed [6, 7, 9–11]. Thus, how the sequence of the artificial leader peptide affects the production of proinsulin and its modification into mature insulin needs to be investigated. In this work, we used several kinds of computational approaches to find a better leader peptide. First, we predicted the three-dimensional structures of fused proinsulins while changing the leader peptides. Although the 3D structure of proinsulin was determined recently [12] and although that of active insulin was determined long ago [13], the structure of fused proinsulin should be analyzed by prediction. Afterwards, the stabilities of the predicted structures were calculated by molecular dynamics (MD) simulation. MD is a method that simulates the movements of atoms and calculates their potential energies. Finally, the interaction of proinsulins with protease was evaluated by protein-protein docking. By comparing these structural analyses and experiments, we demonstrate that these structural analyses may contribute to determining whether a leader peptide results in efficient production of insulin.

Methods

Selection of artificial leader peptides

In previous work (patent WO 2004/044206 A1), a formula for constructing a leader peptide was proposed as follows: Met-Thr-Met-Ile-Thr···Lys(Arg). We selected 13 models, as shown in Table 1. Out of these 13 models, 5 have been used to generate experimental results in a previous work [7], and 7 are being introduced in this work.
Table 1

Sequences of 13 artificial leader peptides used in the structural simulation and refolding yields

ModelSequence of leader peptideRefolding yield (%)
Model 1MTMITNSPEISHHHHHHHHHHQLISEAR68.7
Model 2MTMITNSPEISHHHHHHHHHHQLISEAKNA
Model 3MTMITDSLAVVLQGSLQRNA
Model 4MTMITDSLAVVLQGSLQKNA
Model 5MTMITDSLAVVLQR62.7
Model 6MTMITDSLAVVLQKNA
Model 7MTMITDSLAR57.9
Model 8MTMITDSLAKNA
Model 9MTMITK54.0
Model 10MTMITR42.2
Model 11MKNA
Model 12MRNA
Model 13MNA

Refolding yields were retrieved from our previous data [7]. The values of 5 models (models 1, 5, 7, 9, and 11) are reported.

NA, not available.

Protein structure prediction

The full sequence (leader peptide of the model plus the native sequence of proinsulin) of each fused protein was used as an input for the structure prediction. Among various methods of 3D protein structure prediction, we used I-TASSER (Iterative Threading ASSEmbly Refinement) [14], which ranked as first in performance in recent community-based competitions [15, 16].

MD simulation

The predicted structures were located in a cubic box of water; then, 3-ns MD simulations were carried out using GROMACS, ver. 5.1.2 [17]. Amber99sb was chosen as the force field [18]. Energies of the models were minimized using the steepest descent algorithm for 3 ns. The step size was 0.002 ps. The properties and stabilities of the models were evaluated based on the potential energy, total energy, and root-mean-square deviation of the atomic positions.

Protein-protein docking

Protein-protein docking between trypsin (PDB ID: 2PTN) and recombinant proinsulins was performed using the InterEvDock server [19]. The InterEvDock server provides three kinds of scores: InterEvDock, FRODOCK [20], and SOAP_PP [21]. We chose the SOAP_PP score to evaluate the binding affinities, and docked poses were used to check whether two proteins were bound at the right position.

Results and Discussion

We predicted the 3D structures of 13 artificial models of proinsulin using I-TASSER (Fig. 1). As shown in Fig. 1, different leader peptides affected the overall structure of proinsulin. Especially, although the sequences of models 1 and 2 had only a 1-residue difference, the predicted positions of the leader peptides were quite different.
Fig. 1

Predicted structures of proinsulin models. Structures of leader peptide regions and proinsulins are red and green, respectively.

To refine the predicted structures, we observed the changes in structures by MD simulation at 3 ns to minimize energy. The structures after the MD simulation were superimposed onto the original structures by I-TASSER (Fig. 2). All potential energies of models were kept relatively stable (Fig. 3). The predicted structures were little refined; thus, they were naturally favorable. The total energy and potential energy of models 3 and 8 were the lowest. Among the models tested before (Table 1), no correlation between refolding yields and energies was observed.
Fig. 2

Structural comparison of before (cyan) and after (green) the molecular dynamics simulation.

Fig. 3

Energy changes during molecular dynamics simulation of proinsulin models. (A) Potential energy. (B) Total energy.

Preproinsulin should be refolded and cleaved to become mature insulin. A previous work showed that trypsin could be used as an efficient protease in the maturation of insulin [7]. To investigate how the leader peptide affects the binding mode between proinsulin and trypsin, in silico docking was carried out by InterEVDock [19]. Interestingly, in the docked structure (Fig. 4), the active sites (histidine 57, aspartate 102, and serine 195) of trypsin were located near lysine 64 and arginine 65 of proinsulin, which is the exact position where the first cleavage of proinsulin occurs to release chain A. Binding affinities were also predicted by InterEVDock (Table 2). Among all docked structures, models 1 and 2 were predicted to have the strongest binding affinities.
Fig. 4

Structures by docking between proinsulin models (red) and trypsin (green). Cleavage sites are in blue.

Table 2

Protein-protein docking energies

Model numberSOAP_PP score (kJ/mol)
Round 1Round 2
Model 1−12,021.42−10,875.04
Model 2−12,239.53−11,204.59
Model 3−11,580.68−10,881.19
Model 4−11,394.95−10,484.51
Model 5−11,280.89−10,628.80
Model 6−11,348.51−10,292.90
Model 7−11,596.85−10,383.14
Model 8−11,194.78−10,107.16
Model 9−10,933.44−10,304.09
Model 10−11,037.88−10,266.31
Model 11−10,690.47−9,966.84
Model 12−10,872.28−9,782.31
Model 13−10,614.70−9,815.33
Because multiple steps of cleavage are needed for maturation, we performed additional protein docking of trypsin to model structures whose C-terminal chain after residue 64 (A chain) was cleaved out. As a result of this step, the preferable position for the active sites of trypsin were located near arginine 31 and arginine 32, which are known to form a cleavage site between the B and C chains. The binding affinities of the models were predicted to be strong (in descending order): model 2, 3, and 1. The two docking steps for artificial proinsulin models into trypsin revealed leader peptides that did not affect the order of cleavage sites, and models 1 and 2 are likely to be best accessed by trypsin in producing mature insulin. This coincides with previously tested refolding yields [7]. In summary, using 13 artificial models of leader peptides of proinsulin, we predicted the structures of fused proinsulins, the structures were refined by MD simulation, and protein-protein docking revealed binding modes between the artificial models and trypsin. The energies of the predicted structures of the models were not related to refolding yields, but the docking energies between the protease and the models showed some relation. We expect these analyses to provide basic information in a structural context for more effective production of insulin.
  18 in total

1.  Comparison of multiple Amber force fields and development of improved protein backbone parameters.

Authors:  Viktor Hornak; Robert Abel; Asim Okur; Bentley Strockbine; Adrian Roitberg; Carlos Simmerling
Journal:  Proteins       Date:  2006-11-15

2.  Optimized atomic statistical potentials: assessment of protein interfaces and loops.

Authors:  Guang Qiang Dong; Hao Fan; Dina Schneidman-Duhovny; Ben Webb; Andrej Sali
Journal:  Bioinformatics       Date:  2013-09-27       Impact factor: 6.937

3.  Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12.

Authors:  Chengxin Zhang; S M Mortuza; Baoji He; Yanting Wang; Yang Zhang
Journal:  Proteins       Date:  2017-11-14

4.  The global implications of diabetes and cancer.

Authors:  Yuankai Shi; Frank B Hu
Journal:  Lancet       Date:  2014-06-07       Impact factor: 79.321

5.  Discordant trends in microvascular complications in adolescents with type 1 diabetes from 1990 to 2002.

Authors:  Fauzia Mohsin; Maria E Craig; Janine Cusumano; Albert K F Chan; Stephen Hing; Jennifer W Lee; Martin Silink; Neville J Howard; Kim C Donaghue
Journal:  Diabetes Care       Date:  2005-08       Impact factor: 19.112

6.  I-TASSER: a unified platform for automated protein structure and function prediction.

Authors:  Ambrish Roy; Alper Kucukural; Yang Zhang
Journal:  Nat Protoc       Date:  2010-03-25       Impact factor: 13.491

7.  The structure of 2Zn pig insulin crystals at 1.5 A resolution.

Authors:  E N Baker; T L Blundell; J F Cutfield; S M Cutfield; E J Dodson; G G Dodson; D M Hodgkin; R E Hubbard; N W Isaacs; C D Reynolds
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  1988-07-06       Impact factor: 6.237

8.  Expression and folding of an interleukin-2-proinsulin fusion protein and its conversion into insulin by a single step enzymatic removal of the C-peptide and the N-terminal fused sequence.

Authors:  L R Castellanos-Serra; E Hardy; R Ubieta; N S Vispo; C Fernandez; V Besada; V Falcon; M Gonzalez; A Santos; G Perez; A Silva; L Herrera
Journal:  FEBS Lett       Date:  1996-01-08       Impact factor: 4.124

9.  Renaturation of human proinsulin--a study on refolding and conversion to insulin.

Authors:  Jeannette Winter; Hauke Lilie; Rainer Rudolph
Journal:  Anal Biochem       Date:  2002-11-15       Impact factor: 3.365

10.  Solution structure of proinsulin: connecting domain flexibility and prohormone processing.

Authors:  Yanwu Yang; Qing-Xin Hua; Jin Liu; Eri H Shimizu; Meredith H Choquette; Robert B Mackin; Michael A Weiss
Journal:  J Biol Chem       Date:  2010-01-27       Impact factor: 5.157

View more
  1 in total

1.  Structural insights and molecular dynamics into the inhibitory mechanism of a Kunitz-type trypsin inhibitor from Tamarindus indica L.

Authors:  Amanda Fernandes de Medeiros; Beatriz Blenda Pinheiro de Souza; Lucas Pinheiro Coutinho; Aline Melro Murad; Paula Ivani Medeiros Dos Santos; Norberto de Kássio Vieira Monteiro; Elizeu Antunes Dos Santos; Bruna Leal Lima Maciel; Ana Heloneida de Araújo Morais
Journal:  J Enzyme Inhib Med Chem       Date:  2021-12       Impact factor: 5.051

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.