Literature DB >> 21769194

Structure prediction and functional characterization of secondary metabolite proteins of Ocimum.

Sudeep Roy1, Nidhi Maheshwari, Rashi Chauhan, Naresh Kumar Sen, Ashok Sharma.   

Abstract

Various species of Ocimum have acquired special attention due to their medicinal properties. Different parts of the plant (root, stem, flower, leaves) are used in the treatment of a wide range of disorders from centuries. Experimental structures (X-ray and NMR) of proteins from different Ocimum species, are not yet available in the Protein Databank (PDB). These proteins play a key role in various metabolic pathways in Ocimum. 3D structures of the proteins are essential to determine most of their functions. Homology modeling approach was employed in order to derive structures for these proteins. A program meant for comparative modeling- Modeller 9v7 was utilized for the purpose. The modeled proteins were further validated by Prochek and Verify-3d and Errat servers. Amino acid composition and polarity of these proteins was determined by CLC-Protein Workbench tool. Expasy's Prot-param server and Cys_rec tool were used for physico-chemical and functional characterization of these proteins. Studies of secondary structure of these proteins were carried out by computational program, Profunc. Swiss-pdb viewer was used to visualize and analyze these homology derived structures. The structures are finally submitted in Protein Model Database, PMDB so that they become accessible to other users for further studies.

Entities:  

Keywords:  CLC protein work bench; Homology modeling; Ocimum; Secondary structure prediction; Swiss-PDB Viewer

Year:  2011        PMID: 21769194      PMCID: PMC3134781          DOI: 10.6026/97320630006315

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Botanically, basil belongs to the genus Ocimum of the family Lamiaceae. More than 160 species of Ocimum are reported from different parts of the world. Different parts (roots, stem, leaves, seeds and flowers) of Ocimum have been used for treatment of variety of diseases such as bronchitis, malaria, diarrhea, dysentery, skin diseases, arthritis etc. Ocimum sp. contains monoterpene derivatives such as camphor, limonene, thymol, citral, geraniol and linalool. A detailed analysis of protein sequences from Ocimum, their probable structures and mode of action are yet to be accomplished. Plants synthesize chemicals in their leaves in order to protect themselves from herbivores. One such class of defense compounds that has been used extensively by humans are members of phenylpropenoid class namely eugenol, chavicol and their derivatives. It has been reported that in basil glands, two closely related (90% identical) enzymes chavicol o-methyltransferase (CVOMT) and eugenol o-methyltransferase (EOMT) catalyze the formation of methylchavicol and methyleugenol from chavicol and eugenol respectively [1]. The enzymes are involved in aroma production in basil. From an evolutionary perspective plant and microbial PALs (phenylalanine ammonia lyase) are part of superfamily of enzymes from plants, fungi and bacteria, and are likely derived from a precursor of the widespread histidine ammonia lyase (HAL) family in the histidine degradation pathway [2]. PAL catalyses the non-oxidative deamination of phenylalanine to trans-cinnamate and directs the carbon flow from the shikimate pathway to the various branches of the general phenylpropanoid metabolism. Lipooxygenase (Fatty-acid metabolism) is one of the most widely studied enzymes found in more than 60 species of plant and animal kingdom. The enzyme catalyses the biooxygenation of polyunsaturated fatty acids (PUFA) containing a cis, cis-1, 4-pentadiene unit to form conjugated hydroperoxydienoic acids. Lipoxygenase has considerable application in food related products such as in bread making. The enzyme plays a significant role in formation of secondary metabolites in sweet basil. Enzymatic browning of fruits and vegetables is caused mainly by the conversion of native phenolic compounds to quinones which are then polymerized to brown, red or black pigments imparting colour to various plant parts. The enzymes responsible for catalyzing this sequence of reactions are termed as polyphenol oxidases, but are also known as tyrosinases, catecholases, cresolases and phenolases [3]. Because of deleterious effect of enzymatic browning on fruits and vegetables much work is devoted as to retard or at least delay the browning process. Polyphenol oxidase being the causative agent responsible for browning is exploited for the purpose. The enzyme is involved in iso-quinoline alkaloid biosynthesis and in biosynthesis of other secondary metabolites. In order to understand biochemical function and interaction properties of the protein at molecular level, three dimensional structure of protein is foremost requirement. However, the number of available protein sequences exceeds far behind the available three dimensional protein structures. In order to compensate this, homology modeling approach came into being. These methods are believed to be cost-effective and time-effective when compared to X-rays crystallography and NMR techniques. Computational methods make use of hidden information inside amino acid sequences in order to predict protein structure and function. In the present study, In silico analysis and homology modeling studies on uncharacterized proteins in different species of Ocimum like O. basilicum, O. tenuiflorum, O. citriodorum, O. seloi, O. gratissimum and O. americanum whose structures are not yet available in PDB have been accomplished.

Materials and Methodology

The amino acid sequences of secondary metabolite proteins of Ocimum whose structures are not yet available in RCSB Protein Databank (PDB) were retrieved from SWISSPROT, a public resource of curated protein sequences [4] and subjected to NCBI BLAST [5]. Based on high score, lower e-value and maximum sequence identity, the best template was selected which was then used as reference structure to build a 3D model. Template and target proteins considered for the study have been shown in (see Table 1).

Model building and evaluation

The three dimensional structures of proteins were modeled using Modeler 9v8 [6]. Quality of generated models was evaluated with PROCHECK [7] by Ramachandran plot analysis [8]. Stereochemical quality and accuracy of the selected models was further improved by subjecting it to energy minimization with the GROMOS 96 43B1 parameters set, implementation of Swiss-PDB Viewer [9]. Validation of generated models was further performed by VERIFY 3D [10] and ERRAT [11] programs. ProSA [12] was used for the analysis of Zscores and energy plots. The three dimensional structures of modeled proteins were analyzed using Deep View Swiss PDB viewer. Root Mean Square Deviation (RMSD) values were calculated between the set of targets and template protein to see how much modeled protein deviates from the template protein structure.

Computation of amino acid composition

Amino acid composition (see Table 2) of Ocimum proteins under study was calculated using CLC protein workbench tool (www.clcbio.com/protein). The tool also provides estimation of percentage of hydrophobic and hydrophilic residues present in the protein (see Table 3).

Physiochemical characterization

For physiochemical characterization, theoretical pI (isoelectric point), molecular weight, ­R and +R (total number of positive and negative residues), EI (extinction coefficient) [13], II (instability index [14]) [15], AI (aliphatic index) and GRAVY (grand average hydropathy) [16] were computed using the Expasy's ProtParam server [17] for set of proteins (http://us.expasy.org/tools/protparam.html). The results are shown in (see Table 4)

Functional characterization

CYS_REC (http://sunl.softberry.com/berry.phtml? topic) was used to locate “SS bond” between the pair of cystein residues, if present. The tool yields position of cysteins, total number of cysteins present and pattern, if present, of pairs in the protein sequence as output. All the Ocimum proteins under study showed absence of disulphide bonds. The results are presented in Table 5 (see Supplementary material).

Secondary structure prediction

Profunc [18] was employed for calculating the secondary structural features of Ocimum protein sequences. The results are presented in Table 6(see Supplementary material).

Submission of the modeled proteins in protein model database (PMDB)

The models generated for various Ocimum proteins were successfully submitted in Protein model database, PMDB [19] without any stereochemical errors. The submitted models can be accessed via their PMIDs (see Table 7 Supplementary material).

Results and Discussion

As experimental structures of some of the important secondary metabolite proteins of Ocimum are not available, homology modeling approach was used in order to derive their structures.

Model building, refinement and evaluation

PROCHECK analysis

Ramachandran plot for Chavicol O-methyltransferase (D3KYA1) has been illustrated in Figure 1. Altogether more than 90% of the residues were found to be in favoured and allowed regions, which validate the quality of homology models. The overall G-factor for D3KYA1 was ­0.19. As the value is greater than the acceptable value ­0.50, this suggests that the modeled structure is acceptable. The modeled structures were also validated by other structure verification servers such as Verify 3D and Errat. Verify 3D assigned a 3D-1D score of >0.2 for all the modeled proteins. This implies that the models are compatible with its sequence. ERRAT showed overall quality factor of 49.62 for D3KYA1. The plot generated by Verify-3D and Errat for Chavicol omethyltransferase has been illustrated in Figure 2A & 2B.
Figure 1

Ramachandran plot of chavicol obtained by PROCHECK. 91.2% residues in favourable regions; 8.0% residues in additional residue regions; 0.0% residues in generously regions; 0.9% residues in disallowed regions; Over all G-factor: 0.00.

Figure 2

(A) Verify-3D plot, (B) Errat plot

PROSA analysis

The z-score for all the modeled proteins was found to be within the range of scores typically found for native proteins of similar size showing good quality of the model. Energy Plot for chavicol o-methyltransferase (D3KYA1) with chain length (257 AA) and z-score (­7.26) is presented in Figure 3A & 3B.
Figure 3

(A) Prosa-web z-score plot, (B) Prosa-web plot of residue scores

Swiss-PDB viewer analysis of predicted model

Visualization and analysis of the model using Swiss-PDB reveals that there are no steric hindrances between the residues and thus modeled structures are stable. Structure-structure superimposition was done in order to calculate Root Mean Square Deviation (RMSD) between the target and template sequence. RMSD values for D3KYA1 were found to be 0.94. This implies good quality of the modeled structures. Figure 4 represents modeled structure of Chavicol o-methyltransferase.
Figure 4

Modeled structure of Chavicol o-methyltransferase as viewed by Swiss-PDB viewer

Physiochemical characterization

The physiochemical parameters viz., theoretical isoelectric point (Ip), molecular weight, total number of positive and negative residues, extinction coefficient, half-life, instability index, aliphatic index and grand average hydropathy (GRAVY) were computed using the Expasy's ProtParam tool (Table 4). The computed pI value for A8D7D8, B2ZA17, B6VQV5, B6VQV6, D3KYA1 (pI<7) indicated their acidic nature, whereas pI for A8D6D7, B2ZA12, B2ZA16 (pI>7) revealed there basic behaviour. The computed isoelectric point (pI) will be useful for developing buffer system for purification by isoelectric focusing method. Extinction coefficient values for Ocimum proteins at 280 nm ranged from 1490 to 50795 M-1cm-1 for B6VQV6 and D3KYA1 indicating the presence of higher concentration of Tyr and Trp. Cys was very low in concentration in all the eight Ocimum proteins studied. This indicates that these proteins cannot be analyzed using UV spectral methods. On the basis of instability index Expasy's ProtParam classified the B2ZA17 (Eugenol o-methyltransferase), A8D7D8 (Lipoxygenase), B2ZA12 (Eugenol o-methyltransferase) and B2ZA16 (Eugenol o-methyltransferase) proteins as unstable (Instability index>40) and other Ocimum proteins as stable (Instability index<40). The aliphatic index (AI) which is defined as the relative volume of a protein occupied by aliphatic side chain is regarded as the positive factor for the increase of thermal stability of globular proteins. The very high aliphatic index of all Ocimum proteins infers that these proteins may be stable for a wide range of temperature. The very low GRAVY index of proteins B6VQV6 and D3KYA1 infers that these proteins could result in a better interaction with water.

Functional characterization

The result of primary analysis suggests that all the Ocimum proteins under study were hydrophobic in nature due to the presence of high non-polar residues content (Table 2 & 3). As percentage of Cysteine(C) is very low in all the Ocimum proteins under study (Table 2), none of these proteins have disulphide bond linkages, as indicated by CYS_REC result (Table 4). The extensive hydrogen bonding may provide stability to these proteins in absence of disulphide bonds. Proteins B2ZA12, A8D6D7 and B6VQV5 have high percentage of methionine(M), alanine(A), leucine(L) and lysine(K). As these amino acids have high helix-forming propensities, alpha helix are dominant in these proteins. This is also evident from analysis of PROFUNC result (Table 6). Rest of the Ocimum proteins had mixed secondary structures i.e. alphahelices, beta-strands and coils. All the proteins showed high percentage of glycine and proline (Table 2). As these amino acids are common in turns, other secondary structures such as Beta turns and Gamma turns are dominant in these proteins (Table 6).

Submission of modeled proteins in PMDB

The modeled structures of proteins from various species of Ocimum were successfully deposited in Protein Model Database (PMDB). The PMDB ID for the submitted structures has been presented in (see Table 7). These 3D structures may be further used in characterizing the protein experimentally.

Conclusion

In this study proteins from various species of Ocimum were modeled using homology modeling approach. Different parameters such as isoelectric point, molecular weight, total number of positive and negative residues, extinction coefficient, instability index, aliphatic index and grand average hydropathy (GRAVY) were computed for these proteins in order to determine their physiochemical characteristics. All the proteins were found to be deficient in amino acid cystein, and therefore lack presence of disulphide linkages as also inferred from analysis of cys_rec result. In the absence of disulphide bond, extensive hydrogen bonding is believed to be responsible for stability of these proteins. Polarity studies using CLC protein work bench tool confirmed all the studied proteins to be hydrophobic in nature. This may be due to the presence of a large number of non-polar residues. Secondary structure studies showed that all the studied proteins contain high proportion of other secondary structures ie. Beta-turns and Gamma-turns. This is attributed to the presence of higher concentration of proline and glycine residues. The modeled structures can be accessed through protein model database PMDB via there PMID's. Homology derived models are extensively used in wide range of applications such as virtual screening, site-directed mutagenesis experiments or in rationalizing the effects of sequence variation. These structures will serve as cornerstone for functional analysis of experimentally derived crystal structures.
  17 in total

Review 1.  Protein identification and analysis tools in the ExPASy server.

Authors:  M R Wilkins; E Gasteiger; A Bairoch; J C Sanchez; K L Williams; R D Appel; D F Hochstrasser
Journal:  Methods Mol Biol       Date:  1999

2.  Swiss-PDB Viewer (Deep View).

Authors:  W Kaplan; T G Littlejohn
Journal:  Brief Bioinform       Date:  2001-05       Impact factor: 11.622

3.  An active site homology model of phenylalanine ammonia-lyase from Petroselinum crispum.

Authors:  Dagmar Röther; László Poppe; Gaby Morlock; Sandra Viergutz; János Rétey
Journal:  Eur J Biochem       Date:  2002-06

4.  Stereochemistry of polypeptide chain configurations.

Authors:  G N RAMACHANDRAN; C RAMAKRISHNAN; V SASISEKHARAN
Journal:  J Mol Biol       Date:  1963-07       Impact factor: 5.469

5.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

6.  Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence.

Authors:  K Guruprasad; B V Reddy; M W Pandit
Journal:  Protein Eng       Date:  1990-12

7.  VERIFY3D: assessment of protein models with three-dimensional profiles.

Authors:  D Eisenberg; R Lüthy; J U Bowie
Journal:  Methods Enzymol       Date:  1997       Impact factor: 1.600

8.  Biosynthesis of t-anethole in anise: characterization of t-anol/isoeugenol synthase and an O-methyltransferase specific for a C7-C8 propenyl side chain.

Authors:  Takao Koeduka; Thomas J Baiga; Joseph P Noel; Eran Pichersky
Journal:  Plant Physiol       Date:  2008-11-05       Impact factor: 8.340

9.  The PMDB Protein Model Database.

Authors:  Tiziana Castrignanò; Paolo D'Onorio De Meo; Domenico Cozzetto; Ivano Giuseppe Talamo; Anna Tramontano
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  ProFunc: a server for predicting protein function from 3D structure.

Authors:  Roman A Laskowski; James D Watson; Janet M Thornton
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

View more
  13 in total

1.  Molecular Cloning, Purification and Characterization of Mce1R of Mycobacterium tuberculosis.

Authors:  Dipanwita Maity; Rajasekhara Reddy Katreddy; Amitava Bandhu
Journal:  Mol Biotechnol       Date:  2021-01-09       Impact factor: 2.695

2.  In silico functional and structural characterization of hepatitis B virus PreS/S-gene in Iranian patients infected with chronic hepatitis B virus genotype D.

Authors:  Nastaran Khodadad; Seyed Saeed Seyedian; Afagh Moattari; Somayeh Biparva Haghighi; Roya Pirmoradi; Samaneh Abbasi; Manoochehr Makvandi
Journal:  Heliyon       Date:  2020-07-15

3.  Transcriptome mining and in silico structural and functional analysis of ascorbic acid and tartaric acid biosynthesis pathway enzymes in rose-scanted geranium.

Authors:  Lokesh K Narnoliya; Rajender S Sangwan; Sudhir P Singh
Journal:  Mol Biol Rep       Date:  2018-03-15       Impact factor: 2.316

4.  Bioinformatics approaches for structural and functional analysis of proteins in secondary metabolism in Withania somnifera.

Authors:  Swati Singh; Ashok Sharma
Journal:  Mol Biol Rep       Date:  2014-08-02       Impact factor: 2.316

5.  Comprehensive assessment of the genes involved in withanolide biosynthesis from Withania somnifera: chemotype-specific and elicitor-responsive expression.

Authors:  Aditya Vikram Agarwal; Parul Gupta; Deeksha Singh; Yogeshwar Vikram Dhar; Deepak Chandra; Prabodh Kumar Trivedi
Journal:  Funct Integr Genomics       Date:  2017-03-11       Impact factor: 3.410

6.  Insights using the molecular model of Lipoxygenase from Finger millet (Eleusine coracana (L.)).

Authors:  Apoorv Tiwari; Himanshu Avashthi; Richa Jha; Ambuj Srivastava; Vijay Kumar Garg; Pramod Wasudev Ramteke; Anil Kumar
Journal:  Bioinformation       Date:  2016-06-15

7.  Computational genome-wide identification of heat shock protein genes in the bovine genome.

Authors:  Oyeyemi O Ajayi; Sunday O Peters; Marcos De Donato; Sunday O Sowande; Fidalis D N Mujibi; Olanrewaju B Morenikeji; Bolaji N Thomas; Matthew A Adeleke; Ikhide G Imumorin
Journal:  F1000Res       Date:  2018-09-20

8.  Comparative analysis of zinc finger proteins involved in plant disease resistance.

Authors:  Santosh Kumar Gupta; Amit Kumar Rai; Shamsher Singh Kanwar; Tilak R Sharma
Journal:  PLoS One       Date:  2012-08-15       Impact factor: 3.240

9.  Three-Dimensional Molecular Modeling of a Diverse Range of SC Clan Serine Proteases.

Authors:  Aparna Laskar; Aniruddha Chatterjee; Somnath Chatterjee; Euan J Rodger
Journal:  Mol Biol Int       Date:  2012-11-19

10.  Structure predictions of two Bauhinia variegata lectins reveal patterns of C-terminal properties in single chain legume lectins.

Authors:  Gustavo M S G Moreira; Fabricio R Conceição; Alan J A McBride; Luciano da S Pinto
Journal:  PLoS One       Date:  2013-11-19       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.