| Literature DB >> 29700357 |
Hans Denis Bamal1, Wanping Chen2, Samson Sitheni Mashele1, David R Nelson3, Abidemi Paul Kappo4, Rebamang Anthony Mosa4, Jae-Hyuk Yu5, Jack A Tuszynski6,7, Khajamohiddin Syed8.
Abstract
Phylogenetic and structural analysis of P450 proteins fused to peroxidase/dioxygenase has not been reported yet. We present phylogenetic and in silico structural analysis of the novel P450 fusion family CYP5619 from the deadliest fish pathogenic oomycete, Saprolegnia diclina. Data-mining and annotation of CYP5619 members revealed their unique presence in oomycetes. CYP5619 members have the highest number of conserved amino acids among eukaryotic P450s. The highest number of conserved amino acids (78%) occurred in the peroxidase/dioxygenase domain compared to the P450 domain (22%). In silico structural analysis using a high-quality CYP5619A1 model revealed that CYP5619A1 has characteristic P450 structural motifs including EXXR and CXG. However, the heme-binding domain (CXG) in CYP5619 members was found to be highly degenerated. The in silico substrate binding pattern revealed that CYP5619A1 have a high affinity to medium chain fatty acids. Interestingly, the controlling agent of S. diclina malachite green was predicted to have the highest binding affinity, along with linoleic acid. However, unlike fatty acids, none of the active site amino acids formed hydrogen bonds with malachite green. The study's results will pave the way for assessing CYP5619A1's role in S. diclina physiology, including the nature of malachite green binding.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29700357 PMCID: PMC5919972 DOI: 10.1038/s41598-018-25044-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Evolutionary analysis of CYP5619 family members and their homolog P450s. Thirty P450s were used in the analysis. The P450 motif sequences used for phylogenetic analysis are presented in Table S3. The S. cerevisiae CYP51 P450 sequence was used as an out-group.
Comparative analysis of CYP5619 family members and their homolog P450s.
| Species name | Taxonomic group | No. of CYP5619 P450s | CYP5619 subfamilies | Homolog P450 families | |||||
|---|---|---|---|---|---|---|---|---|---|
| A | B | C | D | F | G | ||||
|
|
| 8 | 1 | 2 | 2 | 3 | CYP5852A1 | ||
|
|
| 1 | 1 | CYP5851A1 and CYP5851A2 | |||||
|
|
| 2 | 1 | 1 | |||||
|
|
| 1 | 1 | ||||||
|
| 5 | 1 | 2 | 1 | 1 | CYP5852B1 | |||
|
| 6 | 1 | 2 | 1 | 2 | ||||
|
|
| 0 | CYP5853A1v1 and CYP5853A1v2 | ||||||
Comparative amino acid conservation analysis of CYP5619 family with top 10 ranked P450 families[12].
| P450 family | Number of member P450s | Kingdom | PROMALS3D conservation index | Rank (highest to lowest conservation) | ||||
|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | ||||
| CYP141 | 29 | Bacteria | 0 | 0 | 0 | 0 | 389 | 1 |
| CYP51 | 50 | Bacteria | 11 | 102 | 0 | 0 | 264 | 2 |
| CYP137 | 38 | Bacteria | 145 | 0 | 0 | 0 | 251 | 3 |
| CYP121 | 34 | Bacteria | 0 | 0 | 0 | 0 | 233 | 4 |
| CYP132 | 39 | Bacteria | 175 | 0 | 0 | 0 | 217 | 5 |
|
|
|
|
|
|
|
|
|
|
| CYP124 | 71 | Bacteria | 52 | 35 | 59 | 0 | 170 | 7 (formerly 6) |
| CYP188 | 67 | Bacteria | 62 | 0 | 100 | 0 | 141 | 8 (formerly 7) |
| CYP123 | 74 | Bacteria | 62 | 0 | 82 | 0 | 137 | 9 (formerly 8) |
| CYP108 | 67 | Bacteria | 52 | 12 | 92 | 0 | 134 | 10 (formerly 9) |
| CYP126 | 78 | Bacteria | 65 | 16 | 98 | 0 | 132 | 11 (formerly 10) |
The conservation index score is obtained as described in the section on methods, following the procedure documented in the literature[28]. The conservation score (5–9) obtained via PROMALS3D is shown in the table where the number “9” indicates conserved amino acids in P450 members. P450 families were arranged in order of the highest to the lowest number of amino acids conserved.
Figure 2Analysis of amino acid patterns at EXXR and CXG motif in CYP5619 P450 family. Twenty-three CYP5619 P450 sequences were used to generate WebLogos. The EXXR and CXG sequences used to generate WebLogos are presented in Table S4.
Figure 3Sequence alignment of CYP5619A1 with template CYP120A1 (PDB ID: 2VE3). Helices are represented by coils and β-sheets are shown as arrows. The P450 consensus motifs EXXR and CXG are highlighted in yellow. Columns with residues that are more than 70% similar according to physico-chemical properties (threshold set to 0.7) are framed in red. The figure was rendered by ESPript 3.0[57].
Figure 4In silico structural analysis of CYP5619A1. (A) 3D model of CYP5619A1 with heme cofactor. Secondary structures are displayed in red (helices), yellow (sheets) and blue (coils and turns). (B) Comparative structural analysis of CYP5619A1 model with CYP120A1. Superimposed image of CYP5619A1 model (blue) with CYP102A1 crystal structure (orange) is shown in the Figure. The CYP5619A1 structure is shown in blue and the template CYP120A1 structure is shown in orange.
Figure 5Active site analysis of CYP5619A1. (A) Active site cavity of CYP5619A1. The active site cavity is shown with the substrate access channel in grey (hydrophobic site) and red (hydrophilic site) surface. (B & C) Active site view of the binding pocket of CYP5619A1. (B) The pocket is displayed with MOE ActiveLP color coding (Blue: Mild polar; Green: Hydrophobic; Pink: H-Bonding) and shows a pattern of high hydrophobicity. (C) Residues forming the pocket are labelled. The amino acids lining the active site cavity are shown in Table S5. Secondary structures are displayed in red (helices), yellow (sheets) and blue (coils and turns). The heme prosthetic group appearing at the center of the active site is shown along with iron atom in a ball shape.
Figure 6Analysis of fatty acids (A–K) and malachite green (L) binding with CYP5619A1 from S. diclina. Fatty acids used in this study are (A) myristic acid, (B) palmitic acid, (C) stearic acid, (D) icosanoic acid, (E) myristoleic acid, (F) palmitoleic acid (G) oleic acid, (H) linoleic acid (I) alpha-linolenic acid, (J) arachidonic acid and (K) eicosapentaenoic acid. The heme prosthetic group is displayed in red at the center of the active site. The ligands are displayed in blue sticks. Secondary structures surrounding the active site are shown in white and the receptor’s surface is displayed as a white mesh.
Figure 7Graphic representation of the free binding energies of the docked possible substrates and malachite green. Abbreviations: MYR: myristic acid; PLM: palmitic acid; STE: stearic acid; DCR: icosanoic acid; MYZ: myristoleic acid; PAM: palmitoleic acid; OLA: oleic acid; EIC: linoleic acid; LNL: alpha-linolenic acid; ACD: arachidonic acid; EPA: eicosapentaenoic acid; MGR: malachite green.
Amino acid residues interacting with the different ligands.
| Ligand code | Interacting residues |
|---|---|
| MYR | Arg162 (2HB), Cys228, Met229, Gly232, Pro297, His299, Met300, HEM413 |
| PLM | Leu61, Leu65, Met158, Arg162 (2HB), Cys228, His225, Met229, Pro297, His299, Tyr301, HEM413 |
| STE | Leu61, Leu65, Arg162 (2HB), Cys228, Met229, Gly232, Gly233,Trp237, Pro297, Tyr301, HEM413 |
| DCR | Leu61, Arg162 (1HB), Met229, Gly232, Gly233, Trp237, Pro297, His299, Thr407, HEM413 |
| MYZ | Pro74, Arg162 (2HB), His225, Cys228, Met229, Gly232, Gly233, Pro297, HEM413 |
| PAM | Leu69, Pro74, Arg162 (2HB), His225, Cys228, Met229, Gly232, Gly233, Pro297, HEM413 |
| OLA | Leu61, Leu65, Arg162 (2HB), His225, Cys228, Met229, Gly232, Gly233, Pro297, His299, Met300, Tyr301, HEM413 |
| EIC | Leu69, Leu161, Arg162 (2HB), His225, Cys228, Met229, Gly232, Gly233, Trp237, Pro297, Leu408, HEM413 |
| LNL | Leu61, Arg162 (2HB), Cys228, Met229, Gly232, Gly233, Trp237, Pro297, His299, Met300, Tyr301, HEM413 |
| ACD | Tyr9, Arg14 (1HB), Leu61, Met229, Gly233, Trp237, Pro297, His299, HEM413 |
| EPA | Leu61, Leu65, Pro74, Arg162 (1HB), His225, Cys228, Met229, Gly232, Gly233, Pro297, Thr407, HEM413 |
| MGR | Leu61, Leu65, Pro74, Met229, Gly232, Gly233, Trp237, Pro297, His299, Tyr301, HEM413 |