| Literature DB >> 24743800 |
Khajamohiddin Syed1, Samson Sitheni Mashele1.
Abstract
Cytochrome P450 monooxygenases (P450s) are heme-thiolate proteins distributed across the biological kingdoms. P450s are catalytically versatile and play key roles in organisms primary and secondary metabolism. Identification of P450s across the biological kingdoms depends largely on the identification of two P450 signature motifs, EXXR and CXG, in the protein sequence. Once a putative protein has been identified as P450, it will be assigned to a family and subfamily based on the criteria that P450s within a family share more than 40% homology and members of subfamilies share more than 55% homology. However, to date, no evidence has been presented that can distinguish members of a P450 family. Here, for the first time we report the identification of EXXR- and CXG-motifs-based amino acid patterns that are characteristic of the P450 family. Analysis of P450 signature motifs in the under-explored fungal P450s from four different phyla, ascomycota, basidiomycota, zygomycota and chytridiomycota, indicated that the EXXR motif is highly variable and the CXG motif is somewhat variable. The amino acids threonine and leucine are preferred as second and third amino acids in the EXXR motif and proline and glycine are preferred as second and third amino acids in the CXG motif in fungal P450s. Analysis of 67 P450 families from biological kingdoms such as plants, animals, bacteria and fungi showed conservation of a set of amino acid patterns characteristic of a particular P450 family in EXXR and CXG motifs. This suggests that during the divergence of P450 families from a common ancestor these amino acids patterns evolve and are retained in each P450 family as a signature of that family. The role of amino acid patterns characteristic of a P450 family in the structural and/or functional aspects of members of the P450 family is a topic for future research.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24743800 PMCID: PMC3990721 DOI: 10.1371/journal.pone.0095616
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparative analysis of member P450s in selected P450 families across 21 basidiomycete species.
| CYP63 | CYP512 | CYP5035 | CYP5037 | CYP5136 | CYP5141 | CYP5144 | CYP5150 | CYP5139 | CYP5152 | References | |
|
| 7 | 14 | 13 | 5 | 5 | 8 | 35 | 7 | 1 | 2 | 34, 56 |
|
| 5 | 15 | 3 | 14 | 0 | 4 | 4 | 32 | 8 | 2 | 36 |
|
| 9 | 14 | 14 | 5 | 5 | 6 | 48 | 10 | 8 | 4 | 37 |
|
| 5 | 16 | 5 | 5 | 6 | 8 | 54 | 15 | 2 | 3 | 35 |
|
| 13 | 10 | 9 | 15 | 12 | 2 | 48 | 8 | 14 | 6 | 40 |
|
| 5 | 18 | 13 | 5 | 9 | 2 | 4 | 32 | 5 | 1 | 35 |
|
| 13 | 21 | 15 | 6 | 7 | 2 | 3 | 36 | 7 | 1 | 38 |
|
| 6 | 5 | 10 | 9 | 2 | 5 | 8 | 18 | 4 | 2 | 35 |
|
| 6 | 12 | 0 | 3 | 0 | 7 | 14 | 12 | 4 | 0 | 24 and this work |
|
| 7 | 11 | 0 | 0 | 5 | 5 | 25 | 2 | 6 | 11 | 24 and this work |
|
| 10 | 10 | 3 | 30 | 4 | 2 | 38 | 24 | 13 | 1 | This work |
|
| 5 | 8 | 6 | 10 | 3 | 3 | 42 | 39 | 12 | 2 | This work |
|
| 8 | 19 | 2 | 37 | 2 | 3 | 9 | 28 | 10 | 1 | This work |
|
| 7 | 14 | 4 | 27 | 0 | 15 | 55 | 6 | 20 | 0 | This work |
|
| 7 | 6 | 1 | 6 | 1 | 1 | 57 | none | 22 | 23 | This work |
|
| 0 | 6 | 1 | 10 | 1 | 2 | 13 | 0 | 12 | 2 | This work |
|
| 6 | 14 | 12 | 7 | 5 | 3 | 41 | 21 | 8 | 1 | This work |
|
| 3 | 13 | 9 | 14 | 2 | 4 | 11 | 9 | 5 | 1 | This work |
|
| 5 | 14 | 3 | 30 | 3 | 2 | 7 | 28 | 11 | 2 | This work |
|
| 3 | 8 | 1 | 17 | 0 | 0 | 14 | 6 | 10 | 1 | This work |
|
| 5 | 4 | 4 | 12 | 0 | 2 | 21 | 15 | 13 | 3 | This work |
Member P450s belonging to different P450 families in basidiomycetes were collected using three methods. (i) P450s belonging to different families were retrieved from published data bases (indicated with reference number in the table). (ii) P450 members belonging to different families were identified by being annotated and assigned to families and subfamilies as described in the “Materials and methods” section (indicated by “this work” in the table). (iii) Some basidiomycete P450s were refined from the published literature (indicated with reference and “this work” in the table). Members of P450 families that are absent in different basidiomycete species were indicated with “0″ in the table.
Figure 1Comparative analysis of amino acid patterns in P450 signature motif EXXR in fungal P450s.
A total of 4304 P450s from the fungal phyla ascomycota (1336 P450s), basidiomycota (2859 P450s), zygomycota (102 P450s) and chytridiomycota (7 P450s) were used for the analysis. A total of 128 amino acid patterns for the EXXR motif were identified. The predominant amino acid patterns (>2%) are shown in the pie charts. Amino acid patterns are shown in the figure along with the number of amino acids representing the pattern and overall percentage of that pattern. A detailed list of amino acid patterns occurring at P450 signature motifs is shown in Table S3.
Analysis of P450 signature motifs EXXR and CXG in fungal P450s.
| EXXR | CXG | ||||||||||
| Second position | Third position | Second position | Third position | ||||||||
| Appear | No. ofP450s | percentage | Appear | No. ofP450s | percentage | Appear | No. ofP450s | percentage | Appear | No. ofP450s | percentage |
| T | 1658 | 38.52 | L | 2811 | 65.31 | P | 1678 | 38.99 | G | 4199 | 97.56 |
| V | 767 | 17.82 | M | 416 | 9.67 | I | 1238 | 28.76 | A | 97 | 2.25 |
| S | 628 | 14.59 | Q | 239 | 5.55 | L | 719 | 16.71 | D | 2 | 0.05 |
| A | 490 | 11.38 | Y | 187 | 4.34 | V | 325 | 7.55 | K | 2 | 0.05 |
| G | 244 | 5.67 | F | 181 | 4.21 | A | 131 | 3.04 | E | 1 | 0.02 |
| L | 161 | 3.74 | I | 128 | 2.97 | S | 57 | 1.32 | P | 1 | 0.02 |
| C | 153 | 3.55 | H | 94 | 2.18 | T | 33 | 0.77 | Q | 1 | 0.02 |
| I | 134 | 3.11 | V | 60 | 1.39 | Q | 32 | 0.74 | S | 1 | 0.02 |
| N | 35 | 0.81 | S | 42 | 0.98 | M | 28 | 0.65 | R | 0 | 0 |
| M | 21 | 0.49 | A | 36 | 0.84 | G | 21 | 0.49 | I | 0 | 0 |
| F | 6 | 0.14 | E | 25 | 0.58 | K | 11 | 0.26 | T | 0 | 0 |
| E | 4 | 0.09 | T | 24 | 0.56 | F | 9 | 0.21 | V | 0 | 0 |
| P | 1 | 0.02 | G | 22 | 0.51 | E | 6 | 0.14 | C | 0 | 0 |
| Q | 1 | 0.02 | C | 15 | 0.35 | W | 4 | 0.09 | F | 0 | 0 |
| W | 1 | 0.02 | N | 9 | 0.21 | Y | 4 | 0.09 | H | 0 | 0 |
| D | 0 | 0 | W | 6 | 0.14 | H | 3 | 0.07 | L | 0 | 0 |
| Y | 0 | 0 | R | 5 | 0.12 | R | 3 | 0.07 | M | 0 | 0 |
| H | 0 | 0 | P | 2 | 0.05 | C | 1 | 0.02 | N | 0 | 0 |
| R | 0 | 0 | D | 1 | 0.02 | N | 1 | 0.02 | W | 0 | 0 |
| K | 0 | 0 | K | 1 | 0.02 | D | 0 | 0 | Y | 0 | 0 |
Comparative analysis of 4304 P450s from fungal phyla ascomycota, basidiomycota, zygomycota and chytridiomycota revealed amino acids that are part of the fungal P450 signature motifs. Amino acids appearing in the P450 signature motifs were represented from top to bottom with their frequencies at each position.
Figure 2Comparative analysis of amino acid patterns at the P450 signature motif CXG in fungal P450s.
In total 4304 P450s from the fungal phyla ascomycota (1336 P450s), basidiomycota (2859 P450s), zygomycota (102 P450s) and chytridiomycota (seven P450s) were used for the analysis. As shown in the figures, 32 amino acid patterns were identified in fungal P450 for the CXG motif. The numbers on the bars represent the number of amino acids representing the pattern. The overall percentage of that pattern is shown in parenthesis.
Figure 3Analysis of amino acid patterns at EXXR motif in 12 fungal P450 families.
A sequence logo for the EXXR motif using the amino acids from 12 fungal P450 families was constructed, as described in the “Materials and methods” section. The number of P450s used for the construction of the sequence logo is shown in the parenthesis right next to the name of the P450 family.
Figure 4Analysis of amino acid patterns at CXG motif in 12 fungal P450 families.
Sequence logo for the CXG motif (FXXGXRXCXG) using the amino acids from 12 fungal P450 families were constructed as described in the “Materials and methods” section. The number of P450s used for construction of the sequence logo is shown in the parenthesis right next to the name of the P450 family.
Comparative quantitative analysis of amino acid patterns at EXXR and CXG (FXXGXRXCXG) motifs in 12 fungal P450 families.
| P450 family | Number ofMember P450s | E-X-X-R | F-X-X-G-X-R-X-C-X-G |
| CYP52 | 93 | E-T(42)/V(24)-L(99)-R | F-N(85)-G(83)-G-P-R-I(79)-C-L(90)-G |
| CYP61 | 70 | E-T(26)/S(24)-L(90)-R | F(96)-G-S(34)/T(29)-G-P(91)-V(53)-C-L(61)-G(93) |
| CYP63 | 133 | E-T(74)-L(92)-R | F-N(78)-A(89)-G-P(98)-R(99)-I(87)-C-L(86)-G |
| CYP512 | 247 | E-S(62)-Q(53)-R | F(94)-G(99)-H(69)-G(99)-K(47)-H(87)-A(88)-C-P(95)-G |
| CYP5035 | 129 | E-T(54)-L(78)-R | F(98)-S(93)-F(31)/Y(29)-G(99)-P(94)-A(39)/M(23)-N(85)-V(81)-G(99) |
| CYP5037 | 261 | E-V(56)-L(48)/Y(20)-R | F(96)-G(99)-F(79)-G(99)-R(99.6)-I(38)/V(28)-C-P(90)-G(98) |
| CYP5136 | 68 | E-T(63)-L(61)-R | F-S(74)-A(76)-G-V(44)/L(19)-R(79)-G(49)/A(41)-C-I(79)-G |
| CYP5139 | 181 | E-T(62)-L(84)-R | F-L(40)/I(25)-G(62)-G-P(83)-R(83)-A(49)/S(16)-C-I(73)-G |
| CYP5141 | 86 | E-T(92)-L(95)-R | F-S(89)-A(35)/G(33)-G-P(47)/A(37)-R(93)-A(36)/G(30)-C-L(47)/I(45)-G |
| CYP5144 | 514 | E-V(29)/T(26)-L(58)-R(99.6) | F(97)-G(99)-F(68)-G(99)-R(99)-R(99.8)-I(65)-C-P(88)-G(99.8) |
| CYP5150 | 336 | E-T(94)-L(97)-R(99) | F(99)-L(49)/I(18)-G(77)-G(99)-G(40)/K(19)-R(85)-A(68)-C-I(90)-G |
| CYP5152 | 66 | E-V(30)/A(14)-L(38)/F(30)-R | F(59)-G-F-G-R(98)-R-V(66)-C-P(77)-G |
The percentage predominance of amino acids at particular positions is calculated considering the total number of amino acids as 100%. Amino acids or patterns of amino acids contributing more than 50% at the specific position are shown in the table. Amino acids conserved (100%) at the specific position(s) are represented by their symbol. The numerical values in the table are percentage values.
Comparative quantitative analysis of amino acid patterns at EXXR and CXG (FXXGXRXCXG) motifs in 54 P450 families.
| P450family | Number ofmemberP450s | E-X-X-R | F-X-X-G-X-R-X-C-X-G |
| CYP1 | 288 | E-T(36)/I(24)-F(66)-R | F(99.6)-G(72)-L(38)/M(38)-G(98)-K(89)-R-R(75)-C-I(89)-G |
| CYP2 | 1244 | E-I(54)/V(40)-Q(93)-R | F(99)-S(93)-A(47)/I(19)/L(14)-G(99.7)-K(68)/R(24)-R(98)-I(29)/V(24)/A(18)-C-L(52)/V(16)-G(99.7) |
| CYP3 | 239 | E-T(74)-L(92)-R | F(99.6)-G(99.6)-A(21)/T(23)/N(19)-G-P-R(98)-N(98)-C-I(91)-G(97) |
| CYP4 | 1034 | E-S(48)/T(27)/A(20)-L(86)-R | F-S(98)-A(89)-G(99.9)-P(77)-R(98)-N(90)-C-I(92)-G(98) |
| CYP5 | 52 | E-T-L-R | F-G-A-G-P-R-S-C-L(96)-G |
| CYP6 | 893 | E-T(78)-L(87)-R | F(99.9)-G(98)-E(40)/D(34)-G(99.8)-P(94)-R(91)-N(50)/I(19)-C-I(89)-G(91) |
| CYP7 | 88 | E-A(42)/S(41)-L(79)-R | F-G-S(60)-G-A(45)/T(24)-T(51)/S(40)-K(52)-C-P-G |
| CYP8 | 91 | E-T(60)-L-R | W(96)-G-A(59)-G(97)-H(37)/V(37)-S(46)/N(45)-I(32)/H(19)/Q(16)-C-P(55)-G |
| CYP9 | 305 | E-T(30)/S(25)/A(18)-L(93)-R | F-G(99)-V(30)/L(23)/I(22)-G-P(90)-R(99)-N(61)-C-I(97)-G(93) |
| CYP11 | 156 | E-T(95)-L(99)-R | F-G-F(75)-G-V(49)/M(22)/P(15)-R(99)-Q-C-L(81)-G |
| CYP12 | 119 | E-S(44)/G(34)-L(51)/Q(26)-R | F-G-F(96)-G-P(85)-R-M(54)/T(20)-C-I(47)/V(40)-G(98) |
| CYP17 | 99 | E-V(87)-L(91)-R | F-G-A(97)-G(99)-P(61)-R-V(50)/S(48)-C-L(39)/I(32)-G |
| CYP19 | 171 | E-S(72)-L(57)-R | F(99)-G(99)-S(39)/F(35)-G-P-R(99)-S(46)/A(36)-C-V(66)-G |
| CYP26 | 125 | E-V(56)/T(43)-L(74)-R | F-G-G(98)-V(42)/L(36)-R-S(59)/T(28)-C-L(46) |
| CYP27 | 113 | E-T(69)-L-R | F-G-Y(52)/F(33)-G-V(56)/K(28)-R-S(66)/A(29)-C-I(39)/L(34)-G |
| CYP53 | 92 | E-T(52)/A(26)-L(67)/M(30)-R | F-S-H(25)/F(17)/T(17)-G-P(98)-R-A(73)-C-V(86)-G |
| CYP58 | 97 | E-G(50)/S(29)/A(18)-L(59)/F(36)-R | F-S(64)-K(44)/R(39)-G-S(83)-R-Q(58)-C-I(62)-G |
| CYP65 | 203 | E-A(55)/S(20)-L(65)-R | F(99)-S(77)-I(26)/V(24)/F(13)-G-P(94)-R(99)-N(80)-C-I(62)-G(98) |
| CYP71 | 767 | E-T(73)-L(73)-R | F(99)-G(99)-A(57)-G(99.7)-R(96)-R(99)-I(44)/M(34)-C-P(96)-G(93) |
| CYP72 | 207 | E-V(90)-L(99)-R | F-G(80)-W(79)-G-P(94)-R(99)-I(72)-C-I(72)-G(97) |
| CYP73 | 153 | E-T(95)-L(95)-R | F(99)-G-V(93)-G-R-R-S(96)-C-P-G |
| CYP74 | 155 | E-A(39)/T(27)/V(15)-L(88)-R | P(92)-T(53)/S(32)-V(46)/E(13)-G(32)/D(19)/S(19)-N(88)-K-Q(97)-C-A(62)-G(77) |
| CYP75 | 248 | E-T(72)-F(87)-R | F-G(99.6)-A(96)-G-R(99.6)-R-I(96)-C-A(85)-G(99.6) |
| CYP76 | 201 | E-T(79)-F(39)/L(37)-R | F(98)-G(97)-A(60)-G-R(93)-R(99)-I(62)-C-P(81)-G(92) |
| CYP78 | 112 | E-T(51)/V(34)-L(99)-R | F-G-S(53)/A(46)-G-R(95)-R-V(68)-C-P-G |
| CYP79 | 95 | E-A(78)-F(87)-R | F(99)-S(77)-T(80)-G-R(82)-R-G(96)-C-P(39)/V(17)/I(15)-G(64) |
| CYP81 | 229 | E-T(89)-L(87)-R | F(99.6)-G-M(39)/L(27)-G(99.6)-R(99)-R(99.6)-A(43)/R(20)-C-P-G(94) |
| CYP82 | 166 | E-T(78)-L(84)-R | F(99)-G(92)-S(74)-G(99)-R(99)-R(99)-S(46)/I(14)-C-P(96)-G(91) |
| CYP86 | 139 | E-T(59)-L(86)-R | F-N(96)-A(67)-G-P-R-L(40)/T(32)-C-L(88)-G |
| CYP89 | 131 | E-G(89)-L(98)-R | F-G(98)-A(71)-G(98)-R(98)-R-I(76)-C-P(75)-G(85) |
| CYP90 | 112 | E-T(99)-L-R | F(89)-G-G(99)-G-P(55)/Q(36)-R-L(95)-C-P(73)-G |
| CYP92 | 163 | E-T(75)-M(50)/L(36)-R | F(99)-G(99)-S(65)-G-R(99)-R-M(54)/G(26)-C-P(96)-G(91) |
| CYP93 | 150 | E-T(97)-F(61)-R | F-G(99)-S(80)-G-R-R-M(51)/G(16)-C-P-G |
| CYP94 | 162 | E-S(65)-M(95)-R | F-Q(62)-A(85)-G-P(74)-R-V(48)/M(22)-C-L(84)-G |
| CYP97 | 99 | E-S(61)-L(64)-R | F-G(69)-G(88)-G-P(66)-R-K(91)-C-V(70)-G |
| CYP102 | 330 | E-S(52)/A(27)-R | F(98)-G(99)-N(80)-G-Q(73)-R-A(94)-C-I(99)-G |
| CYP105 | 328 | E-L(90)-L(81)-R | F(98)-G(93)-F(36)/Y(31)/H(28)-G-V(40)/I(28)/R(13)-H(99.7)-Q(85)-C-L(76)-G |
| CYP106 | 92 | E-V(84)-L(99)-R | F-G-K(83)-G-P(85)-H-F-C-L(99)-G |
| CYP107 | 217 | E-L(55)/M(15)-L(74)-R | F(92)-G(99)-H(68)-G-I(50)/V(18)-H-Y(40)/H(30)-C-L(72)-G(99) |
| CYP108 | 118 | E-M(52)-I(74)-R | F-G-Y(72)-V(50)/A(26)-H-F(49)/M(19)-C-L(74)-G(99) |
| CYP110 | 113 | E-T(83)-L-R | F-G-G(97)-G(88)- S(36)/N(27)/A(22)-R(99)-R(76)-C-I(62)-G(99) |
| CYP116 | 82 | E-C-L-R | F-G-Y-G-S(80)-H-Q-C-M(91)-G |
| CYP136 | 137 | E-S(44)/A(31)-L(86)-R | F-G-G(96)-G-A(60)/V(37)-H-K(97)-C-I(85)-G |
| CYP152 | 90 | E-V(99)-R-R | D(91)-P(35)/H(20)/Y(15)-A(24)/Y(17)/E(15)-K(39)-G(63)-H-R-C-P(75)-G |
| CYP153 | 163 | E-I(67)-I(94)-R | F-G-F(74)-G-I(61)-H-R-C-M(51)/V(44)-G |
| CYP154 | 104 | E-T(76)-L(96)-R | F-G-H(84)-G-V(41)/P(40)-H(97)-F(22)/V(21)/H(15)-C-L(54)/P(32)-G |
| CYP176 | 102 | E-V(50)/T(24)-M(60)-R | F-G(99)-G(96)-G-P(81)-R(90)-M(71)-C-P(82)-G(99) |
| CYP501 | 106 | E-T(97)-L-R | F-G(97)-G(98)-G-R(98)-H-R-C-I(99)-G |
| CYP584 | 88 | E-S(54)/T(22)-L(97)-R | F-N(94)-G(99)-G-P-R-I(84)-C-L(48)/I(27)-G |
| CYP620 | 167 | E-V(48)/T(18)-L(68)-R | F(73)-G-F(73)-G(99)-R(99)-R-I(57)-C-P(96)-G(99) |
| CYP704 | 103 | E-T(97)-L(95)-R | F-Q(76)-A(99)-G-P-R-I(81)-C-L(93)-G |
| CYP707 | 97 | E-T(57)-L(89)-R | F-G(99)-N(53)-G-V(52)-T(22)-H(99)-A(51)/S(45)-C-P(99)-G(99) |
| CYP709 | 112 | E-T(83)-L(96)-R | F-S(96)-L(24)/S(20)/F(16)-G(99)-P(92)-R-S(43)/V(27)-C-I(64)-G(98) |
| CYP716 | 101 | E-V(49)/T(25)-M(60)-R | F-G(99)-G(96)-G-P(81)-R(90)-M(71)-C-P(83)-G(99) |
The percentage predominance of amino acids at particular positions is calculated considering the total number of amino acids as 100%. Amino acids or patterns of amino acids contributing more than 50% at the specific position are shown in the table. Amino acids conserved (100%) at the specific position(s) are represented by their symbol. The numerical values in the table are percentage values.
Figure 5Analysis of amino acid patterns at the EXXR and CXG motif in CYP51 P450 family.
In total 407 CYP51 P450 sequences representing all biological kingdoms were analyzed for EXXR and CXG signature sequences. Member P450 sequences were retrieved from the CYPED website (26) and used for analysis, as described in the “Materials and methods” section.