| Literature DB >> 34073951 |
Nomfundo Nzuza1, Tiara Padayachee1, Puleng Rosinah Syed2, Justyna Dorota Kryś3, Wanping Chen4, Dominik Gront3, David R Nelson5, Khajamohiddin Syed1.
Abstract
Cytochrome P450 monooxygenases (CYPs/P450s), heme-thiolate proteins, are well-known players in the generation of chemicals valuable to humans and as a drug target against pathogens. Understanding the evolution of P450s in a bacterial population is gaining momentum. In this study, we report comprehensive analysis of P450s in the ancient group of the bacterial class Alphaproteobacteria. Genome data mining and annotation of P450s in 599 alphaproteobacterial species belonging to 164 genera revealed the presence of P450s in only 241 species belonging to 82 genera that are grouped into 143 P450 families and 214 P450 subfamilies, including 77 new P450 families. Alphaproteobacterial species have the highest average number of P450s compared to Firmicutes species and cyanobacterial species. The lowest percentage of alphaproteobacterial species P450s (2.4%) was found to be part of secondary metabolite biosynthetic gene clusters (BGCs), compared other bacterial species, indicating that during evolution large numbers of P450s became part of BGCs in other bacterial species. Our study identified that some of the P450 families found in alphaproteobacterial species were passed to other bacterial species. This is the first study to report on the identification of CYP125 P450, cholesterol and cholest-4-en-3-one hydroxylase in alphaproteobacterial species (Phenylobacterium zucineum) and to predict cholesterol side-chain oxidation capability (based on homolog proteins) by P. zucineum.Entities:
Keywords: Alphaproteobacteria; CYP125; annotation; biosynthetic gene clusters; cholesterol oxidation; cytochrome P450 monooxygenase; genome data mining
Mesh:
Substances:
Year: 2021 PMID: 34073951 PMCID: PMC8197338 DOI: 10.3390/ijms22115542
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Some of the secondary metabolites produced by alphaproteobacterial species and their biological functions.
| Secondary Metabolite | Species (Source) | Biological Function | Reference |
|---|---|---|---|
| Didemnin B | Anti-viral and anti-tumor properties | [ | |
| Thalassospiramide | Immunosuppression | [ | |
| Tropodithietic acid | Antibiotic, biocontrol and anti-cancer | [ | |
| Phytohormone | Improve plant growth | [ | |
| 6-gingerol and curcumin | Antitumor activity | [ | |
| Gibberellins, salicylic acid, indole-3-acetic acid, zeatin, and abscisic acid | Promote plant germination and growth | [ | |
| Cyclodipeptides | Bioactive agents | [ |
Figure 1Comparative analysis of P450 statistics in the bacterial class Alphaproteobacteria. Detailed information is presented in Table S1.
Comparative analysis of main characteristics of P450s in different bacterial species.
| Category | Alphaproteobacterial Species | Mycobacterial Species | Cyanobacterial Species | ||
|---|---|---|---|---|---|
| Total no of species analyzed | 599 | 972 | 203 | 60 | 114 |
| No of species that have P450s | 241 | 229 | 203 | 60 | 114 |
| No of P450s | 874 | 712 | 5460 | 1784 | 341 |
| No of families | 143 | 14 | 253 | 77 | 36 |
| No of subfamilies | 214 | 53 | 698 | 132 | 79 |
| Dominant P450 family | CYP202 | CYP107 | CYP107 | CYP125 | CYP110 |
| Average no. of P450s | 4 | 1 | 27 | 30 | 3 |
| P450 diversity percentage | 0.07 | 0.008 | 0.02 | 0.07 | 0.09 |
| No of P450s part of BGCs | 21 | 126 | 1231 | 204 | 27 |
| Percentage of P450s part of BGCs | 2.4 | 18 | 23 | 11 | 8 |
| Reference(s) | This work | [ | [ | [ | [ |
Figure 2Phylogenetic analysis of alphaproteobacterial P450s. P450 families that are expanded in these species were highlighted in different colors and indicated in the figure. Alphaproteobacterial P450 protein sequences used to construct the phylogenetic tree are presented in Supplementary Dataset 1.
Comparative analysis of P450 families and subfamilies in alphaproteobacterial species.
| P450 Family | P450 Count | Percentage Count | Subfamily | P450 Count |
|---|---|---|---|---|
| CYP1010 | 1 | 0.11% | A | 1 |
| CYP1017 | 1 | 0.11% | A | 1 |
| CYP101 | 10 | 1.14% | B | 1 |
| C | 3 | |||
| D | 2 | |||
| E | 2 | |||
| Q | 1 | |||
| R | 1 | |||
| CYP102 | 12 | 1.37% | A | 10 |
| K | 1 | |||
| AC | 1 | |||
| CYP1036 | 2 | 0.23% | B | 1 |
| C | 1 | |||
| CYP103 | 4 | 0.46% | A | 4 |
| CYP104 | 13 | 1.49% | A | 13 |
| CYP105 | 4 | 0.46% | BF | 3 |
| CX | 1 | |||
| CYP1061 | 2 | 0.23% | B | 2 |
| CYP1068 | 4 | 0.46% | A | 4 |
| CYP1075 | 1 | 0.11% | A | 1 |
| CYP1076 | 1 | 0.11% | A | 1 |
| CYP1077 | 1 | 0.11% | A | 1 |
| CYP1078 | 8 | 0.92% | A | 7 |
| C | 1 | |||
| CYP107 | 10 | 1.14% | AN | 8 |
| _ | 1 | |||
| JE | 1 | |||
| CYP1081 | 2 | 0.23% | B | 2 |
| CYP1082 | 1 | 0.11% | A | 1 |
| CYP1083 | 5 | 0.57% | A | 3 |
| C | 2 | |||
| CYP1086 | 1 | 0.11% | B | 1 |
| CYP1088 | 2 | 0.23% | B | 2 |
| CYP1089 | 1 | 0.11% | A | 1 |
| CYP108 | 37 | 4.23% | A | 2 |
| D | 6 | |||
| G | 18 | |||
| L | 1 | |||
| P | 2 | |||
| U | 1 | |||
| V | 1 | |||
| W | 1 | |||
| X | 5 | |||
| CYP1090 | 2 | 0.23% | C | 2 |
| CYP1091 | 3 | 0.34% | A | 3 |
| CYP1094 | 4 | 0.46% | A | 3 |
| B | 1 | |||
| CYP1098 | 6 | 0.69% | A | 6 |
| CYP1101 | 29 | 3.32% | A | 29 |
| CYP1104 | 10 | 1.14% | A | 4 |
| C | 3 | |||
| E | 3 | |||
| CYP1107 | 16 | 1.83% | A | 12 |
| B | 3 | |||
| C | 1 | |||
| CYP1111 | 7 | 0.80% | A | 5 |
| B | 1 | |||
| C | 1 | |||
| CYP1116 | 3 | 0.34% | B | 3 |
| CYP1118 | 1 | 0.11% | B | 1 |
| CYP111 | 2 | 0.23% | A | 1 |
| B | 1 | |||
| CYP112 | 23 | 2.63% | A | 23 |
| CYP1135 | 1 | 0.11% | A | 1 |
| CYP1137 | 1 | 0.11% | A | 1 |
| CYP1138 | 2 | 0.23% | B | 2 |
| CYP1141 | 1 | 0.11% | A | 1 |
| CYP1145 | 1 | 0.11% | A | 1 |
| CYP114 | 22 | 2.52% | A | 22 |
| CYP1155 | 1 | 0.11% | B | 1 |
| CYP115 | 1 | 0.11% | A | 1 |
| CYP116 | 1 | 0.11% | B | 1 |
| CYP1170 | 7 | 0.80% | A | 7 |
| CYP1171 | 1 | 0.11% | A | 1 |
| CYP1173 | 1 | 0.11% | A | 1 |
| CYP1174 | 2 | 0.23% | A | 2 |
| CYP1175 | 1 | 0.11% | A | 1 |
| CYP117 | 22 | 2.52% | A | 22 |
| CYP1181 | 1 | 0.11% | A | 1 |
| CYP1182 | 1 | 0.11% | A | 1 |
| CYP1186 | 2 | 0.23% | A | 2 |
| CYP1187 | 1 | 0.11% | A | 1 |
| CYP1204 | 2 | 0.23% | A | 2 |
| CYP1206 | 1 | 0.11% | B | 1 |
| CYP1221 | 2 | 0.23% | B | 2 |
| CYP1229 | 4 | 0.46% | A | 1 |
| B | 2 | |||
| C | 1 | |||
| CYP1246 | 2 | 0.23% | A | 2 |
| CYP1250 | 1 | 0.11% | A | 1 |
| CYP1258 | 1 | 0.11% | A | 1 |
| CYP125 | 1 | 0.11% | P | 1 |
| CYP1275 | 1 | 0.11% | B | 1 |
| CYP127 | 19 | 2.17% | A | 18 |
| C | 1 | |||
| CYP1281 | 1 | 0.11% | A | 1 |
| CYP1291 | 5 | 0.57% | B | 3 |
| C | 2 | |||
| CYP1302 | 2 | 0.23% | A | 2 |
| CYP1311 | 2 | 0.23% | A | 2 |
| CYP1312 | 1 | 0.11% | A | 1 |
| CYP1326 | 2 | 0.23% | B | 2 |
| CYP1330 | 1 | 0.11% | A | 1 |
| CYP1337 | 1 | 0.11% | A | 1 |
| CYP133 | 1 | 0.11% | F | 1 |
| CYP1349 | 1 | 0.11% | A | 1 |
| CYP1350 | 1 | 0.11% | A | 1 |
| CYP1371 | 1 | 0.11% | B | 1 |
| CYP1376 | 2 | 0.23% | B | 1 |
| C | 1 | |||
| CYP1384 | 1 | 0.11% | A | 1 |
| CYP1396 | 1 | 0.11% | A | 1 |
| CYP1405 | 1 | 0.11% | B | 1 |
| CYP1406 | 3 | 0.34% | B | 2 |
| C | 1 | |||
| CYP1460 | 1 | 0.11% | C | 1 |
| CYP147 | 16 | 1.83% | D | 16 |
| CYP1515 | 2 | 0.23% | A | 2 |
| CYP151 | 2 | 0.23% | C | 2 |
| CYP152 | 13 | 1.49% | B | 4 |
| C | 3 | |||
| E | 2 | |||
| P | 1 | |||
| AA | 1 | |||
| AB | 2 | |||
| CYP153 | 60 | 6.86% | A | 44 |
| C | 2 | |||
| D | 14 | |||
| CYP155 | 4 | 0.46% | J | 3 |
| K | 1 | |||
| CYP1591 | 1 | 0.11% | A | 1 |
| CYP1597 | 1 | 0.11% | A | 1 |
| CYP1732 | 3 | 0.34% | A | 3 |
| CYP1733 | 2 | 0.23% | A | 1 |
| B | 1 | |||
| CYP1734 | 1 | 0.11% | A | 1 |
| CYP1735 | 1 | 0.11% | A | 1 |
| CYP1736 | 1 | 0.11% | A | 1 |
| CYP1737 | 2 | 0.23% | A | 2 |
| CYP1738 | 1 | 0.11% | A | 1 |
| CYP1739 | 2 | 0.23% | A | 2 |
| CYP173 | 60 | 6.86% | A | 45 |
| B | 8 | |||
| C | 3 | |||
| G | 1 | |||
| H | 1 | |||
| J | 2 | |||
| CYP1740 | 1 | 0.11% | A | 1 |
| CYP1741 | 1 | 0.11% | A | 1 |
| CYP1742 | 1 | 0.11% | A | 1 |
| CYP1743 | 1 | 0.11% | A | 1 |
| CYP1744 | 1 | 0.11% | A | 1 |
| CYP1745 | 1 | 0.11% | A | 1 |
| CYP1746 | 1 | 0.11% | A | 1 |
| CYP1747 | 1 | 0.11% | A | 1 |
| CYP1748 | 1 | 0.11% | A | 1 |
| CYP1749 | 2 | 0.23% | A | 1 |
| B | 1 | |||
| CYP1750 | 1 | 0.11% | A | 1 |
| CYP1751 | 1 | 0.11% | A | 1 |
| CYP1752 | 4 | A | 4 | |
| CYP1753 | 1 | 0.11% | A | 1 |
| CYP1754 | 1 | 0.11% | A | 1 |
| CYP1755 | 1 | 0.11% | A | 1 |
| CYP180 | 1 | 0.11% | D | 1 |
| CYP186 | 7 | 0.80% | K | 7 |
| CYP191 | 1 | 0.11% | B | 1 |
| CYP192 | 3 | 0.34% | A | 3 |
| CYP193 | 17 | 1.95% | A | 17 |
| CYP194 | 10 | 1.14% | A | 10 |
| CYP195 | 21 | 2.40% | A | 18 |
| D | 1 | |||
| E | 2 | |||
| CYP196 | 37 | 4.23% | A | 34 |
| B | 2 | |||
| C | 1 | |||
| CYP197 | 2 | 0.23% | R | 2 |
| CYP199 | 16 | 1.83% | A | 11 |
| B | 1 | |||
| J | 1 | |||
| K | 1 | |||
| L | 1 | |||
| M | 1 | |||
| CYP200 | 8 | 0.92% | A | 5 |
| B | 2 | |||
| C | 1 | |||
| CYP201 | 31 | 3.55% | A | 25 |
| B | 3 | |||
| C | 3 | |||
| CYP202 | 70 | 8.01% | A | 46 |
| B | 24 | |||
| CYP203 | 13 | 1.49% | A | 12 |
| B | 1 | |||
| CYP204 | 2 | 0.23% | A | 2 |
| CYP206 | 19 | 2.17% | A | 9 |
| B | 1 | |||
| C | 3 | |||
| D | 6 | |||
| CYP219 | 2 | 0.23% | A | 2 |
| CYP223 | 3 | 0.34% | A | 2 |
| E | 1 | |||
| CYP224 | 1 | 0.11% | A | 1 |
| CYP225 | 7 | 0.80% | A | 7 |
| CYP226 | 2 | 0.23% | C | 1 |
| D | 1 | |||
| CYP278 | 2 | 0.23% | A | 1 |
| C | 1 | |||
| CYP286 | 1 | 0.11% | C | 1 |
| CYP288 | 2 | 0.23% | B | 2 |
| CYP289 | 33 | 3.78% | A | 33 |
| CYP290 | 6 | 0.69% | A | 5 |
| B | 1 | |||
| CYP2140 | 1 | 0.11% | A | 1 |
| CYP1236 | 1 | 0.11% | A | 1 |
| CYP2334 | 2 | 0.23% | A | 2 |
Figure 3Heat map figure representing the presence or absence of cytochrome P450 families in 599 alphaproteobacterial species. The data have been represented as −3 for family absence (green) and 3 for family presence (red). One hundred and forty-three alphaproteobacterial species form the vertical axis and 214 P450 families form the horizontal axis. The respective data used in the generation of this figure are presented in Supplementary Dataset 2.
Figure 4Comparative analysis of types of secondary metabolite BGCs in alphaproteobacterial species. BGCs that are populated in alphaproteobacterial species were presented in the figure. The number at the top of each bar represents the total number of clusters. Detailed information is presented in Supplementary Dataset 3.
Identification of P450s that are part of the secondary metabolite BGCs in alphaproteobacterial species. Reference cluster information was obtained by performing BLAST at the anti-SMASH database [46], as indicated in the materials and methods section. The cluster type and most similar known cluster names available at the anti-SMASH database [46] were listed in the table.
| Species Name | P450 | Reference Cluster Information | ||
|---|---|---|---|---|
| Cluster Type | Most Similar Known Cluster | Similarity | ||
| Novosphingobium aromaticivorans | CYP153C1 | Terpene | Astaxanthin dideoxyglycoside | 75% |
| CYP127A3v1 | Hserlactone | - | - | |
| Agrobacterium fabrum | CYP206A1 | Terpene | - | - |
| CYP206A4 | Terpene | - | - | |
|
| CYP206A4 | Terpene | - | - |
|
| CYP206A2 | Terpene | - | - |
| CYP206A3 | Terpene | - | - | |
| CYP107JE1 | NAGGN | - | - | |
| CYP199A26 | Hserlactone, t2pks | Colabomycin | 4% | |
|
| CYP108L2 | NRPS | - | - |
|
| CYP1101A30 | T1PKS | S56-p1, NRPS | 3% |
|
| CYP173J1 | NRPS, T1PKS | - | - |
|
| CYP1104E2 | T1PKS | Sphingan polysaccharide, saccharide | 13% |
| CYP152C2 | hserlactone | Conglobatin, NRPS | 10% | |
|
| CYP1101A27 | Ectoine | Ectoine, other | 100% |
| CYP1246A4 | Terpene | - | - | |
| CYP1302A1 | NRPS | - | - | |
|
| CYP1138B1 | T1PKS | - | - |
| CYP1326A2 | acyl_amino_acids | - | - | |
| CYP195A21 | bacteriocin | - | - | |
| CYP2334A1 | NRPS-like | - | - | |
Figure 5Comparative analysis of P450 families among different bacterial species. The number of P450 families indicated with a number (bold case) and P450 families that are common between alphaproteobacterial species and other bacterial species were listed in the figure.
Identification of homolog proteins involved in cholesterol side-chain oxidation in Phenylobacterium zucineum. Mycobacterium tuberculosis H37Rv cholesterol side-chain degrading proteins from the published literature [51] were used to identify homolog proteins in P. zucineum. Gene and protein IDs and enzyme annotations were from KEGG [52].
| Proteins Involved in Cholesterol Side-Chain Oxidation in | Homolog Proteins in | |||||
|---|---|---|---|---|---|---|
| Gene Name | Gene ID | Enzyme | Protein ID | % Identity | % Homology | Enzyme |
|
|
| Acyl-CoA synthetase | PHZ_c1345 (fadD) | 29 | 41 | Long-chain acyl-CoA synthetase |
| pzu:PHZ_c1155 | 38 | 52 | Malonyl-CoA/methylmalonyl-CoA synthetase | |||
|
|
| Probable fatty-acid-CoA ligase | pzu:PHZ_c2065 | 39 | 57 | Fatty-acid-CoA ligase |
|
|
| Acyl-CoA synthetase (AMP forming) | pzu:PHZ_c1909 | 35 | 47 | Long-chain-fatty-acid—CoA ligase |
|
|
| Possible fatty-acid-CoA ligase | pzu:PHZ_c0597 | 30 | 46 | Fatty-acyl-CoA synthase |
|
|
| Probable fatty-acid-CoA ligase | pzu:PHZ_c2065 | 39 | 57 | Long-chain fatty acid:CoA ligase |
|
|
| Fatty acid-CoA synthase | pzu:PHZ_c2512 | 29 | 43 | Long-chain-fatty-acid--CoA ligase |
|
|
| Fatty acid-CoA synthase | pzu:PHZ_c2123 | 25 | 38 | Long-chain acyl-CoA synthetase |
|
|
| Possible fatty-acid-CoA ligase | pzu:PHZ_c2065 | 55 | 73 | CoA-synthetase, long-chain fatty acid:CoA ligase |
|
|
| Acyl-CoA dehydrogenase | pzu:PHZ_c2678 | 40 | 54 | Dehydrogenase family protein |
|
|
| Acyl-CoA dehydrogenase | pzu:PHZ_c0890 | 33 | 51 | Acyl-CoA dehydrogenase |
|
|
| Acyl-CoA dehydrogenase | pzu:PHZ_c2365 | 32 | 48 | Acyl-CoA dehydrogenase |
|
|
| Acyl-CoA dehydrogenase FADE25 | pzu:PHZ_c1680 | 40 | 60 | Acyl-CoA dehydrogenase |
|
|
| Probable acyl-CoA dehydrogenase | pzu:PHZ_c2336 | 33 | 50 | Acyl-CoA dehydrogenase |
|
|
| Probable acyl-CoA dehydrogenase | pzu:PHZ_c2406 | 28 | 45 | Isovaleryl CoA dehydrogenase |
|
|
| Probable acyl-CoA dehydrogenase | pzu:PHZ_c2498 | 26 | 43 | Acyl-CoA dehydrogenase |
|
|
| Probable acyl-CoA dehydrogenase | pzu:PHZ_c2541 | 32 | 51 | Acyl-CoA dehydrogenase |
|
|
| Probable acyl-CoA dehydrogenase | pzu:PHZ_c2541 | 41 | 57 | Acyl-CoA dehydrogenase |
|
|
| Probable acyl-CoA dehydrogenase | pzu:PHZ_c2678 | 36 | 53 | Dehydrogenase family protein |
|
|
| Probable acyl-CoA dehydrogenase | pzu:PHZ_c2498 | 32 | 45 | Acyl-CoA dehydrogenase |
|
|
| Probable acyl-CoA dehydrogenase | pzu:PHZ_c2679 | 33 | 48 | Acyl-CoA dehydrogenase family protein |
|
|
| Probable acyl-CoA dehydrogenase | pzu:PHZ_c2678 | 38 | 56 | Dehydrogenase family protein |
|
|
| Acyl-CoA dehydrogenase | pzu:PHZ_c3354 | 36 | 51 | Acyl-CoA dehydrogenase |
|
|
| Acyl-CoA dehydrogenase | pzu:PHZ_c0388 | 30 | 48 | Acyl-CoA dehydrogenase |
|
|
| 3-Hydroxyisobutyryl-CoA hydrolase | pzu:PHZ_c1679 | 39 | 57 | Enoyl-CoA hydratase/isomerase family protein |
|
|
| Possible enoyl-CoA hydratase | pzu:PHZ_c2535 | 38 | 54 | Enoyl-CoA hydratase/carnithine racemase |
|
|
| Possible enoyl-CoA hydratase | pzu:PHZ_c3429 | 33 | 51 | Enoyl-CoA hydratase/isomerase family protein |
|
|
| hydroxybutyryl-CoA dehydrogenase | pzu:PHZ_c3152 | 43 | 60 | 3-Hydroxyacyl-CoA dehydrogenase |
|
|
| Hydroxybutyryl-CoA dehydrogenase | pzu:PHZ_c3152 | 35 | 49 | 3-Hydroxyacyl-CoA dehydrogenase |
|
|
| Acetoacetyl-CoA thiolase | pzu:PHZ_c2504 | 40 | 60 | Acetyl-CoA C-acetyltransferase |
|
|
| 17β-Hydroxysteroid dehydrogenase (17β-HSD) | pzu:PHZ_c2008 | 36 | 49 | Short-chain dehydrogenase/reductase SDR |
|
|
| Probable ketoacyl-CoA thiolase | pzu:PHZ_c3245 | 36 | 49 | Acetyl-CoA C-acetyltransferase |
Functional analysis of alphaproteobacterial P450s.
| P450 | Function | Reference |
|---|---|---|
| CYP101B1 | α-ionone, β-damascone, phenylcyclohexane and | [ |
| CYP101C1 | Ionone derivative hydroxylation | [ |
| CYP101D1 | Terpenoid (camphor) hydroxylase | [ |
| CYP101D2 | Camphor 5-exo hydroxylase | [ |
| CYP108D1 | Aromatic hydrocarbon hydroxylase | [ |
| CYP153C1, CYP153D1 | Alkane hydroxylase | [ |
| CYP111A2 | Oxidizes linalool to 8-hydroxylinalool | [ |
| CYP111B1 | β-Ionone hydroxylation | [ |
| CYP112A2 | Oxidation of Rapamycin | [ |
| CYP127A3 | Hydroxylation of testosterone | [ |
| CYP195A2 | Degradation of 4-fluoro-, 4-chloro- and 4-methylsalicylic acid, and 3-chloro- and 3-methylsalicylic acid | [ |
| CYP199A1 | Hydroxylation of 2-naphthoic acid | [ |
| CYP199A2 | Hydroxylation of 2-naphthoic acid, para-substituted benzoic acids and involved in the degradation of ligninolic compounds | [ |
| CYP199A4 | Catalyzes heteroatom dealkylations, sulfoxidation, and amide and cyclic hemiacetal formation | [ |
| CYP200A1 | Hydroxylation of testosterone | [ |
| CYP201A2 | Involved in the biodegradation of tributyl phosphate | [ |