Literature DB >> 34073951

Ancient Bacterial Class Alphaproteobacteria Cytochrome P450 Monooxygenases Can Be Found in Other Bacterial Species.

Nomfundo Nzuza1, Tiara Padayachee1, Puleng Rosinah Syed2, Justyna Dorota Kryś3, Wanping Chen4, Dominik Gront3, David R Nelson5, Khajamohiddin Syed1.   

Abstract

Cytochrome P450 monooxygenases (CYPs/P450s), heme-thiolate proteins, are well-known players in the generation of chemicals valuable to humans and as a drug target against pathogens. Understanding the evolution of P450s in a bacterial population is gaining momentum. In this study, we report comprehensive analysis of P450s in the ancient group of the bacterial class Alphaproteobacteria. Genome data mining and annotation of P450s in 599 alphaproteobacterial species belonging to 164 genera revealed the presence of P450s in only 241 species belonging to 82 genera that are grouped into 143 P450 families and 214 P450 subfamilies, including 77 new P450 families. Alphaproteobacterial species have the highest average number of P450s compared to Firmicutes species and cyanobacterial species. The lowest percentage of alphaproteobacterial species P450s (2.4%) was found to be part of secondary metabolite biosynthetic gene clusters (BGCs), compared other bacterial species, indicating that during evolution large numbers of P450s became part of BGCs in other bacterial species. Our study identified that some of the P450 families found in alphaproteobacterial species were passed to other bacterial species. This is the first study to report on the identification of CYP125 P450, cholesterol and cholest-4-en-3-one hydroxylase in alphaproteobacterial species (Phenylobacterium zucineum) and to predict cholesterol side-chain oxidation capability (based on homolog proteins) by P. zucineum.

Entities:  

Keywords:  Alphaproteobacteria; CYP125; annotation; biosynthetic gene clusters; cholesterol oxidation; cytochrome P450 monooxygenase; genome data mining

Mesh:

Substances:

Year:  2021        PMID: 34073951      PMCID: PMC8197338          DOI: 10.3390/ijms22115542

Source DB:  PubMed          Journal:  Int J Mol Sci        ISSN: 1422-0067            Impact factor:   5.923


1. Introduction

Cytochrome P450 monooxygenases (CYPs/P450s) are heme-thiolate proteins known to humans for more than five and a half decades [1]. P450s perform enzymatic reactions with stereo- and regio-specific oxidation and because of this capability these enzymes are subjected to various applications in all forms of biology [2,3]. Most living organisms and some non-living entities such as viruses have been found to have P450s in their genomes [4,5], indicating their important role in primary and secondary metabolism. P450 reactions are critical in determining the drug toxicity of prodrugs and xenobiotic compounds [6]. P450s are well-known drug targets for some of the microbial pathogens [7,8]. Some of the P450s from microbes have been characterized and their application in the production of chemicals that are valuable to humans, such as secondary metabolites (compounds produced by the organisms that have no role in their growth, development or reproduction) has been explored [3,9,10,11,12,13]. Well-known examples of P450s’ involvement are antibiotic production, such as in the biosynthesis of the macrolide antibiotic erythromycin and glycopeptide antibiotics [14,15], the production of the anticancer drugs Taxol and perillyl alcohol [16,17] and the production of pharmaceutical compounds such as pravastatin [13]. P450s’ critical role in the synthesis of secondary metabolites has been thoroughly reviewed [13,18,19]. It was found that the coding sequences (genes) of enzymes involved in the synthesis of different secondary metabolites, including P450s, were part of gene clusters named biosynthetic gene clusters (BGCs) [20], where P450s were found to be critical in contributing to the diversity of the secondary metabolites [13,18,19]. In the pre-genome sequencing era, identification of such gene clusters and P450s that formed part of the clusters required tedious and laborious work. The current genome sequencing era has revolutionized the process and bioinformatics programs are available for the identification of BGCs in organisms [20]. However, to date, identification and annotation of P450s still involve manual sorting and naming as per the International P450 Nomenclature Committee rule, i.e., sequences with >40% identity are assigned to the same family as the named homolog P450 and sequences with >55% identity are assigned to the same subfamily as the named homolog P450 [21,22,23]. Proteins with <40% identity are assigned to a new P450 family. The current genome sequencing era has offered the opportunity to understand P450s’ profiles and thus evolution patterns, particularly in bacterial species where the impact of lifestyle has been found to affect the P450 profiles in an organism [24] profoundly. The bacterial class Alphaproteobacteria contains the most abundant, extraordinarily diverse and ancient group of bacteria [25,26]. Alphaproteobacteria consist of species with a diversity of lifestyles, geographical distribution and genome size [26]. They inhabit diverse ecological niches, from water to soil, and form both extra- and intracellular associations with eukaryotes, including unicellular organisms, and multicellular plants and mammals [25,26]. Some species are in symbiotic relationships with plants such as species belonging to the genus Agrobacterium; species belonging to Rickettsiaceae, Brucellaceae and Bartonellaceae are human pathogens [27], of which a few are of ecological importance, playing a huge role in carbon, nitrogen and sulfur cycles [28]. These bacteria are found in large amounts in the upper layer of the ocean [29]. In spite of their differences, all alphaproteobacterial species are oligotrophs; therefore, they can survive in an environment with low nutrients [30]. Alphaproteobacteria contain species that have biotechnological significance, especially some of the species found to produce secondary metabolites, compounds that play no role in the growth or reproduction of organisms, but give the organism a selective advantage [31,32]. The biological properties of these secondary metabolites in the field of drug discovery are invaluable, and strenuous efforts to find new secondary metabolites with biotechnological potentials are being made around the world. Some of the secondary metabolites produced by alphaproteobacterial species and their biological function are presented in Table 1.
Table 1

Some of the secondary metabolites produced by alphaproteobacterial species and their biological functions.

Secondary MetaboliteSpecies (Source)Biological FunctionReference
Didemnin BTistrella mobilis YIT 12409Anti-viral and anti-tumor properties[33]
ThalassospiramideThalassospira sp. CNJ-328Immunosuppression[34]
Tropodithietic acidPhaeobacter gallaeciensis and P. porticolaAntibiotic, biocontrol and anti-cancer[31]
PhytohormoneMethylobacterium sp. NC4Improve plant growth[35]
6-gingerol and curcuminBradyrhizobium japonicum CPAC 15 and Bradyrhizobium diazoefficiens CPAC 7Antitumor activity[36]
Gibberellins, salicylic acid, indole-3-acetic acid, zeatin, and abscisic acidSphingomonas speciesPromote plant germination and growth[37]
CyclodipeptidesSulfitobacter speciesBioactive agents[38]
It is a well-known fact that P450s are involved in the production of secondary metabolites per se; they contribute to the diversity of secondary metabolites produced by an organism [13,18,19]. Recent genome data mining, annotation of P450s and analysis of P450s’ association with secondary metabolism in some bacterial species such as mycobacterial species [39], Streptomyces species [40], cyanobacterial species [41] and Firmicutes species [24] revealed that quite a large number of P450s are involved in the production of secondary metabolites. Apart from these bacterial species, none of the other bacterial species’ P450 repertoire has been analyzed. In addition to this, to date, the P450 repertoire of alphaproteobacterial species and their association with secondary metabolism has not been reported despite the fact that P450s, in general, are involved in the production of secondary metabolites and these species do produce secondary metabolites. Thus, in this study, we address these research gaps by performing genome-wide data mining, annotation and phylogenetic analysis of P450s and identification of P450s involved in secondary metabolism in the bacterial class Alphaproteobacteria.

2. Results and Discussion

2.1. Only 40% of Alphaproteobacterial Species Have P450s

Genome data mining and annotation of P450s in 599 alphaproteobacterial species belonging to 164 genera (Table S1) revealed the presence of P450s in only 241 species belonging to 82 genera (Figure 1). This indicates that only 40% of alphaproteobacterial species and 50% of Alphaproteobacteria genera have P450s (Figure 1). All the species analyzed in the study belonging to genera such as Agrobacterium, Rhodopseudomonas, Hyphomicrobium, Ketogulonicigenium, Filomicrobium, Phenylobacterium, Roseobacter, Gluconacetobacter and Nitrobacter, and most of the species belonging to genera such as Sinorhizobium, Rhizobium, Bradyrhizobium and Sphingobium, have P450s in their genomes (Table S1). A significant number of species belonging to genera such as Rickettsia, Bartonella, Ehrlichia, Wolbachia and Anaplasma were analyzed in this study; no P450s were found in these species, suggesting that species in these genera probably do not have P450s (Table S1). Detailed information on genera, species and P450 information is presented in Table S1. In total, 874 P450s apart from 52 short-P450s were identified in 241 alphaproteobacterial species (Table S2 and Supplementary Dataset 1). On average, four P450s were found in 241 alphaproteobacterial species; 65 species had a single P450 in their genome (Table S1). Of these 65 species, 30 were from the genus Brucella alone (Table S1). Comparative analysis of P450s in alphaproteobacterial species revealed that Bradyrhizobium oligotrophicum has the highest number of P450s (17 P450s) in its genome, followed by 16 P450s in the species Bradyrhizobium japonicum E109, Novosphingobium aromaticivorans and Sphingomonas wittichii and 15 P450s in Bradyrhizobium diazoefficiens USDA 110 (Table S2). Comparative analysis with other bacterial species revealed that alphaproteobacterial species have a higher average number of P450s compared to Firmicutes species [24] and cyanobacterial species [41], but a lower number compared to Streptomyces species [39,40] and mycobacterial species [42] (Table 2). A list of P450s and their sequences along with short-P450s is presented in Supplementary Dataset 1.
Figure 1

Comparative analysis of P450 statistics in the bacterial class Alphaproteobacteria. Detailed information is presented in Table S1.

Table 2

Comparative analysis of main characteristics of P450s in different bacterial species.

CategoryAlphaproteobacterial SpeciesFirmicutes SpeciesStreptomyces SpeciesMycobacterial SpeciesCyanobacterial Species
Total no of species analyzed59997220360114
No of species that have P450s24122920360114
No of P450s87471254601784341
No of families143142537736
No of subfamilies2145369813279
Dominant P450 familyCYP202CYP107CYP107CYP125CYP110
Average no. of P450s4127303
P450 diversity percentage0.070.0080.020.070.09
No of P450s part of BGCs21126123120427
Percentage of P450s part of BGCs2.41823118
Reference(s)This work[24,43][39,40][39,42][41]

2.2. Alphaproteobacterial Species Have the Highest Number of P450 Families Next to Streptomyces

Following the International P450 Nomenclature Committee rules, 874 alphaproteobacterial species’ P450s were grouped into 143 P450 families and 214 P450 subfamilies (grouped in the same family when they share >40% and the same subfamily when they share >55%; with less than 40% identity species are assigned to a new P450 family) [21,22], based on the phylogenetic analysis of P450s (Figure 2) [22,44,45]. A list of P450 families and subfamilies, their count and percentage contribution to the total number of P450s is presented in Table 3.
Figure 2

Phylogenetic analysis of alphaproteobacterial P450s. P450 families that are expanded in these species were highlighted in different colors and indicated in the figure. Alphaproteobacterial P450 protein sequences used to construct the phylogenetic tree are presented in Supplementary Dataset 1.

Table 3

Comparative analysis of P450 families and subfamilies in alphaproteobacterial species.

P450 FamilyP450 CountPercentage CountSubfamilyP450 Count
CYP101010.11%A1
CYP101710.11%A1
CYP101101.14%B1
C3
D2
E2
Q1
R1
CYP102121.37%A10
K1
AC1
CYP103620.23%B1
C1
CYP10340.46%A4
CYP104131.49%A13
CYP10540.46%BF3
CX1
CYP106120.23%B2
CYP106840.46%A4
CYP107510.11%A1
CYP107610.11%A1
CYP107710.11%A1
CYP107880.92%A7
C1
CYP107101.14%AN8
_1
JE1
CYP108120.23%B2
CYP108210.11%A1
CYP108350.57%A3
C2
CYP108610.11%B1
CYP108820.23%B2
CYP108910.11%A1
CYP108374.23%A2
D6
G18
L1
P2
U1
V1
W1
X5
CYP109020.23%C2
CYP109130.34%A3
CYP109440.46%A3
B1
CYP109860.69%A6
CYP1101293.32%A29
CYP1104101.14%A4
C3
E3
CYP1107161.83%A12
B3
C1
CYP111170.80%A5
B1
C1
CYP111630.34%B3
CYP111810.11%B1
CYP11120.23%A1
B1
CYP112232.63%A23
CYP113510.11%A1
CYP113710.11%A1
CYP113820.23%B2
CYP114110.11%A1
CYP114510.11%A1
CYP114222.52%A22
CYP115510.11%B1
CYP11510.11%A1
CYP11610.11%B1
CYP117070.80%A7
CYP117110.11%A1
CYP117310.11%A1
CYP117420.23%A2
CYP117510.11%A1
CYP117222.52%A22
CYP118110.11%A1
CYP118210.11%A1
CYP118620.23%A2
CYP118710.11%A1
CYP120420.23%A2
CYP120610.11%B1
CYP122120.23%B2
CYP122940.46%A1
B2
C1
CYP124620.23%A2
CYP125010.11%A1
CYP125810.11%A1
CYP12510.11%P1
CYP127510.11%B1
CYP127192.17%A18
C1
CYP128110.11%A1
CYP129150.57%B3
C2
CYP130220.23%A2
CYP131120.23%A2
CYP131210.11%A1
CYP132620.23%B2
CYP133010.11%A1
CYP133710.11%A1
CYP13310.11%F1
CYP134910.11%A1
CYP135010.11%A1
CYP137110.11%B1
CYP137620.23%B1
C1
CYP138410.11%A1
CYP139610.11%A1
CYP140510.11%B1
CYP140630.34%B2
C1
CYP146010.11%C1
CYP147161.83%D16
CYP151520.23%A2
CYP15120.23%C2
CYP152131.49%B4
C3
E2
P1
AA1
AB2
CYP153606.86%A44
C2
D14
CYP15540.46%J3
K1
CYP159110.11%A1
CYP159710.11%A1
CYP173230.34%A3
CYP173320.23%A1
B1
CYP173410.11%A1
CYP173510.11%A1
CYP173610.11%A1
CYP173720.23%A2
CYP173810.11%A1
CYP173920.23%A2
CYP173606.86%A45
B8
C3
G1
H1
J2
CYP174010.11%A1
CYP174110.11%A1
CYP174210.11%A1
CYP174310.11%A1
CYP174410.11%A1
CYP174510.11%A1
CYP174610.11%A1
CYP174710.11%A1
CYP174810.11%A1
CYP174920.23%A1
B1
CYP175010.11%A1
CYP175110.11%A1
CYP17524 A4
CYP175310.11%A1
CYP175410.11%A1
CYP175510.11%A1
CYP18010.11%D1
CYP18670.80%K7
CYP19110.11%B1
CYP19230.34%A3
CYP193171.95%A17
CYP194101.14%A10
CYP195212.40%A18
D1
E2
CYP196374.23%A34
B2
C1
CYP19720.23%R2
CYP199161.83%A11
B1
J1
K1
L1
M1
CYP20080.92%A5
B2
C1
CYP201313.55%A25
B3
C3
CYP202708.01%A46
B24
CYP203131.49%A12
B1
CYP20420.23%A2
CYP206192.17%A9
B1
C3
D6
CYP21920.23%A2
CYP22330.34%A2
E1
CYP22410.11%A1
CYP22570.80%A7
CYP22620.23%C1
D1
CYP27820.23%A1
C1
CYP28610.11%C1
CYP28820.23%B2
CYP289333.78%A33
CYP29060.69%A5
B1
CYP214010.11%A1
CYP123610.11%A1
CYP233420.23%A2
P450 family and subfamily-level comparative analysis with other bacterial species revealed that alphaproteobacterial species have more P450 families and subfamilies compared to Firmicutes species, mycobacterial species and cyanobacterial species but fewer than Streptomyces species (Table 2). The highest diversity of P450 families and subfamilies observed in alphaproteobacterial species is possibly due to their diverse lifestyle that led to the generation of diverse P450 families and subfamilies, as the lifestyle of an organism is known to have an impact on the P450 repertoire in the genomes [24]. Thus, for this reason, 77 new P450 families were observed in these species, contributing to the P450 family diversity (Table S3). In aligning with high P450 family and subfamily diversity, alphaproteobacterial species also showed the highest P450 diversity percentage compared to Firmicutes species and Streptomyces species, the same as mycobacterial species but lower than cyanobacterial species (Table 2). A point to be noted is that the number of cyanobacterial species analyzed is almost half of alphaproteobacterial species and thus cyanobacterial species have a slightly higher P450 diversity percentage (Table 2). P450 family-level analysis revealed that some P450 families are expanded in alphaproteobacterial species (Table 3). Among the P450 families, the CYP202 family has the highest number of P450s (70 P450s), followed by CYP153 and CYP173 (each 60 P450s), CYP108 and CYP196 (each 37 P450s) (Table 3). Sixty-five P450 families have a single member and 29 P450 families have two members, indicating the diversity of P450 families in alphaproteobacterial species (Table 3). Analysis of P450 families and subfamilies in alphaproteobacterial species revealed that the P450 family with the highest number of subfamilies was CYP108 (nine subfamilies) followed by CYP101, CYP152, CYP199, CYP173 (each with six subfamilies) and CYP206 (fivesubfamilies). Interestingly, some of the subfamilies are expanded in the dominant P450 families such as CYP202, CYP173, CYP153, CYP196, CYP289, CYP201, CYP1101, CYP112, CYP114 and CYP117, where subfamily “A” is dominant (Table 3). A detailed analysis of the subfamilies and their member count is presented in Table 3. Heat map analysis of the presence and absence of P450 families in different alphaproteobacterial species revealed that none of the P450 family is conserved in these species (Figure 3). Non-conservation of P450 families in bacterial species is not common; it was also observed in cyanobacterial species [41]. However, the co-presence of quite a number of P450 families was found in a large number of species, such as CYP173 and CYP202 in 32 species, CYP196 and CYP201 in 16 species, CYP195 and CYP199 in 10 species, CYP195 and CYP196 in 11 species, CY173, CYP117, CYP127, CYP112 and CYP114 in 10 species and CYP173, CYP147, CYP201 and CYP206 in 9 species (Figure 3). A point to be noted is that the co-presence of some P450 families was also observed in other bacterial species [24]. When compared to other bacterial species, the CYP202 family is dominant in alphaproteobacterial species, while CYP110 is dominant in cyanobacterial species, CYP125 in mycobacterial species, and CYP107 in both the Firmicutes species and Streptomyces species (Table 2).
Figure 3

Heat map figure representing the presence or absence of cytochrome P450 families in 599 alphaproteobacterial species. The data have been represented as −3 for family absence (green) and 3 for family presence (red). One hundred and forty-three alphaproteobacterial species form the vertical axis and 214 P450 families form the horizontal axis. The respective data used in the generation of this figure are presented in Supplementary Dataset 2.

2.3. Only a Few P450s Are Involved in Secondary Metabolism in Alphaproteobacterial Species

Analysis of metabolite BGCs revealed that 504 species of 599 alphaproteobacterial species have secondary metabolite BGCs in their genomes (Supplementary Dataset 3). In total, 2262 secondary metabolite BGCs belonging to 93 types were found in 504 species (Supplementary Dataset 3). Among the 93 cluster types, terpene was the dominant cluster (270 clusters), followed by Homoserine lactone (hserlactone) (180 clusters), bacteriocin (118 clusters), betalactone (82 clusters) and nonribosomal peptides (NRPS) (79 clusters) in these species (Figure 4). Most similar known cluster analysis revealed that of 2262 BGCs, 642 BGCs showed similarity to 132 known clusters and 96 of these clusters showed 100% identity to known clusters, indicating their involvement in the specific secondary metabolite (Supplementary Dataset 3).
Figure 4

Comparative analysis of types of secondary metabolite BGCs in alphaproteobacterial species. BGCs that are populated in alphaproteobacterial species were presented in the figure. The number at the top of each bar represents the total number of clusters. Detailed information is presented in Supplementary Dataset 3.

Analysis of P450s that are part of BGCs revealed that only 2.4% of alphaproteobacterial species P450s are involved in the production of secondary metabolites (Table 4). The percentage of P450s involved in the production of secondary metabolites in alphaproteobacterial species was found to be lowest compared to other bacterial species such as Cyanobacteria (8%), Firmicutes species (18%), mycobacterial species (11%) and Streptomyces species (23%) (Table 2). Twenty-one P450s from 19 alphaproteobacterial species were found to be part of BGCs (Table 4). Of the 16 P450 families that are part of BGCs, CYP206 is the dominant family (5 P450s-24%), followed by CYP1101 (2 P450s-10%) and the remaining 14 P450 families (CYP195, CYP1101, CYP2334, CYP199, CYP173, CYP153, CYP152, CYP1302, CYP127, CYP1246, CYP1138, CYP1104, CYP108, CYP107, CYP1326) have a single P450 (Table 4). Dominant P450 families such as CYP173, CYP153 and CYP108 have only one representative as part of BGCs, indicating no correlation between the dominant P450 family vs BGCs (Table 4). A point to be noted is that three P450s, CYP1326A2, CYP2334A1 and CYP195A21 from Sulfitobacter sp. AM1-D1, were found to be part of BGCs (Table 4). Most similar known cluster analysis revealed that CYP1101A27 from Celeribacter indicus was certainly involved in the production of ectoine, as the percentage identity with the most similar known cluster was 100% (Table 4).
Table 4

Identification of P450s that are part of the secondary metabolite BGCs in alphaproteobacterial species. Reference cluster information was obtained by performing BLAST at the anti-SMASH database [46], as indicated in the materials and methods section. The cluster type and most similar known cluster names available at the anti-SMASH database [46] were listed in the table.

Species NameP450Reference Cluster Information
Cluster TypeMost Similar Known ClusterSimilarity
Novosphingobium aromaticivoransCYP153C1TerpeneAstaxanthin dideoxyglycoside75%
Mesorhizobium japonicum MAFF 303099CYP127A3v1Hserlactone--
Agrobacterium fabrumCYP206A1Terpene--
Agrobacterium sp. H13-3CYP206A4Terpene--
Agrobacterium tumefaciens CYP206A4Terpene--
Agrobacterium rhizogenes CYP206A2Terpene--
Rhizobium sp. IRBG74CYP206A3Terpene--
Rhizobium sp. NT-26CYP107JE1NAGGN--
Bradyrhizobium sp. S23321CYP199A26Hserlactone, t2pksColabomycin4%
Bradyrhizobium oligotrophicum CYP108L2NRPS--
Bosea vaviloviae CYP1101A30T1PKSS56-p1, NRPS3%
Beijerinckia indica CYP173J1NRPS, T1PKS--
Pseudorhodoplanes sinuspersici CYP1104E2T1PKSSphingan polysaccharide, saccharide13%
Rhodobacter sphaeroides ATCC 17025CYP152C2hserlactoneConglobatin, NRPS10%
Celeribacter indicus CYP1101A27EctoineEctoine, other100%
Hyphomonadaceae bacterium UKL13-1CYP1246A4Terpene--
Sphingopyxis macrogoltabida 203CYP1302A1NRPS--
Xanthobacter autotrophicus CYP1138B1T1PKS--
Sulfitobacter sp.AM1-D1CYP1326A2acyl_amino_acids--
CYP195A21bacteriocin--
CYP2334A1NRPS-like--

2.4. Alphaproteobacterial P450 Families Can Be Found in Other Bacterial Species

It is a well-known fact that bacterial species in Alphaproteobacteria are regarded as an ancient group of bacteria [25,26]. A comparison of P450 families with other bacterial species will provide important P450 family evolutionary distribution patterns. To understand this aspect, we performed comprehensive comparative analysis of P450 families from different bacterial species (Figure 5). As shown in Figure 5, four P450 families were commonly found among alphaproteobacterial species, Firmicutes species and cyanobacterial species. Quite a large number of P450 families were found to be common among alphaproteobacterial species and Streptomyces species (22 P450 families), and mycobacterial species (14 P450 families) (Figure 5). Only the CYP107 family was found to be conserved among all bacterial species (Figure 5). CYP152 and CYP197 were commonly found in alphaproteobacterial species, Firmicutes species and Streptomyces species. Neither of these families is present in mycobacterial species. This indicates that the P450 families that are commonly found among alphaproteobacterial species and other bacterial species are indeed passed from alphaproteobacterial species and retained by other bacterial species throughout the speciation, suggesting the important role of these P450 families. One interesting observation is that the CYP125 P450 family, a cholesterol and cholest-4-en-3-one hydroxylase [47,48] and potential drug target against tuberculosis-causing bacteria Mycobacterium tuberculosis H37Rv [49], is commonly found in Alphaproteobacteria and mycobacterial species, and Streptomyces species (Figure 5), indicating that a cholesterol hydroxylation capability already existed in ancient bacteria as described elsewhere [50] and that these bacterial species retained this ability to survive better in the host environment, as mentioned elsewhere [39].
Figure 5

Comparative analysis of P450 families among different bacterial species. The number of P450 families indicated with a number (bold case) and P450 families that are common between alphaproteobacterial species and other bacterial species were listed in the figure.

A point to be noted is that only one alphaproteobacterial species, namely Phenylobacterium zucineum, was found to have CYP125 P450 among 599 species used in the study, indicating two scenarios, i.e., loss of this P450 in other alphaproteobacterial species or gain by P. zucineum. This phenomenon needs further investigation. However, the presence of CYP125 P450 indicates cholesterol side-chain degradation and its subsequent utilization as carbon source via the beta-oxidation pathway [51] in P. zucineum. In order to identify the cholesterol side-chain oxidizing enzymes in P. zucineum, if any, we used M. tuberculosis H37Rv cholesterol side-chain oxidizing enzymes as listed in the literature [51]. Based on the homology percentage and annotation of enzymes at KEGG, we identify all possible cholesterol side-chain oxidizing enzymes (31 enzymes) in P. zucineum, indicating that this alphaproteobacterial species is indeed capable of oxidizing the cholesterol side-chain, where the CYP125 reaction will be critical in removing the side-chain from cholesterol (Table 5). As shown in Table 5, two homologs (one named fadD) were found for fadD36 (Rv1193) acyl-CoA synthetase. Our prediction on cholesterol side-chain degradation by P. zucineum is solely based on homolog proteins’ functions, including CYP125, and further validation by experimentation is required.
Table 5

Identification of homolog proteins involved in cholesterol side-chain oxidation in Phenylobacterium zucineum. Mycobacterium tuberculosis H37Rv cholesterol side-chain degrading proteins from the published literature [51] were used to identify homolog proteins in P. zucineum. Gene and protein IDs and enzyme annotations were from KEGG [52].

Proteins Involved in Cholesterol Side-Chain Oxidation in Mycobacterium tuberculosis H37RvHomolog Proteins in Phenylobacterium zucineum
Gene NameGene IDEnzymeProtein ID% Identity% HomologyEnzyme
fadD36 Rv1193 Acyl-CoA synthetasePHZ_c1345 (fadD)2941Long-chain acyl-CoA synthetase
pzu:PHZ_c11553852Malonyl-CoA/methylmalonyl-CoA synthetase
fadD19 Rv3515c Probable fatty-acid-CoA ligasepzu:PHZ_c20653957Fatty-acid-CoA ligase
fadD3 Rv3561 Acyl-CoA synthetase (AMP forming)pzu:PHZ_c19093547Long-chain-fatty-acid—CoA ligase
fadD17 Rv3506 Possible fatty-acid-CoA ligasepzu:PHZ_c05973046Fatty-acyl-CoA synthase
fadD19 Rv3515c Probable fatty-acid-CoA ligasepzu:PHZ_c20653957Long-chain fatty acid:CoA ligase
fadD10 Rv0099 Fatty acid-CoA synthasepzu:PHZ_c25122943Long-chain-fatty-acid--CoA ligase
fadD9 Rv2590 Fatty acid-CoA synthasepzu:PHZ_c21232538Long-chain acyl-CoA synthetase
fadD18 Rv3513c Possible fatty-acid-CoA ligasepzu:PHZ_c20655573CoA-synthetase, long-chain fatty acid:CoA ligase
fadE22 Rv3061c Acyl-CoA dehydrogenasepzu:PHZ_c26784054Dehydrogenase family protein
fadE23 Rv3140 Acyl-CoA dehydrogenasepzu:PHZ_c08903351Acyl-CoA dehydrogenase
fadE24 Rv3139 Acyl-CoA dehydrogenasepzu:PHZ_c23653248Acyl-CoA dehydrogenase
fadE25 Rv3274c Acyl-CoA dehydrogenase FADE25pzu:PHZ_c16804060Acyl-CoA dehydrogenase
fadE26 Rv3504 Probable acyl-CoA dehydrogenasepzu:PHZ_c23363350Acyl-CoA dehydrogenase
fadE27 Rv3505 Probable acyl-CoA dehydrogenasepzu:PHZ_c24062845Isovaleryl CoA dehydrogenase
fadE28 Rv3544c Probable acyl-CoA dehydrogenasepzu:PHZ_c24982643Acyl-CoA dehydrogenase
fadE29 Rv3543c Probable acyl-CoA dehydrogenasepzu:PHZ_c25413251Acyl-CoA dehydrogenase
fadE30 Rv3560c Probable acyl-CoA dehydrogenasepzu:PHZ_c25414157Acyl-CoA dehydrogenase
fadE31 Rv3562 Probable acyl-CoA dehydrogenasepzu:PHZ_c26783653Dehydrogenase family protein
fadE32 Rv3563 Probable acyl-CoA dehydrogenasepzu:PHZ_c24983245Acyl-CoA dehydrogenase
fadE33 Rv3564 Probable acyl-CoA dehydrogenasepzu:PHZ_c26793348Acyl-CoA dehydrogenase family protein
fadE34 Rv3573c Probable acyl-CoA dehydrogenasepzu:PHZ_c26783856Dehydrogenase family protein
fadE5 Rv0244c Acyl-CoA dehydrogenasepzu:PHZ_c33543651Acyl-CoA dehydrogenase
mbtN (fadE14) Rv1346 Acyl-CoA dehydrogenasepzu:PHZ_c03883048Acyl-CoA dehydrogenase
echA9 Rv1071c 3-Hydroxyisobutyryl-CoA hydrolasepzu:PHZ_c16793957Enoyl-CoA hydratase/isomerase family protein
echA19 Rv3516 Possible enoyl-CoA hydratasepzu:PHZ_c25353854Enoyl-CoA hydratase/carnithine racemase
echA20 Rv3550 Possible enoyl-CoA hydratasepzu:PHZ_c34293351Enoyl-CoA hydratase/isomerase family protein
fadB2 Rv0468 hydroxybutyryl-CoA dehydrogenasepzu:PHZ_c315243603-Hydroxyacyl-CoA dehydrogenase
fadB3 Rv1715 Hydroxybutyryl-CoA dehydrogenasepzu:PHZ_c315235493-Hydroxyacyl-CoA dehydrogenase
fadA5 Rv3546 Acetoacetyl-CoA thiolasepzu:PHZ_c25044060Acetyl-CoA C-acetyltransferase
hsd4A Rv3502c 17β-Hydroxysteroid dehydrogenase (17β-HSD)pzu:PHZ_c20083649Short-chain dehydrogenase/reductase SDR
ltp2 Rv3540c Probable ketoacyl-CoA thiolasepzu:PHZ_c32453649Acetyl-CoA C-acetyltransferase

2.5. Functional Analysis of Alphaproteobacterial P450s

A literature survey on functional analysis of alphaproteobacterial P450s revealed that several P450s are functionally characterized from these species and are involved in oxidation of xenobiotic compounds (Table 6). These study results revealed that 21 P450s were found to be involved in synthesis, so secondary metabolites were CYP1101A27 from C. indicus, involved in the production of ectoine (Table 4). However, the physiological functional relevance of P450s in these species needs to be investigated.
Table 6

Functional analysis of alphaproteobacterial P450s.

P450FunctionReference
CYP101B1α-ionone, β-damascone, phenylcyclohexane and p-cymene hydroxylation[53]
CYP101C1Ionone derivative hydroxylation[54]
CYP101D1Terpenoid (camphor) hydroxylase[55,56]
CYP101D2Camphor 5-exo hydroxylase[57]
CYP108D1Aromatic hydrocarbon hydroxylase[55,58]
CYP153C1, CYP153D1Alkane hydroxylase[59]
CYP111A2Oxidizes linalool to 8-hydroxylinalool[55]
CYP111B1β-Ionone hydroxylation[60]
CYP112A2Oxidation of Rapamycin[61]
CYP127A3Hydroxylation of testosterone[62]
CYP195A2Degradation of 4-fluoro-, 4-chloro- and 4-methylsalicylic acid, and 3-chloro- and 3-methylsalicylic acid[63]
CYP199A1Hydroxylation of 2-naphthoic acid[64]
CYP199A2Hydroxylation of 2-naphthoic acid, para-substituted benzoic acids and involved in the degradation of ligninolic compounds[64,65]
CYP199A4Catalyzes heteroatom dealkylations, sulfoxidation, and amide and cyclic hemiacetal formation[66]
CYP200A1Hydroxylation of testosterone[62]
CYP201A2Involved in the biodegradation of tributyl phosphate[67]

3. Materials and Methods

3.1. Species and Database

In this study, 599 alphaproteobacterial species’ genomes that are available for public use at Kyoto Encyclopedia of Genes and Genomes (KEGG) database [52] were used (Table S1). Detailed information on species, species codes, genera and GenBank accession codes is presented in Table S1.

3.2. Genome Data Mining and Annotation of P450s

P450 mining in alphaproteobacterial species was carried out using the methods recently described by our laboratory [24,41]. Briefly, the complete proteome of alphaproteobacterial species was downloaded from KEGG and subjected to the NCBI Batch Web CD-Search Tool [68]. Proteins that belong to the P450 superfamily were selected and annotated, as per the International P450 Nomenclature Committee rule, i.e., proteins with >40% identity are grouped under the same family and proteins with >55% identity are grouped under the same subfamily [21,22,44]. Proteins with <40% identity with named P450s are assigned to a new P450 family.

3.3. Phylogenetic Analysis of P450s

Phylogenetic analysis of P450s was carried out following the procedure described recently by our laboratory [24,41]. The phylogenetic tree of P450s was constructed using alphaproteobacterial species P450 protein sequences. Firstly, the MAFFT v6.864 [69] was used to align the protein sequences that are part of the Trex web server [70]. The alignments were then be subjected to interpret the best tree by the Trex web server [70]. Lastly, a web-based tool, iTol, was used to create, visualize and color the tree [71].

3.4. Generation of P450 Profile Heat Maps

P450 profile heat maps were generated following the procedure described recently by our laboratory [24,41]. The heat map was generated using the P450 family data to show the presence or absence of P450s in alphaproteobacterial species. The data were represented as (−3) for family absence (green) and (3) for family presence (red). A tab-delimited file was imported into Mev (Multi-experiment viewer) [72]. Hierarchical clustering using a Euclidean distance metric was used to cluster the data. alphaproteobacterial species with P450s in their genome formed the vertical axis and P450 families formed the horizontal axis.

3.5. Secondary Metabolite BGC Analysis and Identification of P450s That Are Part of BGCs

Secondary metabolite BGC analysis and identification of P450s that are part of BGCs in alphaproteobacterial species was carried out following the procedure described recently by our laboratory [24,41]. Briefly, alphaproteobacterial species’ individual genome ID (Table S1) was submitted to anti-SMASH [46] for identification of secondary metabolite BGCs. Results were downloaded both in the form of Excel spreadsheets representing species-wise cluster information and gene cluster sequences in a Word file. P450s that are part of a specific gene cluster were identified by manually going through the BGCs sequence. Standard gene cluster abbreviation terminology present at the anti-SMASH database [46] was sustained in this study.

3.6. Identification of Cholesterol Side-Chain Oxidizing Genes/Proteins

Thirty-one cholesterol side-chain oxidizing genes/proteins from M. tuberculosis H37Rv [51] were used in the study to identify homologs in P. zucineum. Reference proteins were blasted individually against the P. zucineum genome and based on the percentage identity, homology, coverage and annotation at KEGG, the homolog proteins were identified in P. zucineum. A point to be noted is that M. tuberculosis and P. zucineum are distantly related and thus the lowest percentage identity between these two organisms’ proteins is expected, as previously observed [50].

3.7. Data Analysis

All calculations were carried out following the procedure described previously [24,41,73]. The average number of P450s was calculated using the formula: Average number of P450s = Number of P450s/Number of species. The P450 diversity percentage was calculated using the formula: 100 × Total number of P450 families/(Total number of P450s × Number of species with P450s). The percentage of P450s that formed part of BGCs was calculated using the formula: Percentage of P450s part of BGCs = 100 × Number of P450s part of BGCs/Total number of P450s present in species.

4. Conclusions

Cytochrome P450 monooxygenases (CYPs/P450s) have been well-known proteins for the last six decades. The stereo- and regio-specific oxidation of a variety of compounds by these enzymes led to their applications in quite large areas of biological research. Understanding the evolution of P450s in the bacterial population is now gaining momentum owing to the availability of a large number of bacterial genomes. This study is the first of its kind on the analysis of P450s in an ancient group of bacteria belonging to the class Alphaproteobacteria. Comparative analysis of P450s between different bacterial species revealed that during speciation a large number of P450s became part of secondary metabolite gene clusters (as observed in Streptomyces species and mycobacterial species) and some P450s were passed all the way from Alphaproteobacteria to other bacterial species. This study also reports the first identification of CYP125 P450 in alphaproteobacterial species and predicts that P. zucineum is capable of utilizing the cholesterol side-chain as carbon source. Future study should include thorough profiling of cholesterol-degrading genes/proteins and experimental validation of cholesterol degrading ability. Furthermore, comparative analysis of alphaproteobacterial species and more bacterial species should be carried out to deduce the evolution pattern of P450 families and to identify the loss/gain of new P450 families compared to Alphaproteobacteria with respect to habitat or geographical distribution. The results of this study will serve as reference for future genome data mining and annotation of P450s in species of Alphaproteobacteria.
  66 in total

1.  The structure of CYP101D2 unveils a potential path for substrate entry into the active site.

Authors:  Wen Yang; Stephen G Bell; Hui Wang; Weihong Zhou; Mark Bartlam; Luet-Lok Wong; Zihe Rao
Journal:  Biochem J       Date:  2011-01-01       Impact factor: 3.857

2.  Hydroxylation of testosterone by bacterial cytochromes P450 using the Escherichia coli expression system.

Authors:  Hitosi Agematu; Naoki Matsumoto; Yoshikazu Fujii; Hiroki Kabumoto; Satoru Doi; Kazuhiro Machida; Jun Ishikawa; Akira Arisawa
Journal:  Biosci Biotechnol Biochem       Date:  2006-01       Impact factor: 2.043

3.  Discovery of 2-naphthoic acid monooxygenases by genome mining and their use as biocatalysts.

Authors:  Toshiki Furuya; Kuniki Kino
Journal:  ChemSusChem       Date:  2009-06-27       Impact factor: 8.928

4.  Epiphytic pink-pigmented methylotrophic bacteria enhance germination and seedling growth of wheat (Triticum aestivum) by producing phytohormone.

Authors:  Kamlesh K Meena; Manish Kumar; Marina G Kalyuzhnaya; Mahesh S Yandigeri; Dhananjaya P Singh; Anil K Saxena; Dilip K Arora
Journal:  Antonie Van Leeuwenhoek       Date:  2011-12-27       Impact factor: 2.271

Review 5.  Natural Compounds as Pharmaceuticals: The Key Role of Cytochromes P450 Reactivity.

Authors:  Giovanna Di Nardo; Gianfranco Gilardi
Journal:  Trends Biochem Sci       Date:  2020-04-05       Impact factor: 13.807

6.  Mycobacterium tuberculosis CYP125A1, a steroid C27 monooxygenase that detoxifies intracellularly generated cholest-4-en-3-one.

Authors:  Hugues Ouellet; Shenheng Guan; Jonathan B Johnston; Eric D Chow; Petrea M Kells; Alma L Burlingame; Jeffery S Cox; Larissa M Podust; Paul R Ortiz de Montellano
Journal:  Mol Microbiol       Date:  2010-06-10       Impact factor: 3.501

Review 7.  Protein signatures distinctive of alpha proteobacteria and its subgroups and a model for alpha-proteobacterial evolution.

Authors:  Radhey S Gupta
Journal:  Crit Rev Microbiol       Date:  2005       Impact factor: 7.624

8.  Substrate specificity of 6-deoxyerythronolide B hydroxylase, a bacterial cytochrome P450 of erythromycin A biosynthesis.

Authors:  J F Andersen; K Tatsuta; H Gunji; T Ishiyama; C R Hutchinson
Journal:  Biochemistry       Date:  1993-03-02       Impact factor: 3.162

9.  Description of a sulfitobacter strain and its extracellular cyclodipeptides.

Authors:  Cong Long; Xiao-Ling Lu; Yun Gao; Bing-Hua Jiao; Xiao-Yu Liu
Journal:  Evid Based Complement Alternat Med       Date:  2011-07-11       Impact factor: 2.629

10.  Impact of lifestyle on cytochrome P450 monooxygenase repertoire is clearly evident in the bacterial phylum Firmicutes.

Authors:  Tiara Padayachee; Nomfundo Nzuza; Wanping Chen; David R Nelson; Khajamohiddin Syed
Journal:  Sci Rep       Date:  2020-08-19       Impact factor: 4.379

View more
  4 in total

1.  An Unprecedented Number of Cytochrome P450s Are Involved in Secondary Metabolism in Salinispora Species.

Authors:  Nsikelelo Allison Malinga; Nomfundo Nzuza; Tiara Padayachee; Puleng Rosinah Syed; Rajshekhar Karpoormath; Dominik Gront; David R Nelson; Khajamohiddin Syed
Journal:  Microorganisms       Date:  2022-04-21

2.  Contrasting Health Effects of Bacteroidetes and Firmicutes Lies in Their Genomes: Analysis of P450s, Ferredoxins, and Secondary Metabolite Clusters.

Authors:  Bridget Valeria Zinhle Nkosi; Tiara Padayachee; Dominik Gront; David R Nelson; Khajamohiddin Syed
Journal:  Int J Mol Sci       Date:  2022-05-02       Impact factor: 6.208

3.  Lifestyles Shape the Cytochrome P450 Repertoire of the Bacterial Phylum Proteobacteria.

Authors:  Siphesihle Msweli; Andiswa Chonco; Lihle Msweli; Puleng Rosinah Syed; Rajshekhar Karpoormath; Wanping Chen; Dominik Gront; Bridget Valeria Zinhle Nkosi; David R Nelson; Khajamohiddin Syed
Journal:  Int J Mol Sci       Date:  2022-05-22       Impact factor: 6.208

4.  Diversification of Ferredoxins across Living Organisms.

Authors:  Nomfundo Nzuza; Tiara Padayachee; Wanping Chen; Dominik Gront; David R Nelson; Khajamohiddin Syed
Journal:  Curr Issues Mol Biol       Date:  2021-09-30       Impact factor: 2.976

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.