Literature DB >> 29784821

Characterization of gossypol biosynthetic pathway.

Xiu Tian1,2, Ju-Xin Ruan1, Jin-Quan Huang1, Chang-Qing Yang1, Xin Fang1, Zhi-Wen Chen1, Hui Hong1, Ling-Jian Wang1, Ying-Bo Mao1, Shan Lu2, Tian-Zhen Zhang3, Xiao-Ya Chen4,5.   

Abstract

Gossypol and related sesquiterpene aldehydes in cotton function as defense compounds but are antinutritional in cottonseed products. By transcriptome comparison and coexpression analyses, we identified 146 candidates linked to gossypol biosynthesis. Analysis of metabolites accumulated in plants subjected to virus-induced gene silencing (VIGS) led to the identification of four enzymes and their supposed substrates. In vitro enzymatic assay and reconstitution in tobacco leaves elucidated a series of oxidative reactions of the gossypol biosynthesis pathway. The four functionally characterized enzymes, together with (+)-δ-cadinene synthase and the P450 involved in 7-hydroxy-(+)-δ-cadinene formation, convert farnesyl diphosphate (FPP) to hemigossypol, with two gaps left that each involves aromatization. Of six intermediates identified from the VIGS-treated leaves, 8-hydroxy-7-keto-δ-cadinene exerted a deleterious effect in dampening plant disease resistance if accumulated. Notably, CYP71BE79, the enzyme responsible for converting this phytotoxic intermediate, exhibited the highest catalytic activity among the five enzymes of the pathway assayed. In addition, despite their dispersed distribution in the cotton genome, all of the enzyme genes identified show a tight correlation of expression. Our data suggest that the enzymatic steps in the gossypol pathway are highly coordinated to ensure efficient substrate conversion.
Copyright © 2018 the Author(s). Published by PNAS.

Entities:  

Keywords:  P450; cotton; gossypol biosynthesis; secondary metabolism; sesquiterpene

Mesh:

Substances:

Year:  2018        PMID: 29784821      PMCID: PMC6003316          DOI: 10.1073/pnas.1805085115

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


Humans have domesticated wild plants to develop them as a safe food source. Most plants produce specialized (secondary) metabolites that confer resistance to pathogens (1) and herbivores (2) (including insects and mammals). In addition to their toxicity, specialized metabolites possess undesirable antinutritional properties that have been reduced or removed from human and domestic-animal foods during domestication. For example, potato (Solanum tuberosum) (3) and tomato (S. lycopersicum) (4, 5) have been bred for low levels of toxic steroidal glycoalkaloids, and cucumber (Cucumis sativus) cultivars contain low levels of bitter cucurbitacins (6, 7). In the case of cotton species that have been cultivated mainly for spinnable fiber to produce clothing, their specialized metabolites may not have been under the negative selection pressure in the course of domestication, compared with food crops. Plants of cotton synthesize a group of cadinene-type sesquiterpene aldehydes as defense compounds (phytoalexins), represented by gossypol (8–10). Cottonseeds are valuable since they are good sources of protein (∼23%) and oil (∼21%). Cottonseed meal is widely used as animal feed, and cotton oil is still the major cooking oil in some developing countries, such as Pakistan (11, 12). As a result, high gossypol content in cottonseeds poses a health concern (13) for both domestic-animal and human uses. Elucidation of the gossypol biosynthetic pathway started decades ago. Early 14C tracing experiments proved that (+)-δ-cadinene is a precursor to all cadinene-type sesquiterpenoids in cotton, including both 7- and 8-hydroxylated derivatives (14, 15). Sesquiterpene synthases convert farnesyl diphosphate (FPP) into differently structured products. The (+)-δ-cadinene synthase (CDN) activity in cotton (15, 16) and the cDNAs encoding two subfamilies of CDNs (CDNA and CDNC) were then reported (17, 18). Later, a cytochrome P450 monooxygenase (CYP706B1) was demonstrated to catalyze the hydroxylation of (+)-δ-cadinene, presumably at the 8- position (19). In addition, a desoxyhemigossypol methyltransferase was characterized (20). Gossypol is formed through dimerization of hemigossypol (21–23). Comparison of (+)-δ-cadinene and hemigossypol structures suggests several hydroxylation, desaturation, and cyclic ether formation steps in the pathway. However, until now, neither the enzymes nor the reactions downstream of (+)-δ-cadinene have been characterized, except a tentative identification of CYP706B1, and even the biosynthetic intermediates remain largely unknown. All cotton species bear the lysigenous glands located in the subepidermal layer of aerial organs, in which sesquiterpene aldehydes (such as gossypol and hemigossypolone) are stored. There are also glandless cultivars which do not produce these phytoalexins in aerial parts (17, 24, 25) (Fig. 1 ). Recently, the gene responsible for gland formation, GoPGF, was cloned, which encodes a basic helix–loop–helix transcription factor (25). By transcriptome-based comparison of the glandular and the glandless cultivars and coexpression analyses, in combination with virus-induced gene silencing (VIGS) and partial reconstitutions of the pathway in heterologous system, we isolated four enzymes and identified five steps of the pathway, covering the first four consecutive steps and most of the hydroxylation reactions of gossypol biosynthesis.
Fig. 1.

Transcriptomics-based mining of gossypol pathway genes. (A) View of the seed and leaf of glandular (G) and glandless (GL) cultivars of G. hirsutum. (B) Structure of (+)-δ-cadinene, gossypol, and hemigossypolone. (C and D) Venn diagram (C) showing the numbers of genes identified by correlation analysis using CDNC as a bait (correlation ≥0.5) and by differential analysis (down-regulated in glandless cotton leaf). In total, 146 genes were retrieved by both methods, and their numbers in each category are shown in D. (E) Global heatmap of transcript abundances of indicated genes in different organs or in ovule (seed) at different stages. In the expression cluster, DH1, CYP82D113, CYP71BE79, and 2-ODD-1 are most correlated to the reported gossypol pathway genes CDNC and CYP706B1. The heatmap was drawn by the R pheatmap package. Hours (h) postgermination and days postanthesis (dpa) are indicated.

Transcriptomics-based mining of gossypol pathway genes. (A) View of the seed and leaf of glandular (G) and glandless (GL) cultivars of G. hirsutum. (B) Structure of (+)-δ-cadinene, gossypol, and hemigossypolone. (C and D) Venn diagram (C) showing the numbers of genes identified by correlation analysis using CDNC as a bait (correlation ≥0.5) and by differential analysis (down-regulated in glandless cotton leaf). In total, 146 genes were retrieved by both methods, and their numbers in each category are shown in D. (E) Global heatmap of transcript abundances of indicated genes in different organs or in ovule (seed) at different stages. In the expression cluster, DH1, CYP82D113, CYP71BE79, and 2-ODD-1 are most correlated to the reported gossypol pathway genes CDNC and CYP706B1. The heatmap was drawn by the R pheatmap package. Hours (h) postgermination and days postanthesis (dpa) are indicated.

Results

Isolation of Gossypol Pathway Genes.

Upland cotton, Gossypium hirsutum, is an allotetraploid species widely cultivated around the world (26). Analyses by HPLC detected a high level of sesquiterpene aldehydes in the leaf, seed (cotyledon), and floral organs of G. hirsutum cv. CCRI12, but not the glandless mutant CCRI12gl (). Although the sesquiterpenes are widely distributed throughout the glandular cotton plant, their level and composition in different organs vary: while gossypol is predominant in seed and root, hemigossypolone is abundant in leaf (). In cotton CDN, a sesquiterpene cyclase and the cytochrome P450 monooxygenase CYP706B1 catalyze the first two steps of gossypol biosynthesis (17, 19). To further characterize the pathway, we adopted an integrative approach combining two-stage transcriptome analyses and VIGS to isolate genes encoding the downstream enzymes. Comparison of the transcript abundances in the leaves of glandular and glandless cotton uncovered 902 genes significantly down-regulated in the latter (Fig. 1). Next, correlation analysis using the correlation value of ≥0.5 grouped 5,912 transcripts with the bait CDNC of the CDN family (Fig. 1). Combination of these two datasets disclosed 146 genes in total that were potentially linked to gossypol biosynthesis, among which 82 encode enzymes, including the previously reported CDNC and CYP706B1, and the mevalonate (MVA) pathway genes (Fig. 1). Subsequent analysis of spatial expression patterns using the R pheatmap package identified seven enzymes that form the most likely gene expression cluster related to gossypol biosynthesis (Fig. 1 and ), of which four have not been investigated before. Real-time quantitative PCR confirmed the RNA-sequencing data: the four enzyme genes were tightly coexpressed with CDNC and CYP706B1, with their transcript levels high in glandular leaves but low or undetectable in glandless leaves (Fig. 2). During development, young ovules (seeds) do not produce gossypol until 20 d postanthesis (), when CDNC and CYP706B1 as well as the four candidate genes were coordinately activated, concomitant with gossypol accumulation (Fig. 2).
Fig. 2.

Relative gene expressions of the enzymes in relation to accumulation of gossypol and hemigossypolone (HGQ). Six enzymes were analyzed, including the previously reported CDNC and CYP706B1, and the four isolated in this study: DH1, CYP82D113, CYP71BE79, and 2-ODD-1. (A) Down-regulation of the genes in leaves of the glandless cotton cultivar CCRI12gl (GL) compared with the glanded cultivar CCRI12 (G) (means ± SD, n = 3). (B) Relative gene expressions in developing ovule (seed) collected at different days postanthesis (dpa) (means ± SD, n = 3). (C and D) Induced gene expression in GL (C) and G (D) cotyledons after treatment with fungal elicitor VdNEP (means ± SD, n = 3). (E) Decreased gene expression level and gossypol/HGQ content in leaves after VIGS of the gene as indicated. Value of the empty tobacco rattle virus (TRV) vector control (CK) was set to 1 (means ± SD, n = 6 independent experiments). See also .

Relative gene expressions of the enzymes in relation to accumulation of gossypol and hemigossypolone (HGQ). Six enzymes were analyzed, including the previously reported CDNC and CYP706B1, and the four isolated in this study: DH1, CYP82D113, CYP71BE79, and 2-ODD-1. (A) Down-regulation of the genes in leaves of the glandless cotton cultivar CCRI12gl (GL) compared with the glanded cultivar CCRI12 (G) (means ± SD, n = 3). (B) Relative gene expressions in developing ovule (seed) collected at different days postanthesis (dpa) (means ± SD, n = 3). (C and D) Induced gene expression in GL (C) and G (D) cotyledons after treatment with fungal elicitor VdNEP (means ± SD, n = 3). (E) Decreased gene expression level and gossypol/HGQ content in leaves after VIGS of the gene as indicated. Value of the empty tobacco rattle virus (TRV) vector control (CK) was set to 1 (means ± SD, n = 6 independent experiments). See also . Previous investigations demonstrated that biosynthesis of sesquiterpene phytoalexins in cotton cells can be induced by the pathogenic fungus Verticillium dahliae (17, 20). HPLC analysis showed that treatment of cotton cotyledons by the V. dahliae elicitor VdNEP (27) led to increased production of gossypol and hemigossypolone, whereas in glandless cotyledons, in which the sesquiterpene aldehydes were undetectable before elicitation, hemigossypolone was induced to accumulate (). Consistently, the six enzyme genes were all up-regulated by elicitation (Fig. 2 ). Selected candidate genes were submitted to VIGS, and silenced genes were then monitored by metabolite analysis of cotton leaves (28). Silencing of CDNC decreased hemigossypolone and gossypol levels by 95.1% and 96.7%, respectively, and silencing of CYP706B1 decreased the sesquiterpene levels by 59.4% and 61.2%, respectively, compared with empty vector controls (Fig. 2). An extended assay showed that silencing of four enzymes, including two cytochromes P450 (CYP82D113 and CYP71BE79), one alcohol dehydrogenase (DH1), and one 2-oxoglutarate/Fe(II)-dependent dioxygenase (2-ODD-1), each reduced the level of gossypol and hemigossypolone by more than 50% (Fig. 2). These data strongly suggested the involvement of the candidate genes in gossypol biosynthesis.

Identification of Biosynthetic Intermediates.

As silencing of CYP706B1 resulted in an accumulation of its substrate (+)-δ-cadinene in cotton leaves (Fig. 3), we further analyzed the leaf extracts of the VIGS-treated plant by GC-MS and LC-MS to explore clues to the enzyme activity. We found that the CYP706B1 product, which has an m/z of 220, accumulated in the VIGS-DH1, but not the control leaves, suggesting that DH1 may be functional in reducing the CYP706B1 product (Fig. 3). Silencing of CYP82D113 led to the accumulation of a compound that has an m/z of 218 (Fig. 3); thus, this P450 may act immediately after DH1.
Fig. 3.

Identification of enzyme genes of gossypol biosynthesis by VIGS. Silencing of the candidate enzyme genes by VIGS led to accumulation of the putative substrates in leaf. (A–C and E) GC-MS profiles of the extracts prepared from the cotton leaves harboring TRV2:CYP706B1 (A), TRV2:DH1 (B), TRV2:CYP82D113 (C), TRV2:2-ODD-1 (E), or empty vector (TRV2:00). The peaks of the substrates, indicated by arrows, are shown (electron ionization in positive-ion mode). Total-ion chromatograms (TIC) and extracted-ion chromatogram (EIC) of the substrate of the enzyme, as indicated, at m/z 204 (A), m/z 220 (B), m/z 218 (C), and m/z 228 (E). (D) LC-MS analysis of the extracts from the cotton leaves harboring TRV2:CYP71BE79 or empty vector (TRV2:00). The peak of the CYP71BE79 substrate [with UV and EIC of the parent ions at m/z 257 [M + Na]+ on positive mode] is shown.

Identification of enzyme genes of gossypol biosynthesis by VIGS. Silencing of the candidate enzyme genes by VIGS led to accumulation of the putative substrates in leaf. (A–C and E) GC-MS profiles of the extracts prepared from the cotton leaves harboring TRV2:CYP706B1 (A), TRV2:DH1 (B), TRV2:CYP82D113 (C), TRV2:2-ODD-1 (E), or empty vector (TRV2:00). The peaks of the substrates, indicated by arrows, are shown (electron ionization in positive-ion mode). Total-ion chromatograms (TIC) and extracted-ion chromatogram (EIC) of the substrate of the enzyme, as indicated, at m/z 204 (A), m/z 220 (B), m/z 218 (C), and m/z 228 (E). (D) LC-MS analysis of the extracts from the cotton leaves harboring TRV2:CYP71BE79 or empty vector (TRV2:00). The peak of the CYP71BE79 substrate [with UV and EIC of the parent ions at m/z 257 [M + Na]+ on positive mode] is shown. By LC-MS, we found that a peak with m/z (+) 257 [M + Na]+ appeared in the extract of the CYP71BE79-silenced leaves, which could be the substrate of CYP71BE79 (Fig. 3). In addition, GC-MS identified that silencing of 2-ODD-1 resulted in accumulation of an upstream intermediate with an m/z of 228 (Fig. 3). We also noted that the VIGS-CYP71BE79 plants grown in the greenhouse frequently developed disease phenotypes (brown sunken lesions covering the hypocotyl–root junction) (), similar to the symptoms caused by the soilborne necrotrophic fungus Rhizoctonia solani (29), whereas the control and other VIGS-treated plants did not. As PGF silencing blocked the whole gossypol biosynthesis pathway (25), the decreased amount of sesquiterpene phytoalexins in VIGS-CYP71BE79 plants was unlikely responsible for the enhanced susceptibility. Determination by LC-MS revealed that the substrate of CYP71BE79 accumulated in the hypocotyl–root junction after the gene silencing ().

Functional Characterization of Enzymes.

To obtain intermediate standards for structure elucidation and to perform enzyme assays in vitro, we expressed the three cytochromes P450 in Saccharomyces cerevisiae and other enzymes in Escherichia coli. As determined by GC-MS, incubation of the starting substrate FPP with CDNC produced (+)-δ-cadinene, and further reaction with CYP706B1 gave rise to a hydroxylated product (Fig. 4) that was previously proposed to be 8-hydroxy-(+)-δ-cadinene (19). Subsequent incubation revealed that DH1 converted the CYP706B1 product into a compound of Mr 218 (Fig. 4), suggesting a dehydrogenation reaction. NMR spectroscopy detected a ketonic group at the C-7 position; thus, the product is 7-keto-δ-cadinene (Fig. 4).
Fig. 4.

Functional characterization of enzymes by in vitro assays and determination of the products. (+)-δ-Cadinene, 7-hydroxy-(+)-δ-cadinene, and 7-keto-δ-cadinene were detected by GC-MS, and metabolite profiles were monitored as total-ion chromatograms (TIC), whereas 8-hydroxy-7-keto-δ-cadinene, 8,11-dihydroxy-7-keto-δ-cadinene, furocalamen-2-one, and 3-hydroxy-furocalamen-2-one were detected by LC-MS with UV, as indicated. The sample without the relevant protein served as negative control. Structures of all compounds, except (+)-δ-cadinene, were further determined by MS/MS and NMR spectroscopy (). The purified recombinant proteins of DH1 and 2-ODD-1 expressed in E. coli and the microsomes of yeast cells expressing the respective cytochromes P450 were assayed.

Functional characterization of enzymes by in vitro assays and determination of the products. (+)-δ-Cadinene, 7-hydroxy-(+)-δ-cadinene, and 7-keto-δ-cadinene were detected by GC-MS, and metabolite profiles were monitored as total-ion chromatograms (TIC), whereas 8-hydroxy-7-keto-δ-cadinene, 8,11-dihydroxy-7-keto-δ-cadinene, furocalamen-2-one, and 3-hydroxy-furocalamen-2-one were detected by LC-MS with UV, as indicated. The sample without the relevant protein served as negative control. Structures of all compounds, except (+)-δ-cadinene, were further determined by MS/MS and NMR spectroscopy (). The purified recombinant proteins of DH1 and 2-ODD-1 expressed in E. coli and the microsomes of yeast cells expressing the respective cytochromes P450 were assayed. Formation of 7-keto-δ-cadinene cast doubt on the previous identification of the CYP706B1 product as 8-hydroxy-(+)-δ-cadinene based on 1H-NMR spectroscopy (19). Indeed, both 13C NMR and heteronuclear multiple-bond correlation spectra revealed the compound as 7-hydroxy-(+)-δ-cadinene (). Thus, CYP706B1 is reassigned as (+)-δ-cadinene-7-hydroxylase, and DH1 is 7-keto-δ-cadinene synthase (Fig. 4). The compound 7-keto-δ-cadinene was first identified from G. hirsutum plants engineered to express an RNAi construct targeting CYP82D109, which was named (4aR, 5S)-δ-cadinen-2-one (24), but the activity of CYP82D109 has remained unknown. CYP82D113 is 92% identical to CYP82D109. To determine the enzyme activity of CYP82D113, yeast microsomes enriched with CYP82D113 were incubated with 7-keto-δ-cadinene. LC-MS identified an expected peak of the product having an m/z of (+) 257. MS and NMR analyses indicated that, in the presence of NADPH, CYP82D113 transferred a hydroxyl group to C-8 of 7-keto-δ-cadinene, generating 8-hydroxy-7-keto-δ-cadinene (Fig. 4 and ). The CYP82D113 product has an MS spectrum identical to that of the proposed substrate of CYP71BE79 (Fig. 3). To test whether CYP71BE79 is involved in further decoration of the (+)-δ-cadinene backbone, we incubated it with 8-hydroxy-7-keto-δ-cadinene, which was then efficiently converted into a product with an m/z of (+) 273 [M + Na]+ (Fig. 4). NMR analysis identified that CYP71BE79 transferred a new hydroxyl group to C-11 to form 8,11-dihydroxy-7-keto-δ-cadinene (). Lastly, the metabolite accumulated in the 2-ODD-1–silenced leaves (Fig. 3) was identified to be furocalamen-2-one (). As expected, incubation with 2-ODD-1 converted it to a new compound, 3-hydroxy-furocalamen-2-one (Fig. 4 and ). We next measured the kinetic parameters of the five enzymes (Table 1). Notably, CYP71BE79 exhibited a much higher maximum activity (Vmax) than other enzymes tested, including two upstream cytochromes P450 (CYP706B1 and CYP82D113), and its catalytic efficiency (Vmax/Km) was also clearly higher. To test substrate specificity, the five enzymes were assayed with available intermediates possessing similar structures. Most enzymes showed little activity toward alternative substrates under identical assay conditions (). However, in addition to 7-hydroxy-(+)-δ-cadinene, DH1 also accepted 8-hydroxy-7-keto-δ-cadinene and 8,11-dihydroxy-7-keto-δ-cadinene as substrates, although with lower efficiency (). Thus, DH1 is, to some extent, promiscuous in dehydrogenation of the hydroxyl group-containing metabolites.
Table 1.

Kinetic analyses of the enzymes determined in vitro

EnzymeSubstrateKm, μMVmax, nmol·min−1·mg−1
CYP706B1(+)-δ-Cadinene7.57 ± 1.1431.26 ± 1.56
DH17-Hydroxy-(+)-δ-cadinene0.48 ± 0.0410.42 ± 0.21
CYP82D1137-Keto-δ-cadinene1.02 ± 0.1322.00 ± 0.73
CYP71BE798-Hydroxy-7-keto-δ-cadinene9.67 ± 1.34304.90 ± 10.88
2-ODD-1Furocalamen-2-one1.81 ± 0.2149.54 ± 1.11

Each dataset represents means ± SD (n = 3 independent experiments).

Kinetic analyses of the enzymes determined in vitro Each dataset represents means ± SD (n = 3 independent experiments).

Partial Reconstitution of Gossypol Pathway in Tobacco Leaf.

Along with in vitro assays of enzyme activities, we utilized the Agrobacterium-mediated transient expression system to reconstitute the gossypol pathway reactions in Nicotiana benthamiana leaves. The 35S promoter was used to express each of the six enzymes, including an FPP synthase (AtFPS2) from Arabidopsis thaliana (AT4G17190), as well as CDNC, CYP706B1, DH1, CYP82D113, and CYP71BE79 from cotton, which catalyze the six consecutive steps of gossypol biosynthesis starting from isopentenyl diphosphate/dimethylallyl diphosphate. Four metabolic intermediates, (+)-δ-cadinene, 7-hydroxy-(+)-δ-cadinene, 7-keto-δ-cadinene, and 8-hydroxy-7-keto-δ-cadinene, were detected in the leaves expressing the respective enzymes (). Following CYP71BE79 expression with the upstream enzymes, a glycosylated product, rather than 8,11-dihydroxy-7-keto-δ-cadinene itself, was formed (). Together, data from VIGS and in vitro and tobacco leaf transient expression assays suggest that CYP706B1, DH1, CYP82D113, and CYP71BE79 catalyze four consecutive oxidative reactions on (+)-δ-cadinene, and 2-ODD-1 is responsible for a later hydroxylation step in the biosynthetic pathway leading to sesquiterpene aldehydes (Fig. 5).
Fig. 5.

Genes of gossypol pathway enzymes and their expressions. Genes of the enzymes catalyzing the defined steps in MVA and gossypol pathways and their homologs are shown. The expressions are indicated by heatmap, estimated using Cuffdiff by computing the FPKM value (fragments per kilobase of transcript per million reads sequenced) for each transcript. Genes encoding the identified enzymes or showing an expression pattern correlated to gossypol biosynthesis are on the TOP. Dashed arrows indicate unidentified reaction(s). 0Ov, 0-dpa ovule; 25Ov, 25-dpa ovule; 0Sd, 0-h postgermination seed; 5Sd, 5-h postgermination seed; ACAT, acyl CoA-cholesterol acyltransferase; Ca, calyx; DMAPP, dimethylallyl diphosphate; FPS, FPP synthase; HMGR, HMG-CoA reductase; HMGS, 3-hydroxy-3-methylglutaryl-coenzyme-A (HMG-CoA) synthase; IPP, isopentenyl diphosphate; IPPI, IPP isomerase; Lf, leaf; MVK, mevalonate kinase; MVP, phosphomevalonate kinase; Pe, petal; Pi, pistil; PMD, diphosphomevalonate decarboxylase; Rt, root; Sm, stamen; St, stem; To, torus. Distributions of the genes in G. hirsutum genome are indicated by their accession numbers and also shown in the genome atlas ().

Genes of gossypol pathway enzymes and their expressions. Genes of the enzymes catalyzing the defined steps in MVA and gossypol pathways and their homologs are shown. The expressions are indicated by heatmap, estimated using Cuffdiff by computing the FPKM value (fragments per kilobase of transcript per million reads sequenced) for each transcript. Genes encoding the identified enzymes or showing an expression pattern correlated to gossypol biosynthesis are on the TOP. Dashed arrows indicate unidentified reaction(s). 0Ov, 0-dpa ovule; 25Ov, 25-dpa ovule; 0Sd, 0-h postgermination seed; 5Sd, 5-h postgermination seed; ACAT, acyl CoA-cholesterol acyltransferase; Ca, calyx; DMAPP, dimethylallyl diphosphate; FPS, FPP synthase; HMGR, HMG-CoA reductase; HMGS, 3-hydroxy-3-methylglutaryl-coenzyme-A (HMG-CoA) synthase; IPP, isopentenyl diphosphate; IPPI, IPP isomerase; Lf, leaf; MVK, mevalonate kinase; MVP, phosphomevalonate kinase; Pe, petal; Pi, pistil; PMD, diphosphomevalonate decarboxylase; Rt, root; Sm, stamen; St, stem; To, torus. Distributions of the genes in G. hirsutum genome are indicated by their accession numbers and also shown in the genome atlas ().

Gossypol Pathway Genes Are Dispersed in the Cotton Genome.

Several examples exist where genes encoding biosynthetic pathway enzymes of specialized metabolites, including terpenoids and alkaloids, tend to be clustered together in the plant genome (3, 6, 30, 31). In cotton, however, the gossypol pathway genes are dispersed among different chromosomes (Fig. 5 and ). On the other hand, the gene families of the gossypol as well as the core MVA pathways are often extensively expanded with tandem duplications (Fig. 5 and ). Most of the gossypol pathway enzymes identified, including CDN, DH1, CYP82D113, and 2-ODD-1, appear to have arisen from local duplications in the cotton genome. For example, in the allotetraploid genome of G. hirsutum, there are 11 genes encoding the alcohol dehydrogenase DH1 and homologs, all of which are tandemly arranged, with four genes (Gh_A01G1736, Gh_A01G1737, Gh_A01G1739, and Gh_A01G1740) on chromosome A1 (chromosome 1 of A subgenome) and seven (Gh_D01G1983 to Gh_D01G1989) on chromosome D1 (Fig. 5 and ). Among the five enzymes catalyzing oxidative steps in the gossypol biosynthetic pathway, three are cytochromes P450 of different families. Members of CYP71 and CYP82 families are commonly involved in biosynthesis of specialized metabolites such as noscapine (32), podophyllotoxin (33), and artemisinin (34). As cotton CYP71BE79 is distinct in its high activity (Table 1), we analyzed it further. Using CYP71BE79 as query, we performed a bioinformatic blast search of CYP71 family proteins from publicly available genomes of nine plant species, including three species from the family Malvaceae: G. hirsutum, Durio zibethinus, and Theobroma cacao. In total, 312 CYP71 proteins were retrieved (). We found that the CYP71BE proteins form a Malvaceae-specific subfamily (green in Fig. 6), which contained 37 members clustered into five clades. Clade II was composed of six CYP71BEs, including the two CYP71BE79 homologs of G. hirsutum (Gh_A13G1133 and Gh_D13G1407). Notably, CYP71BE genes have been maintained as a truly single copy in diploid genomes or subgenomes (Fig. 6).
Fig. 6.

Maximum-likelihood phylogenetic trees of the CYP71 family. (A) Members of the CYP71 family from nine land plants with sequence identity >40% are included (CYP71BE79 as a seed query). CYP71BE79 is located in the green branch. Plants analyzed are G. hirsutum, D. zibethinus, T. cacao, Aquilegia coerulea, A. thaliana, Oryza sativa subsp. Japonica, Amborella trichopoda, Selaginella moellendorffii, and Physcomitrella patens. The National Center for Biotechnology Information and Phytozome databases (51) were searched. (B) Members of the CYP71BE subfamily from five species of Malvaceae: G. hirsutum, G. raimondii, G. arboreum, T. cacao, and D. zibethinus. CYP71BEs are divided into five clades, and each diploid genome or subgenome harbors a single copy.

Maximum-likelihood phylogenetic trees of the CYP71 family. (A) Members of the CYP71 family from nine land plants with sequence identity >40% are included (CYP71BE79 as a seed query). CYP71BE79 is located in the green branch. Plants analyzed are G. hirsutum, D. zibethinus, T. cacao, Aquilegia coerulea, A. thaliana, Oryza sativa subsp. Japonica, Amborella trichopoda, Selaginella moellendorffii, and Physcomitrella patens. The National Center for Biotechnology Information and Phytozome databases (51) were searched. (B) Members of the CYP71BE subfamily from five species of Malvaceae: G. hirsutum, G. raimondii, G. arboreum, T. cacao, and D. zibethinus. CYP71BEs are divided into five clades, and each diploid genome or subgenome harbors a single copy. The nonsynonymous (Ka) and synonymous substitution rates (Ks) of three gossypol pathway cytochromes P450 (CYP706B1, CYP82D113, and CYP71BE79) in G. hirsutum were compared with their homologs in D. zibethinus (Table 2). The higher Ks values and the lower Ka/Ks ratios of CYP71BE79 indicate that this P450 has undergone less relaxed selection. Moreover, CYP71BE79 has a high Vmax value compared with other, identified cytochromes P450 of the gossypol pathway (Table 1), which supports an efficient transformation of its substrate (8-hydroxy-7-keto-δ-cadinene) that affects plant resistance to pathogens if accumulated (). We propose that CYP71BE79 is functionally more conserved in Gossypium and in closely related genera in order to catalyze a highly controlled step to prevent the accumulation of the phytotoxic metabolite, along with gossypol pathway evolution.
Table 2.

The evolution rates and Ka/Ks values of three homologous P450 gene pairs between G. hirsutum and D. zibethinus

Gene nameGenes in G. hirsutumHomologs in D. zibethinusKaKsKa/Ks
CYP706B1_DGh_D03G1513XM_022882367.10.12710.45140.2816
CYP706B1_AGh_A03G2006XM_022882367.10.12530.43420.2886
CYP82D113_DGh_D05G1894XM_022910758.10.10930.54050.2022
CYP82D113_AGh_A05G1705XM_022910758.10.1050.53820.1951
CYP71BE79_DGh_D13G1407XM_022861030.10.12010.95990.1251
CYP71BE79_AGh_A13G1133XM_022861030.10.11650.93980.124
The evolution rates and Ka/Ks values of three homologous P450 gene pairs between G. hirsutum and D. zibethinus

Discussion

Recent achievements in sequencing cotton genomes (26, 35–37) have facilitated the isolation and characterization of gossypol pathway enzymes through transcriptome mining. It is striking that the first oxidation reaction of (+)-δ-cadinene catalyzed by CYP706B1 toward gossypol biosynthesis occurs at the C-7 position, instead of C-8 as proposed previously. Besides gossypol and related sesquiterpene aldehydes that have a characteristic 8-hydroxyl group, there are other cadinene derivatives featuring oxidation at C-7 in cotton, such as 2-hydroxy-7-methoxycadalene (24). An earlier study showing that the tritiated CYP706B1 product was incorporated into gossypol (38) supported the involvement of this cytochrome P450 in gossypol biosynthesis. Here, we provide evidence that CYP706B1 produces 7-hydroxy-(+)-δ-cadinene, which is an upstream intermediate in the gossypol pathway. Interestingly, 7-hydroxy-(+)-δ-cadinene is subjected to C-8 oxidation following C-7 carbonylation, and the C-7 carbonyl group seems indispensable for C-8 hydroxylation. The cadinene-type sesquiterpenes oxidized at both C-7 and C-8 have not been found before; subsequent oxidation at C-11 by CYP71BE79 presumes to react with a C-8 hydroxyl group to form a C-8–C-11 ether bridge in the structure of gossypol (Fig. 4). The fate of the C-7 carbonyl group awaits determination but could be deduced from structural comparison of 8,11-dihydroxy-7-keto-δ-cadinene and furocalamen-2-one, because the two intermediates leave a biosynthesis gap that may involve isomerization of carbonyl functionality to an enol group and the successive dehydration to form a benzene ring (ring B). Isomerization and dehydration are not uncommon in aromatization, such as the shikimate pathway rearrangement of chorismate to prephenate by chorismate mutase and the dehydration of arogenate to phenylalanine by arogenate dehydratase (39). Furthermore, ring B is also aromatized during desoxyhemigossypol formation from 3-hydroxy-furocalamen-2-one (Fig. 4). The present investigation resolves most of the oxidation reactions involved, leaving two remaining gaps that each involves similar aromatization reactions. Notably, the reaction steps of gossypol formation are not randomly cascaded but rather accurately cascaded, from an energy point of view. The oxidation always occurs in the position much easier to take place, and the introduced oxidized group reduces the energy barrier of the next oxidation. For example, the first hydroxylation proceeds in the active C-7 allylic position, and then the newly formed carbonylation leaves its α position more active for subsequent hydroxylation; such is also the case of hydroxylations at positions 3 and 8, where there are preexisting carbonyl groups. Lastly, aromatizing provides the most stable napthalene ring. Thus, the gossypol pathway has evolved and been optimized through several low-energy intermediates. The clear order and the strict substrate specificity of these biosynthetic reactions imply that the gossypol biosynthetic pathway may have evolved step by step, which might be a reason for discrete distributions of enzyme genes in the genome. We anticipate that in some plants of Malvaceae, such as cacao, okra, and roselle, the biosynthetic pathways of cadinene-type sesquiterpenes are not necessarily destined to be gossypol; the short-cut or diversified routes may result in a rich array of specialized metabolites. Comparative analyses of these pathways will enrich our knowledge on evolution of sesquiterpene biosynthetic pathways and provide valuable data for safe use and further exploration of food, oil, and vegetable crops in the Malvaceae and related families. There are two lines of evidence that support a tight regulation of the gossypol biosynthetic pathway. First, although not clustered in the genome as frequently observed with other specialized pathways (3, 6, 18, 30), genes of all six enzymes characterized show highly similar expression patterns. This raises the possibility that all these genes are regulated by a common transcription factor complex, as seen from the MYB-bHLH-WD40 complex in the anthocyanin biosynthetic pathway (40, 41). Second, products of these gossypol pathway enzymes are mostly undetectable in plant tissues unless the downstream enzyme genes are silenced, suggesting a highly efficient conversion, which could be a result of substrate channeling (42). For example, the monoterpene indole alkaloid pathway in Catharanthus roseus involves a complex and highly regulated biosynthesis in which the upstream pathway enzymes are separated in different cellular compartments to prevent inappropriate accumulation of highly reactive strictosidine aglycone (43). In addition to their function as phytoalexins in plants, gossypol and related sesquiterpene aldehydes also show anticancer (44, 45), antimicrobial (46, 47), and spermicidal (48) activities. We wonder whether the six intermediates identified here have similar or novel biological activities. In particular, the structure of 8-hydroxy-7-keto-δ-cadinene features an α, β-unsaturated ketone and an α-hydroxyl group next to the carbonyl, which may act as a Michael acceptor for biological nucleophiles; the similar enone group has been suggested as a general structural requirement for optimal cytotoxicity of quassinoids, a group of degraded triterpenes with promising antitumor and cytotoxic activity (49, 50), suggesting that this intermediate may harbor interesting biological activities. Cloning of the enzymes makes it possible to obtain these hidden natural products in large quantity for drug or agrochemical screening.

Methods

Details about plant materials and growth conditions are described in . Gene expression, elicitation, plant transformation, heterologous expression and purification of proteins, pathway reconstitution in N. benthamiana leaves, pathogen infection, enzymes assays, metabolites detection, and analysis were carried out according to protocols described in .
  45 in total

1.  Six enzymes from mayapple that complete the biosynthetic pathway to the etoposide aglycone.

Authors:  Warren Lau; Elizabeth S Sattely
Journal:  Science       Date:  2015-09-11       Impact factor: 47.728

Review 2.  MYB-bHLH-WD40 protein complex and the evolution of cellular diversity.

Authors:  Nicola A Ramsay; Beverley J Glover
Journal:  Trends Plant Sci       Date:  2005-02       Impact factor: 18.313

Review 3.  Functional aspects of cell patterning in aerial epidermis.

Authors:  Cathie Martin; Beverley J Glover
Journal:  Curr Opin Plant Biol       Date:  2006-11-30       Impact factor: 7.834

4.  A chemical genetic roadmap to improved tomato flavor.

Authors:  Denise Tieman; Guangtao Zhu; Marcio F R Resende; Tao Lin; Cuong Nguyen; Dawn Bies; Jose Luis Rambla; Kristty Stephanie Ortiz Beltran; Mark Taylor; Bo Zhang; Hiroki Ikeda; Zhongyuan Liu; Josef Fisher; Itay Zemach; Antonio Monforte; Dani Zamir; Antonio Granell; Matias Kirst; Sanwen Huang; Harry Klee
Journal:  Science       Date:  2017-01-27       Impact factor: 47.728

5.  Dirigent Proteins from Cotton (Gossypium sp.) for the Atropselective Synthesis of Gossypol.

Authors:  Isabelle Effenberger; Bin Zhang; Ling Li; Qiang Wang; Yuxiu Liu; Iris Klaiber; Jens Pfannstiel; Qingmin Wang; Andreas Schaller
Journal:  Angew Chem Int Ed Engl       Date:  2015-10-13       Impact factor: 15.336

6.  Molecular cloning and functional identification of (+)-delta-cadinene-8-hydroxylase, a cytochrome P450 mono-oxygenase (CYP706B1) of cotton sesquiterpene biosynthesis.

Authors:  P Luo; Y H Wang; G D Wang; M Essenberg; X Y Chen
Journal:  Plant J       Date:  2001-10       Impact factor: 6.417

7.  GhZFP1, a novel CCCH-type zinc finger protein from cotton, enhances salt stress tolerance and fungal disease resistance in transgenic tobacco by interacting with GZIRD21A and GZIPR5.

Authors:  Ying-Hui Guo; Yue-Ping Yu; Dong Wang; Chang-Ai Wu; Guo-Dong Yang; Jin-Guang Huang; Cheng-Chao Zheng
Journal:  New Phytol       Date:  2009-04-15       Impact factor: 10.151

8.  THE SHIKIMATE PATHWAY.

Authors:  Klaus M. Herrmann; Lisa M. Weaver
Journal:  Annu Rev Plant Physiol Plant Mol Biol       Date:  1999-06

9.  Genetic basis for glandular trichome formation in cotton.

Authors:  Dan Ma; Yan Hu; Changqing Yang; Bingliang Liu; Lei Fang; Qun Wan; Wenhua Liang; Gaofu Mei; Lingjian Wang; Haiping Wang; Linyun Ding; Chenguang Dong; Mengqiao Pan; Jiedan Chen; Sen Wang; Shuqi Chen; Caiping Cai; Xiefei Zhu; Xueying Guan; Baoliang Zhou; Shuijin Zhu; Jiawei Wang; Wangzhen Guo; Xiaoya Chen; Tianzhen Zhang
Journal:  Nat Commun       Date:  2016-01-22       Impact factor: 14.919

10.  An NPF transporter exports a central monoterpene indole alkaloid intermediate from the vacuole.

Authors:  Richard M E Payne; Deyang Xu; Emilien Foureau; Marta Ines Soares Teto Carqueijeiro; Audrey Oudin; Thomas Dugé de Bernonville; Vlastimil Novak; Meike Burow; Carl-Erik Olsen; D Marc Jones; Evangelos C Tatsis; Ali Pendle; Barbara Ann Halkier; Fernando Geu-Flores; Vincent Courdavault; Hussam Hassan Nour-Eldin; Sarah E O'Connor
Journal:  Nat Plants       Date:  2017-01-13       Impact factor: 17.352

View more
  26 in total

1.  Pyrethrin Biosynthesis: The Cytochrome P450 Oxidoreductase CYP82Q3 Converts Jasmolone To Pyrethrolone.

Authors:  Wei Li; Daniel B Lybrand; Fei Zhou; Robert L Last; Eran Pichersky
Journal:  Plant Physiol       Date:  2019-08-26       Impact factor: 8.340

2.  GhMYC2 activates cytochrome P450 gene CYP71BE79 to regulate gossypol biosynthesis in cotton.

Authors:  Xinpei Han; Yadi Xing; Yaqian Zhu; Lei Luo; Lulu Liu; Yaohua Zhai; Wenjing Wang; Ruixing Shao; Maozhi Ren; Fuguang Li; Qinghua Yang
Journal:  Planta       Date:  2022-08-23       Impact factor: 4.540

Review 3.  RES transformation for biosynthesis and detoxification.

Authors:  Jin-Quan Huang; Jia-Ling Lin; Xiao-Xiang Guo; Xiu Tian; Ye Tian; Xiao-Xia Shangguan; Ling-Jian Wang; Xin Fang; Xiao-Ya Chen
Journal:  Sci China Life Sci       Date:  2020-06-08       Impact factor: 6.038

4.  Comparative transcriptome analysis of sesquiterpene biosynthesis and functional characterization of sesquiterpene synthases in Leonurus sibiricus L.

Authors:  Xiaoguang Yan; Weiguo Li; Dongmei Liang; Guangrong Zhao; Qinggele Caiyin; Jianjun Qiao
Journal:  Planta       Date:  2021-02-18       Impact factor: 4.116

5.  Aromatization of natural products by a specialized detoxification enzyme.

Authors:  Jin-Quan Huang; Xin Fang; Xiu Tian; Ping Chen; Jia-Ling Lin; Xiao-Xiang Guo; Jian-Xu Li; Zhen Fan; Wei-Meng Song; Fang-Yan Chen; Ruzha Ahati; Ling-Jian Wang; Qing Zhao; Cathie Martin; Xiao-Ya Chen
Journal:  Nat Chem Biol       Date:  2020-01-13       Impact factor: 16.174

6.  Elucidation of the biosynthesis pathway and heterologous construction of a sustainable route for producing umbelliferone.

Authors:  Yucheng Zhao; Xiangyun Jian; Jialin Wu; Wanchun Huang; Chuanlong Huang; Jun Luo; Lingyi Kong
Journal:  J Biol Eng       Date:  2019-05-22       Impact factor: 4.355

7.  Reconstruction of the full-length transcriptome atlas using PacBio Iso-Seq provides insight into the alternative splicing in Gossypium australe.

Authors:  Shouli Feng; Min Xu; Fujie Liu; Changjiang Cui; Baoliang Zhou
Journal:  BMC Plant Biol       Date:  2019-08-19       Impact factor: 4.215

8.  Gene Expression Correlation Analysis Reveals MYC-NAC Regulatory Network in Cotton Pigment Gland Development.

Authors:  Hailiang Cheng; Xiaoxu Feng; Dongyun Zuo; Youping Zhang; Qiaolian Wang; Limin Lv; Chaofeng Wu; Shuyan Li; Yuanli Dai; Da Qu; Man He; Shang Liu; Guoli Song
Journal:  Int J Mol Sci       Date:  2021-05-08       Impact factor: 5.923

9.  The gland localized CGP1 controls gland pigmentation and gossypol accumulation in cotton.

Authors:  Wei Gao; Fu-Chun Xu; Lu Long; Yang Li; Jun-Li Zhang; Leelyn Chong; Jose Ramon Botella; Chun-Peng Song
Journal:  Plant Biotechnol J       Date:  2020-01-21       Impact factor: 9.803

10.  Genome-wide characterization of the GRF family and their roles in response to salt stress in Gossypium.

Authors:  Jun-Feng Cao; Jin-Quan Huang; Xia Liu; Chao-Chen Huang; Zi-Shou Zheng; Xiu-Fang Zhang; Xiao-Xia Shangguan; Ling-Jian Wang; Yu-Gao Zhang; Jonathan F Wendel; Corrinne E Grover; Zhi-Wen Chen
Journal:  BMC Genomics       Date:  2020-08-24       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.