| Literature DB >> 23834441 |
Hongli Cui1, Xiaona Yu, Yan Wang, Yulin Cui, Xueqin Li, Zhaopu Liu, Song Qin.
Abstract
BACKGROUND: Xanthophylls, oxygenated derivatives of carotenes, play critical roles in photosynthetic apparatus of cyanobacteria, algae, and higher plants. Although the xanthophylls biosynthetic pathway of algae is largely unknown, it is of particular interest because they have a very complicated evolutionary history. Carotenoid hydroxylase (CHY) is an important protein that plays essential roles in xanthophylls biosynthesis. With the availability of 18 sequenced algal genomes, we performed a comprehensive comparative analysis of chy genes and explored their distribution, structure, evolution, origins, and expression.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23834441 PMCID: PMC3728230 DOI: 10.1186/1471-2164-14-457
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Putative carotenoid hydroxylase genes identified in 18 algae genomes
| jgi|Chlre4|309780|kg.chromosome_8_#_293_#_ABQ59243.1 | 577 | CYP97C |
| jgi|Chlre4|196742|DNE_DNE_gwH.55.10.1 | 652 | CYP97A |
| jgi|Chlre4|196744|DNE_DNE_e_gwW.42.59.1 | 655 | CYP97A |
| jgi|Chlre4|283001|au.g1522_t1 | 502 | CYP97B |
| jgi|Chlre4|164400|fgenesh2_kg.C_scaffold_40000013 | 298 | BCH |
| jgi|Volca1|83281|estExt_Genewise1Plus. C_520019 | 576 | CYP97C |
| jgi|Volca1|100143|fgenesh4_pg.C_scaffold_106000021 | 672 | CYP97A |
| jgi|Volca1|65884|e_gw1.51.4.1 | 642 | CYP97B |
| jgi|Volca1|44641|gw1.75.40.1 | 294 | BCH |
| jgi|ChlNC64A_1|51247|fgenesh3_pg. C_scaffold_6000061 | 578 | CYP97C |
| jgi|ChlNC64A_1|138471|IGS.gm_20_00220 | 615 | CYP97B |
| jgi|ChlNC64A_1|24463|e_gw1.13.95.1 | 238 | BCH |
| jgi|Chlvu1|60845|GG.C169_S07_00141 | 541 | CYP97C |
| jgi|Chlvu1|24726|e_gw1.1.89.1 | 535 | CYP97A |
| jgi|Chlvu1|26454|e_gw1.3.111.1 | 534 | CYP97B |
| jgi|Chlvu1|44442|estExt_Genewise1Plus.C_160017 | 347 | BCH |
| jgi|Coc_C169_1|63212|Genemark1.3995_g | 541 | CYP97C |
| jgi|Coc_C169_1|52277|estExt_fgenesh1_pm.C_10317 | 432 | CYP97A |
| jgi|Coc_C169_1|12656|e_gw1.3.115.1 | 534 | CYP97B |
| jgi|Coc_C169_1|30875|estExt_Genewise1Plus.C_160317 | 347 | BCH |
| jgi|Ost9901_3|33533|eugene.0900010237 | 545 | CYP97C |
| jgi|Ost9901_3|47300|estExt_GenewiseEukaryote.C_Chr_130084 | 495 | CYP97A |
| jgi|Ost9901_3|1824|gwEuk.21.62.1 | 461 | CYP97A |
| jgi|Ost9901_3|29177|eugene.0100010571 | 564 | CYP97B |
| jgi|Ost9901_3|18007|fgenesh1_pg.C_Chr_14000053 | 561 | CYP97B |
| jgi|Ost9901_3|9013|gwEuk.10.453.1 | 153 | BCH |
| jgi|Ostta4|1830|gw1.09.00.100.1 | 490 | CYP97C |
| jgi|Ostta4|23029|estExt_fgenesh1_pm.C_Chr_13.00010043 | 484 | CYP97A |
| jgi|Ostta4|27418|estExt_gwp_GeneWisePlus.C_Chr_01.00010469 | 577 | CYP97B |
| jgi|Ostta4|23060|estExt_fgenesh1_pm.C_Chr_15.00010014 | 485 | CYP97B |
| jgi|Ostta4|18835|e_gw1.08.00.85.1 | 407 | OTHER |
| jgi|Ostta4|5089|gw1.10.00.289.1 | 216 | BCH |
| jgi|OstRCC809_1|1644|gw1.3.120.1 | 490 | CYP97C |
| jgi|OstRCC809_1|38666|fgenesh1_pg.C_scaffold_13000165 | 542 | CYP97A |
| jgi|OstRCC809_1|53931|estExt_Genewise1.C_21287 | 525 | CYP97B |
| jgi|OstRCC809_1|87721|eugene1.0000120100 | 576 | CYP97B |
| jgi|OstRCC809_1|16226|gw1.3.1299.1 | 153 | BCH |
| | | |
| jgi|MicpuC2|32152 | 550 | CYP97C |
| jgi|MicpuC2|26780 | 580 | CYP97A |
| jgi|MicpuC2|22138 | 530 | CYP97B |
| jgi|MicpuC2|57732 | 526 | OTHER |
| jgi|MicpuC2|11104 | 230 | BCH |
| jgi|MicpuN2|95887|estExt_Genewise2Plus.C_Chr_140254 | 542 | CYP97C |
| jgi|MicpuN2|83128|e_gw2.06.152.1 | 525 | CYP97A |
| jgi|MicpuN2|96121|estExt_Genewise2Plus.C_Chr_160324 | 539 | CYP97B |
| jgi|MicpuN2|88940|e_gw2.16.55.1 | 574 | CYP97B |
| jgi|MicpuN2|76186|gw2.07.573.1 | 232 | BCH |
| gnl|CMER|CMV041C [pt] beta-carotene hydroxylase | 259 | CRTR |
| jgi|Thaps3|36235|e_gw1.9.19.1 | 667 | CYP97B |
| jgi|Thaps3|264039|thaps1_ua_kg.chr_13000087 | 547 | CYP97B |
| jgi|Phatr2|26422|estExt_Genewise1.C_chr_50056 | 770 | CYP97B |
| jgi|Phatr2|16586|e_gw1.27.30.1 | 539 | CYP97B |
| jgi|Fracy1|169705|estExt_Genewise1.C_61231 | 528 | CYP97B |
| jgi|Fracy1|170430|estExt_Genewise1.C_71165 | 614 | CYP97B |
| jgi|Emihu1|463287|estExtDG_fgeneshEH_pg.C_230139 | 618 | CYP97B |
| jgi|Guith1|114743|au.78_g15845 | 582 | CYP97B |
| jgi|Guith1|88554|estExt_Genewise1Plus.C_630014 | 492 | CYP97B |
| jgi|Guith1|158065|fgenesh2_pm.48_#_5 | 499 | CYP97B |
| jgi|Bigna1|39488|e_gw1.33.13.1 | 474 | CYP97C |
| jgi|Bigna1|52980|estExt_Genewise1Plus.C_140020 | 545 | CYP97B |
| jgi|Auran1|19592 | 528 | CYP97B |
| jgi|Auran1|34662 | 432 | CYP97B |
OTHER indicates two genes belonged to no one subfamily of CYP97.
Figure 1Distribution of putative genes encoding CHYs and hypothesized xanthophylls biosynthetic pathway in algae. A] The distribution of genes encoding BCH, CrtR and CYP97 homologs across 18 algal genome sequences. The number of putative homologs or paralogs in each corresponding genome is indicated by "+" with color codes, respectively. B] Hypothesized xanthophylls biosynthetic pathway in algae. The genes encoding putative BCH-, CrtR- or CYP97-homologs identified from different algae genomes were indicated with color arrows. Lutein and α-carotene are absent in red algae and Chromalveolates according to our results. Question mark indicates enzymes involved in xanthophylls biosynthesis are unclear.
Figure 2Domain structure of BCH from green algae and CrtR from cyanobacteria and red algae. Ten BCH-type CHYs from green algae, one CrtR-type CHY from red algae and five CrtRs from cyanobacteria are included. A partial protein sequence (position: 110–400) has been selected for domain structure analysis. The predicted trans-membrane segments are shaded in red. The histidine boxes are shaded in green and Black stars indicate the positions of conserved histidine residues. Ten BCH-type CHYs from green algae includes Prasionphyceae [Micromonas pusilla, Micromonas sp. RCC299, Ostreococcus sp. RCC809, Ostreococcus tauri, and Ostreococcus lucimarinus]. One CrtR-type CHY from the red alga is Cyanidioschyzon merolae. The information of BCH genes from algae is as in Table 1. Five CrtRs from cyanobacteria includes Synechococcus sp. JA-2-3B'a (2–13) [Cyanobase: CYB_0102], Synechococcus sp. JA-3-3Ab [Cyanobase: CYA_1931], Cyanothece sp. PCC 7425 [Cyanobase: Cyan7425_1008], Acaryochloris marina MBIC11017 [Cyanobase: AM1_3637] and Thermosynechococcus elongatus BP-1 [Cyanobase: tlr1900].
Figure 3A un-rooted maximum likelihood tree of our BCH database and some other BCHs from bacteria, higher plants and cyanobacteria. The sequences information of BCH from cyanobacteria, bacteria and higher plants was downloaded from Cyanobase or NCBI database respectively and summarized in an additional file [see Additional file 3: Table S2]. A partial protein sequence (position: 110–400) has been selected for phylogenetic analysis. A maximum likelihood phylogenetic tree (loglk = −12176.58485) as inferred from amino acid sequences (291 amino acid characters) of BCH and CrtR proteins was computed using LG model for amino acid substitution (selected by PROTTEST) with discrete gamma distribution in four categories. All parameters (gamma shape = 1.963; proportion of invariants = 0.010; number of categories: 4) were estimated from the dataset. Numbers above branches indicate ML bootstrap supports. ML bootstraps were computed using the above mentioned model in 300 replicates. Stars indicate where later gene duplications led to creation of paralogs genes found within one species. Major groups of organisms are labeled to allow comparison between the phylogeny of BCH and algae evolution.
Figure 4Domain structure of CYP97 from algae. A partial protein sequence (position: 272–926) has been selected for domain structure analysis. The red shades indicated conserved amino acid residues in CYP97A and CYP97C homologs from green algae. The green shades indicated conserved amino acid residues in all CYP97B homologs from all algae. The P450s active site components were found in the amino acid sequences of all CYP97A/B/C, including I-helix involved in oxygen binding (CD6 in CYP97B and CD5 in CYP97A and CYP97C), ERR triad (CD7 in CYP97A/B/C) involved in locking the heme pockets into position and to assure stabilization of the conserved core structure, and CD10 involved in heme binding and a conserved cysteine (the circle with blue color). The abbreviations used are: Chl-C, CYP97C from Chlorophyta C. reinhardtii, V. carteri, M. sp. RCC299, O. RCC809, and Cercozoa B. natans CCMP2755; Chl-A, CYP97A from Chlorophyta C. reinhardtii, V. carteri, M. sp. RCC299, and O. RCC809; Chl-B, CYP97B from Chlorophyta M. sp. RCC299, O. RCC809, and V. carteri; Cry-B, CYP97B from Cryptophyta G. theta; Bac-B, CYP97B from Bacillariophyta T. pseudonana, P. tricornutum, F. cylindrus, and Stramenopiles A. anophagefferens; Hap-B, CYP97B from E. huxleyi. The information of BCH genes from algae is as in Table 1.
Figure 5A maximum likelihood tree of our CYP97 database and some other CYP97s from higher plants. The sequences information of CYP97s from higher plants was downloaded from NCBI database and summarized as follow: Arabidopsis thaliana [GenBank: CYP97A3, gb|AEE31394.1, CYP97B3, gb|AEE83557.1, CYP97C1, sp|Q6TBX7.1 and CYP86A, AED97111.1], Zea mays [GenBank: CYP97A16, ACG28871.1], Glycine max [GenBank: carotene epsilon-monooxygenase, XP_003537025.1], Solanum lycopersicum [GenBank: CYP97C11, NP_001234058.1] and Oryza sativa Japonica Group [GenBank: carotene epsilon-monooxygenase, AAK20054.1]. A partial protein sequence (position: 272–926) has been selected for phylogenetic analysis. A maximum likelihood phylogenetic tree (loglk = −27808.68723) as inferred from amino acid sequences (655 amino acid characters) of CYP97 proteins was computed using LG model for amino acid substitution (selected by PROTTEST) with discrete gamma distribution in four categories. All parameters (gamma shape = 1.924; proportion of invariants = 0.011; number of categories: 4) were estimated from the dataset. Numbers above branches indicate ML bootstrap supports. ML bootstraps were computed using the above mentioned model in 300 replicates. The arrow indicates an ancient gene duplication event creating CYP97A/C, respectively. Stars indicate where later gene duplications led to creation of paralogs genes found within one species. Black circle indicate two genes belonged to no one subfamily of CYP97. Major groups of organisms are labeled to allow comparison between the phylogeny of CYP97A/B/C and algae evolution.
Listed of three full-length of HaeCYP97A/B/C from
| HaeCYP97A | 1872 | 1-159 | 160-1752 | 1753-1872 | 530 | 59.03 | 7.81 | - | chl |
| HaeCYP97B | 1871 | 1-2 | 3-1622 | 1623-1871 | 539 | 58.72 | 6.26 | - | nd |
| HaeCYP97C | 1995 | 1-46 | 47-1666 | 1667-1995 | 539 | 58.71 | 7.94 | - | chl |
Abbreviations: 5’UTR 5’-untranslated region, 3’UTR 3’-untranslated region, CDS coding sequences, Mw molecular weight, pI isoelectric point, TM Transmembrane regions, SL subcellular localization, bp base pair, chl chloroplast.
The degree of identity and similar between the predicted amino acid sequences for each isolated CYP97 genes and corresponding CYP97 from other eukaryotes
| HaeCYP97A | 100/100 | 51/66 | 46/64 | 57/72 | 48/64 | 50/70 |
| HaeCYP97B | | 100/100 | 43/59 | 48/63 | 61/76 | 44/60 |
| HaeCYP97C | | | 100/100 | 46/62 | 39/56 | 61/72 |
| AthCYP97A3 | | | | 100/100 | 40/65 | 53/72 |
| AthCYP97B3 | | | | | 100/100 | 43/60 |
| AthCYP97C1 | 100/100 |
Note: AthCYP97A3 [GenBank: NP_564384.1], AthCYP97B3 [GenBank: NP_193247.2] and AthCYP97C1 [GenBank: NP_190881.2] were downloaded from National Center for Biotechnology Information GenBank database, respectively. The data was present by the form of identity/similarity.
Figure 6mRNA levels of xanthophylls biosynthesis-related genes upon white or blue high light stimulation. The exponentially growing cultures (cell density approximately 5 × 107 cells ml–1) were harvested and transferred cells to 500-ml erlenmeyer flasks (named, 1–9), each containing 250-ml BBM (fresh medium) under continuous white light (390–770 nm) or blue light (420–500 nm) with light intensity of 1,000 μmol photons m–2 s–1 without a day/night cycle, respectively. Collected algal cells (20-mL sample at some selected time) were rinsed with PBS, stored at −80°C if not immediately used. The relative transcript levels of bch[A], Haecyp97a[B], Haecyp97b[C] and Haecyp97c[D] were determined after 2, 4, 6, 10, 13, 24, 28, 34, 48, 54 and 72 h by qRT-PCR using actin as a reference gene. The values were normalized to the transcript levels in the normal light condition. Data are averages of triplicate measurements. The error bars represent standard deviation. Length of the distance in x-axis did not correspond to length of induced time (hours).