| Literature DB >> 31395025 |
Yong Jia1,2, Bo Li3, Yujuan Zhang1, Xiaoqi Zhang1,2, Yanhao Xu4, Chengdao Li5,6,7.
Abstract
BACKGROUND: Flavonoid 3'-hydroxlase (F3'H) is an important enzyme in determining the B-ring hydroxylation pattern of flavonoids. In monocots, previous studies indicated the presence of two groups of F3'Hs with different enzyme activities. One F3'H in rice was found to display novel chrysoeriol-specific 5'-hydroxylase activity. However, the evolutionary history of monocot F3'Hs and the molecular basis for the observed catalytic difference remained elusive.Entities:
Keywords: Anthocyanin; Cereal crops; Evolutionary dynamics; Flavonoid 3′-hydroxylase; Flavonoids; Gene evolution; Plant environmental interactions; Positive natural selection
Mesh:
Substances:
Year: 2019 PMID: 31395025 PMCID: PMC6686259 DOI: 10.1186/s12870-019-1947-z
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Identification of putative F3’H genes in cereal plants. The F3’H class was classified based on the phylogeny analyses. The gene duplication pattern was determined using the MCScanX tool. NA stands for “not applicable”
| Species | Chr | F3’H gene ID | F3’H class | Duplication pattern | Start position | End position | |
|---|---|---|---|---|---|---|---|
|
|
| 1H | HORVU1Hr1G094880 | Class I | Dispersed duplication | 556,691,949 | 556,693,904 |
| 6H | HORVU6Hr1G002400 | Class II | Dispersed duplication | 6,328,532 | 6,330,799 | ||
| 7H | HORVU7Hr1G095900.47 | Class II | Dispersed duplication | 585,061,615 | 585,071,468 | ||
|
| Aet1 | AET1Gv21041800 | Class I | Dispersed duplication | 499,122,364 | 499,126,756 | |
| Aet6 | AET6Gv20027200 | Class II | Dispersed duplication | 5,657,246 | 5,659,238 | ||
| Aet7 | AET7Gv20999800 | Class II | Dispersed duplication | 523,485,446 | 523,487,617 | ||
|
| Tt1A | TRIDC1AG064860 | Class I | NA | 590,437,663 | 590,440,408 | |
| Tt1B | TRIDC1BG074020 | Class I | 685,939,174 | 685,944,027 | |||
| Tt2B | TRIDC2BG088640 | Class I | 792,785,227 | 792,787,184 | |||
| Tt6A | TRIDC6AG001340 | Class II | 4,624,395 | 4,644,086 | |||
| Tt6B | TRIDC6BG002010 | Class II | 11,399,578 | 11,402,254 | |||
| Tt7A | TRIDC7AG057400 | Class II | 599,327,799 | 599,328,465 | |||
| Tt7B | TRIDC7BG049820 | Class II | 562,791,603 | 562,792,604 | |||
|
| Ta1A | TraesCS1A01G442300.1 | Class I | NA | 590,995,642 | 590,997,413 | |
| Ta1B | TraesCS1B01G476400.1 | Class I | 685,231,562 | 685,233,491 | |||
| Ta1D | TraesCS1D01G450100.1 | Class I | 492,534,241 | 492,538,011 | |||
| Ta2B | TraesCS2B01G613200.1 | Class I | 792,677,476 | 792,679,409 | |||
| Ta6A | TraesCS6A01G012600.1 | Class II | 5,861,572 | 5,863,417 | |||
| Ta6B | TraesCS6B01G018800.1 | Class I | 11,574,703 | 11,578,097 | |||
| Ta6D | TraesCS6D01G015200.1 | Class II | 6,319,419 | 6,321,226 | |||
| Ta7A | TraesCS7A01G411700.1 | Class II | 602,804,667 | 602,806,415 | |||
| Ta7D | TraesCS7D01G404900.1 | Class II | 522,502,518 | 522,504,208 | |||
|
| Sc1 | Sc1Loc01465431 | Class I | NA | Lo7_v2_contig_2871825 | ||
| Sc2 | Sc2Loc01684522 | Class I | Lo7_v2_contig_326626 | ||||
| Sc7 | Sc7Loc01952123 | Class II | Lo7_v2_contig_61986 | ||||
|
| Tu1 | TuG1812G0100004862 | Class I | Dispersed duplication | 581,402,997 | 581,404,997 | |
| Tu7 | TuG1812G0700004460 | Class II | Dispersed duplication | 590,301,744 | 590,303,739 | ||
|
| Bd1 | Bradi1g17180 | Class I | Dispersed duplication | 13,787,434 | 13,789,806 | |
| Bd1 | Bradi1g24840 | Class II | Dispersed duplication | 20,108,563 | 20,112,348 | ||
| Bd3 | Bradi3g04750 | Class I | Dispersed duplication | 3,260,666 | 3,262,706 | ||
| Bd4 | Bradi4g16560 | Class II | Dispersed duplication | 17,368,956 | 17,372,786 | ||
|
| Os10 | LOC_Os10g17260 | Class I | Proximal duplication | 8,679,309 | 8,681,284 | |
| Os10 | LOC_Os10g16974 | Class II | Proximal duplication | 8,494,247 | 8,504,329 | ||
|
|
| Zm4 | Zm00008a016611 | Class I | WGD/Segmental | 131,908,959 | 131,910,431 |
| Zm5 | Zm00008a022212 | Class I | WGD/Segmental | 177,289,973 | 177,292,210 | ||
| Zm8 | Zm00008a031477 | Class II | Dispersed duplication | 116,235,840 | 116,239,418 | ||
|
| Sb4 | Sobic.004G200800 | Class I | Tandem duplication | 55,221,098 | 55,224,686 | |
| Sb4 | Sobic.004G200833 | Class I | Tandem duplication | 55,225,513 | 55,227,179 | ||
| Sb4 | Sobic.004G200900 | Class I | Proximal duplication | 55,233,582 | 55,236,702 | ||
| Sb4 | Sobic.004G201100 | Class I | Proximal duplication | 55,261,682 | 55,264,545 | ||
| Sb9 | Sobic.009G162500 | Class II | Dispersed duplication | 51,943,204 | 51,948,939 | ||
|
| Si9 | Seita.9G244600 | Class II | Dispersed duplication | 19,091,837 | 19,094,929 | |
| Si9 | Seita.9G242900 | Class I | Dispersed duplication | 18,990,913 | 18,992,801 | ||
|
| Ph8 | Pahal.H01052 | Class II | Dispersed duplication | 31,682,928 | 31,687,178 | |
| Ph9 | Pahal.I03232 | Class I | Dispersed duplication | 16,748,267 | 16,750,212 | ||
Fig. 1Phylogenetic tree displaying the evolutionary history of plant F3’Hs. The phylogeny was developed using Bayesian method. F3’H homologs from P. patens was included as outgroup. Posterior support was displayed above each branch. The deduced duplication event was indicated by solid black dot symbol. Yellow vertical lines indicated the division of sub-branches of F3’Hs in Triticeae
Fig. 2Genomic structures of monocot F3’Hs. The F3’H genes were clustered based on the developed phylogeny in the present study. Two features: CDS and Intron were displayed for each F3’H based data extracted from public database
Fig. 3Displays the collinear F3’H gene pairs across monocot plants. The circle plot was created by MCScanX tool. Identified colinear genes were linked by red curved lines. Hv, Bd, Aet, Tu, Os, Zm, SI, Sb and Ph represent H. vulgare, B. distachyon, A. tauschii, T. urartu, O. sativa, Z. mays, S. italica, S. bicolor and P. hallii, respectively. Only the chromosomes containing F3’H genes were included for this analysis. Class I and Class II F3’Hs were labelled in blue and pink colours, respectively
Natural selection tests on plant F3’Hs. “np” stands for the number of parameters. ln(Likelihood) refers the log value of the likelihood
| Model | np | Estimates of parametersa | Positively selected sitesb | |
|---|---|---|---|---|
| One-ratio | ||||
| ω[eudi] = ω[mono1] = ω[mono2] | 1 | −29,872.87 | ω[eudi] = ω[mono1] = ω[mono2] = 0.12094 | Not Allowed (NA) |
| Branch-specific models | ||||
| ω[eudi] = ω[mono1] ≠ ω[mono2] | 2 | −29,862.42 | ω[eudi] = ω[mono1] = 0.11857, ω[mono2] = 0.82808 | NA |
| ω[eudi] = ω[mono2] ≠ ω[mono1] | 2 | −29,870.85 | ω[eudi] = ω[mono2] = 0.11990, ω[mono1] = 0.43835 | NA |
| ω[eudi] ≠ ω[mono1] ≠ ω[mono2] | 3 | − 29,862.42 | ω[eudi] = 0.11853, ω[mono2] = 0.78501, ω[mono1] = 0.12479 | NA |
| Site-specific models | ||||
| Neutral M1 (2 site classes) | 2 | −29,413.91 | P0 = 0.87805 (P1 = 1-P0 = 0.12195); ω0 = 0.09334 (ω1 = 1- ω0 = 1.0) | NA |
| Selection M1 (3 site classes | 3 | −29,413.91 | P0 = 0.87805, P1 = 0.01421, P2 = 1-P0-P1 = 0.10774; ω0 = 0.09334 (ω1 = 1.0), ω2 = 1.0 | NA |
| Branch-site models | ||||
| Model A Null (Class I F3’H) | 3 | −29,408.41 | P0 = 0, P1 = 0.0, P2 + P3 = 1; ω0 = 0.09152, ω1 = 1.0, ω2 = 1.0 | |
| Model A (Class I F3’H) | 4 | −29,408.41 | P0 = 0.00005, P1 = 0.00001, P2 + P3 = 0.99994; ω0 = 0.09152, ω1 = 1.0, ω2 = 1.0 | None |
| Model A Null (Class II F3’H) | 3 | −29,394.83 | P0 = 0.51238, P1 = 0.07209, P2 + P3 = 0.41553; ω0 = 0.09046, ω1 = 1.0, ω2 = 1.0 | NA |
| Model A (Class II F3’H) | 4 | −29,392.04 | P0 = 0.68799, P1 = 0.09608, P2 + P3 = 0.21593; ω0 = 0.09035, ω1 = 1.0, ω2 = 8.19737 | 108R,222A, 265 V, 274 T, 355Q, 447S, 449 L ( |
aIn the site-specific model M1, two site classes were specified: highly conserved sites (ω0) and neutral sites (ω1 = 1). For the site-specific model M2, there were three site classes: highly conserved sites (ω0), neutral sites (ω1 = 1) and positively selected sites (ω2). In Model A, four site classes were specified. The first two classes had ω ratios of ω0 and ω1 respectively, corresponding to highly conserved sites and neutral sites across all lineages. In the other two site classes, the background lineages had ω0 or ω1 while the foreground lineages had ω2. bPositively selected amino acids at P-value ≤0.05 are numbered according to HORVU6Hr1G002400.1, excluding the first 34 amino acids predicted as membrane targeting signal
Fig. 4Sequence alignment and protein modelling analyses. a Amino acid sequence alignment of the selected sites and SRS6 in plant F3’Hs. b The overall superimposition of the LOC_Os10g16974 (orange) and LOC_Os10g17260 (green) models with CYP76AH1 (grey; PDB: 5YM3); c The spatial location of the positively selected sites in LOC_Os10g16974; d The spatial location of the positively selected sites in LOC_Os10g17260; e The potential interaction of the positively selected sites with the superimposed substrates; f The hydrophobicity changes between LOC_Os10g16974 and LOC_Os10g17260 at the positively selected amino acid residues. Red and white colours represent the most hydrophobic and the most hydrophilic, respectively. HEM stands for heme. PIN refers enzyme inhibitor in PDB 5YM3
Fig. 5Transcriptional profiles of monocot F3’Hs. a LOC_Os10g16974 (Class II) and LOC_Os10g17260 (Class I) in rice; b HORVU6Hr1G002400 (Class II) and HORVU1Hr1G094880 (Class I) in barley; c Sobic.004G200800, Sobic.004G200833, Sobic.004G200900, Sobic.004G201100 (Class I) and Sobic.009G162500 (Class II) in Sorghum; d Zm00008a016611, Zm00008a022212 (Class I) and Zm00008a031477 (Class II) in maize