| Literature DB >> 27966614 |
Bing-Hong Huang1, Yi-Wen Chen2, Chia-Lung Huang1, Jian Gao3, Pei-Chun Liao1.
Abstract
Gene duplication could be beneficial by functional division but might increase the risk of genetic load. The dynamics of duplicated paralogs number could involve recombination, positive selection, and functional divergence. Duplication of DIHYDROFLAVONOL 4-REDUCTASE (DFR) has been reported in several organisms and may have been retained by escape from adaptive conflict (EAC). In this study, we screened the angiosperm DFR gene focusing on a diversified genus Scutellaria to investigate how these duplicated genes are retained. We deduced that gene duplication involved multiple independent events in angiosperms, but the duplication of DFR was before the divergence of Scutellaria. Asymmetric positive selective pressures resulted in different evolutionary rates between the duplicates. Different numbers of regulatory elements, differential codon usages, radical amino acid changes, and differential gene expressions provide evidences of functional divergence between the two DFR duplicates in Scutellaria, implying adaptive subfunctionalization between duplicates. The discovery of pseudogenes accompanying a reduced replacement rate in one DFR paralogous gene suggested possibly leading to "loss of function" due to dosage imbalance after the transient adaptive subfunctionalization in the early stage of duplication. Notwithstanding, episodic gene duplication and functional divergence may be relevant to the diversification of ecological function of DFR gene in Scutellaria.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27966614 PMCID: PMC5155217 DOI: 10.1038/srep39031
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The neighbor-joining tree of DFR gene.
Detailed evolutionary relationships of the lineages are shown in Supplementary Fig. 1. Lineages with potential duplications or duplications identified previously have been labeled with star (*).
Recombination analyses estimated by R, Rm, and ZZ statistic. Both observed and coalescent simulations are shown.
| Taxa | Tajima’s | Average | 95% CI, lower limit | 95% CI, upper limit | Average | 95% CI, lower limit | 95% CI, upper limit | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Angiosperm | 3.74047 | <0.001 | 101 | 0.1322 | 37 | — | — | — | — | 0.0334 | — | — | — | — |
| −0.64289 | >0.1 | 2.9 | 0.0038 | 25 | 80.805 | 64 | 99 | 1 | 0.1477 | 0.0006 | −0.0375 | 0.0378 | 0 | |
| −0.51051 | >0.1 | 0.001 | 0 | 24 | 100.601 | 83 | 121 | 1 | 0.0492 | −0.0005 | −0.0367 | 0.0407 | 0.013 | |
| −0.1985 | >0.1 | 41.3 | 0.0538 | 8 | 28.942 | 19 | 39 | 1 | 0.0265 | −0.0006 | −0.0734 | 0.0779 | 0.224 | |
| −0.85068 | >0.1 | 14.1 | 0.0183 | 8 | 34.079 | 24 | 45 | 1 | 0.1629 | 0.0015 | −0.0576 | 0.073 | 0.001 | |
| 0.13892 | >0.1 | 89.3 | 0.1168 | 1 | 2.297 | 0 | 6 | 0.658 | 0.1731 | 0.0056 | −0.1829 | 0.2098 | 0.057 | |
| 0.68813 | >0.1 | 16.7 | 0.0219 | 1 | 7.581 | 3 | 13 | 0.999 | 0.0279 | −0.0028 | −0.1357 | 0.1497 | 0.27 | |
| 2.10125 | <0.05 | 0.001 | 0 | 5 | 112.14 | 93 | 133 | 1 | 0.0236 | −0.0003 | −0.0377 | 0.0369 | 0.1 | |
| −1.19213 | >0.1 | 41.2 | 0.0541 | 0 | 11.124 | 5 | 18 | 1 | 0.0464 | −0.0003 | −0.1086 | 0.1172 | 0.195 | |
| 0.59302 | >0.1 | 12.2 | 0.016 | 1 | 25.493 | 17 | 35 | 1 | 0.0308 | 0 | −0.0817 | 0.0773 | 0.21 | |
| 1.23996 | >0.1 | 4 | 0.0052 | 1 | 14.923 | 8 | 23 | 1 | 0.0673 | −0.0002 | −0.1016 | 0.1071 | 0.089 | |
| −0.53469 | >0.1 | 45.1 | 0.059 | 1 | 6.887 | 2 | 13 | 0.995 | −0.0177 | −0.002 | −0.1402 | 0.1532 | 0.577 | |
| −0.03248 | >0.1 | 30 | 0.0393 | 1 | 14.161 | 8 | 22 | 1 | −0.0185 | −0.0002 | −0.1047 | 0.1266 | 0.631 |
aObserved value.
bCoalescent simulation.
cProbability of obtaining values of the Rm statistic equal to or greater than the observed value; the probabilities were obtained from coalescent simulations with free recombination.
dProbability of obtaining values of the ZZ statistic equal to or greater than the observed value; the probabilities were obtained from coalescent simulations with no recombination.
Figure 2The neighbor-joining trees of DFR gene in Scutellaria constructed with exon (a) and intron (b) nucleotide sequences, and amino acid sequences (c). Branch support value (including bootstrap of NJ and ML, and posterior probability of BI, respectively) > 50% are shown adjacent to the nodes. Sequences of different species are indicated by different colors. Species with no identified duplicated DFR are indicated in black. Dup1 and Dup2 denote two putative duplications.
Results of branch-site model analysis and the likelihood ratio test for the foreground branches of Scutellaria DFR duplicates.
| Foreground branch | np | lnL | 2ΔL | Proportion | Background ω | Foreground ω | PS site (Pr ω > 1) | ||
|---|---|---|---|---|---|---|---|---|---|
| #1 | Dup1 fixed ω = 1 | 136 | −2629.741 | 0.4763 | 0.4556 | p0 = 0 | ω0 = 0.0355 | ω0 = 0.0355 | None |
| p1 = 0 | ω1 = 1 | ω1 = 1 | |||||||
| p2a = 0.8073 | ω2a = 0.0355 | ω2a = 1 | |||||||
| p2b = 0.1927 | ω2b = 1 | ω2b = 1 | |||||||
| #1 | Dup1 | 137 | −2629.503 | p0 = 0 | ω0 = 0.0358 | ω0 = 0.0358 | |||
| p1 = 0 | ω1 = 1 | ω1 = 1 | |||||||
| p2a = 0.8081 | ω2a = 0.0358 | ω2a = 64.0267 | |||||||
| p2b = 0.1919 | ω2b = 1 | ω2b = 64.0267 | |||||||
| #1 | Dup2 fixed ω = 1 | 136 | −2629.61 | −0.2294 | #NUM! | p0 = 0.0165 | ω0 = 0.0356 | ω0 = 0.0356 | None |
| p1 = 0.0039 | ω1 = 1 | ω1 = 1 | |||||||
| p2a = 0.7907 | ω2a = 0.0356 | ω2a = 1 | |||||||
| p2b = 0.1889 | ω2b = 1 | ω2b = 1 | |||||||
| #1 | Dup2 | 137 | −2629.725 | p0 = 0 | ω0 = 0.0359 | ω0 = 0.0359 | |||
| p1 = 0 | ω1 = 1 | ω1 = 1 | |||||||
| p2a = 0.8079 | ω2a = 0.0359 | ω2a = 48.6756 | |||||||
| p2b = 0.1921 | ω2b = 1 | ω2b = 48.6756 | |||||||
| $1 | Dup1 fixed ω = 1 | 136 | −2628.349 | 18.0661 | 1.12E–05 | p0 = 0.7837 | ω0 = 0.0337 | ω0 = 0.0337 | |
| p1 = 0.1114 | ω1 = 1 | ω1 = 1 | |||||||
| p2a = 0.0919 | ω2a = 0.0337 | ω2a = 1 | |||||||
| p2b = 0.0131 | ω2b = 1 | ω2b = 1 | |||||||
| $1 | Dup1 | 137 | −2619.314 | p0 = 0.7921 | ω0 = 0.0412 | ω0 = 0.0412 | 253D (1.000) | ||
| p1 = 0.1929 | ω1 = 1 | ω1 = 1 | |||||||
| p2a = 0.0121 | ω2a = 0.0412 | ω2a = 6.4153 | |||||||
| p2b = 0.0029 | ω2b = 1 | ω2b = 6.4153 | |||||||
| $1 | Dup2 fixed ω = 1 | 136 | −2629.786 | 8.845 | 0.0016 | p0 = 0.7754 | ω0 = 0.0327 | ω0 = 0.0327 | |
| p1 = 0.1800 | ω1 = 1 | ω1 = 1 | |||||||
| p2a = 0.0362 | ω2a = 0.0327 | ω2a = 1 | |||||||
| p2b = 0.0084 | ω2b = 1 | ω2b = 1 | |||||||
| $1 | Dup2 | 137 | −2625.364 | p0 = 0.7966 | ω0 = 0.0396 | ω0 = 0.0396 | 253 H (0.978) | ||
| p1 = 0.1954 | ω1 = 1 | ω1 = 1 | |||||||
| p2a = 0.0064 | ω2a = 0.0396 | ω2a = 11.6666 | |||||||
| p2b = 0.0016 | ω2b = 1 | ω2b = 11.6666 |
Designations #1 and $1 indicate that the foreground branch is the lineage of entire clades or all lineages of the clade, respectively.
Type-I and type-II functional divergence estimated by Gu’s statistics.
| Type-I Divergence | Type-II Divergence | ||
|---|---|---|---|
| 0.039 ± 0.074 | 1.675 ± 0.380 | ||
| 0.074 (0.296) | 4.412 (<0.00001) | ||
| 0.462 ± 0.240 | 4.453 | ||
| LRT | 3.694 (0.055) | 1.456 | |
| 230.9/8.3/14.8 | |||
| 0.810/0.021/0.009 | |||
aThe estimate of functional divergence coefficient θ with standard error by a model-free method.
bMaximum likelihood estimate of θ and standard error.
cThe log score for the likelihood ratio test against the null θ = 0.
dThe estimate of functional divergence coefficient θ with standard error.
eThe ratio of radical change under functional divergence (a) versus nonfunctional divergence (π).
fThe ratio of proportion of radical change (G) versus conserved change (G);
gThe numbers of sites indicate no difference (N), conserved difference (C), and radical differences (R).
hProportion of no change (F00,N), radical change (F00,R), and conserved change (F00,C) of amino acids between clusters but “no change” within clusters.
Figure 3Site-specific profile for type-II functional divergence of DFR genes in Scutellaria.
Figure 4RT-PCR results of Scutellaria playfairii DFR Dup 1, Dup 2 and internal control (Actin).
(a) The light intensity (gray value) of amplified RT-PCR products analyzed using ImageJ. The error bar represented the standard error. (b) The amplified RT-PCR products were visualized in the agarose gel. L: leaf; F: mature flower; S: shoot apex; I: inflorescence buds; FB: flower buds; N: No-RT negative control.
Figure 5Lineage-through-time (LTT) plots inferred from trees of nonsynonymous and synonymous substitutions of DFR gene.
(a,b) represent topologies of nonsynonymous and synonymous trees, respectively, with the branch lengths corresponding to the relative times (denoted as the proportion of substitutions) in figures (c,d). (c,d) represent LTT plots estimated by reversible-jump Markov chain Monte Carlo (rjMCMC) method (black lines) and a constant birth-death stochastic branching process (SBP, blue lines) based on tree topologies of (A) and (C), respectively. Significant positives of γ-statistic for the SBP-LTT plots denote the late increase of diversification rate in both trees. The x-axes indicate relative time scale since DFR duplication.
γ-statistic of nonsynonymous and synonymous trees of DFR in Scutellaria.
| γ | P | |
|---|---|---|
| Nonsyn | 8.619 | 6.76E–18 |
| Nonsyn Dup1 | 5.690 | 1.27E–08 |
| Nonsyn Dup2 | 1.851 | 0.064 |
| Synonym | 8.017 | 1.08E–15 |
| Synonym Dup1 | 5.120 | 3.05E–07 |
| Synonym Dup2 | 2.142 | 0.032 |
Nonsyn: nonsynonymous tree of DFR.
Synonym: synonymous tree of DFR.