| Literature DB >> 24962214 |
Shanshan Chu, Jiao Wang, Hao Cheng, Qing Yang1, Deyue Yu.
Abstract
BACKGROUND: Previous studies suggest that the metabolic pathway structure influences the selection and evolution rates of involved genes. However, most of these studies have exclusively considered a single gene copy encoding each enzyme in the metabolic pathway. Considering multiple-copy encoding enzymes could provide direct evidence of gene evolution and duplication patterns in metabolic pathways. We conducted a detailed analysis of the phylogeny, synteny, evolutionary rate and selection pressure of the genes in the isoflavonoid metabolic pathway of soybeans.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24962214 PMCID: PMC4076065 DOI: 10.1186/1471-2156-15-76
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Figure 1The isoflavonoid phytoalexin synthesis pathway in soybeans. Terminal productions of the phenylpropanoid pathway are in brackets. The most upstream and downstream enzymes are indicated in blue. The branch-point enzymes are indicated in red. Metabolic node substances are labeled in bold. Dotted arrows represent multiple or unclear steps.
The number of genes of each gene family in the isoflavonoid pathway in various plants and the number of ancestral genes in the most recent common ancestor (MRCA) of different lineages
| 8 | 6 | 6 | 5 | 4 | 11 | 9 | 7 | 4 | 1 | 1 | |
| 4 | 3 | 2 | 3 | 1 | 2 | 4 | 3 | 3 | 2 | 2 | |
| 9 | 5 | 4 | 6 | 6 | 4 | 7 | 6 | 5 | 4 | 3 | |
| 13 | 11 | 21 | 4 | 4 | 14 | 15 | 12 | 4 | 3 | 2 | |
| 8 | 7 | 9 | 5 | 5 | 3 | 4 | 6 | 5 | 3 | 3 | |
| 2 | 2 | 4 | 6 | 0 | 0 | 0 | 4 | 1 | 0 | 0 | |
| 2 | 3 | 3 | 2 | 0 | 0 | 0 | 3 | 1 | 0 | 0 | |
| 17 | 6 | 12 | 4 | 0 | 0 | 0 | 10 | 3 | 0 | 0 | |
| 8 | 7 | 4 | 6 | 0 | 0 | 0 | 6 | 5 | 0 | 0 |
aAncestral genes in the MRCA of legumes.
bAncestral genes in the MRCA of dicots.
cAncestral genes in the MRCA of dicots and monocots.
Figure 2Phylogenic relationship of the gene family members from 7 species. Each species was labeled with different shapes as presented in the figure. Genes reported from the NCBI are highlighted with a blue triangle. The genes sequenced in our study are highlighted with a red dot. The red circles at the nodes represent ancestral genes in the MRCA of legumes. The red triangles and rectangles represent ancestral genes in the MRCA of dicots and those of dicots and monocots, respectively. The nodes with bootstrapping lower than 50% are not shown. Each red square bracket represents one recent segmental duplication; each blue square bracket represents one old segmental duplication.
Date calculations for segmental duplication events in soybeans
| Segments containing | | | |
| Chr 3 & Chr 19 | 10 | 0.17 ± 0.04 | 14 |
| Chr 10 & Chr 13 | 10 | 0.14 ± 0.03 | 11 |
| Chr 3 & Chr 10 | 5 | 0.60 ± 0.15 | 49 |
| Chr 3 & Chr 13 | 5 | 0.62 ± 0.25 | 51 |
| Chr 19 & Chr 10 | 4 | 0.58 ± 0.05 | 48 |
| Chr 19 & Chr 13 | 4 | 0.66 ± 0.22 | 54 |
| Chr 10–2 & Chr 20 | 10 | 0.12 ± 0.02 | 10 |
| Segments containing | | | |
| Chr 2 & Chr 14 | 10 | 0.14 ± 0.02 | 11 |
| Chr 10 & Chr 20 | 10 | 0.15 ± 0.03 | 12 |
| Segments containing | | | |
| Chr 17 & Chr 13 | 10 | 0.16 ± 0.08 | 13 |
| Chr 13–2 & Chr 15 | 10 | 0.12 ± 0.04 | 10 |
| Chr 1 & Chr 11 | 4 | 0.10 ± 0.04 | 8 |
| Segments containing | | | |
| Chr 8 & Chr 5 | 10 | 0.14 ± 0.05 | 11 |
| Chr 1 & Chr 2 | 8 | 0.17 ± 0.07 | 14 |
| Chr 1–2 &Chr 11 | 10 | 0.10 ± 0.01 | 8 |
| Segments containing | | | |
| Chr 14 &Chr 2 | 10 | 0.12 ± 0.04 | 10 |
| Segments containing | | | |
| Chr 10 & Chr 20 | 10 | 0.18 ± 0.13 | 15 |
| Chr 4 & Chr 6 | 10 | 0.17 ± 0.05 | 14 |
| Chr 13 & Chr 15 | 7 | 0.21 ± 0.10 | 17 |
| Segments containing | | | |
| Chr 7 & Chr13 | 10 | 0.18 ± 0.11 | 15 |
| Segments containing | | | |
| Chr 18 & Chr 8 | 10 | 0.22 ± 0.11 | 18 |
| Chr 10 & Chr 20 | 10 | 0.14 ± 0.03 | 11 |
| Segments containing | | | |
| Chr 1 & Chr11 | 10 | 0.18 ± 0.13 | 15 |
Abbreviation: mya, million years ago.
Gene duplication patterns of gene families in the isoflavonoid biosynthesis pathway
| Segment duplicationa | Old | 1 | | | | | | | | |
| | Recent | 3(6)c | 2(4) | 3(6) | 4(7) | 2(3) | 3(6) | 1(2) | 2(4) | 2(3) |
| Tandem duplicationb | Old | | | 1 | | | 2 | | | 1 |
| Total number of duplicationsd | Recent | 1 | 4 | 2 | 5 | 4 | 8 | 2 | 5 | 2 |
| 8 | 9 | 12 | 14 | 9 |
aThe total number of segment duplications.
bThe total number of tandem duplications.
cThe number in the parentheses is the number of loci retainedafter recent segment duplication; Tandemly duplicated genes were considered one locus.
dThe total number of gene copies originating from segmental and tandem duplications.
Figure 3Synteny analyses for the gene family. Green solid and dotted lines represent soybean and M. truncatula chromosomes, respectively. Target genes are indicated by red arrows.
Differential evolutionary pattern between upstream and downstream genes
| Upstream | PAL | Glyma19g36620( | 0.16 | 0.07 | 0.04* | 0.01 | 0.01 | 0.01** | 0.63 | 0.28 | 0.46 | 0.02 | 0.00 |
| | | Glyma03g33890( | 0.03 | | | 0.02 | | | 0.08 | | | 0.28 | 0.00 |
| | | Glyma10g06600( | 0.07 | | | 0.00 | | | 0.30 | | | 0.00 | 0.00 |
| | C4H | Glyma14g38580 | 0.09 | | | 0.00 | | | 0.41 | | | 0.00 | 0.00 |
| | | Glyma02g40290 | 0.00 | | | 0.00 | | | 0.00 | | | 0/0 | 0.00 |
| Downstream | IOMT | Glyma13g24210 | 0.22 | 0.28 | | 0.18 | 0.29 | | 0.35 | 0.26 | | 0.51 | 0.00 |
| | | Glyma18g50290 | 0.44 | | | 0.37 | | | 0.68 | | | 0.54 | 0.00 |
| | IFR | Glyma01g37840 | 0.20 | | | 0.26 | | | 0.00 | | | 0.26e-2/0 | 1.13 |
| Glyma04g01380 | 0.27 | 0.35 | 0.00 | 0.35e-2/0 | 25.15** |
aT-test of the π values between upstream and downstream genes.
bT-test of thedN values between upstream and downstream genes.
cT-test of thedS values between upstream and downstream genes.
LRT statistics are twice the log-likelihood differences between M7 and M8.
*0.01 < P < 0.05; **0.001 < P < 0.01.
Evolutionary pattern of genes encoding for branch-point enzymes
| Higher | 4CL | Glyma17g07190( | 0.18 | 0.14 | 0.30 | 0.48 | 0.00 |
| | | 0.40 | 0.39 | 0.40 | 0.96 | 28.98** | |
| | | 0.09 | 0.12 | 0.00 | 0.12e-2/0 | 66.60** | |
| | CHS | 0.17 | 0.18 | 0.12 | 1.49 | 112.46** | |
| | | Glyma01g43880( | 0.03 | 0.04 | 0.00 | 0.04e-2/0 | 0.00 |
| | | Glyma11g01350( | 0.07 | 0.00 | 0.29 | 0.00 | 0.00 |
| | CHI | Glyma10g43850( | 0.00 | 0.00 | 0.00 | 0/0 | 0.00 |
| | | 0.40 | 0.23 | 0.97 | 0.24 | 60.41** | |
| | | Glyma06g14820( | 0.00 | 0.00 | 0.00 | 0/0 | 0.00 |
| | | Glyma13g33730( | 0.79 | 0.58 | 1.49 | 0.39 | 0.40 |
| Lower | CHR | Glyma02g47750 | 0.27 | 0.00 | 1.29 | 0.00 | 0.00 |
| | | Glyma14g00870 | 0.11 | 0.07 | 0.25 | 0.30 | 0.00 |
| | IFS | Glyma07g32330( | 0.22 | 0.12 | 0.56 | 0.21 | 0.00 |
| Glyma13g24200( | 0.17 | 0.12 | 0.32 | 0.38 | 0.00 |
LRT statistics are twice the log-likelihood differences between M7 and M8.
Significant positive sites are labeled in bold.
*0.01 < P < 0.05; **0.001 < P < 0.01.