| Literature DB >> 24171911 |
Johannes A Hofberger1, Eric Lyons, Patrick P Edger, J Chris Pires, M Eric Schranz.
Abstract
Plants share a common history of successive whole-genome duplication (WGD) events retaining genomic patterns of duplicate gene copies (ohnologs) organized in conserved syntenic blocks. Duplication was often proposed to affect the origin of novel traits during evolution. However, genetic evidence linking WGD to pathway diversification is scarce. We show that WGD and tandem duplication (TD) accelerated genetic versatility of plant secondary metabolism, exemplified with the glucosinolate (GS) pathway in the mustard family. GS biosynthesis is a well-studied trait, employing at least 52 biosynthetic and regulatory genes in the model plant Arabidopsis. In a phylogenomics approach, we identified 67 GS loci in Aethionema arabicum of the tribe Aethionemae, sister group to all mustard family members. All but one of the Arabidopsis GS gene families evolved orthologs in Aethionema and all but one of the orthologous sequence pairs exhibit synteny. The 45% fraction of duplicates among all protein-coding genes in Arabidopsis was increased to 95% and 97% for Arabidopsis and Aethionema GS pathway inventory, respectively. Compared with the 22% average for all protein-coding genes in Arabidopsis, 52% and 56% of Aethionema and Arabidopsis GS loci align to ohnolog copies dating back to the last common WGD event. Although 15% of all Arabidopsis genes are organized in tandem arrays, 45% and 48% of GS loci in Arabidopsis and Aethionema descend from TD, respectively. We describe a sequential combination of TD and WGD events driving gene family extension, thereby expanding the evolutionary playground for functional diversification and thus potential novelty and success.Entities:
Keywords: Brassicaceae; comparative genomics; functional diversification; systems biology; whole-genome duplication
Mesh:
Substances:
Year: 2013 PMID: 24171911 PMCID: PMC3845643 DOI: 10.1093/gbe/evt162
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Retained At-α Ohnolog Duplicate Gene Pairs in Arabidopsis and Aethionema GS Pathway Inventory
| Protein Name | AGI | α-Block | Evident SSD | AabID | Syntelog | % Identity | Col-0 → Aab |
|---|---|---|---|---|---|---|---|
| Core-structure formation | |||||||
| UGT74C1 | AT2G31790 | A02N051 | TD | Aab37175 | Yes | 79.44 | 6 → 10 |
| [UGT like] | AT1G05670 | A02N051 | TD | Aab31930 | Yes | 81.11 | 6 → 10 |
| FMO-GSOX-2 | AT1G62540 | A03N117 | TD | Aab10869 | Yes | 76.6 | 11 → 8 |
| [FMO like] | AT1G12130 | A03N117 | TD | Aab13543 | Yes | 65.09 | 11 → 8 |
| CYP79F2 | AT1G16400 | A05N062 | TD | — | — | — | 8 → 9 |
| CYP79C1 | AT1G79370 | A05N062 | GTD | Aab34143 | Yes | 76.8 | 8 → 9 |
| SOT16 | AT1G74100 | A05N186 | TD | Aab14278 | Yes | 91.07 | 3 → 3 |
| SOT17 | AT1G18590 | A05N186 | Aab19675 | Yes | 86.05 | 3 → 3 | |
| SUR1 | AT2G20610 | A10N194 | Aab30136 | Yes | 89.15 | 3 → 2 | |
| [SUR like] | AT4G28420 | A10N194 | TD | Aab31155 | Yes | 57.93 | 3 → 2 |
| CYP79B2 | AT4G39950 | A10N257 | GTD | Aab17805 | Yes | 81.38 | 8 → 9 |
| CYP79B3 | AT2G22330 | A10N257 | GTD | Aab19477 | Yes | 81.4 | 8 → 9 |
| GGP1 | AT4G30530 | A10N314 | TD | Aab24374 | Yes | 87.6 | 5 → 5 |
| [GGP like] | AT2G23960 | A10N314 | TD | Aab11021 | Yes | 61.38 | 5 → 5 |
| GSTF11 | AT3G03190 | A12N102 | Aab14996 | Yes | 77.1 | 4 → 4 | |
| [GSTF12] | AT5G17220 | A12N102 | Aab14791 | Yes | 77.84 | 4 → 4 | |
| Cosubstrate pathways | |||||||
| [AAO3] | AT2G27150 | A02NOA1 | GTD | Aab27016 | 77.67 | 2 → 2 | |
| AAO4 | AT1G04580 | A02NOA1 | Aab24896 | Yes | 79.58 | 2 → 2 | |
| APK1 | AT2G14750 | A10NOA2 | Aab32150 | Yes | 83.75 | 2 → 2 | |
| APK2 | AT4G39940 | A10NOA2 | GTD | Aab17804 | Yes | 86.05 | 2 → 2 |
| Side-chain elongation | |||||||
| BCAT4 | AT3G19710 | A08N074 | Aab21007 | Yes | 75 | 6 → 6 | |
| [BCAT7] | AT1G50090 | A08N074 | TD | Aab22548 | Yes | 76.9 | 6 → 6 |
| IPMI1 | AT3G58990 | A11N226 | Aab13092 | Yes | 83 | 3 → 3 | |
| [IPMI like] | AT2G43090 | A11N226 | TD | Aab19619 | Yes | 85.99 | 3 → 3 |
| BCAT3 | AT3G49680 | A19N002 | Aab33782 | Yes | 76.02 | 6 → 6 | |
| [BCAT5] | AT5G65780 | A19N002 | TD | Aab23605 | Yes | 75.3 | 6 → 6 |
| BAT5 | AT4G12030 | A20N095 | Aab32285 | Yes | 76.21 | 2 → 2 | |
| [BAT like] | AT4G22840 | A20N095 | Aab23321 | Yes | 91.82 | 2 → 2 | |
| TF regulation | |||||||
| OBP2 | AT1G07640 | A02N142 | Aab18330 | Yes | 80.28 | 2 → 2 | |
| [OBP like] | AT2G28810 | A02N142 | Aab24559 | Yes | 70.03 | 2 → 2 | |
| MYB122 | AT1G74080 | A05N185 | Aab14276 | Yes | 57.56 | 6 → 4 | |
| MYB51 | AT1G18570 | A05N185 | Aab19683 | Yes | 59.89 | 6 → 4 | |
| IQD1 | AT3G09710 | A14N046 | Aab18852 | Yes | 65.59 | 2 → 2 | |
| [IQD2] | AT5G03040 | A14N046 | Aab18368 | Yes | 77.39 | 2 → 2 | |
| MYB28 | AT5G61420 | A26N034 | Aab12163 | Yes | 67.39 | 6 → 4 | |
| MYB29 | AT5G07690 | A26N034 | TD | Aab33585 | Yes | 65.13 | 6 → 4 |
| 36% TD (13/36) | 29% TD (10/35) | Ø 76.58 | |||||
| 14% GTD (5/36) | 14% GTD (5/35) | ||||||
aSquared brackets indicate ohnolog copies of GS biosynthetic genes without GO!-annotation to GS biosynthetic process.
bTD refers to members of TARs and GTD refers to a history of transposition in Arabidopsis.
cPredicted Aethionema CDS.
dChange of gene family locus count in Arabidospsis → Aethionema order.
GTDs in Arabidopsis and Aethionema GS Pathway Inventory
| Protein Name | AGI | α-Block | AabID | Syntelog | % Identity | Lineage Specific? | Col-0 → Aab |
|---|---|---|---|---|---|---|---|
| GS genes with retained α-ohnolog | |||||||
| [AAO3] | AT2G27150 | A02NOA1 | Aab27016 | Yes | 77.67 | No | 2 → 2 |
| CYP79C1 | AT1G79370 | A05N062 | Aab34143 | Yes | 76.8 | No | 8 → 9 |
| APK2 | AT4G39940 | A10NOA2 | Aab17804 | Yes | 86.05 | No | 2 → 2 |
| CYP79B2 | AT4G39950 | A10N257 | Aab17805 | Yes | 81.38 | No | 8 → 9 |
| CYP79B3 | AT2G22330 | A10N257 | Aab19477 | Yes | 81.4 | No | 8 → 9 |
| GS genes with tandem duplicate copy | |||||||
| AOP1 | AT4G03070 | A01 | Aab37231 | Yes | 70.03 | No | 2 → 1 |
| AOP3 | AT4G03050 | A01 | — | — | — | 2 → 1 | |
| CYP79F1 | AT1G16410 | A05 | Aab27579 | Yes | 72.79 | 8 → 9 | |
| CYP79C2 | AT1G58260 | A03 | Aab17711 | Yes | 71.85 | 8 → 9 | |
| A11 | Aab22600 | No | 61.8 | 8 → 9 | |||
| CYP83A1 | AT4G13770 | A15 | Aab32506 | Yes | 69.67 | No | 2 → 3 |
| — | Aab30975 | No | 82.8 | 2 → 3 | |||
| CYP81F2 | AT5G57220 | A22 | — | — | — | 2 → 1 | |
| GS genes without α-ohnolog or tandem duplicate copy | |||||||
| UGT74B1 | AT1G24100* | A05 | Aab07826 | Yes | 70.35 | No | 6 → 10 |
| A05 | Aab07827 | Yes | 80.65 | 6 → 10 | |||
| IMD3 | AT1G31180 | A06 | — | — | — | 2 → 1 | |
| CYP83B1 | AT4G31500 | A10 | Aab12019 | No | 92.73 | No | 2 → 3 |
| GSL-OH | AT2G25450 | A10 | — | — | — | 1 → 0 | |
| IMD1 | AT5G14200 | A12 | Aab14760 | Yes | 89.5 | No | 2 → 1 |
| CYP79A2 | AT5G05260* | A14 | Aab36760 | Yes | 73.37 | No | 8 → 9 |
| CHY1 | AT5G65940* | A19 | Aab05851 | Yes | 80.75 | No | 2 → 2 |
| FMO-GSOX-1 | AT1G65860* | A25 | Aab30109 | Yes (minimum) | 58.45 | No | 11 → 8 |
| 24% TD (4/17) | 18% TD (3/17) | Ø 76.78% | |||||
| 29% retained α-ohnolog (5/17) | 29% retained α-ohnolog (5/17) | ||||||
aAsterisks mark GTDs inferred by flanking TE-like sequences using GEVo.
bPredicted Aethionema CDS.
cChange of gene family locus count in Arabidospsis → Aethionema order.
Genes with Nonretained At-α Ohnolog Duplicate Gene Copy in Arabidopsis and Aethionema GS Pathway Inventory
| Protein Name | AGI | α-Block | Evident SSD | AabID | Syntelog | % Identity | Col-0 → Aab |
|---|---|---|---|---|---|---|---|
| Core-structure formation | |||||||
| AOP1 | AT4G03070 | A01 | TD/GTD | Aab37231 | Yes | 70.03 | 2 → 1 |
| AOP3 | AT4G03050 | A01 | TD/GTD | — | — | — | 2 → 1 |
| UGT74-like | Aab specific | A02 | TD | Aab37178 | Yes | 82.05 | 6 → 10 |
| UGT74-like | Aab specific | A02 | TD | Aab37179 | Yes | 77.63 | 6 → 10 |
| UGT74-like | Aab specific | A02 | TD | Aab37180 | Yes | 78.33 | 6 → 10 |
| GSTF10 | AT2G30870 | A02 | TD | Aab28612 | Yes | 91.59 | 4 → 4 |
| FMO-GSOX-3 | AT1G62560 | A03 | TD | Aab10867 | Yes | 71.9 | 11 → 8 |
| FMO-GSOX-4 | AT1G62570 | A03 | TD | Aab10866 | Yes | 55.2 | 11 → 8 |
| FMO-GSOX-5 | AT1G12140 | A03 | TD | Aab13546 | Yes | 71.9 | 11 → 8 |
| CYP79C2 | AT1G58260 | A03 | TD | Aab17711 | Yes | 71.85 | 8 → 9 |
| Aab specific | A11 | Aab22600 | No | 61.8 | 8 → 9 | ||
| CYP79F1 | AT1G16410 | A05 | TD | Aab27579 | Yes | 72.79 | 8 → 9 |
| SOT18 | AT1G74090 | A05 | TD | Aab14277 | Yes | 83.9 | 3 → 3 |
| GSTU20 | AT1G78370 | A05 | TD | Aab7000 | Yes | 67.29 | 5 → 6 |
| Aab specific | Aab6995 | Yes | 48.86 | 5 → 6 | |||
| UGT74B1 | AT1G24100 | A05 | GTD | Aab07826 | Yes | 70.35 | 6 → 10 |
| Aab specific | Aab07827 | Yes | 80.65 | 6 → 10 | |||
| CYP83B1 | AT4G31500 | A10 | GTD | Aab12019 | No | 92.73 | 2 → 3 |
| GSL-OH | AT2G25450 | A10 | GTD | — | — | — | 1 → 0 |
| CYP79A2 | AT5G05260 | A14 | GTD | Aab36760 | Yes | 73.37 | 8 → 9 |
| CYP83A1 | AT4G13770 | A15 | GTD | Aab32506 | Yes | 69.67 | 2 → 3 |
| Aab specific | Aab30975 | No | 82.8 | 2 → 3 | |||
| CYP81F2 | AT5G57220 | A22 | TD/GTD | — | — | — | 2 → 1 |
| FMO-GSOX-1 | AT1G65860 | A25 | GTD | Aab30109 | Yes | 58.45 | 11 → 8 |
| Cosubstrate pathways | |||||||
| CHY1 | AT5G65940 | A19 | GTD | Aab05851 | Yes | 80.75 | 2 → 2 |
| GSH1 | AT4G23100 | A20 | NA | Aab22781 | Yes | 91.81 | 2 → 2 |
| BZO1 | AT1G65880 | A25 | TD | Aab31601 | Yes | 70.04 | 2 → 4 |
| TD | Aab31602 | Yes | 69.4 | 2 → 4 | |||
| Side-chain elongation | |||||||
| IMD3 | AT1G31180 | A06 | GTD | — | — | — | 2 → 1 |
| IPMI2 | AT2G43100 | A11 | TD | Aab19630 | Yes | 78.71 | 3 → 3 |
| IMD1 | AT5G14200 | A12 | GTD | Aab14760 | Yes | 89.5 | 2 → 1 |
| IIL1 | AT4G13430 | A15 | NA | Aab18132 | Yes | 93.9 | 1 → 1 |
| TF regulation | |||||||
| MYB76 | AT5G07700 | A26 | TD | — | — | — | 6 → 4 |
| 60% TD (15/25) | 57% TD (16/28) | Ø 76.46% | |||||
| 48% GTD (12/25) | 39% GTD (11/28) | ||||||
Note.—NA, not applicable.
aTD (Tandem Dupicate) refers to members of tandem arrays and GTD (Gene Transposition Duplication) refers to a history of transposition in Arabidopsis.
bPredicted Aethionema CDS
cChange of gene family locus count in Arabidospsis → Aethionema order
Genes Not Covered by α-Blocks in Arabidopsis and Aethionema GS Pathway Inventory
| Protein Name | AGI | α-Block | Evident SSD | AabID | Syntelog | % Identity | Col-0 → Aab |
|---|---|---|---|---|---|---|---|
| Side-chain elongation | |||||||
| MAM1 | AT5G23010 | — | TD | Aab12229 | Yes | 72.31 | 2 → 4 |
| Aab12230 | Yes | 71.5 | 2 → 4 | ||||
| MAM-L | AT5G23020 | — | TD | Aab12225 | Yes | 70.67 | 2 → 4 |
| Aab12226 | Yes | 68.36 | 2 → 4 | ||||
| TF regulation | |||||||
| MYB34 | AT5G60890 | — | NA | — | — | — | 6 → 4 |
| 66% TD (2/3) | 100% TD (4/4) | Ø 70.71% | |||||
| 0% GTD | 0% GTD | ||||||
aSquared brackets indicate ohnolog copies of GS biosynthetic genes without GO!-annotation to GS biosynthetic process.
bTD refers to members of TARs and GTD refers to a history of transposition in Arabidopsis.
cPredicted Aethionema CDS.
dChange of gene family locus count in Arabidospsis → Aethionema order.
FDistribution of GS pathway inventory relative to At-α WGD event. AtGS genes are shown before (left) and after (right) interpolation of ohnolog duplicate copies. We hypothesize functional redundancy of 12 additional ohnologs to canonical GS biosynthetic genes.
FDuplicate distribution among (A, B) Arabidopsis protein-coding genes compared with (C, D) AtGS and (E, F) Aethionema GS loci. Shown are retained ohnologs (green), tandem duplicates (blue), and gene transposition duplicates (orange). GS metabolic versatility resulted from a combination of increased ohnolog retention and TD rates.
FIdeogram of Arabidopsis thaliana chromosomes with GS biosynthetic genes. Circos plot visualizing the evolutionary contribution of different duplication types to GS pathway inventory in Arabidopsis and Aethionema. (A) Inner chromosome scale (Mb). (B) Arabidopsis thaliana GS biosynthetic genes. Gray text indicates genomic location outside ohnolog blocks. Black text indicates genomic location within ohnolog blocks but nonretained ohnolog copy. Green text indicates retained pairs of ohnolog copies with missing GO annotation to GS biosynthetic process shown in edged brackets. Orange text indicates single copy genes without clear paralogs in both species. (C) Blue circles indicate genes organized in TARs (i). Red circles indicate genes with transpositional history (ii). Purple circles indicate loci sharing (i) and (ii). (D) Number of rectangles indicates number of homologs present in the Aethionema arabicum draft genome (0–4). Color of rectangles indicates presence (black) or absence (red) of synteny between A. thaliana and Aet. arabicum in the genomic context of the target gene. (E) Arabidopsis thaliana chromosomes with labels showing GS biosynthetic genes. Bands for genes retained in ohnolog pairs are connected with colors of corresponding ohnolog blocks, as defined by Bowers et al. (2003). (F) Genomic location of ohnolog block copies harboring GS biosnthetic genes in A. thaliana, connected by gray bands. All ranges are in scale.
Intraspecies Protein Similarities for At-α Ohnolog Pairs Sharing GS Annotation, Shown with Differential Expression in Arabidopsis Following MeJA Treatment
Note.—MeJA, methyljasmonic acid; NA, not applicable.
aSquared brackets indicate ohnolog copies of GS biosynthetic genes without GO! annotation to GS biosynthetic process.
bGO! column indicates if gene is part of canonical GS pathway inventory set (Sonderby et al. 2010).
cPredicted Aethionema CDS.
dWhole WT plant averages of log-transferred expression change in Arabidopsis.
Intraspecies Protein Similarities for At-α Ohnolog Pairs Not Sharing GS Annotation, Shown with Differential Expression in Arabidopsis Following MeJA Treatment
Note.—MeJA, methyljasmonic acid.
aSquared brackets indicate ohnolog copies of GS biosynthetic genes without GO! annotation to GS biosynthetic process.
bGO! column indicates if gene is part of canonical GS pathway inventory set (Sonderby et al. 2010).
cPredicted Aethionema CDS.
dWhole WT plant averages of log-transferred expression change in Arabidopsis.
Putative Single-Copy Genes in Aethionema and Arabidopsis GS Pathway Inventory
| AGI | Name | α-Block | Retained At-ß/-γ Ohnolog | Most Ancient Syntelog | Closest Paralog | BlastP | Name | α-Block | Retained At-ß/-γ Ohnolog |
|---|---|---|---|---|---|---|---|---|---|
| AT4G13430 | A15 | No | AT4G26970 | 1.00E−16 | A22N121 | No | |||
| AT4G23100 | A20 | No | AT1G19220 | 0.19 | A05 | No | |||
| AT5G60890 | — | B20N001 | AT1G74080 | 4.00E−62 | A05N185 | B20N004 |
aGene transpositional duplicate copy.
bAbsent in Aethionema.
Tandem Duplicate Genes in Arabidopsis and Aethionema GS Pathway Inventory
| Protein Name | AGI | α-Block | AabID | Syntelog | % Identity | Lineage Specific? | Col-0 → Aab |
|---|---|---|---|---|---|---|---|
| Tandem duplicates of syntenic anchor genes retaining an At-α ohnolog | |||||||
| UGT74C1 | AT2G31790 | Aab37175 | Yes | 79.44 | No | 6 → 10 | |
| Aab37178 | Yes | 82.05 | 6 → 10 | ||||
| Aab37179 | Yes | 77.63 | 6 → 10 | ||||
| Aab37180 | Yes | 78.95 | 6 → 10 | ||||
| UGT74D1_oa | AT2G31750 | Aab37181 | Yes | 78.33 | (no GS gene) | 6 → 10 | |
| [UGT-like] | AT1G05670 | Aab31930 | Yes | 81.11 | No | 6 → 10 | |
| UGT-like_oa | AT1G05675 | A02 | Aab31932 | Yes | 71.4 | (no GS gene) | 6 → 10 |
| UGT74E2_oa | AT1G05680 | Aab31933 | Yes | 59.8 | (no GS gene) | 6 → 10 | |
| FMO-GSOX-2 | AT1G62540 | A03N117 | Aab10869 | Yes | 76.6 | No | 11 → 8 |
| FMO-GSOX-3 | AT1G62560 | A03 | Aab10867 | Yes | 71.9 | No | 11 → 8 |
| FMO-GSOX-4 | AT1G62570 | A03 | Aab10866 | Yes | 55.2 | No | 11 → 8 |
| FMO-like_oa | AT1G62580 | A03 | — | — | — | (no GS gene) | 10 → 7 |
| FMO-like_oa | AT1G62600 | A03 | — | — | — | (no GS gene) | 10 → 7 |
| FMO-like_oa | AT1G62620 | A03 | — | — | — | (no GS gene) | 10 → 7 |
| [FMO-like] | AT1G12130 | A03N117 | Aab13543 | Yes | 65.09 | No | 11 → 8 |
| FMO-GSOX-5 | AT1G12140 | A03 | Aab13546 | Yes | 71.9 | No | 11 → 8 |
| FMO-like_oa | AT1G12200 | A03 | Aab13549 | Yes | 66.66 | (no GS gene) | 10 → 7 |
| CYP79F2 | AT1G16400 | A05N062 | — | — | — | 8 → 9 | |
| CYP79F1 | AT1G16410 | A05 | Aab27579 | Yes | 72.79 | 8 → 9 | |
| SOT16 | AT1G74100 | A05N186 | Aab14278 | Yes | 91.07 | No | 3 → 3 |
| SOT18 | AT1G74090 | A05 | Aab14277 | Yes | 83.9 | No | 3 → 3 |
| GSTU20 | AT1G78370 | A05 | Aab07000 | Yes | 67.29 | No | 5 → 6 |
| GSTU23_oa | AT1G78320 | A05 | Aab06994 | Yes | 81.74 | (no GS gene) | 5 → 6 |
| GSTU22_oa | AT1G78340 | A05 | Aab06997 | Yes | 71.1 | (no GS gene) | 5 → 6 |
| GSTU21_oa | AT1G78360 | A05 | Aab06998 | Yes | 76.71 | (no GS gene) | 5 → 6 |
| GSTU19_oa | AT1G78380 | A05N104 | Aab06999 | Yes | 83.41 | (no GS gene) | 5 → 6 |
| [BCAT7] | AT1G50090 | A08N074 | Aab22548 | Yes | 69 | No | 6 → 6 |
| BCAT-like_oa | AT1G50110 | A08 | Aab22550 | Yes | 78.12 | (no GS gene) | 6 → 6 |
| [SUR-like] | AT4G28420 | A10N194 | Aab31155 | Yes | 47.42 | No | 3 → 2 |
| A10 | Aab31154 | Yes | |||||
| SUR-like_oa | AT4G28410 | A10 | Aab31153 | Yes | 63.96 | (no GS gene) | 3 → 2 |
| GGP1 | AT4G30530 | A10N314 | Aab24374 | Yes | 87.6 | No | 5 → 5 |
| GGP-like_oa | AT4G30540 | A10 | Aab24373 | Yes | 75 | (no GS gene) | 5 → 5 |
| GGP3_oa | AT4G30550 | A10 | Aab24372 | Yes | 82 | (no GS gene) | 5 → 5 |
| [GGP-like] | AT2G23960 | A10N314 | Aab11021 | Yes | 69.67 | No | 5 → 5 |
| GGP-like _oa | AT2G23970 | A10 | Aab11018 | Yes | 83.6 | (no GS gene) | 5 → 5 |
| [IPMI-like] | AT2G43090 | A11N226 | Aab19619 | Yes | 85.99 | No | 3 → 3 |
| IPMI2 | AT2G43100 | A11 | Aab19630 | Yes | 78.71 | No | 3 → 3 |
| [BCAT5] | AT5G65780 | A19N002 | Aab23605 | Yes | 75.3 | No | 6 → 6 |
| LINC4_oa | AT5G65770 | A19 | Aab23607 | Yes | 70.08 | (no GS gene) | 6 → 6 |
| MYB29 | AT5G07690 | A26N034 | Aab33585 | Yes | 65.13 | 6 → 4 | |
| MYB76 | AT5G07700 | A26 | — | — | — | 6 → 4 | |
| Tandem duplicates of genes inside the boundaries of α-blocks with nonretained At-αohnolog | |||||||
| AOP1 | AT4G03070 | A01 | Aab37231 | Yes | 70.03 | 2 → 1 | |
| AOP3 | AT4G03050 | A01 | — | — | — | 2 → 1 | |
| GSTF10 | AT2G30870 | A02 | Aab28612 | Yes | 91.59 | No | 4 → 4 |
| GSTF9_oa | AT2G30860 | A02 | Aab28613 | Yes | 89.76 | (no GS gene) | 4 → 4 |
| CYP79C2 | AT1G58260 | A03 | Aab17711 | Yes | 71.85 | No | 8 → 9 |
| CYP-like_oa | AT1G58265 | A03 | Aab17712 | Yes | 60.71 | (no GS gene) | 8 → 9 |
| UGT74B1 | AT1G24100 | A05 | Aab07827 | Yes | 80.65 | 6 → 10 | |
| A05 | Aab07826 | Yes | 70.35 | 6 → 10 | |||
| CYP81F2 | AT5G57220 | A22 | — | — | — | 2 → 1 | |
| CYP71B10 _oa | AT5G57260 | A22 | Aab25774 | Yes | 73.21 | (no GS gene) | 2 → 1 |
| BZO1 | AT1G65880 | A25 | Aab31601 | Yes | 70.04 | No | 2 → 4 |
| Aab31602 | Yes | 69.4 | 2 → 4 | ||||
| BZO-like_oa | AT1G65890 | A25 | Aab31603 | Yes | 68.85 | (no GS gene) | 2 → 4 |
| Aab31604 | Yes | 67.83 | (no GS gene) | 2 → 4 | |||
| Tandem duplicates of genes outside the boundaries of α-blocks | |||||||
| MAM1 | AT5G23010 | — | Aab12229 | Yes | 72.31 | No | 2 → 4 |
| Aab12230 | Yes | 71.5 | 2 → 4 | ||||
| MAM-L | AT5G23020 | — | Aab12225 | Yes | 70.67 | No | 2 → 4 |
| Aab12226 | Yes | 68.36 | 2 → 4 | ||||
| AtGS genes: 45% TD (29/64) | AabGS genes: 46% TD (31/67) | Ø 73.43% | |||||
aSquared brackets indicate ohnolog copies of GS biosynthetic genes without GO! annotation to GS biosynthetic process.
bThe “_oa” suffix indicates tandem duplicate copies without GO! annotation to GS biosynthetic process. These genes have not been considered for the tandem duplicate count of GS loci in both organisms.
cUnderlined items refer to offspring of one pre-α TD event.
dPredicted Aethionema CDS.
eIn case of Aethionema-specific TAR expansion, the corresponding Arabidopsis sequence for identity comparison was determined based on both genomic location and homology criteria.
fChange of gene family locus count in Arabidospsis → Aethionema order.
FPhylogenetic relationships among FMO proteins. Col-0, Aab, and papaya refer to Arabidopsis thaliana (circles), Aethionema arabicum (triangles), and Carica papaya (colorless), respectively. Boxes indicate annotation to GS metabolic activity in Arabidopsis. Tarenaya hasslerania (diamonds) represents the closest-related outgroup of Brassicaceae, including the sister clade Aethionemae. Stars: At-α WGD event. Blue: proteins encoded by members of TARs. Red: protein encoded by GTD locus. The At-α WGD leads to duplication of a FMO locus, resulting in two clades comprising all FMO-like sequences of both Aethionema and Arabidopsis. Thus, FMO versatility has been promoted by a combination of increased degree of ohnolog retention and TD events.
Statistical Test on Duplicate Fractions in Arabidopsis and Aethionema GS Pathway Inventory Compared with Genome-Wide Average in Arabidopsis
| Protein-coding genes | 27,206 | 64 | 67 |
| Retained At-α ohnologs | 6,038/22% | 36/56% | 35/52% |
| 3.87E−09 | 1.26E−07 | ||
| Tandem duplicates | 4,022/15% | 29/45% | 32/48% |
| 5.71E−09 | 9.14E−10 | ||
| GTDs | 3,879/14% | 17/27% | 18/27% |
| 0.01066 | 0.07462 | ||
| Sum duplicates | 12,132/45% | 61/95% | 65/97% |
| 2.20E−16 | 2.20E−16 |
aFisher's exact test on count data.
bExtended set (fig. 1).