| Literature DB >> 34290729 |
Seongmin Hong1,2, Yong Pyo Lim2, Suk-Yoon Kwon3, Ah-Young Shin3, Yong-Min Kim1.
Abstract
In polyploids, whole genome duplication (WGD) played a significant role in genome expansion, evolution and diversification. Many gene families are expanded following polyploidization, with the duplicated genes functionally diversified by neofunctionalization or subfunctionalization. These mechanisms may support adaptation and have likely contributed plant survival during evolution. Flowering time is an important trait in plants, which affects critical features, such as crop yields. The flowering-time gene family is one of the largest expanded gene families in plants, with its members playing various roles in plant development. Here, we performed genome-wide identification and comparative analysis of flowering-time genes in three palnt families i.e., Malvaceae, Brassicaceae, and Solanaceae, which indicate these genes were expanded following the event/s of polyploidization. Duplicated genes have been retained during evolution, although genome reorganization occurred in their flanking regions. Further investigation of sequence conservation and similarity network analyses provide evidence for functional diversification of duplicated genes during evolution. These functionally diversified genes play important roles in plant development and provide advantages to plants for adaptation and survival in response to environmental changes encountered during evolution. Collectively, we show that flowering-time genes were expanded following polyploidization and retained as large gene family by providing advantages from functional diversification during evolution.Entities:
Keywords: copy number variation; flowering-time gene; functional diversification; polyploidization; whole genome duplication
Year: 2021 PMID: 34290729 PMCID: PMC8288248 DOI: 10.3389/fpls.2021.702243
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Classification of flowering genes by their functions in the FLOR-ID database.
| Aging | 4 | RAP2.7, SPL9, TOE2, and TPL | |
| Ambient temperature | 2 | TPL, FCA | |
| Circadian Clock | 5 | ELF4, LHY, PRR7, SRR1, and ZTL | |
| General process and autonomous pathway | Biotic stress | 1 | PUB13 |
| Cell cycle and DNA replication | 2 | GIS5, TIL1 | |
| Chromatin modifications | 37 | AHL22, ASHH1, ATX1, ATX2, BRM, CHC1, CLF, EFS, EMF2, FCA, FLD, FVE, FWA, HAM1, HDA05, HMG, HTA11, HUB2, INO80, JMJ14, JMJD5, LDL2, MBD9, MOS1, MRG1, MRG2, MSI1, PIE1, REF6, SDG25, SEF, SUVR5, TFL2, TRO, VEL1, VIM1, and VIM2 | |
| Control of transcription | 7 | CCT, CDKC2, GCT, NDX, SPT16, VIP3, and VIP5 | |
| mRNA & microRNA processing | 18 | ABH1, CBP20, CSTF64, CSTF77, DCL1, DCL3, DCL4, FLK, FPA, FY, GRP2, HLP1, HUA2, HULK2, LIF2, RRP6L1, RRP6L2, and SUS2 | |
| Multiple processes | 1 | RCD1 | |
| Protein stability control | 7 | CUL4, J3, SIZ1, UBC1, UBP12, UBP26, and ULP1D | |
| Unknown processes | 1 | LD | |
| Hormones | 5 | ATH1, GA1, GA20OX3, GID1B, and RGA1 | |
| Photoperiodism, light perception, and signaling | 34 | AFR2, AGL15, AGL18, AS1, AT-STUbl4, BBX19, CDF2, CIB2, CO, COP1, CPK6, CRY1, CRY2, EBS, ELF4, FAR1, FD, FTIP1, HB16, HOS1, LHY, MYR1, NF-YC4, PHYB, PHYC, PHYD, RAP2.7, SPA1, SRR1, STO, TEM1, TOE2, TPL, and ZTL | |
| Sugar | 8 | ADG1, HXK1, KIN10, NUC, PGM, SUC9, SUS4, and TPS1 | |
| Vernalization | 10 | ASHH3, CLF, FES1, LRB1, MAF1, MAF5, PHP, VEL1, VRN1, and WRKY34 |
FIGURE 1Identification of flowering-time genes. Flowering-time genes were identified by their domain architectures, and domain coverage for each gene was calculated. Functions of flowering-time genes were predicted from the FLOR-ID database.
Flowering-time genes identified in this study.
| Brassicaceae | 757 | 27,655 | 2.74 | |
| Brassicaceae | 2,094 | 101,040 | 2.07 | |
| Brassicaceae | 717 | 35,400 | 2.03 | |
| Brassicaceae | 1,071 | 46,221 | 2.32 | |
| Malvaceae | 537 | 29,452 | 1.82 | |
| Malvaceae | 521 | 18,848 | 2.76 | |
| Malvaceae | 368 | 29,356 | 1.25 | |
| Malvaceae | 1,096 | 35,643 | 3.07 | |
| Malvaceae | 2,133 | 87,603 | 2.43 | |
| Malvaceae | 983 | 37,505 | 2.62 | |
| Malvaceae | 968 | 40,960 | 2.36 | |
| Malvaceae | 795 | 38,871 | 2.05 | |
| Malvaceae | 1,896 | 74,561 | 2.54 | |
| Malvaceae | 1,898 | 78,303 | 2.42 | |
| Malvaceae | 1,904 | 75,376 | 2.53 | |
| Malvaceae | 1,946 | 74,699 | 2.61 | |
| Malvaceae | 1,916 | 78,338 | 2.45 | |
| Solanaceae | 514 | 34,899 | 1.47 | |
| Solanaceae | 684 | 34,688 | 1.97 | |
| Total | - | 22,798 | 979,418 | - |
FIGURE 2Conservation scores of conserved and variable proteins. (A) and (B) The top 20 (A) conserved or (B) variable proteins were selected by their domain architectures and filtered based on comparison of protein length with Arabidopsis templates. Conservation scores were calculated using methods described in a previous study (Koo et al., 2020).
FIGURE 3Duplication pattern analysis of flowering-time genes. Flowering-time genes were analyzed using MCScanX, and duplication patterns were classified as singleton, dispersed, proximal, tandem, segmental/whole genome duplication (WGD), or unassigned.
Classification of duplicated genes.
| 78 | 307 | 20 | 57 | 295 | 0 | 757 | |
| 0 | 283 | 10 | 18 | 1,414 | 369 | 2,094 | |
| 8 | 220 | 15 | 15 | 459 | 0 | 717 | |
| 11 | 257 | 13 | 28 | 761 | 1 | 1,071 | |
| 19 | 269 | 40 | 38 | 170 | 1 | 537 | |
| 14 | 380 | 32 | 60 | 491 | 6 | 983 | |
| 44 | 389 | 28 | 56 | 410 | 41 | 968 | |
| 17 | 330 | 24 | 47 | 377 | 0 | 795 | |
| 1 | 100 | 18 | 16 | 1,732 | 29 | 1,896 | |
| 0 | 91 | 20 | 10 | 1,772 | 5 | 1,898 | |
| 0 | 90 | 17 | 22 | 1,764 | 11 | 1,904 | |
| 0 | 89 | 27 | 21 | 1,804 | 5 | 1,946 | |
| 0 | 89 | 13 | 23 | 1,788 | 3 | 1,916 | |
| 15 | 360 | 34 | 40 | 65 | 0 | 514 | |
| 18 | 381 | 41 | 57 | 187 | 0 | 684 |
FIGURE 4Comparison of genomic structures near flowering-time genes in diploid and polyploid genomes. (A) Analysis of gene collinearity and genomic alterations between A and D genomes. (B) Investigation of microsynteny near the FCA and VIP5 genes. (C) Analysis of genomic variations in flowering-time genes and their flanking regions in diploid and polyploid genomes.
FIGURE 5Sequence similarity network analysis for the FCA and VIP5 gene products in the Brassicaceae, Malvaceae, and Solanaceae families. (A) and (D) Sequence similarity networks of the (A) FCA and (D) VIP5 gene products. (B) and (E) Average protein lengths of (B) FCA and (E) VIP5 within individual groups. (C) and (F) Distribution of duplication patterns for (C) FCA and (F) VIP5 within individual groups.