Literature DB >> 35371153

CONSTANS Polymorphism Modulates Flowering Time and Maturity in Soybean.

Mohammad Abdul Awal Khan1, Shouwei Zhang1, Reza Mohammad Emon1,2, Fulu Chen1, Wenwen Song1, Tingting Wu1, Shan Yuan1, Cunxiang Wu1, Wensheng Hou1, Shi Sun1, Yongfu Fu1, Bingjun Jiang1, Tianfu Han1.   

Abstract

CONSTANS (CO) plays a critical role in the photoperiodic flowering pathway. However, the function of soybean CO orthologs and the molecular mechanisms in regulating flowering remain largely unknown. This study characterized the natural variations in CO family genes and their association with flowering time and maturity in soybeans. A total of 21 soybean CO family genes (GmCOLs) were cloned and sequenced in 128 varieties covering 14 known maturity groups (MG 0000-MG X from earliest to latest maturity). Regarding the whole genomic region involving these genes, GmCOL1, GmCOL3, GmCOL8, GmCOL9, GmCOL10, and GmCOL13 were conserved, and the remaining 15 genes showed genetic variation that was brought about by mutation, namely, all single-nucleotide polymorphisms (SNPs) and insertions-deletions (InDels). In addition, a few genes showed some strong linkage disequilibrium. Point mutations were found in 15 GmCOL genes, which can lead to changes in the potential protein structure. Early flowering and maturation were related to eight genes (GmCOL1/3/4/8/13/15/16/19). For flowering and maturation, 11 genes (GmCOL2/5/6/14/20/22/23/24/25/26/28) expressed divergent physiognomy. Haplotype analysis indicated that the haplotypes of GmCOL5-Hap2, GmCOL13-Hap2/3, and GmCOL28-Hap2 were associated with flowering dates and soybean maturity. This study helps address the role of GmCOL family genes in adapting to diverse environments, particularly when it is necessary to regulate soybean flowering dates and maturity.
Copyright © 2022 Awal Khan, Zhang, Emon, Chen, Song, Wu, Yuan, Wu, Hou, Sun, Fu, Jiang and Han.

Entities:  

Keywords:  GmCOL orthologue; flowering time; maturity group; natural variation; soybean

Year:  2022        PMID: 35371153      PMCID: PMC8969907          DOI: 10.3389/fpls.2022.817544

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

Plants can adapt to different environmental conditions in response to various day lengths (photoperiods). In the photoperiodic flowering pathway, CONSTANS (CO), which is a B-box-containing transcription factor (Robson et al., 2001; Gangappa and Botto, 2014), plays a key role (Putterill et al., 1995; Suarez-Lopez et al., 2001). CO also possesses a CONSTANS, CONSTANS-LIKE, TIMING OF CAB1 (CCT) domain at its C-terminus involved in DNA binding (Strayer et al., 2000; Robson et al., 2001). CYCLING DOF FACTOR (CDF) family proteins bind to the CO promoter to repress its transcription in the morning (Imaizumi et al., 2005; Fornara et al., 2009). FLAVIN-BINDING, KELCH REPEAT, and F-BOX1 (FKF1) interact with GIGANTEA (GI) to degrade the CDF1 protein, which results in the elevation of CO mRNA (Sawa et al., 2007). CO acts in the phloem of leaf vascular tissues to activate FLOWERING LOCUS T (FT) expression, which causes flowering under linkage disequilibrium (LD) conditions (Takada and Goto, 2003; An et al., 2004). SUPPRESSOR OF PHYTOCHROME A-105 (SPA) proteins interact with the CCT domain of CO (Laubinger et al., 2006). CO is ubiquitinated by CONSTITUTIVE PHOTOMORPHOGENIC 1 (COP1) and degraded by the 26S proteasome (Jang et al., 2008; Liu L. J. et al., 2008). HIGH EXPRESSION OF OSMOTICALLY RESPONSIVE GENES1 (HOS1) interacts with CO and participates in the degradation of CO mediated by red light (Lazaro et al., 2012, 2015). Nucleoporin96 (Nup96) interacts with HOS1 to gate CO protein levels (Cheng et al., 2020). CO protein is stable under the light in the evening but degraded in the morning or in the dark under LD conditions (Valverde et al., 2004). All of the abovementioned comments are based on studies in Arabidopsis. As a short-day plant (SDP) distributed over a vast range of latitudes, soybean is characterized as having 14 maturity groups from MG 0000 to MG X (Liu et al., 2017; Jiang et al., 2019). There are hundreds of genetic loci associated with flowering time and maturity in soybean, among which E1 (Xia et al., 2012), E2 (Watanabe et al., 2011), E3 (Watanabe et al., 2009), E4 (Liu B. et al., 2008), E6 (Fang C. et al., 2021), E9 (Kong et al., 2014), E10 (Samanfar et al., 2017), E11 (Wang et al., 2019), J (Lu et al., 2017; Yue et al., 2017) and Tof11/GmPRR37/GmPRR3b (Li et al., 2020; Lu et al., 2020; Wang et al., 2020), GmTof16 (Dong et al., 2021b), GmFUL (Dong et al., 2021a; Sun et al., 2021; Yue et al., 2021), and GmLUX (Bu et al., 2021; Fang X. et al., 2021) are molecularly identified. There are 28 CO orthologs in soybean (Fan et al., 2014), but the functions of most of the CO orthologs remain uncharacterized. It has been shown that GmCOL1a and GmCOL1b can fully complement the late-flowering phenotype of the co1 mutant in Arabidopsis (Wu et al., 2014). In contrast, GmCOL1a and GmCOL1b repressed flowering in soybean under long-day conditions (Cao et al., 2015; Wu et al., 2019). Natural variation is associated with photoperiodic flowering and adaptation in different species (Alonso-Blanco et al., 2005; Balasubramanian et al., 2006; Rosas et al., 2014; Li et al., 2017; Lu et al., 2017; Bao et al., 2019; Jiang et al., 2019; Wu et al., 2020). In this study, 128 soybean varieties were selected and planted, covering all 14 maturity groups from MG 0000 to MG X with a continuous distribution in maturity groups (Jiang et al., 2019). Due to the short duration of the project, 21 of the 28 soybean COL genes (Fan et al., 2014) were sequenced, and their sequence polymorphisms were analyzed. Furthermore, we studied the haplotypes of these soybean GmCOL genes to discover their natural variations in association with flowering time and maturity. These results suggested that some natural variations in 21 soybean GmCOL genes were associated with flowering date and maturity.

Materials and Methods

Plant Materials

Soybean varieties covering all 14 maturity groups (MG 0000-MG X) were assessed in this study (Jiang et al., 2019). In this study, 64 Chinese and Russian soybean varieties and 64 North American maturity group standard varieties were included, covering MG 0000-MG X, for a total of 128 accessions analyzed (Table 1).
TABLE 1

Soyabean varieties and maturity groups.

MGR (maturity group reference) varieties

Maturity groupNorth American varietiesChinese-Russian varieties
MG 0000Star4/75, Hujiao07-2479, Hujiao07-2123, Dongnong 36, Paula, R-4, Dongnong 41, Lingbei 8
MG 000Maple Presto, OAC Vision, Rassvet, Jug 30, MagevaR-2, Heihe 35
MG 00Canatto, Maple Ridge, Daksoy, McCall, AgassizMengdou 32, Beidou 16, Dongnong 44, Mengdou 11
MG 0Traill, Chico, Barnes, Norpro, DawsonJiangmodou 1, Heihe 18, Heihe 43, Heihe 27, Beidou 37, Dengke 1, Fengshou 12, Dongnong 4, Hefeng 25
MG IHaroson, Kato, Parker, Granite, NE1900Heinong 16, Taixingheidou, Heinong 26, Suinong 14
MG IIHolt, Olympus, Century 84, IL1, LN92-7369Jilin 20, Yongchengzihuadou, Xiangchundou 24, Tiefeng 19
MG IIIAthow, LN89-5699, KS3494, IL2, Williams 82Zhonghuang 13, Zhonghuang 30, Tiefeng 33, Zhongdou 39, Xudou 9, Tiefeng 31, Jindou 19, Huachun 6, Huaidou 9
MG IVFlyer, Omaha, Calhoun, CF461, UA-4805Zheng 92116, Guandou 2, Jindou 39, Shanning 16, Houzimao
MG VNathan, Hollady, Hutcheson, R01-3474F, TN04-5321Shangdou 14, Dian 86-4, Diandou 7
MG VIDesha, Musen, D95-6271, G01-PR16, BoggsZhongdou 38, Wuhuasiyuehuang, Suxiandou 19, Nannong 493-1
MG VIIStonewall, Benning, Santee, HagoodHuangfengwo, Tongshanbopihuang, Hengyangbayueqing, Nanxiadou 25
MG VIIIMotte, Dowling, Crockett, Prichard, IAC-8, CIGRAS-51, CIGRAS-06Aijiaoqing, Pingguohuangdou, Nandou 12, Shangraodaqingsi, Lanxidaqingdou, Qiudou 1, Nandou 17, Jiangledaqingdou, Guixia 3
MG IXJupiter, Alamo, FT-15, UFV-3
MG XI.C. 192Zigongdongdou
Soyabean varieties and maturity groups.

Investigation of Flowering and Maturity Dates

The soybeans were planted in soil in 10-L pots and grown under natural conditions in Haidian District, Beijing (39.95°N, 116.32°E) on May 27, 2015, and May 18, 2016 (Jiang et al., 2019). After emergence (VE), seedlings of similar size were selected so that each pot contained five uniform plants. Each variety was planted in three pots. The four developmental stages of soybean, namely, VE, R1, R7, and R8 (Fehr and Caviness, 1977), were investigated, and the average value of three pots for each variety was used for statistical analysis.

DNA Isolation, PCR, and Sequencing

Genomic DNA was extracted from fresh trifoliolate leaves using the standard cetyltrimethylammonium bromide (CTAB) method (Jiang et al., 2019). To amplify the 21 soybean GmCOL genes, 36 PCR primer pairs were used. GmCOL9/15/16/20/28 was amplified with 2 primer pairs: GmCOL4 with 11 primer pairs and GmCOL1/2/3/5/6/8/10/13/14/19/22/23/24/25/26 with 1 primer pair. The sequences of these primers and the template designation are listed in Supplementary Table 1. Target regions were amplified with the high-fidelity polymerase of KOD-Plus-Neo and KOD-FX. Their reaction conditions were 94°C for 2 min (98°C for 10 s, 68°C for 3 min), 35 cycles, and 68°C for 10 min. The PCR products were directly sequenced using the Sanger method (TSINGKE Biological Technology Company, BGI and Omega Genetics, Beijing, China). The polymorphic site information of the 21 soybean GmCOL family genes is listed in Supplementary Table 2.

Data Mining and Sequence Analysis

The annotated soybean COL gene sequence was downloaded from the Joint Genome Institute (JGI) Phytozome website[1]. The protein sequences of the annotated Arabidopsis genes (TAIR9 release) were downloaded from the Arabidopsis Information Resource (TAIR) website[2]. The CO-like gene sequence data were also collected from the USDA-ARS Soybean Genetics and Genomics Database (SoyBase Database[3]). The reference sequence of the soybean variety Williams 82 was obtained from the Phytozome (version 12.0) database[4] as a reference design for the amplification and sequencing primers of the COL genes. Sequencing was performed using an ABI3730 sequencer. Multiple sequence alignment, editing, and stitching were carried out using ClustalW in MEGA5 with default parameters. Single-nucleotide polymorphism (SNP) analysis was carried out using TASSEL5 software. The haplotype analysis of the sequencing results was conducted by DNAsp to determine whether the protein coding was affected. LD was also evaluated using TASSEL5 software. The statistical analysis was performed using “R” software.

Results

Diversity of Soybean Varieties in Flowering Time and Maturity

The results of the growth traits are presented in Table 2. The varieties utilized in this experiment exhibited significant diversity in flowering time and maturity (Table 2). In 2015, 19 varieties, namely, MC119, MC63, MC69, MC70, MC54, MC121, MC122, MC123, MC72, MC124, MC125, MC126, MC48, MC64, MC65, MC66, MC67, MC68, and MC71, flowered but failed to reach R7. In 2016, 50 varieties did not reach R7 (Table 2). In 2015, the days to R1 from emergence (VE) ranged from 21 (MC82, MG 0) to 121 days (MC72, MG VIII) with a span of 100 days. In 2016, the span time was 96 days between MG 0000 (MC05, 13 days) and MG VIII (MC129, 109 days). In 2016, the days to R7 ranged from 68 (MC05 and MC74, MG 0000) to 144 days (MC42, MG IV), and the days to R8 ranged from 76 (MC05, MG 0000) to 109 days (MC37, MG III). The combination of maturity group information for the days of flowering time from the emergence and the days of beginning maturity from emergence in 2016 is plotted in a box diagram (Figure 1).
TABLE 2

Phenotypic variation of soybean varieties at Beijing.

CodeVariety nameMGPhenotype in 2016

VE-R1VE-R7VE-R8
MC01Star 4/75000015.4 ± 1.270.6 ± 1.380.6 ± 0.61
MC02Hujiao 07-2479000016.4 ± 0.8272.06 ± 1.2781.26 ± 0.45
MC03Hujiao 07-2123000015.6 ± 0.5071.13 ± 0.3580.8 ± 0.41
MC04Dongnong 36000014.13 ± 0.5169.13 ± 0.5179.66 ± 0.97
MC05Paula000013.66 ± 0.4868.4 ± 0.8276.66 ± 0.48
MC06R-4000025.86 ± 0.7471.46 ± 0.9182.6 ± 0.82
MC07Dongnong 41000014.33 ± 0.4869.46 ± 0.5179.53 ± 0.91
MC74Lingbei 8000018.9 ± 1.068.4 ± 1.280.2 ± 1.5
MC75Dongnong 41-C000025.7 ± 1.470.5 ± 0.980.9 ± 1.0
MC08Maple-Presto00015.53 ± 0.5169.2 ± 1.0177.66 ± 0.97
MC09OAC-Vision00016.73 ± 0.7969.8 ± 1.5277.53 ± 0.91
MC10Rassvet00017.33 ± 0.4871.53 ± 0.9179.8 ± 0.41
MC11Jug-3000024.33 ± 0.4871.2 ± 0.4179.8 ± 0.67
MC12Mageva00025.73 ± 0.4575.66 ± 0.9783.8 ± 0.86
MC73R200014.6 ± 0.970.6 ± 1.480.8 ± 2.1
MC76Heihe 3500025.4 ± 1.573.5 ± o.587.5 ± 0.5
MC13Canatto0018.2 ± 0.472.8 ± 1.885.2 ± 0.4
MC14Maple-Ridge0018.4 ± 0.572.0 ± 1.084.7 ± 0.4
MC15Daksoy0024.3 ± 0.471.3 ± 0.480.8 ± 0.3
MC16McCall0023.6 ± 0.873.4 ± 1.285.8 ± 1.5
MC17Agassiz0025.2 ± 0.471.6 ± 0.881.2 ± 0.4
MC77Mengdou 320022.6 ± 2.473.2 ± 1.086.3 ± 0.9
MC78Beidou 160019.9 ± 0.773.4 ± 1.585.4 ± 0.5
MC80Dongnong 440024.8 ± 1.071.3 ± 1.481.3 ± 0.9
MC81Mengdou 110024.3 ± 0.471.2 ± 1.381.6 ± 1.5
MC26Traill027.3 ± 0.473.3 ± 0.982.26 ± 0.7
MC27Chico026.4 ± 0.970.8 ± 1.581.4 ± 1.5
MC28Barnes026.5 ± 0.768.8 ± 1.378.4 ± 0.5
MC29Norpro025.3 ± 0.971.4 ± 0.5082.1 ± 1.2
MC30Dawson023.6 ± 0.470.2 ± 1.3780.4 ± 1.2
MC31Jiangmodou 1023.4 ± 0.969.7 ± 1.180.5 ± 0.9
MC82Heihe 18025.2 ± 1.572.0 ± 1.080.8 ± 1.3
MC83Heihe 43026.4 ± 0.571.9 ± 1.682.6 ± 0.7
MC84Heihe 27023.6 ± 0.470.8 ± 1.081.6 ± 0.5
MC85Beidou 37021.5 ± 0.969.5 ± 0.980.2 ± 1.2
MC86Dengke1023.8 ± 0.469.6 ± 0.480.4 ± 0.5
MC97Fengshou 12024.6 ± 0.971.0 ± 1.080.8 ± 1.0
MC88Dongnong-4032.2 ± 1.074.1 ± 0.885.8 ± 0.6
MC89Hefeng 25032.7 ± 0.475.5 ± 0.985.7 ± 0.7
MC23HarosonI25.2 ± 0.4571.2 ± 0.482.2 ± 0.4
MC24KatoI27.5 ± 0.972.2 ± 0.481.5 ± 0.9
MC25ParkerI25.4 ± 0.971.4 ± 0.980.3 ± 0.9
MC26GraniteI25.3 ± 0.971.3 ± 0.978.4 ± 0.9
MC27NE1900I35.2 ± 0.470.7 ± 1.080.1 ± 0.3
MC90Heinong 16I28.5 ± 0.971.6 ± 0.979.4 ± 0.5
MC91TaixingheidouI26.6 ± 0.972.73 ± 1.679.3 ± 1.7
MC92Heinong 26I29.4 ± 0.570.8 ± 0.678.8 ± 0.9
MC93Suinong 14I27.2 ± 1.569.4 ± 1.378.2 ± 1.3
MC28HoltII30.3 ± 2.470.8 ± 0.881.8 ± 1.5
MC29OlympusII27.7 ± 0.972.3 ± 0.482.0 ± 1.0
MC30Century-84II26.6 ± 0.471.6 ± 0.881.0 ± 1.2
MC31IL1II30.4 ± 5.570.2 ± 1.779.6 ± 2.5
MC32LN92-7369II30.4 ± 1.476.2 ± 1.0107.9 ± 1.6
MC94Jilin 20II25.2 ± 1.070.3 ± 0.978.2 ± 1.0
MC95YongchengzihuadouII41.3 ± 1.272.1 ± 1.287.7 ± 1.2
MC96Xiangchundou 24II43.8 ± 1.077.4 ± 0.586.6 ± 1.2
MC98Tiefeng 19II24.4 ± 0.571.2 ± 1.281.2 ± 1.7
MC33AthowIII30.3 ± 0.976.3 ± 0.9106.3 ± 0.9
MC34Zhonghuang 13III43.3 ± 0.478.4 ± 0.9106.4 ± 0.9
MC35Zhonghuang 30III37.8 ± 0.978.3 ± 0.9105.5 ± 1.4
MC36LN89-5699III32.3 ± 0.980.2 ± 1.5108.2 ± 0.4
MC37KS3494III37.4 ± 0.579.8 ± 0.9109.4 ± 1.5
MC38IL2III42.1 ± 0.8105.2 ± 2.3105.3 ± 5.1
MC39Williams 82III36.8 ± 0.3107.3 ± 0.8108.1 ± 2.3
MC97Tiefeng 33III29.5 ± 0.578.7 ± 0.4106.8 ± 1.2
MC99Zhongdou 39III44.5 ± 0.577.4 ± 0.5105.2 ± 1.8
MC100Xudou 9III37.9 ± 0.778.0 ± 1.0106.4 ± 1.5
MC101Tiefeng 31III24.2 ± 1.177.2 ± 1.3105.2 ± 1.1
MC102Jindou 19III25.8 ± 1.377.4 ± 1.5104.8 ± 1.5
MC103Huachun 6III31.4 ± 0.577.3 ± 0.4105.6 ± 0.9
MC104Huaidou 9III34.3 ± 0.4NDND
MC40FlyerIV35.5 ± 1.4143.0 ± 1.4ND
MC41OmahaIV38.0 ± 1.0143.4 ± 1.4ND
MC42CalhounIV37.3 ± 0.4144.4 ± 0.5ND
MC43CF461IV31.8 ± 1.0143.7 ± 0.4ND
MC44UA-4805IV61.2 ± 0.4NDND
MC105Zheng 92116IV37.2 ± 0.4NDND
MC106Guandou 2IV35.8 ± 1.0NDND
MC107Jindou 39IV36.2 ± 1.3142.9 ± 1.0ND
MC108Shanning 16IV43.4 ± 0.8142.3 ± 0.4ND
MC109HouzimaoIV43.7 ± 1.4143.3 ± 1.1ND
MC45NathanV60.2 ± 1.0NDND
MC46HolladayV62.2 ± 1.5NDND
MC47HutchesonV63.2 ± 1.0NDND
MC48R01-3474FV62.2 ± 1.5NDND
MC49TN04-5321V63.6 ± 1.5NDND
MC110Shangdou 14V44.7 ± 0.4NDND
MC111Dian 86-4V43.6 ± 1.5NDND
MC113Diandou 7V88.3 ± 0.9NDND
MC50DeshaVI48.0 ± 0.8NDND
MC51MusenVI80.0 ± 1.0NDND
MC52D95-6271VI82.3 ± 1.5NDND
MC53G01-PR16VI81.6 ± 0.9NDND
MC54BoggsVI79.8 ± 1.5NDND
MC112Zhongdou 38VI83.6 ± 2.5NDND
MC114WuhuasiyuehuangVI86.3 ± 0.9NDND
MC108Suxiandou 19VI85.8 ± 1.2NDND
MC116Nannong 493-1VI91.2 ± 0.4NDND
MC55StonewallVII81.4 ± 0.5NDND
MC57SanteeVII82.6 ± 1.8NDND
MC58HagoodVII94.6 ± 1.2NDND
MC56BenningVII93.3 ± 0.9NDND
MC117HunagfengwuVII93.7 ± 0.4NDND
MC118TongshanbopihuangVII93.2 ± 1.2NDND
MC60MotteVIII93.3 ± 0.9NDND
MC61DowlingVIII95.8 ± 1.5NDND
MC62CrockettVIII65.4 ± 0.2142.2 ± 0.5158.0 ± 1.0
MC128HengyangbayueqingVII96.9 ± 1.0NDND
MC119Nanxiadou 25VII92.6 ± 0.9NDND
MC63PrichardVIII94.8 ± 1.0NDND
MC69CIGRAS-51VIII81.4 ± 0.5NDND
MC70CIGRAS-06VIII81.5 ± 0.9NDND
MC54AijiaoqingVIII94.3 ± 0.4NDND
MC121PingguohuangdouVIII92.9 ± 1.0NDND
MC122Nandou 12VIII91.3 ± 0.4NDND
MC123ShangraodaqingsiVIII92.8 ± 1.0NDND
MC72ZigongdongdouVIII73.4 ± 1.5NDND
MC124LanxidaqingdouVIII91.6 ± 0.5NDND
MC125Qiudou 1VIII94.3 ± 0.4NDND
MC126Nandou 17VIII91.6 ± 0.4NDND
MC48JiangledaqingdouVIII99.5 ± 0.9NDND
MC129Guixia 3VIII109.2 ± 1.0118.4 ± 0.2ND
MC64JupiterIX96.3 ± 0.9NDND
MC65AlamoIX96.0 ± 1.3NDND
MC66FT-15IX94.2 ± 1.3NDND
MC67UFV-3IX96.2 ± 1.0NDND
MC68IAC-8IX90.0 ± 1.0NDND
MC71I.C.-192X80.4 ± 1.5NDND

ND, Not available data; VE, Variation after emergence.

FIGURE 1

(A,B) Flowering time and maturity of soybean varieties from different maturity groups. Flowering time and maturity data were collected in 2016 from 128 soybean varieties covering 14 maturity groups. DAE, days after emergence.

Phenotypic variation of soybean varieties at Beijing. ND, Not available data; VE, Variation after emergence. (A,B) Flowering time and maturity of soybean varieties from different maturity groups. Flowering time and maturity data were collected in 2016 from 128 soybean varieties covering 14 maturity groups. DAE, days after emergence.

GmCOL Gene Sequencing and Mutation Study

A large number of mutated sites were found in GmCOL2, GmCOL5, GmCOL14, GmCOL16, GmCOL20, and GmCOL28 (Figures 2B1,D1 and Supplementary Figures 1G1,I1, 2B1,H1). A majority of the mutation sites were in gene intron regions, but a few were located in protein-coding regions caused by insertions-deletions (InDels) and substitutions. In addition, GmCOL6, GmCOL22, GmCOL23, GmCOL24, GmCOL25, and GmCOL26 showed high mutation frequencies in the protein-coding regions (Supplementary Figures 1, 2). In contrast, GmCOL4, GmCOL15, and GmCOL19 expressed lower mutational occurrences (Supplementary Figures 1A1,H1, 2A1). The exception was GmCOL3, which did not show mutations (Figure 2C) in either the intron or exon region of the entire gene. GmCOL3 is a highly conserved gene with no change in sequence. However, the amino acid sequences encoded by GmCOL1, GmCOL8, GmCOL9, GmCOL10, and GmCOL13 were not affected (Figure 2A and Supplementary Figures 1C1-F1).
FIGURE 2

Haplotype analysis, distribution in different maturity groups, and flowering time (VE-R1) in the main haplotypes of approximately four GmCOL family genes. (A) Haplotype of GmCOL1. (B,B1) Haplotypes of GmCOL2. The haplotype is shown as a linear combination of alleles. The site combination labeled with the yellow outline represents the tagging haplotype (the composition of the haplotype depends on fewer and critical variation sites). (B2) Haplotype distribution of GmCOL2 in different maturity groups. (B3) Flowering time (VE-R1) of the main haplotypes of GmCOL2 in 2015. (B4) Flowering time (VE-R1) of the main haplotypes of GmCOL2 in 2016. (C) Gene structure of GmCOL3. (D,D1) Haplotypes of GmCOL5 (1) and GmCOL5 (2). (D2) Haplotype distribution of GmCOL5 in different maturity groups. (D3) Flowering time (VE-R1) of the main haplotypes of GmCOL5 in 2015. (D4) Flowering time (VE-R1) of the main haplotypes of GmCOL5 in 2016.

Haplotype analysis, distribution in different maturity groups, and flowering time (VE-R1) in the main haplotypes of approximately four GmCOL family genes. (A) Haplotype of GmCOL1. (B,B1) Haplotypes of GmCOL2. The haplotype is shown as a linear combination of alleles. The site combination labeled with the yellow outline represents the tagging haplotype (the composition of the haplotype depends on fewer and critical variation sites). (B2) Haplotype distribution of GmCOL2 in different maturity groups. (B3) Flowering time (VE-R1) of the main haplotypes of GmCOL2 in 2015. (B4) Flowering time (VE-R1) of the main haplotypes of GmCOL2 in 2016. (C) Gene structure of GmCOL3. (D,D1) Haplotypes of GmCOL5 (1) and GmCOL5 (2). (D2) Haplotype distribution of GmCOL5 in different maturity groups. (D3) Flowering time (VE-R1) of the main haplotypes of GmCOL5 in 2015. (D4) Flowering time (VE-R1) of the main haplotypes of GmCOL5 in 2016.

Soybean COL Gene Family Exhibited Different Linkage Disequilibrium

Linkage and association mapping were drawn among the SNPs of the gene via TASSEL5. GmCOL5 across the region almost from the starting to the end site presented a strong LD by completing the haplotype block (Figure 3D). GmCOL20, GmCOL24, and GmCOL25 across the region expressed strong LD as a haplotype block (Figures 3N,Q,R), and the polymorphic sites of GmCOL6, GmCOL8, GmCOL10, GmCOL13, and GmCOL19 showed a strong LD, with almost the entire region being a haplotype block (Figures 3E,F,H,I,M). Although GmCOL2, GmCOL16, GmCOL22, GmCOL26, and GmCOL28 had a high-sequence polymorphism, the entire region distributed a few LDs (Figures 3B,L,O,S,T) with the same high-sequence polymorphism of GmCOL4, GmCOL9, and GmCOL23, and quite a few strong LDs were distributed broadly (Figures 3C,G,P). GmCOL1, GmCOL14, and GmCOL15 did not reveal any LD (Figures 3A,J,K). GmCOL3 was the most conserved gene, showing no polymorphic site(s).
FIGURE 3

Linkage disequilibrium of GmCOL family genes in soybean. (A) Linkage disequilibrium (LD) of GmCOL1. (B) LD of GmCOL2. (C) LD of GmCOL4. (D) LD of GmCOL5. (E) LD of GmCOL6. (F) LD of GmCOL8. (G) LD of GmCOL9. (H) LD of GmCOL10. (I) LD of GmCOL13. (J) LD of GmCOL14. (K) LD of GmCOL15. (L) LD of GmCOL16. (M) LD of GmCOL19. (N) LD of GmCOL20. (O) LD of GmCOL22. (P) LD of GmCOL23. (Q) LD of GmCOL24. (R) LD of GmCOL25. (S) LD of GmCOL26. (T) LD of GmCOL28.

Linkage disequilibrium of GmCOL family genes in soybean. (A) Linkage disequilibrium (LD) of GmCOL1. (B) LD of GmCOL2. (C) LD of GmCOL4. (D) LD of GmCOL5. (E) LD of GmCOL6. (F) LD of GmCOL8. (G) LD of GmCOL9. (H) LD of GmCOL10. (I) LD of GmCOL13. (J) LD of GmCOL14. (K) LD of GmCOL15. (L) LD of GmCOL16. (M) LD of GmCOL19. (N) LD of GmCOL20. (O) LD of GmCOL22. (P) LD of GmCOL23. (Q) LD of GmCOL24. (R) LD of GmCOL25. (S) LD of GmCOL26. (T) LD of GmCOL28.

Haplotype Analysis and Its Association With Maturity Groups of Soybean Varieties

Based on LD and sequence clustering, haplotype definition was analyzed for the 21 soybean GmCOL gene families in the 128 varieties investigated. A total of 21 haplotypes of the GmCOL gene family variation types and polymorphic sites used for composing haplotypes are listed in Figure 2 and Supplementary Tables 3, 4 (Supplementary Figures 1, 2), respectively. Supplementary Table 5 shows 21 GmCOL protein groups associated with haplotypes. GmCOL2-Hap2 was dominant in the 41 accessions covering all maturity groups, except MG X. GmCOL2-Hap4 showed early flowering compared with the other haplotype groups (Figures 2B3,B4, Table 2, and Supplementary Table 3). GmCOL4-Hap1 accounted for 99 varieties, with all maturity groups excluding MG X (Supplementary Figure 1A2). GmCOL4-Hap1 had an earlier flowering date in 2016. GmCOL5-Hap2 was present in 56 accessions, covering all maturity groups (Figure 2D1). GmCOL6-Hap1 was the most abundant in 70 varieties from different maturity groups, including MG 0000-MG V and MG VIII-MG X (Supplementary Figure 1B1). Among the 11 haplotypes, GmCOL9-Hap1 was dominant in 67 varieties from maturity groups MG 0000-MG X (Supplementary Figures 1D1,D2). GmCOL9-Hap3 and GmCOL9-Hap4 were distributed in a few varieties, and the flowering time for both haplotypes was early. GmCOL10-Hap1 was the most abundantly expressed in 91 varieties from all maturity groups except MG X (Supplementary Figures 1E1,E2). GmCOL16-Hap1 was rich in accessions distributed in the maturity groups MG 0000-MG VIII (Supplementary Figures 1I1,I2). GmCOL20-Hap2 was present in 52 varieties from all maturity groups (MG 0000-MG X) (Supplementary Figures 2B1,B2). Among all the haplotypes of GmCOL28, GmCOL28-Hap1 was the most common, accounting for 59 accessions, and was distributed in the maturity groups MG 0000, MG 00, MG 0, MG I-MG VIII, and MG X (Supplementary Figures 2H1,H2). In this experiment, no haplotype variants were found in GmCOL1 and GmCOL3 (Figures 2A,C). The Hap1 series of haplotypes of GmCOL8/13/14/15/19/22/23/24/25/26 was the most abundant in the varieties and was distributed in all 14 maturity groups (Supplementary Figures 1C1,F1,G1,H1, 2A1,C1,D1,E1,F1,G1).

Haplotypes Associated With Flowering Time

The haplotype groups of 21 gene families are presented in Table 3. An analysis of variance (ANOVA) was performed to elucidate the natural variations in flowering time (VE-R1) of the GmCOL gene families based on haplotype groups. In this analysis, eight genes (GmCOL2/5/9/13/15/16/25/28) showed significant results from the first emergence to the first flowering date (VE-R1) in both years, and three genes [GmCOL8 (2015), GmCOL22 (2015), and GmCOL26 (2016)] exhibited significant results in a single year (Supplementary Table 6). The remaining genes (GmCOL4/6/10/14/19/20/23/24) did not show significant results in these years. Notably, GmCOL1 and GmCOL3 were the most conserved, and no polymorphisms in the coding region were observed.
TABLE 3

Haplotype frequency of GmCOL family genes.

Haplotype
GmCOL2 Hap1 Hap2 Hap3 Hap4 Hap5 Hap6 Hap7 Hap8 Hap9 Hap10 Hap11
Accessions184120173333111
GmCOL4 Hap1 Hap2 Hap3 Hap4 Hap5
Accessions997441
GmCOL5 Hap1 Hap2 Hap3 Hap4 Hap5
Accessions5156111
GmCOL6 Hap1 Hap2 Hap3 Hap4 Hap5
Accessions7046511
GmCOL8 Hap1 Hap2
Accessions8811
GmCOL9 Hap1 Hap2 Hap3 Hap4 Hap5 Hap6 Hap7 Hap8 Hap9 Hap10 Hap11
Accessions6725443111111
GmCOL10 Hap1 Hap2 Hap3 Hap4
Accessions911611
GmCOL13 Hap1 Hap2 Hap3
Accessions12251
GmCOL14 Hap1 Hap2 Hap3
Accessions11411
GmCOL15 Hap1 Hap2 Hap3 Hap4
Accessions107521
GmCOL16 Hap1 Hap2 Hap3 Hap4 Hap5 Hap6 Hap7 Hap8 Hap9
Accessions542524654221
GmCOL19 Hap1 Hap2 Hap3 Hap4 Hap5
Accessions9917221
GmCOL20 Hap1 Hap2 Hap3 Hap4 Hap5 Hap6 Hap7 Hap8 Hap9 Hap10
Accessions1952278221111
GmCOL22 Hap1 Hap2 Hap3 Hap4 Hap5 Hap6
Accessions11074321
GmCOL23 Hap1 Hap2 Hap3 Hap4 Hap5 Hap6 Hap7 Hap8 Hap9 Hap10
Accessions80432111111
GmCOL24 Hap1 Hap2 Hap3 Hap4 Hap5
Accessions885311
GmCOL25 Hap1 Hap2 Hap3 Hap4 Hap5
Accessions899661
GmCOL26 Hap1 Hap2 Hap3 Hap4
Accessions615672
GmCOL28 Hap1 Hap2 Hap3
Accessions592922
Haplotype frequency of GmCOL family genes. Among the 11 haplotype groups of GmCOL2, 4 were analyzed, and GmCOL2-Hap3 was significantly different from every other haplotype group (Figure 4A). The GmCOL2-Hap1/2/4 haplotype groups, which appeared in the above haplotype group, were not significantly different from each other. Analysis of GmCOL5 showed that in both years two major haplotypes showed significant differences from each other, and Hap2 was related to late flowering compared with Hap1 (Figure 4B). In terms of GmCOL5 Hap2 (S999), the original nucleotide (A499) was mutated to G499 in the coding sequence, resulting in a missense mutation in the amino acid sequence (K167 to E167). K167 was located between the B-box and the CCT domain. In comparison, GmCOL9, Hap1, and Hap2 (Figure 4C) did not show significant differences from each other, and both were significantly different from GmCOL9-Hap3 and GmCOL9-Hap4 only in 2015. Likewise, GmCOL13-Hap2/3 (Figure 4D) was closely associated with flowering time because both were significantly different from GmCOL13-Hap1 in both years.
FIGURE 4

Haplotypes of GmCOL genes in relation to flowering time (2015–2016). The relationship between flowering time and haplotype of GmCOL2 (A), GmCOL5 (B), GmCOL9 (C), GmCOL13 (D), GmCOL15 (E), GmCOL16 (F), GmCOL25 (G), and GmCOL28 (H).

Haplotypes of GmCOL genes in relation to flowering time (2015–2016). The relationship between flowering time and haplotype of GmCOL2 (A), GmCOL5 (B), GmCOL9 (C), GmCOL13 (D), GmCOL15 (E), GmCOL16 (F), GmCOL25 (G), and GmCOL28 (H). For GmCOL13 Hap2/3 (S584), the original nucleotide (G84) was mutated to A84 in the coding sequence, resulting in a silent mutation in the no. 28 amino acid sequence, which was located in the first B-box. In both years, GmCOL15-Hap3/4 (Figure 4E) showed a significant difference compared with GmCOL15-Hap1/2. For GmCOL16, the haplotypes showed non-significant results among the four haplotypes (Hap1, Hap2, Hap3, and Hap4) (Figure 4F). In the first year, five haplotypes of GmCOL25 (Hap1, Hap2, Hap3, Hap4, and Hap5) were not significantly different from each other, whereas Hap3 and Hap4 were significantly different from Hap1, Hap2, and Hap5 in the second year (Figure 4G). Hap2 of GmCOL28 was significantly different from Hap1 and Hap3 in both years (Figure 4H). For GmCOL28 Hap2 (S1066), the original nucleotide (T263) was mutated to C263 in the coding sequence, resulting in a missense mutation in the amino acid sequence (V88 to A88), which was located in the first B-box, and may affect the protein and protein interaction. Thus, the results revealed that Hap2 of GmCOL28 was closely associated with flowering time.

Discussion

Selected Soybean Varieties Showed Diversity in Flowering Time and Maturity

The varieties analyzed in this study showed the diversity of flowering time and maturity, which indicated that the flowering time (VE-R1) and the days from emergence to physiological maturity (VE-R7) were related to the variety trait and natural environment (Jiang et al., 2019). According to the photoperiod responses, many soybean varieties have evolved to adapt to a broad range of growing areas in China (Wang et al., 2016). The flowering time of the selected varieties ranged from 21 to 121 days and 13.66 to 109.2 days, and the maturity time ranged from 61.2 to 150.7 days and 68.4 to 144.4 days, respectively, for two consecutive years (2015 and 2016) (Jiang et al., 2019), indicating high diversity (Table 2). However, Paula showed the shortest VE-R1 in the 2nd year, with the maturity group MG 0000. Paula is recognized as a high-latitude cold region (HCR) soybean variety and HCR soybean, which typically matures early (Jia et al., 2014). This observation suggested that the maturity group MG 0000 was relatively stable for days to start flowering from emergence.

GmCOL Orthologs Showed Divergence in Sequence Polymorphism

It has been shown that GmCOL1a (Glyma08g28370) and GmCOL1b (Glyma18g51320) in soybean are the closest Arabidopsis COL2 orthologs to CO (Thakare et al., 2010). The transgenic soybean line overexpressing GmCOL1a flowered late under long-day or natural conditions (Cao et al., 2015). There is a single-nucleotide substitution at the GmCOL1b CCT domain in the gmcol1b mutant, leading to the mutagenesis of conserved threonine at amino acid 314 into isoleucine, which results in early flowering of the gmcol1b mutant (Cao et al., 2015). In this study, the varieties that were taken for the observation of natural variations regarding flowering time and maturation showed divergent characteristics in flowering time (Jiang et al., 2019). Allelic variation can have an effect on flowering time (Irwin et al., 2016). In this study, the insertion and single-nucleotide substitution (InDel918, InDel919, InDel920, s958, s1138, s1139, s1218, and s1260) found in the first exon in GmCOL2 led to missense mutations, which might have an effect on flowering time and maturity. The other genes in the GmCOL family showed various types of point mutations (Figure 2 and Supplementary Figures 1–5). The loss of functions may have an impact on gene function, and the presence of these mutations suggested that polymorphism could be the main cause of the diversity of soybean flowering time.

Polymorphism of GmCOL Family Genes Was Associated With Flowering Time and Maturity

Haplotype-based analyses have been successfully carried out in different crops, such as maize (Zea mays L.) (Weber et al., 2009; Van Inghelandt et al., 2012; Lipka et al., 2013), rice (Oryza sativa L.) (Lestari et al., 2011; Yonemaru et al., 2012, 2014; Shao et al., 2013; Wu et al., 2020), and soybean (Choi et al., 2007; Li et al., 2009; Langewisch et al., 2014; Patil et al., 2016; Jiang et al., 2019). In this study, some polymorphic sites were identified and utilized by conducting a haplotype analysis of 21 soybean GmCOL gene families. Based on LD analysis and polymorphic sites, tagging haplotypes composed of some SNPs and InDels indicated that there were 7 polymorphic sites in GmCOL2 and just 3 polymorphic sites in GmCOL10 and 46 and 10 polymorphic sites in GmCOL2 and GmCOL10, respectively (Supplementary Table 4). In addition, some haplotypes associated with flowering date and maturity (such as GmCOL2-Hap1, Hap2, and Hap4; GmCOL9-Hap1, Hap2, Hap3, and Hap4; GmCOL13-Hap1, Hap2, and Hap3; GmCOL15-Hap1, Hap2, Hap3, and Hap4; GmCOL16-Hap1, Hap2, Hap3, Hap4, and Hap5; and GmCOL25-Hap1, Hap2, Hap3, Hap4, and Hap5) were distributed in varieties with different maturity groups. The LD differences among the 21 soybean COL gene families may reflect selection pressure and, to some extent, the process of natural selection and domestication.

Conclusion

In summary, 21 GmCOL genes exhibited natural divergence in association with flowering and growth periods. Significant changes were found in 21 COL genome sequences, among which GmCOL1, GmCOL3, GmCOL8, GmCOL9, GmCOL10, and GmCOL13 were conserved, but the sequences of the remaining COL genes showed a wide range of changes that could alter their function. In this study, it was noticed that additional polymorphisms linked to the 21 GmCOL gene families in trait-controlling regions might have a significant impact on flowering time and maturity.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author Contributions

MA conducted the major phenotypic and genotypic works. SZ, RE, WS, TW, and BJ joined the experiments. SY, CW, WH, SS, and YF provided the materials and technical supports. MA, RE, and BJ prepared the initial draft of the manuscript. FC rewrote the manuscript. FC, BJ, MA, RE, and TH revised the manuscript. TH coordinated and supervised the project. All authors read and approved the final manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  59 in total

1.  The Arabidopsis E3 ubiquitin ligase HOS1 negatively regulates CONSTANS abundance in the photoperiodic control of flowering.

Authors:  Ana Lazaro; Federico Valverde; Manuel Piñeiro; Jose A Jarillo
Journal:  Plant Cell       Date:  2012-03-09       Impact factor: 11.277

2.  CONSTANS mediates between the circadian clock and the control of flowering in Arabidopsis.

Authors:  P Suárez-López; K Wheatley; F Robson; H Onouchi; F Valverde; G Coupland
Journal:  Nature       Date:  2001-04-26       Impact factor: 49.962

3.  FKF1 and GIGANTEA complex formation is required for day-length measurement in Arabidopsis.

Authors:  Mariko Sawa; Dmitri A Nusinow; Steve A Kay; Takato Imaizumi
Journal:  Science       Date:  2007-09-13       Impact factor: 47.728

4.  Haplotype variation at Badh2, the gene determining fragrance in rice.

Authors:  Gaoneng Shao; Shaoqing Tang; Mingliang Chen; Xiangjin Wei; Jiwai He; Ju Luo; Guiai Jiao; Yichao Hu; Lihong Xie; Peisong Hu
Journal:  Genomics       Date:  2012-12-06       Impact factor: 5.736

5.  Stepwise selection on homeologous PRR genes controlling flowering and maturity during soybean domestication.

Authors:  Sijia Lu; Lidong Dong; Chao Fang; Shulin Liu; Lingping Kong; Qun Cheng; Liyu Chen; Tong Su; Haiyang Nan; Dan Zhang; Lei Zhang; Zhijuan Wang; Yongqing Yang; Deyue Yu; Xiaolei Liu; Qingyong Yang; Xiaoya Lin; Yang Tang; Xiaohui Zhao; Xinquan Yang; Changen Tian; Qiguang Xie; Xia Li; Xiaohui Yuan; Zhixi Tian; Baohui Liu; James L Weller; Fanjiang Kong
Journal:  Nat Genet       Date:  2020-03-30       Impact factor: 38.330

6.  Conserved CO-FT regulons contribute to the photoperiod flowering control in soybean.

Authors:  Chengming Fan; Ruibo Hu; Xiaomei Zhang; Xu Wang; Wenjing Zhang; Qingzhe Zhang; Jinhua Ma; Yong-Fu Fu
Journal:  BMC Plant Biol       Date:  2014-01-07       Impact factor: 4.215

7.  Genetic variation of maturity groups and four E genes in the Chinese soybean mini core collection.

Authors:  Jicun Li; Xiaobo Wang; Wenwen Song; Xinyang Huang; Jing Zhou; Haiyan Zeng; Shi Sun; Hongchang Jia; Wenbin Li; Xinan Zhou; Suzhen Li; Pengyin Chen; Cunxiang Wu; Yong Guo; Tianfu Han; Lijuan Qiu
Journal:  PLoS One       Date:  2017-02-16       Impact factor: 3.240

8.  Natural variation and CRISPR/Cas9-mediated mutation in GmPRR37 affect photoperiodic flowering and contribute to regional adaptation of soybean.

Authors:  Liwei Wang; Shi Sun; Tingting Wu; Luping Liu; Xuegang Sun; Yupeng Cai; Jicun Li; Hongchang Jia; Shan Yuan; Li Chen; Bingjun Jiang; Cunxiang Wu; Wensheng Hou; Tianfu Han
Journal:  Plant Biotechnol J       Date:  2020-01-25       Impact factor: 9.803

9.  Maturity group classification and maturity locus genotyping of early-maturing soybean varieties from high-latitude cold regions.

Authors:  Hongchang Jia; Bingjun Jiang; Cunxiang Wu; Wencheng Lu; Wensheng Hou; Shi Sun; Hongrui Yan; Tianfu Han
Journal:  PLoS One       Date:  2014-04-16       Impact factor: 3.240

10.  Functional and evolutionary characterization of the CONSTANS gene family in short-day photoperiodic flowering in soybean.

Authors:  Faqiang Wu; Brian William Price; Waseem Haider; Gabriela Seufferheld; Randall Nelson; Yoshie Hanzawa
Journal:  PLoS One       Date:  2014-01-21       Impact factor: 3.240

View more
  1 in total

1.  GmFT3a fine-tunes flowering time and improves adaptation of soybean to higher latitudes.

Authors:  Shan Yuan; Yining Wang; Junya Wang; Chunlei Zhang; Lixin Zhang; Bingjun Jiang; Tingting Wu; Li Chen; Xin Xu; Yupeng Cai; Shi Sun; Fulu Chen; Wenwen Song; Cunxiang Wu; Wensheng Hou; Lijie Yu; Tianfu Han
Journal:  Front Plant Sci       Date:  2022-07-25       Impact factor: 6.627

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.