| Literature DB >> 25710777 |
Rui Zhang1, Jian Ding1, Chunxiao Liu1, Caiping Cai1, Baoliang Zhou1, Tianzhen Zhang1, Wangzhen Guo1.
Abstract
Flowering time is an important ecological trait that determines the transition from vegetative to reproductive growth. Flowering time in cotton is controlled by short-day photoperiods, with strict photoperiod sensitivity. As the CO-FT (CONSTANS-FLOWER LOCUS T) module regulates photoperiodic flowering in several plants, we selected eight CONSTANS genes (COL) in group I to detect their expression patterns in long-day and short-day conditions. Further, we individually cloned and sequenced their homologs from 25 different cotton accessions and one outgroup. Finally, we studied their structures, phylogenetic relationship, and molecular evolution in both coding region and three characteristic domains. All the eight COLs in group I show diurnal expression. In the orthologous and homeologous loci, each gene structure in different cotton species is highly conserved, while length variation has occurred due to insertions/deletions in intron and/or exon regions. Six genes, COL2 to COL5, COL7 and COL8, exhibit higher nucleotide diversity in the D-subgenome than in the A-subgenome. The Ks values of 98.37% in all allotetraploid cotton species examined were higher in the A-D and At-Dt comparison than in the A-At and D-Dt comparisons, and the Pearson's correlation coefficient (r) of Ks between A vs. D and At vs. Dt also showed positive, high correlations, with a correlation coefficient of at least 0.797. The nucleotide polymorphism in wild species is significantly higher compared to G. hirsutum and G. barbadense, indicating a genetic bottleneck associated with the domesticated cotton species. Three characteristic domains in eight COLs exhibit different evolutionary rates, with the CCT domain highly conserved, while the B-box and Var domain much more variable in allotetraploid species. Taken together, COL1, COL2 and COL8 endured greater selective pressures during the domestication process. The study improves our understanding of the domestication-related genes/traits during cotton evolutionary process.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25710777 PMCID: PMC4339614 DOI: 10.1371/journal.pone.0118669
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Cotton accessions used in the study with their geographic distribution in Gossypium.
| No. | Species | Genome | Geographic distribution |
|---|---|---|---|
| 1 |
| A1 | Africa |
| 2 |
| D5 | Peru |
| 3 |
| AD1 | America |
| 4 |
| AD1 | China |
| 5 |
| AD1 | China |
| 6 |
| AD1 | China |
| 7 |
| AD1 | Mexico |
| 8 |
| AD1 | Mexico |
| 9 |
| AD1 | Mexico |
| 10 |
| AD1 | Mexico |
| 11 |
| AD1 | Mexico |
| 12 |
| AD1 | Mexico |
| 13 |
| AD1 | Mexico |
| 14 |
| AD2 | China |
| 15 |
| AD2 | America-Egypt |
| 16 |
| AD2 | China |
| 17 |
| AD2 | Egypt |
| 18 |
| AD2 | America-Egypt |
| 19 |
| AD2 | Egypt |
| 20 |
| AD2 | Peru |
| 21 |
| AD2 | Peru |
| 22 |
| AD2 | Brazil |
| 23 |
| AD3 | Hawaii |
| 24 |
| AD4 | Brazil |
| 25 |
| AD5 | Galapagos Islands |
Fig 1Maximum likelihood tree of CONSTANS-like proteins in Arabidopsis, rice and cotton.
CO-like proteins in Arabidopsis and rice are based on Griffiths et al. (2003) and the Plant Transcription Factor Database (http://plntfdb.bio.uni-potsdam.de/v3.0/), and their amino acid sequences were obtained from the Plant Transcription Factor Database.
Structural analysis of eight COL genes in 25 cotton accessions.
| Gene | Full length/length of ORF(bp)/number of AA/length of extron1/length of intron/length of extron2 | Accession numbers |
|---|---|---|
|
| 1611/1125/374/774/486/351 | KM201660-KM201709 |
|
| 1 | KM201710-KM201759 |
|
| 1094/1017/338/699/77/318 | KM201760-KM201809 |
|
| 23 | KM201810-KM201859 |
|
| 1103/1008/335/693/95/315 | KM201860-KM201909 |
|
| At | KM201910-KM201959 |
|
| At | KM202010-KM202059 |
|
| 14–19 | KM201960-KM202009 |
a Code designations are the same as in Table 1.
b At = A-subgenome from tetraploid cotton species.
c Dt = D-subgenome from tetraploid cotton species.
Fig 2Diurnal expression patterns of eight COL genes in TM-1 leaves under LD or SD conditions.
The X-axin represent time point (hours) and Y-axin indicates relative expression levels with cotton histone3 (AF024716) as the control. Mean values ± SD were obtained from three biological repeats. The gray bars over each chart represent dark periods.
Fig 3Phylogenetic tree based on the combined sequence of eight CO-like genes by the maximum likelihood method.
Bootstrap values (%) based on 1000 replicates are indicated beside the nodes.
Correlation analysis between A-D and At-Dt comparisons for each allotetraploid accession.
| Pairwise comparison | r of Ks |
|---|---|
| A-D vs 3 At-Dt | 0.919 |
| A-D vs 4 At-Dt | 0.919 |
| A-D vs 5 At-Dt | 0.919 |
| A-D vs 6 At-Dt | 0.894 |
| A-D vs 7 At-Dt | 0.868 |
| A-D vs 8 At-Dt | 0.922 |
| A-D vs 9 At-Dt | 0.917 |
| A-D vs 10 At-Dt | 0.842 |
| A-D vs 11 At-Dt | 0.902 |
| A-D vs 12 At-Dt | 0.939 |
| A-D vs 13 At-Dt | 0.869 |
| A-D vs 14 At-Dt | 0.890 |
| A-D vs 15 At-Dt | 0.900 |
| A-D vs 16 At-Dt | 0.888 |
| A-D vs 17 At-Dt | 0.888 |
| A-D vs 18 At-Dt | 0.889 |
| A-D vs 19 At-Dt | 0.889 |
| A-D vs 20 At-Dt | 0.905 |
| A-D vs 21 At-Dt | 0.896 |
| A-D vs 22 At-Dt | 0.903 |
| A-D vs 23 At-Dt | 0.797 |
| A-D vs 24 At-Dt | 0.873 |
| A-D vs 25 At-Dt | 0.871 |
Code designations are the same as in Table 1.
r: Correlation coefficient.
** Correlation is significant at P<0.01 (2-tailed).
* Correlation is significant at P<0.05 (2-tailed).
Estimates of nucleotide diversity of A vs At and D vs Dt for eight COL genes in cotton according to their genomic sequences.
| Gene | A1-At | D5-Dt |
|---|---|---|
| combined sequence | 0.00586 | 0.01051 |
|
| 0.00605 | 0.00532 |
|
| 0.00624 | 0.00848 |
|
| 0.0039 | 0.00827 |
|
| 0.00678 | 0.01201 |
|
| 0.00161 | 0.00584 |
|
| 0.01124 | 0.00536 |
|
| 0.0048 | 0.00575 |
|
| 0.00631 | 0.02956 |
| average | 0.00587 | 0.01007 |
Taxa include 23 allotetraploids accessions and the two genome donors to the allotetraploid accessions. At = A genome from the allotetraploid cotton species; Dt = D genome from the allotetraploid cotton species; D5 = G. raimondii; A1 = G. herbaceum.
** P<0.01
Nucleotide polymorphism and neutrality tests of eight COL genes in cotton according to their ORF sequences.
| Gene | Accession | πtotal | πs | πa | TD | FD | FF |
|---|---|---|---|---|---|---|---|
|
| total | 0.00226 | 0.00479 | 0.00147 | −2.21 | −3.62 | −3.73 |
|
| 0.00304 | 0.00734 | 0.0017 | −2.00 | −2.33 | −2.54 | |
|
| 0.00059 | 0.00083 | 0.00052 | −1.51 | −1.68 | −1.82 | |
| wild | 0.00178 | 0.00748 | 0 | nd | Nd | nd | |
|
| total | 0.00385 | 0.00752 | 0.00272 | −1.01 | 0.06 | −0.31 |
|
| 0.00252 | 0.00477 | 0.00183 | −1.34 | −1.47 | −1.63 | |
|
| 0.00198 | 0.00396 | 0.00136 | 0.03 | −0.29 | −0.23 | |
| wild | 0.0077 | 0.01998 | 0.0039 | nd | Nd | nd | |
|
| total | 0.00335 | 0.00698 | 0.00232 | −0.47 | −1.52 | −1.41 |
|
| 0.00045 | 0 | 0.00059 | −0.78 | −0.33 | −0.49 | |
|
| 0.00089 | 0 | 0.00115 | 1.23 | 1.06 | 1.22 | |
| wild | 0.00681 | 0.01451 | 0.00462 | nd | Nd | nd | |
|
| total | 0.00232 | 0.00481 | 0.00161 | −0.13 | −1.57 | −1.33 |
|
| 0.00081 | 0.00144 | 0.00063 | −1.79 | −2.13 | −2.30 | |
|
| 0 | 0 | 0 | nd | Nd | nd | |
| wild | 0.00533 | 0.01055 | 0.00385 | nd | Nd | nd | |
|
| total | 0.00274 | 0.00214 | 0.00294 | 0.10 | −0.58 | −0.44 |
|
| 0.00097 | 0 | 0.00127 | 1.34 | 1.00 | 1.21 | |
|
| 0.00055 | 0 | 0.00072 | 1.40 | 0.84 | 1.07 | |
| wild | 0.00393 | 0.00277 | 0.00431 | nd | Nd | nd | |
|
| total | 0.00282 | 0.00559 | 0.00198 | 0.20 | −0.58 | −0.41 |
|
| 0.00136 | 0.00366 | 0.00066 | −0.73 | −0.76 | −0.85 | |
|
| 0 | 0 | 0 | nd | Nd | nd | |
| wild | 0.00459 | 0.00278 | 0.00517 | nd | Nd | nd | |
|
| total | 0.00198 | 0.00114 | 0.00224 | −1.40 | −2.41 | −2.46 |
|
| 0.00089 | 0.00158 | 0.00068 | −1.32 | −1.21 | −1.39 | |
|
| 0.00113 | 0 | 0.00148 | 0.03 | 0.23 | 0.20 | |
| wild | 0.00544 | 0.00291 | 0.00623 | nd | Nd | nd | |
|
| total | 0.00017 | 0 | 0.00023 | −1.51 | −2.13 | −2.26 |
|
| 0.00018 | 0 | 0.00024 | −1.13 | −1.29 | −1.40 | |
|
| 0 | 0 | 0 | nd | Nd | nd | |
| wild | 0.00067 | 0 | 0.00087 | nd | Nd | nd | |
|
| total | 0.00172 | 0.00287 | 0.0013 | 0.20 | −0.95 | −0.71 |
|
| 0.00069 | 0 | 0.0009 | 0.04 | −0.33 | −0.27 | |
|
| 0 | 0 | 0 | nd | Nd | nd | |
| wild | 0.00265 | 0.00833 | 0.00087 | nd | Nd | nd | |
|
| total | 0.00225 | 0.00434 | 0.00161 | −1.05 | −1.55 | −1.63 |
|
| 0.00152 | 0.00076 | 0.00176 | −0.40 | −0.08 | −0.18 | |
|
| 0 | 0 | 0 | nd | Nd | nd | |
| wild | 0.00463 | 0.01674 | 0.00087 | nd | Nd | nd | |
|
| total | 0.00236 | 0.00669 | 0.00106 | −0.87 | −2.25 | −2.15 |
|
| 0.00166 | 0.00399 | 0.00096 | −1.40 | −1.23 | −1.44 | |
|
| 0 | 0 | 0 | nd | Nd | nd | |
| wild | 0.00353 | 0.01209 | 0.00092 | nd | Nd | nd | |
|
| total | 0.00081 | 0.00308 | 0.00012 | −0.83 | −1.80 | −1.76 |
|
| 0.00019 | 0 | 0.00025 | −1.13 | −1.29 | −1.40 | |
|
| 0 | 0 | 0 | nd | Nd | nd | |
| wild | 0.00142 | 0.00612 | 0 | nd | Nd | nd | |
|
| total | 0.00191 | 0.00618 | 0.00066 | 0.37 | −0.66 | −0.42 |
|
| 0.00033 | 0 | 0.00043 | −1.43 | −1.66 | −1.80 | |
|
| 0.0005 | 0 | 0.00065 | 1.40 | 0.84 | 1.07 | |
| wild | 0.0018 | 0.00529 | 0.00078 | nd | nd | nd | |
|
| total | 0.00274 | 0.00564 | 0.00188 | −1.08 | −2.20 | −2.18 |
|
| 0.00312 | 0.0046 | 0.00267 | −1.44 | 1.71 | −1.86 | |
|
| 0 | 0 | 0 | nd | nd | nd | |
| wild | 0.0012 | 0.00264 | 0.00078 | nd | nd | nd | |
|
| total | 0.00341 | 0.00671 | 0.00241 | 0.37 | −1.08 | −0.76 |
|
| 0.00119 | 0.00203 | 0.00093 | −1.46 | −1.44 | −1.63 | |
|
| 0 | 0 | 0 | nd | nd | nd | |
| wild | 0.00378 | 0.00647 | 0.00299 | nd | nd | nd | |
|
| total | 0.00394 | 0.00366 | 0.00405 | 0.57 | 0.51 | 0.62 |
|
| 0.00324 | 0.00637 | 0.00227 | 1.59 | 1.33 | 1.57 | |
|
| 0 | 0 | 0 | nd | nd | nd | |
| wild | 0.00378 | 0 | 0.00498 | nd | nd | nd | |
| average | total | 0.00241 | 0.00451 | 0.00179 | −0.54725 | −1.39592 | −1.33363 |
|
| 0.00139 | 0.00228 | 0.00111 | −0.83652 | −0.72036 | −1.02529 | |
|
| 0.00035 | 0.00030 | 0.00037 | 0.16132 | 0.06295 | 0.09409 | |
| wild | 0.00369 | 0.00742 | 0.00257 | nd | nd | nd |
TD, the Tajima’s D; FD, Fu and Li’s D; FF, Fu and Li’s F.
nd, not determined (not implemented yet).
# P<0.1
* P<0.05
** P<0.01.
Nucleotide polymorphism of B-box, Var and CCT domains for eight COL genes in allotetraploid cotton.
| Gene | B-box | Var | CCT |
|---|---|---|---|
|
| 0.00104 | 0.00213 | 0.00251 |
|
| 0.0016 | 0.00493 | 0.00576 |
|
| 0.00696 | 0.00314 | 0 |
|
| 0.00311 | 0.00269 | 0 |
|
| 0.00406 | 0.00145 | 0.00067 |
|
| 0.00357 | 0.002 | 0.00135 |
|
| 0.00435 | 0.0014 | 0.00067 |
|
| 0.00035 | 0.00016 | 0 |
|
| 0.00069 | 0.00285 | 0 |
|
| 0.0027 | 0.00149 | 0.00202 |
|
| 0.00073 | 0.00306 | 0.00533 |
|
| 0.00073 | 0.00151 | 0 |
|
| 0.00315 | 0.00236 | 0 |
|
| 0.00248 | 0.00376 | 0 |
|
| 0.0072 | 0.00022 | 0.00067 |
|
| 0.00267 | 0.00579 | 0 |
| average | 0.00284 | 0.00243 | 0.00119 |