| Literature DB >> 22666315 |
Li Lu1, Wenhao Yan, Weiya Xue, Di Shao, Yongzhong Xing.
Abstract
Plant height, heading date, and yield are the main targets for rice genetic improvement. Ghd7 is a pleiotropic gene that controls the aforementioned traits simultaneously. In this study, a rice germplasm collection of 104 accessions (Oryza sativa) and 3 wild rice varieties (O.rufipogon) was used to analyze the evolution and association of Ghd7 with plant height, heading date, and yield. Among the 104 accessions, 76 single nucleotide polymorphisms (SNPs) and six insertions and deletions were found within a 3932-bp DNA fragment of Ghd7. A higher pairwise π and θ in the promoter indicated a highly diversified promoter of Ghd7. Sixteen haplotypes and 8 types of Ghd7 protein were detected. SNP changes between haplotypes indicated that Ghd7 evolved from two distinct ancestral gene pools, and independent domestication processes were detected in indica and japonica varietals respectively. In addition to the previously reported premature stop mutation in the first exon of Ghd7, which caused phenotypic changes of multiple traits, we found another functional C/T mutation (SNP S_555) by structure-based association analysis. SNP S_555 is located in the promoter and was related to plant height probably by altering gene expression. Moreover, another seven SNP mutations in complete linkage were found to be associated with the number of spikelets per panicle, regardless of the photoperiod. These associations provide the potential for flexibility of Ghd7 application in rice breeding programs.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22666315 PMCID: PMC3364234 DOI: 10.1371/journal.pone.0034021
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Population structure for 104 accessions.
Three colors indicate the populations; red, green, and blue indicate the subpopulations 1, 2, and 3, respectively. Every accession is represented by a single vertical line with the lengths proportional to each of the subpopulations. The figure is created by STRUCTURE.
Summary of DNA polymorphic sites of Ghd7 genome.
| Parameter | Entire region | Promoter | 5′UTR | Exon 1 | Intron | Exon 2 | 3′UTR | |
| Length, bp | 3923 | 1263 | 210 | 444 | 1646 | 330 | 30 | |
| SNP sites | 76 | 45 | 0 | 8 | 19 | 4 | 0 | |
| InDels | 6 | 4 | 0 | 0 | 2 | 0 | 0 | |
| Whole population | ||||||||
| Π | 0.0062 | 0.0124 | – | 0.0039 | 0.0035 | 0.0038 | – | |
| θ | 0.0037 | 0.0069 | – | 0.0035 | 0.0022 | 0.0023 | – | |
| Tajima’s | 2.1730 | 2.5443 | – | 0.2990 | 1.6448 | 1.2658 | – | |
|
| ||||||||
| Π | 0.0028 | 0.0050 | – | 0.0027 | 0.0015 | 0.0030 | – | |
| θ | 0.0036 | 0.0068 | – | 0.0034 | 0.0019 | 0.0026 | – | |
| Tajima’s | −0.7306 | −0.8374 | – | −0.4747 | −0.6505 | 0.3128 | – | |
|
| ||||||||
| Π | 0.0013 | 0.0029 | – | 0.0006 | 0.0006 | 0.0004 | – | |
| θ | 0.0033 | 0.0066 | – | 0.0016 | 0.0021 | 0.0014 | – | |
| Tajima’s | −2.1742 | −1.9697 | – | −1.2926 | −2.2250 | −1.3005 | – | |
π, average number of nucleotide differences per site between two sequences; θ, Watterson estimator; Tajima’s D, test for neutral selection.
Significant at P<0.05;
significant at P<0.01.
Figure 2Haplotype analysis of the Ghd7 gene region in the 104 cultivars.
(a) The Ghd7 containing two exons (indicated in gray) and the entire length of the 3923-bp genome is shown in graphics on the top. The position of every SNP is shown in the first row (SNP frequency>5%). Twelve haplotypes (H0–H11) were detected in the 104 cultivars of O. sativa, which can be divided into an indica group (ind-G) and a japonica group (jap-G) based on the population structure analysis. The number of cultivars (cvs) in every subpopulation is shown in the right columns: Q1 indicates the indica population and Q2 and Q3 indicate the japonica population. Yellow represents polymorphisms characteristic of the indica haplogroup, light blue shows the japonica haplogroup polymorphisms. Red indicates the new mutation. WR1–3 indicates the three wild rice varieties of O.rufipogon. (b) Phylogenetic tree of the twelve haplotypes (H0–H11).
Figure 3Protein diversity of Ghd7.
The two exons (indicated in black rectangle) and the 5′ and 3′ UTRs (indicated in white rectangle) of Ghd7 are shown in graphics on the top. The first row indicates the position of the SNPs, the last row reveals the amino acid change. Gray indicates synonymous SNP. Eight types of Ghd7 protein were identified. Ghd7-0 was a permutation type. Hap indicates the haplotypes that share the same protein type. The numbers in the right column are the numbers of cultivars (cvs) represented in every protein type.
Comparison of means of three traits among the major 4 protein types.
| Ghd7-0 | Ghd7-1 | Ghd7-2 | Ghd7-4 | |||||||
| Means±SD | N | Means±SD | N | Means±SD | N | Means±SD | N |
|
| |
| PH(2007LD) | 77.7±10.1a | 2 | 130.4±31.8b | 40 | 121.8±29.6b | 39 | 125.5±27.3b | 17 | 2.2243 | 0.0904 |
| HD(2007LD) | 48.0±1.9a | 2 | 89.6±13.9b | 40 | 86.5±11.5b | 39 | 81.4±3.9b | 17 | 9.1804 | 0.0000 |
| SPP(2007LD) | 49.2±17.4a | 2 | 162.7±55.4b | 39 | 146.2±44.7b | 33 | 200.0±31.3b | 10 | 6.5769 | 0.0005 |
| PH(2010LD) | − | − | 118.8±32.7a | 31 | 119.5±33.0a | 31 | 111.9±26.9a | 17 | 0.3501 | 0.7057 |
| HD(2010LD) | − | − | 83.6±14.4a | 31 | 84.7±14.0a | 31 | 75.4±3.9b | 17 | 3.2020 | 0.0462 |
| SPP(2010LD) | − | − | 173.3±48.9a | 25 | 143.8±42.2b | 23 | 189.5±42.3a | 17 | 5.4496 | 0.0066 |
| PH(2010SD) | 73.5±8.6a | 2 | 112.5±27.3b | 38 | 114.1±29.9b | 36 | 114.3±31.0b | 17 | 1.2695 | 0.2897 |
| HD(2010SD) | 57.1±1.9a | 2 | 83.8±9.2b | 38 | 80.7±9.4b | 36 | 79.9±3.7b | 17 | 6.6881 | 0.0004 |
| SPP(2010SD) | 55.5±11.3a | 2 | 157.3±45.7b | 28 | 139.6±57.8b | 33 | 178.3±26.6b | 16 | 5.1143 | 0.0028 |
The first line indicates the main 4 protein types. SD, standard deviation; N, number of cultivars tested. Means followed by different letters each row are significantly different at P = 0.05 within one environment. F ratio and probability based on one-way analysis of variance.
Results of GLM association of SNP traits.
| Trait | PH | HD | SPP | |||
| Site |
|
|
|
|
|
|
|
| ||||||
| S_555 | 0.0019 | 0.0998 | – | – | 0.0235 | 0.0555 |
| 7 SNP in LD | – | – | 0.0125 | 0.0657 | 0.0178 | 0.0605 |
| 10 SNP in LD | – | – | – | – | 0.0097 | 0.0716 |
|
| ||||||
| S_555 | 0.0367 | 0.0582 | – | – | – | – |
| 7 SNP in LD | – | – | 0.0329 | 0.0581 | 0.0141 | 0.0664 |
|
| ||||||
| S_555 | 0.0033 | 0.0975 | – | – | – | – |
| 7 SNP in LD | – | – | – | – | 0.0068 | 0.0833 |
| 10 SNP in LD | – | – | – | – | 0.0183 | 0.0639 |
Result of structure-based association mapping (P<0.05) of haplotypes H1–H5, by GLM analysis of TASSEL. R, the total variation explained by the SNP.
7 SNPs in LD:S_194, S_278, S_968, S_1804, S_1808, S_3207, and S_3635. They were in complete linkage disequilibrium and gathered in haplotype H4.
10 SNP in LD:S_30, S_58, S_207, S_392, S_857, S_876, S_2652, S_3252, S_3346, and S_3815. They were in complete linkage disequilibrium and they were the introgressed SNPs transferred from japonica to indica.
Comparison of means between different haplotypes in the three traits.
| Trait | PH | HD | SPP | |||
| Hap. | Means±SD | N | Means±SD | N | Means±SD | N |
|
| ||||||
| H2(2007LD) | 111.8±16.1a | 21 | 85.9±9.4a | 21 | 169.2±62.4ab | 21 |
| H3(2007LD) | 149.2±33.8b | 17 | 94.4±18.0b | 17 | 147.7±43.3a | 16 |
| H4(2007LD) | 125.5±27.3a | 17 | 81.4±3.9a | 17 | 200.0±31.3b | 10 |
| H2(2010LD) | 103.0±20.5a | 16 | 81.4±11.9ab | 16 | 160.0±38.9a | 15 |
| H3(2010LD) | 135.4±36.5b | 14 | 85.8±17.4a | 14 | 189.0±59.1a | 9 |
| H4(2010LD) | 111.9±26.9a | 17 | 75.4±3.9b | 17 | 189.5±42.3a | 17 |
| H2(2010SD) | 97.8±14.8a | 21 | 83.6±6.6a | 21 | 152.9±34.4a | 15 |
| H3(2010SD) | 127.9±29.1b | 15 | 84.2±12.7a | 15 | 154.3±58.4a | 11 |
| H4(2010SD) | 114.3±31.0ab | 17 | 80.0±3.7a | 17 | 178.3±26.6a | 16 |
|
| ||||||
| H1(2007LD) | 122.9±32.4a | 28 | 88.3±12.8a | 28 | 143.8±51.4a | 23 |
| H5(2007LD) | 119.1±21.9a | 11 | 81.9±5.2a | 11 | 151.7±24.7a | 10 |
| H1(2010LD) | 120.5±37.5a | 22 | 86.8±15.6a | 22 | 128.0±31.4a | 14 |
| H5(2010LD) | 117.0±19.6a | 9 | 79.3±7.6a | 9 | 168.4±46.6b | 9 |
| H1(2010SD) | 112.5±33.0a | 26 | 79.9±10.6a | 26 | 141.3±65.6a | 24 |
| H5(2010SD) | 118.2±20.5a | 10 | 82.8±5.0a | 10 | 135.1±31.1a | 9 |
Hap, haplotype; SD, standard deviation; N, number of cultivars tested. Within an environment, means followed by different letters are significantly different at P = 0.05.
Comparison of expression levels in H1–H5.
| Hap. | Protein type | S_555 | Mean ± SD | N |
| H2 | Ghd7-1 | T | 1.45±1.64a | 21 |
| H3 | Ghd7-1 | C | 2.86±1.76b | 16 |
| H4 | Ghd7-4 | C | 1.49±0.76a | 10 |
| H1 | Ghd7-2 | C | 1.63±0.98a | 24 |
| H5 | Ghd7-2 | C | 1.48±0.79a | 10 |
Hap, haplotype; SD, standard deviation; N, number of cultivars tested; characters not connected by the same letter are significantly different at P = 0.05.
Spearman correlation analysis between expression level and three traits in three planting tests.
| PH | HD | SPP | |||||||
| N | R |
| N | R |
| N | R |
| |
| 2007LD | 37 | 0.49 | 0.002 | 37 | 0.07 | 0.70 | 37 | −0.26 | 0.12 |
| 2010LD | 29 | 0.37 | 0.050 | 29 | −0.07 | 0.74 | 23 | −0.03 | 0.88 |
| 2010SD | 35 | 0.47 | 0.005 | 35 | −0.06 | 0.73 | 25 | −0.25 | 0.23 |
N, number of varieties tested; R, correlation coefficient.