| Literature DB >> 22606277 |
Ghayas Lakis1, Miguel Navascués, Samah Rekima, Mathieu Simon, Marie-Stanislas Remigereau, Magalie Leveugle, Najat Takvorian, Françoise Lamy, Frantz Depaulis, Thierry Robert.
Abstract
BACKGROUND: Pearl millet landraces display an important variation in their cycle duration. This diversity contributes to the stability of crop production in the Sahel despite inter-annual rainfall fluctuation. Conservation of phenological diversity is important for the future of pearl millet improvement and sustainable use. Identification of genes contributing to flowering time variation is therefore relevant. In this study we focused on three flowering candidate genes, PgHd3a, PgDwarf8 and PgPHYC. We tested for signatures of past selective events within polymorphism patterns of these three genes that could have been associated with pearl millet domestication and/or landraces differentiation. In order to implement ad hoc neutrality tests, a plausible demographic history of pearl millet domestication was inferred through Approximate Bayesian Computation by using eight neutral STS loci.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22606277 PMCID: PMC3351476 DOI: 10.1371/journal.pone.0036642
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Genetic diversity and tests for strict neutrality for the eight STS loci.
| Sample size | Length | S | Sinb | Specific sitesc | πd (10−3) | θd (10−3) | LSine | Dom/Wild π ratio | Tajima’s Df | L groupg | ||
| STS 306 | Dom. | 51 | 563 | 5 | 2 | 1 | 2.13 | 1.97 | 2 | 0.72 | 0.20 ns | 4 |
| Wild | 17 | 563 | 5 | 1 | 1 | 2.95 | 2.63 | 1 | 0.40 ns | |||
| STS 344 | Dom. | 43 | 710 | 33 | 15 | 20 | 10.1 | 11 | 12 | 1.06 | −0.27 ns | 4 |
| Wild | 12 | 710 | 20 | 6 | 6 | 9.5 | 9.5 | 2 | −0.02 ns | |||
| STS 359 | Dom. | 49 | 440 | 15 | 6 | 8 | 7.43 | 8.86 | 6 | 0.78 | −0.20 ns | 2 |
| Wild | 18 | 440 | 15 | 6 | 6 | 9.53 | 10.72 | 4 | −0.46 ns | |||
| STS 476 | Dom. | 61 | 752 | 10 | 5 | 6 | 2.09 | 2.86 | 5 | 0.52 | −0.74 ns | 3 |
| Wild | 15 | 752 | 14 | 8 | 9 | 4.03 | 5.92 | 4 | −1.26 ns | |||
| STS 521 | Dom. | 46 | 617 | 29 | 16 | 16 | 5.34 | 11.56 | 11 | 0.86 | −1.80* | 5 |
| Wild | 19 | 617 | 25 | 21 | 10 | 6.18 | 12.35 | 10 | −1.95* | |||
| STS 713 | Dom. | 51 | 1065 | 30 | 18 | 20 | 2.87 | 6.62 | 12 | 0.44 | −1.87* | 6 |
| Wild | 22 | 1065 | 32 | 16 | 19 | 6.45 | 8.25 | 9 | −0.83 ns | |||
| STS 738 | Dom. | 29 | 1995 | 64 | 47 | 48 | 4.93 | 8.32 | 21 | 0.92 | −1.54 ns | 2 |
| Wild | 12 | 1995 | 42 | 26 | 24 | 5.33 | 7.07 | 10 | −1.12 ns | |||
| STS 870 | Dom. | 49 | 744 | 36 | 18 | 28 | 8 | 12.87 | 13 | 1.4 | −1.19 ns | 6 |
| Wild | 20 | 744 | 23 | 14 | 9 | 5.78 | 8.94 | 7 | −1.36 ns | |||
| Average | Dom. | 47 | 860 | 28 | 16 | 18 | 5.36 | 8 | 10 | 0.84 | −0.93 | |
| Wild | 17 | 860 | 22 | 12 | 11 | 6.22 | 8.17 | 6 | −0.83 |
S: number of segregating sites. bSin:number of singletons. cSpecific sites: sites polymorphic in one population but monomorphic in the other. dθ: Watterson estimator of sequence diversity per site, π: average number of differences per site between two sequences. eNumber of lineages with at least one singleton. fD values were calculated using Fabsim [50], P-values of D were calculated for the standard coalescent model (*: p<0.05; ns: not significant); P-values were computed without correction for multiple tests. glinkage group of the pearl millet genetic map on which the locus is located.
Figure 1Genetic structure of wild and domesticated populations.
A. Values of ΔK calculated by using the Evanno et al. (2005) method according to the number K of clusters B. Representation of the individual assignment probabilities to each of the 2 inferred clusters; individuals were arranged according to the estimated proportion of admixture in their genome.
Figure 2Posterior probability density distribution of the parameters of the demographic model inferred by ABC estimation.
Data shown was obtained for the scenario “with migration”. The prior (black line) is used as a reference for the posterior distribution (blue lines) obtained from the rejection followed by the non linear regression. Parameters T, M and rho are scaled to the wild population effective size New.
Estimates of demographic and genetic population parameters obtained from the demographic model “with migration.”
| Parameter | median | 95% HPD intervalb |
| θW | 0.0082 | 0.0042–0.0142 |
| θD0 | 9.79×10−5 | 6.05×10−7–0.0017 |
| θD1 | 0.1409 | 0.0060–4.5243 |
| T | 0.0704 | 0.0133–0.4524 |
| µ | 6.50×10−8 | 1.39×10−8–3.41×10−7 |
θW is the population mutation rate of the wild population, θD0 is the population mutation rate of the domestic population at domestication time, θD1 is the population mutation rate of the domestic population at present time, T is the time of domestication in units of 4NeW generations, and µ is the mutation rate per bp per generation. bHPD interval is the interval of parameter values with the highest posterior density.
Figure 3Structure of the pearl millet Hd3a orthologous and sequence similarity with PEBP family members.
A. structure of the PgHd3a gene, E:exon and I: intron; the 918 bp fragment used in the population genetics analysis is shown between the two dotted red lines; the cDNA sequence was used to complete the 3′ region of the original sequence and to obtain the protein sequence. B. Multiple sequence alignment of the proteins of PgHd3a, the orthologous of FT (wheat (TaFT), barley (HvFT) and rice (Hd3a)) and TFL1; in the black boxes: the Tyr/His(88) and the Gln/Asp(145) residues which are essential to distinguish FT and TFL1. C. A neighborhood joining tree of the PEBP protein family in pearl millet (PgHd3a), maize (ZCN), rice (Os, RFT1, Hd3a, Hd3b), wheat (Ta), barley (Hv) and Arabidopsis (FT). The tree was built with the Poisson corrective distance [96] model. Robustness of nodes was tested on the basis of 1000 Bootstrap iterations.
Genetic diversity and selective neutrality tests for PgHd3a.
| Sample | Length | S | Sinb | πc (10−3) | θc (10−3) | LSind | Dom/Wild | Tajima’s De | Fu & Li’s F*e | ||
| size | π ratio |
|
| ||||||||
|
| Dom. | 59 | 918 | 47 | 42 | 2.48 | 11.2 | 23 | 0.72 | −2.62** | −5.92** |
| nc:24 | nc:3.08 | nc:11.43 | |||||||||
| syn:8 | syn:3.01 | syn:15.41 | |||||||||
| nsyn:14 | nsyn:1.43 | nsyn:9.07 | |||||||||
| Early | 26 | 918 | 22 | 19 | 2.4 | 6.38 | 9 | −2.26* | −3.69* | ||
| nc:10 | nc:2.59 | nc:5.8 | |||||||||
| syn:5 | syn:4.07 | syn:11.72 | |||||||||
| nsyn:7 | nsyn:1.62 | nsyn:5.52 | |||||||||
| Late | 28 | 918 | 25 | 22 | 2.59 | 7.07 | 14 | −2.3* | −4.12* | ||
| nc:15 | nc:3.6 | nc:8.47 | |||||||||
| syn:3 | syn:1.87 | syn:6.74 | |||||||||
| nsyn:6 | nsyn:1.27 | nsyn:4.55 | |||||||||
| Wild | 14 | 918 | 17 | 11 | 3.47 | 5.88 | 7 | −1.69 ns | −1.69 ns | ||
| nc:14 | nc:5.99 | nc:9.68 | |||||||||
| syn:0 | syn:0 | syn:0 | |||||||||
| nsyn:3 | nsyn:1.27 | nsyn:2.79 |
S: number of segregating sites. bSin: number of singletons. cθ: Watterson estimator of sequence diversity per site, π: average number of differences per site between two sequences, the diversity of the candidate genes was estimated for the non-coding (nc), synonymous (syn) and non-synonymous (nsyn) sites. dNumber of lineages with at least one singleton. eD and F* were calculated for the simulation using parameters randomly drawn from the posterior distribution. f P-values were computed for unilateral tests as P< or > than the observed value according to the sign of the statistics without correction for multiple tests.
Genetic diversity and tests for selective neutrality for the two candidate genes: PgDwarf8 and PgPHYC.
| Sample | Length | S | Sinb | πc (10−3) | θc (10−3) | LSind | Dom/Wild | Tajima’s De | Fu & Li’s F*e | ||
| size | π ratio |
|
| ||||||||
|
| Dom. | 33 | 1233 | 46 | 40 | 3.15 | 9.73 | 15 | 0.28 | −2.47* | −4.48* |
| syn:19 | syn:4.97 | syn:16.23 | |||||||||
| nsyn:23 | nsyn:1.98 | nsyn:7.19 | |||||||||
| Early | 14 | 1233 | 14 | 11 | 2.4 | 3.7 | 7 | −1.39 ns | −2.08 ns | ||
| syn:2 | syn:1.9 | syn:2 | |||||||||
| nsyn:11 | nsyn:2.24 | nsyn:4.15 | |||||||||
| Late | 16 | 1233 | 32 | 29 | 3.81 | 8.02 | 7 | −2.18* | −3.17* | ||
| syn:15 | syn:6.69 | syn:14.59 | |||||||||
| nsyn:15 | nsyn:2.67 | nsyn:5.35 | |||||||||
| Wild | 12 | 1233 | 56 | 26 | 11.22 | 15.26 | 10 | −1.22 ns | −0.76 ns | ||
| syn:32 | syn:24.74 | syn:32.64 | |||||||||
| nsyn:24 | nsyn:6.38 | nsyn:9.05 | |||||||||
|
| Dom. | 38 | 823 | 24 | 13 | 5.67 | 6.96 | 6 | 1.45 | −0.63 ns | −1.89 ns |
| Early | 18 | 823 | 20 | 10 | 6.39 | 7.07 | 4 | −0.37 ns | −0.93 ns | ||
| Late | 17 | 823 | 10 | 2 | 4.78 | 3.6 | 1 | 1.2 ns | 0.81 ns | ||
| Wild | 15 | 823 | 13 | 6 | 3.91 | 4.86 | 3 | −0.77 ns | −0.73 ns |
S: number of segregating sites. bSin: number of singletons. cθ: Watterson estimator of sequence diversity per site, π: average number of differences per site between two sequences, the diversity of the candidate genes was estimated for the non-coding (nc), synonymous (syn) and non-synonymous (nsyn) sites. dNumber of lineages with at least one singleton. eD and F* were calculated for the simulation using parameters randomly drawn from the posterior distribution. f P-values were computed for unilateral tests as P< or > than the observed value according to the sign of the statistics without correction for multiple tests.