| Literature DB >> 20354847 |
Marie-Claire Namroud1, Carine Guillet-Claude, John Mackay, Nathalie Isabel, Jean Bousquet.
Abstract
Genes involved in transcription regulation may represent valuable targets in association genetics studies because of their key roles in plant development and potential selection at the molecular level. Selection and demographic signatures at the sequence level were investigated for five regulatory genes belonging to the knox-I family (KN1, KN2, KN3, KN4) and the HD-Zip III family (HB-3) in three Picea species affected by post-glacial recolonization in North America and Europe. To disentangle neutral and selective forces and estimate linkage disequilibrium (LD) on a gene basis, complete or nearly complete gene sequences were analysed. Nucleotide variation within species, haplotype structure, LD, and neutrality tests, in addition to coalescent simulations based on Tajima's D and Fay and Wu's H, were estimated. Nucleotide diversity was generally low in all species (average pi = 0.002-0.003) and much heterogeneity was seen in LD and selection signatures among genes and species. Most of the genes harboured an excess of both rare and frequent alleles in the three species. Simulations showed that this excess was significantly higher than that expected under neutrality and a bottleneck during the Last Glacial Maximum followed by population expansion at the Pleistocene/Holocene boundary or shortly after best explains the correlated sequence patterns. These results indicate that despite recent large demographic changes in the three boreal species from two continents, species-specific selection signatures could still be detected from the analysis of nearly complete regulatory gene sequences. Such different signatures indicate differential subfunctionalization of gene family members in the three congeneric species.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20354847 PMCID: PMC2874021 DOI: 10.1007/s00239-010-9335-1
Source DB: PubMed Journal: J Mol Evol ISSN: 0022-2844 Impact factor: 2.395
Estimates of nucleotide and indel diversity for the knox-I and HB-3 genes of P. glauca
| Gene | Coding region | Noncoding region | Silent sitesa | Total | |
|---|---|---|---|---|---|
| Synonymous | Nonsynonymous | ||||
|
| |||||
| No of sitesc | 244.2 | 889.8 | 1129 | 1373.2 | 2263 |
| SNPs (singletons)d | 7 | 16 (4) | 29 (8) | 36 (8) | 52 (12) |
| Indels (size)e | 0 | 0 | 6 (Σ = 66 bp: 41, 19, 3, 1, 1, 1) | 6 (Σ = 66 bp: 41, 19, 3, 1, 1, 1) | 6 (Σ = 66 bp: 41, 19, 3, 1, 1, 1) |
| π | 0.00150 | 0.00060 | 0.00461 | 0.00403 | 0.00264 |
| θ | 0.00492 | 0.00308 | 0.00468 | 0.00472 | 0.00406 |
|
| |||||
| No of sites | 247.7 | 889.3 | 2363 | 2610.7 | 3500 |
| SNPs (singletons) | 13 | 8 (5) | 67 (13) | 80 (13) | 88 (18) |
| Indels (size) | 0 | 1 (3 bp) | 14 (Σ = 26 bp: 7, 5 3, 2, 8 × 1, 1) | 14 (Σ = 26 bp: 7, 5, 3, 2, 8 × 1, 1) | 15 (Σ = 29 bp: 7, 5, 2 × 3, 2, 8 × 1, 1) |
| π | 0.00909 | 0.00060 | 0.00382 | 0.00433 | 0.00338 |
| θ | 0.00900 | 0.00155 | 0.00492 | 0.00531 | 0.00435 |
|
| |||||
| No of sites | 273.7 | 989.3 | 841 | 1114.7 | 2104 |
| SNPs (singletons) | 11 (3) | 3 (1) | 24 (9) | 35 (12) | 38 (13*) |
| Indels (size) | 0 | 3 (Σ = 15 bp: 6, 6, 3) | 4 (Σ = 41 bp: 25, 8, 7, 1) | 4 (Σ = 41 bp:25, 8, 7, 1) | 7 (Σ = 56 bp: 25, 8, 7, 6, 6, 3, 1) |
| π | 0.00349 | 0.00006 | 0.00366 | 0.00361 | 0.00192 |
| θ | 0.00689 | 0.00053 | 0.00514 | 0.00559 | 0.00318 |
|
| |||||
| No of sites | 245.6 | 942.4 | 1400 | 1645.6 | 2588 |
| SNPs (singletons) | 7 (2) | 5 (2) | 52 (10) | 59 (12) | 64 (14) |
| Indels (size) | 0 | 2 (Σ = 9 bp: 3, 6) | 4 (Σ = 32 bp: 28, 2, 2 × 1) | 4 (Σ = 32 bp: 28, 2, 2 × 1) | 6 (Σ = 41 bp: 28, 9, 2, 2 × 1) |
| π | 0.00349 | 0.00016 | 0.00543 | 0.00513 | 0.00331 |
| θ | 0.00489 | 0.00092 | 0.00652 | 0.00626 | 0.00431 |
|
| |||||
| No of sites | 611.2 | 1914.8 | 3354 | 3965.2 | 5880 |
| SNP (singletons) | 13 | 2 (2) | 80 (21) | 93 (21) | 95 (23) |
| Indel (size) | 0 | 0 | 9 (Σ = 18 bp: 6, 4, 2, 6 × 1) | 9 (Σ = 18 bp: 6, 4, 2, 6 × 1) | 9 (Σ = 18 bp: 6, 4, 2, 6 × 1) |
| π | 0.00456 | 0.00002 | 0.00351 | 0.00368 | 0.00248 |
| θ | 0.00397 | 0.00019 | 0.00447 | 0.00440 | 0.00302 |
| Total ( | |||||
| No of sites | 1622.4 | 5625.6 | 9087 | 10709.4 | 16335 |
| SNPs (singletons) | 51 (5) | 34 (14) | 252 (61) | 303 (66) | 337 (80) |
| Indels (total size) | 0 | 6 (27 bp) | 37 (183 bp) | 37 (183 bp) | 43 (210 bp) |
| Weighted average | |||||
| π | 0.00445 | 0.00023 | 0.00404 | 0.00410 | 0.00275 |
| θ | 0.00551 | 0.00104 | 0.00499 | 0.00507 | 0.00367 |
aSilent sites include synonymous and noncoding sites
b n is the number of sequences in the sample
cThe number of sequenced sites includes gaps induced by indels
dThe total number of substitutions with the number of singletons indicated in parentheses
eThe total number of indels with the size of indels indicated in parentheses and indels corresponding to singletons in italics
* P < 0.05 corresponding to the P value for obtaining a number of SNP singletons (under the neutral coalescent process) equal or higher than that observed
Estimates of nucleotide and indel diversity for the knox-I and HB-3 genes of P. mariana
| Gene | Coding region | Noncoding region | Silent sitesa | Total | |
|---|---|---|---|---|---|
| Synonymous | Nonsynonymous | ||||
|
| |||||
| No of sitesc | 243.8 | 890.2 | 1068 | 1311.8 | 2202 |
| SNPs (singletons)d | 6 (1) | 8 (6) | 20 (8) | 26 (9) | 34 (15*) |
| Indels (size)e | 0 | 0 | 2 (Σ = 4 bp: 3, 1) | 2 (Σ = 4 bp: 1, 3) | 2 (Σ = 4 bp: 1, 3) |
| π | 0.00654 | 0.00093 | 0.00229 | 0.00308 | 0.00221 |
| θ | 0.00545 | 0.00199 | 0.00416 | 0.00440 | 0.00342 |
|
| |||||
| No of sites | 248.0 | 889.0 | 2350 | 2598.0 | 3487 |
| SNPs (singletons) | 6 (1) | 4 (4) | 32 (12) | 38 (13) | 42 (17*) |
| Indels (size) | 0 | 0 | 5 (Σ = 8 bp: 2 × 1, 3, 2, 1) | 5 (Σ = 8 bp: 2 × 1, 3, 2, 1) | 5 (Σ = 8 bp: 2 × 1, 3, 2, 1) |
| π | 0.00351 | 0.00017 | 0.00280 | 0.00286 | 0.00218 |
| θ | 0.00535 | 0.00100 | 0.00302 | 0.00325 | 0.00267 |
|
| |||||
| No of sites | 275.0 | 985.0 | 841 | 1116.0 | 2101 |
| SNPs (singletons) | 8 (1) | 2 (1) | 14 (4) | 22 (5) | 24 (6) |
| Indels (size) | 0 | 1 (6 bp) | 1 (2 bp) | 1 (2 bp) | 2 (8 bp: 6, 2) |
| π | 0.00580 | 0.00012 | 0.00344 | 0.00402 | 0.00219 |
| θ | 0.00644 | 0.00045 | 0.00370 | 0.00437 | 0.00254 |
|
| |||||
| No of sites | 245.5 | 942.5 | 1400 | 1645.5 | 2588 |
| SNPs (singletons) | 10 (2) | 4 (2) | 30 (8) | 40 (10) | 44 (12) |
| Indels (size) | 0 | 2 (Σ = 6 bp: 2 × 3) | 3 (Σ = 4 bp: 2, 1, 1) | 3 (Σ = 4 bp: 2, 1, 1) | 5 (Σ = 10 bp: 2 × 3, 2, 1, 1) |
| π | 0.00836 | 0.00036 | 0.00333 | 0.00408 | 0.00273 |
| θ | 0.00901 | 0.00095 | 0.00476 | 0.00539 | 0.00377 |
|
| |||||
| No of sites | 611.2 | 1914.8 | 3352 | 3963.2 | 5878 |
| SNP (singletons) | 9 (1) | 1 | 64 (15) | 73 (16) | 74 (16) |
| Indel (size) | 0 | 0 | 1 (6 bp) | 1 (6 bp) | 1 (6 bp) |
| π | 0.00155 | 0.00014 | 0.00295 | 0.00251 | 0.00174 |
| θ | 0.00326 | 0.00012 | 0.00443 | 0.00408 | 0.00279 |
| Total ( | |||||
| No of sites | 1623.4 | 5621.6 | 9011 | 10634.4 | 16256 |
| SNPs (singletons) | 39 (6) | 19 (13) | 160 (47) | 199 (53) | 218 (66) |
| Indels (total size) | 0 | 3 (12 bp) | 12 (24 bp) | 12 (24 bp) | 15 (36 bp) |
| Weighted average | |||||
| π | 0.00435 | 0.00030 | 0.00294 | 0.00307 | 0.00211 |
| θ | 0.00532 | 0.00075 | 0.00401 | 0.00415 | 0.00297 |
aSilent sites include synonymous and noncoding sites
b n is the number of sequences in the sample
cThe number of sequenced sites includes gaps induced by indels
dThe total number of substitutions with the number of singletons indicated in parentheses
eThe total number of indels with the size of indels indicated in parentheses and indels corresponding to singletons in italics
* P < 0.05 corresponding to the P value for obtaining a number of SNP singletons (under the neutral coalescent process) equal or higher than that observed
Estimates of nucleotide and indel diversity for the knox-I and HB-3 genes of P. abies
| Gene | Coding region | Noncoding region | Silent sitesa | Total | |
|---|---|---|---|---|---|
| Synonymous | Nonsynonymous | ||||
|
| |||||
| No of sitesc | 244.2 | 889.8 | 1121 | 1365.2 | 2255 |
| SNPs (singletons)d | 10 (7) | 3 (2) | 35 (14) | 45 (21) | 48 (23*) |
| Indels (size)e | 0 | 0 | 3 (Σ = 56 bp: 54, 1, 1) | 3 (Σ = 56 bp: 54, 1, 1) | 3 (Σ = 56 bp: 54, 1, 1) |
| π | 0.00395 | 0.00019 | 0.00438 | 0.00430 | 0.00264 |
| θ | 0.00932 | 0.00077 | 0.00748 | 0.00782 | 0.00496 |
|
| |||||
| No of sites | 247.9 | 889.1 | 2351 | 2598.9 | 3488 |
| SNPs (singletons) | 5 (2) | 5 (2) | 34 (18) | 39 (20) | 44 (22***) |
| Indels (size) | 0 | 0 | 9 (Σ = 49 bp: 25, 11, 5, 2 × 2, 4 × 1) | 9 (Σ = 49 bp: 25, 11, 5, 2 × 2, 4 × 1) | 9 (Σ = 49 bp: 25, 11, 5, 2 × 2, 4 × 1) |
| π | 0.00536 | 0.00088 | 0.00193 | 0.00226 | 0.00191 |
| θ | 0.00459 | 0.00128 | 0.00336 | 0.00348 | 0.00291 |
|
| |||||
| No of sites | 275.0 | 982.0 | 842 | 1117.0 | 2099 |
| SNPs (singletons) | 6 (4) | 3 (3) | 16 (8) | 22 (12) | 25 (15**) |
| Indels (size) | 0 | 1 (3 bp) | 5 (Σ = 7 bp: 2, 2, 3 × 1) | 5 (Σ = 7 bp: 2, 2, 3 × 1) | 6 (Σ = 10 bp: 3, 2, 2, 3 × 1) |
| π | 0.00140 | 0.00013 | 0.00150 | 0.00148 | 0.00085 |
| θ | 0.00496 | 0.00070 | 0.00436 | 0.00451 | 0.00272 |
|
| |||||
| No of sites | 246.5 | 938.5 | 1402 | 1648.5 | 2587 |
| SNPs (singletons) | 2 | 5 (2) | 39 (10) | 41 (10) | 46 (12) |
| Indels (size) | 0 | 0 | 5 (Σ = 13 bp: 5, 4, 2, 2 × 1) | 5 (Σ = 13 bp: 5, 4, 2, 2 × 1) | 5 (Σ = 13 bp: 5, 4, 2, 2 × 1) |
| π | 0.00069 | 0.00100 | 0.00587 | 0.00509 | 0.00359 |
| θ | 0.00185 | 0.00121 | 0.00639 | 0.00570 | 0.00406 |
|
| |||||
| No of sites | 611.2 | 1914.8 | 3360 | 3971.2 | 5886 |
| SNP (singletons) | 8 (5) | 1 (1) | 63 (28) | 71 (33) | 72 (34*) |
| Indel (size) | 0 | 0 | 7 (Σ = 11 bp: 4, 2,1, 1, 3 × 1) | 7 (Σ = 11 bp: 4, 2,1, 1, 3 × 1) | 7 (Σ = 11 bp: 4, 2,1, 1, 3 × 1) |
| π | 0.00102 | 0.00002 | 0.00182 | 0.00170 | 0.00115 |
| θ | 0.00298 | 0.00012 | 0.00428 | 0.00408 | 0.00279 |
| Total ( | |||||
| No of sites | 1624.7 | 5614.3 | 9076 | 10700.7 | 16315 |
| SNPs (singletons) | 31 (18) | 17 (10) | 187 (78) | 218 (96) | 235 (106) |
| Indels (total size) | 0 | 1 (3 bp) | 29 (136 bp) | 29 (136 bp) | 30 (139 bp) |
| Weighted average | |||||
| π | 0.00214 | 0.00037 | 0.00276 | 0.00267 | 0.00186 |
| θ | 0.00434 | 0.00069 | 0.00477 | 0.00470 | 0.00331 |
aSilent sites include synonymous and noncoding sites
b n is the number of sequences in the sample
cThe number of sequenced sites includes gaps induced by indels
dThe total number of substitutions with the number of singletons indicated in parentheses
eThe total number of indels with the size of indels indicated in parentheses and indels corresponding to singletons in italics
* P < 0.05, ** P < 0.01, *** P < 0.001 corresponding to the P value for obtaining a number of SNP singletons (under the neutral coalescent process) equal or higher than that observed
Fig. 1Linkage disequilibrium (LD) decay over distance measured by the squared correlations of allele frequencies (r 2) between polymorphic sites plotted against the distance in base pairs in the five regulatory genes KN1 to KN4 and HB-3 of: a P. glauca, b P. mariana, and c P. abies. In P. glauca, LD decay of KN1 overlaps to a great extent with that of KN4, while in P. mariana it overlaps with that of KN3
Recombination, haplotype structure, and neutrality tests based on haplotype frequency distributions
|
| ρ | Haplotype number ( | Haplotype diversity ( | Tajima’s | Fay and Wu’s | |
|---|---|---|---|---|---|---|
|
| ||||||
| | 0.18 | 0.0049 | 63† | 0.95 | −1.05 | −4.19 |
| | 0.20 | 0.0034 | 123† | 0.99† | −0.69 | −2.49 |
| | 0.11 | 0.0228 | 53 | 0.93** | −1.15* | −4.70 |
| | 0.17 | 0.0046 | 103† | 0.97 | −0.70 | −1.81 |
| | 0.12 | 0.0017 | 47 | 0.92 | −0.83 | −2.80 |
|
| ||||||
| | 0.09 | 0.0064 | 30† | 0.97 | −1.18 | −4.83 |
| | 0.10 | 0.0034 | 33† | 0.97 | −0.63 | 1.43 |
| | 0.13 | 0.0252 | 27 | 0.96 | −0.44 | −0.97 |
| | 0.18 | 0.0386 | 43 | 0.99 | −0.95** | −3.41 |
| | 0.08 | 0.0005 | 22 | 0.78*** | −1.32 | −4.84 |
|
| ||||||
| | 0.15 | 0.0266 | 25** | 0.90*** | −1.64*** | −6.28 |
| | 0.14 | 0.0169 | 36 | 0.99 | −1.14** | −8.50* |
| | 0.04 | 0.0467 | 22*** | 0.79*** | −2.28*** | −3.13 |
| | 0.16 | 0.0251 | 37 | 0.99 | −0.40 | −5.96 |
| | 0.10 | 0.0024 | 31 | 0.98 | −2.08*** | −6.85 |
a R M is the minimum number of recombination events (Hudson and Kaplan 1985) divided by the number of informative sites
bρ is the maximum-likelihood estimator of the recombination parameter based on independent linked pairs of sites (Hudson 2001)
* P < 0.05, ** P < 0.01, *** P < 0.001 that observed values are lower than expected based on 1,000 coalescent simulations using the DnaSP program
† P < 0.05 that observed values are higher than expected based on 1,000 coalescent simulations using the DnaSP program
Mean values of Tajima’s D and Fay and Wu’s H for the three Picea species observed in the analysed data and expected under a standard neutral model (SNM) and a bottleneck model followed by population expansion (BottExp)
|
|
|
| ||||
|---|---|---|---|---|---|---|
| Mean Tajima’s | Mean Fay and Wu’s | Mean Tajima’s | Mean Fay and Wu’s | Mean Tajima’s | Mean Fay and Wu’s | |
| Observed | −0.88 | −3.20 | −0.90 | −2.52 | −1.49 | −6.14 |
| SNM | −0.02 ( | 0.01 ( | −0.02 ( | 0.04 ( | −0.01 ( | 0.04 ( |
| BottExpa | −0.92 ( | −6.89 ( | −0.76 ( | −5.50 ( | −0.75 ( | −5.25 ( |
A model is considered to fit empirical data if the P values of all the parameters considered (mean and variances of D and H) are higher than 0.05 (no significant difference)
aThe bottleneck model that best fit the observed data for each of the three species occurred during the LGM around 25,000 years ago and ended about 17,000 years ago when populations started expanding. Other near-best models suggested that the expansion phase might have been delayed to the Holocene for P. glauca and P. abies (see “Results”)