| Literature DB >> 23144662 |
Elena Mosca1, Andrew J Eckert, John D Liechty, Jill L Wegrzyn, Nicola La Porta, Giovanni G Vendramin, David B Neale.
Abstract
A candidate gene approach was used to identify levels of nucleotide diversity and to identify genes departing from neutral expectations in coniferous species of the Alpine European forest. Twelve samples were collected from four species that dominate montane and subalpine forests throughout Europe: Abies alba Mill, Larix decidua Mill, Pinus cembra L., and Pinus mugo Turra. A total of 800 genes, originally resequenced in Pinus taeda L., were resequenced across 12 independent trees for each of the four species. Genes were assigned to two categories, candidate and control, defined through homology-based searches to Arabidopsis. Estimates of nucleotide diversity per site varied greatly between polymorphic candidate genes (range: 0.0004-0.1295) and among species (range: 0.0024-0.0082), but were within the previously established ranges for conifers. Tests of neutrality using stringent significance thresholds, performed under the standard neutral model, revealed one to seven outlier loci for each species. Some of these outliers encode proteins that are involved with plant stress responses and form the basis for further evolutionary enquiries.Entities:
Keywords: candidate gene; neutrality tests; nucleotide diversity; single nucleotide polymorphisms
Year: 2012 PMID: 23144662 PMCID: PMC3492901 DOI: 10.1111/j.1752-4571.2012.00256.x
Source DB: PubMed Journal: Evol Appl ISSN: 1752-4571 Impact factor: 5.183
Figure 1Sampling locations across species distribution in Europe.
Sequencing summary statistics for the candidate genes and the control genes for each species.
| Candidate | Control | |||||||
|---|---|---|---|---|---|---|---|---|
| Total genes | 70 | 61 | 171 | 190 | 32 | 35 | 109 | 120 |
| Length mean (bp) | ||||||||
| No. of samples | ||||||||
| Polymorphic genes (%) | 54.29 | 50.82 | 45.03 | 73.16 | 56.25 | 37.14 | 34.26 | 73.33 |
| Total no. of SNPs | 197 | 219 | 284 | 900 | 131 | 88 | 90 | 487 |
| No. of SNPs per gene | 3 ± 6 | 4 ± 9 | 2 ± 4 | 5 ± 9 | 4 ± 9 | 3 ± 7 | 1 ± 2 | 4 ± 7 |
| SNP frequency | 135 | 108 | 239 | 81 | 95 | 159 | 459 | 94 |
| Watterson’s θ | ||||||||
| Nucleotide diversity (π) | ||||||||
| Divergence | ||||||||
Mean values are reported in boldface type with their standard deviation.
Figure 2Mean values of (A) number of SNPs and (B) nucleotide diversity (θw, θπ) with the bootstrapped 95% confidence intervals and (C) distribution of nucleotide diversity in candidate genes for the different species. The box shows the upper and lower quartiles, while the line represents the median of the sample.
Levels of nucleotide polymorphism in the annotated candidate genes
| Species | Parameters | All | N-coding | N-Syn | Syn | All silent |
|---|---|---|---|---|---|---|
| Sites | ||||||
| 18.72% | 62.88% | 17.19% | 35.91% | |||
| Segregating sites | ||||||
| 15.75% | 43.15% | 41.10% | 56.85% | |||
| Watterson’s θ | ||||||
| 0.0133 | 0.0129 | 0.0097 | 0.0299 | |||
| Nucleotide diversity (π) | ||||||
| 0.0133 | 0.0013 | 0.0097 | 0.0294 | |||
| Sites | ||||||
| 17.62% | 63.47% | 17.56% | 35.18% | |||
| Segregating sites | ||||||
| 17.50% | 32.50% | 50.00% | 67.50% | |||
| Watterson’s θ | ||||||
| 0.0075 | 0.0021 | 0.0058 | 0.0206 | |||
| Nucleotide diversity (π) | ||||||
| 0.0075 | 0.0023 | 0.0058 | 0.0207 | |||
| Sites | ||||||
| 23.26% | 59.40% | 16.35% | 39.60% | |||
| Segregating sites | ||||||
| 31.08% | 28.83% | 40.09% | 71.17% | |||
| Watterson’s θ | ||||||
| 0.0057 | 0.0057 | 0.0035 | 0.0131 | |||
| Nucleotide diversity (π) | ||||||
| 0.0057 | 0.0057 | 0.0034 | 0.0131 | |||
| Sites | ||||||
| 20.14% | 61.75% | 17.17% | 37.32% | |||
| Segregating sites | ||||||
| 21.43% | 40.77% | 37.80% | 59.23% | |||
| Watterson’s θ | ||||||
| 0.0129 | 0.0068 | 0.0102 | 0.0261 | |||
| Nucleotide diversity (π) | ||||||
| 0.0128 | 0.0068 | 0.0101 | 0.0258 |
All, all sites; N-coding, noncoding sites; N-Syn, nonsynonymous sites; Syn, synonymous sites; All silent, all silent sites.
Numbers are the total number of sites and the total number of segregating size.
Bold numbers are the average across loci with their standard deviations in regular type.
Figure 3Mean values of (A) π for all sites for the annotated genes (θπ), at noncoding sites (θπ-nc), at nonsynonymous sites (θπa) (B) divergence for all sites (Dxy), divergence at nonsynonymous sites (Ka), divergence at synonymous sites (Ks), with the bootstrapped 95% confidence intervals and (C) distribution of divergence from loblolly pine across site categories in candidate genes. The box shows the upper and lower quartiles, while the line represents the median of the sample.
List of the outliers from the standard neutral model (SNM) across the candidate genes
| Species | Gene | Putative protein | DHEW- | Reference | |||||
|---|---|---|---|---|---|---|---|---|---|
| 0_18619 | Protein kinase family protein | 6 | 0.0207 | 0.0257 | 0.8835 | 0.8812 | 0.0622 | ||
| 0_2775 | spx domain-containing protein | 2 | 0.118 | 0.0773 | 0.8062 | 1 | 0.2949 | ||
| 0_8111 | 6-phosphogluconate dehydrogenase | 3 | 0.0798 | 0.0331 | 0.9009 | 1 | 0.1807 | ||
| 2_1528 | Reduced epidermal fluorescence 4 | 3 | 0.0491 | 0.0829 | 0.7472 | 0.5466 | 0.2256 | ||
| 2_6731 | E3 ubiquitin complex protein | 3 | 0.0451 | 0.0169 | 0.746 | 0.5456 | 0.2276 | ||
| CL1659Contig1 | Chloride channel-like protein | 4 | 0.0786 | 0.0533 | 0.9346 | 0.9552 | 0.1050 | ||
| CL1661Contig1 | Acetyl-CoA carboxylase 2 | 3 | 0.0453 | 0.0851 | 0.9167 | 0.8924 | 0.2272 | ||
| 0_13913 | Exocyst subunit EXO70 family protein | 4 | 0.1146 | 0.0803 | 0.9236 | 0.8508 | 0.0985 | ||
| 2_8627 | Carbon–sulfur lyase | 10 | 0.1131 | <0.0001 | 0.9846 | 1 | 0.0176 | ||
| 2_8852 | Galactokinase | 8 | 0.0155 | <0.0001 | 0.9919 | 1 | 0.0291 |
The P value of Kelly Zns (PZn) and the number of SNPs per locus (S).
Results of each test (D = Tajima’s D; H = Fay and Wu’s H and EW = Ewens–Watterson’s F) are presented as P-value.
The critical P-values calculated with the compound DHEW test. (Tajima‘s D, Fay and Wu‘s H and Ewens-Watterson's F tests).