| Literature DB >> 20181249 |
Hanna M Heidel-Fischer1, Heiko Vogel, David G Heckel, Christopher W Wheat.
Abstract
BACKGROUND: A molecular population genetics understanding is central to the study of ecological and evolutionary functional genomics. Population genetics identifies genetic variation and its distribution within and among populations, it reveals the demographic history of the populations studied, and can provide indirect insights into historical selection dynamics. Here we use this approach to examine the demographic and selective dynamics acting of a candidate gene involved in plant-insect interactions. Previous work documents the macroevolutionary and historical ecological importance of the nitrile-specifier protein (Nsp), which facilitated the host shift of Pieridae butterflies onto Brassicales host plants approximately 80 Myr ago.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20181249 PMCID: PMC2841170 DOI: 10.1186/1471-2148-10-60
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Structural overview of . Shaded rectangles and lines respectively represent gene exons and introns to scale. The signal peptide region is indicated by a blank box while the three domains are shaded to different degrees. Depicted are also the approximate annealing sites of the primer pairs used to amplify ~1 kb large segments of the gene and the corresponding names of the fragments. The two segments studied were Pra Nsp-D2 (in domain2) and Pra Nsp-D3 (in domain3). Dashes show approximate sites of amino acid substitutions located in the amplified coding regions as listed in Fig. 2. The approximate position of selected codons, both positive (black arrow heads with circles on their stems) and negative (open arrow heads), are indicated on exons, with codons listed in Table 4.
Alternative hypotheses for the microevolution of Pra Nsp.
| Hypothesis | Assumption | Expected pattern of variation |
|---|---|---|
| H0-demography | No adaptive role | Reflects demographic history |
| H1-local adaptation | Unique local host plant adaptation = directional selection within populations | Variation within populations < variation among populations |
| H2- diversifying/balancing selection | Generalist response to diverse host plant assemblages = diversifying/balancing selection | Variation within populations ≥ variation among populations |
| H3-directional selection | Purifying selection upon optimal genotype = directional selection across populations | Little variation within and among populations |
Summary statistics of Pieris rapae genes for individual and grouped populations.
| 20 | 0.0108 | 0.00902 | 19 | 0.02182 | 0.00769 | 0.347378 | 0.02013 | 0.01823 | 0.0405 | 0.03938 | |||||
| 20 | 0.0093 | 0.00795 | 17 | 0.02121 | 0.00593 | 0.275416 | 0.0164 | 0.01326 | 0.02925 | 0.02542 | |||||
| 20 | 0.01042 | 0.00723 | 15 | 0.02307 | 0.00684 | 0.292393 | 0.01689 | 0.01209 | 0.02858 | 0.02087 | |||||
| 20 | 0.01145 | 0.00854 | 18 | 0.02783 | 0.00683 | 0.240799 | 0.01783 | 0.01319 | 0.02951 | 0.02342 | |||||
| 80 | 594 | 0.01093 | 0.01054 | 31 | 0.02473 | 0.00703 | 0.27995 | 943 | 0.01722 | 0.01797 | 349 | 0.03159 | 0.03494 | ||
| 20 | 0.00739 | 0.00732 | 12 | 0.01476 | 0.00531 | 0.356045 | 0.01337 | 0.01335 | 0.02735 | 0.02759 | |||||
| 20 | 0.0062 | 0.00671 | 11 | 0.01262 | 0.00437 | 0.342969 | 0.01241 | 0.01335 | 0.02665 | 0.02759 | |||||
| 20 | 0.00562 | 0.00549 | 9 | 0.01159 | 0.00392 | 0.335038 | 0.01247 | 0.01092 | 0.02809 | 0.02519 | |||||
| 20 | 0.00538 | 0.00549 | 9 | 0.0153 | 0.00256 | 0.16538 | 0.01114 | 0.0105 | 0.02444 | 0.02269 | |||||
| 80 | 462 | 0.00642 | 0.00918 | 21 | 0.01419 | 0.00422 | 0.294649 | 706 | 0.01255 | 0.01537 | 244 | 0.02681 | 0.03279 | ||
| 20 | 0.00459 | 0.00613 | 9 | 0.01723 | 0.00084 | 0.048752176 | |||||||||
| 18 | 0.00422 | 0.00492 | 7 | 0.01062 | 0.00233 | 0.219397363 | |||||||||
| 18 | 0.00155 | 0.00281 | 4 | 0.00562 | 0.00035 | 0.06227758 | |||||||||
| 18 | 0.00394 | 0.00524 | 7 | 0.01144 | 0.00174 | 0.152097902 | |||||||||
| 74 | 291 | 0.00365 | 0.00755 | 15 | 0.01163 | 0.00131 | 0.112639725 | ||||||||
| 18 | 0.0042 | 0.00496 | 6 | 0.01564 | 0.00042 | 0.02685422 | |||||||||
| 20 | 0.00302 | 0.0016 | 2 | 0.01217 | 0 | 0 | |||||||||
| 18 | 0.00743 | 0.00743 | 9 | 0.01781 | 0.00042 | 0.023582257 | |||||||||
| 18 | 0.00418 | 0.00248 | 3 | 0.01684 | 0 | 0 | |||||||||
| 74 | 352 | 0.00444 | 0.00583 | 10 | 0.0173 | 0.00021 | 0.012138728 | ||||||||
| 8 | 0.0079 | 0.00715 | 14 | 0.03026 | 0.00075 | 0.024785195 | |||||||||
| 8 | 0.01003 | 0.01022 | 20 | 0.03865 | 0.00088 | 0.022768435 | |||||||||
| 8 | 0.00629 | 0.00715 | 14 | 0.02595 | 0 | 0 | |||||||||
| 7 | 0.00164 | 0.00162 | 3 | 0.03026 | 0.00075 | 0.024785195 | |||||||||
| 31 | 755 | 0.00691 | 0.00963 | 28 | 0.02611 | 0.00076 | 0.029107622 | ||||||||
| 12 | 0.00321 | 0.00637 | 5 | 0.01246 | 0.00081 | 0.064669 | |||||||||
| 20 | 0.00077 | 0.00217 | 2 | 0.00187 | 0.00081 | 0.259259 | |||||||||
| 20 | 0.00219 | 0.00434 | 4 | 0.00876 | 0.00049 | 0.054933 | |||||||||
| 14 | 0.00156 | 0.00242 | 2 | 0.0076 | 0 | 0 | |||||||||
| 66 | 260 | 0.00182 | 0.00566 | 7 | 0.00719 | 0.00043 | 0.058823 | ||||||||
| 20 | 0.01778 | 0.01603 | 12 | 0.06822 | 0.00127 | 0.017609 | |||||||||
| 18 | 0.01208 | 0.01378 | 10 | 0.04898 | 0 | 0 | |||||||||
| 18 | 0.01344 | 0.01102 | 8 | 0.05456 | 0 | 0 | |||||||||
| 18 | 0.01159 | 0.01102 | 8 | 0.04487 | 0.0007 | 0.015164 | |||||||||
| 74 | 211 | 0.01395 | 0.01556 | 16 | 0.05503 | 0.00051 | 0.008987 | ||||||||
Shown are the number of sequences (n), the number of base pairs (bp), the average pairwise differences (π), the pairwise differences for synonymous and nonsynonymous sites (πss and πns), θw, the number of segregating sites (S) and the rate of nonsynonymous to synonymous substitutions for the coding part of the genes. If introns are included in the sequence, the number of base pairs, the average pairwise differences and θw is given separately for the whole gene and the non coding part.
Figure 2Overview of the amino acid haplotypes of . Amino acid variation with reference to first sequence is depicted for each unique allele, with shared alleles across populations highlighted with same color. Bars across the top of the sequences indicates the exon location (Fig. 1).
AMOVA results for an estimate of genetic structure.
| gene | between continents | among populations | within populations |
|---|---|---|---|
| 1.95 | 5.56 | 92.49** | |
| 7.93 | 1.43 | 90.65** | |
| -3.95 | 10.98** | 92.97** | |
| -10.77 | 18.66** | 92.11** | |
| 10.84 | -0.54 | 89.7 | |
| 1.14 | 0.64 | 98.22 | |
| -1.61 | 1.85 | 99.76 | |
| ** P value < 0.01 | |||
Shown is the percentage of variation for each gene at each level of analysis and whether these levels showed significant levels of genetic variation. Comparisons are made within each population and within and between two groups, Europe and the USA.
Estimates of population differentiation.
| Analysis | pop | IDH | Ga3pdh | COI | ArgKinase | Wingless | ||
|---|---|---|---|---|---|---|---|---|
| DE-FR | 0.03 | -0,02* | -0.01 | 0.14* | -0.05 | -0.01 | 0.00 | |
| DE-IT | 0.00 | 0.02 | 0.03 | 0.05 | -0.03 | 0.01 | 0.01 | |
| DE-US | 0.01 | 0,09* | -0.02 | 0.04 | 0.20* | -0.04 | 0.00 | |
| FR-IT | 0.03 | 0.04 | 0.10 | 0.29** | 0.00 | 0.01 | 0.03 | |
| FR-US | 0.06 | 0.13 | -0.04 | 0.17* | 0.14 | 0.04 | 0.00 | |
| IT-US | 0.03 | 0.07 | 0.07 | 0.02 | 0.11 | 0.05 | 0.02 | |
| DE-FR | 0.0009* | 0.0082* | 0.4361 | 0.0279 | 0.4868 | 0.0867 | 0.7204 | |
| DE-IT | 0.0321 | 0.1859 | 0.3974 | 0.1286 | 0.5966 | 0.4368 | 0.3702 | |
| DE-US | 0.0373 | 0.0001* | 0.9271 | 0.0177 | 0.0507 | 0.6965 | 0.1252 | |
| FR-IT | 0.1009 | 0.0911 | 0.0060 | 0.0022* | 0.1747 | 0.6628 | 0.4554 | |
| FR-US | 0.0007* | 0.1322 | 0.7671 | 0.0525 | 0.3480 | 0.1460 | 0.2003 | |
| IT-US | 0.0039 | 0.1107 | 0.0842 | 0.5326 | 0.2481 | 0.2489 | 0.0310 | |
| DE-FR | 15.48 | 3.01 | ||||||
| DE-IT | 14.32 | 10.31 | ||||||
| DE-US | 4.92 | 12.93 | 1.93 | |||||
| FR-IT | 16.88 | 11.76 | 4.65 | 1.22 | 16.44 | |||
| FR-US | 8.20 | 3.33 | 2.40 | 3.04 | 10.70 | |||
| IT-US | 15.79 | 6.49 | 6.50 | 4.23 | 10.06 | 20.90 | ||
Fst values, p-values for the exact test and the estimated absolute number of migrants between two populations M as determined in the Arlequin program are given for every population comparison for every sequenced gene. For easier comprehension high migration values (above 30) are in bold. Analysis always includes the whole sequenced fragments, thus both exons and intron in Pra Nsp-D2 and -D3. For both the Fst and Exact Tests, P-values significant after Bonferroni corrections (< 0.05) within gene are marked with an asterisk. If the p-value is < 0.001 (after Bonferroni) two asterisks are used.
Selected sites in P. rapae NSP domains 2 and 3 identified by at least two of the methods
| Analysis Method | ||||||
|---|---|---|---|---|---|---|
| Codon | Domain | SLAC | FEL | IFEL | REL | |
| 276 | 2, exon 4 | -37.08 (0.246) | -24.60 (0.236) | -24.60 (0.328) | ||
| 277 | 2, exon 4 | |||||
| 284 | 2, exon 5 | -30.24 (0.302) | -22.12 (0.195) | -22.12 (0.280) | ||
| 287 | 2, exon 5 | -27.37 (0.333) | ||||
| 299 | 2, exon 5 | 12.69 (0.745) | 13.36 (0.358) | 23.00 (0.257) | ||
| 303 | 2, exon 5 | -37.08 (0.246) | -26.42 (0.224) | -26.41 (0.313) | ||
| 316 | 2, exon 5 | |||||
| 329 | 2, exon 5 | -46.91 (0.242) | -81.89 (0.136) | |||
| 334 | 2, exon 5 | |||||
| 338 | 2, exon 5 | -27.37 (0.333) | -20.84 (0.181) | -20.83 (0.263) | ||
| 355 | 2, exon 5 | 22.27 (0.672) | 29.94 (0.259) | 50.70 (0.163) | ||
| 379 | 2, exon 5 | 28.03 (0.488) | ||||
| 389 | 2, exon 5 | -29.36 (0.333) | -33.44 (0.161) | -33.44 (0.300) | ||
| 390 | 2, exon 5 | -29.36 (0.333) | -92.95 (0.186) | |||
| 393 | 2, exon 5 | -16.89 (0.579) | -35.67 (0.201) | -35.68 (0.355) | ||
| 395 | 2, exon 5 | -29.36 (0.333) | -32.64 (0.181) | -32.64 (0.329) | ||
| 405 | 2, exon 5 | -29.36 (0.333) | -32.54 (0.213) | -32.54 (0.369) | ||
| 430 | 3, exon 5 | -39.78 (0.246) | -40.08 (0.244) | |||
| 434 | 3, exon 5 | -32.44 (0.302) | -35.73 (0.192) | -35.73 (0.342) | ||
| 437 | 3, exon 5 | |||||
| 441 | 3, exon 5 | |||||
| 467 | 3, exon 5 | 22.31 (0.508) | 30.17 (0.245) | 36.57 (0.244) | ||
| 480 | 3, exon 5 | |||||
| 483 | 3, exon 6 | -19.22 (0.333) | -22.54 (0.2689) | |||
| 493 | 3, exon 6 | |||||
| 505 | 3, exon 7 | 19.00 (0.455) | 19.49 (0.204) | 24.46 (0.203) | ||
| 508 | 3, exon 7 | |||||
| 523 | 3, exon 7 | 28.57 (0.306) | n/a | |||
| 532 | 3, exon 7 | -19.22 (0.333) | -26.85 (0.2402) | |||
| 548 | 3, exon 7 | 16.83 (0.635) | 24.43 (0.372) | 63.24 (0.181) | ||
| 573 | 3, exon 7 | 16.84 (0.636) | 24.32 (0.388) | 30.54 (0.363) | ||
| 580 | 3, exon 7 | |||||
| 594 | 3, exon 7 | -19.22 (0.333) | -27.67 (0.2337) | |||
Note. - For every method, first value is scaled dN - dS and number in parentheses shows P value, except for the REL method, where first value is Bayes factor value followed by P value based on the Bayes factor posterior probability). Significant values (P < 0.15) shown in bold and when 2 of the four methods are significant, the Selection column indicates "N" for negative or "P" for positive selection. Codon numbering is relative to start codon of the signal peptide.
Figure 3Frequency distribution of dN/dS ratios (x-axis) from interspecific comparisons among 70 random genes between . The number of genes in each bin of a given ratio range are shown on the y-axis. The range of dN/dS across all pairwise comparisons of Pra Nsp domain 2 & 3 combined with Pbr Nsp domain 2 & 3 sequence is shown right of the histogram as a grey box. Analyses of domains 2 & 3 separately showed little difference from the combined analysis, and thus the latter was presented for greater figure clarity.