| Literature DB >> 22726891 |
Marius Roesti1, Walter Salzburger, Daniel Berner.
Abstract
BACKGROUND: With the establishment of high-throughput sequencing technologies and new methods for rapid and extensive single nucleotide (SNP) discovery, marker-based genome scans in search of signatures of divergent selection between populations occupying ecologically distinct environments are becoming increasingly popular. METHODS ANDEntities:
Mesh:
Substances:
Year: 2012 PMID: 22726891 PMCID: PMC3426483 DOI: 10.1186/1471-2148-12-94
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Differentiation between two populations, as quantified by Weir and Cockerham’s FST estimator theta [[19]]
| Informative polymorphism | TT | TC | CC | TT | TC | CC | |
| | 5 | 10 | 5 | 5 | 10 | 5 | -0.026 |
| Uninformative polymorphism | TT | TC | CC | TT | TC | CC | |
| 20 | 0 | 0 | 19 | 1 | 0 | 0.000 | |
Other FST estimators produce qualitatively similar results), given informative and uninformative single nucleotide polymorphism at a marker locus (two alleles are present, T and C).
Figure 1Informative and uninformative markers in genome scans. Two populations derived from the same ancestral population occupy ecologically distinct environments (white and gray boxes) at t0,. Circles represent an ecologically important QTL with two alleles under divergent selection; white and gray alleles are favored in the white and gray environment. Squares represent a neutral marker with two alleles (yellow and blue). The marker is tightly physically linked to the QTL. In A), both initial (t0) populations display a very low frequency for the blue marker allele. A novel adaptive QTL allele arising in the gray habitat will therefore likely be associated with the frequent yellow marker allele. When sampling the populations at t1, after a period of selection that has increased the frequency of the gray QTL allele in the gray environment, no signature of selection is visible at the marker locus because hitchhiking along with the favored QTL allele has not materially changed the allele frequency distribution at the marker (FST[22] approximates zero at both t0 and t1). In B), the initial conditions (t0) are as in A), except that the novel adaptive QTL allele happens to be linked to the rare blue marker allele. At t1, selection at the QTL will be visible at the marker (FST = 0.22) because the blue allele has hitchhiked to high frequency. In C), the initial (t0) allele frequency distribution at the marker is relatively even in both populations (FST = 0). At t1, the marker exhibits a clear signature of selection (FST = 0.13) because the yellow allele has increased in frequency by hitchhiking. In both B) and C) but not in A), we would consider the marker locus informative at t1 based on its minor allele frequency across both samples, and consider the marker for a genome scan for the signature of selection (see text).
Figure 2The number of polymorphic loci (x 10) (A), and mean F across all loci (B), for different minor allele count thresholds () in the Constance (black) and Boot (gray) lake-stream stickleback system. This threshold specifies the minimum number of times the minor SNP allele at a locus had to occur in the pooled lake and stream sample for a polymorphic locus to remain in the data set.
Figure 3Differentiation along a segment of chromosome seven between the lake and stream stickleback population from the Constance (black) and Boot (gray) system. Sliding window analysis was performed by local polynomial fitting of FST values for data sets with the allele frequency threshold n set to one (all SNPs in the data sets considered; dotted lines), and n set to four (at least four copies of the minor allele required across the pooled lake and stream sample in each system; solid lines). Note the relatively flat differentiation profiles with n = 1.