| Literature DB >> 25269380 |
C Schlötterer1, R Kofler1, E Versace2, R Tobler3, S U Franssen1.
Abstract
Evolve and resequence (E&R) is a new approach to investigate the genomic responses to selection during experimental evolution. By using whole genome sequencing of pools of individuals (Pool-Seq), this method can identify selected variants in controlled and replicable experimental settings. Reviewing the current state of the field, we show that E&R can be powerful enough to identify causative genes and possibly even single-nucleotide polymorphisms. We also discuss how the experimental design and the complexity of the trait could result in a large number of false positive candidates. We suggest experimental and analytical strategies to maximize the power of E&R to uncover the genotype-phenotype link and serve as an important research tool for a broad range of evolutionary questions.Entities:
Mesh:
Year: 2014 PMID: 25269380 PMCID: PMC4815507 DOI: 10.1038/hdy.2014.86
Source DB: PubMed Journal: Heredity (Edinb) ISSN: 0018-067X Impact factor: 3.821
Figure 1Overview of E&R studies. (a) A population of flies is exposed for 60 generations to ultraviolet (UV) radiation (purple arrows). We assume here, for the sake of illustration, that darker pigmentation is beneficial in high UV environments, whereby darker flies will increase in frequency. (b) At the genotypic level, the allele frequency of the causative allele (dark brown) will increase, more so than hitchhiking variants (dark gray background) that will be recombined onto other backgrounds (breaks between dark and light gray background). (c) The allele frequencies of the starting population and the selected population are measured with Pool-Seq. (d) Causative variants can be identified by contrasting the allele frequencies between base and selected population and visualized with Manhattan plots. A full color version of this figure is available at the Heredity journal online.
Features of different approaches aiming to link genotype and phenotype
| Analysis of heterozygous individuals | + | + | − | + | + |
| Repeated phenotyping | Every generation | − | + | − | − |
| Sensitivity to environmental noise | Low because of repeated phenotyping in every generation and replication | High | Low because of repeated phenotyping of identical genotypes | Moderate because of replication | High |
| Well-established analysis strategies | − | + | + | − | + |
| Mapping resolution | High | High | High | High | Moderate (cost effective), high (expensive) |
| Genetic diversity analyzed | High | High | Moderate–high | High | Limited to parental genotypes |
| Inference of effect size | Selection, coefficient | + | + | − | + |
| Randomized genetic background | +, in starting population and repeated mixing by sexual reproduction during the experiment | +, but sensitive to population structure that can be accounted for in analysis | +, but sensitive to population structure that can be accounted for in analysis given a sufficient sample size | +, but sensitive to population structure that can be accounted for in analysis | + |
| Genotyping/sequencing costs | Low because of Pool-Seq | High for establishment, no costs for follow-up experiments | High for establishment, no costs for follow-up experiments | Low because of Pool-Seq | Low because of the use of genetic markers (Rad-Tag sequencing) |
| Sampling effort | High because of maintenance of experimental populations | Depends on species | High for establishment, low later on | Moderate | Moderate |
| Analysis of multiple traits from the same genotypes | − | + | + | − | + |
| Replication | Yes, is common practice | Only across different populations | Requires an independent reference panel | Yes, it is common practice. Easy to expand to multiple populations | Requires independent mapping families |
| Influence of allele frequency (conditional on presence in the sample) | Low power for high-frequency alleles, low-frequency alleles are often lost | Yes | Yes | Yes | No |
| Trajectories of selected variants | + | − | − | − | − |
| Identification of adaptive variants in a defined environment | + | − | − | − | − |
Abbreviations: E&R, Evolve and resequence; GWAS, genome-wide association study; Pool-Seq, Pool-sequencing.
Figure 2Performance of different test statistics used in E&R studies. Receiver operator characteristic (ROC) curves that contrast the true positive rate with the false positive rate. We extended the results of Kofler and Schlötterer (2014) by including the pooled Hs/D test, the pooled Hs/Hc test and the pooled FST test (Remolina ). Briefly, Kofler and Schlötterer (2014) simulated E&R with a base population that captures the pattern of polymorphism in a natural D. melanogaster population. They simulated 60 generations of selection with a population size of 1000 and 3 replicates. Results are shown for SNP-based analysis (top graphs) and for a window-based (bottom graphs) analyses using either 150 strongly (left graphs) or 150 weakly (right graphs) selected loci. The behavior of the Cochran–Mantel–Haenszel (CMH) and pooled FST tests are very similar, resulting in largely overlapping curves. We note that this comparison is mainly for illustrative purpose and it may be that different evolutionary scenarios change the behavior of the test statistics. A full color version of this figure is available at the Heredity journal online.