| Literature DB >> 20480016 |
Ryan C Garrick1, Adalgisa Caccone, Paul Sunnucks.
Abstract
Understanding the nature, timing and geographic context of historical events and population processes that shaped the spatial distribution of genetic diversity is critical for addressing questions relating to speciation, selection, and applied conservation management. Cladistic analysis of gene trees has been central to phylogeography, but when coupled with approaches that make use of different components of the information carried by DNA sequences and their frequencies, the strength and resolution of these inferences can be improved. However, assessing concordance of inferences drawn using different analytical methods or genetic datasets, and integrating their outcomes, can be challenging. Here we overview the strengths and limitations of different types of genetic data, analysis methods, and approaches to historical inference. We then turn our attention to the potentially synergistic interactions among widely-used and emerging phylogeographic analyses, and discuss some of the ways that spatial and temporal concordance among inferences can be assessed. We close this review with a brief summary and outlook on future research directions.Entities:
Keywords: cladistic analysis; landscape history; molecular markers; population structure; statistical phylogeography; temporal contrasts
Mesh:
Year: 2010 PMID: 20480016 PMCID: PMC2871112 DOI: 10.3390/ijms11041190
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1.Three hierarchical levels of genetic information that can be obtained from diploid, co-dominant nuclear loci. Taken together, they cover a broad temporal spectrum, and the use of complementary analyses that focus on different ‘time slices’ of population history potentially allow these components to be separated.
Characteristics of two major classes of phylogeographic analysis. Although ‘exploratory’ and ‘model-driven’ analyses are not mutually exclusive, the dichotomy can serve as a conceptual framework.
| Low | High | |
| Low | High | |
| No | Yes | |
| Broad | Narrow | |
| Yes | Limited | |
| Limited | Yes | |
| Limited | Yes | |
| Qualitative | Quantitative | |
NCPA is considered ‘exploratory’ here and is unique in its ability to separate multiple temporally overlying events and processes. Conversely, several model-driven methods explicitly consider temporally sequential events or processes (e.g., IM, simulations within population trees; Sections 4.3 and 4.4).
Figure 2.Example of a three-population migration matrix, showing parameters estimated by MIGRATE [36] when the full island model is implemented. There are three population mutation rate parameters (Θ = 4Nμ for diploid autosomal genes; one for each extant population) and six migration parameters (M = immigration rate divided by μ). Each population pair has two migration parameters to accommodate asymmetrical gene flow.
Figure 3.The six-parameter isolation-with-migration model implemented in IM and IMa [38,129]. There are three population mutation rate parameters (Θ), two migration parameters (M), and the time since population divergence (Tdiv). In IM, an additional parameter—s, the proportion of the ancestral population that founds a descendant population—can be included to allow for population size changes (not shown).
Figure 4.Hypothetical population tree containing a gene tree (dashed lines) that has been simulated via neutral coalescence. Even this relatively simple four-population model with zero post-divergence gene flow has many parameters that must be specified during construction of the population tree. Here, these include N-values of four extant and three ancestral populations, two successive splitting times (Tdiv), and tree topology. The population tree branch lengths are measured in organismal generations (scale not shown).
Complementarity of phylogeographic analyses. Assumptions enforced by some coalescent methods can be validated or tested using other methods. This table is intended only as an example of some of the analytical methods that can be used to complement one another. Additional analytical resources are overviewed by Excoffier and Heckel [55], and Kuhner [56]. All of the software listed in this table is freely-available, and the associated references and websites are given in Supplementary Material (Table S1).
| Natural clusters exist | Genotypic clustering | STRUCTURE, GENELAND | Spatially cohesive clades | GARLI, MR BAYES, BEAST | |
| Random mating | Hardy-Weinberg and Linkage Equilibrium | ARLEQUIN, GENEPOP, FSTAT | |||
| No geographic structure | Spatial autocorrelation | GENALEX | NCPA | TCS and GEODIS | |
| No family structure | Relatedness analysis | KINGROUP | |||
| Constant size | Recent growth or decline | BOTTLENECK, MSVAR | DNASP | ||
| Θ estimation | IM, MIGRATE | Θ estimation | IM, MIGRATE, FLUCTUATE | ||
| No migration | Isolation-with-migration analysis | IM | Isolation-with-migration analysis | IM | |
| Sister relationship | Distance-based clustering | PHYLIP | Species tree estimation | AUGIST, BEST, STEM | |
| Old divergences | Microsatellite dating: (δµ)2 or | Relaxed-clock molecular dating | BEAST, R8S | ||
Analysis requires geo-referenced genetic data;
Converting Θ to N requires an estimate of per-locus mutation rate, and organismal generation time;
Requires multiple unlinked, recombination-free, selectively neutral DNA sequence loci;
Refer to Goldstein et al. [191] and Zhivotovsky [192], respectively.