| Literature DB >> 28791133 |
Jane Melville1, Margaret L Haines1, Katja Boysen1, Luke Hodkinson1, Andrzej Kilian2, Katie L Smith Date1, Dominique A Potvin1, Kirsten M Parris3.
Abstract
Next-generation sequencing (NGS) approaches are increasingly being used to generate multi-locus data for phylogeographic and evolutionary genetics research. We detail the applicability of a restriction enzyme-mediated genome complexity reduction approach with subsequent NGS (DArTseq) in vertebrate study systems at different evolutionary and geographical scales. We present two case studies using SNP data from the DArTseq molecular marker platform. First, we used DArTseq in a large phylogeographic study of the agamid lizard Ctenophorus caudicinctus, including 91 individuals and spanning the geographical range of this species across arid Australia. A low-density DArTseq assay resulted in 28 960 SNPs, with low density referring to a comparably reduced set of identified and sequenced markers as a cost-effective approach. Second, we applied this approach to an evolutionary genetics study of a classic frog hybrid zone (Litoria ewingii-Litoria paraewingi) across 93 individuals, which resulted in 48 117 and 67 060 SNPs for a low- and high-density assay, respectively. We provide a docker-based workflow to facilitate data preparation and analysis, then analyse SNP data using multiple methods including Bayesian model-based clustering and conditional likelihood approaches. Based on comparison of results from the DArTseq platform and traditional molecular approaches, we conclude that DArTseq can be used successfully in vertebrates and will be of particular interest to researchers working at the interface between population genetics and phylogenetics, exploring species boundaries, gene exchange and hybridization.Entities:
Keywords: DArTseq; SNPs; genomics; hybridization; phylogeography; population genetics
Year: 2017 PMID: 28791133 PMCID: PMC5541528 DOI: 10.1098/rsos.161061
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 1.Previously published maximum-likelihood phylogenetic tree for C. ornatus and the six subspecies of C. caudicinctus based on approximately 1400 bp mtDNA [16]. ML bootstraps greater than 70% (above) and Bayesian posterior probabilities greater than 90% (below) are provided on branches. Colours designate clades, which are mapped.
Figure 2.Map of the previously studied hybrid zone between the frogs L. ewingii and L. paraewingi in Victoria, southeastern Australia. Bold black line indicates the ‘Glenburn transect’, including the locations of 11 sites sampled 2007–2013 [17]. Shaded areas indicate forested regions versus cleared land (unshaded).
Figure 3.Population genetic analyses of 1485 SNPs for C. ornatus and the six subspecies of C. caudicinctus: (a) a fastStructure plot (with the simple prior) at K = 7; and (b) a distance-based (PCoA) plot of genetic structure, based on genetic distance between samples. Each of the C. caudicinctus subspecies and C. ornatus are coloured to match those in figure 1. Individuals identified as admixed in the fastSTRUCTURE analysis are indicated by a number 1–11 in both plots. Percentage of genetic distance explained by each of the PCoA axes (PC) are provided in parentheses.
Figure 4.Conditional maximum-likelihood phylogenetic tree for C. ornatus and the six subspecies of C. caudicinctus based on 1485 SNPs. ML bootstraps are provided about the branches. Colours designate clades identified in the fastStructure analysis (figure 3). Admixed individuals are coloured blue and numbered to correspond to those identified in figure 1.
Figure 5.Population genetic analyses of 5278 SNPs resulting from a low-density array for L. ewingii and L. paraewingi: (a) a fastStructure plot (with the simple prior) of samples including L. verreauxii at K = 3; and (b) a fastStructure plot (with the simple prior) of samples excluding L. verreauxii at K = 2.
Figure 6.Multi-variate plots of genetic structure L. ewingii and L. paraewingi: (a) PCoA analysis based on genetic distance between samples of 5278 SNPs, resulting from a low-density array; and (b) a plot of a previously published multiple correspondence analysis (MCA) based on microsatellite data [17]. Dashed lines indicate 90% confidence ellipses for each parental species. Purple triangles indicate hybrid individuals. Percentage of genetic distance explained by each of the PCoA axes (PC) are provided in parentheses.