| Literature DB >> 32467710 |
Evanthia Thanou1,2, Panagiotis Kornilios1,2,3, Petros Lymberakis4, Adam D Leaché1.
Abstract
Comparing mitochondrial and genomic phylogenies is an essential tool for investigating speciation processes, because each genome carries different inheritance properties and evolutionary characteristics. Furthermore, mitonuclear discordance may arise from ecological adaptation, historic isolation, population size changes, and sex-biased dispersal. Closely related taxa are expected to experience gene flow; however, this may not be true for insular populations or populations isolated in refugia. The four-lined snake Elaphe quatuorlineata has a fragmented distribution, separating populations of the Italian and Balkan Peninsulas, whereas several insular Aegean populations of significantly smaller body size (Cyclades island group and Skyros Island, Greece) are currently considered distinct subspecies. We constructed the species-tree phylogeny of this species utilizing genome-wide single nucleotide polymorphisms and a gene-tree based on complete cytochrome b sequences, aiming to detect convergence and discrepancies between biparentally and maternally inherited genomes. Population structuring, phylogenetic patterns and migration events among geographically defined lineages supported our hypothesis of isolation in multiple sub-refugia. Where biogeographical barriers did not restrict subsequent dispersal, extensive genetic exchange occurred between mainland Balkan populations. This process has led to the mitochondrial sweep of an ancestral mitolineage that survived only in peripheral (East Greece) and insular populations (North Cyclades and Skyros). The Central Cyclades represent an ancient lineage for both molecular markers that emerged almost 3.3 Mya. Considering their distinct morphology, insular E. quatuorlineata populations should be the future focus of an extensive sampling, especially since the mitonuclear discordance observed in this species could be related to ecological adaptations, such as the island-dwarfism phenomenon.Entities:
Keywords: Aegean Islands; Elaphe quatuorlineata species-tree; genome-wide SNPs; mitochondrial introgression; phylogeography; refugia within refugia
Year: 2019 PMID: 32467710 PMCID: PMC7245017 DOI: 10.1093/cz/zoz018
Source DB: PubMed Journal: Curr Zool ISSN: 1674-5507 Impact factor: 2.624
Figure 1.(A) Map of the Aegean region with sampling localities and codes of E. quatuorlineata specimens (see also Supplementary Table S1). Black points represent specimens for which both mtDNA sequences of the complete cytb gene and genome-wide SNPs were obtained, whereas for specimens with white points only cytb was available. (B) Map of the Cyclades island group with the respective sampling localities and codes, and the distribution of the currently described subspecies (E. quatuorlineata quatuorlineata, E. q. muenteri, E. q. parensis, and E. q. scyrensis). The approximate geographical position of the Pindos Mt. Range and the outline of the Cyclades Plateau (CP) during the late Pleistocene (modified by Kapsimalis et al. 2009) are shown.
Figure 2.ML phylogenetic tree based on the complete cytb dataset. Coding in terminal nodes refer to samples in Figure 1 and Supplementary Table S1. Numbers in branches give the respective nodal support estimated as standard bootstrap values/ultrafast bootstrap values/SH-like approximate likelihood ratio test/approximate Bayes posterior probabilities. The time of divergence (mean and 95% HPD intervals values in millions of years, Mya) are also given for nodes that were represented in the molecular-clock analysis (Supplementary Table S1, Figure S1). Inset: Typical morphotype corresponding to E. quatuorlineata quatuorlineata (Photo by P.K.).
Summary of the ddRAD data matrices as resulted from the iPyRAD pipeline
| iPyRAD parameters | Minimum % of individuals for a given locus (total number of individuals included during filtering) | 100 (28) | 90 (30) | 75 (30) | 50 (30) |
|---|---|---|---|---|---|
| Filtering Statistics | Number of prefiltered loci | 45,915 | 47,670 | 47,670 | 47,670 |
| Number of filtered loci | 2,321 | 7,543 | 11,931 | 17,640 | |
| Total number of base pairs | 93,032 | 302,558 | 478,587 | 707,588 | |
| Number of SNPs | 725 | 3,572 | 5,649 | 8,339 | |
| Number of uSNPs | 531 | 2,518 | 3,929 | 5,602 |
This includes information on the number of prefiltered and filtered loci, the total number of base pairs, and the number of SNPs and unlinked uSNPs, given for 4 different datasets. Each dataset was built with different percentages of missing data, expressed as % of individuals for a given locus: 100% (0% missing data, i.e., all loci present for all samples), 90%, 75%, and 50% (respectively 10%, 25%, and 50% missing data, i.e., all loci present for at least 90%, 75%, and 50% of the samples). The number of samples included during the filtering for each dataset is given in parentheses (E. quatuorlineata samples with or without E. sauromates).
The ddRAD datasets (size and number of included individuals shown in parentheses) used in the respective final analyses
| Analyses | Type of dataset | Size of dataset (individuals included) |
|---|---|---|
| DAPC | uSNPS | 531 (28) |
| STRUCTURE | uSNPS | 531 (28) |
| 646 (24) | ||
| ML after concatenation | total base pairs (bp) | 477,705 (21) |
| SVDquartets | uSNPS | 2,430 (21) |
| SNAPP | biallelic uSNPS | 2,612 (16) |
| TREEMIX | uSNPS | 2,628 (26) |
DAPC analysis was performed to investigate population structure and STRUCTURE analysis to further identify individuals with genetic admixture within E. quatuorlineata (including or excluding the samples from the Cyclades islands). Phylogenomic trees were constructed with the concatenated ML and SVDquartets methods after removing admixed individuals and with E. sauromates as the outgroup. Species-tree inferred with SNAPP used a reduced dataset of non-admixed E. quatuorlineata individuals and migration events were inferred with TREEMIX, after removing only 2 admixed E. quatuorlineata individuals. Further information on which individuals were removed or retained in every case is presented in the text.
Figure 3.Results of the DAPC analysis for the population clustering of E. quatuorlineata based on the uSNPs dataset. (A) Plot of the ln likelihood values estimated under the BIC to the number of clusters (up to K = 12) found. The plot shows an almost equal support for K = 4 and K = 5. (B) Map showing the approximate geographical distribution of 5 clusters and (C) scatterplot of the two first components inferred for K = 5. (D) and (E) Respective scatterplots inferred for K = 4. Insets in each scatterplot represent the number of principal component analysis (PCA) and discriminant analysis (DA) eigenvalues retained in each analysis.
Figure 4.(A) Hierarchical population clustering with STRUCTURE using 2 uSNPs datasets; including all E. quatuorlineata and excluding individuals from Cyclades islands (Naxos, Andros and Tinos). Q represents the membership probability of each individual to a given cluster. (B) ML phylogenomic tree based on the concatenated ddRAD loci dataset, including non-admixed E. quatuorlineata individuals (see text for details) and E. sauromates as outgroup. Numbers in branches give the respective nodal support estimated as standard bootstrap values/SH-like approximate likelihood ratio/bootstrap values estimated for the same nodes inferred by the SVDquartets analysis.
Figure 5.(A) Coalescent species-tree for E. quatuorlineata produced by SNAPP analysis on the uSNPs dataset, transformed into biallelic format. The MCC tree is over-imposed on the posterior distribution of the 95% HPD set of tree-topologies graph. Numbers in branches give the respective support estimated as posterior probability values for the MCC tree/bootstrap values estimated for the same nodes inferred by the SVDquartets analysis. (B) TREEMIX admixture graph with 4 migration events (i–iv). The strength (migration weight as the fraction of ancestry derived from the migration source) and directionality of the inferred introgression is indicated by the colored arrows. The plot of the ln likelihood values to the number of migration edges (m) up to m = 8 is also given.