| Literature DB >> 36104671 |
Victor Julio Piñeros1, Carmen Del R Pedraza-Marrón2, Isaí Betancourt-Resendes3, Nancy Calderón-Cortés4, Ricardo Betancur-R2, Omar Domínguez-Domínguez5,6.
Abstract
BACKGROUND: Delimiting species across a speciation continuum is a complex task, as the process of species origin is not generally instantaneous. The use of genome-wide data provides unprecedented resolution to address convoluted species delimitation cases, often unraveling cryptic diversity. However, because genome-wide approaches based on the multispecies coalescent model are known to confound population structure with species boundaries, often resulting in taxonomic over-splitting, it has become increasingly evident that species delimitation research must consider multiple lines of evidence. In this study, we used phylogenomic, population genomic, and coalescent-based species delimitation approaches, and examined those in light of morphological and ecological information, to investigate species numbers and boundaries comprising the Chirostoma "humboltianum group" (family Atherinidae). The humboltianum group is a taxonomically controversial species complex where previous morphological and mitochondrial studies produced conflicting species delimitation outcomes. We generated ddRADseq data for 77 individuals representing the nine nominal species in the group, spanning their distribution range in the central Mexican plateau.Entities:
Keywords: Chirostoma humboldtianum; Freshwater fishes; Genome-wide data; Genomic structure; SNP loci; Species delimitation
Mesh:
Year: 2022 PMID: 36104671 PMCID: PMC9472351 DOI: 10.1186/s12862-022-02063-0
Source DB: PubMed Journal: BMC Ecol Evol ISSN: 2730-7182
Fig. 1a Sampling localities of the humboldtianum group across the central Mexico plateau: Lake Chapala, Lake Zacapu, Lake Zirahuén, Lake Pátzcuaro, Tepuxtepec Dam, Trinidad Fabela Dam Basins CHA, Chapala; ZIR, Zirahuén; PAT, Pátzcuaro; ZAC, Zacapu; ALE, Alto Lerma; BLE, Bajo Lerma; MLE, Medio Lerma [126]. b Location of central Mexico plateau in Mexico. Artistic fish illustration credit: Sergio Godínez-Marrón
Fig. 2Multivariate and admixture results based on all, neutral, and outlier SNP loci from the matrix D (3564-snps loci), which explain the highest percentage of explained variation in the analyses. a Principal component analyses (PCAs), and (b) Discriminant analyses of principal components (DAPCs) consistently recovered four genomic groups (k = 4) with all and neutral loci that are in agreement with geographic patterns but not with the previously recognized morphospecies: humboldtianum sensu stricto group (blue), from Lake Zacapu; estor group (green) from Lakes Patzcuaro and Zirahuén; chapalae group (red), from Lake Chapala; and C. sphyraena group (yellow), also from Lake Chapala. Outlier loci resolved three groups (k = 3) where chapalae and sphyraena groups clustered together. Morphospecies are color-coded according to the genomic groups observed. In PCAs scatterplots the point clustering groups are delimited by convex hulls. In DAPC scatterplots, the point clustering groups are inside their 95% inertia ellipses, and the lines connect points to the mean value for each group. The eigenvalue bar plots are showing in the upper right of each figure. c Admixture assignment analyses estimated using all, neutral, and outlier SNPs consistently identified three well-differentiated clusters (k = 3). Each bar represents the probability of assignment to each cluster. Genomic clusters are color-coded as blue, humboldtianum sensu stricto, green, estor group; orange, chapalae-sphyraena group. CHA, Lake Chapala; TEP, Tepuxtepec Dam; TRI, Trinidad Fabela Dam; PAT, Lake Pátzcuaro; ZIR, Lake Zirahuén
Fig. 3The ML phylogenetic tree based on 3564 SNP loci (a) recovered three well-differentiated clades: clade I formed by species in Lake Chapala, clade II represented by species from Lake Zacapu and dams, and clade III composed by species in Lakes Pátzcuaro and Zirahuén. We observed C. sphyraena as a monophyletic group within clade I, and the subspecies C. e. copandaro from Lake Zirahuén as a differentiated cluster within clade III. Although the multi-coalescent species tree considering circa 3400 neutral SNPs (b) recovered the main genomic clusters as the rest of the genetic structure analyses and the ML phylogenetic inferences, it did not present any intra-lake divergences. c The phylogenetic inference based on mtDNA (Cytb and D-loop) failed to delineate reciprocal monophyletic groups as the ones estimated using the ddRADseq data (3564 SNP loci). The analyses of the genome-wide datasets clearly recovered well-differentiated clusters that are in agreement with the geography of the central Mexico plateau. However, none of our phylogenetic inferences showed concordance with the morphospecies nor ecotypes recognized within the humboltianum group. Numbers on branches of the main clades indicate bootstrap values. Artistic fish illustration credit: Sergio Godínez-Marrón
Results of bayes factor delimitation (BFD*) analyses for the humboldtianum group using three SNPs subsets (ranging from 39 to 59 individuals, and 411–1102 SNP loci)
| Model | Species number | Subset 1: 39ind_411 SNP loci | Subset 2: 39ind_1102 SNP loci | Subset 3: 59ind_548 SNP loci | |||
|---|---|---|---|---|---|---|---|
| MLE | BF | MLE | BF | MLE | BF | ||
| Current morpho-species | 9 | − 12,058.03 | 295.56 | − 30,993.31 | 850.28 | − 21,279.57 | 507.42 |
| Mitonuclear groups | 5 | NA | − 30,600.16 | 63.98 | NA | ||
| DAPC groups | 4 | − 11,927.68 | 34.85 | NA | − 21,161.43 | 271.14 | |
| Admixture groups | 3 | − 11,989.52 | 158.53 | − 30,898.37 | 660.41 | − 21,278.3 | 504.88 |
Bayes factor (BF) calculations are estimated against the model with the best marginal-likelihood estimate (Model 1). Positive BF values indicate support for model 1 (the best model). Overall, these analyses support a species delimitation scenario between four and five species, rejecting the nine morphospecies scheme
MLE: marginal likelihood estimates. Bold values correspond to the best model selected by the analysis. NA correspond to not applied, and its associated marginal likelihood is in bold. 0 < BF < 2, non-significant; 2 < BF < 6, positive evidence; 6 < BF < 10, strong support; BF > 10, decisive support
Genomic diversity of 3482 neutral SNPs loci estimates for the humboldtianum group, under each species delimitation hypothesis examined in this study (i–iv)
| N | |||||||
|---|---|---|---|---|---|---|---|
| 9 morphospecies (i; Barbour, 1973) | |||||||
| | 1 | – | – | – | – | – | – |
| | 4 | 0.138 (0.004) | 0.110 (0.003) | − 0.225 (0.008) | Infinite (infinite–infinite) | 8.9 (6.4–14.9) | 10.9 (8.1) |
| | 7 | 0.134 (0.004) | 0.115 (0.003) | − 0.144 (0.007) | Infinite (infinite–infinite) | 12.6 (8.4–25.6) | 6 (4.9) |
| | 5 | 0.123 (0.004) | 0.099 (0.003) | − 0.222 (0.009) | Infinite (infinite–infinite) | Infinite (20.1–infinite) | 31 (19.3) |
| | 24 | 0.136 (0.004) | 0.121 (0.003) | − 0.083 (0.005) | Infinite (infinite-infinite) | 8.9 (7.2–11.6) | 2.2 (1.9) |
| | 7 | 0.126 (0.004) | 0.117 (0.003) | − 0.062 (0.007) | Infinite (infinite–infinite) | Infinite (108.1–infinite) | 3 (2.7) |
| | 15 | 0.103 (0.004) | 0.096 (0.003) | − 0.041 (0.007) | Infinite (infinite–infinite) | 244 (19.9–infinite) | 3.6 (3.2) |
| | 6 | 0.100 (0.004) | 0.084 (0.003) | − 0.170 (0.008) | Infinite (infinite–infinite) | 10.6 (6.9–24.6) | 7.2 (5.4) |
| | 3 | 0.112 (0.004) | 0.085 (0.003) | − 0.301 (0.009) | Infinite (infinite–infinite) | 7.2 (5.3–11.8) | Infinite (infinite) |
| 5 mitonuclear groups (ii, Betancourt-Resendes et al | |||||||
| | 17 | 0.130 (0.003) | 0.139 (0.003) | 0.079 (0.007) | Infinite (infinite–infinite) | 24.7 (13.5–179.5) | 4.5 (3.9) |
| | 24 | 0.136 (0.004) | 0.121 (0.003) | − 0.083 (0.005) | Infinite (infinite–infinite) | 8.9 (7.2–11.6) | 2.2 (1.9) |
| | 7 | 0.126 (0.004) | 0.117 (0.003) | − 0.062 (0.007) | Infinite (infinite–infinite) | Infinite (108.1–infinite) | 3 (2.7) |
| | 16 | 0.103 (0.004) | 0.096 (0.003) | − 0.037 (0.007) | Infinite (infinite–infinite) | 10.8 (7.6–18.9) | 4.4 (3.6) |
| | 8 | 0.097 (0.004) | 0.105 (0.003) | 0.082 (0.010) | Infinite (infinite–infinite) | 8.9 (6.2–16.4) | 5.4 (4.2) |
| 4 multivariate DAPC clusters (iii, | |||||||
| | 17 | 0.130 (0.003) | 0.139 (0.002) | 0.079 (0.007) | Infinite (infinite–infinite) | 24.7 (13.5–179.6) | 4.5 (3.9) |
| | 24 | 0.136 (0.004) | 0.121 (0.003) | − 0.083 (0.005) | Infinite (infinite–infinite) | 8.9 (7.2–11.6) | 2.2 (1.9) |
| | 7 | 0.126 (0.003) | 0.117 (0.003) | − 0.062 (0.007) | Infinite (infinite–infinite) | Infinite (108.1–infinite) | 3 (2.7) |
| | 24 | 0.101 (0.003) | 0.104 (0.003) | 0.103 (0.008) | Infinite (infinite–infinite) | 106.4 (20.2–infinite) | 4.4 (3.8) |
| Admixture clusters K = 3 (iv, | |||||||
| C. | 41 | 0.134 (0.003) | 0.138 (0.003) | 0.085 (0.006) | 86.9 (84.8–89.1) | 317 (40–infinite) | 4.2 (3.8) |
| | 7 | 0.126 (0.003) | 0.117 (0.003) | − 0.062 (0.007) | Infinite (infinite–infinite) | Infinite (108.1–infinite) | 3 (2.7) |
| C. | 24 | 0.101 (0.003) | 0.104 (0.003) | 0.103 (0.008) | Infinite (infinite–infinite) | 106.4 (20.2–infinite) | 4.4 (3.8) |
N number of individuals analyzed per group, observed (H) and expected (H) heterozygosity, fixation index coefficient (F), linkage disequilibrium effective (LDNe), heterozygote-excess (HetExcessNe), and molecular coancestry (CoancestryNe) population size estimators. SD = standard deviation; CI = confidence intervals. The grey shading in the cells indicates the highest values of genetic diversity
aC. chapalae group conformed by C. chapalae, C. consocium, C. lucius, and C. promelas
bC. estor group represented by C. e. estor, C. e. copandaro, C. grandocule, and C. patzcuaro