| Literature DB >> 30804976 |
Erich P Hofmann1,2, Kirsten E Nicholson3, Ileana R Luque-Montes1, Gunther Köhler4, César A Cerrato-Mendoza5, Melissa Medina-Flores5, Larry David Wilson6, Josiah H Townsend1.
Abstract
Single-locus molecular barcoding is a useful method for identifying overlooked and undescribed biodiversity, providing the groundwork for further systematic study and taxonomic investigation. A variety of methods for delimiting species from barcoding libraries have been developed and applied, allowing for rapid estimates of species diversity in a broad range of taxa. However, tree-based and distance-based analyses can infer different group assignments, potentially over- or underestimating the number of putative species groups. Here, we explore diversity of mainland species of anole lizards from the Chortís Block biogeographical province of northern Central America using a DNA barcoding approach, generating and analyzing cytochrome oxidase subunit I (COI) sequences for over 400 samples assignable to 33 of 38 (86.8%) native and one introduced mainland species. We subsequently tested the effects different models of nucleotide substitution, different species-delimitation algorithms, and reducing our dataset had on species delimitation estimates. We performed of two distance-based (ABGD, RESL) and three tree-based (bPTP, mPTP, GMYC) analyses on both the full dataset and a dataset consisting only of unique halotypes. From 34 nominal taxa, analyses of the full dataset recovered between 34 and 64 operational taxonomic units (OTUs), while analyses of the reduced dataset inferred between 36 and 59. Reassigning individuals to either mPTP-inferred or ABGD clustered (7.2% threshold) groups improved the detection of a barcoding gap across three different models of nucleotide substitution, removing overlap between intra- and interspecific distances. Our results highlight the underestimated diversity of mainland Chortís Block anoles, but the lack of congruence between analyses demonstrates the importance of considering multiple analytical methods when dealing with single-locus datasets. We recommend future studies consider the effects of different models of nucleotide substitution on proposed barcoding gaps, as well as the effect reducing a dataset to unique haplotypes may have on proposed diversity estimates.Entities:
Keywords: ABGD; Anolis; DNA barcoding; GMYC; Norops; PTP; RESL; cytochrome c oxidase subunit I (COI)
Year: 2019 PMID: 30804976 PMCID: PMC6378269 DOI: 10.3389/fgene.2019.00011
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Map of the Chortís Block Biogeographical Province; boundaries are denoted by a dashed line.
Summary of the mainland anoles of the Chortís Block: Species, number of samples (N) and localities used in this paper, and conservation scores.
| Species | Sampling localities (C | EVS Category ( | ||||
|---|---|---|---|---|---|---|
| 2 | H | – | LC | LC | M | |
| 6 | H | E | E | CE | H | |
| - | – | – | – | LC | M | |
| 4 | H | – | LC | LC | L | |
| 21 | H | – | LC1 | LC1 | L1 | |
| 8 | H | – | LC | LC | M | |
| - | – | – | – | LC | H | |
| 1 | E | – | LC | LC | L | |
| 16 | H | – | LC | LC | M | |
| 5 | H | E | E | E | H | |
| 36 | E | – | E | LC | H | |
| 7 | H | – | E | NT | H | |
| 3 | H | – | CE | E | H | |
| 19 | H | – | LC | LC | L | |
| 22 | H | – | LC | LC | L | |
| 11 | H | – | LC | LC | H | |
| 7 | H | E | E | NT | H | |
| - | – | – | LC2 | – | H | |
| 40 | H | – | LC | LC | M | |
| 54 | H | – | CE | V | H | |
| - | – | V | CE | V | H | |
| 1 | H | – | E | NT | H | |
| 3 | N | – | LC | LC | M4 | |
| 1 | H | – | V | LC | L | |
| 14 | H | – | E | V | H | |
| 1 | H | – | E | V | H | |
| 3 | N | – | LC | LC | H | |
| 7 | H | – | LC | LC | M | |
| 7 | H | – | E | V | H | |
| 1 | E | – | LC2 | – | M | |
| 14 | H | DD | E | LC | H | |
| 5 | M | – | LC | LC | M | |
| 3 | H | – | – | LC | L | |
| - | – | – | E | CE | H | |
| 10 | H | – | – | LC | H | |
| 2 | N | – | V3 | – | H | |
| 4 | H | – | – | – | – | |
| 59 | H | – | E | NT | H | |
| 15 | H | – | E | LC | H | |
Summaries of uncorrected pairwise distances, K2P-corrected distances, and TrN+G-corrected distances for the dataset, including averages based on a priori identification (morphospecies), ABGD clustering (at 7.2% threshold), and mPTP delimitations.
| Whole dataset | Averages within groups (Intraspecific) | Averages between groups (Interspecific) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Model used | Range | Mean | StDev | Range | Mean | StDev | Range | Mean | StDev |
| 0.0–0.247 | 0.171 | 0.047 | |||||||
| 0.0–0.062 | 0.014 | 0.016 | 0.049–0.242 | 0.181 | 0.024 | ||||
| ABGD | 0.0–0.037 | 0.011 | 0.010 | 0.066–0.242 | 0.181 | 0.025 | |||
| mPTP | 0.0–0.030 | 0.006 | 0.006 | 0.031–0.242 | 0.180 | 0.028 | |||
| 0.0–0.313 | 0.200 | 0.058 | |||||||
| 0.0–0.068 | 0.015 | 0.017 | 0.051–0.300 | 0.212 | 0.032 | ||||
| ABGD | 0.0–0.039 | 0.011 | 0.011 | 0.070–0.301 | 0.212 | 0.033 | |||
| mPTP | 0.0–0.032 | 0.006 | 0.006 | 0.032–0.305 | 0.211 | 0.037 | |||
| 0.0–0.336 | 0.211 | 0.062 | |||||||
| 0.0–0.070 | 0.015 | 0.018 | 0.052–0.323 | 0.224 | 0.035 | ||||
| ABGD | 0.0–0.039 | 0.011 | 0.011 | 0.071–0.323 | 0.223 | 0.036 | |||
| mPTP | 0.0–0.033 | 0.006 | 0.007 | 0.032–0.326 | 0.222 | 0.040 | |||
FIGURE 2Histograms of pairwise distances under different models of nucleotide substitution: uncorrected (top), K2P (middle), and TrN+G (bottom).
Number of putative species (OTUs) inferred by each delimitation method from input datasets (“Full”: 412 sequences; “Reduced”: 290 unique sequences) of 34 nominal taxa, with exact matches (N) and match ratios.
| ABGD 10.0% | ABGD 7.2% | ABGD 5.2% | BOLD RESL | mPTP | bPTP | GMYC | |
|---|---|---|---|---|---|---|---|
| 28 | 26 | 25 | 23 | 24 | 21 | 23 | |
| 0.824 | 0.743 | 0.658 | 0.541 | 0.600 | 0.429 | 0.548 | |
| – | 25 | 24 | 23 | 25 | 21 | 24 | |
| – | 0.714 | 0.667 | 0.541 | 0.625 | 0.452 | 0.565 | |
FIGURE 3Maximum likelihood phylogeny of the full COI sequence dataset, with lineage assignments from the three tree-based (mPTP, bPTP, GMYC) and two distance-based (RESL, ABGD at three thresholds) methods. ML bootstrap support and Bayesian posterior probabilities shown when ≥ 50 and 0.50, respectively. Gray bars span all samples assigned a priori to named taxa (morphospecies). Note (∗): JMS71 was assigned to its own lineage by GMYC, apart from all other samples assigned to N. cupreus (i.e., there were two inferred lineages within N. cupreus, not three).
FIGURE 4Maximum likelihood phylogeny of the full COI sequence dataset, with lineage assignments from the three tree-based (mPTP, bPTP, GMYC) and two distance-based (RESL, ABGD at three thresholds) methods (continued from ). ML bootstrap support and Bayesian posterior probabilities shown when ≥ 50 and 0.50, respectively. Gray bars span all samples assigned a priori to named taxa (morphospecies).