Literature DB >> 29134018

A gene-tree test of the traditional taxonomy of American deer: the importance of voucher specimens, geographic data, and dense sampling.

Eliécer E Gutiérrez1,2,3,4, Kristofer M Helgen5, Molly M McDonough3,4, Franziska Bauer6, Melissa T R Hawkins3,4, Luis A Escobedo-Morales7, Bruce D Patterson8, Jesús E Maldonado4,9.   

Abstract

The taxonomy of American deer has been established almost entirely on the basis of morphological data and without the use of explicit phylogenetic methods; hence, phylogenetic analyses including data for all of the currently recognized species, even if based on a single gene, might improve current understanding of their taxonomy. We tested the monophyly of the morphology-defined genera and species of New World deer (Odocoileini) with phylogenetic analyses of mitochondrial DNA sequences. This is the first such test conducted using extensive geographic and taxonomic sampling. Our results do not support the monophyly of Mazama, Odocoileus, Pudu, M. americana, M. nemorivaga, Od. hemionus, and Od. virginianus. Mazama contains species that belong to other genera. We found a novel sister-taxon relationship between "Mazama" pandora and a clade formed by Od. hemionus columbianus and Od. h. sitkensis, and transfer pandora to Odocoileus. The clade formed by Od. h. columbianus and Od. h. sitkensis may represent a valid species, whereas the remaining subspecies of Od. hemionus appear closer to Od. virginianus. Pudu (Pudu) puda was not found sister to Pudu (Pudella) mephistophiles. If confirmed, this result will prompt the recognition of the monotypic Pudella as a distinct genus. We provide evidence for the existence of an undescribed species now confused with Mazama americana, and identify other instances of cryptic, taxonomically unrecognized species-level diversity among populations here regarded as Mazama temama, "Mazama" nemorivaga, and Hippocamelus antisensis. Noteworthy records that substantially extend the known distributions of M. temama and "M." gouazoubira are provided, and we unveil a surprising ambiguity regarding the distribution of "M." nemorivaga, as it is described in the literature. The study of deer of the tribe Odocoileini has been hampered by the paucity of information regarding voucher specimens and the provenance of sequences deposited in GenBank. We pinpoint priorities for future systematic research on the tribe Odocoileini.

Entities:  

Keywords:  Americas; CYTB; Cervidae; Deer; Hippocamelus; Mazama; Neotropics; Odocoileus; Pudu; Taxonomy; mDNA; phylogenetics

Year:  2017        PMID: 29134018      PMCID: PMC5673856          DOI: 10.3897/zookeys.697.15124

Source DB:  PubMed          Journal:  Zookeys        ISSN: 1313-2970            Impact factor:   1.546


Introduction

The tribe (: ) represents a monophyletic group encompassing all modern deer native to the New World (Americas) with the exception of the Holarctic taxa (), (), and () (Price et al. 2005, Gilbert et al. 2006, Hughes et al. 2006, Agnarsson and May-Collado 2008, Decker et al. 2009, Hassanin et al. 2012, Heckeberg et al. 2016)—see Heckeberg et al. (2016) for current suprageneric taxonomy. Living deer have been traditionally classified in six genera (, , , , , and ) and 16 species (Merino and Rossi 2010, Mattioli 2011; see also Gutiérrez et al. 2015), but alternative taxonomic propositions have suggested that the alpha-level diversity of the tribe might be higher (Molina and Molinari 1999, Molinari 2007, Groves and Grubb 2011). Some authors have also included as a member of (e.g., Groves and Grubb 2011). The native distribution of ranges from northern North America (Alaska, Canada) to southern South America (Patagonia), including some islands of the Caribbean Sea and the Atlantic and Pacific oceans. Collectively, members of the tribe occupy a wide variety of habitats, including desert scrub, savannas, swamps, lowland rain forests, humid-montane forests, páramo, and alpine tundra at elevations from sea level to about 4800 meters (Allen 1915, Hershkovitz 1982, Baker 1984, Méndez 1984, Brokx 1984, Medellín et al. 1998, González et al. 2002, Cronin et al. 2006, Meier and Merino 2007, Molinari 2007, Rumiz et al. 2007, Latch et al. 2009, Miranda et al. 2009, Piovezan et al. 2010, Groves and Grubb 2011, Mendes-Oliveira et al. , Barrio 2013, Gutiérrez et al. 2015). By virtue of this wide ecogeographic range, is of great biogeographic interest. Despite being heavily hunted animals in the Western Hemisphere and also of great public health interest (Bennett and Robinson 2000, Hurtado-Gonzales and Bodmer 2004, Angers et al. 2006, Campbell and VerCauteren 2011, Martinsen et al. 2016, Uehlinger et al. 2016), relatively little progress has been achieved in recent decades with regard to the systematics of deer. To date, only the genera (Allen 1915) and (Hershkovitz 1982) have been subjects of specimen-based revisionary taxonomic work, but these studies did not employ explicit phylogenetic methods. In general, the scientific community has largely followed the taxonomic arrangements recognized by 20th century authorities, predominantly E. R. Hall for North America (Hall 1981) and A. Cabrera for South America (Cabrera 1961). The uncritical acceptance of these taxonomic arrangements for decades is indefensible because the criteria, data, and methods used to construct them are largely unknown, unclear, or even incorrect (see example pointed out by Molinari [2007, p. 31]). Several recent taxonomic studies have demonstrated that the traditional taxonomy of deer needs to be revisited. For instance, morphometric analyses and differences in the frequency of qualitative skeletal traits in of northern South America and North America led Molina and Molinari (1999) to propose that populations from North and South America are not conspecific. These authors also demonstrated a remarkable degree of morphological variability among Venezuelan populations of , whose taxonomy remains disputed (Moscarella et al. 2003, 2007, Molinari 2007). Another example comes from phylogenetic analyses of molecular data demonstrating that the genus , as traditionally understood (Allen 1915, Cabrera 1961), is polyphyletic (Gilbert et al. 2006, Duarte et al. 2008, Hassanin et al. 2012, Escobedo-Morales et al. 2016, Heckeberg et al. 2016). Unfortunately, phylogenetic studies of published to date have been based on limited taxonomic and/or geographic sampling—i.e., lacking taxa or using exemplars for widely distributed and highly variable taxa (e.g., species of ). Nevertheless, these and other taxonomic studies, some based on karyology (e.g., Jorge and Benirschke 1977, Duarte and Jorge 2003, Cursino et al. 2014), have documented the need to revise the systematics of deer. Biologically meaningful species-level taxonomies are essential for study design in evolutionary biology, and inadequate species-level classifications, such as uncritically lumping or splitting taxa in absence of appropriate evidence, can detrimentally impact species conservation (George and Mayden 2005, Gutiérrez and Helgen 2013, Heller et al. 2013, Kaiser et al. 2013, Zachos et al. 2013, Voigt et al. 2015, Gippoliti et al. 2017). Accordingly, our long-term goal is to improve all aspects related to the systematics of odocoileines. A first step is to test whether phylogenetic analyses of mtDNA sequence data support the monophyly of recognized genera and species. These analyses have the potential to identify or indicate (1) distant phylogenetic relationships and deep divergences in species or populations currently lumped into a single genus or species, respectively; and (2) close phylogenetic relationships and PageBreakshallow divergences in species or populations currently split into different genera or species, respectively. Discovering any of these conditions can help target taxa requiring closer attention by taxonomists. Such a test also affords the first assessment of phylogenetic relationships among odocoileines that is simultaneously based on data for all traditionally recognized species, relatively dense geographic sampling within their ranges, and informed by our morphological examination of relevant voucher material in most cases. Nevertheless, because phylogenetic relationships can only be convincingly inferred based on sequence data from multiple, independently inherited loci—e.g., mtDNA, nuclear introns and exons located on different chromosomes—we understand the need to avoid overinterpretations of the gene tree that resulted from our analyses. As interpreted here, our results represent a set of explicit hypotheses that will serve to guide further research.

Methods

Sources of material, and taxonomic and geographic sampling

Our analyses were based on 192 sequences of the mitochondrial cytochrome-b (CYTB) gene. We drew on this marker for three reasons. First, CYTB sequences can be obtained relatively easily from degraded DNA that is extracted from museum specimens, which is important for our study since no freshly-preserved samples were available for several targeted species or populations. Second, previous studies have shown that analyses of CYTB sequence data can substantially clarify the taxonomic status of mammals whose taxonomy had been predominantly studied based only on morphological and/or karyological data (Duarte et al. 2008, Helgen et al. 2009, Gutiérrez et al. 2010, 2014, 2015, Voss et al. 2013). This coding gene evolves relatively rapidly yet is stable enough to offer insights at suprageneric levels (Agnarsson and May-Collado 2008, Ge et al. 2014), and many studies employing CYTB alongside unlinked nuclear sequences have found compatible patterns of variation among them, indicating that CYTB can be useful as a first-order estimator of phylogenetic history (Velazco and Patterson 2013, Voss et al. 2014, Upham and Patterson 2015). Third, a large number of CYTB sequences are available from GenBank and include most of our focal species. We obtained 171 sequences from GenBank and generated the remaining 21 sequences. All but two (KY928656, KY928667) of the latter sequences were obtained from degraded DNA extracted from museum specimens, from residual soft tissue attached to skeletons, or from maxilloturbinate bones (Wisely et al. 2004) (Table 1). Use of museum specimens allowed us to obtain sequence data for (1) species for which molecular data were previously lacking (i.e., and ; but see Heckeberg et al. 2016), and (2) populations from regions never included in any phylogeographic or phylogenetic study—e.g., from southern Central America and the Andes of Ecuador and Peru for . A study just published by Heckeberg et al. (2016) included CYTB data obtained from European museum specimens for PageBreakand that we could not access during the development of the present study. We independently generated and analyzed data for these species. We analyzed our sequences employing a more comprehensive geographic sampling for most taxa; hence, we take the opportunity to compare results from both studies and discuss the effect of geographic sampling on the resolution of the gene-trees and its impact on associated taxonomic interpretations. We deposited all sequences that we generated in GenBank, along with the museum catalogue numbers of their respective voucher specimens, tissue numbers, or both (Table 1). The geographic provenance and the names of the institutions that house voucher specimens are provided in the supplementary file 1 (see also Figures 1a, 1b, 1c for abbreviated provenance locality information and GenBank accession numbers of all analyzed sequences).
Table 1.

Sequenced specimens. GB: GenBank accession number. Catalogue#: museum catalogue number. Provenance: geographic origin (name of country, larger administrative entity, and a numeric identifier that corresponds to detailed locality information presented in the Gazetteer; supplementary file 1). DNA: number assigned to DNA extracted. Year: year in which the specimen was collected. M: Sequencing method (I: Illumina; S: Sanger; see Methods).

SpeciesGBCatalogue#ProvenanceDNAYearM
B. dichotomus KY928652 FMNH 52329Brazil: São Paulo (3)EEG 3431941I
M. americana KY928653 AMNH 67109Peru: Cajamarca (10)EEG 4371924I
M. americana KY928654 USNM 443588Venezuela: Yaracuy (21)EEG 6361967I
M. chunyi KY928655 FMNH 79912Peru: Puno: Sandia (12)EEG 297 [MTRH 293]1951S
M. gouazoubira KY928656 KU 155307Guyana: Potaro-Siparuni (8)EEG 5681997I
M. nemorivaga KY928657 AMNH 96171Brazil: Para (2)EEG 4701931I
M. nemorivaga KY928658 USNM 374916Venezuela: Bolívar (20)EEG 6281966I
Od. pandora KY928659 KU 93857Mexico: Campeche (13)EEG 5701963I
M. rufina KY928660 1FMNH 70563 2Colombia: Cundinamarca (5)EEG 3261952I
M. temama KY928661 KU 82215Guatemala: Petén (7)EEG 5721960I
Od. virginianus KY928662 AMNH 62872Ecuador: Los Ríos (6)EEG 3741922S
Od. virginianus KY928663 AMNH 29453Nicaragua: Jinotega (16)EEG 3981909S
Od. hemious KY928664 USNM 99455USA: Arizona (18)EEG 6721900I
Od. hemious KY928665 USNM 249424USA: Alaska (17)EEG 6661930I
Od. virginianus KY928666 USNM 99351Mexico: Chihuahua (14)EEG 0391899I
Od. virginianus 3 KY928667 USA: Washington DC (19)WTD00282010S
Od. virginianus KY928668 FMNH 78421Peru: Puno (11)EEG 2271950I
Od. virginianus KY928669 KU 149129Honduras: Cortes (9)EEG 5591955I
Od. virginianus KY928670 KU 93852Mexico: Yucatán (15)EEG 5621963S
Oz. bezoarticus KY928671 FMNH 28297Brazil: Mato Grosso (1)EEG 3541927I
P. mephistophiles KY928672 AMNH 181505Colombia: Cauca (4)EEG 3621958S

1 A previous study (Gutiérrez et al. 2015) generated a CYTB sequence (GenBank accession number is KR107038) for this specimen employing Sanger sequencing procedures.

2 The museum abbreviation for this specimen has been mistakenly reported as “USNM” (see Supporting information in Gutiérrez et al. 2015). 3 Hybrid, cross between and (see Discussion).

Figure 1a.

Phylogenetic tree of cytochrome-b sequences of . This is a strict consensus topology resulting from the Bayesian inference analysis. Nodal support is indicated at each node, except where the relationship received negligible support. Posterior probabilities (from the Bayesian inference analysis) and bootstrap values (from the maximum-likelihood analysis) are indicated before and after the slashes (“/”) at branches of interest (i.e., nodal support for fairly shallow relationships within intraspecific haplogroups are omitted). The scale represents substitutions per site. For each terminal, country of origin and next-largest administrative unit (state, department, province, etc.) are provided (when reported by the team that generated them; see detailed voucher and locality information in supplementary file 1 for sequences that we generated). GenBank accession numbers are indicated for each terminal.

Figure 1b.

Phylogenetic tree of cytochrome-b sequences of (continuation). This is a strict consensus topology resulting from the Bayesian inference analysis. Nodal support is indicated at each node, except where the relationship received negligible support. Posterior probabilities (from the Bayesian inference analysis) and bootstrap values (from the maximum-likelihood analysis) are indicated before and after the slashes (“/”) at branches of interest (i.e., nodal support for fairly shallow relationships within intraspecific haplogroups are omitted). The scale represents substitutions per site. For each terminal, country of origin and next-largest administrative unit (state, department, province, etc.) are provided (when reported by the team that generated them; see detailed voucher and locality information in supplementary file 1 for sequences that we generated). GenBank accession numbers are indicated for each terminal.

Figure 1c.

Phylogenetic tree of cytochrome-b sequences of (continuation). This is a strict consensus topology resulting from the Bayesian inference analysis. Nodal support is indicated at each node, except where the relationship received negligible support. Posterior probabilities (from the Bayesian inference analysis) and bootstrap values (from the maximum-likelihood analysis) are indicated before and after the slashes (“/”) at branches of interest (i.e., nodal support for fairly shallow relationships within intraspecific haplogroups are omitted). The scale represents substitutions per site. For each terminal, country of origin and next-largest administrative unit (state, department, province, etc.) are provided (when reported by the team that generated them; see detailed voucher and locality information in supplementary file 1 for sequences that we generated). GenBank accession numbers are indicated for each terminal.

Sequenced specimens. GB: GenBank accession number. Catalogue#: museum catalogue number. Provenance: geographic origin (name of country, larger administrative entity, and a numeric identifier that corresponds to detailed locality information presented in the Gazetteer; supplementary file 1). DNA: number assigned to DNA extracted. Year: year in which the specimen was collected. M: Sequencing method (I: Illumina; S: Sanger; see Methods). 1 A previous study (Gutiérrez et al. 2015) generated a CYTB sequence (GenBank accession number is KR107038) for this specimen employing Sanger sequencing procedures. 2 The museum abbreviation for this specimen has been mistakenly reported as “USNM” (see Supporting information in Gutiérrez et al. 2015). 3 Hybrid, cross between and (see Discussion). Phylogenetic tree of cytochrome-b sequences of . This is a strict consensus topology resulting from the Bayesian inference analysis. Nodal support is indicated at each node, except where the relationship received negligible support. Posterior probabilities (from the Bayesian inference analysis) and bootstrap values (from the maximum-likelihood analysis) are indicated before and after the slashes (“/”) at branches of interest (i.e., nodal support for fairly shallow relationships within intraspecific haplogroups are omitted). The scale represents substitutions per site. For each terminal, country of origin and next-largest administrative unit (state, department, province, etc.) are provided (when reported by the team that generated them; see detailed voucher and locality information in supplementary file 1 for sequences that we generated). GenBank accession numbers are indicated for each terminal. Phylogenetic tree of cytochrome-b sequences of (continuation). This is a strict consensus topology resulting from the Bayesian inference analysis. Nodal support is indicated at each node, except where the relationship received negligible support. Posterior probabilities (from the Bayesian inference analysis) and bootstrap values (from the maximum-likelihood analysis) are indicated before and after the slashes (“/”) at branches of interest (i.e., nodal support for fairly shallow relationships within intraspecific haplogroups are omitted). The scale represents substitutions per site. For each terminal, country of origin and next-largest administrative unit (state, department, province, etc.) are provided (when reported by the team that generated them; see detailed voucher and locality information in supplementary file 1 for sequences that we generated). GenBank accession numbers are indicated for each terminal. Phylogenetic tree of cytochrome-b sequences of (continuation). This is a strict consensus topology resulting from the Bayesian inference analysis. Nodal support is indicated at each node, except where the relationship received negligible support. Posterior probabilities (from the Bayesian inference analysis) and bootstrap values (from the maximum-likelihood analysis) are indicated before and after the slashes (“/”) at branches of interest (i.e., nodal support for fairly shallow relationships within intraspecific haplogroups are omitted). The scale represents substitutions per site. For each terminal, country of origin and next-largest administrative unit (state, department, province, etc.) are provided (when reported by the team that generated them; see detailed voucher and locality information in supplementary file 1 for sequences that we generated). GenBank accession numbers are indicated for each terminal.

Laboratory methods

We employed both Sanger (following Gutiérrez et al. 2015) and massively parallel (following Hawkins et al. 2016) sequencing technologies to generate part of the analyzed sequences. In order to minimize the risk of contamination with exogenous DNA, all pre-amplification procedures—i.e., DNA extractions, and either settings of conventional PCR reactions or library preparations—based on material obtained from museum specimens were conducted in an isolated facility dedicated exclusively to work with degraded DNA (i.e., where no PCR products have ever been present). We conducted phenol/chloroform DNA extractions following Wisely et al. (2004). Samples were concentrated with Amicon (Millipore, Darmstadt, Germany) filters via centrifugation and stored in siliconized tubes with an additional 20–50 μl of 1 X TE plus 0.5% Tween 20 (Sigma) and stored at -20°C. The DNA of the single freshly preserved tissue sample was extracted in a standard DNA extraction laboratory with a DNeasy Blood and Tissue kit (Qiagen, Valencia, CA, USA) following the manufacturer’s instructions. We employed various combinations of primers to amplify and to sequence short CYTB fragments (supplementary file 2). These reactions were conducted in a six-stage touchdown protocol using a thermal cycler (MJ Research). After an incubation at 95°C for 10 min, the first stage consisted of 2 cycles of the following steps: denaturing at 95°C for 15 seconds, annealing at 60°C for 30 seconds, and extension at 72°C for 1 min. The subsequent stages were identical to the first stage except for lowered annealing temperatures, which were 58°C, 56°C, 54°C, and 52°C for the second, third, fourth, and fifth stages, respectively. The sixth (final) stage consisted of 40 cycles with an annealing temperature of 50°C. All PCR reactions were set in 25 μl volumes containing 0.5 U AmpliTaq Polymerase (Applied Biosystems, Foster City, CA), 1X PCR AmpliTaq Buffer, 0.2 μM each dNTP, 0.4 μM of forward and 0.4 μM of reverse primers, 1.5 μM MgCl2, 10X BSA (New England Biolabs, Ipswich, MA), and 50–250 ng of genomic DNA template. Successful amplifications were purified using ExoSAP (USB Corporation, Cleveland, OH) incubated at 37°C for 15 min followed by 80°C for 15 min. Both strands of each PCR product were cycle sequenced by subjecting them to a second amplification using a total of 10 μL sequencing reaction mixture, including 50–200 ng of PCR product, 10 pM of corresponding forward or reverse primer, 5X Big Dye Buffer (Applied Biosystems), 1/8 reaction of Big Dye version 3 (Applied Biosystems). The following conditions were used for the Dye Terminator Cycle Sequencing: 25 cycles consisting of denaturing at 96°C for 10 s, annealing at 50°C for 10 s and extension at 60°C for 4 min. The final products were cleaned using Sephadex filtration and then both the 3’ and 5’ strands were sequenced on a 50 cm array using the ABI PRISM 3130 Genetic Analyzer (Applied Biosystems). To compile and edit the sequences that were generated via Sanger sequencing, we employed Geneious v.7.1.5. (Biomatters; http://www.geneious.com/). Some of the analyzed CYTB sequences were trimmed from 31 mitochondrial genomes (mitogenomes), 16 obtained from GenBank (generated by Hassanin et al. 2012) and 15 generated by us (following Hawkins et al. 2016). To generate these miPageBreaktogenomes, we prepared samples for Illumina sequencing using commercially available library preparation kits (Kapa Biosystems Illumina Library Preparation Kit #KK8232, Wilmington, MA, USA). Single indexed TruSeq-style adapters were used (Faircloth and Glenn 2012) employing 50 μl of DNA extract. Minor modifications to the manufacturer’s protocol (see Supplementary Materials Hawkins et al. [2016]) were made, including additional PCR cycles on degraded samples (18 cycles for degraded DNA from museum samples, and 10 for the freshly-preserved DNA sample). The success of library preparation was determined by visualization on an agarose gel. Then, the samples were purified with MagNA magnetic beads (Rohland and Reich 2012) in place of AMPure XP beads to bind DNA and remove the primer and adapter dimer. A ratio of 2.4:1 of MagNA beads to DNA was added to remove adapter dimer. DNA concentration was determined using the Nanodrop v.2.0. We multiplexed samples in order to decrease the costs associated with library enrichments. Individual samples were multiplexed in equimolar ratios for enrichment based on Nanodrop values in conjunction with the appearance and size distribution from the agarose gel. Each multiplexed pool contained 4–10 uniquely indexed samples for a total concentration of 500 ng concentrated to 3.4 μl volume. The pools also included non-cervid samples from other projects (see Hawkins et al. 2016). We enriched each pool of samples using a probe set that was diluted 1:5, giving each multiplexed pool approximately 100 ng of probes per enrichment. The probes employed corresponded to the same array described by Hawkins et al. (2016). Each pool of libraries was incubated with the RNA probes and buffers as described in the MYcroarray protocol for 48 hours at 65°C. Following incubation, DNA was separated from the probes via magnetic beads and purified with QiaQuick PCR Purification Kits (Qiagen) following MYcroarray’s enrichment protocol (version 1.3.8). Detailed protocols for MYbaits kits have been published online (http://ultraconserved.org/#protocols; http://www.mycroarray.com/pdf/MYbaits-manual.pdf). Post-enrichment pools were amplified for 25 cycles to produce a high enough concentration for gel extraction. QiaQuick Gel Extraction Kits (Qiagen) were used to size select the enriched pools for ~200–500 bp fragments and to remove residual adapter and primer dimer. Quantitative PCR was performed on enriched pools using an Illumina Library Quantification Kit (Kapa Biosystems) with two replicates of 1:1000, 1:2000, and 1:4000 dilutions for each pool. Pools were combined in equimolar ratios based on the number of samples in each pool. The samples were sequenced with paired-end chemistry and with a read length of 143 bp on a single lane of an Illumina HiSeq2500 at the Semel Institute UCLA Neurosciences Genomics Core; reads were demultiplexed at the core facility. To assemble the mitogenomes, we first merged the forward and reverse paired reads with the program PEAR v0.9.4. (Zhang et al. 2014). Using the default settings of PEAR, we merged forward and reverse reads when they had a 10 bp or greater overlap. All sequences were screened for the presence of adapter sequences, which were removed with cutadapt v.1.4.2 (Martin 2011). We then employed PRINSEQ-lite v.0.20.4 (Schmieder and Edwards 2011) for quality filtering, trimming reads with average quality scores below 20 and exact PCR replicates (more than three identical copPageBreakies). The filtered reads were then mapped to a reference sequence of the most closely related species using bwa v.7.10 (Li and Durbin 2009). The ‘bwa aln’ and ‘samse’ as well as the ‘bwa mem’ algorithms were tested on the degraded samples, with ‘bwa aln’ conducted as specified in Kircher et al. (2012). The reads corresponding to the freshly preserved tissue sample were mapped using the ‘bwa mem’ algorithm.

Sequence alignment, matrix properties, and selection of partition scheme and models of nucleotide substitution

We aligned sequences using default options of MAFFT v.7.017 (Katoh and Standley 2013) as implemented in Geneious v.7.1.5. Multiple substitutions in a DNA site (i.e., saturation) compromise historical information from it; therefore, we evaluated whether our CYTB matrix suffered from this undesirable condition. Thus, we employed the software DAMBE version 5.3 (Xia 2013) to generate saturation plots based on the GTR-corrected genetic distances. Subsequently, we used the Bayesian Information Criterion (BIC) as implemented in PartitionFinder ver. 1.0.1. (Lanfear et al. 2012) to determine the most suitable partition scheme and best-fit models of nucleotide substitution. This analysis considered models of nucleotide substitution applicable in MrBayes and evaluated five partition schemes.

Phylogenetic analyses

We conducted phylogenetic analyses using maximum likelihood (ML) and Bayesian inference (BI) as optimality criteria. For all analyses, we employed one sequence of each of the closest related taxa to the —, , and (Gilbert et al. 2006, Hassanin et al. 2012)—as outgroup taxa. However, we included (tribe ) as part of the ingroup to test whether it was recovered sister to the clade formed by undisputed (as found in more limited previous studies). Because has also been treated as a member of by some authors (Groves and Grubb 2011; but see Heckeberg et al. 2016), we take the opportunity to perform the same set of analyses that we are conducting for also for . For inferring the best topology in the ML analysis, we conducted 50 independent searches in the Genetic Algorithm for Rapid Likelihood Inference (GARLI 2.0) (Zwickl 2006) applying the best-fitting model (see Results) and the default settings. The Bayesian analysis was conducted in MrBayes v. 3. 2 (Ronquist et al. 2012). The search started with a random tree, and the Markov chains were run for 100,000,000 generations; trees were sampled every 1,000 generations. Default values were kept for the ‘‘relburnin’’ and ‘‘burninfrac’’ options in MrBayes (i.e., we used the commands relburnin = yes; burninfrac = 0.25); therefore, the first 25,000,000 generations (25,000 trees) were discarded as burn-in, and posterior probability estimates of all model parameters were based on the remaining (75,000) trees. Convergence PageBreakand stationarity were assessed in the Bayesian analyses by plotting likelihood values in Tracer 1.5 (Drummond and Rambaut 2007). To assess nodal support, we used nonparametric bootstrapping (Felsenstein 1985) for the ML analysis, and posterior probabilities for the BI analysis (Ronquist et al. 2012). The ML bootstrap analysis was performed in GARLI 2.0 using 100 pseudoreplicated data matrices, with 10 searches performed on each. Bayesian posterior probabilities were calculated simultaneously with the search for the best Bayesian topology, conducted as described earlier. Throughout the text, we refer to different degrees of nodal support for the ML bootstrap analysis using the following categories: strong support, for bootstrap values ≥ 75%; moderate support, for bootstrap values > 50% and < 75%; negligible support for values ≤ 50%. For the BI analysis, we refer to degrees of nodal support with two categories, significant or strong in cases in which a node’s posterior probability was ≥ 0.95, and insignificant or negligible posterior probability values < 0.95. We assessed the strength of phylogenetic evidence for species boundaries in the CYTB tree employing various statistics calculated via the Species Delimitation plugin (Masters et al. 2011) of Geneious v.7.1.5. This plugin allows users to assign terminals of a phylogenetic tree to putative species, which we did using traditional taxonomy of (see Introduction). Based on these designations and the recovered tree, Geneious’ Species Delimitation plugin calculates various statistics relating to the phylogenetic exclusivity of each putative species, the probabilities of such exclusivity having arisen by chance in a random coalescent process, and the degree to which the species can be diagnosed (Masters et al. 2011). The calculated metrics are abbreviated and defined as follows (from Masters et al. 2011): Intra, the average pairwise tree distance among members of the focal haplogroup; Inter, the average pairwise tree distance between the focal haplogroup and the members of the closest haplogroup; Intra/Inter, the ratio of Intra to Inter; P ID (strict), the mean (95% confidence interval) probability of correctly identifying an unknown member of the focal haplogroup using the criterion that it must fall within, but not sister to, the species clade in a tree; P ID (liberal), the mean (95% confidence interval) probability of correctly identifying an unknown member of the putative species using the criterion that it falls within, or sister to, the species clade in a tree; Av (MRCA-tips), the mean distance between the most recent common ancestor of a haplogroup and its members. We computed these statistics twice, once based on the ML tree and another based on the BI tree. A high degree of sequence divergence is neither necessary nor sufficient for species recognition (Ferguson 2002, Dávalos and Russell 2014); however, as pointed out by Gutiérrez et al. (2010), values of sequence divergence do provide a heuristically useful basis for comparing genetic variation within and among lineages and can help identify taxa in need of closer taxonomic attention. Therefore, we report average uncorrected (p) distance and average Kimura 2-parameter-corrected (K2P) distance within and among haplogroups. Whether justified or not, the latter metric has become widely used in mammals, and therefore we report it to facilitate comparisons with values reported for other groups and by other researchers. Genetic distances were calculated using MEGA version 5.2.1 (Tamura et al. 2011).

Results

Alignment properties, partition schemes, and models of nucleotide substitution

The saturation plot demonstrated that the sequence data used in this study do not suffer from saturation; the number of transversions is substantially lower than the number of transitions, even at the highest values of genetic distances (supplementary file 3). The alignment contained 11% missing data. The most suitable partitioning scheme was that in which the three codon positions were analyzed together (i.e., without using subsets). The best-fit model of nucleotide substitution was the generalized time-reversible model with gamma-distributed rate heterogeneity and a proportion of invariant sites (GTR + Г + I).

Monophyly of traditionally recognized genera

The topologies of the two phylogenetic analyses were similar; we show only the tree resulting from the Bayesian inference analysis (BI) (Figures 1a, 1b, 1c), with nodal support for both the BI and maximum-likelihood (ML) analyses. We comment on the few instances in which results from the two analyses differ. In both analyses, the genera , , , and were recovered as monophyletic with strong support, whereas the genera , , and were not (Figures 1a, 1b, 1c; see also column “Focal haplogroup support” on Tables 2 and 3). was recovered as polyphyletic, with (type species of the genus ), , , , , and showing a closer relationship to than to the other three species of , namely , , and , which were recovered elsewhere in the tree. These latter three species showed closer relationships to the genera , , , and than to . With regard to the genus , it was recovered as paraphyletic with respect to (Figures 1a, 1b), which was recovered sister to a haplogroup containing, almost exclusively, sequences of and (hereafter referred as the group; Figure 1b). However, the relationship between and the group received negligible support in both analyses. Lastly, neither analysis supports the monophyly of the genus as currently recognized (Figure 1c). In the BI analysis, our only sequence of P. was part of a polytomy that included also a haplogroup formed by and a clade formed by haplogroups of , , , , and . In the ML analysis, this latter multi-genus clade and P. were recovered as sister groups with negligible support.
Table 2.

Summary statistics from the Species Delimitation plugin of Geneious for haplogroups of and deer recovered in the maximum-likelihood tree. Focal haplogroup support: bootstrap values; Intra: The average pairwise tree distance among members of the focal haplogroup; Inter: the average pairwise tree distance between the focal haplogroup and the members of the closest haplogroup; Intra/Inter: the ratio of Intra to Inter; P ID (strict): the mean (95% confidence interval) probability of correctly identifying an unknown member of the focal haplogroup using the criterion that it must fall within, but not sister to, the species clade in a tree; P ID (liberal): the mean (95% confidence interval) probability of correctly identifying an unknown member of the putative species using the criterion that it falls within, or sister to, the species clade in a tree; Av (MRCA-tips): the mean distance between the most recent common ancestor of a haplogroup and its members.

Focal HaplogroupClosest HaplogroupSupport Intra Inter Intra/InterP ID (strict)P ID (liberal)Av (MRCA-tips)
B. dichotomus M. gouazoubira 1000.0030.1560.020.92 (0.80, 1.0)0.98 (0.87, 1.0)0.0025
H. antisensis H. bisulcus NANA0.069NANA0.96 (0.83, 1.0)NA
H. bisulcus H. antisensis 1000.0020.0690.030.57 (0.43, 0.72)0.96 (0.81, 1.0)0.0011
americana group 1 M. temama <500.0500.0900.560.83 (0.77, 0.88)0.96 (0.93, 0.98)0.0341
americana group 2 hemionus group<500.0360.0930.390.75 (0.65, 0.86)0.91 (0.85, 0.97)0.0247
M. chunyi M. gouazoubira NANA0.046NANA0.96 (0.83, 1.0)NA
M. gouazoubira M. chunyi 610.0150.0460.320.87 (0.80, 0.94)0.96 (0.92, 1.0)0.0107
M. nemorivaga americana group 21000.0690.1770.390.78 (0.70, 0.87)0.93 (0.88, 0.98)0.0749
M. pandora columbianus group1000.0020.1110.020.78 (0.61, 0.96)1.00 (0.85, 1.0)0.0013
M. rufina americana group 2930.0410.1300.320.79 (0.69, 0.90)0.92 (0.86, 0.99)0.0449
M. temama americana group 1990.0160.0900.180.88 (0.80, 0.97)0.96 (0.91, 1.0)0.0270
hemionus group americana group 2<500.0160.0930.170.94 (0.88, 0.99)0.98 (0.95, 1.0)0.0246
columbianus group hemionus group1000.0060.0970.060.97 (0.92, 1.0)0.99 (0.97, 1.0)0.0040
Oz. bezoarticus M. gouazoubira 1000.0110.1380.080.96 (0.89, 1.0)0.99 (0.95, 1.0)0.0111
P. mephistophiles Oz. bezoarticus NANA0.160NANA0.96 (0.83, 1.0)NA
P. puda Oz. bezoarticus 1000.0040.1730.020.97 (0.89, 1.0)1.00 (0.95, 1.0)0.0044
R. tarandus americana group 21000.0100.2130.050.98 (0.93, 1.0)1.00 (0.97, 1.0)0.0071
Table 3.

Summary statistics from the Species Delimitation plugin of Geneious for haplogroups of and deer recovered in the Bayesian tree. Focal haplogroup support: posterior probability values; Intra: The average pairwise tree distance among members of the focal haplogroup; Inter: the average pairwise tree distance between the focal haplogroup and the members of the closest haplogroup; Intra/Inter: the ratio of Intra to Inter; P ID (strict): the mean (95% confidence interval) probability of correctly identifying an unknown member of the focal haplogroup using the criterion that it must fall within, but not sister to, the species clade in a tree; P ID (liberal): the mean (95% confidence interval) probability of correctly identifying an unknown member of the putative species using the criterion that it falls within, or sister to, the species clade in a tree; Av (MRCA-tips): the mean distance between the most recent common ancestor of a haplogroup and its members.

Focal Haplogroup Closest Haplogroup Support Intra Inter Intra/Inter P ID (strict) P ID (liberal) Av (MRCA-tips)
B. dichotomus M. gouazoubira 1.000.0652.0140.030.91 (0.79, 1.0)0.98 (0.87, 1.0)0.0352
H. antisensis H. bisulcus NANA0.906NANA0.96 (0.83, 1.0)NA
H. bisulcus H. antisensis 1.000.0470.9060.050.56 (0.41, 0.71)0.95 (0.80, 1.0)0.0236
americana group 1 M. temama 0.950.6881.2480.550.83 (0.78, 0.88)0.96 (0.93, 0.98)0.4722
americana group 2 M. temama 0.890.5091.3340.380.76 (0.65, 0.86)0.91 (0.85, 0.98)0.3445
M. chunyi M. gouazoubira NANA0.639NANA0.96 (0.83, 1.0)NA
M. gouazoubira M. chunyi 0.950.2500.6390.390.85 (0.78, 0.91)0.95 (0.91, 1.00)0.1888
M. nemorivaga P. mephistophiles 0.920.9392.1980.430.77 (0.68, 0.85)0.93 (0.87, 0.98)0.9906
M. pandora columbianus group1.000.0501.4370.030.77 (0.59, 0.94)0.99 (0.84, 1.0)0.0305
M. rufina americana group 21.000.5851.7940.330.79 (0.68, 0.89)0.92 (0.86, 0.98)0.6342
M. temama americana group 11.000.2391.2480.190.88 (0.79, 0.96)0.96 (0.91, 1.0)0.3774
hemionus group americana group 20.920.2701.3910.190.93 (0.87, 0.98)0.98 (0.95, 1.0)0.4257
columbianus group hemionus group1.000.1171.4160.080.96 (0.91, 1.0)0.99 (0.96, 1.0)0.0617
Oz. bezoarticus M. gouazoubira 1.000.1901.885NA0.95 (0.88, 1.0)0.98 (0.94, 1.0)0.1755
P. mephistophiles americana group 2NANA1.9210.00NA0.96 (0.83, 1.0)NA
P. puda Oz. bezoarticus 1.000.0842.0630.040.96 (0.88, 1.0)1.00 (0.94, 1.0)0.0454
R. tarandus americana group 21.000.1792.6580.070.97 (0.92, 1.0)0.99 (0.96, 1.0)0.1416
Summary statistics from the Species Delimitation plugin of Geneious for haplogroups of and deer recovered in the maximum-likelihood tree. Focal haplogroup support: bootstrap values; Intra: The average pairwise tree distance among members of the focal haplogroup; Inter: the average pairwise tree distance between the focal haplogroup and the members of the closest haplogroup; Intra/Inter: the ratio of Intra to Inter; P ID (strict): the mean (95% confidence interval) probability of correctly identifying an unknown member of the focal haplogroup using the criterion that it must fall within, but not sister to, the species clade in a tree; P ID (liberal): the mean (95% confidence interval) probability of correctly identifying an unknown member of the putative species using the criterion that it falls within, or sister to, the species clade in a tree; Av (MRCA-tips): the mean distance between the most recent common ancestor of a haplogroup and its members.

Monophyly of traditionally recognized species

Taxa traditionally regarded as valid species for which we included multiple samples were all recovered as monophyletic with strong support in both analyses (ML, BI), with four exceptions: , , , and (Figures 1a, 1c). Two clades were identified for , and these clades PageBreakwere not sister to each other. One of these clades was formed by haplotypes from Bolivia, Brazil, French Guiana, Paraguay, Peru, and Venezuela, and a strongly supported subclade of and ; hereafter we refer to that clade as the group 1. The monophyly of the group 1 (including as members and ) received negligible and strong support in the ML and BI analyses, respectively. The PageBreaksecond clade of included haplotypes from Amazonas, Pará, and southern states of Brazil; hereafter we refer to this clade as the group 2. Monophyly of the group 2 received negligible and moderate support in the ML and BI, respectively. group 2 was recovered as sister to a large clade containing , , , and the group 1. In the case of , all but one sequence were recovered in a fully supported haplogroup that was sister to a single sequence of that species, but this relationship received negligible support (Figure 1c). Neither of the traditionally recognized species of the genus were recovered as monophyletic in any of our analyses. Both analyses recovered most sequences of in a large, strongly supported haplogroup, which also included three sequences from North American (Figure 1a); hereafter we refer to this clade as the group. As mentioned earlier, both analyses also recovered most of the samples attributed to and all of the samples attributed to in another fully supported haplogroup (Figure 1b). This haplogroup also included a sequence attributed to , though this sample is from Alaska. This haplogroup, as previously mentioned, was found sister to , albeit with negligible support. Lastly, was not recovered as monophyletic; a few sequences of nested within the group. The remaining sequences of were recovered as closely related to the group, but they did not form supported haplogroups or show clear geographic patterns of relatedness.

Gene tree-based species delimitation statistics and genetic distances

Species delimitation statistics and genetic distances aided in identifying taxa or haplogroups of taxonomic interest. A low degree of within-haplogroup tree distance suggests that the implicated haplogroup might comprise a single species. The average within-haplogroup tree distances were 0.007 and 0.132 as calculated with the ML and BI trees, respectively. The smallest within-haplogroup tree distances corresponded to , , , and , whereas the highest within-haplogroup tree distances corresponded to the group 2, , group 1, and (see “Intra” in Tables 2 and 3). Conversely, high tree distances between closely related haplogroups suggest that the haplogroups might not be conspecific. The average between-haplogroup tree distances were 0.115 and 1.512 as calculated with the ML and BI trees, respectively. The smallest between-haplogroup tree distances were those between the two species of , and between and , whereas the highest between-haplogroup tree distances were those between the and groups of the genus , and between with respect to the group (see “Inter” in Tables 2 and 3). Two other metrics, “P ID (strict)” and “P ID (liberal)”, show probabilities of correctly identifying an unknown member of the focal haplogroup using the criteria that it must fall either within or sister to the focal haplogroup, respectively. The lower these probabilities, the less likely that the focal haplogroup represents a valid species. The mean P ID (strict) were 0.856 and 0.849 as calculated with the ML and BI trees, respectively; in both cases only four species had probabilities equal or above 0.95—, P. , the group, and (Tables 2 and 3). The mean values of P ID (liberal) were 0.966 and 0.963 as calculated with the ML and BI trees, respectively; in both analyses all species had probabilities equal or above 0.95, with exception of group 2, , and (Tables 2 and 3). Another statistic calculated was the average distance between the most recent common ancestor of PageBreaka focal haplogroup and the tips of its members, Av (MRCA-tips). The smaller the value of this metric, the more likely members of the focal haplogroup are conspecific. The mean Av (MRCA-tips) were 0.019 and 0.282 as calculated with the ML and BI trees, respectively; in both analyses , , and showed the smallest Av (MRCA-tips) and and the largest (Tables 2 and 3). Mean uncorrected sequence divergence within species-level haplogroups—provisionally treating the group, the group, the group 1, and the group 2 as if each represented an individual species-level haplogroup—ranges from 0.0 to 3.6% (Table 4). However, sequence divergences across the basal split within some species are considerably higher than these average within-group values. In particular, Central American sequences of differ from the PageBreakPageBreaksingle available Colombian sample of that species by 5.0%, and the lone sequence of from the state of Pará, in northern Brazil, differs from all other sequences of that species by 8.3%. Although not a basal split within , it is noteworthy that group 1 (from the Guiana Shield) and group 2 (from Brazil and Peru) differ from one another by 5.9%. Average interspecific divergences within three consistently recovered sister-species pairs ( + , + , and + group) range from 1.8% to 6.2% (Table 4). The sister-species pair formed by and was embedded within the diversity of the group 1; the level of divergence between these two species ( and ) was only 1.3%.
Table 4.

Matrix of genetic distances (percent sequence divergence) within and among recovered haplogroups of deer. Average uncorrected (p) distances among conspecific sequences are arrayed along the diagonal, interspecific p distances are below the diagonal, and Kimura two-parameter (K2P) distances are above the diagonal. No genetic distances were calculated within species for which we only had a single sequence available; however, we duplicated each of these sequences to allow for calculations of interspecific p-distances. The following names apply to haplogroups (as recovered in our phylogenetic analyses) rather than to species: 1, 2, group, and the group.

1234567891011121314151617
1. Blastocerus dichotomus 0.3 4.47.69.810.37.17.26.910.98.010.08.49.08.25.214.69.5
2. Hippocamelus antisensis 4.23.08.89.32.52.65.89.97.38.87.48.08.34.212.56.6
3. Hippocamelus bisulcus 7.12.9 0.8 8.39.93.85.85.911.45.68.310.19.57.87.410.25.6
4. Mazama americana 1 9.18.27.7 2.8 3.77.29.18.94.34.83.36.64.810.37.810.16.9
5. Mazama americana 2 9.58.59.13.8 3.2 7.28.89.04.35.23.96.75.810.68.311.18.6
6. Mazama chunyi 6.72.53.76.76.81.86.37.04.76.07.57.19.37.011.77.6
7. Mazama gouazoubira 6.82.65.48.48.11.8 0.5 7.19.06.58.07.79.111.47.113.49.6
8. Mazama nemorivaga 6.55.55.68.28.35.96.6 3.6 9.65.38.07.89.48.16.612.29.2
9. Mazama pandora 10.09.010.24.14.16.68.38.8 0.0 6.35.27.56.213.38.812.99.7
10. Mazama rufina 7.56.85.34.65.04.56.15.15.9 1.5 3.76.76.48.36.38.46.7
11. Mazama temama 9.38.27.83.23.85.77.57.55.03.6 0.7 5.64.68.78.09.96.8
12. columbianus group 7.97.09.26.26.37.07.27.26.96.35.4 2.2 6.612.36.311.99.1
13. hemionus group 8.47.58.74.55.56.78.48.75.86.04.46.2 0.2 11.37.011.07.9
14. Ozotoceros bezoarticus 7.77.77.29.59.78.510.37.511.87.78.011.110.3 0.7 9.215.27.8
15. Pudu mephistophiles 5.14.17.07.37.76.66.76.28.25.97.56.06.78.510.45.8
16. Pudu puda 13.111.39.49.410.210.712.011.111.77.99.210.810.113.59.7 0.4 9.8
17. Rangifer tarandus 8.76.15.36.57.97.18.78.48.96.26.48.37.47.35.59.1 0.8
Summary statistics from the Species Delimitation plugin of Geneious for haplogroups of and deer recovered in the Bayesian tree. Focal haplogroup support: posterior probability values; Intra: The average pairwise tree distance among members of the focal haplogroup; Inter: the average pairwise tree distance between the focal haplogroup and the members of the closest haplogroup; Intra/Inter: the ratio of Intra to Inter; P ID (strict): the mean (95% confidence interval) probability of correctly identifying an unknown member of the focal haplogroup using the criterion that it must fall within, but not sister to, the species clade in a tree; P ID (liberal): the mean (95% confidence interval) probability of correctly identifying an unknown member of the putative species using the criterion that it falls within, or sister to, the species clade in a tree; Av (MRCA-tips): the mean distance between the most recent common ancestor of a haplogroup and its members. Matrix of genetic distances (percent sequence divergence) within and among recovered haplogroups of deer. Average uncorrected (p) distances among conspecific sequences are arrayed along the diagonal, interspecific p distances are below the diagonal, and Kimura two-parameter (K2P) distances are above the diagonal. No genetic distances were calculated within species for which we only had a single sequence available; however, we duplicated each of these sequences to allow for calculations of interspecific p-distances. The following names apply to haplogroups (as recovered in our phylogenetic analyses) rather than to species: 1, 2, group, and the group.

Discussion

Polyphyly and phylogenetics of the genus

Based on data from all nine currently recognized species of (Gutiérrez et al. 2015), we confirm the findings by previous authors (Gilbert et al. 2006, Duarte et al. 2008, Hassanin et al. 2012, Escobedo-Morales et al. 2016, Heckeberg et al. 2016) that the genus, as traditionally understood (Allen 1915, Cabrera 1961), is polyphyletic. In the only comprehensive revisionary work published for , Allen (1915) stated that the main characteristics that distinguish the genus from other deer genera are: short, unbranching (spike-like) antlers in males (but note that males of the genus also possesses spike-like antlers); small, slightly expanded bullae in comparison with those of and ; flat and usually nearly straight upper borders of the orbits; slight over-hang of the frontals over the postorbital fossa; overall small size and the red coloration of most of its species; Allen also acknowledged the existence of a group of with brown coloration (Allen 1915). Clearly, our results and those from previous studies, one of them based on multi-locus data, demonstrate that this morphological characterization of does not diagnose a natural group (Gilbert et al. 2006, Escobedo-Morales et al. 2016). Logically, either some of these morphological characteristics resulted from convergent evolution, or they represent plesiomorphies inherited from an ancestor shared by many of these deer. Ancient hybridization, incomplete lineage sorting, or both, often explain lack of monophyly in recently originated clades when limited sequence data are analyzed (particularly mitochondrial DNA data); however, species traditionally classified into the genus are so widely distributed throughout the recovered tree that it seems unlikely that these phenomena explain the observed, rampant polyphyly. The tribe is divided into two major clades for which subtribe-level names have recently been proposed (Heckeberg et al. 2016). The subtribe contains taxa from temperate, subtropical, and tropical regions of the Americas, whereas the subtribe contains taxa exclusively from subtropical and tropical regions of South America (see Figures 1a, 1b, 1c). In our analyses, both subtribes were recovered with poor nodal support, but their monophyly has been supported by previous studies PageBreak(e.g., Gilbert et al. 2006, Hassanin et al. 2012). , as traditionally understood, is represented by species in both subtribes. In our analyses, the included all of the species of with red (reddish) pelage, i.e., , , , , ; one species with brown (brownish/grayish) pelage coloration, ; and the genus . The remaining three species of with brown (i.e., brownish or grayish) pelage coloration (i.e., , , ) were recovered in , which also includes the genera , , , and . These findings confirm, with more comprehensive sampling, those from two recent mtDNA-based studies (Escobedo-Morales et al. 2016, Heckeberg et al. 2016). Results from these studies clearly call into question the validity and usefulness of the terms “red clade”, “red brocket species group”, “gray clade”, “gray brocket species group”, “brown group”, all of which have been previously applied to groups (e.g., by Allen 1915, Duarte et al. 2008, Escobedo-Morales et al. 2016) whose respective monophyly has never been supported. These terms based on pelage coloration are highly misleading. For example, the term “gray clade” erroneously implies that all of the species now allocated within the subtribe possess predominantly gray pelage coloration, but almost half of the species in this subtribe lack such coloration (, , , ; Hershkovitz 1959, 1982, Jackson 1987, Rumiz et al. 2007, Miranda et al. 2009), and, more importantly, species of “” with gray pelage were not recovered as a monophyletic group in either our analyses or those of previous studies (Gilbert et al. 2006, Duarte et al. 2008, Escobedo-Morales et al. 2016).

Phylogenetic relationships and taxonomy of species traditionally classified as

Our results have implications for the alpha-level taxonomy of . Phylogenetic analyses based on CYTB data by Duarte et al. (2008) recovered in two distinct haplogroups, one of which also included terminal branches that they identified as and . In that study, however, these haplogroups formed part of a polytomy together with and a sequence of “ sp.” Subsequently, based on partial sequences of both the CYTB gene and the mitochondrial control-region (D-LOOP), Abril et al. (2010) recovered two monophyletic haplogroups within . Despite the lack of resolution in the results obtained by Duarte et al. (2008), Abril et al. (2010) assumed the monophyly of by the composition of their ingroup (i.e., not including other odocoileines), and, therefore, the topology they obtained could not evaluate whether represents a single species. However, more recent studies employing CYTB sequence data from multiple species of have shown to be polyphyletic (Escobedo-Morales et al. 2016, Heckeberg et al. 2016). Based on more comprehensive sampling, our results confirm the polyphyly of (as currently understood) and provide novel insights regarding the possible taxonomic identity and geographic distribution of at PageBreakleast two species currently lumped within . Before discussing this topic, we clarify that comparisons of the CYTB sequences generated by Abril et al. (2010) with respect to those analyzed by us indicated that the two haplogroups obtained by the former group of researchers match our groups 1 and 2. Because the name is based on the type locality of Cayenne, French Guiana (see Allen 1915), our group 1, which included a sequence (accession number NC020719; Figure 1b) from Barrage de Petite-Saut (Alexandre Hassanin in litt.), northern French Guiana, located at only ca. 80 km E Cayenne, likely corresponds to sensu stricto. Further work is necessary to determine whether group 1 truly corresponds to sensu stricto. If confirmed, then the sequence data herein analyzed, and that produced by Abril et al. (2010), would document the presence of sensu stricto in French Guiana, Bolivia, Brazil (states of Acre, Pará, Paraná, Rondônia, and São Paulo), Paraguay (department of Alto Paraná), Peru (Region of Cajamarca), and Venezuela (state of Yaracuy). The provenance localities of other analyzed samples of group 1 are unknown (see ). Further taxonomic work is also necessary to confirm that group 2 is not conspecific with sensu stricto and, if so, assign to it a species name. Our analysis documents this lineage (provisionally referred to as “ group 2” or “ 2”) in the states of Amazonas and Pará in northern Brazil. In addition, a sequence that matches our group 2 was generated by Abril et al. (2010) from a karyotyped individual born in captivity (in “Criadouro Santarém”) in the Brazilian state of Pará, but of unknown geographic origin. Previous research focused on Brazilian populations of sensu lato has shown the existence of at least six distinct karyotypes in different regions of that country, and inter-cytotype crosses in captivity demonstrated reproductive isolation among the most geographically-distant cytotypes (Cursino et al. 2014). The results from our phylogenetic analyses are congruent with these karyological and reproductive observations, and confirm that more than a single species is currently lumped under sensu lato. To date, the only study that has examined the morphological variation of sensu lato in a large portion of its distribution is the unpublished master thesis of Dr. Rogério V. Rossi (Rossi 2000). Based on morphometric analyses of Brazilian samples, Rossi found that specimens from littoral areas of southeastern Brazil (from Santa Catarina to São Paulo states) are slightly differentiated from those obtained from populations to the interior of that country. Whether a correspondence exists between these two morphologically distinguished groups and the clades identified in the present study remains to be addressed. Reconciling current phylogenetic information for and with their taxonomic status as valid species presents a conundrum. The existence of two species of small brockets in southern South America has been noted in the scientific literature since the first half of the 19th century (Lesson 1842, Goeldi 1893, Lydekker 1898, 1915, Miranda-Ribeiro 1919). These deer are currently referred to as , known from remnants of Atlantic Forest in southeastern São Paulo and eastern Paraná and Santa Catarina, Brazil (Duarte and Jorge 2003, Vogliotti and Duarte 2009, PageBreakDuarte et al. 2016), and as , known from Atlantic Forest habitat in southern São Paulo, Paraná, Santa Catarina, and northern Río Grande do Sul, Brazil (Rossi 2000, Vivo et al. 2011, Duarte et al. 2012). Records of also exist from the Alto Paraná and Itapúa departments of Paraguay (Gamarra de Fox and Martin 1996) and the Misiones province of Argentina (Di Bitetti et al. 2008). No agreement about their taxonomic status was reached until recently, when they were recognized as valid species on the basis of chromosomal differences between them and with sensu lato (Duarte and Jorge 2003, Abril and Duarte 2008). Reported karyotypes for these species include the following diploid and fundamental numbers (2n/FN): : 32–34/46 (Duarte and Jorge 2003); : 36/56, 37/59, 38/60 (Duarte and Jorge 2003), 36–39/58 (Abril and Duarte 2008); M group 1: 50/54; M group 2: 42/49, 43/48, 49/56, 51/56 (Duarte and Jorge 2003). Additional karyotypes reported for sensu lato lacking CYTB sequences are available—and hence not assigned to group 1 or 2—include the following 2n/FNs: 42/46, 43/46, 44/46, 44/48, 45/48, 50/54, 52/56, 53/56 (Abril et al. 2010). These data and a study that involved crosses in captivity to assess hybrids’ fertility have demonstrated that: (1) is not a hybrid between and , and is unable to produce fertile hybrids with either of these species (Duarte and Jorge 2003); and (2) groups 1 and 2 are reproductively isolated (Cursino et al. 2014). Based on these findings, phylogenetic analyses based on a relatively fast-evolving gene would be expected to recover , , and as independent lineages; however, the former two species were recovered nested within group 1. For species in this complex, future systematic efforts should concentrate in three areas. First, to investigate the phylogeographic structure of populations in the group 1, which implicitly requires assessing the phylogenetic position of and , based on sequence data from multiple unlinked loci, including nuclear DNA segments with faster mutation rates than the CYTB gene to resolve finer-scale relationships. This approach would concomitantly enable assessment of the potential role of hybridization, incomplete lineage sorting, or both as causal explanations for the topology obtained in our analyses (see above). Second, the specific mechanisms responsible for the remarkable karyological variation observed in this group need further investigation, as do their implications for speciation. Although important advances have been made unveiling the chromosomal variation in this group (e.g., Duarte and Jorge 1996, 2003, Abril and Duarte 2008, Cursino et al. 2014), much remains to be done, including investigating the possible role of B chromosomes—which are able to create even intra-individual karyological variation (Abril and Duarte 2008, Abril et al. 2010)—on speciation (if any). The mechanisms that have been postulated to explain the chromosomal variability of sensu lato need to be revisited because sensu lato is not monophyletic, as previously (and implicitly) assumed (by Abril et al. 2010, Cursino et al. 2014). Third, a morphological assessment of differences among natural groups (identifiable by molecular and karyological criteria) should be conducted in search of diagnostic traits. Preliminary analyses of linear measurements taken on craniodental and external traits allow unambiguous PageBreakdiscrimination between sensu lato and , but not between the former and (Rossi 2000). This is likely an artifact created by the fact that such comparisons were conducted assuming that populations of sensu lato comprised a single species, inflating its apparent variability. Similar comparisons between and permitted unambiguous discrimination between both of these species (Rossi 2000; but see Duarte et al. 2008). Our results offer novel phylogenetic information with respect to , a species endemic to the Península de Yucatán. A recent study based on mtDNA (Escobedo-Morales et al. 2016) recovered as a monophyletic haplogroup sister to , the only species of analyzed in that study. Another study reanalyzed these and additional data and found sister to a clade composed by a handful of sequences of and of unspecified geographic origin; the two species of were found intermixed with each other within a poorly supported monophyletic group (Heckeberg et al. 2016). Our more comprehensive sampling identified a novel sister-taxon relationship between and the group—the latter is a clade formed by most samples and all samples of , and a sample of , whose inclusion in this clade might be a consequence of hybridization. Given the traditional assignment of to the genus (Allen 1915, Medellín et al. 1998), its nested position within was unexpected. However, the overall morphological appearance of somewhat resembles that of the genus (Figure 2); the species has grayish pelage, and divergent antlers larger than other species classified in . It is expected that future work will unveil morphological synapomorphies between species of and . The sister relationship between and the group also suggests that the biogeographic history of these deer is complex, but this topic requires robust phylogenetic inference, enabling ancestral area reconstructions and proper molecular dating. However, discussing the nomenclatural implications of the close relationship between and the genus is necessary, especially after Escobedo-Morales et al. (2016) advocated allocating species of into the genus . Such an action, which has been contemplated by a few modern authors (Haltenorth 1963, Grubb 2000, Groves and Grubb 2011), would increase congruence between available phylogenetic information and the taxonomic nomenclature of but diminish efficiency in communication of scientific information. Allocating species currently treated as within would unnecessarily (see below) disrupt the association between the name and at least two—and perhaps more (Molina and Molinari 1999, Molinari 2007)—species epithets and the names of numerous subspecies (between 48 and 71) (Baker 1984, Brokx 1984, Méndez 1984, Smith 1991). This action would pose difficulties for retrieval of data and bibliography from repositories, such as GenBank and the Global Biodiversity Information Facility, and search engines, such as Google Scholar and the Web of Science, respectively. This is not a trivial matter because, given the importance of in aspects raging from public health to landscape ecology, massive amounts of data are associated with the name , whose North American members are among the most studied ungulates worldwide. A more suitPageBreakPageBreakable solution to the current incongruence between the phylogenetic information available and the nomenclature of these deer would be to restrict the use of the name to the clade containing and the group 1; to allocate to the genus , and to recognize and the group 2 as belonging to two separate genera, other than . Disrupting association between the genus and species epithets for “” , “” and taxa within the group 2 is both unavoidable—because of the polyphyletic nature of (as currently understood)—and less problematic for scientific communication because these species are far less studied than those of . This solution would reconcile the available phylogenetic information with the taxonomy of the group while minimizing nomenclatural instability. Similar considerations and actions have been recently employed to preserve binomial stability in various mammalian groups, including opossums (Giarla et al. 2010, Voss et al. 2014, Díaz-Nieto and Voss 2016, Pavan and Voss 2016), rodents (Teta et al. 2016), and primates (Garbino 2015, Gutiérrez and Marinho-Filnho 2017). A third alternative would be to retain in until data from independently inherited loci become available. However, no analytical evidence, of any sort, supports a close relationship between and , the type species of the genus. Although analyses of data from a single gene offer incomplete bases for taxonomic revisions, they represent an improvement when the traditional taxonomy in question is based on no evidence whatsoever. In those cases, dogmatically preserving the traditional taxonomy would essentially translate into imposing beliefs while ignoring data. The transferral of to an already-described genus, , seems a sensible and justifiable provisional action, considering not only the phylogenetic evidence here presented but also resemblance in external morphology between and species of (Figure 2). By contrast, allocating presumed clades currently regarded as sensu lato into different genera would involve either the description of new genera or the recognition of available generic names which are currently treated as junior synonymies, without sufficient consideration of morphological traits that might support such actions. These nomenclatural improvements should be carried out once a robust multi-locus phylogeny becomes available and should be coupled with morphological diagnoses of the genera to be proposed.
Figure 2.

Overall morphological appearance of “” (panels A–C) and that of the genus (panels D–F). Notice the grayish pelage and divergent antlers larger than in other species currently classified in . “” , panels A and C individuals kept in captivity at the Parque Zoológico del Bicentenario Animaya, Mérida, Yucatán, Mexico (photographs by Luis A. Escobedo-Morales)—provenance unknown; panel B individual kept in captivity in Tekax, Yucatán, Mexico (photograph by Rosa María González Marín)—provenance unknown. (see proposals by Molina and Molinari 1999 and Molinari 2007); panels D and E Monteredondo, Parque Nacional Chingaza, ca. 47 km (by road) E Bogota, Cundinamarca, Colombia (photographs by Aideé Vargas-Espinoza and Irene Aconcha, respectively); panel F Laguna de Mucubají, Parque Nacional Sierra Nevada, Mérida, Venezuela (photograph by Rodrigo Díaz Lupanow).

Besides confirming the monophyly of (Escobedo-Morales et al. 2016), we provide evidence that this species occurs in South America, or that populations in Colombia perhaps represent a currently unrecognized species. Previously, had been regarded as a Central American endemic, ranging from southeastern Mexico to Panama (Allen 1915). However, some authors speculated that the species could also range into northern Colombia, but provided no evidence or explanation (Bello-Gutiérrez et al. 2010). In our analyses, a sequence (GenBank accession number JN632673) from Parque Nacional Chingaza, near Bogotá, Cundinamarca, Colombia (Manuel Ruiz-García in litt.), previously assigned to (Hassanin et al. 2012), was recovered as sister to a haplogroup containing sequences of (Figure 1b). Because this latter haplogroup comprised sequences obtained from samples that were correctly identified via examination of voucher specimens (see Escobe), we herein re-identify this Colombian sample as . Our finding of the species in Colombia is congruent with unpublished morphometric data obtained by EEG, KMH, and JEM. In their recent study, Escobedo-Morales et al. (2016) retained the identity of sequence JN632673 as (a procedure also followed by Heckeberg et al. 2016) but noted that it could have resulted from misidentification, contamination, or hybridization with other species of , or that it might represent an unnamed species. Our results cannot reject that this sequence belongs to an currently unrecognized species because the sequence divergence existing between sequence JN632673 (from Colombia) and the Central American haplogroup of is (ca. 5.0%) substantially higher than divergences known between sister species pairs of deer (all below 3%; see Results). Hence, our assignment of sequence JN632673 to should be regarded as provisional; further work should explore the possibility that two species might be currently lumped within . Three species traditionally regarded as members of the genus were recovered within , the subtribe endemic to South America. One of them, , has only been incorporated twice in phylogenetic assessments (herein and in the just-published study by Heckeberg et al. 2016), and in each case based on a single CYTB sequence (obtained from different specimens). was found sister to , which was recovered in a monophyletic haplogroup (with strong and moderate support in the BI and ML analyses, respectively). Thus, pending confirmation via analyses of additional molecular data, it is likely that and represent a sister-species pair: one member is restricted to montane habitats of the Bolivian and Peruvian Andes () and the other is widely distributed in lowland habitats of South America (). If this result is corroborated, then both species should be assigned to a genus other than (which is based on and likely applies to group 1, see above). Even if further analyses do not confirm their sister-taxon relationship, both species need to be transferred to a genus other than because they share a most recent common ancestor with members of the subtribe , not with the type species of , which belongs to the subtribe . We note that the genus-group name Fitzinger, 1873, with type species Fitzinger (= ), may be available for this clade (Lydekker 1898, Allen 1915). We recovered two principal reciprocally monophyletic haplogroups within : one ( 1) formed exclusively by samples from the northern portion of the species’ range—i.e., from the Venezuelan state of Bolivar, the Guyanean region of Potaro-Siparuni, an unknown locality from French Guiana, and the Brazilian state of Rondônia—and the other ( 2) formed by samples from two unknown localities (one from Brazil and another from Peru) and from the Brazilian states of Pará, Paraná, and Rondônia. The monophyly of these haplogroups received either moderate or strong support. was recovered in our analyses as an isolated lineage divergent from other South American lineages of , including the - clade, with which it has been taxonomically associated for most of its past taxonomic history (e.g., Miranda-Ribeiro 1919, Cabrera 1961; but see Allen , Rossi 2000). Further research is needed to confirm its relationships and distinctness, but our results suggest it may require genus-level recognition within the . We note that the generic-level name Gloger, 1841, with type species Cuvier, 1817 (= ), is available for this clade (Palmer 1904). We found evidence that suggests that habitat association in and might have impacted their phylogeographic structure in contrasting ways. Despite the wide distribution of , which apparently ranges from Colombia (see below) to Argentina, we found shallow phylogeographic relationships among analyzed populations of this species (Figures 1a, 1b, 1c). This pattern might be explained by the tolerance of this species to a wide range of environmental conditions, as suggested by its occurrence across dry, wet, forested and open habitat types (Black and Vogliotti 2008, Black-Décima et al. 2010, Duarte et al. 2012). Wide environmental tolerance might have enabled historical connectivity among populations and gene flow. Conversely, in , a species that seems to be predominantly associated with tropical and subtropical broadleaf moist forest habitats (as described by Olson et al. 2001; Rossi and Duarte 2016), we found substantially deeper phylogeographic pattering. This pattern might be a consequence of past expansion and contractions of wet forest habitats isolating populations. Such expansions and contractions of forest habitats are thought to have triggered vicariance events that shaped the phylogeographic structure observed in species closely associated to either wet forest- or dry forest habitat types (Gutiérrez et al. 2014). Our analyses also yielded new insights regarding the distribution of “” . Given that a Colombian sample of “” (GenBank accession number JN632658 [curated version number NC_020720]; Hassanin et al. 2012), obtained from an live individual from northern Bolívar department (Manuel Ruiz-Garcia, in litt.), was recovered nested within a haplogroup containing all other samples of that species, our results demonstrate that the northern limit of the species’ distribution is not the southern margin of the Amazon basin, as recently argued (Black and Vogliotti 2008, Black-Décima et al. 2010, Duarte et al. 2012). The Colombian sample extends the distribution of at least ca. 1000 km to the north of literature records of the species in northwestern Bolivia (Black and Vogliotti 2008, Black-Décima et al. 2010, Duarte et al. 2012)—this distance is a rough estimate as we were not able to obtain detailed locality information for this sample (see Hassanin et al. 2012). We take the opportunity to comment on ambiguities that have prevailed in the literature with regard to the distribution of . Important discrepancies exist among published distribution maps for this species. For example, Duarte et al. (2012) depicted a distributional range for the species that includes the Amazonian region and the Guianas, the eastern slopes of the Ecuadorian and Peruvian Andes, the southern half of the Andean cordilleras of Colombia, the Sierra de Santa Marta and lowlands in northern Colombia, and the Lago de Maracaibo basin and the Península de Paraguaná in northwestern Venezuela. However, Rossi and Duarte (2016) omitted the Colombian Andes from their range map for this species, but included the entire VenPageBreakezuelan mainland with exception of the Andean cordilleras, the Península de Paraguaná, and the northern half of the La Guajira department of Colombia. These differences seem to have resulted from attempts to combine records of from Amazonia and the Guianas with alleged records of that species from other regions. A modern revisionary work evaluating the taxonomy of brockets in northern South America is indispensable to achieve reliable knowledge on the distribution of and determine which of the populations in northwestern South America, if any, correspond to , whose type locality is Cayenne, French Guiana (Allen 1915). Overall morphological appearance of “” (panels A–C) and that of the genus (panels D–F). Notice the grayish pelage and divergent antlers larger than in other species currently classified in . “” , panels A and C individuals kept in captivity at the Parque Zoológico del Bicentenario Animaya, Mérida, Yucatán, Mexico (photographs by Luis A. Escobedo-Morales)—provenance unknown; panel B individual kept in captivity in Tekax, Yucatán, Mexico (photograph by Rosa María González Marín)—provenance unknown. (see proposals by Molina and Molinari 1999 and Molinari 2007); panels D and E Monteredondo, Parque Nacional Chingaza, ca. 47 km (by road) E Bogota, Cundinamarca, Colombia (photographs by Aideé Vargas-Espinoza and Irene Aconcha, respectively); panel F Laguna de Mucubají, Parque Nacional Sierra Nevada, Mérida, Venezuela (photograph by Rodrigo Díaz Lupanow).

Monophyly and phylogenetics of the genus

Our results do not support the monophyly of the genus as traditionally understood because the node shared by all samples of received negligible support in both analyses and, more importantly, because was found embedded within (as previously discussed). Because of the apparent recent origin of , it is likely that recovering the genus and its species as monophyletic groups would require examination of DNA segments with higher mutations rates than that of the CYTB gene. In fact, we conducted preliminary analyses (not shown) of sequence data from the mitochondrial control region (D-loop) and CYTB generated for a previous study on the phylogeography of (Latch et al. 2009) and found that, when analyzed alone, the CYTB data failed to provide an adequately supported topology. By contrast, D-loop sequences analyzed in combination with the CYTB data yielded a more structured tree and with better nodal support (similar to that shown in figure 2 of Latch 2009).

Phylogenetic relationships and taxonomy of species of

Our results do not support the monophyly of either of the species traditionally recognized within the genus , i.e., and . Two explanations are likely. First, as mentioned above, the substitution rate of CYTB appears too low to allow adequate resolution of relationships as recent as these. In other words, incomplete lineage sorting might be responsible for the observed lack of monophyly in these taxa. Second, the observed lack of monophyly in these species is a partial consequence of hybridization between them, a phenomenon that has been widely documented (Carr et al. 1986, Stubblefield et al. 1986, Cronin et al. 1988, Key and Boe 1992, Cathey et al. 1998, Hornbeck and Mahoney 2000, Bradley et al. 2003). Hybridization between and , or among their respective subspecies (e.g., Hopken et al. 2015), seems to occur not only along contact zones of their native ranges, but also in areas to which they have been translocated for commercial purposes. For instance, a free-ranging deer in natural areas within Washington PageBreakDC (the National Zoo, Smithsonian Institution), with external characteristics matching , had a CYTB haplotype (KY928667) that places it within the group in our gene tree. This is a sign of hybridization between both species in the state of Virginia, where individuals of were translocated decades ago (Linzey 1998). Hybridization can also explain other instances in which nominal taxa were not recovered in monophyletic groups. For example, although most samples of black-tailed deer ( and ) form a clade, two sequences attributed to from Alaska were not recovered within this clade. These two sequences were recovered within the group which can be attributed to hybridization between and other subspecies of (Latch et al. 2009, 2011 and references therein). Similarly, hybridization may also explain why a sequence attributed to from Alaska was recovered within the group. The traditional classification of species of is incongruent with the phylogenetic information currently available for them. Our results suggest (1) that the and lineages, currently treated as subspecies of , form a clade that is more closely related to than to ; and that (2) appears more closely related to (even to from South America!) than to its putative subspecies columbianus or . In agreement with this possibility, the level of uncorrected genetic divergence, calculated with CYTB sequence data, between the and the groups (6.2%) greatly exceeds mean levels of divergences within species (and species-like lineages) of and (all below 3.6%, Table 4). Surprisingly in view of their importance to North American hunters, no phylogenetic study using nuclear sequence data from mule deer, white-tailed deer, and black-tailed deer have been conducted to date. If further analyses based on sequence data obtained from independently inherited loci confirm the topology obtained from mtDNA, then reconciling taxonomy with phylogenetics would require elevating and to species rank (see Future Directions). However, such further analyses based on multiple loci are likely to produce an alternative topology, for example by recovering all lineages of mule deer, white-tailed deer, and black-tailed deer as a monophyletic group and with sister to it. Under this plausible scenario, and for the sake of binomial stability, which has important implications for scientific communication (see discussion on this topic by Gutiérrez and Marinho-Filho 2017), we transfer to the genus , in congruence with the close relationship and overall similarity it shares with other members of . Regardless of which of these alternative topologies will be favored by additional analyses, dense geographic sampling is necessary to produce a suitable taxonomic classification with respect to lineages currently treated as members of and . This is particularly important due to the tremendous morphological variation documented among (even geographically close) populations of Neotropical white-tailed deer and the possibility that they might not be conspecific (as proposed by Molina and Molinari 1999, and Molinari 2007).

Monophyly and phylogenetic relationships of the remaining

According to the traditional taxonomy of deer, the recently described subtribe contains four species-poor genera, and containing two species each, and the monotypic and . Our analyses supported the monophyly of and . In addition, none of our tree- or genetic-distance metrics suggests the existence of additional unrecognized species within this genus. The single analyzed sequence of did not nest within the haplogroup of any other species. Nevertheless, our sampling for this genus was poor; additional studies might reveal higher diversity within the two traditionally recognized species of . In fact, the recent study by Heckeberg et al. (2016) analyzed the same sequences that we analyzed and two additional sequences of (of unknown geographic precedence). These authors recovered these additional sequences (hereafter referred to as lineage 2) as sister to . A third sequence of that species analyzed by these authors (which we analyzed; hereafter referred to as lineage 1) was recovered as sister to . Therefore, their results challenge the monophyly of both the genus and (Heckeberg et al. 2016), and suggest that an unrecognized species related to might exist among populations currently assigned to . Nevertheless, ancient hybridization and incomplete lineage sorting remain as alternative causal explanations for these results; these possibilities need to be tested with data from unlinked loci. Our results support the monophyly of both and , and none of our tree- or genetic-distance metrics suggest the possible existence of currently unrecognized species within sampled populations currently referred to as or . These results agree with results from previous studies (González et al. 1998, 2002, Márquez et al. 2006, Duarte et al. 2008). Both of our phylogenetic analyses (BI, ML) recovered sister to a clade containing “” , “” , and the genus ( + lineage 1); however, this relationship received insignificant support in the BI analysis and modest support in the ML analysis. That phylogenetic position for agrees with that recovered by Heckeberg et al. (2016) from CYTB data, but disagrees with the topology obtained by Hassanin et al. (2012) from complete mitochondrial genomes, who recovered sister to “” . Duarte et al. (2008) found sister to a clade formed by and “” . A likely explanation for this difference is that these authors used different optimality criteria than the ones that we used. The tree presented by Duarte et al. (2008) seems to have been produced by a neighbor-joining analysis (a phenetic technique) (showing bootstrap values from that analysis and from a Maximum-Parsimony analysis), whereas our analyses were based on Bayesian and Maximum-Likelihood optimality criteria. Duarte et al. (2008) also mentioned that an unreported Bayesian inference analysis they conducted yielded a similar topology to those of their other two analyses. Differences in the taxon sampling used by Duarte et al. (2008) and Hassanin et al. (2012) PageBreakwith respect to our taxon sampling might also help explain the differences between their topologies and ours. Similar factors could also explain disagreement between our results and those from previous studies with regard to the phylogenetic position of . Albeit with negligible support, our analyses recovered as sister to a clade formed by , , “” , and “” . Both Duarte et al. (2008) and Heckeberg et al. (2016) found sister to a sequence representing lineage 2, whereas Hassanin et al. (2012) recovered sister to a clade formed by “” and lineage 1. A case deserving close attention concerns the monophyly (or lack thereof) of the genus . According to the traditional taxonomy, contains two species, P. ( and P. ( (Hershkovitz 1982). The former occurs in Argentina and Chile, at elevations from sea level up to 1700 meters, whereas the latter occurs in the Andes of Colombia, Ecuador, and Peru at elevations between 1700 and 4000 meters (Hershkovitz 1982, Cronin et al. 2006, Meier et al. 2007, Escamilo et al. 2010, Jiménez 2010). Our results do not support the monophyly of the genus as traditionally recognized. , which is the type species of the genus, was recovered sister to a clade including “” , “” , ( + lineage 1), , and —this position was recovered in the best tree resulting from the ML analysis and in the consensus tree resulting from the BI analyses, but in both cases with negligible support. This large putative clade (including all the taxa just mentioned) was recovered sister to P. in the ML analysis, but with negligible nodal support. The BI analysis recovered P. in a polytomy at the base of the subtribe . This polytomy contained two additional branches, one leading to “” and another containing all other members of . The recent study by Heckeberg et al. (2016) analyzed multiple partial CYTB sequences of P. ; these authors conducted various analyses, but recovered the species in various positions, including: P. as sister to all other (as in our ML analysis); as sister to and ; and in an unresolved position with other clades and . However, Heckeberg et al. (2016) also analyzed a sequence labeled as P. (by Hassanin et al. 2012), overlooking the observation already made by Gutiérrez et al. (2015), who demonstrated that this sequence actually corresponds to “” . Despite the ambiguity regarding the position of P. , P. was consistently recovered in our analyses and in those by Heckeberg et al. (2016) as being more closely related to other than P. . This fact suggests the possibility that the genus , as traditionally defined, is not monophyletic. If confirmed by future studies, the monotypic (Thomas 1913), which is currently treated as a subgenus of , would warrant genus rank. According to Hershkovitz (Hershkovitz 1982; see also Brooke 1874, 1878), the union of the cuboideonavicular and external and middle cuneiform tarsal bones into a single bone (Figure 3) is the only osteological characteristic shared by P. and P. that consistently separates them from PageBreakall other living deer, except from the distantly related Asiatic genera and (Gilbert et al. 2006, Hassanin et al. 2012). Could this anatomical similarity between P. and P. be the result of evolutionary convergence rather than a trait inherited from a recent common ancestor shared between these two species? Convergence could also explain other similarities between these species, like their small sizes and spike-like antlers, among others (see Hershkovitz 1982). Evolutionary convergence in morphological appearance has misguided supraspecific classifications of deer before, most spectacularly in the case of the genus “” PageBreaksensu lato (as traditionally understood) (see findings of molecular studies based on data from either mDNA, nDNA, or both: Gilbert et al. 2006, Duarte et al. 2008, Hassanin et al. 2012, Escobedo-Morales et al. 2016, Heckeberg et al. 2016, the present study). Regardless of these issues concerning supraspecific classification, our results and those by Heckeberg et al. (2016), support the species-level monophyly of both P. and P. . Both of our phylogenetic analyses recovered P. in a single strongly supported haplogroup, and none of our analyses recovered the single analyzed sequence of P. embedded within another species’ haplogroups. None of our tree- or genetic-distance metrics suggest the existence of species-level diversity currently unrecognized among their populations.
Figure 3.

Hind foot bones of (A) and (B) sensu Hershkovitz (1982). According to Hershkovitz (1982; see also Brooke 1874, 1878), the union of the cuboideonavicular and external and middle cuneiform tarsal bones into a single bone in is the only osteological characteristic shared by P. and P. that consistently separates them from all other living deer, with exception of the genera and .

Hind foot bones of (A) and (B) sensu Hershkovitz (1982). According to Hershkovitz (1982; see also Brooke 1874, 1878), the union of the cuboideonavicular and external and middle cuneiform tarsal bones into a single bone in is the only osteological characteristic shared by P. and P. that consistently separates them from all other living deer, with exception of the genera and .

A word on the genus

Because we employed dense taxonomic and geographic sampling for deer, we sought to test if our approach confirmed the monophyly of this tribe and therefore included as part of our ingroup. , which is currently placed within the subtribe (Heckeberg et al. 2016), has been recovered sister to the in previous studies that were based on limited sampling for both and (Gilbert et al. 2006, Hassanin et al. 2012). We were also able to test, for the first time, the monophyly of with dense sampling of both and various . Our results were not controversial, as both of our phylogenetic analyses provided strong support to the monophyly of the genus and it was found sister to a clade formed by all —this clade was recovered in both analyses, albeit with negligible support in both cases.

Three main caveats affect the present study and, more generally, have hampered progress towards a suitable taxonomy for deer. First, the scarcity of freshly preserved tissue samples for Neotropical deer has restricted many studies to Sanger sequencing technologies and mitochondrial DNA, and in most cases only partial sequences of one or two genes are used. At present, CYTB is the only gene sampled broadly enough to support the geographic and taxonomic scope of the present study. Our new CYTB sequences filled some geographic and taxonomic gaps pre-existing on GenBank, but not all of them, and particularly for widely distributed taxa (e.g., and ), data are still missing from large and biogeographically interesting portions of their ranges. Secondly, the use of sequence data from a single locus is an obvious limitation. Because the mode of inheritance of mitochondrial DNA is matrilineal, our use of CYTB sequences allows inference only of matrilineal relationships among sampled populations, which might be contradicted when sequence data from additional loci PageBreakbecome available. Nevertheless, because female philopatry is rampant in mammals, matrilineal relationships are useful to identify priority regions and taxa in phylogenetic comparison. Moreover, previous studies based on CYTB sequence data have regularly improved the classification of tropical mammalian groups (e.g., Patterson and Velazco 2008, Solari et al. 2009, Gutiérrez et al. 2010, 2015, Voss et al. 2013, Moratelli et al. 2016, 2017, Molinari et al. 2017, the present study) whose decades-old classifications had been based on assessments of morphological similarity. Many of these classifications, including that of deer, were proposed at times predating the implementation of phylogenetic, or even statistical, analyses in taxonomic research. Third, many of the sequences available from GenBank are not associated with voucher specimens, lack geographic data, or both. This is likely due to the fact that many colleagues that generated these data are not taxonomists—but ecologists, wildlife managers, conservation biologists, and researchers working on public health issues—and they did not need to report such data for their particular research goals. Unfortunately, in many instances, it has not been reported whether voucher specimens are available and, if so, basic information associated with these specimens (e.g., institution in which they are housed, catalogue numbers, criteria used to assign taxonomic identifications) have not been provided. Similarly, geographic provenances of samples used to generate sequence data are rarely reported and, when reported, often limited to names of country and large administrative entities (e.g., state, department, etc.). Moreover, some Neotropical members of the tribe are rare, subject to intense pressure by humans (e.g., due to hunting and habitat loss), or both, which has hindered, in some countries, obtaining permits to collect specimens for research. To circumvent this difficulty, researchers have sometimes resorted to using samples obtained from animals kept in captivity. Often, zoos do not maintain detailed records of the provenance of animals they keep. The ambiguities resulting from all the aforementioned factors compromise the use of such samples (and derived sequences) from certain types of analyses (e.g., ancestral area reconstructions); even when they can be used, these issues often limit the interpretations that could otherwise be made. Examples of the latter type of problem are some of the sequences that we analyzed and that represent new and noteworthy distributional records—e.g., the apparent first record of for South America and Colombia; the apparent first record of “” for northwestern South America and Colombia—unfortunately, no detailed information about their provenance were published by the research teams that generated these sequences (see discussion above).

Future directions

Our results suggest that future systematic studies on deer should prioritize assessments of the taxonomic status of populations historically assigned to widely distributed taxa—e.g., species of and . PageBreak shows great morphological variability. Regional patterns of this high morphological variability have led authors to propose that multiple species exist among populations traditionally referred to (Molina and Molinari 1999, Molinari 2007). A study based on phylogenetic information and adequate sampling from North, Central, and South America has yet to be conducted to evaluate these proposals. Similarly, efforts based on mtDNA sequences (including the present report) and karyology have advanced our understanding of the variability of and documented the existence of an undescribed species among populations traditionally referred to this taxon. This species needs to be described, a process that necessarily requires both testing our hypothesis that group 1 likely corresponds to sensu stricto and solving the current incongruence between phylogenetics and the taxonomy of and . Other cases in which available phylogenetic information identified the likely existence of undescribed species are those of , whose populations have been recovered in two lineages that are not sister to each other (Heckeberg et al. 2016), and South American populations provisionally assigned to “” (the present study). A single sequence of “” is known from this region, but it is from an unknown locality in Colombia. This sequence is highly divergent from a clade formed by Central American populations of “” . Future fieldwork in northwestern South America and study of specimens housed at museums, particularly those in Colombia and Venezuela, might provide additional samples of this likely undescribed taxon. Clearly, substantial species-level taxonomic work is yet to be done. As the scientific community advances tackling the many taxonomic issues of cervid species, researchers should keep in mind that, despite the conservation status of some of these deer and the implicit difficulty to obtaining collecting permits for research, especially in the Neotropics, new species and subspecies should only be described when preserved museum specimens are available to document new names (see Ceríaco et al. 2016, Gutiérrez and Pine 2017, Dubois 2017 and references therein, Pine and Gutiérrez [in press]; contra Donegan 2008, Marshall and Evenhuis 2015). In addition, and also to avoid obstructing scientific progress, upcoming studies should provide sufficient information regarding voucher specimen availability and detailed information regarding the provenance of samples from which they have obtained data; unfortunately, this is not customary. The current supraspecific taxonomy of deer does not closely align with the information currently available regarding their phylogenetic relationships (Gilbert et al. 2006, Duarte et al. 2008, Hassanin et al. 2012, Escobedo-Morales et al. 2016, Heckeberg et al. 2016, the present study). Further phylogenetic analyses and morphology-based revisionary work is required. The use of massively-parallel sequencing technologies and the unprecedented potential to generate large amounts of DNA data from museum specimens offers the most promising approach to solve this incongruence; however, museum work should also be conducted to enable proper characterization and diagnoses of generic names to be assigned to clades. Efforts to generate a more robust phylogeny will also provide a basis for biogeographic studies on deer. Such studies will be of great interest for understanding aspects of the Great American Biotic PageBreakInterchange and other major events in the deep history of the American continents. Results presented in this study suggest that some long-lived notions about areas of origin and number and direction of dispersal events of deer are erroneous, but correcting them will require meaningful estimates of times since divergences and ancestral area reconstructions.
  45 in total

1.  In-solution hybridization for mammalian mitogenome enrichment: pros, cons and challenges associated with multiplexing degraded DNA.

Authors:  Melissa T R Hawkins; Courtney A Hofman; Taylor Callicrate; Molly M McDonough; Mirian T N Tsuchiya; Eliécer E Gutiérrez; Kristofer M Helgen; Jesus E Maldonado
Journal:  Mol Ecol Resour       Date:  2015-08-24       Impact factor: 7.090

2.  Species Delimitation--a Geneious plugin for the exploration of species boundaries.

Authors:  Bradley C Masters; Vicky Fan; Howard A Ross
Journal:  Mol Ecol Resour       Date:  2011-01       Impact factor: 7.090

3.  Specimen collection crucial to taxonomy.

Authors:  Eliécer E Gutiérrez; Ronald H Pine
Journal:  Science       Date:  2017-03-24       Impact factor: 47.728

4.  Photography-based taxonomy is inadequate, unnecessary, and potentially harmful for biological sciences.

Authors:  Luis M P Ceríaco; Eliécer E Gutiérrez; Alain Dubois
Journal:  Zootaxa       Date:  2016-11-23       Impact factor: 1.091

5.  Mitochondrial DNA and microsatellite DNA variation in domestic reindeer (Rangifer tarandus tarandus) and relationships with wild caribou (Rangifer tarandus granti, Rangifer tarandus groenlandicus, and Rangifer tarandus caribou).

Authors:  Matthew A Cronin; Michael D Macneil; John C Patton
Journal:  J Hered       Date:  2006-07-12       Impact factor: 2.645

6.  Species-wide phylogeography of North American mule deer (Odocoileus hemionus): cryptic glacial refugia and postglacial recolonization.

Authors:  Emily K Latch; James R Heffelfinger; Jennifer A Fike; Olin E Rhodes
Journal:  Mol Ecol       Date:  2009-03-19       Impact factor: 6.185

7.  MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space.

Authors:  Fredrik Ronquist; Maxim Teslenko; Paul van der Mark; Daniel L Ayres; Aaron Darling; Sebastian Höhna; Bret Larget; Liang Liu; Marc A Suchard; John P Huelsenbeck
Journal:  Syst Biol       Date:  2012-02-22       Impact factor: 15.683

8.  New species without dead bodies: a case for photo-based descriptions, illustrated by a striking new species of Marleyimyia Hesse (Diptera, Bombyliidae) from South Africa.

Authors:  Stephen A Marshall; Neal L Evenhuis
Journal:  Zookeys       Date:  2015-10-05       Impact factor: 1.546

9.  The role of chromosome variation in the speciation of the red brocket deer complex: the study of reproductive isolation in females.

Authors:  Marina Suzuki Cursino; Maurício Barbosa Salviano; Vanessa Veltrini Abril; Eveline dos Santos Zanetti; José Maurício Barbanti Duarte
Journal:  BMC Evol Biol       Date:  2014-03-04       Impact factor: 3.260

10.  PEAR: a fast and accurate Illumina Paired-End reAd mergeR.

Authors:  Jiajie Zhang; Kassian Kobert; Tomáš Flouri; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2013-10-18       Impact factor: 6.937

View more
  6 in total

1.  The systematics of the Cervidae: a total evidence approach.

Authors:  Nicola S Heckeberg
Journal:  PeerJ       Date:  2020-02-18       Impact factor: 2.984

2.  Museomics of tree squirrels: a dense taxon sampling of mitogenomes reveals hidden diversity, phenotypic convergence, and the need of a taxonomic overhaul.

Authors:  Edson Fiedler de Abreu-Jr; Silvia E Pavan; Mirian T N Tsuchiya; Don E Wilson; Alexandre R Percequillo; Jesús E Maldonado
Journal:  BMC Evol Biol       Date:  2020-06-26       Impact factor: 3.260

3.  Satellite DNA in Neotropical Deer Species.

Authors:  Miluse Vozdova; Svatava Kubickova; Natália Martínková; David Javier Galindo; Agda Maria Bernegossi; Halina Cernohorska; Dita Kadlcikova; Petra Musilová; Jose Mauricio Duarte; Jiri Rubes
Journal:  Genes (Basel)       Date:  2021-01-19       Impact factor: 4.096

4.  Revalidation of Mazama rufa (Illiger 1815) (Artiodactyla: Cervidae) as a Distinct Species out of the Complex Mazama americana (Erxleben 1777).

Authors:  Pedro H F Peres; Douglas J Luduvério; Agda Maria Bernegossi; David J Galindo; Guilherme B Nascimento; Márcio L Oliveira; Eluzai Dinai Pinto Sandoval; Miluse Vozdova; Svatava Kubickova; Halina Cernohorska; José Maurício Barbanti Duarte
Journal:  Front Genet       Date:  2021-12-14       Impact factor: 4.599

5.  Cytochrome b sequence of the Mazama americana jucunda Thomas, 1913 holotype reveals Mazama bororo Duarte, 1996 as its junior synonym.

Authors:  Aline Meira Bonfim Mantellatto; Susana González; José Maurício Barbanti Duarte
Journal:  Genet Mol Biol       Date:  2021-12-13       Impact factor: 1.771

Review 6.  Species delimitation based on diagnosis and monophyly, and its importance for advancing mammalian taxonomy.

Authors:  Eliécer E Gutiérrez; Guilherme S T Garbino
Journal:  Zool Res       Date:  2018-03-08
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.