Literature DB >> 32336263

Pattern and timing of diversification in the African freshwater fish genus Distichodus (Characiformes: Distichodontidae).

Jairo Arroyave1,2, John S S Denton3,4, Melanie L J Stiassny3.   

Abstract

BACKGROUND: Distichodus is a clade of tropical freshwater fishes currently comprising 25 named species distributed continent-wide throughout the Nilo-Sudan and most Sub-Saharan drainages. This study investigates the phylogenetic relationships, timing of diversification, and biogeographic history of the genus from a taxonomically comprehensive mutilocus dataset analyzed using Maximum Likelihood and Bayesian methods of phylogenetic inference, coalescence-based species-tree estimation, divergence time estimation, and inference of geographic range evolution.
RESULTS: Analyses of comparative DNA sequence data in a phylogenetic context reveal the existence of two major clades of similar species-level diversity and provide support for the monophyletic status of most sampled species. Biogeographic reconstruction on a time-scaled phylogeny suggest that the origins of the genus date back to the late Oligocene and that current geographic distributions are the result of a Congo Basin origin followed by dispersal and range expansion into adjacent ichthyofaunal provinces at different times during the evolutionary history of the group.
CONCLUSIONS: We present the most comprehensive phylogenetic, chronological, and biogeographic treatment yet conducted for the genus. The few instances of species paraphyly (D. teugelsi, D. fasciolatus) revealed by the resulting phylogenies are likely a consequence of post-divergence introgressive hybridization and/or incomplete lineage sorting due to recent speciation. Historical biogeographic findings are both in agreement and conflict with previous studies of other continent-wide African freshwater fish genera, suggesting a complex scenario for the assemblage of Africa's continental ichthyofaunal communities.

Entities:  

Keywords:  African fishes; Congo Basin; Distichodontidae; Distichodus; Geographic range evolution; Molecular dating; Molecular phylogeny

Mesh:

Year:  2020        PMID: 32336263      PMCID: PMC7184684          DOI: 10.1186/s12862-020-01615-6

Source DB:  PubMed          Journal:  BMC Evol Biol        ISSN: 1471-2148            Impact factor:   3.260


Background

Distichodus, the type genus of the endemic African characiform family Distichodontidae, is a morphologically distinctive and moderately speciose lineage of endemic African freshwater fishes. Distichodus species are distributed across the continent, occurring throughout the freshwaters of most of sub-Saharan Africa and the river basins of the Nilo-Sudan, with representation in six of the nine ichthyofaunal provinces of continental Africa (Fig. 1). Although general aspects of the biology of the genus are poorly documented, a few studies indicate that most species are typically diurnally active and found primarily in lentic habitats shoaling in and around grasses along vegetated river banks and swamps [4]. Most species are primarily herbivorous, feeding almost entirely on periphyton, macrophytes, and detritus [5-7] [pers. obs.], although some, such as D. lusosso, have been characterized as dietary generalists feeding on a range of both plant and animal materials [6]. Besides playing an important role as a major constituent of the ecologically important herbivore/detrivore guilds in African freshwaters [8], Distichodus is also of considerable socio-economic importance, as many species constitute a highly valued, but increasingly over-exploited, component of artisanal and commercial fisheries across the continent [9], and due to their high fecundity and herbivorous diet are increasingly being cultured in fish farms and lentic water bodies, particularly in western Africa [4].
Fig. 1

Geographic distribution and variation in external morphology of Distichodus species diversity. Map of Africa divided into ichthyofaunal provinces (originally defined by Roberts [1], modified by Lévêque [2], and redrawn according to new hydrological basin mapping published by FAO [3]): Congo Basin (CB), East Africa (EA), Nilo-Sudan (NS), Lower Guinea (LG), South Africa (SA), and West Africa (WA). Shaded area represents Distichodus extent of occurrence. Inset bar charts indicate number of Distichodus species present in each ichthyofaunal province: endemic (red) and total (blue) (when more than endemics). Inset frame fish photographs illustrate the extent of variation in body shape, size, and coloration in Distichodus species (from top to bottom: D. hypostomatus, D. sexfasciatus, D. lussoso, D. antonii, D. affinis, D. shenga, D. decemmaculatus)

Geographic distribution and variation in external morphology of Distichodus species diversity. Map of Africa divided into ichthyofaunal provinces (originally defined by Roberts [1], modified by Lévêque [2], and redrawn according to new hydrological basin mapping published by FAO [3]): Congo Basin (CB), East Africa (EA), Nilo-Sudan (NS), Lower Guinea (LG), South Africa (SA), and West Africa (WA). Shaded area represents Distichodus extent of occurrence. Inset bar charts indicate number of Distichodus species present in each ichthyofaunal province: endemic (red) and total (blue) (when more than endemics). Inset frame fish photographs illustrate the extent of variation in body shape, size, and coloration in Distichodus species (from top to bottom: D. hypostomatus, D. sexfasciatus, D. lussoso, D. antonii, D. affinis, D. shenga, D. decemmaculatus) Currently, the genus contains 25 valid species [10-12], most of which are found in the Congo River basin with species diversity decreasing with distance from that central African center of diversity (Fig. 1). Although no morphological synapomorphies have yet been identified for Distichodus, the genus can be distinguished from all other distichodontid genera by the combination of: an upper jaw only slightly mobile with respect to the cranium; an edentulous maxilla not tightly applied posteriorly to the premaxilla; two rows (generally) of gracile, long stalked, bicuspid teeth in each jaw; a highly mobile joint between the angulo-articular and dentary (i.e., a Distichodus-type lower jaw [13]); a reduced dentary portion of the mandibular sensory canal; and a completely pored lateral line [13, 14]. Morphological variation within the genus includes notable differences in overall body size, spanning two orders of magnitude and ranging from over ~ 1 m in the largest species (D. nefasch, D. langi) to ~ 5 cm in the smallest (D. decemmaculatus, D. teugelsi), lateral line scale counts (large- vs. small-scaled), the position of the mouth (terminal vs. inferior), coloration (including presence and number of dark vertical bands and spots), tooth number in the oral jaws, and fin ray counts, among others [14-16] (Fig. 1). The genus Distichodus was erected in the mid-nineteenth century [17] and much of the currently recognized taxonomic diversity had been described by the early twentieth century. As is typical of the taxonomic literature prior to the mid-twentieth century, these older descriptions are highly abbreviated, usually lacking anatomical or ecological detail, and often based on examination of little or no comparative material. In one of the earliest attempts at providing a classification scheme for Distichodus, Boulenger [15] divided the genus in two major groups based on the number of lateral line scales. Boulenger’s classification scheme and the monophyletic status of the genus, however, were not tested until the cladistic study of Vari [13], in which the phylogenetic relationships of the Distichodontidae were investigated using comparative anatomical data. Although only five species of Distichodus were included in his study, Vari’s findings failed to support the hypothesis of Distichodus monophyly, resolving some species more closely related to a clade formed by the diminutive distichodontid genera Nannocharax and Hemigrammocharax. Contrary to Vari’s work [13], the first molecular phylogenetic study focused on the Distichodontidae [18] found strong support for the monophyly of Distichodus, and while this study did not focus on the genus and sampling of Distichodus species was not exhaustive, it provided the first picture of Distichodus relationships. Despite this recent contribution to understanding of distichodontid relationships, taxonomic problems within Distichodus persist, and ongoing morphometric and morphological studies (Vreven, pers. comm.) indicate that considerable cryptic diversity remains unrecognized by current taxonomy [14, 16, 19]. Because the taxonomy of Distichodus has only been incidentally examined since the work of Boulenger [12, 13, 16, 18, 20], a comprehensive and focused phylogenetic treatment of the genus (including sampling of multiple individuals per species from a broad geographic range) is needed to test the current classification and to lay essential foundations for future investigations of this socio-economically important genus. Therefore, to advance our understanding of the systematics and evolutionary history of Distichodus, in addition to providing insights into the processes generating fish diversity in freshwater environments of continental Africa, this study investigates the phylogenetic, biogeographic, and chronological framework for the diversification of the genus based on multi-locus comparative DNA sequence data. The study provides a robust phylogenetic framework for testing the adequacy of the current Distichodus taxonomy, informing future revisionary studies and conservation actions, as well as addressing an array of questions about the evolutionary history of the genus. Furthermore, given its pan-African distribution, knowledge on the temporal and geographic context for the diversification of Distichodus holds considerable promise for shedding light on the very poorly understood biogeographic history of the continent’s riverine networks.

Results

Sequence data summary statistics, partitioning scheme and substitution models

The concatenated alignment of eight genes consisted of 6824 sites, of which 1581 were variable and 1339 parsimony-informative. The few instances of failed DNA amplification and/or sequencing resulted in < 2% of missing data. The best partitioning scheme according to the PartitionFinder analysis comprise four partitions: 1) the entire mtDNA control region (cr), 2) 3rd codon positions of the protein-coding mitochondrial genes [co1, cytb, and nd], 3) 1st and 2nd codon positions of the nuclear genes [enc1, glyt, myh6, shx3px3] plus 2nd codon positions of the mitochondrial protein-coding genes, and 4) 3rd codon positions of nuclear genes plus 1st codon positions of mitochondrial protein-coding genes. The best-fit substitution models for these partitions were HKY + G + X, TrN + G + X, TrN + I + X, and TrNef+I + G, respectively. Models that include +X are those in which base frequencies are estimated using maximum likelihood rather than using the empirical frequency distributions. For the BEAST2 analyses, all model parameter ESS values were greater than 200 and effective topological approximate ESS was always > 570. All best-fit codon models for individual gene trees input to ASTRAL-III were Muse and Gaut’s [21] (MG94) + M0 + F3x4 codon frequency models, with the exception of myh6, for which an MG94 + M3 + F3x4 model was inferred. Terminology for the number of omega (ω) classes follows Yang et al. [22].

Distichodus phylogeny

The phylogeny derived from ML analysis (RAxML tree) of the concatenated alignment of all eight markers is presented in Fig. 2. A summarized version of this phylogeny, highlighting interspecific relationships, is illustrated in Fig. 3. Single-locus phylogenies (enc1, glyt, myh6, sh3px3, mtDNA) are presented in Figs. S1, S2, S3, S4 and S5, respectively. As expected, partially because of variation in substitution rates, single-locus phylogenies differed in the level of resolution and nodal support, with ncDNA markers resulting in less resolved and supported phylogenies when compared to the mtDNA locus.
Fig. 2

Total-evidence Distichodus phylogeny as inferred by likelihood in RAxML. Colored circles on nodes indicate degree of clade support as determined by bootstrap values (BS). Nodes labeled A and B represent the two main infrageneric clades. Outgroup taxon (Paradistichodus dimiatus) not shown

Fig. 3

Summary tree of the RAxML Distichodus phylogeny highlighting interspecific relationships

Total-evidence Distichodus phylogeny as inferred by likelihood in RAxML. Colored circles on nodes indicate degree of clade support as determined by bootstrap values (BS). Nodes labeled A and B represent the two main infrageneric clades. Outgroup taxon (Paradistichodus dimiatus) not shown Summary tree of the RAxML Distichodus phylogeny highlighting interspecific relationships Species-tree analyses (SVDquartets and ASTRAL-III) results are presented in Figs. 4 and 5, respectively. BEAST2 analyses yielded very similar topologies (Figs. 6 and S6, S7, S8, S9, S10, S11, S12, S13, S14, S15, S16, S17, S18 and S19), only differing slightly in resolution within one of the two main clades discovered. The RAxML, SVDquartets, and BEAST2 phylogenies exhibit largely congruent topologies with comparable nodal support, resolving the genus into two strongly supported major clades of roughly equivalent species diversity and with the same limits and composition (clades A and B in Figs. 2, 3, 4, 6, and S6, S7, S8, S9, S10, S11, S12, S13, S14, S15, S16, S17, S18 and S19). While these three different analytical methods revealed the same general pattern of relationships in clade A, with disagreement inside clade B (notably among D. engycephalus, D. kasaiensis, D. lusosso, and D. atroventralis), the ASTRAL-III analysis produced a considerably different topology (Fig. 5). The source of this disagreement with the other methods is unclear.
Fig. 4

Distichodus species tree generated using the coalescence-based method SVDquartets

Fig. 5

Distichodus species tree generated using the coalescence-based method ASTRAL-III

Fig. 6

A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 8, intermediate in terms of calibration node (D, crown) and P SMB (30 Ma). Divergence-time estimates are represented by the mean ages of clades. Light red bars correspond to 95% highest posterior density (HPD) intervals of mean node ages. Calibration (fossil-based) node indicated by a dagger (†). Colored circles on nodes indicate degree of clade support as determined by posterior probabilities: black > 0.95, 0.95 ≥ blue ≥0.75, red < 0.75. Outgroup taxon (Nannocharax ansorgii) not shown

Distichodus species tree generated using the coalescence-based method SVDquartets Distichodus species tree generated using the coalescence-based method ASTRAL-III A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 8, intermediate in terms of calibration node (D, crown) and P SMB (30 Ma). Divergence-time estimates are represented by the mean ages of clades. Light red bars correspond to 95% highest posterior density (HPD) intervals of mean node ages. Calibration (fossil-based) node indicated by a dagger (†). Colored circles on nodes indicate degree of clade support as determined by posterior probabilities: black > 0.95, 0.95 ≥ blue ≥0.75, red < 0.75. Outgroup taxon (Nannocharax ansorgii) not shown Regardless of inference method, and conforming to expectation, nodal support was greater at deeper divergences, while weaker (BS < 75; PP < 0.75) at nodes corresponding to more recent divergences, likely reflecting intraspecific population-level structuring (when sampling multiple individuals per species). Nonetheless, for the most part, interspecific relationships are well supported, with the exception of a subclade of clade B.

Monophyly of Distichodus species

Sampling of multiple individuals per species allowed testing of the monophyletic status of most morphologically diagnosed Distichodus species, and the resulting total evidence phylogeny (Figs. 2 and 3) strongly supports the monophyly of most of the species for which multiple individuals were sampled. However, there are two notable exceptions: the species pairs D. teugelsi / D. decemmaculatus, and D. fasciolatus / D. schenga, each of whose members were resolved as paraphyletic with respect to the other. Specifically, the phylogenetic placement of all sampled individuals of morphologically determined D. teugelsi renders D. decemmaculatus paraphyletic, and similarly, the phylogenetic placement of the two sampled individuals of D. schenga renders D. fasciolatus paraphyletic (Fig. 2). Although based on considerably fewer comparative data, the mtDNA phylogeny agreed, for the most part, with the concatenated phylogeny in the monophyly of most sampled species. Most ncDNA single-locus phylogenies, on the contrary, exhibited lower degrees of resolution and support than the total evidence and mtDNA trees, failing to support the monophyletic status of several of the species evaluated.

Timescale of Distichodus diversification

The resultant chronograms from the BEAST2 analyses are presented in Figs. 6 and S6, S7, S8, S9, S10, S11, S12, S13, S14, S15, S16, S17, S18 and S19, and a summary of the results including age estimates and associated HPD intervals of select nodes in Table 1. A number of findings are apparent regardless of calibration strategy, and therefore, of absolute times of divergence. Notable among these are that Distichodus (crown group) originated shortly after its divergence from Paradistichodus, and that the two major components of the Distichodus radiation (clades A and B) started diversifying roughly concurrently. However, despite this initial chronological correspondence, a large subclade of clade B consisting of seven species (the MRCA of D. kasaiensis and D. atroventralis and all of its descendants) is, for the most part, of comparatively more recent origin.
Table 1

Results from alternative BEAST2 analyses. Estimated mean ages (in Ma) and associated 95% HPD intervals of select nodes: D + P = MRCA of Distichodus & Paradistichodus; D = MRCA of Distichodus species; Dne + Dro = MRCA of D. nefasch & D. rostratus; DA = Distichodus subclade A; DB = Distichodus subclade B. P SMB = 95th percentile soft maximum bound (in Ma), as a proxy for the maximum node age constraint

AnalysisCalibration node, P95 SMBD + PDDADBDne + Dro
1D + P (stem), 2012.42 [8.71, 16.55]11.21 [7.92, 15.14]9.82 [6.80, 13.26]9.14 [6.24, 12.53]3.47 [1.98, 5.24]
2D + P (stem), 3015.38 [9.76, 21.86]13.76 [8.67, 19.65]12.01 [7.48, 17.21]11.22 [6.94, 16.22]4.37 [2.35, 6.81]
3D + P (stem), 4019.44 [11.41, 29.34]17.47 [10.31, 26.48]15.30 [8.87, 23.25]14.23 [8.16, 21.66]5.41 [2.61, 8.78]
4D + P (crown), 2019.03 [18.24, 20.15]17.29 [14.75, 19.51]15.16 [12.64, 17.63]14.13 [11.58, 16.77]5.38 [3.44, 7.47]
5D + P (crown), 3024.01 [19.43, 30.15]21.53 [15.95, 28.01]18.81 [13.84, 24.81]17.60 [12.55, 23.30]6.89 [4.04, 10.10]
6D + P (crown), 4028.95 [20.81, 39.58]26.09 [17.75, 36.45]22.89 [15.19, 32.04]21,32 [14.00, 30.08]8.10 [4.40, 12.38]
7D (crown), 2021.50 [18.54, 25.08]19.01 [18.23, 20.07]16.61 [14.56, 18.39]15.54 [13.26, 17.65]6.09 [3.98, 8.30]
8D (crown), 3027.12 [20.64, 35.18]24.10 [19.44, 30.20]21.15 [16.39, 27.14]19.69 [14.84, 25.69]7.49 [4.46, 10.82]
9D (crown), 4032.08 [22.31, 44.49]28.60 [20.79, 38.73]25.09 [17.90, 34.80]23.34 [16.12, 32.41]8.83 [4.75, 13.34]
10Dne + Dro (stem), 2041.39 [30.72, 52.63]36.97 [28.92, 46.28]32.44 [24.74, 41.13]30.08 [23.66, 37.06]11.32 [8.03, 14.69]
11Dne + Dro (stem), 3050.21 [34.81, 67.85]44.86 [31.82, 59.25]39.28 [27.43, 52.69]36.53 [26.67, 48.17]13.88 [8.93, 19.16]
12Dne + Dro (stem), 4057.55 [37.68, 80.13]51.46 [34.88, 70.56]45.09 [29.98, 62.81]41.98 [28.82, 57.31]15.90 [9.53, 22.72]
13Dne + Dro (crown), 2066.14 [43.85, 90.50]59.09 [40.85, 79.59]51.71 [34.59, 69.69]48.32 [34.01, 65.26]18.96 [18.24, 19.94]
14Dne + Dro (crown), 3078.23 [50.07, 109.73]69.92 [45.95, 96.62]61.16 [39.54, 85.63]57.24 [38.17, 79.02]22.85 [19.30, 27.43]
15Dne + Dro (crown), 4087.68 [53.18, 125.70]78.53 [49.63, 111.69]68.74 [43.03, 99.17]64.28 [40.60, 91.16]25.88 [20.21, 32.85]
Results from alternative BEAST2 analyses. Estimated mean ages (in Ma) and associated 95% HPD intervals of select nodes: D + P = MRCA of Distichodus & Paradistichodus; D = MRCA of Distichodus species; Dne + Dro = MRCA of D. nefasch & D. rostratus; DA = Distichodus subclade A; DB = Distichodus subclade B. P SMB = 95th percentile soft maximum bound (in Ma), as a proxy for the maximum node age constraint Of the main variables defining calibration strategy (i.e., calibration node and P SMB), selection of calibration node appears to have the strongest effect on estimates of divergence times, with node Dne + Dro resulting in the oldest node age estimates (substantially older than those based on any of the other calibration nodes used), irrespective of P SMBs. However, node age estimates based on calibration node P + D did not differ considerably from those based on calibration node D, especially under equivalent P SMBs. This trend can be explained by the fact that the age difference between these nodes is relatively small, as previously mentioned. Unsurprisingly, older P SMBs resulted in older node age estimates, although perhaps not as much as anticipated. According to the results of analysis 8 (Fig. 6), under what could be considered a “midway” calibration strategy, intermediate in terms of calibration node (D, crown) and P SMB (30 Ma), the origins of the Distichodus crown group date to the late Oligocene (24.1 Ma; 95% HPD = 19.44–30.20). Conforming to expectation, this estimate is older (~ 7 Ma) than the only previously published estimate, inferred in the context of a time-scaled phylogeny of the Distichodontidae (17.22 Ma; 95% HPD = 12–23) [18]. The results from analysis 8 also indicate that by the late Miocene/early Pliocene (~ 5 Ma) the bulk of species diversity in the genus was already present. Furthermore, this chronogram implies that stem lineages leading to the modern species D. hypostomatus, D. maculatus, D. engycephalus appeared around 21–18 Ma, while most remaining modern diversity likely originated during the late Miocene. Notably, the most recent divergences (~ 1 Ma) correspond to the seemingly paraphyletic species pairs fasciolatus/shenga and teugelsi/decemmaculatus mentioned above, an observation that supports the notion that each of these pairs may correspond to lineages at the early stages of differentiation and speciation.

Geographic range evolution on the Distichodus phylogeny

Model comparison using AIC and AIC weights (Table 2) indicate support for the M1 model (CB-as-source) over the M2 model (CB-as-sink), while the unconstrained (M0) model received negligible support, regardless of absolute times of divergence (input chronogram). Likewise, the pattern of range shifts out of and expansions from the Congo Basin (the ancestral area) implied by the preferred model (M1) was equivalent across analyses, irrespective of absolute node ages and despite minor topological differences between input chronograms (particularly with respect to the relative placement of D. engycephalus). Specifically, the M1 model inferred six range shifts for Distichodus out of the Congo Basin (the ancestral area) and three different range expansions from the Congo Basin to include adjacent ichthyofaunal provinces (Figs. 7, S20, and S21). Support/signal for model M1, however, appears to be stronger when based on older times of divergence (Table 2).
Table 2

Results from DEC* analysis of geographic range evolution on the Distichodus phylogeny. Results are presented for each of the three analyses based on different BEAST2 input chronograms (derived from analyses 5, 8, and 14). Comparison of alternative models (biogeographic hypotheses) and their support as assessed via Akaike weights. M0 (unconstrained, dispersal to and from the Congo Basin); M1 (allowing only dispersal out of the Congo Basin); M2 (allowing only dispersal into the Congo Basin); dispersal (d); extinction (e); number of parameters (k); Akaike information criterion (AIC); Akaike Weights (AW)

Input chronogramHypotheses (constraints)lnLParameter estimatesAIC analysis
kdeAICAW
Analysis 5M0−52.0172420.019113670.10630582108.034500.08844
M1−49.8677720.024130220.10000542103.735500.75880
M2−51.4706120.017727280.02239203106.941200.15277
Analysis 8M0−52.1915620.017343940.0977318108.383100.08778
M1−49.8785320.021698150.09103859103.757100.88703
M2−53.4401320.016395950.02140229110.880300.02519
Analysis 14M0−52.1764120.0060070770.033905864108.352800.08558
M1−49.8342220.0074665230.03069103.668400.89043
M2−53.4483320.0056634810.007312191110.896700.02399
Fig. 7

A spatiotemporal reconstruction of Distichodus range evolution. Based on the optimal DEC* model (M1; CB-as-source) and input chronogram resultant from BEAST2 analysis 8 (Fig. 7). Ichthyofaunal provinces color-coded and abbreviated as in Fig. 1. Probabilities of ancestral areas at each node are presented in Table S1

Results from DEC* analysis of geographic range evolution on the Distichodus phylogeny. Results are presented for each of the three analyses based on different BEAST2 input chronograms (derived from analyses 5, 8, and 14). Comparison of alternative models (biogeographic hypotheses) and their support as assessed via Akaike weights. M0 (unconstrained, dispersal to and from the Congo Basin); M1 (allowing only dispersal out of the Congo Basin); M2 (allowing only dispersal into the Congo Basin); dispersal (d); extinction (e); number of parameters (k); Akaike information criterion (AIC); Akaike Weights (AW) A spatiotemporal reconstruction of Distichodus range evolution. Based on the optimal DEC* model (M1; CB-as-source) and input chronogram resultant from BEAST2 analysis 8 (Fig. 7). Ichthyofaunal provinces color-coded and abbreviated as in Fig. 1. Probabilities of ancestral areas at each node are presented in Table S1

Discussion

Distichodus relationships and taxonomic implications

Here we present the first comprehensive phylogenetic, temporal, and biogeographic framework for examination of the current taxonomy and evolutionary history of Distichodus and for future evolutionary studies of the genus. Regardless of analytical method (except for ASTRAL-III, but see below), our results, based on a dataset with considerably more inclusive taxon, character, and geographic sampling for Distichodus than previous works, strongly support the existence of two roughly equal-sized, and reciprocally monophyletic lineages within the genus, while corroborating the monophyletic status of most currently recognized species. RAxML, SVDquartets, and BEAST2 topologies are largely congruent, with some swapping of taxa inside the two-clade Distichodus structure that is also supported by morphology [13]. Only the ASTRAL-III analysis did not conform to this general picture of Distichodus relationships, but there are two confounding issues then at play. First, ASTRAL-III may be sensitive to gene tree estimation error. The codon model approach to gene tree inference used here should, in principle, be the most accurate method for gene tree inference (of those currently available reversible Markov models), given its hierarchical modeling structure [23]. However, it is still not immune to the requirements of large amounts of data [24]. Some close-to-zero-length branches in the individual gene trees may either be (1) a true artifact of ILS, or (2) a consequence of insufficient data (in terms of gene length, or in terms of evolutionary rate distribution). However, distinguishing differences between real ILS and insufficient data is not possible from the current analysis. We therefore focused our attention and present our conclusions based on the results with the largest overall congruence. Conforming to expectation, nodal support tends to be higher at more basal nodes (deeper divergences), whereas more recent divergences are, on average, less strongly supported. Low nodal support and instances of conflict between analytical methods of phylogenetic inference are particularly evident for the clade consisting of the predominantly large-bodied species (D. atroventralis, D. lusosso, D. kasaiensis, D. antonii, and D. sexfasciatus), suggesting that additional data will be necessary to resolve interspecific relationships for this particular section of the tree. Disagreement between methods in this part of the Distichodus tree is an aspect worth revisiting in the future with genome-wide NGS-generated data, as larger amounts of DNA sequence data might be capable of better resolving and supporting these divergences. Besides the obvious reasons for wanting to unambiguously resolve this part of the Distichodus tree, such an endeavor is of special interest because the highly disparate trophic-related morphologies displayed by members of this clade are undoubtedly an interesting character system from both evolutionary ecological and functional morphological perspectives. Despite some of the disagreements between inference methods, our results offer a general working hypothesis of Distichodus relationships and, with few exceptions, are consistent with the current species-level morphology-based taxonomy of the group. Instances of questionable species monophyly and therefore in conflict with the current classification are discussed below.

Paraphyly of D. teugelsi with respect to D. decemmaculatus

Problems with the species recognition of the two dwarf species, D. decemmaculatus and D. teugelsi, have been noted by Verheyen et al. [19], and are confirmed here. Species identification has previously been based on the presence (decemmaculatus) or absence (teugelsi) of a series of dark spots or bars along the flanks, and 20 (decemmaculatus) versus 16 (teugelsi) scales around the caudal peduncle [11]. While our study finds strong support for a teugelsi/decemmaculatus clade, samples tentatively identified as D. teugelsi from the Kwilu River in the Kasai basin (with 16–17 scales around the caudal peduncle and variously marked spots or bars along the flanks), form a well-supported sister clade to the remaining samples. While samples of D. teugelsi from the type locality, the Lefini River, a right bank tributary of the Congo River upstream of Pool Malebo (lacking spots or bars on the flanks and with 16 scales around the caudal peduncle), form a clade sister to the D. decemmaculatus samples, all tentatively identified here as D. decemmaculatus or D. cf. decemmaculatus. Among these we record caudal peduncle scales counts ranging from 18 to 20, and flank pigmentation ranging from virtually absent to clearly marked and strongly spotted. While no taxonomic solution is proposed here, based on the molecular analysis presented and the observation of high variability in both pigmentation and scale counts in geographically disparate samples of both “species”, further study of the teugelsi/decemmaculatus clade, including representatives of populations across the range of each putative taxon, is needed. We note further that, as for the fasciolatus/shenga species pair discussed below, our estimation of the divergence time for the teugelsi/decemmaculatus pair (based on analysis 8; Fig. 6) is among the most recent (~ 1 Ma). The fact that this is a very recent divergence, might explain the resulting paraphyletic pattern. It is well known that genetic variation shared between closely related species can be due to retention of ancestral genetic polymorphisms resulting from incomplete lineage sorting (ILS) [25], a process that can confound phylogenetic inference and hinder robust tests of monophyly in recently diverged species pairs. Whereas mtDNA introgressive hybridization has been also recognized as one cause of misleading inferences of paraphyly, the overall congruence between the nc- and mtDNA signal involving the teugelsi/decemmaculatus pair supports ILS instead of introgression as a probable explanation for the observed pattern of paraphyly [26].

Paraphyly of D. fasciolatus with respect to D. schenga

Representatives of the widespread species D. fasciolatus are rendered paraphyletic by the placement of the two sampled individuals of the southern African species, D. schenga, a middle and lower Zambezi endemic, which are placed well nested within a strongly supported D. fasciolatus clade (Fig. 2). Distichodus schenga (type locality Tete, Zambesi River) was described by Peters in 1852 and D. fasciolatus by Boulenger in 1898 (type localities in the lower Congo River region), and the descriptions of both are minimal, not allowing for morphological species discrimination. Possibly because of this, Boulenger [15] did not include D. schenga in his key to Distichodus, and by implication did not recognize it as distinct from D. fasciolatus. Our molecular data clearly suggest that the synonomy of D. fasciolatus with D. schenga is in order, however ongoing morphometric and morphological study of the entire “fasciolatus-complex” is currently underway (Vreven, pers. comm.), and pending the results of that study we defer proposing a formal taxonomic synonomy based solely on our molecular data and minimal sampling of putative D. schenga from across the Zambezi basin. We do note however, that the phylogenetic and chronological pattern revealed by our study (Figs. 2, 4, and 6), coupled with the allopatric distribution of these two taxa, suggest that populations currently recognized as D. schenga could have diverged from a lineage/population of D. fasciolatus that colonized the Upper Zambezi headwaters from the Kasai during the Pleistocene, when the two river systems shared a past connection [27-29]. This chronological and geographic dispersal scenario out of the Congo Basin is consistent with our estimated divergence time for this species pair (~ 1 Ma) (Fig. 6) and the inferred range shift involving D. schenga (Fig. 7), and has been hypothesized for various other fish taxa across the headwaters of the Congo-Zambezi watersheds [30-32]. In any case, a phylogenetic pattern of short, poorly supported branches is an indicator of recent species divergence that precluded mtDNA lineages from sorting to reciprocal monophyly [33]. Therefore, as for the teugelsi/decemmaculatus pair, we cannot rule out the possibility that the inferred paraphyly of D. fasciolatus with respect to D. schenga is an artifact of ILS issues. However, signal discordance between mtDNA and nuclear markers involving the fasciolatus/schenga pair strongly suggest that post-divergence introgressive hybridization could also explain the inferred paraphyly of D. fasciolatus with respect to D. schenga.

A spatiotemporal framework for Distichodus diversification

In the context of their time-scaled phylogeny of the suborder Citharinoidei, Arroyave et al. [18] were among the first authors to estimate an age for the origin of Distichodus and the timing of diversification within the genus. Their chronogram suggested that the Distichodus crown group appeared in the Miocene (~ 17 Ma), but that most of the species diversity likely originated during the past 5 Ma. These inferences were based on a molecular clock calibrated using ~ 7.5 Ma Distichodus fossilized dentition [34], which at the time was the oldest known fossil assignable to the genus. The recent discovery of a considerably older (18–19 Ma) Distichodus fossil [35], however, prompted our reexamination of the timescale of Distichodus diversification in the context of a larger dataset, both in terms of molecular markers (8 vs. 7 loci) and taxon sampling (20 vs. 16 spp.). This older fossil, however, presented us with the challenge of accurately assigning it to a node for the purpose of calibrating the molecular clock and estimating absolute times of divergence in the phylogeny of Distichodus. Whereas the approach devised herein to address the uncertainties associated with the fossil-based calibration of the molecular clock resulted in multiple alternative chronograms, from our knowledge of the study subject we believe that some of these alternative calibration scenarios might be either overly conservative (e.g., D + P stem) or too liberal (e.g., Dne + Dro crown), therefore possibly resulting in under- or overestimation of node ages, respectively. Nonetheless, because we have no means to empirically falsify any of these alternative calibration scenarios, we consider it important and valuable to offer the reader the possibility of choosing among alternative scenarios (including those we think too extreme) based on their own knowledge of the study subject and their personal beliefs regarding best practices for justifying fossil calibrations [36]. For the most part, our findings imply a temporal framework for the diversification of Distichodus older than previously reported [18], irrespective of calibration strategy. Only analyses 1 and 2, based on calibration node D + P (stem), resulted in younger divergence time estimates (Table 1). While at present we feel more comfortable grounding our discussion of the temporal and geographic context for the diversification of Distichodus in the results from analysis 8 (intermediate in terms of calibration node [D, crown] and P SMB [30 Ma]), we acknowledge that, should this calibration fossil be eventually confirmed as D. nefasch, D. rostratus, or their MRCA (a possibility due to fossil tooth shape, size, and geographic distribution), a reinterpretation of the biogeographic history will be necessary to reconcile the inferred patterns of geographic range evolution with a chronological framework more than twice as old as the one discussed below (Fig. S21). Despite the high ichthyofaunal diversity of Afrotropical continental waters, few studies have investigated the chronological and biogeographic context for the diversification of African freshwater fish clades, among which only a handful have focused on Pan-African riverine genus-level radiations, namely Hydrocynus [30], Mastacembelus [37, 38] and the species-rich Synodontis [32, 39]. Notably, a Miocene diversification for Distichodus, as implied by the chronogram resulting from analysis 8 (Fig. 6), broadly concurs with previous findings for both Mastacembelus and Synodontis [32, 38, 39]. Similarly, a concurrence of Miocene diversification among various lineages of fishes, frogs, and crabs has been pointed out by Daniels et al. [40], who suggest this likely reflects a shared response to mesic climatic shifts resulting in marked allopatric differentiation among each of these freshwater lineages during the mid- to late Miocene. While to our knowledge there are no empirical studies proving a causal relationship between particular paleohydrological events and diversification patterns in African freshwater fishes, some authors have suggested that Miocene tectonic and climatic upheaval may have influenced or even triggered diversification [31, 38]. The Miocene geological epoch was the setting for widespread epeirogenic uplift in Africa and global climate change that profoundly contributed to shaping the modern African hydrological landscape [41-43], which in turn, it is believed, promoted diversification in freshwater fishes as a consequence of river discharge shifts (due to climate change) and drainage disruption and modification (due to rifting) [44]. Our findings about the timing of Distichodus diversification add to instances of Miocene continent-wide freshwater radiations, and therefore to a growing body of evidence in support for a “hydrogeological” hypothesis, that paleohydrological and paleoclimatic changes promote landscape evolution which in turn promotes cladogenesis in freshwater organisms [45, 46]. Further research, however, ideally in a multi-taxon comparative framework, is certainly needed to robustly test hypotheses of concerted responses to paleogeologic and paleoclimatic scenarios. Analysis of geographic range evolution on the phylogeny of Distichodus favored a biogeographic model in which the Congo Basin (CB) is the center of origin (ancestral area) and source of the geographic diversity of the genus, irrespective of absolute times of divergence. In particular, the biogeographic reconstruction based on the chronogram resulting from analysis 8 (Fig. 7), implies that most cladogenetic events occurred in lineages still confined to the CB throughout most of the Miocene, but also multiple lineage range shifts out of and expansions from the CB into adjacent ichthyofaunal provinces at different times during the evolutionary history of the group. Only in the late Miocene (~ 9–7 Ma) are the first recorded instances of range shifts out of the CB and of cladogenesis occurring in other ichthyofaunal provinces, namely the Nilo-Sudan (NS) and Lower Guinea (LG). The remaining instances of range shifts and expansions are more recent, dating back to the Pliocene. While most ichthyofaunal provinces appear to have been colonized only once (or twice in the case of UG), our results indicate that LG was independently colonized by five different lineages, mostly during the Pliocene. Our reconstruction of the biogeographic history of Distichodus suggests a central role of the CB in the distribution of the continent’s freshwater ichthyofauna during the late Cenozoic, offering support to the hypothesis that the CB is the source of the ichthyofauna of less diverse river basins throughout continental Africa [44]. While a CB origin has also been postulated for the African tigerfish Hydrocynus [30], other continent-wide African freshwater fish genera such as Synodontis [32] and Mastacembelus [38] do not conform to this pattern and suggest repeated independent colonization into the CB. Considering the vast geographic area under study, and that complex evolutionary histories of dispersal and vicariance are likely to exist among the different fish lineages, these conflicting biogeographic histories certainly suggest a complex scenario for the assemblage of the continent’s ichthyofaunal communities.

Conclusions

The spatiotemporal framework for the diversification of African freshwater fish genus Distichodus presented herein provides a significant advance in our knowledge of the evolutionary history of this ecologically and socio-economically important group of fishes. With few exceptions, the resulting phylogeny is consistent with the current species-level taxonomy of the group, offering a working hypothesis of Distichodus relationships that will serve as phylogenetic framework for future evolutionary studies involving phenotypic and genomic systems. The few instances of species paraphyly (D. teugelsi, D. fasciolatus) revealed in our favored phylogeny are likely a consequence of introgression and/or incomplete lineage sorting due to recent speciation. Therefore, we refrain from making taxonomic/nomenclatural changes pending further morphological assessment based on a larger sample of comparative material. While analysis of geographic range evolution favored a biogeographic scenario in which the Congo Basin is the source of geographic diversity of the genus, this finding is both in agreement and conflict with previous studies of other continent-wide African freshwater fish genera, suggesting a complex scenario for the assemblage of Africa’s continental ichthyofaunal communities.

Methods

Taxon sampling

Ingroup sampling consisted of 133 specimens representing 20 of the 25 valid Distichodus species, thereby encompassing 80% of Distichodus currently recognized diversity (Table 3). Distichodus brevipinnis, D. langi, D. mossambicus, D. rufigiensis, and the newly described D. ingae [12], were not included in analyses due to unavailability of tissues. With the exception of D. altus, D. nefasch, D. rostratus, and D. petersii, for which only a single tissue sample was available, multiple individuals per species were sequenced to sample as large a portion of each species’ range as possible (Table 3). In addition to increasing geographic coverage, inclusion of multiple individuals per species allowed for testing the monophyletic status—and therefore species limits under the phylogenetic species concept [47, 48]––of nominal species from which more than one individual was available for sequencing. Sampling of multiple individuals per species, however, was not aimed at making inferences about tokogenetic (intraspecific) relationships and/or phylogeographic patterns. Paradistichodus dimidiatus was included as outgroup based on the findings from a relatively recent molecular phylogenetic study that investigated relationships of the Distichodontidae [18], which resolved the monotypic genus Paradistichodus as the sister group of Distichodus. Similarly, Nannocharax ansorgii was included as additional and outermost outgroup for molecular dating and inference of geographic range evolution analyses.
Table 3

Taxa, voucher specimens (catalog and tissue numbers), and GenBank accession numbers for the gene sequences included in the analyses. Institutional abbreviations: AMCC (Ambrose Monell CryoCollection, AMNH), AMNH (American Museum of Natural History), CU (Cornell University Museum of Vertebrates), SAIAB (South African Institute for Aquatic Biodiversity), MRAC (Royal Museum for Central Africa)

TaxonCatalog #Voucher/Tissue #CountryIchthyofaunal province, drainageGenBank Accession Number
co1crcytbenc1glytmyh6nd2sh3px3
Nannocharax ansorgiiAMNH 257013t-111-11,018GuineaUGKF541815n/aKF541951KF542063KF542157KF542230KF542408KF542486
Paradistichodus dimidiatusAMNH 257747t-110-10,981GuineaUGKF541830n/aKF541914KF542040KF542146KF542266KF542390KF542497
Distichodus affinisAMNH 263347AMCC 227456Democratic Republic of CongoCB, Congo R., Boma.MT300571MT301534MT300757MT300808MT300942MT301157MT301230MT301317
Distichodus affinisAMNH 263306AMCC 227412Democratic Republic of CongoCB, Congo R., Boma.MT300572MT301535MT300758MT300801MT300940MT301158MT301231MT301316
Distichodus affinisAMNH 252431AMCC 197380Democratic Republic of CongoCB, Kwilu R.MT300569MT301548MT300759MT300798MT300932MT301152MT301228MT301349
Distichodus affinisAMNH 246626t-41-4050Democratic Republic of CongoCB, Congo R., Luozi.MT300577MT301542MT300767MT300804MT300936MT301160MT301234MT301356
Distichodus affinisAMNH 250971t-69-6870Democratic Republic of CongoCB, N’Sele R.MT300578MT301543MT300768MT300805MT300937MT301161MT301237MT301360
Distichodus affinisAMNH 250736t-68-6796Democratic Republic of CongoCB, N’Sele R.MT300581MT301536MT300769MT300806MT300948MT301162MT301242MT301357
Distichodus affinisAMNH 250973t-69-6875Democratic Republic of CongoCB, N’Sele R.MT300582MT301544MT300770MT300795MT300931MT301163MT301238MT301358
Distichodus affinisAMNH 250966t-69-6866Democratic Republic of CongoCB, N’Sele R.MT300583MT301545MT300771MT300807MT300944MT301164MT301239MT301359
Distichodus affinisCU 92981t-78-7729GabonLG, Lekoli R.MT300584MT301547MT300772MT300796MT300945MT301165MT301240MT301361
Distichodus affinisAMNH 252468AMCC 197361Democratic Republic of CongoCB, Kwilu R.MT300570MT301549MT300760MT300799MT300935MT301153MT301229MT301350
Distichodus affinisAMNH 252633t-81-8098Democratic Republic of CongoCB, Lulua R.MT300574MT301538MT300762MT300800MT300933MT301159MT301235MT301351
Distichodus affinisCU 95831t-77-7681GabonLG, Lekoli R.MT300579MT301540MT300763MT300797MT300938MT301154MT301232MT301352
Distichodus affinisAMNH 242260t-33-3214Democratic Republic of CongoCB, Congo R., Luozi.MT300575MT301539MT300764MT300794MT300934MT301155MT301236MT301353
Distichodus affinisAMNH 247062t-52-5125Democratic Republic of CongoCB, Congo R., Luozi.MT300580MT301546MT300765MT300802MT300947MT301075MT301241MT301354
Distichodus affinisAMNH 246626t-41-4049Democratic Republic of CongoCB, Congo R., Luozi.MT300576MT301541MT300766MT300803MT300943MT301156MT301233MT301355
Distichodus affinisCU 93141t-77-7692GabonLG, Lekoli R.MT300573MT301537MT300761MT300809MT300946MT301166MT301243MT301345
Distichodus altusAMNH 269875AMCC 253489Democratic Republic of CongoCB, Lake MaiNdombeMT300568MT301533MT300756MT300824MT300941MT301171MT301227n/a
Distichodus antoniiAMNH 255413t-104-10,352Democratic Republic of CongoCB, Congo R., Ngombe.MT300643MT301465MT300693MT300867MT300998MT301058MT301275MT301395
Distichodus antoniiCU 95832t-77-7682Democratic Republic of CongoCB, Congo R., Wanie-Rukula.MT300641MT301466MT300689MT300870MT301028MT301055MT301277MT301396
Distichodus antoniiAMNH 240031t-22-2155Republic of CongoCB, Congo R., Bela.MT300639MT301469MT300687MT300884MT301029MT301052MT301271MT301397
Distichodus antoniiAMNH 246450t-39-3830Democratic Republic of CongoCB, N’Djili R.MT300647MT301467MT300690MT300878MT301006MT301056MT301278MT301398
Distichodus antoniiAMNH 240030t-21-2055Republic of CongoCB, Congo R., Foulakari.MT300642MT301470MT300691MT300879MT300997MT301053MT301272MT301399
Distichodus antoniiAMNH 250972t-69-6872Democratic Republic of CongoCB, N’Sele R.MT300640MT301471MT300688MT300885MT301001MT301057MT301273MT301400
Distichodus antoniiAMNH 255209t-98-9773Democratic Republic of CongoCB, Congo R., Nkana.MT300644MT301468MT300694MT300868MT300999MT301059MT301279MT301401
Distichodus antoniiAMNH 240029t-20-1955Democratic Republic of CongoCB, Pool Malebo.MT300645MT301463MT300692MT300880MT301000MT301054MT301276MT301402
Distichodus antoniiCU 91526t-78-7730Central African RepublicCB, Oubangui R.MT300646MT301464MT300695MT300869MT301005MT301060MT301274MT301403
Distichodus atroventralisAMNH 263670AMCC 230733Democratic Republic of CongoCB, Congo R., Boma.MT300625MT301485MT300717MT300894n/aMT301095n/aMT301414
Distichodus atroventralisAMNH 246956t-53-5231Democratic Republic of CongoCB, Congo R., Luozi.MT300615MT301477MT300708MT300886MT301018MT301086MT301283MT301404
Distichodus atroventralisAMNH 249787t-65-6425Democratic Republic of CongoCB, Lomako R.MT300617MT301475MT300709MT300887MT301023MT301087MT301281MT301405
Distichodus atroventralisAMNH 250964t-69-6863Democratic Republic of CongoCB, N’Sele R.MT300621MT301483MT300710MT300888MT301015MT301088MT301286MT301406
Distichodus atroventralisAMNH 250133t-70-6907Democratic Republic of CongoCB, Congo R., Luozi.MT300618MT301478MT300711MT300881MT301024MT301089MT301287MT301407
Distichodus atroventralisAMNH 255281t-69-6858Democratic Republic of CongoCB, N’Sele R.MT300622MT301481MT300712MT300889MT301019MT301090MT301280MT301408
Distichodus atroventralisAMNH 257156t-69-6861Democratic Republic of CongoCB, N’Sele R.MT300619MT301479MT300713MT300890MT301020MT301091MT301282MT301409
Distichodus atroventralisAMNH 241044t-31-3077Democratic Republic of CongoCB, Congo R., Bulu.MT300616MT301476MT300718MT300891MT301021MT301092MT301284MT301410
Distichodus atroventralisAMNH 241139t-33-3230Democratic Republic of CongoCB, Congo R., Luozi.MT300623MT301482MT300714MT300882MT301016MT301093MT301285MT301411
Distichodus atroventralisAMNH 254914t-103-10,235Democratic Republic of CongoCB, Mai-Ndombe R.MT300624MT301484MT300715MT300892MT301017MT301062MT301288MT301412
Distichodus atroventralisAMNH 257894t-115-11,490Democratic Republic of CongoCB, Congo R., Ngombe.MT300620MT301480MT300716MT300893MT301025MT301094MT301289MT301413
Distichodus decemmaculatusAMNH 247931t-25-2492Democratic Republic of CongoCB, Luilaka R., Monkoto.MT300549MT301512MT300748MT300844MT300956MT301126MT301216MT301333
Distichodus decemmaculatusAMNH 255150t-97-9699Democratic Republic of CongoCB, Mai-Ndombe R.MT300561MT301519MT300742MT300840MT300953MT301132MT301220MT301338
Distichodus decemmaculatusAMNH 252263AMCC 209903Democratic Republic of CongoCB, Luilaka R., Bosombangwa.MT300556MT301514MT300740MT300835MT300952MT301127MT301219MT301348
Distichodus decemmaculatusCU 92911t-78-7732GabonLG, Lekoli R.MT300544MT301520MT300738MT300838MT300965MT301128MT301209MT301334
Distichodus decemmaculatusAMNH 241858t-25-2461Democratic Republic of CongoCB, Lofombo R.MT300550MT301513MT300749MT300845MT300957MT301129MT301217MT301335
Distichodus decemmaculatusAMNH 246318t-39-3811Democratic Republic of CongoCB, Lengoue R., Louesso.MT300545MT301521MT300739MT300839MT300939MT301130MT301210MT301336
Distichodus cf. decemmaculatusAMNH 255150t-97-9698Democratic Republic of CongoCB, Mai-Ndombe R.MT300559MT301517MT300741MT300836MT300954MT301131MT301214MT301337
Distichodus decemmaculatusAMNH 255183t-98-9710Democratic Republic of CongoCB, Mai-Ndombe R.MT300551MT301510MT300750MT300837MT300966MT301133MT301223MT301339
Distichodus decemmaculatusAMNH 257178t-38-3774Democratic Republic of CongoCB, Yenge R., Boyenga.MT300552MT301507MT300744MT300846MT300967MT301123MT301218MT301340
Distichodus decemmaculatusAMNH 246318t-39-3810Republic of CongoCB, Lengoue R., Louesso.MT300553MT301508MT300753MT300847MT300955MT301134MT301226MT301341
Distichodus cf. decemmaculatusAMNH 255150t-97-9696Democratic Republic of CongoCB, Mai-Ndombe R.MT300557MT301515MT300754MT300849MT300960MT301124MT301221MT301330
Distichodus cf. decemmaculatusAMNH 255006t-96-9533Democratic Republic of CongoCB, Mai-Ndombe R.MT300554MT301509MT300751MT300841MT300961MT301135MT301224MT301331
Distichodus cf. decemmaculatusAMNH 255006t-96-9534Democratic Republic of CongoCB, Mai-Ndombe R.MT300558MT301516MT300755MT300842MT300962MT301136MT301222MT301332
Distichodus cf. decemmaculatusAMNH 255150t-97-9697Democratic Republic of CongoCB, Mai-Ndombe R.MT300560MT301518MT300743MT300834MT300949MT301125MT301215MT301343
Distichodus cf. decemmaculatusAMNH 255006t-96-9535Democratic Republic of CongoCB, Mai-Ndombe R.MT300555MT301511MT300752MT300843MT300964MT301137MT301225MT301344
Distichodus engycephalusAMNH 257169t-74-7334GuineaNS, Niger R, Diaragbela.MT300591MT301443MT300659MT300784MT300984MT301112MT301246MT301372
Distichodus engycephalusAMNH 257169t-74-7335GuineaNS, Niger R, Diaragbela.MT300592MT301445MT300660MT300785MT300985MT301051MT301248MT301373
Distichodus engycephalusAMNH 257168t-74-7331GuineaUG, Dion R.MT300593MT301444MT300661MT300786MT300986MT301113MT301249MT301374
Distichodus engycephalusAMNH 257704t-111-11,048GuineaUGMT300594MT301446MT300662MT300787MT300987MT301114MT301247MT301375
Distichodus engycephalusCU 94663t-78-7736EthiopiaNS, Alwero R.MT300590MT301442n/an/an/aMT301061n/aMT301415
Distichodus fasciolatusAMNH 240040t-20-1916Democratic Republic of CongoCB, Pool Malebo, Kintele, RCMT300604MT301451MT300668MT300876MT301034MT301097MT301296MT301381
Distichodus fasciolatusAMNH 246445t-39-3834Democratic Republic of CongoCB, N’Djili R.MT300603MT301458MT300666MT300875MT301014MT301104MT301291MT301291
Distichodus fasciolatusAMNH 250308t-71-7093Democratic Republic of CongoCB, Congo R., Luozi.MT300607n/aMT300675MT300866MT301027MT301105MT301298MT301388
Distichodus fasciolatusAMNH 247837t-55-5478Democratic Republic of CongoCB, Lulua R.MT300600MT301461MT300665MT300898MT301010MT301106MT301304MT301389
Distichodus fasciolatusAMNH 251201t-75-7499Democratic Republic of CongoCB, Lulua R., DRCMT300601MT301462MT300707MT300871MT301011MT301107MT301292MT301390
Distichodus fasciolatusAMNH 253393t-81-8018Democratic Republic of CongoCB, Kasai R., DRCMT300602MT301460MT300663MT300877MT301012MT301108MT301293MT301391
Distichodus fasciolatusAMNH 253304t-80-7905Democratic Republic of CongoCB, Lulua R.MT300605MT301450MT300671MT300895MT301022MT301098MT301306MT301382
Distichodus fasciolatusAMNH 253082t-81-8030Democratic Republic of CongoCB, Kasai R.MT300599MT301459MT300664MT300874MT301008MT301099MT301294MT301383
Distichodus fasciolatusAMNH 252430AMCC 197329Democratic Republic of CongoCB, Kwilu R., Kikwit.MT300609MT301453MT300673MT300862MT301031MT301100MT301301MT301384
Distichodus fasciolatusAMNH 252538AMCC 197367Democratic Republic of CongoCB, Kwilu R.MT300610MT301454MT300674MT300863MT301032MT301101MT301302MT301385
Distichodus fasciolatusCU 95835t-77-7683Democratic Republic of CongoCB, Tshopo/Lindi R.MT300606MT301448MT300669MT300896MT301009MT301073MT301300MT301386
Distichodus fasciolatusCU 95835t-77-7684Democratic Republic of CongoCB, Tshopo/Lindi R.MT300611MT301452MT300672MT300864MT301033MT301074MT301305MT301378
Distichodus fasciolatusCU 92983t-78-7740Republic of CongoCB, Congo R., Bela.MT300612MT301455MT300676MT300865MT301026MT301103MT301303MT301347
Distichodus fasciolatusAMNH 240041t-22-2157GabonLG, Lekoli R.MT300608MT301456MT300667MT300861MT301030MT301096MT301299MT301380
Distichodus fasciolatusCU 92982t-78-7739GabonLG, Lekoli R.MT300598MT301449MT300670MT300897MT301035MT301102MT301297MT301346
Distichodus hypostomatusCU 95143t-78-7741GabonLG, Ngounie R.MT300531n/an/aMT300788MT300976MT301045MT301183MT301416
Distichodus hypostomatusCU 95143t-77-7685GabonLG, Ngounie R.MT300532n/an/aMT300789MT300977MT301046MT301185MT301417
Distichodus hypostomatusAMNH 249522t-63-6209CameroonLG, Bitande R.MT300529MT301557MT300725MT300790MT300978MT301047MT301184MT301309
Distichodus hypostomatusAMNH 249522t-63-6210CameroonLG, Bitande R.MT300530MT301558MT300726MT300791MT300979MT301048MT301186MT301418
Distichodus hypostomatusAMNH 253909t-88-8738Republic of CongoLG, Niari R.MT300527MT301555MT300723MT300792MT300981MT301049MT301181MT301310
Distichodus hypostomatusAMNH 253936t-88-8787Republic of CongoLG, Kouilou R.MT300528MT301556MT300724MT300793MT300980MT301050MT301182MT301311
Distichodus kasaiensisAMNH 251295t-76-7515Democratic Republic of CongoCB, Lulua R.MT300636MT301474MT300696MT300906MT301002MT301109MT301251MT301392
Distichodus kasaiensisAMNH 243646t-38-3731Democratic Republic of CongoCB, Lulua R.MT300637MT301472MT300697MT300907MT301003MT301110MT301252MT301393
Distichodus kasaiensisAMNH 252780t-80-7914Democratic Republic of CongoCB, Lulua R.MT300638MT301473MT300698MT300908MT301004MT301111MT301250MT301419
Distichodus kolleriAMNH 249814t-62-6191CameroonLG, EbebdaMT300562n/aMT300774MT300814MT300915MT301146MT301187MT301321
Distichodus kolleriAMNH 249824t-63-6292CameroonLGMT300563n/aMT300777MT300825MT300916MT301147MT301188MT301318
Distichodus kolleriAMNH 249824t-63-6293CameroonLGMT300564n/aMT300778MT300826MT300917MT301148MT301189MT301312
Distichodus kolleriCU 93515t-77-7686CameroonLG, Djerem R.MT300565n/aMT300775MT300827MT300918MT301149MT301190MT301319
Distichodus kolleriAMNH 236538t-58-5703CameroonLG, Sanaga R.MT300566n/aMT300776MT300828MT300919MT301150MT301191MT301320
Distichodus kolleriAMNH 236521t-58-5718CameroonLG, Sanaga R.MT300567n/aMT300773MT300829MT300920MT301151MT301192MT301322
Distichodus lusossoAMNH 256953t-112-11,160Democratic Republic of CongoCB, Pool Malebo.MT300650MT301502MT300702MT300905MT301036MT301083MT301256MT301437
Distichodus lusossoAMNH 252809t-80-7939Democratic Republic of CongoCB, Lulua R.MT300651MT301499MT300703MT300899MT301038MT301076MT301257MT301431
Distichodus lusossoCU 95830t-77-7688Democratic Republic of CongoCB, Congo R., Wanie-Rukula.MT300652MT301497MT300704MT300900MT301039MT301078MT301258MT301433
Distichodus lusossoAMNH 247230t-50-4901Democratic Republic of CongoCB, Lufula R.MT300654MT301500MT300699MT300902MT301041MT301080MT301254MT301435
Distichodus lusossoAMNH 250310t-67-6648Democratic Republic of CongoCB, Congo R., Luozi.MT300655MT301503MT300701MT300903MT301042MT301081MT301255MT301436
Distichodus lusossoAMNH 250310t-71-7096Democratic Republic of CongoCB, Congo R., Luozi.MT300649MT301501MT300706MT300904MT301043MT301082MT301260MT301430
Distichodus lusossoAMNH 240047t-26-2599Republic of CongoCB, Congo River, Mbelo.MT300653MT301498MT300705MT300901MT301040MT301079MT301259MT301434
Distichodus lusossoCU 91878t-77-7687Central African RepublicCB, Baidou R.MT300648MT301496MT300700MT300883MT301037MT301077MT301253MT301432
Distichodus maculatusAMNH 252806t-80-7944Democratic Republic of CongoCB, Lulua R.MT300524MT301439MT300720MT300911MT300973MT301173MT301178MT301368
Distichodus maculatusCU 91523t-77-7690Central African RepublicOubangui R., Mobaye.MT300523MT301438MT300719MT300914MT300972MT301174MT301177MT301371
Distichodus maculatusCU 95265t-77-7691TanzaniaCB, Malagarasi R.MT300526MT301440MT300721MT300912MT300975MT301175MT301179MT301369
Distichodus maculatusCU 91120t-77-7689ZambiaCB, Luapula R.MT300525MT301441MT300722MT300913MT300974MT301176MT301180MT301370
Distichodus nefaschAMNH 264420AMCC 236881EthiopiaNS, Omo R.MT300595MT301560MT300657MT300910n/aMT301115n/an/a
Distichodus noboliAMNH 257170t-34-3384Democratic Republic of CongoCB, Lac Ilungu.MT300537MT301526MT300733MT300816MT300926MT301138MT301204MT301376
Distichodus noboliAMNH 247930t-25-2491Democratic Republic of CongoCB, Luilaka R.MT300533MT301528MT300732MT300817MT300929MT301139MT301198MT301367
Distichodus noboliAMNH 242501t-34-3316Democratic Republic of CongoCB, Lac Ikenge.MT300539MT301529MT300728MT300818MT300924MT301140MT301199MT301364
Distichodus noboliAMNH 241865t-25-2456Democratic Republic of CongoCB, Luilaka R.MT300536MT301530MT300729MT300819MT300925MT301141MT301200MT301366
Distichodus noboliAMNH 242502t-34-3327Democratic Republic of CongoCB, Lac Ikenge.MT300538MT301527MT300734MT300820MT300927MT301142MT301205MT301377
Distichodus noboliAMNH 249786t-65-6424Democratic Republic of CongoCB, Lomako R.MT300534MT301531MT300730MT300821MT300923MT301143MT301201MT301363
Distichodus noboliAMNH 249775t-65-6412Democratic Republic of CongoCB, Maringa R.MT300535MT301532MT300731MT300822MT300922MT301144MT301202MT301362
Distichodus noboliAMNH 255007t-96-9536Democratic Republic of CongoCB, Mai-Ndombe R.MT300540MT301525MT300727MT300823MT300928MT301145MT301203MT301365
Distichodus notospilusCU 95853t-78-7743GabonLG, Lekoli R, GabonMT300586MT301554MT300780MT300813MT300968MT301172MT301197MT301324
Distichodus notospilusAMNH 249523t-63-6213CameroonLG, Bitande R., CameroonMT300587MT301551MT300781MT300810MT300969MT301167MT301193MT301313
Distichodus notospilusAMNH 249537t-63-6237CameroonLG, Coastal stream.MT300585MT301553MT300779MT300815MT300921MT301168MT301196MT301323
Distichodus notospilusAMNH 249523t-63-6211CameroonLG, Bitande R., CameroonMT300588MT301550MT300782MT300811MT300970MT301169MT301194MT301314
Distichodus notospilusAMNH 249523t-63-6212CameroonLG, Bitande R., CameroonMT300589MT301552MT300783MT300812MT300971MT301170MT301195MT301315
Distichodus petersiiCU 93783t-77-7693TanzaniaEC, Kilimbero R.MT300597MT301559MT300656MT300851MT300982MT301044MT301244MT301307
Distichodus rostratusAMNH photo vouchern/an/aNS, aquarium tradeMT300596MT301561MT300658n/aMT300983MT301116MT301245MT301308
Distichodus schengaSAIAB 97189RC10C047MozambiqueZ, Zambezi R.MT300613MT301457n/aMT300909MT301013MT301084MT301290MT301379
Distichodus schengaSAIAB 97065RC10C077MozambiqueZ, Zambezi R.MT300614MT301447n/aMT300872MT301007MT301085MT301295MT301394
Distichodus sexfasciatusAMNH 240874t-27-2982Democratic Republic of CongoCB, Congo R., Bulu.MT300631MT301486MT300679MT300859MT300991MT301063MT301268MT301424
Distichodus sexfasciatusAMNH 247254t-47-4695Democratic Republic of CongoCB, Congo R., Bulu.MT300632MT301493MT300681MT300860MT300992MT301070MT301269MT301425
Distichodus sexfasciatusAMNH 250133t-70-6909Democratic Republic of CongoCB, Congo R., Luozi.MT300627MT301495MT300686MT300855MT300989MT301066MT301270MT301426
Distichodus sexfasciatusAMNH 251085t-75-7437Democratic Republic of CongoCB, Lulua R.MT300633MT301488MT300682MT300858MT300995MT301071MT301265MT301427
Distichodus sexfasciatusAMNH 255283t-100-9953Democratic Republic of CongoCB, N’Sele R.MT300634MT301487MT300677MT300856MT300990MT301064MT301262MT301428
Distichodus sexfasciatusAMNH 253488t-83-8276Democratic Republic of CongoCB, Congo R., Kinsuka.MT300635MT301489MT300683MT300857MT300996MT301072MT301263MT301429
Distichodus sexfasciatusAMNH 251120t-75-7414Democratic Republic of CongoCB, Lulua R.MT300628MT301490MT300684MT300873n/aMT301065MT301264MT301420
Distichodus sexfasciatusAMNH 251317t-76-7552Democratic Republic of CongoCB, Lulua R.MT300629MT301491MT300685MT300852MT300993MT301068MT301266MT301421
Distichodus sexfasciatusAMNH 240051t-27-2625Republic of CongoCB, Congo R., Mbelo.MT300626MT301492MT300680MT300854MT300988MT301067MT301261MT301423
Distichodus sexfasciatusCU 91519t-77-7694Central African RepublicCB, Oubangui R.MT300630MT301494MT300678MT300853MT300994MT301069MT301267MT301422
Distichodus teugelsiMRAC A7–31-P-348A8–20#965Republic of CongoCB, Lefini R., RCMT300546MT301523MT300745MT300830MT300950MT301117MT301211MT301325
Distichodus teugelsiMRAC A7–31-P-349A7–31#507Republic of CongoCB, Lefini R., RCMT300547MT301524MT300746MT300831MT300951MT301118MT301212MT301326
Distichodus teugelsiMRAC A8–20-P-210MRAC 3Republic of CongoCB, Lefini R., RCMT300548MT301522MT300747MT300832MT300930MT301119MT301213MT301327
Distichodus teugelsiAMNH 253625t-86-8583Democratic Republic of CongoCB, Kwilu R.MT300543MT301504MT300736MT300848MT300958MT301120MT301206MT301328
Distichodus teugelsiAMNH 253758t-87-8607Democratic Republic of CongoCB, Kwilu R.MT300541MT301505MT300737MT300833MT300959MT301121MT301207MT301329
Distichodus teugelsiAMNH 256221t-107-10,607Democratic Republic of CongoCB, Kwilu R.MT300542MT301506MT300735MT300850MT300963MT301122MT301208MT301342
Taxa, voucher specimens (catalog and tissue numbers), and GenBank accession numbers for the gene sequences included in the analyses. Institutional abbreviations: AMCC (Ambrose Monell CryoCollection, AMNH), AMNH (American Museum of Natural History), CU (Cornell University Museum of Vertebrates), SAIAB (South African Institute for Aquatic Biodiversity), MRAC (Royal Museum for Central Africa) Most tissue samples were obtained from specimens collected during recent expeditions in West and West-Central Africa by a research team from the American Museum of Natural History (AMNH) (led by co-author MLJS). Specimens were handled and euthanized prior to preservation in accordance with recommended guidelines for the use of fishes in research [49] and stress was ameliorated by minimizing handling and through the use of the anesthetic Tricaine mesylate (MS-222) for euthanasia. Tissue samples were taken in the field and immediately preserved in 95% ethanol. Voucher specimens were fixed in formalin and subsequently transferred to 70% ethanol for long-term storage. Data for specimens cataloged and stored in the ichthyology collection of the AMNH, are available online at http://sci-web-001.amnh.org/db/emuwebamnh/index.php. Specimen collection was made in accordance with ethical and legal guidelines for international animal research approved by the AMNH Institutional Animal Care and Use Committee (IACUC) (approval #36/06). The AMNH IACUC has guidelines relating to studies involving its members in different countries, and this study conforms to those guidelines. Specimen collection and exportation of samples used in this study follow institutional and national ethical and legal guidelines of the Ministry of Fishery and Aquaculture, Republic of Guinea, No. 65/MPA/DGAGSP/11; the Ministry of Scientific Research and Technical Innovation, Republic of Congo, No. 031/MRSIT/DGRST/GERBID.06.13; and the Ministry of Agriculture and Fisheries, Democratic Republic of Congo, No. 037/DP/SG/AGRIPEL/16. Additional samples were obtained from colleagues at the Cornell University Museum of Vertebrates (CUMV), the Royal Museum for Central Africa (MRAC), and the South African Institute for Aquatic Biodiversity (SAIAB). Voucher specimens are deposited in the ichthyology collections of the AMNH, CUMV, MRAC, and SAIAB. Species identity of non-AMNH vouchers was confirmed either by direct examination of loaned specimens, photographs provided, or on taxonomic authority of the loaning institution. Voucher catalog numbers and GenBank accession numbers for the gene sequences generated and included in this study are listed in Table 3.

Gene sampling and nucleotide data collection

Eight gene fragments, including the seven protein-coding loci sampled by Arroyave et al. [18] to address distichodontid interrelationships (co1, cytb, enc1, glyt, myh6, nd2, and sh3px3) were sequenced. Additionally, a faster-evolving mitochondrial non-coding marker, control region (cr), was added to address more recent divergences within the genus. DNA sequence data was generated from a total of 133 Distichodus individuals. General procedures for DNA extraction, amplification, and purification, along with primers and thermal profiles for sequencing the protein-coding genes used in this study follow Arroyave and Stiassny [50] and Arroyave et al. [18]. Distichodus-specific primers for cr (cr_Dist_f: 5′-AGCGCCGGTCTTGTAATCCG-3′; cr_Dist_r: 5′-TGCTTGTGGAACTTTCTAGGGTCCAT-3′) were designed using the software Primer3 [51] from conserved flanking regions of aligned mtDNA control region sequences extracted from the two distichodontid complete mitochondrial genomes available in GenBank (Distichodus sexfasciatus AB070242 and Ichthyborus sp. AP011993). Amplification of cr via PCR was carried out using the following thermal profile: 5-min initial denaturation at 95 °C, followed by 35 cycles of denaturation at 95 °C for 60 s, annealing at 58 °C for 60 s, and extension at 72 °C for 120 s, followed by a 10-min final extension at 72 °C.

Sequence editing and partitioning scheme/substitution model selection

Contig assembly and sequence editing was performed using Geneious v.11.0.2 [52]. IUPAC nucleotide ambiguity codes were used to represent heterozygous sites. The resulting sequences were trimmed to exclude primer regions and examined for appropriateness/homology using BLASTx [53]. Each gene was aligned using MUSCLE [54] under default parameters as implemented in Geneious, followed by concatenation of individual alignments. All sequences were checked for stop codons and for miscalled amino acids by examining translation alignments. Best-fit partitioning schemes and models of molecular evolution for the nucleotide data were determined using PartitionFinder2 [55] based on 22 pre-defined data blocks: the non-coding mtDNA control region (1 block) plus the 1st, 2nd, and 3rd codon positions of the seven protein-coding genes (3 positions × 7 genes). The PartitionFinder2 greedy algorithm was employed to search for an optimal scheme under the assumption of independent model parameters and branch lengths for each partition. Selection of the partitioning scheme and models over the set of schemes and models produced during greedy search was accomplished using the Schwarz/Bayesian Information Criterion (BIC) [56].

Phylogenetic, biogeographic, and chronological analyses

Various analytical approaches were employed to infer phylogenetic relationships in Distichodus from the multilocus dataset generated in this study, one of which also simultaneously estimates absolute times of divergence in the resultant phylogeny. The results from the latter approach were subsequently used in analyses for testing historical biogeographic hypotheses of geographic range evolution in Distichodus.

Maximum likelihood (ML) estimation of phylogeny

Phylogenetic analysis of the concatenated alignment of the eight sampled genes under a Total Evidence/Simultaneous Analysis [57, 58] approach was performed using the ML optimality criterion. Furthermore, to examine the degree of variation in topology, resolution, and clade support among the individual sampled loci, and to complement the inferences made from the simultaneous analysis of all markers, each of the nuclear genes (enc1, glyt, myh6, sh3px3) and a concatenated alignment of the mitochondrial genes (co1, cr, cytb, nd2; effectively inherited as a single locus), were independently analyzed, also using the ML optimality criterion. ML phylogenetic analyses were conducted with RAxML v.8 [59] through the CIPRES Science Gateway v.3.3 [60] as a single partition under the GTRGAMMA model with four rate classes using full ML optimization for the tree search and 1000 rapid bootstrap (BS) searches to assess nodal support [61].

Species-tree approaches

Although concatenation methods have been suggested to often perform well when incomplete lineage sorting (ILS) levels are low [24], the degree of ILS in Distochodus is unknown. To explore the outcomes of ILS-aware species-tree analyses relative to concatenation, both SVDquartets [62] and ASTRAL-III [63] were employed. SVDquartets has been suggested to perform well with low ILS and small numbers of sites per gene, and ASTRAL methods have been suggested to perform well under high ILS conditions, but may be sensitive to small numbers of sites per gene [24]. SVDquartets analysis was conducted in PAUP* v4.0a164 [64] sampling all ~ 8.6 million quartets under the multispecies coalescent on the full dataset, using the default QFM quartet assembly method. Bootstrap support values were assembled onto the SVDquartets tree using the sumtrees command in the DendroPy package [65]. Gene trees input to ASTRAL-III were estimated from best-fit codon models inferred in codonPhyML [66] under default search intensity, using custom R scripts written by the authors. Because the mitochondrial genome does not undergo recombination and is inherited as a single locus, the three protein-coding mitochondrial genes were fit with a single codon model and inferred gene tree. Gene trees for each autosomal locus were inferred separately.

Bayesian co-estimation of phylogeny and divergence times

Prior to co-estimation of phylogeny and divergence times, a new data matrix was created from the original multi-individual, multi-locus matrix, by including DNA sequence data from only a single individual per species, from or near the type locality whenever possible (for each sampled species, the first individual listed in Table 3). The resulting reduced matrix was analyzed in BEAST v.2.5.0 [67] under the optimal partitioning scheme and substitution models suggested by the PartitionFinder2 analysis. Node ages were estimated using a Bayesian relaxed-clock method [68] under the uncorrelated lognormal (UCLN) rate variation model, and assuming a birth-death process prior for topology and divergence times. By default, the prior on the mean parameter of the UCLN clock model (ucldMean.c) is a uniform distribution on the interval (0,∞), which is an uninformative and improper prior (it does not integrate to 1). Although improper priors can sometimes lead to proper posterior distributions, they may also have undesired effects and cause problems with mixing and convergence [69]. Based on previous findings regarding substitution rates in Distichodus [18], we assumed a log-normally distributed prior for the clock rate (ucldMean.c) with hyperparameters μ = 0.003 and σ = 0.5. On the other hand, the standard deviation parameter of the UCLN clock model (ucldStdev.c) is by default assigned a gamma distribution prior. Variation in substitution rates among branches in Distichodus, however, appears to be low in general [18]. Accordingly, we assumed an exponential prior distribution with 95% of the probability density on values < 1 for the standard deviation of the UCLN (ucldStdev.c). The molecular clock was calibrated based on early Miocene (ca. 18 Ma) fossilized dentition attributable to Distichodus recovered from deposits of the Maradah Formation in Jabal Zaltan, Libya, by far the oldest fossil unambiguously assignable to the genus [35]. In fact, this fossil pushes back the first known appearance of Distichodus in the fossil record by 10 Ma with respect to the Distichodus calibration fossil used by Arroyave et al. [18] to infer a time-scaled phylogeny of citharinoid fishes. Although the Maradah fossil is unquestionably diagnostic of Distichodus (tall, slender necked tooth with a bifid apex bearing characteristically short and rounded lobes) and could potentially be ascribed to either Distichodus nefasch or D. rostratus on the basis of size and geographic distribution, its exact phylogenetic placement is unknown. The absence of relevant comparative morphological data in a phylogenetic context to which to integrate the fossil taxon, coupled with its fragmentary nature, renders it difficult to confidently assign it to a particular node and to determine whether it should be used to constrain the age of the stem or the crown group of the calibration node. Because of this phylogenetic uncertainty, along with the challenge of objectively establishing a maximum age constraint to the calibration node, we conducted a series of analyses (Table 4) to assess the robustness of node ages to analytical ambiguity and to offer alternative output scenarios based on a variety of reasonable input parameters, particularly with respect to the phylogenetic placement of the calibration node and its maximum age constraint. Specifically, we used three alternative calibration nodes: 1) MRCA of Distichodus and Paradistichodus (D + P), 2) MRCA of Distichodus (D), and 3) MRCA of D. nefasch and D. rostratus (Dne + Dro). The rationale behind this proposal is that, at the very least, the calibration fossil could be used to constrain the age of divergence between Distichodus and its sister group, Paradistichodus, but under more liberal phylogenetic designations, it could also be used to constrain the age of the entire genus or even the divergence between the species D. nefasch and D. rostratus. Furthermore, each calibration node was constrained both as stem and as crown group. Additionally, the temporal uncertainty of calibration nodes was modeled using log-normally distributed priors with a hard minimum bound set by the age of the fossil (18 Ma) and one of three alternative 95th percentile soft maximum bounds (P SMBs): 20, 30, and 40 Ma (Fig. 8; Table 4). The combinatorial exercise of choosing one of three alternative calibration nodes, constrained as stem or crown, and modeled by a log-normally distributed prior characterized by one of three alternative P SMBs, resulted in 18 different analyses (although effectively 15 since the node representing the MRCA of Distichodus as stem is equivalent to the node representing the MRCA of Distichodus and Paradistichodus as crown (Table 4). In each analysis, root age was indirectly constrained (as an implied prior) by the combined effects of the calibration prior on other internal node and the prior for topology and divergence times (birth-death process).
Table 4

Alternative BEAST2 analyses (1–15) for co-estimating phylogeny and divergence times in Distichodus resulting from variable calibration strategies (calibration node, stem vs. crown group, and 95th percentile [P] soft maximum bound [SMB] of calibration prior)

Calibration nodeLognormal PDF P95 SMB
20 Ma30 Ma40 Ma
MRCA of Distichodus & ParadistichodusStem123
Crown456
MRCA of DistichodusCrown789
MRCA of D. nefasch & D. rostratusStem101112
Crown131415
Fig. 8

Alternative log-normally distributed priors used to account for temporal uncertainty of calibration nodes. Each prior probability density function (PDF) is characterized by a hard minimum bound of 18 Ma (set by the age of the calibration fossil), a standard deviation (σ) of 0.5, and a variable mean (μ) (in real space) that probabilistically models the extent to which the node age spreads into the past: μ = 19 (black), μ = 24 (blue), and μ = 29 (red). The lower limit of the x-axis interval defining the area shaded under each curve corresponds to its 95th percentile soft maximum bound (P SMB): 20 Ma (black), 30 Ma (blue), and 40 Ma (red)

Alternative BEAST2 analyses (1–15) for co-estimating phylogeny and divergence times in Distichodus resulting from variable calibration strategies (calibration node, stem vs. crown group, and 95th percentile [P] soft maximum bound [SMB] of calibration prior) Alternative log-normally distributed priors used to account for temporal uncertainty of calibration nodes. Each prior probability density function (PDF) is characterized by a hard minimum bound of 18 Ma (set by the age of the calibration fossil), a standard deviation (σ) of 0.5, and a variable mean (μ) (in real space) that probabilistically models the extent to which the node age spreads into the past: μ = 19 (black), μ = 24 (blue), and μ = 29 (red). The lower limit of the x-axis interval defining the area shaded under each curve corresponds to its 95th percentile soft maximum bound (P SMB): 20 Ma (black), 30 Ma (blue), and 40 Ma (red) BEAST2 analyses were implemented using the Markov Chain Monte Carlo algorithm (MCMC) run for 50 million generations sampled every 1000 generations, under default proposal mechanisms and default priors for the parameters of the birth-death branching process used to provide the prior distribution for the non-calibration nodes (speciation and extinction rates) and the model of molecular evolution for each gene (substitution rates, base frequencies, gamma shape, and proportion of invariant sites). Convergence model parameter estimates were assessed via ESS values over 200, using Tracer v.1.7 [70]. Sufficient sampling of the estimate of the tree topology (ESS > 200) was determined by dividing the topological approximate ESS by the generation number of the approximate earliest stationary value in the topological autocorrelation plot, generated in the R package rwty [71]. Further assessment of MCMC convergence was undertaken by examination of the average standard deviation of split frequencies, with values << 0.01 taken as indicative of stationarity. All analyses used a 10% burn-in. A maximum clade credibility (MCC) topology was inferred using TreeAnnotator v.2.5 [67], resulting in a chronogram indicating posterior probabilities (PP) and mean ages of all nodes with their associated 95% highest posterior density (HPD) intervals.

Inference of geographic range evolution

The evolution of geographic ranges in Distichodus was investigated using the null-range-excluded dispersal-extinction-cladogenesis model (DEC*) [72], a modified version of the original likelihood-based dispersal-extinction-cladogenesis (DEC) model [73, 74]. The set of discrete geographic areas for the DEC* analysis consisted of the six Afrotropical ichthyofaunal provinces of Roberts [1] (modified by Lévêque [2]) with presence of Distichodus species: Congo Basin (CB), Zambezi (Z), Nilo-Sudan (NS), Upper Guinea (UG), Lower Guinea (LG), and East Coast (EC) (Fig. 1). African ichthyofaunal provinces were delimited on the basis of current and historical patterns of drainage connectivity and the composition of the fish fauna, and therefore represent regions with a distinctive evolutionary history and a more or less characteristic biota at the species and higher taxonomic levels [1, 2]. To assess the relative fits of alternative models of faunal assemblage in the Congo Basin, three variants of the DEC* model were fit to the data in the BioGeoBEARS R package [75], following the parameterization of dispersal multipliers from Day et al. [38]: M0, an unconstrained multiplier matrix allowing for dispersal to and from the Congo Basin; M1, an asymmetric multiplier matrix allowing only dispersal out of the Congo Basin (CB-as-source); M2, an asymmetric multiplier matrix allowing only dispersal into the Congo Basin (CB-as-sink). Tip-state ranges were assigned based on the presence of species in different ichthyofaunal provinces. In several cases, species spanned multiple provinces. The maximum range size was set to widespread (all six ichthyofaunal provinces). Given the high dimensionality of the transition matrix resulting from the combination of different provinces (areas) into ranges of sizes up to six, relative to the size of the dataset, 14 disjunct ranges of differing sizes were pruned from analysis, reducing the dimensionality of the matrix from 64 × 64 to 50 × 50. To assess the stability of numerical optimization, analysis was run five times from fresh R sessions. Model fits of the M0, M1, and M2 variants were compared using the Akaike information criterion [76] and supports were assessed using Akaike weights [77]. In an effort to take account of chronological uncertainty due to alternative molecular clock calibration scenarios, inference of geographic range evolution in Distichodus was conducted on three of the 15 time-scaled phylogenies previously inferred with BEAST2, namely the chronograms resulting from analyses based on each alternative calibration node constrained as crown and by a relatively moderate soft maximum bound (P SMB = 30 Ma) (analyses 5, 8, and 14 in Table 4). Additional file 1: Figure S1.enc1 Distichodus phylogeny as inferred by likelihood in RAxML. Colored circles on nodes indicate degree of clade support as determined by bootstrap values (BS). The identity of leaves (terminals) not printed on the tree is specified by the species name (in bold) at the base of the most recent labeled ancestral node from which the sample descends. Names in bold black correspond to those species resolved as monophyletic (when multiple individuals were available), whereas those in bold green indicate that, while most of the sampled specimens fall into the clade subtended by that node, some samples fall outside the clade, and therefore the species is not resolved as monophyletic. Outgroup taxon (Paradistichodus dimiatus) not shown. Additional file 2: Figure S2.glyt Distichodus phylogeny as inferred by likelihood in RAxML. Same contextual information as in Fig. S1. Additional file 3: >Figure S3.. myh6 Distichodus phylogeny as inferred by likelihood in RAxML. Same contextual information as in Fig. S1. Additional file 4: Figure S4.sh3px3 Distichodus phylogeny as inferred by likelihood in RAxML. Same contextual information as in Fig. S1. Additional file 5: Figure S5. mtDNA (co1, cr, cytb, nd2) Distichodus phylogeny as inferred by likelihood in RAxML. Same contextual information as in Fig. S1. Additional file 6: Figure S6. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 1. Same contextual information as in Fig. 6. Additional file 7: Figure S7. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 2. Same contextual information as in Fig. 6. Additional file 8: Figure S8. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 3. Same contextual information as in Fig. 6. Additional file 9: Figure S9.  A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 4. Same contextual information as in Fig. 6. Additional file 10: Figure S10. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 5. Same contextual information as in Fig. 6. Additional file 11: Figure S11.  A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 6. Same contextual information as in Fig. 6. Additional file 12: Figure S12. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 7. Same contextual information as in Fig. 6. Additional file 13: Figure S13. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 9. Same contextual information as in Fig. 6. Additional file 14: Figure S14. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 10. Same contextual information as in Fig. 6. Additional file 15: Figure S15. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 11. Same contextual information as in Fig. 6. Additional file 16: Figure S16. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 12. Same contextual information as in Fig. 6. Additional file 17: Figure S17. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 13. Same contextual information as in Fig. 6. Additional file 18: Figure S18. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 14. Same contextual information as in Fig. 6. Additional file 19: Fig. S19. A time-scaled phylogeny of Distichodus. Chronogram resulting from BEAST2 analysis 15. Same contextual information as in Fig. 6. Additional file 20: Figure S20. A spatiotemporal reconstruction of Distichodus range evolution. Based on the optimal DEC* model (M1; CB-as-source) and input chronogram resultant from BEAST2 analysis 5. Ichthyofaunal provinces color-coded and abbreviated as in Fig. 1. Probabilities of ancestral areas at each node are presented in Table S2. Additional file 21: Figure S21. A spatiotemporal reconstruction of Distichodus range evolution. Based on the optimal DEC* model (M1; CB-as-source) and input chronogram resultant from BEAST2 analysis 14. Ichthyofaunal provinces color-coded and abbreviated as in Fig. 1. Probabilities of ancestral areas at each node are presented in Table S3. Additional file 22: Table S1. Probabilities of ancestral states/ranges at each node of the spatiotemporal reconstruction of Distichodus range evolution presented in Fig. 7. Columns indicate ancestral areas, represented by all unique combinations for all possible group sizes for the six ichthyofaunal provinces. Rows indicate nodes, with numbering following the typical R phylo format, i.e., 1 is the first tip taxon/area, beginning at the bottom. After the last tip value, the numbering begins at the root, and moves tipward. Ichthyofaunal provinces abbreviated as in Fig. 1. Additional file 23: Table S2. Probabilities of ancestral states/ranges at each node of the spatiotemporal reconstruction of Distichodus range evolution presented in Fig. S20. Same contextual information as in Table S1. Additional file 24: Table S3. Probabilities of ancestral states/ranges at each node of the spatiotemporal reconstruction of Distichodus range evolution presented in Fig. S21. Same contextual information as in Table S1.
  34 in total

1.  Historical biogeography of the catfish genus Hypostomus (Siluriformes: Loricariidae), with implications on the diversification of Neotropical ichthyofauna.

Authors:  J I Montoya-Burgos
Journal:  Mol Ecol       Date:  2003-07       Impact factor: 6.185

2.  An empirical examination of the utility of codon-substitution models in phylogeny reconstruction.

Authors:  Fengrong Ren; Hiroshi Tanaka; Ziheng Yang
Journal:  Syst Biol       Date:  2005-10       Impact factor: 15.683

3.  Introgression as a likely cause of mtDNA paraphyly in two allopatric skippers (Lepidoptera: Hesperiidae).

Authors:  E V Zakharov; N F Lobo; C Nowak; J J Hellmann
Journal:  Heredity (Edinb)       Date:  2009-03-18       Impact factor: 3.821

4.  Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7.

Authors:  Andrew Rambaut; Alexei J Drummond; Dong Xie; Guy Baele; Marc A Suchard
Journal:  Syst Biol       Date:  2018-09-01       Impact factor: 15.683

5.  CodonPhyML: fast maximum likelihood phylogeny estimation under codon substitution models.

Authors:  Manuel Gil; Marcelo Serrano Zanetti; Stefan Zoller; Maria Anisimova
Journal:  Mol Biol Evol       Date:  2013-02-23       Impact factor: 16.240

6.  Best practices for justifying fossil calibrations.

Authors:  James F Parham; Philip C J Donoghue; Christopher J Bell; Tyler D Calway; Jason J Head; Patricia A Holroyd; Jun G Inoue; Randall B Irmis; Walter G Joyce; Daniel T Ksepka; José S L Patané; Nathan D Smith; James E Tarver; Marcel van Tuinen; Ziheng Yang; Kenneth D Angielczyk; Jenny M Greenwood; Christy A Hipsley; Louis Jacobs; Peter J Makovicky; Johannes Müller; Krister T Smith; Jessica M Theodor; Rachel C M Warnock; Michael J Benton
Journal:  Syst Biol       Date:  2011-11-21       Impact factor: 15.683

7.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

8.  Relaxed phylogenetics and dating with confidence.

Authors:  Alexei J Drummond; Simon Y W Ho; Matthew J Phillips; Andrew Rambaut
Journal:  PLoS Biol       Date:  2006-03-14       Impact factor: 8.029

9.  Importance of incomplete lineage sorting and introgression in the origin of shared genetic variation between two closely related pines with overlapping distributions.

Authors:  Y Zhou; L Duvaux; G Ren; L Zhang; O Savolainen; J Liu
Journal:  Heredity (Edinb)       Date:  2016-09-21       Impact factor: 3.821

10.  BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis.

Authors:  Remco Bouckaert; Timothy G Vaughan; Joëlle Barido-Sottani; Sebastián Duchêne; Mathieu Fourment; Alexandra Gavryushkina; Joseph Heled; Graham Jones; Denise Kühnert; Nicola De Maio; Michael Matschiner; Fábio K Mendes; Nicola F Müller; Huw A Ogilvie; Louis du Plessis; Alex Popinga; Andrew Rambaut; David Rasmussen; Igor Siveroni; Marc A Suchard; Chieh-Hsi Wu; Dong Xie; Chi Zhang; Tanja Stadler; Alexei J Drummond
Journal:  PLoS Comput Biol       Date:  2019-04-08       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.