| Literature DB >> 22373076 |
Lyndall L Pereira-da-Conceicoa1, Benjamin W Price, Helen M Barber-James, Nigel P Barker, Ferdy C de Moor, Martin H Villet.
Abstract
BACKGROUND: Baetis harrisoni Barnard is a mayfly frequently encountered in river studies across Africa, but the external morphological features used for identifying nymphs have been observed to vary subtly between different geographic locations. It has been associated with a wide range of ecological conditions, including pH extremes of pH 2.9-10.0 in polluted waters. We present a molecular study of the genetic variation within B. harrisoni across 21 rivers in its distribution range in southern Africa.Entities:
Mesh:
Year: 2012 PMID: 22373076 PMCID: PMC3523013 DOI: 10.1186/1471-2148-12-26
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Map of recorded and sampled localities. Recorded localities of B. harrisoni and the location of sampled sites corresponding to the major lineages of B. harrisoni in Southern Africa. Data obtained from the Albany Museum database.
Data characteristics and analysis summaries
| | | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 52 | 357 | 58 | 33 | 9.2 | 10000 | 49 | 0.776 | 0.969 | K80 + G | −1063.23 (1P) | |
| 21 | 458 | 33 | 14 | 3.1 | 5 | 18 | 0.833 | 0.957 | GTR + I | −913.51 (1P) | |
| 50 | 502 | 71 | 65 | 13.0 | 10000 | 104 | 0.779 | 0.965 | GTR + I | −1393.33 (1P) | |
| 50 | 618 | 224 | 191 | 31.0 | 15 | 424 | 0.642 | 0.936 | GTR + I + G | −3016.22 (1P) | |
| 67 | 1477 | 353 | 289 | 20.0 | 10000 | 586 | 0.667 | 0.943 | GTR + I + G | −5724.33* (1P) | |
The number of specimens with sequence data (ntax), number of variable (# Var), parsimony informative (# Pi), percent parsimony informative (% Pi) and total number of base pairs (bp) is reported. The parsimony search summary comprises the number of trees retained (# trees), tree length (score), Consistence Index (CI) and Retention Index (RI). The Bayesian analysis summary comprises the model selected by MrModeltest, the harmonic mean of the estimated marginal likelihoods and the partitioning strategy indicated in brackets; an asterisk indicates the value is the re-estimated marginal likelihood using the generalized stepping stone model.
Pairwise Incongruence Length Difference (ILD)
| - | | | |
| - | | ||
| 0.61 | - | ||
| 0.20 | 0.74 |
ILD test probability values for the four data partitions generated using 1000 replicates. Statistically significant values are emphasised in bold.
Figure 2Combined Bayesian Inference with GMYC. Bayesian Inference phylogram of combined molecular data (excluding EF1α data) with support for major nodes shown above (Bayesian posterior probability / Parsimony bootstrap / Bremer support) and below (partitioned Bremer support: PEPCK / 16S / COI) each branch. The columns to the right of the tree indicate species boundaries identified by the GMYC likelihood analyses for the COI (clades: C1–C6), 16S (clades: S1–S5) and PEPCK (clades: P1–P7) partitions respectively. The fourth (dark grey) column shows five well supported monophyletic consensus groups identified by the GMYC likelihood analyses of the combined data (PEPCK + 16S + COI), representing putative phylogenetic species further discussed in this study. Node numbers (circled) correspond to date estimates reported in Tables 3 &4 and Additional file 2: Figure S1.
Isolation by Distance (IBD)
| INGROUP | 0.084 | |
| SA Ced | 0.338 | 0.077 |
| SA West | 0.064 | |
| SA East | 0.132 | |
| SA East + MAL + ZAM | 0.717 |
The contribution of Isolation by Distance (R2) and corresponding probability values (p) for each clade analysed using the COI gene. Statistically significant values are emphasised in bold.
TMRCA with substitution rate at 1.5% per My
| | | ||||||
|---|---|---|---|---|---|---|---|
| MAL | 8 | 1.27 | (0.19–2.86) | 1.63 | (0.29–3.36) | 1.14 | (0.15–2.58) |
| ZAM | 6 | 1.55 | (0.19–3.63) | 1.88 | (0.24–4.16) | 1.45 | (0.19–3.38) |
| SA Ced | 2 | 2.19 | (0.65–4.23) | 2.11 | (0.65–3.94) | 1.99 | (0.60–3.79) |
| SA West | 4 | 3.04 | (1.04–5.65) | 3.54 | (1.35–6.19) | 2.71 | (0.93–5.12) |
| SA East | 9 | 3.92 | (1.97–6.30) | 3.45 | (1.66–5.73) | 3.84 | (1.80–6.29) |
| (SA East + MAL) | 7 | 8.19 | (3.91–13.09) | 5.75 | (2.81–9.60) | 8.85 | (3.92–14.72) |
| (SA East + MAL + ZAM) | 5 | 12.74 | (5.96–20.13) | 7.74 | (3.68–12.71) | 13.92 | (6.30–22.81) |
| (SA East + MAL + ZAM + SA West) | 3 | 18.80 | (9.41–29.73) | - | - | 20.09 | (9.41–32.88) |
| (SA East + MAL + ZAM + SA West + SA Ced) | 1 | 24.23 | (12.68–38.46) | 11.49 | (5.64–18.63) | 24.63 | (11.41–39.94) |
Mean and 95% HPD estimates of the time to most recent common ancestor (TMRCA) estimated using a COI substitution rate of 1.5% per My in BEAST. Node numbers correspond to those in Figure 2; “-” represents node present in Figure 2 but absent in BEAST analysis. Clades shown in ascending order of age of MRCA.
Figure 3Bayesian Inference Phylograms. Bayesian Inference phylograms of each of the four different datasets corresponding to each individual gene: (a) 16S (b) PEPCK (c) COI (d) EF1α. Posterior probability support (p > 0.95) is indicated using an asterisk above the corresponding branch.
TMRCA with substitution rate at 3.5% per My
| | | ||||||
|---|---|---|---|---|---|---|---|
| MAL | 8 | 0.59 | (0.08–1.30) | 0.70 | (0.13–1.55) | 0.54 | (0.09–1.20) |
| ZAM | 6 | 0.71 | (0.08–1.68) | 0.85 | (0.10–1.92) | 0.64 | (0.06–1.52) |
| SA Ced | 2 | 0.96 | (0.25–1.87) | 0.94 | (0.28–1.77) | 0.89 | (0.26–1.76) |
| SA West | 4 | 1.41 | (0.48–2.64) | 1.60 | (0.62–2.86) | 1.29 | (0.46–2.41) |
| SA East | 9 | 1.73 | (0.80–2.92) | 1.54 | (0.72–2.59) | 1.71 | (0.81–2.83) |
| (SA East + MAL) | 7 | 3.64 | (1.71–6.08) | 2.40 | (1.25–4.29) | 3.96 | (1.69–6.65) |
| (SA East + MAL + ZAM) | 5 | 5.67 | (2.50–9.48) | 3.46 | (1.66–5.75) | 6.24 | (2.71–10.52) |
| (SA East + MAL + ZAM + SA West) | 3 | 8.16 | (3.85–13.51) | 4.33 | (2.20–6.95) | - | - |
| (SA East + MAL + ZAM + SA West + SA Ced) | 1 | 10.42 | (4.90–17.33) | 5.15 | (2.66–8.51) | 13.23 | (5.45–22.47) |
Mean and 95% HPD estimates of the time to most recent common ancestor (TMRCA) estimated using a COI substitution rate of 3.5% per My in BEAST. Node numbers correspond to those in Figure 2; “-” represents node present in Figure 2 but absent in BEAST analysis. Clades shown in ascending order of age of MRCA.
Summary of GMYC
| | | | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 48 | 161.10 | 163.11 | 4.03 | 0.25 | 6 | 4–18 | 163.72 | 1.50 | 0.68 | 9 | 5–13 | |
| 48 | 106.58 | 108.46 | 3.75 | 0.29 | 5 | 1–15 | 163.58 | 0.00 | 1.00 | 5 | 1–11 | |
| 54 | 166.48 | 169.30 | 5.62 | 0.13 | 7 | 4–12 | 169.49 | 0.38 | 0.94 | 6 | 4–15 | |
| 64 | 169.65 | 173.60 | 7.90 | 6 | 4–17 | 173.84 | 0.48 | 0.92 | 7 | 6–32 | ||
Summary of the single and multiple threshold GMYC analyses, including the number of unique haplotypes/genotypes (H/G) in each dataset; the likelihood values estimated under the null (Coalescent) and GMYC (mixed Yule Coalescent) priors; the value used for the likelihood ratio test (LR) and its associated probability (p), corresponding to a comparison between the single threshold and the null model, and the comparison between the multiple threshold and the single threshold models. The estimated number of Maximum Likelihood entities or “phylogenetic species” and the confidence interval (CI) around these values are reported. Statistically significant values are emphasised in bold. ML entities correspond to clades labelled in Figure 2.