| Literature DB >> 23628960 |
Johannes Bergsten1, Anders N Nilsson, Fredrik Ronquist.
Abstract
We review Bayesian approaches to model testing in general and to the assessment of topological hypotheses in particular. We show that the standard way of setting up Bayes factor tests of the monophyly of a group, or the placement of a sample sequence in a known reference tree, can be misleading. The reason for this is related to the well-known dependency of Bayes factors on model-specific priors. Specifically, when testing tree hypotheses it is important that each hypothesis is associated with an appropriate tree space in the prior. This can be achieved by using appropriately constrained searches or by filtering trees in the posterior sample, but in a more elaborate way than typically implemented. If it is difficult to find the appropriate tree sets to be contrasted, then the posterior model odds may be more informative than the Bayes factor. We illustrate the recommended techniques using an empirical test case addressing the issue of whether two genera of diving beetles (Coleoptera: Dytiscidae), Suphrodytes and Hydroporus, should be synonymized. Our refined Bayes factor tests, in contrast to standard analyses, show that there is strong support for Suphrodytes nesting inside Hydroporus, and the genera are therefore synonymized.Entities:
Mesh:
Year: 2013 PMID: 23628960 PMCID: PMC3739882 DOI: 10.1093/sysbio/syt029
Source DB: PubMed Journal: Syst Biol ISSN: 1063-5157 Impact factor: 15.683
Interpretation of the Bayes factor for hypothesis testing after Kass and Raftery (1995)
| Bayes factor | 2×log | Interpretation |
|---|---|---|
| 1–3 | 0–2 | Not worth more than a bare mention |
| 3–20 | 2–6 | Positive |
| 20–150 | 6–10 | Strong |
| > 150 | >10 | Very strong |
Figure 1Schematic illustration of the Bayes factor test of the hypothesis that A is a monophyletic group (H1) against the hypothesis that it is not (H0). We assume that A consists of two strongly supported subclades, A1 and A2, but that the rest of the tree is unresolved such that there is no evidence that A1 and A2 together form a monophyletic group. The Bayes factor compares the average height of the posterior over the prior tree space of each hypothesis. When using a standard H0, the signal is spread over a large tree space, and the test suggests that H1 is strongly supported. If we use an informed prior for H0 by restricting the tree space to those trees that have A1 and A2 monophyletic, then the Bayes factor correctly identifies that there is no support for or against H1 over H0, as the average height is the same. A posterior odds test compares the total probability mass under each hypothesis (the area under the posterior distribution), and is therefore not as strongly influenced by the size of the tree space as the Bayes factor.
FIGURE 3Phylogeny based on the combined four-gene matrix from the unconstrained Bayesian analysis and rooted between outgroups and ingroups as defined a priori. Numbers refer to posterior probability clade support values.
Figure 2Comparison of standard (a–b) and preferred (c–d) Bayesian hypothesis testing for the empirical example. Standard: When the H0 hypothesis a) of a paraphyletic Hydroporus is tested as unconstrained against the alternative H1 hypothesis b) constrained to comply with a monophyletic Hydroporus, the latter is strongly preferred by the Bayes factor (2×log BF = 88.96 in favor of H1). However, this is largely due to strong signal for the two subclades within Hydroporus. Preferred: When the H0 hypothesis c) of a paraphyletic Hydroporus is associated with an informed topology prior comparable to that of H1 d), the Bayes factor instead prefers H0 (2 × log BF = 7.82 in favor of H0). This is also the hypothesis with the largest posterior probability.
Posterior probability of hypotheses based on an unconstrained analysis and a backbone-constrained analysis where Hydroporus is constrained monophyletic in relation to the outgroups but Suphrodytes is allowed to “float”
| Constraints | H0 | H1 | H2 | H3 |
|---|---|---|---|---|
| Unconstrained | 0.03339 | 0.86825 | 0.08666 | 0.9469 |
| (30,004) | (1018) | (26,050) | (2601) | (28,411) |
| Backbone | 0.03376 | 0.88025 | 0.08582 | 0.9662 |
| (30,004) | (1013) | (26,410) | (2576) | (28,991) |
Notes: H0 = Hydroporus monophyletic to the exclusion of Suphrodytes; H1 = Suphrodytes+Hydroporus Clade II monophyletic; H2 =Suphrodytes +Hydroporus Clade I monophyletic; H3 = Suphrodytes nested within Hydroporus. In parenthesis the total number of trees filtered out from the total (30,004) that supports the hypothesis.
Estimations of the marginal likelihood using four independent stepping-stone sampling runs (SS) of three constrained hypotheses with an equally informed prior on topology, and of an unconstrained analysis (UC)
| SS runs | Marg. Log Lik. H0 | Marg. Log Lik. H1 | Marg. Log Lik. H2 | Marg. Log Lik. UC | 2×log | 2×log |
|---|---|---|---|---|---|---|
| 1 | −13,366.60 | −13,364.61 | −13,364.61 | −13,411.57 | ||
| 2 | −13,368.18 | −13,362.88 | −13,364.75 | −13,410.21 | ||
| 3 | −13,366.39 | −13,361.74 | −13,365.65 | −13,410.75 | ||
| 4 | −13,365.59 | −13,362.19 | −13,365.36 | −13,411.32 | ||
| Mean | −13,366.34 | −13,362.43 | −13,365.00 | −13,410.82 | −7.82 | 88.96 |
Mean and standard deviation from four independent stepping-stone sampling estimations of the marginal likelihood under different constraints and unconstrained (UC)
| Constraints | Marg. Log Lik. | St. Dev. (4 runs) |
|---|---|---|
| UC | −13,410.82 | 0.6078 |
| H | −13,391.51 | 0.3093 |
| HS | −13,389.30 | 1.9914 |
| HS, H1, H2, S | −13,363.21 | 1.0238 |
| H, HS, H1, H2, S | −13,366.34 | 1.0845 |
| H2S, HS, H1, H2, S | −13,362.43 | 1.2604 |
| H1S, HS, H1, H2, S | −13,365.00 | 0.4941 |
| H, H1, H2 | −13,370.77 | 1.8202 |
Note: For explanation of constraints see Figure 2.
| ID Cat. No. | Genus | Species | Species gr. | 16S | CO1 | 18S | H3 |
|---|---|---|---|---|---|---|---|
| Ingroup | |||||||
| BMNH 744009 | JX221577 | JX221617 | JX221591 | JX221693 | |||
| BMNH 744010 | JX221578 | JX221618 | JX221592 | JX221694 | |||
| BMNH 722881 | longulus gr. | ||||||
| BMNH 725985 | rufifrons gr. | ||||||
| BMNH 725987 | tristis gr. | ||||||
| BMNH 800416 | niger gr. | ||||||
| BMNH 800432 | axillaris gr. | ||||||
| BMNH 800437 | subpubescens gr. | ||||||
| BMNH 800458 | notabilis gr. | ||||||
| BMNH 800761 | transpunctatus gr. | ||||||
| BMNH 800780 | appalachius gr. | ||||||
| BMNH 800785 | puberulus gr. | ||||||
| BMNH 800806 | lapponum gr. | ||||||
| BMNH 800846 | neglectus gr. | ||||||
| BMNH 801288 | nigrita gr. | ||||||
| BMNH 801293 | tesselatus gr. | ||||||
| BMNH 801298 | memnonius gr. | ||||||
| BMNH 801303 | obscurus gr. | ||||||
| BMNH 824791 | columbianus gr. | ||||||
| BMNH 824798 | sinuatipes gr. | ||||||
| BMNH 824800 | erythrocephalus gr. | ||||||
| NHRS-JLKB000000528 | lapponum gr. | ||||||
| NHRS-JLKB000000678 | nigellus gr. | ||||||
| BMNH 681719 | nigellus gr. | AY365277 | AY365311 | AJ850515 | EF670195 | ||
| BMNH 681248 | fuscipennis gr. | AF518274 | AF518305 | AJ318733 | EF670196 | ||
| BMNH 681250 | fuscipennis gr. | EF419327 | AF309300 | AJ318734 | EF056566 | ||
| BMNH 681249 | angustatus gr. | AF518278 | AF518309 | AJ850516 | EF670197 | ||
| BMNH 681239 | striola gr. | AF518281 | AF518312 | AJ850517 | EF670198 | ||
| Outgroups | |||||||
| BMNH 693521 | AJ850379 | AJ850629 | AJ850518 | EF670199 | |||
| BMNH 681287 | AF518252 | AF518282 | AJ318732 | EF670194 | |||
| BMNH 681544 | AJ850380 | AJ850630 | AJ850519 | EF670193 | |||
| BMNH 681288 | AJ850381 | AJ850631 | AJ318741 | EF670200 | |||
| BMNH 681625 | AJ850426 | AJ850673 | AJ850552 | EF670202 | |||
| MNCN AI9 | EF056665 | EF059805 | EF056633 | EF056550 | |||
| MNCN-AI126 | EF056677 | EF056606 | EF056642 | EF056563 | |||
| BMNH 681225 | EF056688 | EF056618 | EF056651 | EF056576 | |||
| BMNH 681839 | AJ850333 | AJ850585 | AJ850457 | EF670118 | |||
| BMNH 681331 | AY334131 | AY334247 | AJ318738 | EF056578 | |||
| ID Cat. No. | Legit | Country | Locality |
|---|---|---|---|
| Ingroup | |||
| BMNH 744009 | J. Geijer | Sweden | Öland, Kalmar Län, Mörbylånga, Algustrum, Jordtorp grustag. 27 August 2005 |
| BMNH 744010 | J. Geijer | Sweden | Öland, Kalmar Län, Borgolm, Högsrum, Vitlerkärret. 19 July 2005 |
| BMNH 722881 | G. N. Foster | France | Isère, Ruisseau des Fontinettes. 17 July 2005 |
| BMNH 725985 | AN. Nilsson | Sweden | Ångermanland, Torrböle, 23 May 2005 |
| BMNH 725987 | AN. Nilsson | Sweden | Ångermanland, Torrböle, 23 May 2005 |
| BMNH 800416 | J. Bergsten | United States | New York, Richford, SE of Ithaca. 11 September 2002 |
| BMNH 800432 | J. Bergsten | United States | California, Eldorado Co., American river, campground by Silver lake. 20 September 2002 |
| BMNH 800437 | J. Bergsten | United States | California, Alpine Co., Hope Valley, Blue Lakes road, West Fork, Carson River. 21 September 2002 |
| BMNH 800458 | J. Bergsten | Canada | Alberta, Meanook biological station, W4mer. Twp65 Rge23 Sec.12NW. 3 September 2002 |
| BMNH 800761 | J. Bergsten | Canada | Alberta, Hwy.11 approx. 5 km E. of border to Banff National Park. 7 September 2002 |
| BMNH 800780 | J. Bergsten | Canada | Alberta, W4mer. Twp67 Rge24 Sec.34SE, 2 September 2002 |
| BMNH 800785 | J. Bergsten | Canada | Alberta, W4mer. Twp65 Rge25 Sec.1SE. 3 September 2002 |
| BMNH 800806 | J. Bergsten | Canada | Alberta, Hwy. 11, just E. of Big Horn. 7 September 2002 |
| BMNH 800846 | J. Bergsten | Sweden | Västerbotten, Hörnefors. 31 July 2007 |
| BMNH 801288 | D. Bilton | United Kingdom | Cornwall, The Lizard Pool 1 at Kynanace Cove. 1 July 2005 |
| BMNH 801293 | D. Bilton | United Kingdom | Cornwall, stream beside B3300 above bridge, 20 June 2005 |
| BMNH 801298 | D. Bilton | United Kingdom | Cornwall, The Lizard Pool 2 at Kynanace Cove. 1 July 2005 |
| BMNH 801303 | J. Bergsten | Sweden | Västerbotten, Umeå, Sörfors, Umeälven. 26 August 2007 |
| BMNH 824791 | J. Bergsten | United States | California Sierra Co., Sierra Valley, Hwy 89 & 49. 21 September 2002 |
| BMNH 824798 | J. Bergsten | United States | California, Modoc Co., Hwy 299 approx. 5 km E. of Cedarville. 22 September 2002 |
| BMNH 824800 | D. Bilton | United Kingdom | Cornwall, The Lizard, Hayle Kimbro pool. 1 July 2005 |
| NHRS-JLKB000000528 | AN. Nilsson | Sweden | Torne Lappmark, Abisko, myre E. of B Njakajaure, 28 June 2010 |
| NHRS-JLKB000000678 | AN. Nilsson | Sweden | Västerbotten, Vindeln, Strycksele, 29 May 2010 |
| BMNH 681719 | B. Andren | Sweden | S. Ha. Onsala, Kustgöl, 1100 m VSV Rorvik, 3 November 2000 |
| BMNH 681248 | D.T. Bilton | Tenerife | Anaga, Roque Chinobre, December 1997 |
| BMNH 681250 | I. Ribera | Spain | Burgos |
| BMNH 681249 | I. Ribera | England | Dorset, Wareham, Morden bog, 5 July 1998 |
| BMNH 681239 | I. Ribera | Portugal | Sa. Da Estrela, Torre, lagoon 25 July 1998 |
| BMNH 693521 | A.N. Nilsson | Sweden | Prov. Västerbotten, Åmsele, 3 August 1999 |
| BMNH 681287 | Y. Alarie | Canada | Ontario |
| BMNH 681544 | Y. Alarie | United States | New Mexico, September 2000 |
| BMNH 681288 | Y. Alarie | Canada | Ontario |
| BMNH 681625 | I. Ribera & A. Cieslak | United States | 16 US California Mendocino co. / Rd. 1 Manchester / pond S City / 23 June 2000 |
| Outgroups | |||
| MNCN AI9 | G. Challet | South Africa | North Cape, Stream at top of Studer Pass, 22 August 2004 |
| MNCN-AI126 | M. Balke | Madagascar | Andasibe, Station Forestiere, orchid garden, 979 m, xi./xii.2004, MD 037 |
| BMNH 681225 | I. Ribera | United Kingdom | Sommerset Levels, Catcott Heath, 4 July 1998 |
| BMNH 681839 | I. Ribera & A. Cieslak | South Africa | 17, W Cape, Limiet Berge, Tributary r. Wit, rd. R301 24 km NE Wellington. 25 March 2001 |
| BMNH 681331 | I. Ribera | Chile | 16, X Reg. 6 km W La Unión, Pond in rd. to Hueicolla, 29 January 1999 |