| Literature DB >> 31225900 |
Daijiang Li1, Lauren Trotta1, Hannah E Marx2, Julie M Allen3, Miao Sun4, Douglas E Soltis4, Pamela S Soltis4, Robert P Guralnick4, Benjamin Baiser1.
Abstract
Should we build our own phylogenetic trees based on gene sequence data, or can we simply use available synthesis phylogenies? This is a fundamental question that any study involving a phylogenetic framework must face at the beginning of the project. Building a phylogeny from gene sequence data (purpose-built phylogeny) requires more effort, expertise, and cost than subsetting an already available phylogeny (synthesis-based phylogeny). However, we still lack a comparison of how these two approaches to building phylogenetic trees influence common community phylogenetic analyses such as comparing community phylogenetic diversity and estimating trait phylogenetic signal. Here, we generated three purpose-built phylogenies and their corresponding synthesis-based trees (two from Phylomatic and one from the Open Tree of Life, OTL). We simulated 1,000 communities and 12,000 continuous traits along each purpose-built phylogeny. We then compared the effects of different trees on estimates of phylogenetic diversity (alpha and beta) and phylogenetic signal (Pagel's λ and Blomberg's K). Synthesis-based phylogenies generally yielded higher estimates of phylogenetic diversity when compared to purpose-built phylogenies. However, resulting measures of phylogenetic diversity from both types of phylogenies were highly correlated (Spearman's ρ > 0.8 in most cases). Mean pairwise distance (both alpha and beta) is the index that is most robust to the differences in tree construction that we tested. Measures of phylogenetic diversity based on the OTL showed the highest correlation with measures based on the purpose-built phylogenies. Trait phylogenetic signal estimated with synthesis-based phylogenies, especially from the OTL, was also highly correlated with estimates of Blomberg's K or close to Pagel's λ from purpose-built phylogenies when traits were simulated under Brownian motion. For commonly employed community phylogenetic analyses, our results justify taking advantage of recently developed and continuously improving synthesis trees, especially the Open Tree of Life.Entities:
Keywords: alpha diversity; beta diversity; community phylogenetic structure; open tree of life; phylogenetic diversity; phylogenetic signal; trait
Mesh:
Year: 2019 PMID: 31225900 PMCID: PMC7079099 DOI: 10.1002/ecy.2788
Source DB: PubMed Journal: Ecology ISSN: 0012-9658 Impact factor: 5.499
Figure 1Workflow to assess effects of commonly used synthesis phylogenies on community phylogenetic diversity and trait phylogenetic signal estimations. Boxes with light yellow background are related to community phylogenetic diversity; boxes with light blue background are related to trait phylogenetic signal. APG, Angiosperm Phylogeny Group; OTL, Open Tree of Life; PD, Faith's phylogenetic diversity; MPD, mean pairwise distance; MNTD, mean nearest taxon distance; Unif, Unifraction; PCD, phylogenetic community dissimilarity; λ, Pagel's lambda; K, Blomberg's K.
Figure 2Median correlations of phylogenetic alpha diversity values based on different phylogenies.
Slopes based on linear mixed models (LMMs)
| Index | Data set | tree_apg | tree_zanne | tree_otl | tree_random |
|---|---|---|---|---|---|
| PD | Pine (540 sp) | 0.843 (0.837, 0.849) | 0.917 (0.913, 0.922) | 0.971 (0.969, 0.974) | −0.001 (−0.013, 0.01) |
| PD | Alpine (1064 sp) | 0.854 (0.848, 0.86) | 0.915 (0.91, 0.919) | 0.937 (0.933, 0.941) | −0.022 (−0.034, −0.01) |
| PD | FL (1548 sp) | 0.92 (0.916, 0.924) | 0.891 (0.886, 0.896) | 0.871 (0.865, 0.876) | 0.006 (−0.005, 0.018) |
| MPD | Pine (540 sp) | 0.891 (0.885, 0.896) | 0.972 (0.969, 0.974) | 0.996 (0.995, 0.997) | 0.047 (0.036, 0.059) |
| MPD | Alpine (1064 sp) | 0.957 (0.954, 0.96) | 0.997 (0.997, 0.998) | 0.941 (0.937, 0.945) | 0.004 (−0.008, 0.015) |
| MPD | FL (1548 sp) | 0.962 (0.958, 0.965) | 0.95 (0.946, 0.953) | 0.895 (0.889, 0.9) | −0.002 (−0.014, 0.009) |
| MNTD | Pine (540 sp) | 0.78 (0.773, 0.788) | 0.787 (0.78, 0.794) | 0.897 (0.892, 0.902) | 0.006 (−0.006, 0.017) |
| MNTD | Alpine (1064 sp) | 0.713 (0.705, 0.721) | 0.794 (0.787, 0.801) | 0.874 (0.869, 0.88) | −0.016 (−0.028, −0.004) |
| MNTD | FL (1548 sp) | 0.856 (0.85, 0.862) | 0.797 (0.79, 0.804) | 0.831 (0.824, 0.837) | 0.03 (0.018, 0.041) |
PD = Faith's PD, MPD = mean pairwise distance, MNTD = mean pairwise distance between the closest relatives. Within the model, the response variable is the phylogenetic alpha diversity values based on the purpose‐built phylogeny; the predictor is the phylogenetic alpha diversity values based on one of the synthesis‐based phylogenies (tree_apg, tree_zanne, tree_otl, and tree_random). Therefore, slopes less than one indicate that diversity values based on synthesis‐based phylogenies were higher than those based on the purpose‐built phylogenies. Numbers within parentheses are the 95% confidence intervals for the slopes.
Figure 3Median correlations of phylogenetic beta diversity values based on different phylogenies.
Slopes based on linear mixed models (LMMs)
| Index | Data set | tree_apg | tree_zanne | tree_otl | tree_random |
|---|---|---|---|---|---|
| Unif | Pine (540 sp) | 0.824 (0.822, 0.826) | 0.791 (0.789, 0.793) | 0.87 (0.869, 0.872) | 0.063 (0.058, 0.067) |
| Unif | Alpine (1064 sp) | 0.811 (0.808, 0.813) | 0.871 (0.869, 0.873) | 0.896 (0.894, 0.897) | 0.056 (0.053, 0.06) |
| Unif | FL (1548 sp) | 0.871 (0.869, 0.873) | 0.791 (0.788, 0.793) | 0.814 (0.812, 0.816) | 0.071 (0.066, 0.075) |
| MPD_beta | Pine (540 sp) | 0.34 (0.337, 0.342) | 0.972 (0.969, 0.975) | 1.23 (1.225, 1.234) | 0.009 (0.007, 0.011) |
| MPD_beta | Alpine (1064 sp) | 0.797 (0.794, 0.799) | 0.976 (0.976, 0.977) | 1.122 (1.117, 1.127) | 0.002 (0.001, 0.004) |
| MPD_beta | FL (1548 sp) | 0.778 (0.776, 0.781) | 1.343 (1.339, 1.347) | 1.805 (1.797, 1.813) | 0.001 (‐0.001, 0.002) |
| MNTD_beta | Pine (540 sp) | 0.856 (0.853, 0.859) | 0.857 (0.854, 0.86) | 0.928 (0.926, 0.93) | 0.054 (0.05, 0.058) |
| MNTD_beta | Alpine (1064 sp) | 0.896 (0.894, 0.899) | 0.952 (0.95, 0.954) | 0.942 (0.94, 0.943) | 0.046 (0.043, 0.05) |
| MNTD_beta | FL (1548 sp) | 0.787 (0.785, 0.789) | 0.762 (0.76, 0.764) | 0.75 (0.748, 0.752) | 0.039 (0.036, 0.043) |
| PCD | Pine (540 sp) | 0.857 (0.854, 0.86) | 0.828 (0.825, 0.831) | 0.872 (0.87, 0.875) | 0.089 (0.085, 0.093) |
| PCD | Alpine (1064 sp) | 0.827 (0.825, 0.83) | 0.912 (0.909, 0.915) | 0.907 (0.905, 0.909) | 0.059 (0.055, 0.063) |
| PCD | FL (1548 sp) | 0.802 (0.799, 0.804) | 0.744 (0.741, 0.746) | 0.719 (0.716, 0.722) | 0.054 (0.05, 0.059) |
Unif = UniFrac, MPD_beta = interassemblage mean pairwise distance, MNTD_beta = interassemblage mean pairwise distance between the closest relatives, and PCD = phylogenetic community dissimilarity. Within the model, the response variable is the phylogenetic beta diversity values based on the purpose‐built phylogeny; the predictor is the phylogenetic beta diversity values based on one of the synthesis phylogenies (tree_apg, tree_zanne, tree_otl, and tree_random). Therefore, slopes less than one indicate that diversity values based on synthesis‐based phylogenies were higher than those based on the purpose‐built phylogenies. Numbers within parentheses are the 95% confidence intervals for the slopes.
Figure 4Estimated Pagel's λ for traits simulated with divergence rate of 0.2. When traits were simulated under BM and weak OU models, estimated Pagel's λ values based on tree_otl were the closest to those estimated based on tree_purpose in most cases and had smaller variances than other synthesis‐based phylogenies. Note that we allow λ to be larger than 1 in all estimates.
Figure 5Estimated Blomberg's K for traits simulated with divergence rate of 0.2. Because for Blomberg's K, it is the mean, not the median, value that has the expected value of 1, we did not use boxplots as in Fig. 4. Instead, we added the average values (red points) on top of jittered raw estimated values.
Figure 6Spearman's rank correlations of estimated Blomberg's K values between tree_purpose and the three synthesis‐based phylogenies.