Stephen A Smith1, Joseph W Brown1. 1. Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, 48109, USA.
Abstract
PREMISE OF THE STUDY: Large phylogenies can help shed light on macroevolutionary patterns that inform our understanding of fundamental processes that shape the tree of life. These phylogenies also serve as tools that facilitate other systematic, evolutionary, and ecological analyses. Here we combine genetic data from public repositories (GenBank) with phylogenetic data (Open Tree of Life project) to construct a dated phylogeny for seed plants. METHODS: We conducted a hierarchical clustering analysis of publicly available molecular data for major clades within the Spermatophyta. We constructed phylogenies of major clades, estimated divergence times, and incorporated data from the Open Tree of Life project, resulting in a seed plant phylogeny. We estimated diversification rates, excluding those taxa without molecular data. We also summarized topological uncertainty and data overlap for each major clade. KEY RESULTS: The trees constructed for Spermatophyta consisted of 79,881 and 353,185 terminal taxa; the latter included the Open Tree of Life taxa for which we could not include molecular data from GenBank. The diversification analyses demonstrated nested patterns of rate shifts throughout the phylogeny. Data overlap and inference uncertainty show significant variation throughout and demonstrate the continued need for data collection across seed plants. CONCLUSIONS: This study demonstrates a means for combining available resources to construct a dated phylogeny for plants. However, this approach is an early step and more developments are needed to add data, better incorporating underlying uncertainty, and improve resolution. The methods discussed here can also be applied to other major clades in the tree of life.
PREMISE OF THE STUDY: Large phylogenies can help shed light on macroevolutionary patterns that inform our understanding of fundamental processes that shape the tree of life. These phylogenies also serve as tools that facilitate other systematic, evolutionary, and ecological analyses. Here we combine genetic data from public repositories (GenBank) with phylogenetic data (Open Tree of Life project) to construct a dated phylogeny for seed plants. METHODS: We conducted a hierarchical clustering analysis of publicly available molecular data for major clades within the Spermatophyta. We constructed phylogenies of major clades, estimated divergence times, and incorporated data from the Open Tree of Life project, resulting in a seed plant phylogeny. We estimated diversification rates, excluding those taxa without molecular data. We also summarized topological uncertainty and data overlap for each major clade. KEY RESULTS: The trees constructed for Spermatophyta consisted of 79,881 and 353,185 terminal taxa; the latter included the Open Tree of Life taxa for which we could not include molecular data from GenBank. The diversification analyses demonstrated nested patterns of rate shifts throughout the phylogeny. Data overlap and inference uncertainty show significant variation throughout and demonstrate the continued need for data collection across seed plants. CONCLUSIONS: This study demonstrates a means for combining available resources to construct a dated phylogeny for plants. However, this approach is an early step and more developments are needed to add data, better incorporating underlying uncertainty, and improve resolution. The methods discussed here can also be applied to other major clades in the tree of life.
Keywords:
GenBank; Open Tree of Life; clustering; divergence-time estimation; diversification; phylogenetic methods; phylogenetics; plant tree of life; seed plants
Authors: Matthew G Johnson; Lisa Pokorny; Steven Dodsworth; Laura R Botigué; Robyn S Cowan; Alison Devault; Wolf L Eiserhardt; Niroshini Epitawalage; Félix Forest; Jan T Kim; James H Leebens-Mack; Ilia J Leitch; Olivier Maurin; Douglas E Soltis; Pamela S Soltis; Gane Ka-Shu Wong; William J Baker; Norman J Wickett Journal: Syst Biol Date: 2019-07-01 Impact factor: 15.683
Authors: Annamária Fenesi; Dorottya Sándor; Petr Pyšek; Wayne Dawson; Eszter Ruprecht; Franz Essl; Holger Kreft; Jan Pergl; Patrick Weigelt; Marten Winter; Mark Van Kleunen Journal: Ann Bot Date: 2019-06-24 Impact factor: 4.357
Authors: Rafael Molina-Venegas; Miguel Á Rodríguez; Manuel Pardo-de-Santayana; Cristina Ronquillo; David J Mabberley Journal: Nat Ecol Evol Date: 2021-03-29 Impact factor: 15.460
Authors: Thais N C Vasconcelos; Suzana Alcantara; Caroline O Andrino; Félix Forest; Marcelo Reginato; Marcelo F Simon; José R Pirani Journal: Proc Biol Sci Date: 2020-03-18 Impact factor: 5.349
Authors: Rachael V Gallagher; Daniel S Falster; Brian S Maitner; Roberto Salguero-Gómez; Vigdis Vandvik; William D Pearse; Florian D Schneider; Jens Kattge; Jorrit H Poelen; Joshua S Madin; Markus J Ankenbrand; Caterina Penone; Xiao Feng; Vanessa M Adams; John Alroy; Samuel C Andrew; Meghan A Balk; Lucie M Bland; Brad L Boyle; Catherine H Bravo-Avila; Ian Brennan; Alexandra J R Carthey; Renee Catullo; Brittany R Cavazos; Dalia A Conde; Steven L Chown; Belen Fadrique; Heloise Gibb; Aud H Halbritter; Jennifer Hammock; J Aaron Hogan; Hamish Holewa; Michael Hope; Colleen M Iversen; Malte Jochum; Michael Kearney; Alexander Keller; Paula Mabee; Peter Manning; Luke McCormack; Sean T Michaletz; Daniel S Park; Timothy M Perez; Silvia Pineda-Munoz; Courtenay A Ray; Maurizio Rossetto; Hervé Sauquet; Benjamin Sparrow; Marko J Spasojevic; Richard J Telford; Joseph A Tobias; Cyrille Violle; Ramona Walls; Katherine C B Weiss; Mark Westoby; Ian J Wright; Brian J Enquist Journal: Nat Ecol Evol Date: 2020-02-17 Impact factor: 15.460
Authors: Gustavo Brant Paterno; Carina Lima Silveira; Johannes Kollmann; Mark Westoby; Carlos Roberto Fonseca Journal: Proc Natl Acad Sci U S A Date: 2020-05-04 Impact factor: 11.205
Authors: Daniel S Park; Xiao Feng; Brian S Maitner; Kacey C Ernst; Brian J Enquist Journal: Proc Natl Acad Sci U S A Date: 2020-05-04 Impact factor: 11.205