| Literature DB >> 24278335 |
Zhenxiang Xi1, Joshua S Rest, Charles C Davis.
Abstract
The extant seed plants include more than 260,000 species that belong to five main lineages: angiosperms, conifers, cycads, Ginkgo, and gnetophytes. Despite tremendous effort using molecular data, phylogenetic relationships among these five lineages remain uncertain. Here, we provide the first broad coalescent-based species tree estimation of seed plants using genome-scale nuclear and plastid data By incorporating 305 nuclear genes and 47 plastid genes from 14 species, we identify that i) extant gymnosperms (i.e., conifers, cycads, Ginkgo, and gnetophytes) are monophyletic, ii) gnetophytes exhibit discordant placements within conifers between their nuclear and plastid genomes, and iii) cycads plus Ginkgo form a clade that is sister to all remaining extant gymnosperms. We additionally observe that the placement of Ginkgo inferred from coalescent analyses is congruent across different nucleotide rate partitions. In contrast, the standard concatenation method produces strongly supported, but incongruent placements of Ginkgo between slow- and fast-evolving sites. Specifically, fast-evolving sites yield relationships in conflict with coalescent analyses. We hypothesize that this incongruence may be related to the way in which concatenation methods treat sites with elevated nucleotide substitution rates. More empirical and simulation investigations are needed to understand this potential weakness of concatenation methods.Entities:
Mesh:
Year: 2013 PMID: 24278335 PMCID: PMC3836751 DOI: 10.1371/journal.pone.0080870
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Conflicting phylogenetic relationships among extant gymnosperms.
(A) The four main lineages of extant gymnosperms: (1) conifers (Pinus resinosa), (2) cycads (Cycas sp.), (3) Ginkgo biloba, and (4) gnetophytes (Ephedra chilensis). (B) Two main hypotheses for phylogenetic relationships of gymnosperms. (C) Three main hypotheses for the phylogenetic placement of gnetophytes. (D) Two main hypotheses for the phylogenetic placement of Ginkgo.
Data sources of nuclear gene sequences included in our phylogenetic analyses.
|
|
|
|
|
|
|---|---|---|---|---|
|
| [ | 5,724 | 107 | 47.1% |
|
| [ | 32,987 | 251 | 45.1% |
|
| [ | 8,224 | 184 | 44.0% |
|
| [ | 4,211 | 118 | 45.1% |
|
| [ | 3,739 | 88 | 44.7% |
|
| [ | 2,016 | 44 | 44.8% |
|
| [ | 68,266 | 266 | 48.1% |
|
| [ | 23,693 | 288 | 44.7% |
|
| [ | 13,298 | 283 | 44.9% |
|
| [ | 7,844 | 260 | 44.5% |
|
| [ | 28,670 | 271 | 44.8% |
|
| [ | 21,094 | 305 | 54.3% |
|
| [ | 3,170 | 80 | 43.9% |
|
| [ | 11,104 | 214 | 45.0% |
Species with sequenced genome is highlighted in bold.
Data sources of plastid gene sequences included in our phylogenetic analyses.
|
|
|
|
| |
|---|---|---|---|---|
|
| NC_004766 | 46 | 42.8% | |
|
| NC_005086 | 44 | 40.1% | |
|
| NC_010548 | 46 | 38.0% | |
|
| NC_020319 | 47 | 40.3% | |
|
| NC_016986 | 47 | 40.4% | |
|
| NC_011942 | 33 | 38.6% | |
|
| NC_008788 | 44 | 40.6% | |
|
| NC_021456 | 36 | 40.7% | |
|
| NC_016069 | 35 | 40.7% | |
|
| NC_004677 | 36 | 40.5% | |
|
| NC_021440 | 36 | 40.4% | |
|
| NC_013086 | 47 | 50.8% | |
|
| NC_010654 | 32 | 37.2% | |
|
| JQ770198-JQ770303 | 32 | 41.4% |
Figure 2Species trees inferred from (A) 305 nuclear genes and (B) 47 plastid genes using the coalescent method (STAR).
Bootstrap percentages (BPs) from STAR/RAxML are indicated above each branch; an asterisk indicates that the clade is supported by 100 BPs from both STAR and RAxML. Branch lengths were estimated by fitting the concatenated matrices to the inferred topology from STAR.
Figure 3Summary of bootstrap percentages (BPs) from coalescent and concatenation analyses using different gene subsampling and rate partitions.
(A) BPs from coalescent and concatenation analyses using different gene subsampling. The 305 nuclear genes were subsampled for four different gene size categories (i.e., 25, 47, 100, or 200 genes; 10 replicates each), and the 47 plastid genes were subsampled for 25 genes (10 replicates). Cells with hatching indicate that support for the placement of Ginkgo biloba from all replicates is below 80 BP; colored cells indicate relationships that received bootstrap support ≥80 BP from at least one replicate (pink = cycads plus Ginkgo as sister to all remaining extant gymnosperms, yellow = Ginkgo alone as sister to conifers and gnetophytes within extant gymnosperms; see also Figure 1D). (B) BPs from coalescent and concatenation analyses across different nucleotide rate partitions. Parsimony informative sites in concatenated matrices were sorted based on estimated evolutionary rates, and subsequently divided into two equal partitions. The index of substitution saturation (I SS) was used to measure nucleotide substitution saturation for sites within each rate partition. The two critical I SS values, i.e., I SS.C1 and I SS.C2, were estimated using an asymmetrical and symmetrical topology, respectively (for data including more than 32 species, only values estimated from 32 terminals are shown here).