| Literature DB >> 28912480 |
Xueqin Wang1,2, Xiaying Ye1,3,4, Lei Zhao1,3,4, Dezhu Li5,6, Zhenhua Guo7, Huifu Zhuang8.
Abstract
The temperate bamboos (tribe Arundinarieae, Poaceae) are strongly supported as monophyly in recent molecular studies, but taxonomic delineation and phylogenetic relationships within the tribe lack resolution. Here, we sampled 39 species (36 temperate bamboos and 3 outgroups) for restriction-site associated DNA sequencing (RAD-seq) with an emphasis on Phyllostachys clade and related clades. Using the largest data matrix for the bamboos to date, we were able to infer phylogenetic relationships with unparalleled resolution. The Phyllostachys, Shibataea, and Arundinaria clades defined from plastid phylogeny, were not supported as monophyletic group. However, the RAD-seq phylogeny largely agreed with the morphology-based taxonomy, with two clades having leptomorph rhizomes strongly supported as monophyletic group. We also explored two approaches, BWA-GATK (a mapping system) and Stacks (a grouping system), for differences in SNP calling and phylogeny inference. For the same level of missing data, the BWA-GATK pipeline produced much more SNPs in comparison with Stacks. Phylogenetic analyses of the largest data matrices from both pipelines, using concatenation and coalescent methods provided similar tree topologies, despite the presence of missing data. Our study demonstrates the utility of RAD-seq data for elucidating phylogenetic relationships between genera and higher taxonomic levels in this important but phylogenetically challenging group.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28912480 PMCID: PMC5599554 DOI: 10.1038/s41598-017-11367-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of data matrices produced through BWA-GATK pipeline.
| Matrix* | missing data | base pairs | SNPs | informative SNPs |
|---|---|---|---|---|
| s56 | 0 | 2357 | 1390 | 489 |
|
| 1.34% | 9416 | 5759 | 2076 |
|
| 7.64% | 147524 | 97588 | 35056 |
|
| 13.08% | 400796 | 272254 | 97104 |
*Minimum number of samples needed to retain a locus (total number of samples is 56). The data matrices used for phylogenetic analyses are shown in bold.
Summary of data matrices produced through Stacks pipeline.
| Matrix* | missing data | SNPs | informative SNPs | total loci | consensus loci | polymorphic loci | |
|---|---|---|---|---|---|---|---|
| m = 5 | s55 | 0 | 0 | 0 | 2 | 2 | 0 |
| s45 | 13.59% | 97 | 46 | 119 | 68 | 51 | |
| s35 | 29.29% | 911 | 241 | 456 | 149 | 307 | |
| s25 | 50.14% | 20855 | 6485 | 4651 | 319 | 4332 | |
|
| 57.86% | 67263 | 21357 | 13889 | 538 | 13351 | |
|
| 65.22% | 167674 | 53927 | 33678 | 1082 | 32596 | |
|
| 72.64% | 362823 | 119331 | 74448 | 2856 | 71592 | |
| m = 10 | s55 | 1.79% | 9 | 6 | 10 | 6 | 4 |
| s45 | 10.78% | 208 | 90 | 245 | 132 | 113 | |
| s35 | 24.31% | 537 | 182 | 433 | 192 | 241 | |
| s25 | 44.50% | 2589 | 777 | 1365 | 381 | 984 | |
|
| 54.52% | 6356 | 1957 | 2780 | 639 | 2141 | |
|
| 63.65% | 15202 | 4942 | 5665 | 1069 | 4596 | |
|
| 73.87% | 42788 | 14194 | 14097 | 2198 | 11899 | |
p = minimum numbers of populations required to process a locus. The data matrices used for phylogenetic analyses are shown in bold.
Figure 1Tree resolution increases as data are added to the phylogenetic analysis. The different colors represent eight major clades (see Fig. 2 for key). Values on branches are bootstrap support from 200 bootstrap replicates using RAxML’s rapid bootstrap algorithm. Topologies shown are the best tree from a full ML search. Bootstrap support values within major clades are not shown.
Figure 2Phylogeny of the temperate bamboos based on an ML analysis (matrix s45: 400,796 base pairs, and 272,254 variable sites). Nodes with posterior probalilities ≥95 are unlabeled. Rhizome type is indicated by branch thickness: thick lines = pachymorph rhizomes and thin lines = leptomorph rhizomes.
Bootstrap support for relationships within Bambusoideae.
| Clade | Genus | s55 | s50 | s45 | p20, m = 5 | p15, m = 5 | p10, m = 5 | p20, m = 10 | p15, m = 10 | p10, m = 10 |
|---|---|---|---|---|---|---|---|---|---|---|
|
|
| 100/97 | 100/98 | 100/99 | 100/98 | 100/100 | 100/94 | 100/98 | 100/100 | 100/96 |
|
|
| 59/59 | 79/94 | 87/97 | 9/84 | 80/86 | 88/75 | 24/25 | 29/− | 48− |
|
|
| 100/97 | 100/97 | 100/95 | 100/− | 100/− | 100/50 | 92/− | 100/− | 100/− |
|
|
| 100/98 | 100/96 | 100/95 | 100/100 | 100/100 | 100/100 | 99/61 | 100/61 | 100/74 |
| Sino–Japanese |
| −/49 | 100/92 | 100/90 | 100/100 | 100/100 | 100/98 | 100/− | 100/15 | 100/29 |
|
|
| 100/73 | 100/88 | 100/91 | 100/77 | 100/98 | 100/83 | 87/44 | 100/− | 100/− |
| Alpine |
| 100/93 | 100/96 | 100/99 | 100/67 | 100/92 | 100/77 | −/− | −/− | 60/− |
|
|
| 100/100 | 100/99 | 100/100 | 100/100 | 100/100 | 100/100 | −/− | 45/− | 91/− |
Bipartitions that were absent are represented by a “−” Results are shown for two approaches: concatenated/species tree.
Figure 3The resolution of phylogenetic trees increases as a function of data matrix size. The colors indicate the proportion of branches with a bootstrap support above a given threshold.
Data characteristics with bootstrap value and tree length for each data matrix.
| Matrix | Mean ML bootstrap | tree length |
|---|---|---|
| s55 | 77.52 | 0.78 |
| s50 | 96.15 | 1.06 |
| s45 | 97.80 | 1.23 |
| p20, m = 5 | 95.33 | 3.26 |
| p15, m = 5 | 96.26 | 3.89 |
| p10, m = 5 | 95.76 | 4.73 |
| p20, m = 10 | 66.06 | 4.60 |
| p15, m = 10 | 76.33 | 4.97 |
| p10, m = 10 | 87.56 | 5.32 |
Figure 4Species trees from temperate bamboos estimated using SVDquartets for data matrices s55, s45, p20 (m = 5), p10 (m = 5). The different colors represent eight major clades (see Fig. 2 for key). Bootstrap values (from 100 replicates) are shown on nodes.