| Literature DB >> 26448557 |
Yan Hou1, Michael D Nowak2, Virginia Mirré1, Charlotte S Bjorå1, Christian Brochmann1, Magnus Popp1.
Abstract
Restriction-site associated DNA sequencing (RAD-seq) has recently become an important method to generate genome-wide molecular data for species delimitation, phylogeography, and population genetic studies. However, very few empirical studies have so far tested its applicability in phylogenetic reconstruction. The alpine-arctic genus Diapensia was selected to study the origin of the disjunction between the Arctic and the Himalayan-Hengduan Mountains (HHM). However, a previous phylogenetic analysis based on one nuclear and four plastid DNA regions failed to resolve the oldest divergences in Diapensia as well as the relationship between the two HHM species. Here we reconstruct a fully resolved phylogeny of Diapensia and address the conflict between the currently accepted taxonomy and the gene trees in the HHM species using RAD-seq. Based on a data set containing 2,650 loci selected to maximize the number of parsimony informative sites and allowing for a high level of missing data (51%), the phylogeny of Diapensia was fully resolved and each of the four species was reciprocally monophyletic. Whereas the arctic D. lapponica was inferred as sister to the HHM clade in the previous study, the RAD-seq data resolved the two arctic species as sisters to the HHM clade. Similar relationships were inferred from a differently filtered data set with far fewer loci (114) and less missing data (21%), but with lower support and with one of the two HHM species as non-monophyletic. Bayesian concordance analysis and Patterson's D-statistic tests suggested that admixture has occurred between the two HHM species.Entities:
Mesh:
Year: 2015 PMID: 26448557 PMCID: PMC4598014 DOI: 10.1371/journal.pone.0140175
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Total geographical ranges (lines) and sampling sites (dots) of Diapensia.
HHM: Himalayan-Hengduan Mountains. Sample IDs refer to Table 1. The ranges of D. lapponica and D. obovata are redrawn after Hultén & Fries [44]. The photo of D. lapponica was attributed by Alinja (https://commons.wikimedia.org/wiki/File:Diapensia_lapponica_Kilpisj%C3%A4rvi_2012-07.jpg#/media/File:Diapensia_lapponica_Kilpisj%C3%A4rvi_2012-07.jpg)
Results after filtering and clustering RAD-seq data from 14 samples of Diapensia and 4 samples of the outgroup using pyRAD.
| Species | Sample ID | No. reads (× 106) | Clusters at 90% | Mean depth | No. consensus loci | No. loci | No. of pis | ||
|---|---|---|---|---|---|---|---|---|---|
| m4 | m14 | m4 | m14 | ||||||
|
| 55962 | 0.85 | 16075 | 46.91 | 15579 | 1853 | 110 | 4265 | 251 |
| 55969 | 1.14 | 7954 | 123.14 | 6717 | 1292 | 103 | 3035 | 222 | |
| 55976 | 0.75 | 7964 | 84.27 | 7691 | 1826 | 109 | 4254 | 244 | |
|
| 43266 | 0.54 | 5704 | 77.77 | 5529 | 1647 | 110 | 3860 | 248 |
| 47250 | 0.63 | 9966 | 53.88 | 9635 | 1650 | 109 | 3864 | 246 | |
| 47410 | 0.57 | 11688 | 41.72 | 10870 | 1456 | 106 | 3477 | 242 | |
| 55956 | 1.03 | 17934 | 53.68 | 17392 | 1936 | 113 | 4529 | 257 | |
|
| 53658 | 0.77 | 7262 | 92.17 | 6913 | 1787 | 113 | 4104 | 257 |
| 55953 | 1.23 | 7345 | 132.56 | 6568 | 1496 | 106 | 3512 | 234 | |
| 55954 | 0.38 | 3589 | 93.37 | 3311 | 650 | 72 | 1536 | 150 | |
| 6378 | 0.61 | 4957 | 103.87 | 4854 | 1571 | 109 | 3601 | 242 | |
|
| 56014 | 1.94 | 10328 | 142.37 | 8632 | 1699 | 112 | 3989 | 254 |
| 56025 | 3.11 | 15991 | 139.85 | 13594 | 2021 | 114 | 4645 | 258 | |
| 56031 | 1.12 | 11446 | 88.98 | 11176 | 1884 | 113 | 4239 | 254 | |
|
| 56079 | 0.20 | 4400 | 35.46 | 4124 | 82 | 13 | 199 | 30 |
| 56080 | 0.42 | 4820 | 80.66 | 4653 | 107 | 12 | 264 | 28 | |
|
| 56090 | 0.36 | 2865 | 96.75 | 2711 | 257 | 54 | 638 | 136 |
| 56091 | 0.18 | 1921 | 84.10 | 1787 | 261 | 53 | 626 | 141 | |
m4/m14: data matrix clustering RAD-seq reads at 90% similarity threshold and consisting of loci that shared by at least 4 (‘m4’) or 14 (‘m14’) samples.
aNumber of reads after quality filtering.
bClusters that passed filtering for 2× minimum coverage.
cAfter descarding all loci without parsimony informative sites (pis), blast filtering and trimming the last 5 bases from all loci.
Fig 2Correlation between three important statistics in RAD-seq data sets and clustering similarity thresholds.
Square: number of parsimony informative sites; triangle: total number of loci; and dots: number of loci that contain at least one parsimony informative site.
Fig 3Phylogeny of Diapensia inferred from two RAD-seq data sets (a to d).
(a) and (b) are based on the m4 data set, in which all loci were shared by at least 4 samples; and (c) and (d) are based on the m14 data set, in which all loci were shared by at least 14 samples. Sample IDs refer to Table 1. Maximum likelihood trees were estimated using RAxML; numbers above branches are bootstrap (BS) values generated from 1,000 replicates. Bayesian consensus trees were inferred using MrBayes; numbers above branches are posterior probabilities (PP). Asterisks on branches indicate BS = 100 or PP = 1.
Fig 4Primary concordance trees inferred at the α of 100 (a) and 0.1 (b) using BUCKy.
The α parameter represents the a priori level of discordance expected among loci, where a high α assumes a high level of discordance among the gene trees and a low α assumes a low level of discordance. 246 loci with at least two parsimony informative sites covering eight Diapensia samples were used in the BUCKy analyses. Sample IDs refer to Table 1. The concordance factors (CFs) and their 95% confidence intervals are shown on the branches; those in bold did not overlap with any conflicting CF, and those in bold and with an asterisk had no conflicting splits.
Four-taxon D-statistic test for introgression between Diapensia purpurea and D. himalaica.
| No. test | P1 | P2 | P3 | O | D | Std(D) | Z | ABBA | BABA | No. loci | pdisc |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | him1 | him2 | pur1 | A | -0.58 | 0.10 | 5.60 | 18.50 | 69.75 | 670.00 | 0.14 |
| 2 | him1 | him3 | pur1 | A | -0.72 | 0.08 | 9.39 | 18.75 | 113.50 | 980.00 | 0.15 |
| 3 | him2 | him3 | pur1 | A | 0.17 | 0.21 | 0.84 | 16.00 | 11.25 | 690.00 | 0.04 |
| 4 | him1 | him2 | pur2 | A | -0.80 | 0.07 | 12.25 | 11.25 | 99.25 | 784.00 | 0.15 |
| 5 | him1 | him3 | pur2 | A | -0.76 | 0.06 | 13.27 | 19.75 | 142.00 | 1176.00 | 0.15 |
| 6 | him2 | him3 | pur2 | A | 0.52 | 0.19 | 2.80 | 20.50 | 6.50 | 814.00 | 0.03 |
| 7 | him1 | him2 | pur3 | A | -0.89 | 0.04 | 24.38 | 7.25 | 126.00 | 790.00 | 0.19 |
| 8 | him1 | him3 | pur3 | A | -0.88 | 0.04 | 24.11 | 12.25 | 187.25 | 1182.00 | 0.19 |
| 9 | him2 | him3 | pur3 | A | 0.38 | 0.22 | 1.69 | 13.75 | 6.25 | 806.00 | 0.03 |
| 10 | pur1 | pur2 | him1 | A | 0.01 | 0.21 | 0.04 | 15.50 | 15.25 | 1096.00 | 0.04 |
| 11 | pur1 | pur3 | him1 | A | 0.50 | 0.08 | 6.08 | 75.00 | 24.75 | 1074.00 | 0.12 |
| 12 | pur2 | pur3 | him1 | A | 0.50 | 0.08 | 6.23 | 84.50 | 28.00 | 1283.00 | 0.11 |
| 13 | pur1 | pur2 | him2 | A | -0.16 | 0.31 | 0.50 | 10.25 | 14.00 | 771.00 | 0.03 |
| 14 | pur1 | pur3 | him2 | A | -0.27 | 0.24 | 1.14 | 9.50 | 16.50 | 709.00 | 0.04 |
| 15 | pur2 | pur3 | him2 | A | -0.03 | 0.21 | 0.16 | 14.00 | 15.00 | 832.00 | 0.04 |
| 16 | pur1 | pur2 | him3 | A | -0.14 | 0.16 | 0.85 | 10.25 | 13.50 | 1075.00 | 0.03 |
| 17 | pur1 | pur3 | him3 | A | 0.07 | 0.21 | 0.34 | 17.25 | 15.00 | 1034.00 | 0.03 |
| 18 | pur2 | pur3 | him3 | A | -0.01 | 0.17 | 0.04 | 20.75 | 21.00 | 1242.00 | 0.04 |
P1, P2 and P3: him1: D. himalaica 55962, him2: D. himalaica 55969, him3: D. himalaica 55976, pur1: D. purpurea 56014, pur2: D. purpurea 56025, pur3: D. purpurea 56031 (sample IDs refer to Table 1); O: outgroup ‘A’ consists of all individuals from the two arctic species D. lapponica and D. obovata. D-statistic values (D) and their standard deviation (Std(D)) are given for each test. ABBA, BABA: the number of alleles that support each pattern (the fractions are due to heterozygosity). No. loci: the number of loci analyzed in each test; pdisc: the percentage of discordance.
*Z-scores that are statistically significant after conversion to a two-tailed P-value and using α = 0.01 as a conservative cutoff for significance.