| Literature DB >> 22731878 |
Matthew Parks1, Richard Cronn, Aaron Liston.
Abstract
BACKGROUND: Through next-generation sequencing, the amount of sequence data potentially available for phylogenetic analyses has increased exponentially in recent years. Simultaneously, the risk of incorporating 'noisy' data with misleading phylogenetic signal has also increased, and may disproportionately influence the topology of weakly supported nodes and lineages featuring rapid radiations and/or elevated rates of evolution.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22731878 PMCID: PMC3475122 DOI: 10.1186/1471-2148-12-100
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Phylogenetic hypotheses for genus . Alternate placements (indicated by dashed lines) of subsections Contortae and Krempfianae, as well as the clade consisting of Pinus merkusii and P. latteri are shown. The most common plastid-based resolution of these groups is indicated by gray shading. Tree topology and relative branch lengths reflective of data from Gernandt et al. [42], Parks et al. [31] and this study.
Average per site OV values of plastome regions for full plastome alignment
| average OV | 0.04546a (0.12833) | 0.03153b (0.11227) | 0.02110c (0.08880) | 0.00443e (0.03725) | 0.00462e (0.04255) |
| average OV without | | 0.01907c (0.08184) | | | |
| without | 0.01478d (0.06997) |
Values given are for alignment of all 113 Pinus and Pinaceae accessions. Standard deviations are given in parentheses. Mean values with different superscript letters are significantly different at α < 0.05 in Tukey’s HSD test, following one-way ANOVA supporting different means at p < 0.0001.
Figure 2Distribution of OV for variable plastome alignment positions. Schematic of the Pinus chloroplast genome with annotated protein-coding exons (blue), rRNA loci (yellow), tRNA loci (orange) and noncoding regions (green). The coding loci ycf1 and ycf2 are highlighted in light blue. The distribution of OV values > 0 is indicated by the internal histogram, as follows: red – most variable 4.6 kbp (A142165 to A136665 ); yellow – most variable sites from 4.6 to 8.3 kbp (A136565 to A133065 ); green – remaining sites with OV > 0.
Figure 3Trends in bootstrap support values and topologies for likelihood analyses of alignment partitions. For OV-based analyses, the following are shown: a) Distributions of bootstrap support values for all nodes. Circles represent median bootstrap support for each An partition size. b) Distribution of branch score metric (triangles) and partition metric (circles) values for tests of topological congruence between An and corresponding Bn data partitions. Filled data points correspond to An partitions sizes falling between final decrease of branch score metric values and start of decreases in overall bootstrap support values for An partitions. Partition metric values shown are 0.1× actual value in order to fit on same scale with branch score metric values.
Figure 4Distribution of bootstrap support values for phylogenetic position of three clades in genus . a) Bootstrap support values for placement of subsection Krempfianae . Circles correspond to placement of P. krempfii sister to subsection Gerardianae . b) Bootstrap support values for placement of Pinus merkusii / P. latteri . Circles correspond to placement of P. merkusii/P. latteri as sister to subsection Pinaster and triangles as sister to subsection Pinus. c) Bootstrap support values for placement of subsection Contortae. Circles correspond to placement of subsection Contortae as sister to subsection Australes and triangles as basal to both subsections Australes and Contortae. For all charts, filled data points correspond to An partition sizes falling between final decrease of branch score metric values and start of decrease in overall bootstrap support values for An partitions, as shown in Figure 3. Squares represent variable phylogenetic placements not including those represented by circles or triangles. Arrows in b) and c) indicate partition size at which bootstrap support for monophyly of clade falls below 100%.
Impact of long-branch exclusion on full alignment, Aand Adata partitions
| Full Alignment | (K + G) + S | (K + G) + S | (K + G) + S | (K + G) + S | (M/L + Pina.) + Pinus | (M/L + Pina.) + Pinus | (M/L + Pina.) + Pinus | (M/L + Pina.) + Pinus | C + (A + P) | C + (A + P) | C + (A + P) | C + (A + P) |
| (79) | (61) | (79) | (79) | (53) | (53) | (54) | (78) | (100) | (100) | (100) | (100) | |
| A136665 | (K + G) + S | (K + G) + S | (K + G) + S | (K + G) + S | (M/L + Pina.) + Pinus | (M/L + Pina.) + Pinus | (M/L + Pina.) + Pinus | (M/L + Pina.) + Pinus | C + (A + P) | C + (A + P) | C + (A + P) | C + (A + P) |
| (71) | (70) | (71) | (54) | (43) | (46) | (47) | (68) | (100) | (100) | (100) | (100) | |
| A133065 | (K + G) + S | (K + G) + S | (K + G) + S | (K + G) + S | (M/L + Pina.) + Pinus | (M/L + Pina.) + Pinus | (M/L + Pina.) + Pinus | (M/L + Pina.) + Pinus | P + (C + A) | P + (C + A) | P + (C + A) | C + (A + P) |
| (97) | (97) | (97) | (98) | (99) | (100) | (100) | (99) | (37/42) | (31/36) | (35/42) | (84) | |
For each combination, supported topology is given with maximum likelihood bootstrap support underneath in parentheses. Subsection and species abbreviations are as follows: A = Australes , C = Contortae , G = Gerardianae , K = P. krempfii , M/L = P. merkusii /P. latteri , P = Ponderosae, Pinast. = Pinaster , S = Strobus . Single outgroups used in most exclusive groups are described Methods and Materials.
Slopes of regression lines for plots of corrected versus uncorrected pairwise distances
| Full alignment | 0.9574 | 0.9682 | 0.9898 | 0.9893 | 0.9905 | 0.9925 | 0.9928 |
| (0.9571-0.9577) | (0.9681-0.9683) | (0.9895-0.9898) | (0.9888-0.9898) | (0.9893-0.9918) | (0.9911-0.9939) | (0.9921-0.9935) | |
| A136665 | 0.9604 | 0.9851 | 0.9935 | 0.9903 | 0.9916 | 0.9941 | 0.9948 |
| | (0.9601-0.9607) | (0.9850-0.9853) | (0.9929-0.9941) | (0.9897-0.9908) | (0.9903-0.9928) | (0.9923-0.9958) | (0.9939-0.9958) |
| A133065 | 0.9655 | 0.9954 | 0.9959 | 0.9950 | 0.9952 | 0.9955 | 0.9961 |
| (0.9653-0.9656) | (0.9950-0.9958) | (0.9953-0.9964) | (0.9938-0.9962) | (0.9932-0.9972) | (0.9935-0.9976) | (0.9953-0.9969) |
95% confidence intervals for slopes are shown in parentheses. Slopes with values below 1.0 represent increased levels of saturation in the alignment tested. Intercepts of all lines were significantly different than zero at p < ≤0, with the exception of section Pinus + P. ponderosa for treatment A133065, which was not significantly different than zero. 95% confidence intervals for intercepts significantly different than zero were all 1.8 × 10-6 ≤ y ≤ 0.00040.