| Literature DB >> 20444286 |
Hyun Jung Park1, Guohua Jin, Luay Nakhleh.
Abstract
BACKGROUND: Maximum parsimony is one of the most commonly used criteria for reconstructing phylogenetic trees. Recently, Nakhleh and co-workers extended this criterion to enable reconstruction of phylogenetic networks, and demonstrated its application to detecting reticulate evolutionary relationships. However, one of the major problems with this extension has been that it favors more complex evolutionary relationships over simpler ones, thus having the potential for overestimating the amount of reticulation in the data. An ad hoc solution to this problem that has been used entails inspecting the improvement in the parsimony length as more reticulation events are added to the model, and stopping when the improvement is below a certain threshold.Entities:
Mesh:
Year: 2010 PMID: 20444286 PMCID: PMC2874802 DOI: 10.1186/1471-2148-10-131
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1A phylogenetic tree (a) and a phylogenetic network obtained from it by adding a horizontal edge .
Figure 2(a) A scenario where none of four HGT edges identified individually in 100 bootstrap samples has good support (the recipient of each of the four edges is the same node . (b) When combined, thus allowing for ambiguity in pinpointing the exact source, a well-supported hypothesis of an HGT emerges.
Mitochondrial gene data sets and HGTs postulated by Bergthorsson et al. and those computed by the MP analysis (NEPAL). 'donor' denotes the group from which the gene was transferred (in all cases, the recipient is Amborella).
'SH' denotes support of the HGT events as computed by the Shimodaira-Hasegawa (SH) test and reported by Bergthorsson et al. (values lower than 0.05 indicate high support, and NS indicates support is not significant). The 'b1', 'b2', 'b3', and 'b4' columns correspond to the support values from Formula 3 for adding the first, second, third, and fourth HGT edges inferred by the MP analysis. Since adding HGT edges stops once a weakly supported edge is encountered, a '-' entry under these columns indicates that adding HGT edges was stopped before. B = Bryophyte, M = Moss, E = Eudicot, and A = Angiosperm.
Figure 3HGT edges (in red) inferred by the MP criterion, with support values, in parentheses, computed based on Formula (3). Ambiguity in the source is denoted by a circle (when drawing a circle was possible) or a multi-source edge. Amborella genes are colored in red, core eudicot genes and moss genes are colored in blue and green. Branch refinements are performed for nad5, ccmFN1, nad5intron, and nad7intron at the places marked with solid circles.
Results of the MP analysis on 20 simulated data sets.
The rows are sorted by the number of true HGTs simulated for each gene data set. The 'd1' and 'd2' columns denote the distance, in terms of the number of branches on the species tree, between the source and recipient of the first and second HGT events simulated, respectively. The 'b1', 'b2', and 'b3' columns correspond to the support values from Formula 3 for adding the first, second, and third HGT edges inferred by the MP analysis. Since adding HGT edges stops once a weakly supported edge is encountered, a '-' entry under these columns indicates that adding HGT edges was stopped before.