| Literature DB >> 21226965 |
Jialiang Yang1, Jun Li, Liuhuan Dong, Stefan Grünewald.
Abstract
BACKGROUND: As one of the most widely used parsimony methods for ancestral reconstruction, the Fitch method minimizes the total number of hypothetical substitutions along all branches of a tree to explain the evolution of a character. Due to the extensive usage of this method, it has become a scientific endeavor in recent years to study the reconstruction accuracies of the Fitch method. However, most studies are restricted to 2-state evolutionary models and a study for higher-state models is needed since DNA sequences take the format of 4-state series and protein sequences even have 20 states.Entities:
Mesh:
Year: 2011 PMID: 21226965 PMCID: PMC3030536 DOI: 10.1186/1471-2105-12-18
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The reconstruction accuracies from all the leaves as well as from a root-to-leaf path in an equal-branch complete binary tree . UA and AA denote the unambiguous and ambiguous accuracy respectively by using all the leaves. Figure 1 (a)-(d) show the trend of reconstruction accuracies with the increase of conservation probability for the number of states 2, 4, 5 and 20, respectively. The dashed line denotes the unambiguous reconstruction accuracy by using all the leaves, the solid line denotes the ambiguous reconstruction accuracy by using all the leaves, and the diamonds the unambiguous and ambiguous accuracy by using any root-to-leaf path. Note that the unambiguous and ambiguous accuracies by any root-to-leaf path are the same.
The estimated values of b for the number of states N = 2, ..., 25.
| Estimated values of | ||||||||
|---|---|---|---|---|---|---|---|---|
| b | 0.875 | 0.839 | 0.821 | 0.809 | 0.784 | 0.774 | 0.768 | 0.763 |
Table 1 lists the estimated value of b by using Matlab. For convenience, we only show the values for N = 2, 3, 4, 5, 10, 15, 20 and 25.
Figure 2Two kinds of comb-shaped trees. Figure 2 shows two kinds of comb-shaped trees: (a) an equal-branch comb-shaped tree, (b) a Hennigian comb-shaped tree with n leaves.
Figure 3The reconstruction accuracies from all the leaves as well as from a root-to-leaf path in an equal-branch comb-shaped tree . Figure 3 (a)-(d) show the trend of reconstruction accuracies with the increase of conservation probability for the number of states 2, 4, 5 and 20 respectively. The dashed line denotes the unambiguous reconstruction accuracy by using all the leaves, the solid line denotes the ambiguous reconstruction accuracy by using all the leaves, and the diamonds the unambiguous and ambiguous accuracy by using the nearest root-to-leaf path. Note that the unambiguous and ambiguous accuracies by the nearest root-to-leaf path are the same.
The estimated values for UA of Hennigian comb-shaped trees when n is large for the number of states N = 2, ..., 50.
| Estimated values of the limiting UA for Hennigian trees | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| UA | 0.3333 | 0.1765 | 0.1126 | 0.0797 | 0.0274 | 0.0145 | 0.0098 | 0.0053 | 0.0034 | 0.0022 |
Table 2 lists the estimated value of UA of Hennigian trees when n is large by using Matlab. For convenience, we only show the values for N = 2, 3, 4, 5, 10, 15, 20, 30, 40 and 50.
Figure 4The comparison of unambiguous and ambiguous reconstruction accuracy between 1000 randomly generated Yule trees and the equal-branch complete binary tree with 1024 leaves. Figure 4 (a)-(b) show the trend of ambiguous and unambiguous reconstruction accuracies with the increase of conservation probability for 1000 randomly generated Yule trees and the equal-branch complete binary tree with 1024 leaves. The dashed line denotes the trend of reconstruction accuracy on the equal-branch complete binary tree, the solid lines denote the trend of reconstruction accuracy on the randomly generated Yule trees.