| Literature DB >> 24371824 |
Alexander Bolshoy1, Valery M Kirzhner2.
Abstract
Ancestral sequence reconstruction is a well-known problem in molecular evolution. The problem presented in this study is inspired by sequence reconstruction, but instead of leaf-associated sequences we consider only their lengths. We call this problem ancestral gene length reconstruction. It is a problem of finding an optimal labeling which minimizes the total length's sum of the edges, where both a tree and nonnegative integers associated with corresponding leaves of the tree are the input. In this paper we give a linear algorithm to solve the problem on binary trees for the Manhattan cost function s(v, w) = |π(v) - π(w)|.Entities:
Mesh:
Year: 2013 PMID: 24371824 PMCID: PMC3858891 DOI: 10.1155/2013/472163
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Assignment of bottom-up stage values (left, right, Z, and X) in 2-leaf trees. The “gain” penalty C 1 = 50; the “loss” penalty C 2 = 30. Optimal labels are in red.
Figure 2Assignment of bottom-up stage values (left, right, Z, and X) in 3-leaf trees. The “gain” penalty C 1 = 50; the “loss” penalty C 2 = 30. Optimal labels are in red.
Figure 3Assignment of bottom-up stage values (left, right, Z, and X) in a 4-leaf tree with all four leaves labeled by positive integers. The “gain” penalty C 1 = 50; the “loss” penalty C 2 = 30. Optimal labels are in red.
Figure 4Labeling of a “peculiar” tree. The left subtree has three zero and one nonzero leaf, while the right subtree has three nonzero leaves. The “gain” penalty C 1 = 50; the "loss" penalty C 2 = 30. Optimal labels are in red.
Figure 5Archaeal part of Figure 4(b) from [7] labeled accordingly to COG0835.

Pseudocode 1
List of archaeal genomes for Figure 4.
| No. | Name | Kingdom | Group |
|---|---|---|---|
| 0 |
| A | C |
| 1 |
| A | E |
| 8 |
| A | C |
| 29 |
| A | E |
| 30 |
| A | E |
| 31 |
| A | E |
| 32 |
| A | E |
| 35 |
| A | C |
| 36 |
| A | C |
| 37 |
| A | C |
| 38 |
| A | E |
| 39 |
| A | E |
| 40 |
| A | E |
| 41 |
| A | E |
| 42 |
| A | E |
| 43 |
| A | E |
| 44 |
| A | E |
| 45 |
| A | E |
| 46 |
| A | E |
| 47 |
| A | E |
| 48 |
| A | E |
| 49 |
| A | E |
| 50 |
| A | E |
Notations of the groups: E: Euryarchaeota, C: Crenarchaeota.
Protein lengths of the chemotaxis signal transduction proteins. Archaeal part of COG0835.
| Number | COG | Length | Genome name |
|---|---|---|---|
| 1 | 835 | 160 |
|
| 29 | 835 | 144 |
|
| 29 | 835 | 328 |
|
| 30 | 835 | 132 |
|
| 30 | 835 | 178 |
|
| 31 | 835 | 132 |
|
| 31 | 835 | 178 |
|
| 39 | 835 | 159 |
|
| 41 | 835 | 146 |
|
| 42 | 835 | 146 |
|
| 43 | 835 | 146 |
|
| 44 | 835 | 147 |
|
| 46 | 835 | 182 |
|
| 46 | 835 | 184 |
|
| 47 | 835 | 173 |
|
| 48 | 835 | 159 |
|
| 48 | 835 | 189 |
|
| 50 | 835 | 124 |
|
| 50 | 835 | 167 |
|
| 50 | 835 | 169 |
|
| 50 | 835 | 169 |
|
| 50 | 835 | 174 |
|
| 50 | 835 | 176 |
|
| 50 | 835 | 183 |
|
| 50 | 835 | 187 |
|
| 50 | 835 | 189 |
|
| 50 | 835 | 190 |
|
| 50 | 835 | 198 |
|
| 50 | 835 | 200 |
|
| 50 | 835 | 344 |
|
| 50 | 835 | 779 |
|