| Literature DB >> 26696141 |
Cedric Chauve, Yann Ponty, João Zanetti.
Abstract
CONTEXT: The reconstruction of evolutionary scenarios for whole genomes in terms of genome rearrangements is a fundamental problem in evolutionary and comparative genomics. The DeCo algorithm, recently introduced by Bérard et al., computes parsimonious evolutionary scenarios for gene adjacencies, from pairs of reconciled gene trees. However, as for many combinatorial optimization algorithms, there can exist many co-optimal, or slightly sub-optimal, evolutionary scenarios that deserve to be considered. CONTRIBUTION: We extend the DeCo algorithm to sample evolutionary scenarios from the whole solution space under the Boltzmann distribution, and also to compute Boltzmann probabilities for specific ancestral adjacencies.Entities:
Mesh:
Year: 2015 PMID: 26696141 PMCID: PMC4686788 DOI: 10.1186/1471-2105-16-S19-S6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Example of an adjacency forest predicted from two reconciled gene trees. (Left) Species tree S, with two extant species A and B and an ancestral species C. (Middle) Two reconciled gene trees G1 and G2, with four extant genes in genome A, four extant genes in genome B and three ancestral genes in genome C. The set of extant adjacencies is (A1A3, B1B3, A2A4, B2B4) (Right) Parsimonious adjacency forest F composed of two adjacency trees. Blue dots are speciation nodes. Leaves are extant (species, genes, adjacencies), except when labelled with a red cross (gene loss). Green squares are (gene or adjacency) duplication nodes. Gene labels refer to the species of nodes. Every node of the adjacency tree is labelled by a couple of nodes from gene trees. Figure adapted from [5].
Figure 2Temperature dependency of the adjacency matrix. Each value of the (pseudo) temperature parameter kT induces different Boltzmann probabilities for ancestral and extant adjacencies, allowing to probe the continuum between a parsimonious model (kT → 0) and the uniform distribution kt → ∞.
Figure 3Partition function version of the DeCo dynamic programming scheme. This system of recurrences computes the α-rescaled partition function for two reconciled gene trees g1 and g2, using penalties AG and AB respectively for adjacency gains and breaks.
Characteristics of ancestral genes and adjacencies from observed ancestral adjacencies filtered by Boltzmann probability (leftmost column), with different kT values.
| Ancestral genes | Ancestral adjacencies | Syntenic conflicts | |||||||
|---|---|---|---|---|---|---|---|---|---|
| ≥ 0.1 | 122,593 | 120,495 | 120,531 | 137,133 | 116,671 | 116,873 | 31,415 | 14,655 | 14,863 |
| ≥ 0.2 | 121,896 | 119,792 | 119,864 | 130,928 | 113,835 | 114,109 | 26,415 | 12,631 | 12,871 |
| ≥ 0.3 | 120,869 | 118,871 | 118,913 | 122,073 | 110,472 | 110,676 | 19,729 | 10,259 | 10,471 |
| ≥ 0.4 | 118,240 | 117,883 | 117,990 | 111,224 | 107,492 | 107,845 | 11,569 | 8,439 | 8,738 |
| ≥ 0.5 | 113,023 | 116,476 | 116,810 | 100,265 | 103,963 | 105,015 | 5,658 | 6,578 | 7,373 |
| ≥ 0.6 | 104,970 | 114,699 | 114,902 | 88,943 | 100,064 | 100,511 | 3,044 | 5,081 | 5,387 |
| ≥ 0.7 | 92,647 | 112,834 | 112,924 | 75,063 | 96,546 | 96,789 | 1,363 | 4,104 | 4,301 |
| ≥ 0.8 | 75,408 | 110,272 | 110,503 | 58,665 | 92,181 | 92,621 | 492 | 3,290 | 3,496 |
| ≥ 0.9 | 45,753 | 107,769 | 107,863 | 34,005 | 88,132 | 88,373 | 65 | 2,717 | 2,886 |
| = 1 | 13 | 16,470 | 106,903 | 7 | 10,798 | 87,048 | 0 | 0 | 2,690 |