| Literature DB >> 23282410 |
Sergey Aganezov1, Max A Alekseyev.
Abstract
In comparative genomics, the rearrangement distance between two genomes (equal the minimal number of genome rearrangements required to transform them into a single genome) is often used for measuring their evolutionary remoteness. Generalization of this measure to three genomes is known as the median score (while a resulting genome is called median genome). In contrast to the rearrangement distance between two genomes which can be computed in linear time, computing the median score for three genomes is NP-hard. This inspires a quest for simpler and faster approximations for the median score, the most natural of which appears to be the halved sum of pairwise distances which in fact represents a lower bound for the median score.In this work, we study relationship and interplay of pairwise distances between three genomes and their median score under the model of Double-Cut-and-Join (DCJ) rearrangements. Most remarkably we show that while a rearrangement may change the sum of pairwise distances by at most 2 (and thus change the lower bound by at most 1), even the most "powerful" rearrangements in this respect that increase the lower bound by 1 (by moving one genome farther away from each of the other two genomes), which we call strong, do not necessarily affect the median score. This observation implies that the two measures are not as well-correlated as one's intuition may suggest.We further prove that the median score attains the lower bound exactly on the triples of genomes that can be obtained from a single genome with strong rearrangements. While the sum of pairwise distances with the factor 2/3 represents an upper bound for the median score, its tightness remains unclear. Nonetheless, we show that the difference of the median score and its lower bound is not bounded by a constant.Entities:
Mesh:
Year: 2012 PMID: 23282410 PMCID: PMC3526427 DOI: 10.1186/1471-2105-13-S19-S1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Breakpoint graph BG(.
Figure 2. A DCJ on these A-edges merges these AB-cycles and AC-cycles into a single AB-cycle and a single AC-cycle, and thus increases ts(A, B, C) by 2 (right panel).
Figure 3Breakpoint graph BG(.
Figure 4. The pairwise DCJ distances are d(A, B) = d(A, C) = d(C, B) = 8, d(A, M) = 3, d(B, M) = 5, and d(C, M) = 7. Right panel: Breakpoint graph of the same genomes A (red edges), C (green edges), and genome B' = (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11) (blue edges) obtained from B by a single fusion. The genomes A, B', C have a different median genome M' = (1, -3, -4, -5, -2, -6, 7, -10, -11, 8, 9) (dashed edges) with the same median score ms(A, B', C) = 15 and larger triangle score ts(A, B', C) = 26. The pairwise DCJ distances are d(A, B')= d(C, B') = 9, d(A, C) = 8, d(A, M') = 4, d(B', M') = 6, and d(C, M') = 5. The median genomes M and M' were computed with GASTS [14].