| Literature DB >> 15601531 |
William J Murphy1, Guillaume Bourque, Glenn Tesler, Pavel Pevzner, Stephen J O'Brien.
Abstract
Rapidly developing comparative gene maps in selected mammal species are providing an opportunity to reconstruct the genomic architecture of mammalian ancestors and study rearrangements that transformed this ancestral genome into existing mammalian genomes. Here, the recently developed Multiple Genome Rearrangement (MGR) algorithm is applied to human, mouse, cat and cattle comparative maps (with 311-470 shared markers) to impute the ancestral mammalian genome. Reconstructed ancestors consist of 70-100 conserved segments shared across the genomes that have been exchanged by rearrangement events along the ordinal lineages leading to modern species genomes. Genomic distances between species, dominated by inversions (reversals) and translocations, are presented in a first multispecies attempt using ordered mapping data to reconstruct the evolutionary exchanges that preceded modern placental mammal genomes.Entities:
Mesh:
Substances:
Year: 2003 PMID: 15601531 PMCID: PMC3525001 DOI: 10.1186/1479-7364-1-1-30
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Figure 1Illustrating the effect of the distance threshold, . Suppose genome A has marker order 1,2,3,4,5,6; genome B has 1,2,3,6,5,4; and genome C has 3,1,2,4,5,6. The strips are [1,2], [3], [4-6]. The clusters at G = 4 (a) are [1,2] and [4-6] (the singleton [3] is deleted). At G = 5 (not shown), some of these are combined together. Specifically, d(2, 3) = 1 + 1 + 2 = 4 < 5, so an edge is added between markers 2 and 3, joining their clusters together. The clusters at G = 5 are [1-3] and [4-6] and the order within the clusters varies by genome, giving micro-rearrangements. At G = 6 and 7 (b), edges are added within clusters, but not between clusters, so clusters do not change. At G = 8 (not shown), two edges are added that would join the clusters into [1-6] Specifically, d(2, 4) = 2 + 4 + 1 = 7 < 8 and d(3, 4) = 1 + 3 + 3 = 7 < 8.
Conserved markers, clusters and reversal distances computed with GRIMM-synteny and Multiple Genome Rearrangement Algorithm analysis of comparative gene maps of 470 Type I gene homologues aligned between human (H), mouse (M) and cat (C) genomes.
| Distance threshold, | 4 | 5 | 6 | 8 | 20 |
|---|---|---|---|---|---|
| No. of markers retained | 276 | 345 | 379 | 409 | 432 |
| % of markers used | 59 | 73 | 81 | 87 | 92 |
| No. of clusters | 112 | 114 | 106 | 94 | 76 |
| d(H, M, C) | 222 | 234 | 216 | 201 | 160 |
| d(A, H*) + d(H*, H) | 19 + 10 = 29 | 18 + 9 = 27 | 19 + 10 = 29 | 15 + 9 = 24 | 11 + 6 = 17 |
| d(A, C*) + d(C*, C) | 13 + 15 = 28 | 13 + 11 = 24 | 13 + 8 = 21 | 10 + 12 = 22 | 13 + 5 = 18 |
| d(A, M*) + d(M*, M) | 25 + 41 = 66 | 31 + 49 = 80 | 32 + 40 = 72 | 21 + 43 = 64 | 24 + 32 = 56 |
| Tree score | 123 | 131 | 122 | 110 | 91 |
The common ancestor of all three genomes is denoted A, while preancestors for human, mouse and cat are denoted H*, M* and C*, respectively. The total distance between the three genomes, d(H, M, C), is defined as d(H, M) + d(M, C) + d(C, H). The tree score is defined as d(A,H) + d(A,M) + d(A,C)
Comparison of the Multiple Genome Rearrangement (MGR) algorithm-derived syntenies found in the common ancestors of the human-cat-mouse (HCM) and the human-cat-cow (HCC) datasets, with predicted syntenies based on comparative cytogenetic analyses (left-hand column8).
| HCM | HCM | HCM | HCC | HCC | HCC | |
|---|---|---|---|---|---|---|
| 3 & 21 | +,f | + | + | + | + | + |
| 4 & 8p | + | + | + | n.c.a. | - | - |
| 7a/16p | n.c.a. | n.c.a. | + | + | + | + |
| 12 & 22a | +,f | +,f | +,f | - | - | - |
| 12 & 22b | - | - | n.c.a. | n.c.a. | n.c.a. | n.c.a. |
| 14 & 15 | +,f | + | - | + | - | - |
| 16q/19q | - | + | + | - | + | + |
| 1p | +,f | + | +,f | + | + | + |
| 1q | - | - | - | - | - | - |
| 2p | +,f | +,f | +,f | +,f | +,f | +,f |
| 2q | + | + | + | + | + | +,f |
| 5 | +,f | +,f | +,f | +,f | + | - |
| 6 | + | + | + | + | + | + |
| 7b | +,f | -,f | +,f | +,f | +,f | +,f |
| 8q | + | + | + | + | + | + |
| 9 | + | + | + | + | + | + |
| 10p | +,f | +,f | +,f | +,f | +,f | +,f |
| 10q | +,f | - | +,f | +,f | +,f | +,f |
| 11 | - | - | +,f | + | + | + |
| 13 | +,f | +,f | +,f | + | + | +,f |
| 17 | + | + | + | + | + | + |
| 18 | + | +,f | +,f | + | + | + |
| 19p | +,f | +,f | +,f | +,f | + | + |
| 20 | +,f | +,f | +,f | +,f | +,f | + |
| X/f | + | + | + | + | + | + |
MGR analyses were performed using the indicated distance threshold, G
'+' means synteny is intact in the ancestor, '-' means synteny is disrupted in the ancestor, ' + ,f ' means synteny is intact and fused to another chromosome in the ancestor. n.c.a. = no chromosome available, due to lack of shared markers defining that conserved segment.
Figure 2Chromosome syntenic organisation imputed by the Multiple Genome Rearrangement Algorithm for preancestors (denoted with asterisks) of human, of cat and of mouse, and for the reconstructed common ancestor (A) of all three starting genomes (human, cat and mouse). The data consisted of 379 common markers grouped into 106 clusters shared across the three starting genomes (human, cat and mouse) derived using a distance threshold of G = 6. The length of each chromosome is proportional to its number of cluster segments. Each human chromosome (and its component cluster segment [boxed]) is assigned a unique colour. Each cluster segment is traversed by a diagonal line (top left to bottom right) to indicate relative order and orientation within the blocks. The number above each coloured block refers to the corresponding human chromosome homologue. Species chromosome designations are shown to the left. At the top of the figure, the phylogram indicates the number of rearrangements required to convert one genome into the other.
Conserved markers, clusters and reversal distances computed with GRIMM-synteny and the Multiple Genome Rearrangement Algorithm analysis of comparative gene maps of 311 Type I gene homologues aligned between human (H), cat (Ct) and cow (Cw) genomes.
| Distance threshold, | 4 | 5 | 6 | 8 | 20 |
|---|---|---|---|---|---|
| No. of markers used | 248 | 262 | 276 | 286 | 298 |
| % of markers used | 80 | 84 | 89 | 92 | 96 |
| No. of clusters | 81 | 74 | 70 | 60 | 44 |
| d(H, Ct, Cw) | 129 | 126 | 119 | 98 | 63 |
| d(A, H*) + d(H*, H) | 4 + 10 = 14 | 3 + 11 = 14 | 4 + 12 = 16 | 1 + 12 = 13 | 2 + 6 = 8 |
| d(A, Ct*) + d(Ct*, Ct) | 8 + 14 = 22 | 8 + 17 = 25 | 6 + 15 = 21 | 9 + 11 = 20 | 3 +7 = 10 |
| d(A, Cw*) + d(Cw*,Cw) | 12 + 22 = 34 | 11 + 18 = 29 | 10 + 17 = 27 | 7 + 13 = 20 | 5 + 11 = 16 |
| Tree score | 70 | 68 | 64 | 53 | 34 |
The common ancestor of all three genomes is denoted A, while preancestors for human, cat and cow genomes are denoted H*, Ct* and Cw*, respectively. The total distance between the three genomes, d(H, Ct, Cw), is defined as d(H, Ct) + d(Ct, Cw) + d(Cw, H). The tree score is defined as d(A, H) + d(A, Ct) + d(A, Cw)
Figure 3Chromosome syntenic organisation imputed by the Multiple Genome Rearrangement Algorithm for preancestors (denoted with asterisks) of human, of cat and of cow, and for the reconstructed common ancestor (A) of all three starting genomes (human, cat and cow). The data consisted of 276 common markers grouped into 70 clusters shared across the three starting genomes (human, cat and cow) derived using a distance threshold of G = 6. The length of each chromosome is proportional to its number of cluster segments. Each human chromosome (and its component cluster segments [boxed]) is assigned a unique colour. Each cluster segment is traversed by a diagonal line (top left to bottom right) to indicate relative order and orientation within the blocks. The number above each coloured block refers to the corresponding human chromosome homologue. Species chromosome designations are shown to the left. On the top of the figure, the phylogram indicates the number of rearrangements required to convert one genome into the other.
Proportion of different types of rearrangements for the human-cat-mouse and the human-cat-cow datasets
| Distance threshold, | 4 | 5 | 6 | 8 | 20 | |
|---|---|---|---|---|---|---|
| Human-cat-mouse | % reversals | 38.2 | 38.2 | 36.9 | 23.6 | 7.7 |
| % translocations | 57.7 | 58.8 | 59.0 | 70.9 | 87.9 | |
| % fusions | 1.6 | 2.3 | 2.5 | 0.9 | 3.3 | |
| % fissions | 2.4 | 0.8 | 1.6 | 4.5 | 1.1 | |
| Human-cat-cow | % reversals | 45.7 | 42.6 | 34.4 | 26.4 | 5.9 |
| % translocations | 40.0 | 38.2 | 45.3 | 49.1 | 55.9 | |
| % fusions | 8.6 | 7.4 | 7.8 | 9.4 | 14.7 | |
| % fissions | 5.7 | 11.8 | 12.5 | 15.1 | 23.5 |