| Literature DB >> 19393055 |
Claus Kemkemer1, Matthias Kohn, David N Cooper, Lutz Froenicke, Josef Högel, Horst Hameister, Hildegard Kehrer-Sawatzki.
Abstract
BACKGROUND: Genome comparisons have made possible the reconstruction of the eutherian ancestral karyotype but also have the potential to provide new insights into the evolutionary inter-relationship of the different eutherian orders within the mammalian phylogenetic tree. Such comparisons can additionally reveal (i) the nature of the DNA sequences present within the evolutionary breakpoint regions and (ii) whether or not the evolutionary breakpoints occur randomly across the genome. Gene synteny analysis (E-painting) not only greatly reduces the complexity of comparative genome sequence analysis but also extends its evolutionary reach.Entities:
Mesh:
Year: 2009 PMID: 19393055 PMCID: PMC2681463 DOI: 10.1186/1471-2148-9-84
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Number of genes in different species for which unambiguous orthologies to a total of 28197 annotated human genes were identified using the BioMart program.
| 17733 | |
| 17796 | |
| 16263 | |
| 16689 | |
| 15257 | |
| 12591 | |
a: Orthologous genes were determined using the program BioMart
(; ENSEMBL release 46).
Figure 1Ideogram of human chromosome 1 (. The human chromosome coordinates of the breakpoint intervals are given to the right of the human ideogram in Mb. The chromosome number of the orthologous segments in the analyzed species is indicated to the right of each conserved segment. Chromosomal breakpoints have been evenly spaced in order to optimize visualization of the conserved syntenic segments. The resulting ideograms of the chromosomes and conserved segments are therefore not drawn to scale. The centromeric region is indicated by a black horizontal bar on the human ideogram. The stippled red lines indicate breaks present in all analyzed non-human genomes and which may thus be attributable to rearrangements specific to the primate lineage (see Table 3). Black lines within the ideograms indicate breaks within the contiguous sequence that probably resulted from intrachromosomal rearrangements caused by inversions. Stippled green lines indicate the positions of 'reused breakpoints', defined as locations in which breakpoints were found to map to the same genomic intervals in at least three species from two different clades. The complete set of E-painting results for chromosomes 1–22 is given in Additional file 1. un: undetermined.
Figure 2The colour code for chromosomal regions 1–38, X and Z chromosomes was employed to indicate regions of conserved synteny in Figure 1 and Additional file 1. The same colour code was also used to depict the ancestral boreoeutherian karyotype indicated in Figure 3.
Figure 3The reconstructed ancestral boreoeutherian karyotype, derived from synteny analyses of human, mouse, rat, cow, dog, opossum and chicken genome sequences, and based on the identified orthology blocks, is depicted in Additional file 1. The ideograms represent the 22 autosomal syntenic groups of the ancestral genome as well as the ancestral X chromosome. The orthologies to the human genome are given for entire chromosomes below each chromosomal ideogram and to the right of the ideograms for the individual conserved segments. For conserved segments representing portions of human chromosomes, the positions of the boundaries of the orthologous segments in the human genome are listed above the ideograms in Mb. Boundaries in agreement with previous findings, and based on comparative cytogenetics, are given in black whereas the boundaries refined in this study are indicated in blue. The sizes of the chromosomal ideograms reflect the approximate size ratios of the euchromatic orthologous segments in the human genome. The association of the segment orthologous to HSA10p with segments orthologous to HSA12 and HSA22 is based on comparative chromosome painting data from carnivores [61], hedgehog, several afrotherian [10,60] and xenarthran [55,56] species as well as the opossum genome sequence [30]. The comparative chromosome painting data for afrotherian and xenarthran species further indicate that the syntenic groups of the ancestral boreoeutherian karyotype are identical with those of the eutherian karyotype.
Number of evolutionary breakpoint intervals per chromosome and their characteristics.
| 47 | 11.89 (4.84) | 0.11 | 0.25 | 8.67 | |
| 46 | 17.87 (7.34) | 0.15 | 0.40 | 5.66 | |
| 45 | 13.60 (6.82) | 0.14 | 0.30 | 5.58 | |
| 21 | 8.78 (4.57) | 0.12 | 0.42 | 4.35 | |
| 42 | 18.24 (10.09) | 0.17 | 0.43 | 5.11 | |
| 28 | 6.24 (3.67) | 0.12 | 0.23 | 6.48 | |
| 37 | 8.21 (5.18) | 0.11 | 0.23 | 6.25 | |
| 30 | 9.94 (6.80) | 0.21 | 0.34 | 5.06 | |
| 20 | 6.31 (4.56) | 0.23 | 0.33 | 6.54 | |
| 20 | 7.39 (5.44) | 0.11 | 0.39 | 6.05 | |
| 31 | 8.58 (6.37) | 0.16 | 0.28 | 10.36 | |
| 17 | 3.06 (2.32) | 0.13 | 0.18 | 8.34 | |
| 9 | 2.32 (2.03) | 0.14 | 0.26 | 3.14 | |
| 11 | 2.87 (2.69) | 0.11 | 0.26 | 6.23 | |
| 16 | 2.12 (2.10) | 0.06 | 0.13 | 6.57 | |
| 12 | 4.21 (4.73) | 0.03 | 0.38 | 10.33 | |
| 31 | 5.63 (7.13) | 0.09 | 0.19 | 15.70 | |
| 11 | 7.71 (10.16) | 0.59 | 0.77 | 3.86 | |
| 17 | 1.49 (2.34) | 0.05 | 0.09 | 22.47 | |
| 20 | 3.21 (5.14) | 0.07 | 0.17 | 9.72 | |
| 6 | 0.46 (0.98) | 0.04 | 0.08 | 5.79 | |
| 9 | 1.60 (3.23) | 0.12 | 0.18 | 10.28 | |
Summary of the evolutionary breakpoint intervals specific to the primate lineage.
| 1 | 1 | 142.78 | 142.90 | |
| 2 | 1 | 223.81 | 224.23 | 41 |
| 3 | 1 | 224.99 | 225.18 | |
| 4 | 1 | 243.42 | 243.79 | 21 |
| 5 | 2 | 50.49 | 53.81 | |
| 6 | 2 | cent | cent | 72 |
| 7 | 2 | 110.02 | 110.20 | 62 |
| 8 | 2 | 113.72 | 114.19 | 93 |
| 9 | 2 | 132.07 | 133.00 | 83 |
| 10a | 3 | ptel | ptel | 114 |
| 11 | 3 | 12.86 | 13.00 | 104,5,6,144,5,6,164,5,6 |
| 12 | 3 | 15.12 | 15.22 | |
| 13 | 3 | 74.62 | 77.17 | |
| 14 | 3 | 127.18 | 127.28 | 115,7,175,7 |
| 15 | 3 | 130.52 | 130.58 | |
| 16 | 3 | 131.18 | 131.64 | 116 |
| 17a | 3 | qtel | 199.33 | 147 |
| 18 | 5 | 147.57 | 147.8 | |
| 19 | 7 | 6.64 | 7.05 | 228 |
| 20 | 7 | 55.53 | 55.64 | |
| 21 | 7 | 75.90 | 76.40 | 239 |
| 22 | 7 | 97.15 | 97.26 | 198 |
| 23 | 7 | 101.85 | 101.98 | 219 |
| 24 | 8 | 7.87 | 8.21 | 2610,11,2910,11 |
| 25 | 8 | 9.68 | 9.95 | 2812 |
| 26 | 8 | 11.76 | 12.03 | 2410 |
| 27 | 8 | 17.99 | 18.11 | |
| 28 | 8 | 29.18 | 29.25 | 2512 |
| 29 | 8 | cent | cent | 2411 |
| 30 | 9 | 38.41 | 39.06 | |
| 31 | 9 | 91.79 | 92.05 | |
| 32 | 9 | 94.17 | 94.4 | |
| 33 | 9 | 96.88 | 97.08 | |
| 34 | 10 | 27.57 | 27.83 | |
| 35 | 10 | 35.97 | 37.45 | |
| 36 | 10 | 51.56 | 51.62 | 3713 |
| 37 | 10 | 88.94 | 89.25 | 3613 |
| 38 | 11 | 3.20 | 3.62 | 3914 |
| 39 | 11 | 70.94 | 71.32 | 3814 |
| 40 | 12 | 106.67 | 107.03 | |
| 41 | 13 | 40.14 | 40.40 | 4215 |
| 42 | 13 | 52.06 | 52.12 | 4115 |
| 43 | 15 | 26.24 | 27.00 | |
| 44 | 15 | 30.57 | 30.69 | 4616 |
| 45 | 15 | 76.02 | 76.07 | |
| 46 | 15 | 100.08 | 100.16 | 4416 |
| 47 | 16 | 31.45 | 31.98 | |
| 48 | 16 | cent | cent | |
| 49a | 17 | ptel | ptel | 5417 |
| 50 | 17 | 6.68 | 6.84 | 5118 |
| 51 | 17 | 15.58 | 15.74 | 5018 |
| 52 | 17 | 16.65 | 16.78 | 5319 |
| 53 | 17 | 20.31 | 20.84 | 5219 |
| 54 | 17 | cent | cent | 4917,20,5617,20 |
| 55 | 17 | 25.88 | 26.08 | |
| 56 | 17 | 26.31 | 26.45 | 5420 |
| 57 | 17 | 33.18 | 33.71 | 5821 |
| 58 | 17 | 57.50 | 57.81 | 5721 |
| 59 | 18 | 39.11 | 40.54 | |
| 60 | 19 | 6.94 | 6.97 | 6122 |
| 61 | 19 | 8.70 | 8.79 | 6022 |
| 62 | 19 | 15.97 | 16.04 | |
| 63 | 19 | cent | cent | |
| 64 | 22 | 17.03 | 17.14 | |
| 65 | 22 | 23.31 | 23.44 | |
| 66 | 22 | 30.83 | 31.11 | |
a: These 3 inversion breakpoint regions were deduced, from the human genome sequence, to harbour the second breakpoint of the respective primate-specific inversions. They were, however, not evident as such in the sequences of the 6 species under investigation (mouse, rat, dog, cow, chicken and opossum) and hence are not indicated in Additional files 1 and 2.
b: The superscript numbers in the last column indicate putative inversions. Thus, chromosomes orthologous to HSA 4, 6, 14, 20 and 21 were not rearranged in the primate lineage.
Autosomal common fragile sites, whose locations on the human genome sequence have been demarcated by flanking markers, and their overlap with evolutionary breakpoint intervals.
| FRA2G [ | 169 | 170.2 | -- | ||
| FRA3B [ | SHGC86352 | 59.7 | RH41625 | 60.5 | -- |
| FRA4F [ | 90.8 | 96.7 | 95.62–95.73 | ||
| FRA6E [ | D6S1581 | 160.2 | D6S1719 | 165.9 | -- |
| FRA6F [ | SHGC144121 | 111.7 | SHGC82095 | 112.7 | -- |
| FRA7E [ | D7S1934 | 80.3 | SHGC104456 | 84.7 | 84.40–85.92 |
| FRA7G [ | SHGC143971 | 115.6 | RH44861 | 116.2 | -- |
| FRA7H [ | D7S614 | 130.2 | STSG33535FS | 130.4 | -- |
| FRA7I [ | SHGC153624 | 144.3 | SWSS2627 | 145.7 | -- |
| FRA7K [ | 109.9 | 110.7 | -- | ||
| FRA9E [ | D9S1866 | 106.9 | D9S154 | 118.4 | -- |
| FRA16D [ | SHGC150973 | 76.7 | WJ2755 | 77.8 | -- |
Evolutionary breakpoint intervals 'co-localizing' with known cancer-associated genes.
| 1.85 | 2.28 | |||
| 230.76 | 231.4 | |||
| 184.47 | 184.69 | |||
| 112.29 | 112.88 | |||
| 74.03 | 74.19 | |||
| 75.60 | 76.05 | |||
| 93.55 | 93.92 | |||
| 101.61 | 101.77 | |||
| 110.09 | 110.84 | |||
| 56.26 | 56.62 | |||
| 58.70 | 59.41 | |||
a: Evolutionary breakpoint intervals are indicated with respect to their position on the orthologous human chromosome.
b: Data on cancer-associated genes were extracted from the Atlas of Genetics and Cytogenetics in Oncology and Haematology .
Numbers of evolutionary breakpoint intervals on chromosomes 1 – 22, length of the respective chromosomes and gene density.
| 1 | 47 | 247,249,719 | 2189 | 8.85 | 0.19 |
| 2 | 46 | 242,951,149 | 1328 | 5.47 | 0.19 |
| 3 | 45 | 199,501,827 | 1112 | 5.57 | 0.23 |
| 4 | 21 | 191,273,063 | 797 | 4.17 | 0.11 |
| 5 | 42 | 180,857,866 | 903 | 4.99 | 0.23 |
| 6 | 28 | 170,899,992 | 1133 | 6.62 | 0.16 |
| 7 | 37 | 158,821,424 | 1023 | 6.44 | 0.23 |
| 8 | 30 | 146,274,826 | 747 | 5.11 | 0.21 |
| 9 | 20 | 140,273,252 | 929 | 6.62 | 0.14 |
| 10 | 20 | 135,374,737 | 834 | 6.16 | 0.15 |
| 11 | 31 | 134,452,384 | 1385 | 10.30 | 0.23 |
| 12 | 18 | 132,349,534 | 1080 | 8.16 | 0.14 |
| 13 | 9 | 114,142,980 | 361 | 3.16 | 0.08 |
| 14 | 11 | 106,368,585 | 669 | 6.29 | 0.10 |
| 15 | 16 | 100,338,915 | 641 | 6.39 | 0.16 |
| 16 | 12 | 88,827,254 | 925 | 10.41 | 0.13 |
| 17 | 31 | 78,774,742 | 1236 | 15.69 | 0.39 |
| 18 | 11 | 76,117,153 | 295 | 3.88 | 0.14 |
| 19 | 17 | 63,811,651 | 1443 | 22.61 | 0.27 |
| 20 | 20 | 62,435,964 | 617 | 9.88 | 0.32 |
| 21 | 6 | 46,944,323 | 284 | 6.05 | 0.13 |
| 22 | 9 | 49,691,432 | 519 | 10.44 | 0.18 |
a: according to the Ensembl database, version 47.36i. The gene density per chromosome was found to be correlated with the number of breakpoint intervals (r = 0.60; p = 0.0031).
Average transcript density of lineage-specific breakpoints observed for regions of 125 kb around the arithmetic centre of the evolutionary breakpoint interval.
| 83 | 25 | 166 | 120 | 56 | |
| 0.16 | 0.24 | 0.27 | 0.35 | 0.39 | |
| 29.76 | 27.76 | 17.52 | 16.20 | 20.32 | |
| 3.72-fold | 3.72-fold | 2.19-fold | 2.03-fold | 2.54-fold | |
NB. Estimates of both the average size of the breakpoint interval and the transcript density are sensitive to the relative-fold sequence coverage of the respective genomes. However, calculating values for the increase above the genome average transcript density allows valid inter-genome comparisons to be made.
Figure 4E-painting results for chromosomes . The stippled red lines indicate regions of primate-specific breakpoints. Black lines within the ideograms represent the positions of breaks in synteny which were probably caused by inversions. Unique colour codes link the HSA12q distal segment (Mb 107.03–132.00) and the central 22q segment (Mb 17.14–30.83), representing the smallest eutherian chromosome [10,12] (12b-22b in Figure 2), as well as the segments 12pter-12q (Mb 0–106.67), 22q proximal (Mb 14.4–17.03), and 22q distal (Mb 31.11–49.60) representing a medium-sized eutherian chromosome (12a-22a in Figure 2). In dog and cow, the HSA10p orthologous segment (Mb 0–37.45) bears a colour code that is different from the HSA12 and HSA22 orthologues and hence does not provide any evidence for an evolutionary association. However, the shared synteny on opossum chromosome 8 confirms previously performed chromosome painting data [11,56,60], strongly suggesting common ancestral HSA10p/12pq/22q orthology. The E-painting data from the murids are not informative in this regard.