| Literature DB >> 19749981 |
Valery A Shepelev1, Alexander A Alexandrov, Yuri B Yurov, Ivan A Alexandrov.
Abstract
Alpha satellite domains that currently function as centromeres of human chromosomes are flanked by layers of older alpha satellite, thought to contain dead centromeres of primate progenitors, which lost their function and the ability to homogenize satellite repeats, upon appearance of a new centromere. Using cladistic analysis of alpha satellite monomers, we elucidated complete layer patterns on chromosomes 8, 17, and X and related them to each other and to primate alpha satellites. We show that discrete and chronologically ordered alpha satellite layers are partially symmetrical around an active centromere and their succession is partially shared in non-homologous chromosomes. The layer structure forms a visual representation of the human evolutionary lineage with layers corresponding to ancestors of living primates and to entirely fossil taxa. Surprisingly, phylogenetic comparisons suggest that alpha satellite arrays went through periods of unusual hypermutability after they became "dead" centromeres. The layer structure supports a model of centromere evolution where new variants of a satellite repeat expanded periodically in the genome by rounds of inter-chromosomal transfer/amplification. Each wave of expansion covered all or many chromosomes and corresponded to a new primate taxon. Complete elucidation of the alpha satellite phylogenetic record would give a unique opportunity to number and locate the positions of major extinct taxa in relation to human ancestors shared with extant primates. If applicable to other satellites in non-primate taxa, analysis of centromeric layers could become an invaluable tool for phylogenetic studies.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19749981 PMCID: PMC2729386 DOI: 10.1371/journal.pgen.1000641
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Phylogenetic trees of AS monomers in human chromosomes 8, 17, and X.
Each terminal branch represents an AS monomer. The major branches marked by colored arches correspond to colored AS arrays in Figure 2 and monomeric types in Table 1. The positions of runaway monomers that appear in a “wrong” cluster are indicated by dots (1–3 monomers) or squares (4–15 monomers). (A) 1,434 monomers of Xp pericentromeric region plus W1–W5 consensus monomers representing the current SF3 centromere (indicated by arrows, cluster with R1R2). “Old” clades are highlighted by a grey box. All other clades belong to the “ancient” group. The 171 bp AS clade is indicated by an open arch. Branches formed by R1 and R2 monomers are indicated. (B) Monomers of Xq pericentromeric region were added to those shown in (A) (2,516 monomers total). Xp monomers are shown in black and Xq monomers in khaki. No major new clades appear. Grey, olive, and green clades are Xp-specific, except for a few runaway monomers. In blue, yellow, and yellow-striped branches, the monomers from both arms of X chromosome are well-mixed. (C) Phylogenetic tree of the red, yellow-striped, and yellow monomers from chromosomes 8, 17, and X (2,588 monomers). Due to a large number of red monomers in chromosome 8, every 5th 8p and every 4th 8q red monomer were taken into analysis. Monomers from different chromosomes (black, blue, and purple for 8, 17, and X, respectively) mix well on the tree. Subclades in the red branch are indicated by letters B, C, J, and G above the red arch.
Structural features in dead AS layers.
| Layer | Monomer length (bp) | Monomer types | Arrangement | Age group |
| Blue | 171 | R1 (B), R2 (A) | irregular | old |
| Yellow | 171 | M1 (A) | monomeric | old |
| Yellow-striped | 172 | V1 (A) | monomeric | old |
| Olive-green | 172 | H1 (A), H2 (A) | dimeric | ancient |
| Red | 172 | H3 (A) | monomeric | ancient |
| Grey | 172 | H4 (A) | monomeric | ancient |
*: The monomer types for blue and yellow layers were assigned previously [25],[26], and for the rest of the layers, in this paper. Division of AS into pure A type and AB type was introduced previously [1],[26].
**: Division of the old AS into “old” and “ancient” groups is introduced in this paper as a result of primate and cladistic analysis.
Figure 2Alpha satellite layers in human chromosomes 8, 17, and X.
Each colored domain represents an AS array composed of monomers that belong to the same branch on phylogenetic trees shown in Figure 1. Chromosome domains and the arches marking different branches are in the same colors in Figures 1 and 2. Colored layers are partially symmetrical around the centromere on one chromosome and partially shared between different chromosomes. The p and q arms of the chromosomes are indicated. The diagonally crossed white and light blue central boxes represent the new AS HOR domains, which form current centromeres. They are shown not to scale. For chromosome 17, we show the presumed organization of the HOR domain. The central D17Z1 16-mer HOR array is flanked by two homogenous 14-mer HOR arrays, D17Z1-B on the p arm [24], and a distinct one termed D17Z1-C on the q arm (see Text S1 for details).
Search for human AS layers in various primates.
| Primate | Grey H4 (PA7), 40 myr | Red H3 (PA5), 26 myr | Olive-green H1H2 (PA4/PA5), 23–26 myr | Yellow-striped V1 (PA4), 23 myr | Yellow M1 (PA3/PA4), 16–23 myr | Blue R1R2 (PA3), 16 myr | NEW |
| Tarsier (PA8), 58 myr | |||||||
|
| + | ||||||
|
| + | + | + | ||||
| Gibbon, apes (PA4), 18 myr | + | + | + | + | + | ||
| Orangutan, great apes (PA3), 14 myr | + | + | + | + | + | + | |
| Gorilla, African apes (PA2), 7 myr | + | + | + | + | + | + | + |
In the “Primate” column after the name of the primate and/or primate group, the youngest major L1 repeat shared with humans (in parentheses; our data) and the proposed age of the taxon [32] in million years (myr) are indicated. In other columns, the AS layer, the oldest L1 repeat found in the layer (in parentheses; our data), and the age of this L1 repeat estimated using mutation rate of 0.216% per myr [31] are indicated. In the new HOR arrays, L1 repeats are not present in significant numbers [2]. The presence of AS layers in certain species is indicated by “+”.
Average identity of monomers in AS layers.
| Layer/chromosome | Grey H4 | Red H3 | Olive H1 | Green H2 | Yellow-striped V1 | Yellow M1 |
| Xp | 0.711 | 0.733 | 0.785 | 0.790 | 0.818 | 0.845 |
| 0.043 | 0.046 | 0.038 | 0.038 | 0.033 | 0.038 | |
| 292 | 194 | 301 | 282 | 288 | 28 | |
| Xq | 0.737 | 0.812 | 0.833 | |||
| 0.040 | 0.038 | 0.024 | ||||
| 468 | 85 | 88 | ||||
| 0.837 | ||||||
| 0.023 | ||||||
| 136 | ||||||
| 8p | 0.732 | |||||
| 0.039 | ||||||
| 1566* | ||||||
| 8q | 0.725 | 0.812 | ||||
| 0.040 | 0.035 | |||||
| 1238* | 76 | |||||
| 17p | 0.730 | 0.803 | ||||
| 0.042 | 0.039 | |||||
| 168 | 101 | |||||
| 0.723 | ||||||
| 0.045 | ||||||
| 143 | ||||||
| 17q | 0.723 | 0.850 | ||||
| 0.045 | 0.046 | |||||
| 167 | 23 | |||||
| Mean within a monomer type (%) | 71 | 73 | 79 | 79 | 81 | 84 |
| Mean between monomer types (%) | 66 | 67 | 70 | 70 | 72 | 69 |
The mean identity of monomers in pairwise comparisons within each array (upper line), standard deviation (middle line), and the number of monomers are indicated for each AS layer, as depicted in Figure 2. In cases where two domains of the same color are present on one chromosome arm, figures for both of them are presented separately; multiple red arrays in 8q are presented summarily. In cases marked by an asterisk, only 1,000 monomers were used in comparisons. In the two bottom lines, the mean of each column representing the identity across a whole layer and the mean identity obtained in “between layers” comparisons (this layer to all the others) are presented. It can be seen that the intra-array divergence and hence the age of the arrays decrease towards the centromere.
Age of AS layers calculated by different methods.
| Layer | “Calculated age” by array divergence (myr) | Age by primate & L1 dating (myr) |
| Grey (H4) | 92 | 40–58 |
| Red (H3) | 84 | 26–40 |
| Olive-green (H1H2) | 62 | 23–26 |
| Yellow-striped (V1) | 55 | 18–23 |
| Yellow (M1) | 45 | 16–18 |
The “mock” age estimates calculated from intra-array divergence on the basis of 0.2% mutation rate are compared to “valid” estimates obtained from L1 and primate dating. The age calculated from the divergence figures is about twice as old as provided by other estimates.