| Literature DB >> 21767380 |
Abstract
BACKGROUND: Although their role in cell division is essential, centromeres evolve rapidly in animals, plants and yeasts. Unlike the complex centromeres of plants and aminals, the point centromeres of Saccharomcyes yeasts can be readily sequenced to distinguish amongst the possible explanations for fast centromere evolution.Entities:
Mesh:
Year: 2011 PMID: 21767380 PMCID: PMC3155921 DOI: 10.1186/1471-2148-11-211
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Alignment of CEN4 for all strains analysed. Centromeres are made up of two binding sites (CDEI and CDEIII) that are separated by an 87 bp CDEII. This CDEII region shows levels of polymorphism (θW = 0.04) that are typical of CDEII in general, and many more point substitutions (N = 16) than would be expected for a transposable element fragment of the same length (N = 3). This alignment is unambiguous and so shows that the high rates of mutation at centromeres are not the result of alignment error or slippage mutations.
Levels of polymorphism and divergence in S. paradoxus are lower for transposable elements than for CDEII
| CDEII loci | Median π or d | TE loci | Median π or d | CDEII:TE | |||
|---|---|---|---|---|---|---|---|
| Europe π | 15 | 0.0066 | 0.0021-0.013 | 396 | 0.0017 | 0.0013-0.002 | 3.9 |
| Europe- Far East d | 15 | 0.084 | 0.059-0.10 | 339 | 0.028 | 0.026-0.030 | 3 |
| Europe-American d | 13 | 0.22 | 0.2-0.27 | 196 | 0.084 | 0.078-0.092 | 2.6 |
Table showing nucleotide diversity (π) within the European population or pairwise distance (d) between populations. 95% confidence intervals (C.I.) of each median are based on 10,000 bootstrap replicates. DNA sequence data are those published in Liti et al [14] and only 13-15 loci were available for S. paradoxus because PALAS alignment data were missing for all strains of S. paradoxus for CEN8 and because of low coverage of the American strains for CEN7, CEN8 and CEN12.
Mean nucleotide diversity (π) is higher at S. cerevisiae centromeres (in CDEII) than in selectively unconstrained sequences (TEs)
| Population | Strains | CDEII S | CDEII π | TE S | TE π | P value | CDEII:TE |
|---|---|---|---|---|---|---|---|
| Global | 34 | 222 | 0.04 | 1715 | 0.01 | 9 × 10-11 | 3.6 |
| European | 11 | 29 | 0.006 | 133 | 0.001 | 2 × 10-10 | 4.8 |
| Sake | 3 | 27 | 0.01 | 32 | 0.005 | 6 × 10-6 | 2.6 |
| Malaysian | 3 | 1 | 0.0005 | 0 | 0 | 0.02 | n/a |
| Oak | 2 | 0 | 0 | 0 | 0 | n/a | n/a |
| West African | 2 | 0 | 0 | 0 | 0 | n/a | n/a |
Nucleotide diversity of CDEII (π) was estimated across the CDEII component of all 16 centromeres and π for TEs was estimated for the 210 loci at which TEs appeared fixed. P values are the result of Wilcoxon tests comparing 16 estimates of π of CDEII to 210 estimates of π of TEs for each population.
Summary of possible recombination events in or near the centromere
| Locus | Rmin | No. of Sites | Distance from centromere (bp) |
|---|---|---|---|
| CEN1 | 2 | 2 | 0 (within CDEII) |
| CEN2 | 4 | 7 | < 10, 10-13, 229-281, 281-353 |
| CEN11 | 2 | 5 | < 51, 157-281 |
| CEN14 | 1 | 3 | 14-1208 |
The recombination events listed here show statistically significant evidence (P < 0.05) for recombination using 1 or more test implemented in LDhat, as well as with the 4-gamete test followed by simulation to test the likelihood of multiple mutations. Rmin is the minimum number of recombination events estimated at each locus [22], and No. of Sites is the number of sites showing evidence for recombination.
Mutation spectrum at centromeres is unlike that of gene conversion
| Study | Ts: Tv | Ts:Tv freq | 1 bp deletions | Del:BPS freq | 1 bp insertions | Other |
|---|---|---|---|---|---|---|
| CDEII | 46: 68 | 0.68 | 4 | 0.047 | 0 | 18a |
| Genome-wide (WT) | 12: 19 | 0.63 | 1 | 0.032 | 0 | 1 |
| URA3 Lang (WT) | 46: 121 * | 0.38 | 22 | 0.017 | 3 | 15 |
| CAN1 Lang (WT) | 65: 85 | 0.76 | 56 | 0.034 | 8 | 13 |
| URA3 Hicks (WT) | 19: 27 | 0.70 | 5 | 0.014 | 1 | 9 |
| URA3 Hicks (GC) | 32: 24 * | 1.33 | 32 | 0.074 | 1 | 14 |
The mutation spectrum of CDEII is compared to the wild type spontaneous mutation (WT) inferred from various studies (Genome-wide, [15]; Lang, [23]; Hicks, [13]), and the mutation spectrum expected under gene conversion (GC, [13]). "Other" indicates indels that are -> 1 bp long, and in the case of CDEII also includes indels occurring in homopolymer runs that were 5 bp or longer (N = 14), other mutations described as "Other" for the WT and GC datasets are described in [13]. The deletion: base pair subsitution frequency (Del:BPS) corrects for the difference in target length at which deletions could be observed compared to the target length for base pair substitutions ([23], e.g. for URA3, indel target size = 804 bp, and BPS target size = 104 bp). For CDEII the number of deletions (N = 4) was compared to the number of base pair substitutions (N = 84) outside homopolymer runs. * Significantly different from CDEII (Fisher's exact test, P < 0.05). Poisson tests showed that none of the Del:BPS frequencies were significantly different from that of CDEII.
Figure 2The point substitution spectrum in CDEII. Relative rates of each point substitution (ri-> j) are counts of substitutions from i to j (ni-> j) given the total number of substitutions (∑ ni-> j), normalised for the base composition (pi, pAT = 0.95 in CDEII) and then rescaled so that totals add up to one: ri-> j = ((ni-> j/∑ ni-> j)/pi)/∑((ni-> j/∑ ni-> j)/pi). Error bars are 95% confidence intervals estimated from 1000 bootstrap replicates in which samples of 16 centromeres were sampled at random with replacement for each replicate using R. Open circles show the relative rates of spontaneous mutations observed genome-wide [15]. The genome-wide estimates are from a total of 31 point substitutions, and the errors associated with these are therefore too large to show here. Total counts of each point substitution type (ni-> j) observed in CDEII are shown in the lower margin.