| Literature DB >> 29305387 |
Paul B Talbert1,2, Sivakanthan Kasinathan1,3, Steven Henikoff4,2.
Abstract
Centromeres are the chromosomal sites of assembly for kinetochores, the protein complexes that attach to spindle fibers and mediate separation of chromosomes to daughter cells in mitosis and meiosis. In most multicellular organisms, centromeres comprise a single specific family of tandem repeats-often 100-400 bp in length-found on every chromosome, typically in one location within heterochromatin. Drosophila melanogaster is unusual in that the heterochromatin contains many families of mostly short (5-12 bp) tandem repeats, none of which appear to be present at all centromeres, and none of which are found only at centromeres. Although centromere sequences from a minichromosome have been identified and candidate centromere sequences have been proposed, the DNA sequences at native Drosophila centromeres remain unknown. Here we use native chromatin immunoprecipitation to identify the centromeric sequences bound by the foundational kinetochore protein cenH3, known in vertebrates as CENP-A. In D. melanogaster, these sequences include a few families of 5- and 10-bp repeats; but in closely related D. simulans, the centromeres comprise more complex repeats. The results suggest that a recent expansion of short repeats has replaced more complex centromeric repeats in D. melanogaster.Entities:
Keywords: ChIP; Cid; prod; rotational phasing
Mesh:
Substances:
Year: 2018 PMID: 29305387 PMCID: PMC5844345 DOI: 10.1534/genetics.117.300620
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
Selected candidate centromeric satellite repeats in D. melanogaster
| Repeat | Prior localization | Localization confirmed by FISH in this report | Anti-Cid ChIP enrichment in this report |
|---|---|---|---|
| X, 4; centromeric ( | Yes | ||
| 2; noncentromeric | Yes | ||
| Noncentromeric ( | No | ||
| — | Yes | ||
| — | No | ||
| 359-bp repeat | — | No |
Lohe and Jagannathan .
Tolchkov .
Sun .
Torok , 2000), Blower and Karpen (2001) and Garavís .
Garavís .
Figure 1AAGAG is not necessary for centromere function on the D. melanogaster X chromosome. (A) Translocation T(1;3) e breaks on the short right arm of the X chromosome and at 93 on chromosome 3. (B) The wild-type X in the stock T(1;3) e(3R)C lacks detectable AAGAG and CENP-A signal overlaps with AATAT signal. Hybridization signal is not found under all of the anti-CidH signal, suggesting that the kinetochore interferes with hybridization.
Enrichment of sequences in CENP-A ChIP experiments in S2 cells
| Repeat unit | IP count | IP/IN 1 | IP/IN 2 | IP/IN 3 | IP/IN 4 | Median | Minimum | Maximum |
|---|---|---|---|---|---|---|---|---|
| 15,020,471 | 3.06 | 14.78 | 6.76 | 8.96 | 7.9 | 3.1 | 14.8 | |
| 2,162,779 | 13.47 | 11.69 | 14.61 | 15.42 | 14 | 11.7 | 15.4 | |
| 1,147,564 | 60.48 | 1.76 | 3.25 | 6.88 | 5.1 | 1.8 | 60.5 | |
| 527,674 | 2.04 | 0.98 | 2.03 | 3.03 | 2.0 | 0.98 | 3.0 | |
| 90,458 | 4.74 | 0.91 | 9.1 | 2.67 | 3.7 | 0.9 | 9.1 | |
| 68,335 | 11.24 | 36.35 | 61.09 | 89.7 | 48.7 | 11.2 | 89.7 | |
| 65,591 | 19.28 | 4.97 | 0.61 | 7.15 | 6.1 | 0.6 | 19.3 |
Candidate sequences enriched at least twofold in at least three experiments. Enrichment is the ratio of normalized IP counts to normalized input counts. IN, input.
Counts of sequences not in bold are <5% of those of Prodsat.
15 min MNase, anti-CidM, 250-bp single-end reads.
5 min MNase, anti-CidM, 25- × 25-bp reads.
5 min 2 × MNase, anti-CidH, 25- × 25-bp reads.
15 min MNase, anti-CidM, 25- × 25-bp reads.
Enrichment of sequences in GFP ChIP from embryos
| Repeat unit | IP count | Cid-GFP IP/IN − OR IP/IN |
|---|---|---|
| 371,367 | 6.75 | |
| 215,558 | 4.44 | |
| 53,939 | 13.05 | |
| 45,962 | 24.51 | |
| 5,061 | 3.18 |
Counts of AATAGAATTG (not in bold) are <5% of those of Prodsat. IN, input; OR, Oregon R.
Counts from Cid-GFP IP.
5 min MNase, anti-GFP, 25- × 25-bp reads. Enrichment of sequences in the IP was calculated separately for Cid-GFP and Oregon R embryos, and then subtracted.
Figure 2AATAG repeats in D. melanogaster. (A) A fluorescent (AATAG)10 probe detects repeats left of the centromere on chromosome 2 in larval brain squashes. (B) Enrichment of varying numbers of (AATAG) repeats in 250-bp reads from anti-CidM ChIP of S2 cells. (C) An example of a reference sequence containing (AATAG)2 repeats.
Enrichment of sequences in CENP-A ChIP experiments in ML82-19a cells
| 10- to 14-mer | IP counts | IP/IN 1 | IP/IN 2 | IP/IN 3 | Repeat family |
|---|---|---|---|---|---|
| 285,765 | 23.28 | 27.68 | 56.12 | ||
| 165,104 | 26.41 | 22.00 | 50.96 | ||
| 149,863 | 20.52 | 20.66 | 39.06 | ||
| 114,881 | 3.12 | 2.13 | 3.52 | ||
| 105,305 | 3.31 | 3.46 | 5.48 | ||
| 87,881 | 15.39 | 16.23 | 19.70 | ||
| 68,580 | 47.69 | 36.80 | 53.00 | ||
| 51,479 | 3.66 | 2.50 | 3.34 | ||
| 50,478 | 3.74 | 4.07 | 4.88 | ||
| 48,229 | 3.56 | 2.35 | 4.16 | ||
| 38,353 | 35.92 | 30.76 | 38.84 | ||
| 34,493 | 2.49 | 2.09 | 5.64 | ||
| 25,443 | 5.24 | 4.90 | 9.77 | ||
| 22,660 | 2.01 | 0.92 | 2.19 | 10-mer | |
| 22,372 | 8.87 | 7.55 | 16.25 | ||
| 4,247 | 5.40 | 4.11 | 5.72 | ||
| 1,278 | 5.56 | 2.75 | 6.36 | 7-mer | |
| 238 | 19.70 | 2.11 | 2.15 | 7-mer | |
| 24 | 9.32 | 41.75 | 7-mer |
IN, input.
Candidate sequences enriched at least twofold in at least two experiments. Enrichment is the ratio of normalized IP counts to normalized input counts. Counts of sequences not in bold are <5% of those of ACAATCGTTT, the most abundant sequence counted in the IP.
15 min MNase, anti-CidM, 25- × 25-bp reads.
5 min MNase, anti-CidM, 25- × 25-bp reads.
Enrichment undefined due to absence in input.
Figure 3Periodicity of simcent1 repeats. Dot matrix plots of similarity for four clones with homology to simcent1 compared against themselves reveal repeats with a periodicity of ∼500 bp.
Figure 4FISH of the simcent1 probe in D. simulans. (A) Hybridization of simcent1 to D. simulans larval neuroblasts using formaldehyde fixation. (B) simcent1 hybridization together with CENP-A detection. (C and D) Hybridization of simcent1 using formaldehyde fixation with 45% acetic acid. Arrow points to weak signal at the expected position of the X centromere.
Figure 5Abundance of simple and complex repeats in nine Drosophila species. All species are in the melanogaster subgroup, except D. ananassae, which is in the melanogaster group, and D. pseudoobscura, which serves as an outgroup. The cladograms at the bottom depict the relationships of the species. The y-axis represents the fraction of the reads that contain the repeat, presented on log10 scale. In the box and whisker plots, the box ends represent the second and third quartiles of the data separated by the median, while the whiskers represent the first and fourth quartiles. The mean is marked by X. For each centromere-enriched repeat, [mel] and [sim] indicate that in the corresponding species, the abundance of the repeat in anti-Cid IP was at least 5% of that of the most abundant centromere repeat in that species.