| Literature DB >> 28910411 |
Juan M Calvo-Martín1, Montserrat Papaceit1, Carmen Segarra1.
Abstract
Polycomb group (PcG) proteins are important regulatory factors that modulate the chromatin state. They form protein complexes that repress gene expression by the introduction of posttranslational histone modifications. The study of PcG proteins divergence in Drosophila revealed signals of coevolution among them and an acceleration of the nonsynonymous evolutionary rate in the lineage ancestral to the obscura group species, mainly in subunits of the Pcl-PRC2 complex. Herein, we have studied the nucleotide polymorphism of PcG genes in a natural population of D. subobscura to detect whether natural selection has also modulated the evolution of these important regulatory genes in a more recent time scale. Results show that most genes are under the action of purifying selection and present a level and pattern of polymorphism consistent with predictions of the neutral model, the exceptions being Su(z)12 and Pho. MK tests indicate an accumulation of adaptive changes in the SU(Z)12 protein during the divergence of D. subobscura and D. guanche. In contrast, the HKA test shows a deficit of polymorphism at Pho. The most likely explanation for this reduced variation is the location of this gene in the dot-like chromosome and would indicate that this chromosome also has null or very low recombination in D. subobscura, as reported in D. melanogaster.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28910411 PMCID: PMC5599051 DOI: 10.1371/journal.pone.0185005
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Location of the PcG genes on the D. subobscura chromosomes.
The location of the sixteen studied PcG genes is indicated on the polytene chromosomes of D. subobscura (ch cu strain) by arrowheads. All chromosomes have the standard arrangement shown in the cytological map of the species [27] except the O chromosome that has the O3+4 arrangement that includes the overlapping inversions O3 and O4. Chromosomes are identified by capital letters (A, J, U, E and O) at the distal end. The dot-like chromosome is also indicated (Dot).
Summary estimates of the level and pattern of polymorphism.
| Complex | n° lines | n° sites | Tajima's | ||||||
|---|---|---|---|---|---|---|---|---|---|
| PhoRC | |||||||||
| | 17 | 2842 | 11 | 10 | 0.90 | 0.0009 | 0.0263 | -0.8763 | 0 |
| | 15 | 5617 | 76 | 14 | 0.99 | 0.0033 | 0.1055 | -0.9055 | 9 |
| | 16 | 2293 | 34 | 15 | 0.99 | 0.0030 | 0.1130 | -1.3772 | 5 |
| Pcl-PRC2 | |||||||||
| | 14 | 2621 | 85 | 13 | 0.99 | 0.0083 | 0.1857 | -0.9447 | 9 |
| | 16 | 3652 | 143 | 15 | 0.99 | 0.0113 | 0.2162 | -0.3766 | 16 |
| | 15 | 2105 | 78 | 15 | 1.00 | 0.0098 | 0.3617 | -0.7133 | 10 |
| | 16 | 3763 | 58 | 15 | 0.99 | 0.0039 | 0.1788 | -0.7093 | 15 |
| | 16 | 3491 | 39 | 16 | 1.00 | 0.0027 | 0.0833 | -0.8365 | 5 |
| | 15 | 1756 | 66 | 14 | 0.99 | 0.0117 | 0.3400 | -0.1027 | 10 |
| PRC1 | |||||||||
| | 15 | 11839 | 325 | 15 | 1.00 | 0.0060 | 0.1799 | -1.3784 | 47 |
| | 9 | 2714 | 50 | 9 | 1.00 | 0.0064 | 0.1455 | -0.4042 | 6 |
| | 17 | 3749 | 161 | 16 | 0.99 | 0.0085 | 0.2312 | -1.4846 | 20 |
| | 17 | 9203 | 176 | 16 | 0.99 | 0.0038 | 0.1348 | -1.4433 | 20 |
| | 16 | 4161 | 123 | 15 | 0.99 | 0.0060 | 0.1716 | -1.4928 | 18 |
| | 16 | 5752 | 63 | 15 | 0.99 | 0.0022 | 0.1007 | -1.4728 | 6 |
| dRAF | |||||||||
| | 16 | 8507 | 186 | 15 | 0.99 | 0.0046 | 0.1403 | -1.3780 | 20 |
S: number of segregating sites, h: number of haplotypes, Hd: haplotype diversity, π: nucleotide diversity per site, π/K: nucleotide diversity per site normalized by nucleotide divergence with D. guanche, R: minimum number of recombination events.
*dRAF complex also contains PSC and SCE.
Summary of synonymous and nonsynonymous polymorphism and divergence with D. guanche in the PcG genes.
| Complex | ||||||
|---|---|---|---|---|---|---|
| PhoRC | ||||||
| | 0.0000 | 0.0006 | - | 0.0663 | 0.0181 | 0.2732 |
| | 0.0115 | 0.0009 | 0.0742 | 0.0890 | 0.0070 | 0.0786 |
| | 0.0094 | 0.0012 | 0.1261 | 0.1051 | 0.0045 | 0.0428 |
| Pcl-PRC2 | ||||||
| | 0.0211 | 0.0000 | 0.0000 | 0.1417 | 0.0010 | 0.0070 |
| | 0.0242 | 0.0004 | 0.0145 | 0.1743 | 0.0036 | 0.0206 |
| | 0.0234 | 0.0004 | 0.0176 | 0.0876 | 0.0022 | 0.0256 |
| | 0.0149 | 0.0006 | 0.0382 | 0.0621 | 0.0070 | 0.1133 |
| | 0.0071 | 0.0012 | 0.1704 | 0.0881 | 0.0129 | 0.1464 |
| | 0.0334 | 0.0014 | 0.0407 | 0.1292 | 0.0034 | 0.0266 |
| PRC1 | ||||||
| | 0.0120 | 0.0019 | 0.1586 | 0.0890 | 0.0097 | 0.1093 |
| | 0.0149 | 0.0007 | 0.0443 | 0.0470 | 0.0033 | 0.0704 |
| | 0.0128 | 0.0006 | 0.0494 | 0.0757 | 0.0061 | 0.0811 |
| | 0.0070 | 0.0010 | 0.1355 | 0.0425 | 0.0070 | 0.1642 |
| | 0.0137 | 0.0010 | 0.0736 | 0.0774 | 0.0063 | 0.0820 |
| | 0.0056 | 0.0006 | 0.1097 | 0.0478 | 0.0092 | 0.1923 |
| dRAF | ||||||
| | 0.0120 | 0.0006 | 0.0507 | 0.0901 | 0.0030 | 0.0334 |
π: synonymous nucleotide diversity, π: nonsynonymous nucleotide diversity, π/π: nonsynonymous/synonymous diversity ratio, K: synonymous divergence, K: nonsynonymous divergence, K/K: nonsynonymous/synonymous divergence ratio.
*dRAF complex also contains PSC and SCE.
Fig 2Polymorphism and divergence in the coding region of the PcG genes.
Ratio of nonsynonymous to synonymous polymorphism in D. subobscura (π/π) and of nonsynonymous to synonymous divergence between D. subobscura and D. guanche (K/K) for each of the Polycomb genes studied. The x-axis indicates gene identity and the y-axis indicates π/π and K/K values. Pho lacks synonymous polymorphism and Caf1-55 nonsynonymous polymorphism. Genes on the x-axis are grouped according to Polycomb complex. dRAF also contains PSC and SCE.
Fig 3Nucleotide diversity and divergence along Pho and Su(z)12.
Sliding window plots of the distribution of nucleotide diversity in D. subobscura (π, black line) and of nucleotide divergence between D. subobscura and D. guanche (K, gray line). Windows include 50 sites with successive displacements of 5 sites. The x-axis indicates nucleotide sites across the gene region and the y-axis indicates nucleotide diversity or divergence. Solid boxes in the lower part of the figure indicate the coding exons and thin lines show flanking regions and introns. The protein domains of the encoded proteins are indicated below the gene structure with empty boxes.