| Literature DB >> 22737079 |
Mia T Levine1, Connor McCoy, Danielle Vermaak, Yuh Chwen G Lee, Mary Alice Hiatt, Frederick A Matsen, Harmit S Malik.
Abstract
Heterochromatin is the gene-poor, satellite-rich eukaryotic genome compartment that supports many essential cellular processes. The functional diversity of proteins that bind and often epigenetically define heterochromatic DNA sequence reflects the diverse functions supported by this enigmatic genome compartment. Moreover, heterogeneous signatures of selection at chromosomal proteins often mirror the heterogeneity of evolutionary forces that act on heterochromatic DNA. To identify new such surrogates for dissecting heterochromatin function and evolution, we conducted a comprehensive phylogenomic analysis of the Heterochromatin Protein 1 gene family across 40 million years of Drosophila evolution. Our study expands this gene family from 5 genes to at least 26 genes, including several uncharacterized genes in Drosophila melanogaster. The 21 newly defined HP1s introduce unprecedented structural diversity, lineage-restriction, and germline-biased expression patterns into the HP1 family. We find little evidence of positive selection at these HP1 genes in both population genetic and molecular evolution analyses. Instead, we find that dynamic evolution occurs via prolific gene gains and losses. Despite this dynamic gene turnover, the number of HP1 genes is relatively constant across species. We propose that karyotype evolution drives at least some HP1 gene turnover. For example, the loss of the male germline-restricted HP1E in the obscura group coincides with one episode of dramatic karyotypic evolution, including the gain of a neo-Y in this lineage. This expanded compendium of ovary- and testis-restricted HP1 genes revealed by our study, together with correlated gain/loss dynamics and chromosome fission/fusion events, will guide functional analyses of novel roles supported by germline chromatin.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22737079 PMCID: PMC3380853 DOI: 10.1371/journal.pgen.1002729
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1HP1 diversity in Drosophila genomes.
A. Phylogeny of 12 Drosophila species, which were each queried for HP1-like genes in this study [2], [42]. Scale bar refers to the approximate divergence time between these species [2], [42]. B. Schematics of proteins encoded by the various HP1 genes in Drosophila genomes are presented alongside the HP1 gene name. Highlighted in boxes are the canonical chromo (green) and shadow (blue) domains that typify HP1 genes. Note that in some instances, we were unable to confirm the exact gene model and therefore the lengths of the N-terminal tails (these are indicated with dashed lines). We also report the D. melanogaster cytolocation of the gene or if the gene is absent in D. melanogaster, the sytenic location in the D. melanogaster genome based on neighboring genes. The final column reports the species in which the gene is found. Genes shaded gray represent founding HP1 gene family members that were reported in the original D. melanogaster genome sequencing study [42]. “*” refers to an allele that harbors a premature stop codon but conserved C-terminal sequence (Table S1, Figure S4) and predicted CD and CSD domains, consistent with a polymorphic full length gene or an incorrect base call.
Figure 2Phylogenetic relationships among the Drosophila HP1 genes.
We constructed phylogenetic trees generated in BEAST (see Materials and Methods) using codon-based alignments of the Chromodomain (A) or Chromoshadow domain (B) based on a log-normal relaxed molecular clock [58]. For clarity, we only present posterior probabilities for major clade relationships rather than between orthologs of the same gene (complete trees with all posterior probability support values indicated can be found in supplemental data (Figure S5A, S5B)). In most instances orthologs grouped together with a high degree of support (exceptions, including the HP1Lcsd genes are discussed in the main text). Genes that are shaded gray refer to partial HP1s that encode either a chromodomain (in A) or shadow domains (in B) only. Scale bar refers to the expected number of substitutions per site.
Figure 3Expression patterns of Drosophila HP1 genes.
RT-PCR analysis on several adult tissues from male and female Drosophila from each of 5 species. RP49 represents a control locus. “UMB:” umbrea, “OXP”: oxpecker, “-”: no DNA/RNA control; “g”: genomic DNA, “M”: whole male, “F”: whole female; “H”:head; “T”: testis, “C”: carcass (gonadectomized, headless individuals); “O” ovaries. Gray lines refer to the absence of the gene in the particular species. We present the analyses for full-length HP1 genes in (A), for partial CD-only HP1s in (B) and for partial CSD-only HP1s in (C).
Figure 4HP1 gene compendia in the 12 Drosophila species.
We present the summary of all HP1-like genes that were identified in our evolutionary screen. These include the five previously known HP1A through HP1E but also include 21 additional HP1 genes identified in this study. Unlike HP1A through HP1D, which are present throughout Drosophila phylogeny, many HP1 genes are present or lost in a lineage- or even species-specific fashion. Summarizing the expression patterns in five Drosophila species (Figure 3 above), we report either ubiquitous expression (i.e., not germline biased) or ovary- or testis-biased expression. Most of the genes we have identified have a germline- and specifically testis-biased expression. HP1F (*) appears to be exclusively expressed in D. pseudoobscura heads. Open circles refer to genes where we did not find evidence for adult-specific expression.
Results of the population genetic analyses of HP1 genes that occur in D. melanogaster.
| Codon Usage Bias | Syn+Nonsyn Sequence Diversity | McDonald-Kreitman Test | |||||||||||||
| gene | ENC mel/sim/yak | %ile | ave n | #codons | NSπ | Sπ | π ratio | %ile | #codons | NSfix | Sfix | NSpoly | Spoly | NI | FETpval |
|
| 47/48/45 | 43.1 | 38.5 | 206 | 0.000 | 0.004 | 0.08 | 50.7 | 206 | 6 | 13 | 6 | 9 | 1.44 | 0.72 |
|
| 40/40/36 | 9.8 | 37.6 | 240 | 0.001 | 0.013 | 0.11 | 60.5 | 239 | 1 | 11 | 2 | 17 | 1.29 | 1.00 |
|
| 39/39/40 | 8.0 | 36.5 | 237 | 0.000 | 0.011 | 0.02 | 22.0 | 237 | 7 | 9 | 6 | 25 | 0.31 | 0.10 |
|
| 59/59/56 | 93.6 | 40.8 | 418 | 0.004 | 0.009 | 0.41 | 93.2 | 413 | 101 | 44 | 34 | 20 | 0.74 | 0.40 |
|
| 46/61/61 | 31.1 | 35.5 | 174 | 0.004 | 0.004 | 0.95 | 98.2 | 172 | 14 | 23 | 15 | 12 | 2.05 | 0.21 |
|
| 61/58/59 | 99.9 | 37.3 | 133 | 0.007 | 0.018 | 0.39 | 92.4 | 133 | 13 | 8 | 11 | 10 | 0.68 | 0.76 |
|
| 56/49/58 | 86.5 | 38.5 | 106 | 0.003 | 0.003 | 0.84 | 97.9 | 102 | 28 | 8 | 4 | 3 | 0.38 | 0.35 |
|
| 61/49/56 | 99.9 | 34.6 | 83 | 0.001 | 0.001 | 1.13 | 98.7 | 81 | 23 | 11 | 6 | 3 | 0.96 | 1.00 |
|
| 59/45/53 | 96.1 | 40.4 | 84 | 0.001 | 0.005 | 0.14 | 70.3 | 84 | 7 | 15 | 6 | 5 | 2.57 | 0.27 |
|
| 55/-/52** | 81.7 | 37.0 | 173 | 0.003 | 0.007 | 0.38 | 92.0 | 88 | 36 | 12 | 16 | 7 | 0.76 | 0.78 |
ENC = effective number of codons, Syn = synonymous, Nonsyn = nonsynonymous, %ile = percentile, ave n = average # alleles, NS = nonsynonymous, S = synonymous, π ratio = NSπ/Sπ, poly = #polymorphisms, fix = # fixations, NI = neutrality index, FETpval = Fisher's Exact Test probability value.
Results from the molecular evolution analysis of genes that occur in D. melanogaster.
| gene | dN/dS | dN | dS | %ile |
|
|
| 0.010 | 0.06 | 75.4 |
|
|
| 0.002 | 0.03 | 49.5 |
|
|
| 0.006 | 0.10 | 45.0 |
|
|
| 0.084 | 0.065 | 99.6 |
|
|
| 0.022 | 0.20 | 64.5 |
|
|
| 0.025 | 0.01 | 99.7 |
|
|
| 0.079 | 0.11 | 98.0 |
|
|
| 0.080 | 0.15 | 96.2 |
|
|
| 0.013 | 0.17 | 54.0 |
|
|
| n/a | n/a | n/a |
The dN/dS refers to a D. melanogaster-D. simulans pairwise calculation.
PAML analysis results of genes that occur in D. melanogaster.
| gene | species | #codons | log ratio | p-val |
|
|
| 206 | 0.00 | 1.00 |
|
|
| 660 | 5.56 | 0.06 |
|
|
| 356 | 0.19 | 0.91 |
|
|
| 257 | 12.36 |
|
|
|
| 144 | 0.17 | 0.92 |
|
|
| 125 | 1.14 | 0.56 |
|
|
| 216 | 13.44 |
|
|
|
| 66 | 2.70 | 0.26 |
|
|
| 60 | 0.00 | 1.00 |
|
|
| 173 | n/a | n/a |
mel = D. melanogaster, sim = D. simulans, sec = D. sechellia, yak = D. yakuba, ere = D. erecta, tak = D. takahashii, bia = D. biarmipes, ele = D. elegans, fic = D. ficusphilia.
Figure 5Delineating HP1E loss in the obscura group.
We amplified the syntenic region of HP1E in the obscura group and successfully identified intact HP1E genes from D. guanche and D. bifasciata. We found highly pseudogenized versions of HP1E in D. azteca, D. affinis (Figure S3). These latter four species also share dramatic karyotypic changes specific to this lineage including an X:3L fusion, a Y:4 fusion and a neo-Y (indicated as Y′ in figure, note that 3L and 4 = elements “D” and “F”, respectively). Thus, to the level of resolution possible from the available species, HP1E loss coincided with the karyotypic changes in the obscura group. The HP1E cytolocation on chromosome 3R (element “E”), post-karyotype evolution, is apparently undisrupted.