Masafumi Nozawa1,2, Kazuho Ikeo3,4, Takashi Gojobori5. 1. Department of Biological Sciences, Tokyo Metropolitan University, Hachioji, Japan. 2. Research Center for Genomics and Bioinformatics, Tokyo Metropolitan University, Hachioji, Japan. 3. Center for Information Biology, National Institute of Genetics, Mishima, Japan. 4. Department of Genetics, SOKENDAI, Mishima, Japan. 5. King Abdullah University of Science and Technology (KAUST), Biological and Environmental Science and Engineering, Computational Bioscience Research Center, Thuwal, Kingdom of Saudi Arabia.
Abstract
Many organisms have a global mechanism for dosage compensation (DC) operating along the entire male X chromosome, which equalizes gene expression on the male X with that on the two Xs in females and/or on autosomes. At the initial stage of sex chromosome evolution, however, gene-by-gene (or localized) DC may also be necessary because the degeneration of Y-linked genes occurs independently at different times. We therefore tested whether the up-regulation of X-linked genes depends on the status of their Y-linked homologs, using the young sex chromosomes, neo-X and neo-Y, in Drosophila miranda. In support of the presence of gene-by-gene DC, the extent of up-regulation in males was indeed higher for neo-X-linked genes with pseudogenized neo-Y-linked homologs than for neo-X-linked genes with functional neo-Y-linked homologs. Further molecular evolutionary analysis also supports the idea that many individual neo-X-linked genes first acquired the potential for up-regulation, which then enabled the pseudogenization of neo-Y-linked homologs, without serious deleterious effects on male fitness.
Many organisms have a global mechanism for dosage compensation (DC) operating along the entire male X chromosome, which equalizes gene expression on the male X with that on the two Xs in females and/or on autosomes. At the initial stage of sex chromosome evolution, however, gene-by-gene (or localized) DC may also be necessary because the degeneration of Y-linked genes occurs independently at different times. We therefore tested whether the up-regulation of X-linked genes depends on the status of their Y-linked homologs, using the young sex chromosomes, neo-X and neo-Y, in Drosophila miranda. In support of the presence of gene-by-gene DC, the extent of up-regulation in males was indeed higher for neo-X-linked genes with pseudogenized neo-Y-linked homologs than for neo-X-linked genes with functional neo-Y-linked homologs. Further molecular evolutionary analysis also supports the idea that many individual neo-X-linked genes first acquired the potential for up-regulation, which then enabled the pseudogenization of neo-Y-linked homologs, without serious deleterious effects on male fitness.
Y chromosomes often exhibit substantial degeneration (Lander et al. 2001; Zhou and Bachtrog 2012; Papadopulos et al. 2015). This potential disadvantage of having sex chromosomes is believed to be overcome by dosage compensation (DC), which equalizes gene dosage between one X chromosome in males and two Xs in females as well as between the X and autosomes in males (Ohno 1967; Disteche 2012). Indeed, many organisms have DC mechanisms (Deng et al. 2011; Conrad and Akhtar 2012; Julien et al. 2012; Papadopulos et al. 2015).In Drosophila, a single X chromosome in males is globally up-regulated to equalize the expression level to that of the two Xs in females as well as that of autosomes (Gelbart and Kuroda 2009; Conrad and Akhtar 2012; Disteche 2012). This global up-regulation of the male X is initiated by the DC complex (DCC). The binding of the DCC to a sequence motif, that is, chromatin entry sites (CESs), which are scattered across the male X, promotes histone H4 acetylation, which in turn stimulates the hyper-transcription of the many genes on the X chromosome by opening the chromatin structure (Alekseyenko et al. 2012).During the early evolution of sex chromosomes, however, genes on the Y chromosome are expected to become individually pseudogenized at different times, rather than to become simultaneously nonfunctionalized. In this case, the global up-regulation of genes on the entire male X may cause the overexpression of X-linked genes whose Y-linked homologs are still functional. This overexpression of X-linked genes may also be deleterious for males. Therefore, a gene-by-gene (or localized) DC mechanism may be necessary at the initial stage of sex chromosome evolution. Indeed, Nozawa et al. (2014) found that DC varies considerably among genes depending on their functional constraints by analyzing the right arm of the X chromosome (or XR), which emerged ∼15 Ma in the lineage leading to Drosophila pseudoobscura. However, the XR was too old to study the relationship between the pseudogenization of Y-linked genes and the up-regulation of X-linked homologs; the Y chromosome in D. pseudoobscura was almost completely degenerated, and the interplay between the X and Y chromosomes could not be investigated.In Drosophila, even younger sex chromosomes than those of D. pseudoobscura emerged independently in different species (Chang et al. 2008). One of such species is D. miranda with the so-called neo-sex chromosomes. Drosophila miranda is a close relative of D. pseudoobscura. After splitting from D. pseudoobscura ∼2 Ma, an autosome of D. miranda became the neo-Y via a fusion with the original Y chromosome ∼1 Ma, and consequently, the remaining homologous autosome became the neo-X (see Nozawa et al. 2016 for more detailed processes of karyotype evolution of D. miranda). The neo-Y in D. miranda is so young that degeneration is ongoing, providing an excellent target to study the interplay between the sex chromosomes. Indeed, many studies have already been conducted by using this species (e.g., Marin et al. 1996; Zhou and Bachtrog 2012; Ellison and Bachtrog 2013; Zhou et al. 2013; Nozawa et al. 2016). In particular, Zhou et al. (2013) extensively studied the relationship between the functionality of neo-Y-linked genes and the chromatin states of the neo-X-linked homologs. In this study, we therefore focused more on the interplay between the functionality of neo-Y-linked genes and the gene expression level of the neo-X-linked homologs.
Gene-by-Gene DC Operates on the Neo-X Chromosome
To measure the DC level for each neo-X-linked gene in males, we used RLin, developed by Lin et al. (2012). In short, RLin computes the ratio of the expression level of an X-linked gene in one species to that of its autosomal ortholog in another species by using the median expression level of genes located on autosomes in both species as a control (see Materials and Methods for more details). A value of 1 indicates perfect DC, whereas a value of 0.5 indicates a lack of DC. If gene-by-gene DC operates on the neo-X and the interplay between the neo-X and neo-Y is fine-tuned at the gene expression level, RLin is expected to be greater for the neo-X-linked genes whose neo-Y-linked homologs are already pseudogenized than for the neo-X-linked genes whose neo-Y-linked homologs are still functional.We found that RLin values are indeed significantly greater for the neo-X-linked genes with pseudogenized neo-Y-linked homologs (XF–YP) than for the neo-X-linked genes with functional neo-Y-linked homologs (XF–YF) (fig. 1 and supplementary fig. S1, Supplementary Material online). The level of DC for XF–YP genes was high in the thorax. In the testis, by contrast, DC for XF–YP genes was less obvious, possibly reflecting meiotic sex chromosome inactivation, a mechanism for repressing gene expression on sex chromosomes during male meiosis (Vibranovski 2014 for review), or less efficient DC in the testis (Mikhaylova and Nurminsky 2011; Landeen et al. 2016). Consistent with this observation, the abdomen without gonad tissues showed a larger RLin than the abdomen including gonad tissues (median values for XF–YP genes were 0.84 and 0.73, respectively; fig. 1). For XF–YF genes, when the expression of both functional neo-X-linked and neo-Y-linked homologs was considered to compute RLin, the expression levels of the genes in D. miranda became similar to those in D. pseudoobscura, one of the most closely related species to D. miranda, used as a proxy of the ancestral state before neo-sex chromosomes were acquired (i.e., RLin for XF+YF was ∼1; supplementary fig. S1, Supplementary Material online). Essentially the same pattern of RLin was observed for all other tissues examined. These results are consistent with the patterns expected under a model of gene-by-gene DC. Note that the XF–YF genes also showed RLin values significantly >0.5 (fig. 1 and supplementary fig. S1, Supplementary Material online), consistent with global up-regulation on the male neo-X, as previously reported (Ellison and Bachtrog 2013; Zhou et al. 2013). Therefore, both gene-by-gene and global DC seem to have operated on the male neo-X in D. miranda, but gene-by-gene DC may have been particularly important to overcome massive pseudogenization on the neo-Y during the initial stage of sex chromosome evolution.
. 1.
—Relationship between the functionality of neo-Y-linked genes and the extent of DC on neo-X-linked genes in the thorax, testis, abdomen without gonad, and abdomen (from left to right). XF–YF, a group of genes for which both neo-X-linked and neo-Y-linked homologs are functional; XF–YP, a group of genes for which neo-X-linked homologs are functional, but neo-Y-linked homologs are pseudogenized. The numbers of genes analyzed are 461 and 757 for the XF–YF and XF–YP genes, respectively. The lines in the boxes represent medians, 50% of values are included in the boxes, and 80% of values are included within the bars. Differences in median values between groups were tested based on a Monte-Carlo simulation with 1,000 bootstrap replicates: ***, P < 0.001. A solid line indicates the RLin value of 1, indicating perfect DC, whereas a broken line corresponds to a value of 0.5, indicating no DC. + below each bar means that the median RLin value is significantly >0.5 at the 5% significance level.
—Relationship between the functionality of neo-Y-linked genes and the extent of DC on neo-X-linked genes in the thorax, testis, abdomen without gonad, and abdomen (from left to right). XF–YF, a group of genes for which both neo-X-linked and neo-Y-linked homologs are functional; XF–YP, a group of genes for which neo-X-linked homologs are functional, but neo-Y-linked homologs are pseudogenized. The numbers of genes analyzed are 461 and 757 for the XF–YF and XF–YP genes, respectively. The lines in the boxes represent medians, 50% of values are included in the boxes, and 80% of values are included within the bars. Differences in median values between groups were tested based on a Monte-Carlo simulation with 1,000 bootstrap replicates: ***, P < 0.001. A solid line indicates the RLin value of 1, indicating perfect DC, whereas a broken line corresponds to a value of 0.5, indicating no DC. + below each bar means that the median RLin value is significantly >0.5 at the 5% significance level.It should be mentioned that RLin ranges from zero to infinite, because the expression level of a gene in a species can change even without acquiring sex chromosomes (i.e., without the effect of DC). This is why individual genes often show the RLin values <0.5 or >1. Therefore, considering a representative value (e.g., median) of RLin values for many genes in each category would be more meaningful to grasp a general pattern of up-regulation than focusing on the RLin values for individual genes.It should also be mentioned that RLin can also be computed for females (see Materials and Methods). In females, however, RLin values for XF–YP genes are not necessarily greater than the values for XF–YF genes (although depending on the tissues examined, supplementary fig. S2, Supplementary Material online). This observation suggests that the gene-by-gene up-regulation is primarily male-specific and operates as a mechanism of DC rather than gene-specific up-regulation operating on both sexes.
Individual Neo-X-Linked Genes Seem to Have First Acquired the Potential for Up-regulation, Which Have Promoted the Pseudogenization of Neo-Y-Linked Homologs
If the up-regulation of neo-X-linked genes in males has occurred in response to the pseudogenization of their neo-Y-linked homologs, DC is expected to be incomplete at the initial stage and gradually fine-tuned over time. In this case, RLin for the neo-X-linked genes would initially be close to 0.5 and become greater with time after the pseudogenization of the neo-Y-linked homologs. To test this possibility, we estimated the relative time elapsed since the pseudogenization of neo-Y-linked genes (supplementary fig. S3, Supplementary Material online; Oda et al. 2002). We then classified all XF–YP genes into four bins based on this relative time. Another group for which neo-Y-linked genes were already deleted from the chromosome was also evaluated. The results showed that the DC level is essentially the same, irrespective of the time since the pseudogenization of neo-Y-linked genes (fig. 2). This observation suggests a general trend that individual neo-X-linked genes first acquired the potential for up-regulation to maintain a stable expression level. After the establishment of such a mechanism for the neo-X-linked genes, the neo-Y-linked homologs could have become easily pseudogenized, without serious deleterious effects on fitness. Yet, our estimates of the relative pseudogenization time contain large sampling errors due to small sequence divergence of neo-X and neo-Y and we cannot still completely rule out other possibilities. For example, it is known that effects of chromosomal aneuploidies on gene expression are often counteracted by a gene-expression buffering mechanism (Stenberg et al. 2009). Therefore, dosage reduction due to pseudogenization of neo-Y-linked genes might have also been buffered to some extent by an intrinsic mechanism that up-regulates their neo-X-linked homologs. However, note that in our definition, this mechanism can also be classified as a gene-by-gene or a localized DC but not a global DC, because in this case up-regulation occurs only on the neo-X-linked genes whose neo-Y-linked homologs are pseudogenized but not on an entire chromosome. In any case, if a genome-editing technique in D. miranda is developed, we may be able to evaluate the possibility of this buffering mechanism by making a mutant in which a functional neo-Y-linked gene is knocked-out (i.e., artificially pseudogenized) and by testing whether the expression level of the neo-X-linked homolog is up-regulated in a male.
. 2.
—Relationship between the relative time after pseudogenization of neo-Y-linked genes and the extent of dosage compensation (DC) on the neo-X-linked homologs. The RLin values in a tissue where each gene is expressed most highly among the tissues examined were used. The lines in the boxes represent medians, 50% of values are included in the boxes, and 80% of values are included within the bars. Solid and broken lines correspond to the RLin value of 1 and 0.5, respectively. Differences in median values between groups were tested based on a Monte-Carlo simulation with 1,000 bootstrap replicates; P ≥ 0.05 for all pairwise comparisons. The number of genes analyzed for each category is shown in parentheses.
—Relationship between the relative time after pseudogenization of neo-Y-linked genes and the extent of dosage compensation (DC) on the neo-X-linked homologs. The RLin values in a tissue where each gene is expressed most highly among the tissues examined were used. The lines in the boxes represent medians, 50% of values are included in the boxes, and 80% of values are included within the bars. Solid and broken lines correspond to the RLin value of 1 and 0.5, respectively. Differences in median values between groups were tested based on a Monte-Carlo simulation with 1,000 bootstrap replicates; P ≥ 0.05 for all pairwise comparisons. The number of genes analyzed for each category is shown in parentheses.
Genes Highly Expressed in Male Gonads Tend to Remain Functional on the Neo-Y Chromosome
We next examined whether pseudogenization of neo-Y-linked genes depends on the spatiotemporal gene expression pattern. For this analysis, all genes on the neo-sex chromosomes were classified into groups depending on the tissue where the genes are expressed most highly in D. pseudoobscura, as a proxy for the ancestor without neo-sex chromosomes. In this way, the kinds of genes in the ancestor that tend to remain functional after becoming neo-sex-linked, if any, were examined. Using a similar approach, Kaiser et al. (2011) reported that male-biased genes have remained functional at a higher rate compared with female-biased genes on the neo-Y. We also observed that the neo-Y-linked genes expressed most highly in the testis or accessory gland had a greater tendency to remain functional than the genes expressed most highly in other somatic tissues (fig. 3). Consistent with this observation, the category of genes that are functional only on the neo-Y (XP–YF) contained a significantly greater proportion of genes with high expression in the testis or accessory gland than other categories of genes (fig. 3). Qualitatively similar results were obtained when more stringent criteria were applied for defining the tissue with the maximal expression for each gene (supplementary fig. S4, Supplementary Material online). However, we also observed an unexpected trend in which the proportion of genes that are functional was significantly greater for the genes expressed most highly in the ovary than for the genes expressed most highly in other somatic tissues (fig. 3). To further inspect this observation, we classified these “ovary” genes into groups based on the tissues in which they exhibit the second-highest expression. The proportion of genes that are functional was significantly greater for neo-Y-linked genes with the second-highest expression in male tissues than for the genes with the second-highest expression in female tissues (0.48 vs. 0.35, respectively; P = 0.02 by a Monte-Carlo simulation with 1,000 bootstrap replicates). Therefore, some of these “ovary” genes would have been expressed and function in male tissues as well and have remained functional, even after becoming neo-Y-linked.
. 3.
—Expression patterns and pseudogenization of neo-Y-linked genes. (A) Relationship between genes showing maximum expression in various tissues and the proportion of functional genes. Gene expression based on cFPKM (see Materials and Methods) in Drosophila pseudoobscura was used as a proxy for the ancestor without neo-sex chromosomes. (B) Relationship between the functionality of neo-sex-linked genes and the proportion of genes with the highest expression in the testis or accessory gland. XF–YF, for genes that are functional on both neo-X and neo-Y; XF–YP, for genes that are functional on the neo-X, but pseudogenized on the neo-Y; XP–YF, for genes that are pseudogenized on the neo-X, but functional on the neo-Y; and XP–YP, for genes that are pseudogenized on both the neo-X and neo-Y. Error bars showing the 95% confidence interval and statistical significance were calculated based on a Monte-Carlo simulation with 1,000 bootstrap replicates: ***, P < 0.001; *, P < 0.05; NS, P ≥ 0.05. The number of genes analyzed for each category is shown in parentheses.
—Expression patterns and pseudogenization of neo-Y-linked genes. (A) Relationship between genes showing maximum expression in various tissues and the proportion of functional genes. Gene expression based on cFPKM (see Materials and Methods) in Drosophila pseudoobscura was used as a proxy for the ancestor without neo-sex chromosomes. (B) Relationship between the functionality of neo-sex-linked genes and the proportion of genes with the highest expression in the testis or accessory gland. XF–YF, for genes that are functional on both neo-X and neo-Y; XF–YP, for genes that are functional on the neo-X, but pseudogenized on the neo-Y; XP–YF, for genes that are pseudogenized on the neo-X, but functional on the neo-Y; and XP–YP, for genes that are pseudogenized on both the neo-X and neo-Y. Error bars showing the 95% confidence interval and statistical significance were calculated based on a Monte-Carlo simulation with 1,000 bootstrap replicates: ***, P < 0.001; *, P < 0.05; NS, P ≥ 0.05. The number of genes analyzed for each category is shown in parentheses.An additional possibility is that the neo-X and neo-Y are still recombining or experienced recent recombination during male meiosis at a certain frequency, although it is believed that males of many Drosophila species do not exhibit recombination during meiosis (John et al. 2016; Charlesworth 2017). If male recombination indeed occurred in the D. miranda lineage, the functional genes on the neo-X that are expressed in the ovary may have moved onto the neo-Y during meiosis. Providing indirect support for this hypothesis, the observed number of shared polymorphisms between the neo-sex chromosomes was much greater than expected under a random mutation model on each chromosome (supplementary table S1, Supplementary Material online). In addition, nucleotide diversity on the neo-Y (π = 6.4 × 10−4) was more than half compared with that on the neo-X (π = 10.7 × 10−4). This Y/X ratio of nucleotide diversity was greater than one-third, an expected ratio at equilibrium derived from the number of the neo-X and neo-Y in a pair of parents. Although Bachtrog (2004) reported that nucleotide diversity on the neo-Y is ∼30 times less than that on the neo-X in D. miranda, this conclusion was largely derived from an analysis of nonhomologous genes from neo-X and neo-Y. In contrast, our estimates are based on the homologous genes, which would be more appropriate for the analysis.Note that meiotic male recombination also seems to have resulted in comparable levels of nucleotide diversity and a large number of shared polymorphism between the neo-X and neo-Y, which emerged independently in D. albomicans (Satomura and Tamura 2016). With our knowledge, however, male recombination in D. pseudoobscura, a closely related species of D. miranda, has not been reported in the long research history since 1930s initiated by Dobzhansky. In addition, we used only two D. miranda strains to estimate nucleotide diversity, which is much smaller than the number of strains used in Bachtrog (2004). Therefore, the proposal of this hypothesis should be taken with cautions. Crossing experiments in D. miranda are apparently necessary to further evaluate this hypothesis.
Global DC Machinery Unlikely Contributes to Gene-by-Gene DC
If the DCC that initiated global DC also contributed to gene-by-gene DC, the XF–YP genes would have most likely been recognized by male-specific lethal 3 (MSL3), a component of the DCC, and subjected to histone acetylation at H4K16 (H4K16ac hereafter) at a higher frequency than the XF–YF genes. However, Zhou et al. (2013) reported that no significant difference was observed between the XF–YF and XF–YP genes in the proportion of neo-X-linked genes bound by MSL3 or H4K16ac. Nevertheless, they also reported that the neo-X-linked genes bound by MSL3 or H4K16ac tended to be up-regulated to a greater extent than the neo-X-linked genes unbound by these proteins (Zhou et al. 2013). Using our RNA-seq data with the ChIP-seq data from Zhou et al. (2013), we also observed the same tendency (supplementary fig. S5, Supplementary Material online). In addition, the variance of RLin for the neo-X-linked genes bound by MSL3 or H4K16ac was significantly smaller than that for unbound neo-X-linked genes (P < 10−180 for each by the F-test). These observations suggest that the molecular mechanism underlying global DC does not contribute substantially to gene-by-gene DC, but broadly increases the expression of neo-X-linked genes and stabilizes the expression levels within a certain range.In this study, we were unable to clarify the molecular basis of gene-by-gene (or localized) DC. Nevertheless, if an auto-regulation or a gene-expression buffering mechanism operates on individual neo-X-linked genes, gene-by-gene DC may be quickly established in the early stage of sex chromosome evolution. Rapidly evolving elements such as satellite DNAs may also contribute to gene-by-gene DC. Indeed, siRNA-directed modification of chromatin structure at the 1.688X satellite loci seems to be an identifier of the male X chromosome, independent from CESs for DC in D. melanogaster (Menon and Meller 2015; Deshpande and Meller 2018). However, whether this type of repetitive elements can function for DC in a gene-by-gene manner is another issue to be clarified.The interplay between young sex chromosomes at the gene expression level has also been reported in a plant species, Silene latifolia (Muyle et al. 2012). According to this study, the expression of X-linked genes increases as the expression of Y-linked homologs decreases in males. Therefore, gene-by-gene or localized DC mechanisms may be present in many organisms with young sex chromosome systems. However, once most of the Y-linked genes are degenerated, global DC would be more effective than gene-by-gene DC. In this way, gene-by-gene DC may disappear as a global DC mechanism is developed. In this sense, the neo-sex chromosome system in D. miranda might be a transient stage from gene-by-gene DC to global DC.In conclusion, our findings support the Ohno’s (1967) statement that “During the course of evolution of this insect (Drosophila), the dosage compensation mechanism must have developed one by one for each individual X-linked gene”, which is crucial to counteract the deleterious effects due to the massive losses of Y-linked genes at the initial stage of sex chromosome evolution.
Materials and Methods
Flies and Sequencing Data
To estimate the nucleotide polymorphism in Drosophila miranda, two strains (14011–0101.16 and 14011–0101.17) were obtained from Drosophila Species Stock Center at UC San Diego (moved to Cornell University, http://blogs.cornell.edu/drosophila/, last accessed July 16, 2018). To remove intrastrain polymorphism as much as possible, full-sib mating was repeated for eight generations for each strain before starting experiments. Genome and transcriptome sequences as well as gene expression data for D. miranda (strain 14011–0101.17), D. pseudoobscura [strain k-s12 (or 14011–0121.94)], and D. obscura (strain 14011–0151.01) were obtained from Nozawa et al. (2016).
DNA Extraction, PCR, and Sequencing
To estimate the nucleotide polymorphism in D. miranda, we extracted total DNA from ∼10 adult females and males of the two strains, separately, using the Boom et al.’s (1990) method with slight modifications. PCR was conducted in a 25 µl reaction mixture in which 50 mM KCl, 10 mM Tris–HCl (pH8.5), 2 mM MgCl2, 0.001% gelatin, 0.2 mM each dNTP, 0.5 µM each primer (forward and reverse), ∼5 ng total DNA, and 625 mU Taq polymerase were included. Primers used are listed in supplementary table S2, Supplementary Material online. PCR amplification was performed as follows: 30 cycles of 94 °C for 30 s, 60 °C for 30 s, and 72 °C for 60 s. The nucleotide sequences of the amplified fragments were determined by using BigDye Terminator v3.1 Cycle Sequencing Kit and 3130xl Genetic Analyzer (Thermo Fisher Scientific, Waltham, USA). Neo-X-derived sequences were determined from female sequences, whereas neo-Y-derived sequences were determined by subtracting neo-X-derived sequences from male sequences by using MEGA7 (Kumar et al. 2016). It should be mentioned that all primers used were designed to amplify both neo-X- and neo-Y-derived sequences. All sequence data generated in this study are deposited to DDBJ under the accession numbers LC363594–LC363749.
Definition of Functional Genes and Pseudogenes
We obtained the information for the classification of functional genes and pseudogenes from Nozawa et al. (2016). In Nozawa et al. (2016) all genes are classified as follows. 1) If a gene was expressed, contained an initiation as well as a stop codon, and its CDS length was equal to or greater than 80% of the average of other members of an orthologous group, the gene was regarded as “functional”. 2) If a gene was expressed, contained an initiation and a stop codon, and its CDS length was <80% of the average of other orthologous members, the gene was regarded as being “disrupted”. 3) If a gene was not expressed and its CDS length was equal to or greater than 80% of the average of other orthologous members, the gene was being “silenced”. 4) If a gene was not expressed and its CDS length was <80% of the average of other orthologous members, the gene was regarded as being “silenced and disrupted”. 5) If a gene was expressed, lacked either one of an initiation or a stop codon, and its flanking 100 nucleotides did not contain any ambiguous site, the gene was being “disrupted”. 6) If a gene was expressed in at least one of the tissues, lacked either one of an initiation or a stop codon, and flanking 100 nucleotides contained at least one ambiguous site (i.e., N), the gene was regarded as being “unclassified”.
Estimation of DC Level
To evaluate the extent of DC, we used RLin developed by Lin et al. (2012). The equation can be written aswhere cFPKM is the corrected number of fragments per kilobase of exon per million mapped fragments, which linearly adjusted an individual FPKM to make the median expression identical to 1. Subscripts tar, com, X, Au(proto-X), and Au denote target species (D. miranda in this study), comparing species (D. pseudoobscura in this study), the X chromosome (the neo-X in this study), an autosome orthologous to the X in comparing species (the chromosome 3 in this study), and autosomes, respectively. The essence of this equation is that we can detect changes in male expression between species due to the emergence of sex chromosomes using expression change of autosomal genes as a control. If there is no DC, the RLin value will be 0.5 because D. miranda males have only one copy of the Muller element C (i.e., the neo-X), whereas males in D. pseudoobscura carry two copies (i.e., the chromosome 3). In contrast, the value is expected to be 1 if DC is perfect. For this analysis, only the orthologs that are expressed in D. pseudoobscura, are present in D. obscura, and have not experienced any interchromosomal translocation in the lineage leading to D. miranda and D. pseudoobscura after splitting from D. obscura were used. It should be mentioned that RLin can be calculated for females as well by using female expression data. All orthologous genes with genomic location, functionality, and expression level are listed in supplementary table S3, Supplementary Material online.
ChIP-Seq Analysis
The lists of genes that are bound by MSL3 or H4K16ac in the male third instar larvae of D. miranda (strain MSH22) were kindly provided by Zhou et al. (2013).
Supplementary Material
Supplementary data are available at Genome Biology and Evolution online.Click here for additional data file.
Authors: R Boom; C J Sol; M M Salimans; C L Jansen; P M Wertheim-van Dillen; J van der Noordaa Journal: J Clin Microbiol Date: 1990-03 Impact factor: 5.948
Authors: Xinxian Deng; Joseph B Hiatt; Di Kim Nguyen; Sevinc Ercan; David Sturgill; LaDeana W Hillier; Felix Schlesinger; Carrie A Davis; Valerie J Reinke; Thomas R Gingeras; Jay Shendure; Robert H Waterston; Brian Oliver; Jason D Lieb; Christine M Disteche Journal: Nat Genet Date: 2011-10-23 Impact factor: 38.330
Authors: Artyom A Alekseyenko; Joshua W K Ho; Shouyong Peng; Marnie Gelbart; Michael Y Tolstorukov; Annette Plachetka; Peter V Kharchenko; Youngsook L Jung; Andrey A Gorchakov; Erica Larschan; Tingting Gu; Aki Minoda; Nicole C Riddle; Yuri B Schwartz; Sarah C R Elgin; Gary H Karpen; Vincenzo Pirrotta; Mitzi I Kuroda; Peter J Park Journal: PLoS Genet Date: 2012-04-26 Impact factor: 5.917