Literature DB >> 22139016

X-chromosome hyperactivation in mammals via nonlinear relationships between chromatin states and transcription.

Eda Yildirim1, Ruslan I Sadreyev, Stefan F Pinter, Jeannie T Lee.   

Abstract

Dosage compensation in mammals occurs at two levels. In addition to balancing X-chromosome dosage between males and females via X inactivation, mammals also balance dosage of Xs and autosomes. It has been proposed that X-autosome equalization occurs by upregulation of Xa (active X). To investigate mechanism, we perform allele-specific ChIP-seq for chromatin epitopes and analyze RNA-seq data. The hypertranscribed Xa demonstrates enrichment of active chromatin marks relative to autosomes. We derive predictive models for relationships among Pol II occupancy, active mark densities and gene expression, and we suggest that Xa upregulation involves increased transcription initiation and elongation. Enrichment of active marks on Xa does not scale proportionally with transcription output, a disparity explained by nonlinear quantitative dependencies among active histone marks, Pol II occupancy and transcription. Notably, the trend of nonlinear upregulation also occurs on autosomes. Thus, Xa upregulation involves combined increases of active histone marks and Pol II occupancy, without invoking X-specific dependencies between chromatin states and transcription.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22139016      PMCID: PMC3732781          DOI: 10.1038/nsmb.2195

Source DB:  PubMed          Journal:  Nat Struct Mol Biol        ISSN: 1545-9985            Impact factor:   15.369


INTRODUCTION

In many organisms, sex is determined genetically by dimorphic sex chromosomes. In the XY-based system, females are homogametic (XX) and males are heterogametic (XY)[1,2] Current evolutionary theories suggest that sex chromosomes evolved from a pair of autosomal homologues, and acquisition of favorable male genes on the Y led to a suppression of recombination, making gradual loss of Y-chromosome material inevitable. Degeneration of the Y would have resulted in a continual series of sudden changes in gene dosage balance not only between male and female Xs, but also between X and autosomes [1]. Ohno predicted that two types of dosage compensation schemes must therefore exist [2-4]. For mammals, the existence of X-chromosome inactivation (XCI) to silence one of the two X chromosomes in females has been known since 1961 [5,6]. This mechanism equalizes X-chromosome dosage between the sexes and depends on expression of Xist RNA [7-9] coupled with recruitment of PRC2 complex[10-12] . But because XCI creates another level of dosage imbalance, this one between X’s and autosomes of both sexes, a secondary compensatory mechanism must target the active X chromosome (Xa) and double its transcription to restore genome-wide balance. Several recent studies support the idea of X hyperactivation in mammals. Microarray-based gene expression profiling of mammalian tissues showed that X-linked genes are expressed not at half the average autosomal dose (as would be expected if expression came from a single X) but at nearly the same dose as autosomal genes in both sexes, implying that the Xa is upregulated in both males and females [13,14]. These conclusions have been challenged by analysis of RNA-Seq data, which showed that the expression average of X-linked genes was approximately half that of the autosomal average[15] A more recent study, however, indicates that this interpretation was confounded by inclusion of silent genes on the X [16]. Here, we take an alternate approach to address whether and how dosage compensation occurs between X and autosomes by investigating chromatin signatures on a genome-wide scale. We carry out allele-specific chromatin immunoprecipitation with deep sequencing (ChIP-seq) for RNA polymerase II (POL-II) and activate chromatin marks and, through a combined analysis with RNA-seq data, we find that Xa upregulation indeed occurs. The data suggest that Xa upregulation occurs at the level of both transcription initiation and elongation and point to nonlinear quantitative dependencies among active histone marks, POL-II occupancies, and transcription output which are not X-specific and are part of a genome-wide mechanism for quantitative control of gene expression.

RESULTS

Confirmation of Xa upregulation

To address how X-linked transcription compares to autosomal transcription in the female soma and whether the differences, if any, could be explained by chromatin mechanisms, we first compared average gene expression of all X-linked and autosomal genes using previously published RNA-seq data from a mouse female fibroblast cell line [17]. We calculated gene expression levels as FPKM values (fragments per kilobase per million) for non-overlapping RefSeq mouse genes using TopHat and Cufflinks methods, and found that the total FPKM averages of haploid X and autosomal genes differed only by 22%. This conclusion is consistent with the argument that Xa-hyperactivation does not occur [15]. However, the X-chromosome may harbor more silent genes than autosomes. Reasoning that this difference could confound measurements of average transcriptional output, we categorized genes with respect to their expression status (active vs inactive) and CpG content (high vs low) at the promoters (Supplementary Fig. 1a). A natural FPKM cutoff of ~1.0 for actively expressed genes was suggested by the analyses of dependency between gene expression and POL-II densities across the gene body (Supplementary Fig. 2, see Methods). We observed that almost 57% of X-linked genes were of low CpG content at promoters (LCP) with very little to no transcription (inactive). Indeed, when analysis of the RNA-seq data was performed using only expressed genes, the X-chromosome showed 85% higher mean expression than the average haploid autosome set, with 57% difference in median values (Supplementary Table I). X-hypertranscription was apparent among both active HCP and active LCP genes (Supplementary Fig. 1b). Differences between autosomal and X-linked gene populations were highly significant (Supplementary Table I, Mann-Whitney P-values; Supplementary Fig. 1c). Thus, in agreement with previous reports [13,14,16], the mammalian X is dosage compensated with respect to autosomes.

Allelic ChIP-seq reveals enrichment of active histone marks and POL-II on Xa

Next, we asked whether X:A dosage compensation has a chromatin basis. Over the years, genome-wide profiles of histone marks have been generated by ChIP-on-chip or ChIP-seq. In every case, the analysis was performed either in male cell lines [18,19] or in female lines without the allele-specificity and sequencing depth necessary to distinguish Xa from Xi with sufficient resolution[20-22]. Here, we performed high-resolution, allele-specific ChIP-seq in a clonal F1 hybrid female mouse fibroblast line that carries one haploid chromosome set of Mus musculus (mus) origin and a second haploid set of Mus castaneous (cast) origins. We examined active chromatin signatures that are associated with the transcription start site (TSS), including phospho-serine-5 RNA polymerase II (POL-II-S5P) and trimethylated histone H3 at lysine 4 (H3K4me3), both of which are associated with transcription initiation; we also examined signatures associated with transcription elongation through the gene body, including phospho-serine-2 POL-II (POL-II-S2P) and trimethylated H3 at lysine 36 (H3K36me3) [23,24]. By using paired-end sequencing, we could uniquely align 83-93% of the 17-28 million read-pairs to the genome, yielding 16.5-21.9 million uniquely aligned reads per epitope (Supplementary Table II). With ~22 million SNP and ~1 million insertion/deletion differences between mus and cast genomes[25], ~35% of all read-paired could be assigned to specific alleles. Analysis of monoallelically expressed loci, including the Zim1/Peg3 imprinted domain (Fig. 1a and data not shown), confirmed the allele-specific output of the ChIP-seq. As expected, active marks were found on the paternally expressed Peg3 allele (cast) and the maternally expressed Zim1 allele (mus). Elsewhere in the genome, active marks were evenly distributed between autosomal homologues, as expected (Fig. 1b). By contrast, on the X-chromosome, active marks were predominantly on Xcast (Xa), consistent with occurrence of XCI.
Figure 1

Allele-specific ChIP-seq

(a) Profiles for POL-II-S5P, H3K4me3, and H3K36me3 are mapped to M. castaneous (cast) or M. musculus (mus) alleles for two imprinted loci, Zim1 [52] and Peg3 [53] on Chromosome 7. Composite tracks (comp) represent combination of cast, mus and neutral reads. Coverage values are normalized by input and are indicated on the y-axis. (b) X chromosome shows a strong allelic skew in the occupancy of active histone marks and POL-II at the TSS and across the gene body. Barplots show mean composite densities of H3K4me3, H3K36me3, POL-II-S2P and POL-II-S5P on autosomes (A) and X chromosome (X), with proportion of allelic coverage indicated by red (cast), and blue (mus) fractions. Inactive X (Xmus), and active X (Xcast).

To determine whether Xa-hypertranscription is reflected in enrichment of active chromatin marks, we compared coverages on Xcast (Xa) with those on autosomes of M. castaneous origin. When we calculated medians for coverage densities on active genes, Xa showed 31% more POL-II-S5P and 24% more H3K4me3 coverage around the TSS; Xa also showed 20% more POL-II–S2P and 9% H3K36me3 coverage on the gene bodies (Fig. 2; Supplementary Table I). The enrichments were statistically significant in each category (Supplementary Table I), except for POL-II-S2P and H3K36me3 on LCP active genes, presumably due to smaller sample of LCP active genes on the Xa (n=72). Moreover, distributions of density values for all marks followed a similar shape (Fig. 2), suggesting that enrichment of Xa marks was global and could not be attributed to a distinct subset with exceptionally high coverage.
Figure 2

Distributions of coverage densities for POL-II and active histone modifications on X chromosome and autosomes

Coverage density values are shown for H3K4me3 and POL-II-S5P at the TSS and for H3K36me3 and POL-II-S2P across the gene bodies as indicated. Distributions are plotted for actively transcribed (HCP+LCP, HCP and LCP) genes. Black line, autosomal genes; red line, X-linked genes.

Nonlinear dependencies between active chromatin marks and transcription output

Interestingly, the enrichment of POL-II and active histone marks on Xa relative to autosomes was not proportional to the nearly 2-fold transcriptional upregulation on Xa (Fig. 2; Supplementary Table I). Indeed, the degree of epitope enrichment was much less than 2-fold. To explain this disparity, we compared POL-II and histone mark densities against transcription output genome-wide. Plotting POL-II-S2P against FPKM (transcription) values revealed a good correlation between gene expression and the elongating form of POL-II, as expected. It also demonstrated a natural separation of active genes from inactive ones (Supplementary Fig. 2: dotted horizontal line, FPKM cutoff of ~1.0) regardless of gene category (HCP, LCP, all genes). Therefore, in subsequent analyses, we used active genes of all three categories to investigate the relationship between chromatin epitope and transcription output. The relationship between transcription output and chromatin epitopes is shown by scatterplots and point densities for Xa (black line contours) and autosomal (color contours) genes in pairwise comparisons between expression and various chromatin epitopes (Fig. 3 and Supplementary Fig. 3-4). For active HCP M. castaneus alleles, we found pronounced dependencies between gene expression and POL-II-S5P, POL-II-S2P and active histone marks, regardless of whether they were X-linked or autosomal (Fig. 3a-d). Similar strong dependencies were observed between POL-II-S5P and H3K4me3 densities, and between POL-II-S2P and H3K36me3 densities (Fig. 3e-h). Similar trends were also found in the populations of all active (HCP+LCP) and active LCP genes (Supplementary Fig. 3-4). These data showed that Xa and autosomal genes are subject to similar quantitative relationships between active chromatin marks/POL-II and transcription output.
Figure 3

Relationships between levels of gene expression, POL-II, and active histone modifications

M. castaneous alleles of actively expressed autosomal HCP genes are represented as points, with point density shown by colored contour. Black line contour represents active HCP X-linked M. castaneous alleles (Xa). Expression, POL-II, and histone modification levels are positively correlated, the relationships are non-linear, and X-linked genes follow autosomal trends of dependency, albeit with a shift to higher values. (a,b) POL-II at the TSS (a) and across the gene body (b) vs expression (log-log scale). (c) H3K4me3 at the TSS vs expression (log-log scale). (d) H3K36me3 across the gene body vs expression (linear-log scale). (e) H3K4me3 vs POL-II at the TSS (log-log scale). (f) H3K36me3 across the gene body vs POL-II at the TSS (linear-log scale). (g) H3K4me3 at the TSS vs POL-II across the gene body (log-log scale). (h) H3K36me3 vs POL-II across the gene body (linear-log scale).

These dependencies have several universal features that might provide general insight into the mechanism of Xa upregulation. First, the relationships showed monotonic trends: Increases in POL-II or active histone marks densities were accompanied by corresponding average increases in gene expression. Second, the relationship was nonlinear, as an increase in the input variable (e.g., POL-II-S5P or H3K4me3) resulted in a much higher increase in the readout (e.g., expression). The relationship could be linearized by using log-log scale transformation, which suggested a power-law relationship. This type of dependency is consistent with previously published analysis of correlations between expression and histone mark densities at promoters[26]. The only exceptions were H3K36me3 comparisons against all other epitopes. For both Xa and autosomal genes, a log scale transformation better linearized the trend for H3K36me3 (Fig. 3f,h and Supplementary Fig. 3-4), implying that relatively smaller changes in H3K36me3 can produce large changes in transcription output. The overall non-linearity suggests potential collective effects in the regulation of POL-II occupancy, changes in chromatin state, and transcriptional activity. Importantly, the non-linear nature of these dependencies allows for signal amplification: For example, a 20-30% increase in POL-II density corresponds to a larger increase in gene expression (Fig. 3a,b). Third, although Xa and autosomal genes obeyed a common trend of dependency between gene expression and active chromatin/POL-II marks, the Xa scatterplot demonstrated consistent, significant shifts along the autosomal trendline to higher positions on both x- and y-axes (Fig. 3 and Supplementary Fig. 3-4), reflecting the generally higher levels of gene expression, POL-II occupancy, and H3K4me3 and H3K36me3 deposition on Xa relative to autosomes. In comparison to autosomes, Xa genes generally showed better correlations between chromatin epitopes (Fig. 4a), possibly reflecting the generally greater Xa values. Although ranges for these variables on Xa overlapped with autosomes, there was a clear distinction between the two gene populations, consistent with the significant differences in the means/medians of epitope density coverages (Supplementary Table I, Supplementary Fig. 1c and Fig. 1b).
Figure 4

Autosomal relationships between active histone modifications, POL-II, and expression are predictive of X-linked gene expression

(a) Actively expressed X-linked and autosomal genes show similar patterns of correlation between the levels of active marks and expression. Pearson correlation coefficients between the levels of all marks and expression (FPKM) are shown as heatmaps for actively expressed (HCP+LCP, HCP, and LCP) genes. In each plot, autosomal and X chromosome correlations are shown above and below diagonal, respectively. (b) Active X chromosome loci (X) and the corresponding set of autosomal loci (A) show similar non-linear relationship between active marks and expression (blue curve), which produces a large average expression change in response to smaller changes in the mark occupancy (schematic). (c) Scatterplot of X-linked gene expression values predicted from autosome-based full linear model vs observed X-linked expression (log-log scale). Shades of blue indicate point density. Identity line y=x is shown in red.

Taken together, these data enable us to draw several conclusions regarding Xa upregulation. The similar nonlinear trends of relationships for Xa and autosomal genes suggest that Xa upregulation might be explained by chromatin-based mechanisms that are generally used throughout the genome. On both Xa and autosomes, nonlinearity of the dependencies provide a mechanism for signal amplification. For example, Xa upregulation might be explained simply by increased density of active epigenetic marks associated with increased POL-II occupancy, rather than by Xa-specific rules governing the relationship between histone marks, POL-II, and transcription. Our data also indicate that, unlike in fruitflies where enhanced transcription elongation has been proposed to be the primary mode of Xa upregulation[27] the mammalian mechanism displays a strong influence of enhanced transcription initiation, as both POL-II-S5P and H3K4me3 have ~30% increased coverage at the TSS of active Xa genes. At the same time, however, mammalian Xa upregulation is also associated with ~20% increase in POL-II-S2P and ~9% increase in H3K36me3 coverages across gene bodies, implying that elongation is also facilitated on Xa. We therefore propose that Xa upregulation involves both enhanced transcription initiation and elongation that depend on increased trimethylation of H3K4 and H3K36, which amplify gene expression in a nonlinear fashion (Fig. 4b).

Xa hyperactivation is predictable without necessitating Xa-specific principles

We tested this hypothesis by deriving models of gene expression based on autosomal active mark occupancy and applying these models to Xa. The resulting predictions of X-linked gene expression showed accordance with the observed values (Pearson r=0.71)(Fig. 4c). Moreover, autosomal models based on any combination of marks, when applied to an individual autosome, produced the correlation between predicted and observed values similar to that of Xa (Supplementary Table III; See Methods for details). These results suggested that, firstly, quantitative relationships between active histone mark densities, POL-II occupancy, and gene expression had predictive power, consistent with the results of others[26]. Secondly, trends observed in autosomes were remarkably predictive of X-linked gene expression. Together, these conclusions argue that Xa upregulation is governed by principles that are not unique to X but are applicable throughout the genome.

DISCUSSION

We have tested the hypothesis of Xa upregulation using a novel, unbiased approach and asked whether this arm of dosage compensation is based on chromatin mechanisms similar to those observed on autosomes. By allele-specific ChIP-seq and RNA-seq analysis, we conclude that dosage compensation of Xa does occur, in agreement with previously published studies [13,14,16]. The inability of a separate study[15] to detect Xa upregulation stems from inclusion of inactive genes. When we considered all genes regardless of expression status, the average difference was masked by unusually high fraction of silent genes (genes with low-CpG content) on the X. Earlier studies [13,14] also included both active and inactive genes; however, microarrays have a lower dynamic range and it is likely that the analysis heavily favored highly transcribed genes. We then focused on potential mechanisms of Xa upregulation. By analyzing the genome-wide allelic distribution of POL-II and active chromatin marks in the context of gene expression[17] we found that Xa was enriched for POL-II-S5P, POL-II-S2P, H3K4me3, and H3K36me3 relative to the haploid autosome set. Scatterplots (Fig. 3 and Supplementary Fig. 3-4) showed a clear rightward and upward shift of Xa contours relative to autosomal contours, along the autosomal trendline to the higher levels of active marks and expression. However, the degree of POL-II enrichment on Xa and the coverage of active histone marks did not scale proportionally with increased transcription output from Xa. This nonlinearity enables signal amplification with small changes in chromatin structure and POL-II density. An unexpected observation was that the nonlinear dependencies applied to autosomes and X alike, as suggested by all pairwise plots (Fig. 3 and Supplementary Fig. 3-4) and by the quality of predicting Xa genes based on the autosome-based models (Fig. 4c and Supplementary Table III). The similar quantitative relationships suggest that chromatin-based processes governing Xa upregulation are used genome-wide, and that dosage compensation of Xa and autosomes do not require X-specific principles to bring about the ~2-fold upregulation. Importantly, while the non-linear power laws apply to both Xa and autosomes, we do not exclude the possibility that Xa-specific factors are required to initiate the chromatin-based enhancement. Such factors could target Xa to a special compartment or lead to association with nuclear pore factors, as has been suggested for Drosophila melanogaster [28]. Our data imply similarities and differences with Xa upregulation in the fruitfly. In D. melanogaster, the X-chromosome dosage between XX and XY individuals is equalized by hypertranscription of the single male X-chromosome[29-31]. With this mechanism, the fly compensates for differences between X and autosomal gene dosage at the same time it achieves male-to-female X-chromosome balance. This process requires cooperation between MOF histone acetyltransferase[32], MSL complexes[33-35], and the long noncoding RNA, roX1 and roX2 [36,37]. Together, they bring about precise chromatin change leading to ~2-fold upregulation of the male X [27,38-45]. An alternative mechanism proposes less dependency on MSL-driven hyperactivation and more on the genome’s inherent ability to correct for dosage imbalances – via the so-called “inverse effect” [32,46]. This mechanism may also operate in mammalian Xa upregulation. X hyperactivation in the fruitfly has been proposed to be achieved primarily through enhanced transcription elongation [27,38,43-45]. In our system, differences in distributions of POL-II-S5P, POL-II-S2P, H3K4me3, and H3K6me3 argue that Xa upregulation is controlled through transcription initiation, but enhancement of elongation is also a strong possibility. While POL-II-S5P and H3K4me3 coverages increase ~30% at the TSS of Xa genes, POL-II-S2P and H3K36me3 coverages also increase ~20% and ~9%, respectively, across gene bodies. In conclusion, we favor a model in which Xa upregulation is effected by both enhanced transcription initiation and elongation via nonlinear dynamics. Future studies will focus on whether and how Xa-specific factors might be involved in initiating hypertranscription through the chromatin-based mechanisms identified herein.

METHODS

Cell line

To generate EY.T4 clonal hybrid cell line, female mice of Mus musculus (129S1) and male mice of Mus castaneus (CAST/EiJ) origins were crossed, F1 embryos were collected at day 13.5 and mouse embryonic fibroblasts (MEFs) were prepared using female embryos. MEFs were later immortalized by SV-40 T-antigen[47] subcloned by limiting dilution, and the chromosome content of each subclone was screened by DNA FISH using chromosome paints and RNA FISH using Xist RNA probe.. In this clone, Xi was of Mus musculus and Xa was of Mus castaneus origins as determined by allele-specific RT-PCR using primers against Xist locus.

Chromatin immunoprecipitation (ChIP)

ChIP was performed as described[48] using 2×106 cells and 5-10μg of antibodies per reaction. Antibodies used were as follows: H3K36me3 (ab9050, Abcam), H3K4me3 (ab8580, Abcam), POL-II-S5P (ab5131, Abcam) and POL-II-S2P (ab5095, Abcam) and IgG rabbit serum (I8140, Sigma). ChIP [DNA] was quantitated using Quant-iT Picogreen dsDNA Assay kit (Invitrogen). ChIP products for H3K4me3 and H3K36me3 were verified by PCR using primers against c-fos and c-jun genes.

ChIP library preparation and sequencing

Paired-end Solexa ChIP libraries were prepared as described in Illumina ChIP sequencing manual with minor modifications and using NEB Next DNA sample prep reagent Set 1 (E6000S) (NEB). Input DNA was used as a control and for normalization. Modifications to the Illumina protocol were: 1-30 ng of ChIP products were used as template, 2-paired-end adapters were ligated to end-repaired and A-tailed DNA using T4 DNA ligase (NEB) for 2 hours at 16°C, and 2-Phusion polymerase in GC buffer (Finzyme) was used instead of Phusion polymerase in HF buffer. DNA products were purified with QIAquick spin columns (Qiagen). Concentration, size distribution (400-500bp), and purity of libraries were assessed on DNA1000 Bioanalyzer chip (Agilent). Genome Analyzer II (Illumina) was used to perform 2×36 cycles of paired end sequencing.

Calculation of allele-specific ChIP-seq coverage

Paired-end reads (17-28 million per sample, Supplementary Table II) were aligned to two variant strain genomes (CAST/EiJ and 129S1/SvImJ) of the hybrid EY.T4 cell line. The strain genomes were reconstructed from mm9 reference using catalogued SNPs and indels[25]. Resulting allelic differences include ~22 million SNPs and ~1 million indels, approximately one modification per 120 bp. Alignment was performed with Novoalign 2.06 (www.novocraft.com) using default parameters with modifications: -i 300 100 –t 180 –R 10 –h 180 180 –v 180., Uniquely aligned pairs with significant score difference between the two strain alignments (>10) were classified as allele-specific and assigned to the higher-scoring allele variant; otherwise they were classified as neutral. Coverage was calculated separately for allelic tracks (cast, mus) and composite track (allelic and neutral combined) based on fragments defined by paired reads (~400 bp on average), discarding duplicate fragments. Positional coverage was normalized input coverage with a pseudocount of 1 and balanced for potential inequality of total input and experiment coverages: nnorm = [(n+1)/(ni+1)] * [Ni/N], where n, ni, are positional coverages and N, Ni are total genome coverages in experiment and input, respectively. For each promoter region and gene body, allelic skew was estimated as the proportion of cumulative coverages at allelic cast/mus tracks, and corresponding fractions of composite coverage were used in further analyses.

Gene set selection

To ensure unique representation of each gene and exclude alternative transcripts, mm9 RefSeq genes were grouped into single-linkage clusters by gene body overlap and the longest gene in each cluster was chosen as a representative. Promoter regions were defined as segments including +/− 3kb from annotated TSS with high- and low-CpG promoters defined by the presence of CpG islands within 1 kb from the promoter. CpG islands were identified by using EMBOSS 6.3.1[49] on unmasked mm9 genome with default parameters.

Estimation of gene expression levels

Approximately 45 million of 36-bp RNA-seq reads by Yang et al. [17] (NCBI SRA accession SRA010053) were aligned to mm9 genome using TopHat[50] with default parameters, except for using arguments --no-novel-juncs and --min-isoform-fraction 0. The resulting alignments were used to calculate FPKM (fragments per kilobase per million) values using Cufflinks[51] with default parameters, except for setting --min-isoform-fraction 0. To estimate expression from a single allele, autosomal expression values were divided by 2, whereas X-linked expression values were used in full, given the well-documented inactivation of one X chromosome. As a cutoff for active expression, we chose FPKM of 1.0 corresponding to the boundary between two regimes of FPKM correlation to POL-II density on a gene (Fig. S2): the absence of dependency at low values, consistent with readout fluctuations for silent genes, and positive correlation at higher values, consistent with active transcription. Choosing FPKM cutoff of 0 produced similar values.

Linear models of gene expression

Log-transformed mark densities on autosomes (POL-II on promoters and gene bodies, H3K4me3 on promoters) and untransformed H3K36me3 on gene bodies were used for training a linear regression model that predicts logarithm of gene expression. This model (full model) was applied to the coverage densities on X chromosome, and the resulting expression predictions for X-linked genes were compared to the observed values. To compare prediction accuracy for X chromosome and for an individual autosome, we trained and tested 19 models in a chromosome-based cross-validation setting. Specifically, a model (partial model) was trained on the autosomal gene set with one autosome removed and applied separately to the removed autosome and X chromosome. Pearson correlation coefficients between predicted and observed logarithms of expression for X chromosome were compared to the distribution of Pearson R for 19 autosomes. This comparison was made for the models based on all possible combinations of marks as predictors. Logarithmic transformation of mark densities and expression is not defined at zero. To alleviate this problem, we used two approaches. First, we added a pseudocount of 1.0 to each value x so that log (x+1) was calculated. Second, since the fraction of zero values among actively expressed genes was relatively low we tested the removal of genes with zero values from the dataset. Both approaches produced similar results.
  52 in total

Review 1.  Drosophila dosage compensation: a complex voyage to the X chromosome.

Authors:  Marnie E Gelbart; Mitzi I Kuroda
Journal:  Development       Date:  2009-05       Impact factor: 6.868

Review 2.  X chromosome dosage compensation: how mammals keep the balance.

Authors:  Bernhard Payer; Jeannie T Lee
Journal:  Annu Rev Genet       Date:  2008       Impact factor: 16.830

3.  Dissecting direct reprogramming through integrative genomic analysis.

Authors:  Tarjei S Mikkelsen; Jacob Hanna; Xiaolan Zhang; Manching Ku; Marius Wernig; Patrick Schorderet; Bradley E Bernstein; Rudolf Jaenisch; Eric S Lander; Alexander Meissner
Journal:  Nature       Date:  2008-05-28       Impact factor: 49.962

4.  Histone modification levels are predictive for gene expression.

Authors:  Rosa Karlić; Ho-Ryun Chung; Julia Lasserre; Kristian Vlahovicek; Martin Vingron
Journal:  Proc Natl Acad Sci U S A       Date:  2010-02-01       Impact factor: 11.205

5.  Genome-scale DNA methylation maps of pluripotent and differentiated cells.

Authors:  Alexander Meissner; Tarjei S Mikkelsen; Hongcang Gu; Marius Wernig; Jacob Hanna; Andrey Sivachenko; Xiaolan Zhang; Bradley E Bernstein; Chad Nusbaum; David B Jaffe; Andreas Gnirke; Rudolf Jaenisch; Eric S Lander
Journal:  Nature       Date:  2008-07-06       Impact factor: 49.962

6.  X-chromosome-wide profiling of MSL-1 distribution and dosage compensation in Drosophila.

Authors:  Gaëlle Legube; Shannon K McWeeney; Martin J Lercher; Asifa Akhtar
Journal:  Genes Dev       Date:  2006-03-17       Impact factor: 11.361

7.  High-resolution analysis of epigenetic changes associated with X inactivation.

Authors:  Hendrik Marks; Jennifer C Chow; Sergei Denissov; Kees-Jan Françoijs; Neil Brockdorff; Edith Heard; Hendrik G Stunnenberg
Journal:  Genome Res       Date:  2009-07-06       Impact factor: 9.043

8.  Polycomb proteins targeted by a short repeat RNA to the mouse X chromosome.

Authors:  Jing Zhao; Bryan K Sun; Jennifer A Erwin; Ji-Joon Song; Jeannie T Lee
Journal:  Science       Date:  2008-10-31       Impact factor: 47.728

9.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

10.  Dosage compensation in the mouse balances up-regulation and silencing of X-linked genes.

Authors:  Hong Lin; Vibhor Gupta; Matthew D Vermilyea; Francesco Falciani; Jeannie T Lee; Laura P O'Neill; Bryan M Turner
Journal:  PLoS Biol       Date:  2007-12       Impact factor: 8.029

View more
  52 in total

1.  Claims and counterclaims of X-chromosome compensation.

Authors:  James A Birchler
Journal:  Nat Struct Mol Biol       Date:  2012-01-05       Impact factor: 15.369

2.  Expression reduction in mammalian X chromosome evolution refutes Ohno's hypothesis of dosage compensation.

Authors:  Fangqin Lin; Ke Xing; Jianzhi Zhang; Xionglei He
Journal:  Proc Natl Acad Sci U S A       Date:  2012-07-02       Impact factor: 11.205

Review 3.  Evolution of vertebrate sex chromosomes and dosage compensation.

Authors:  Jennifer A Marshall Graves
Journal:  Nat Rev Genet       Date:  2015-11-30       Impact factor: 53.242

Review 4.  The evolution of X chromosome inactivation in mammals: the demise of Ohno's hypothesis?

Authors:  Eugénie Pessia; Jan Engelstädter; Gabriel A B Marais
Journal:  Cell Mol Life Sci       Date:  2013-10-31       Impact factor: 9.261

5.  ATRX directs binding of PRC2 to Xist RNA and Polycomb targets.

Authors:  Kavitha Sarma; Catherine Cifuentes-Rojas; Ayla Ergun; Amanda Del Rosario; Yesu Jeon; Forest White; Ruslan Sadreyev; Jeannie T Lee
Journal:  Cell       Date:  2014-11-06       Impact factor: 41.582

6.  SMCHD1 Merges Chromosome Compartments and Assists Formation of Super-Structures on the Inactive X.

Authors:  Chen-Yu Wang; Teddy Jégu; Hsueh-Ping Chu; Hyun Jung Oh; Jeannie T Lee
Journal:  Cell       Date:  2018-06-07       Impact factor: 41.582

Review 7.  New and Xisting regulatory mechanisms of X chromosome inactivation.

Authors:  Yesu Jeon; Kavitha Sarma; Jeannie T Lee
Journal:  Curr Opin Genet Dev       Date:  2012-03-16       Impact factor: 5.578

Review 8.  Insight into lncRNA biology using hybridization capture analyses.

Authors:  Matthew D Simon
Journal:  Biochim Biophys Acta       Date:  2015-09-14

Review 9.  ChIP-seq and beyond: new and improved methodologies to detect and characterize protein-DNA interactions.

Authors:  Terrence S Furey
Journal:  Nat Rev Genet       Date:  2012-10-23       Impact factor: 53.242

10.  Mammalian X upregulation is associated with enhanced transcription initiation, RNA half-life, and MOF-mediated H4K16 acetylation.

Authors:  Xinxian Deng; Joel B Berletch; Wenxiu Ma; Di Kim Nguyen; Joseph B Hiatt; William S Noble; Jay Shendure; Christine M Disteche
Journal:  Dev Cell       Date:  2013-03-21       Impact factor: 12.270

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.