| Literature DB >> 23315381 |
Timothy B Sackton1, Daniel L Hartl.
Abstract
The Drosophila Y chromosome is a degenerated, heterochromatic chromosome with few functional genes. Despite this, natural variation on the Y chromosome in D. melanogaster has substantial trans-acting effects on the regulation of X-linked and autosomal genes. It is not clear, however, whether these genes simply represent a random subset of the genome or whether specific functional properties are associated with susceptibility to regulation by Y-linked variation. Here, we present a meta-analysis of four previously published microarray studies of Y-linked regulatory variation (YRV) in D. melanogaster. We show that YRV genes are far from a random subset of the genome: They are more likely to be in repressive chromatin contexts, be expressed tissue specifically, and vary in expression within and between species than non-YRV genes. Furthermore, YRV genes are more likely to be associated with the nuclear lamina than non-YRV genes and are generally more likely to be close to each other in the nucleus (although not along chromosomes). Taken together, these results suggest that variation on the Y chromosome plays a role in modifying how the genome is distributed across chromatin compartments, either via changes in the distribution of DNA-binding proteins or via changes in the spatial arrangement of the genome in the nucleus.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23315381 PMCID: PMC3595022 DOI: 10.1093/gbe/evt005
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Studies Included in Meta-Analysis, with GEO Information and Other Characteristics
| Study Name | GEO | Description | Reference |
|---|---|---|---|
| BL08 | GSE9457 | Original study reporting YRV: compared five geographically disparate Y chromosomes | |
| BL10 | GSE23612 | YRV in XXY females | |
| SP11 | GSE27695 | YRV in rDNA deletion lines | |
| JZ12 | GSE37068 | YRV in mutation accumulation lines (Harwich) |
FProcedure for defining common and specific classes of YRV genes, based on the combined q value across studies plus the q values of individual studies. No genes with q ≤ 0.05 in more than one study fail to achieve significance in the combined statistic.
Variables Used for Model Selection
| Variable Name | Description | Source | Reference | Final Model? |
|---|---|---|---|---|
| expdivtot.m | Expression divergence across seven | Custom Nimblegen arrays produced and hybridized by Brian Oliver's laboratory | Yes | |
| h2m | Mutational variance in expression in | Yes | ||
| postmei.ratio | Ratio of postmeiotic to max(meiotic, mitotic) expression in spermatogenesis | Yes | ||
| mf.pc1 | Principle component representing male/female expression bias | Calculated from modENCODE and SEBIDA data | This article | Yes |
| tau.avg | Measure of tissue specificity (tau) averaged between adult and larval values | FlyAtlas | This article | Yes |
| FbtrLen | Transcript length | Calculated from Flybase GFF files | Yes | |
| het | Binary variable coding presence in repressive chromatin (black, green, or blue states) | Calculated from the data of | Yes | |
| NumInt | Number of introns | Calculated from FlyBase GFF files | Yes | |
| AveIntLen | Average intron length | Calculated from FlyBase data | No | |
| window11.length | Length of a window of five upstream and five downstream genes surrounding focal gene | Calculated from FlyBase GFF files | This article | No |
| FirstIntLen | Length of first intron | Calculated from FlyBase GFF files | No | |
| exp.pc1 | Principle component representing overall, sex-averaged, whole fly transcription level | Calculated from modENCODE male and female expression, FlyAtlas whole fly expression, and codon bias | This article | No |
| dmsp | Binary variable coding presence in the | Proteomic studies conducted by Tim Karr's laboratory | No | |
| transcript.num | Total number of different transcripts for each gene | Calculated from FlyBase GFF files | This article | No |
| m0w | Omega calculated from PAML model M0 | Twelve genomes’ annotation | No | |
| intergenic.dist | Average distance to nearest gene (upstream or downstream) | Calculated from FlyBase GFF files | This article | No |
| RecomRP | Recombination rate calculated by the RP method | Calculated from FlyBase data | No | |
| gpmb | Genes per megabase | Calculated from FlyBase GFF files | This article | No |
| cellular_loc | A simplified gene ontology cellular location | Calculated from FlyBase ontology files | This article | No |
FThe proportion of genes in the common class that are individually significant in 0, 1, 2, 3, or 4 of the underlying studies. Light bars are for a nominal (not multiple-test corrected) P value ≤ 0.05. Dark gray bars are for a FDR-corrected q value ≤ 0.05.
F(A) Boxplot showing the average Cohen's d statistic across studies for each significance class, where Cohen's d represents the standardized effect size of YRV. Common vs. none, P value < 2 × 10−16 (Mann–Whitney U); specific vs. none, P value < 2 × 10−16 (Mann–Whitney U). (B) Boxplot of the coefficient of variation (CV) of Cohen's d across studies for each significance class. Specific vs. common, P value < 2 × 10−16 (Mann–Whitney U); specific vs. none, P value < 2 × 10−16 (Mann–Whitney U); and common vs. none, P value = 8.31 × 10−14 (Mann–Whitney U).
FRanked AIC support for the 200 best models. Red line is placed at the AIC of the 10th best model.
FModel-averaged importance of each term in the model (table 2), which is defined as the proportion of the 200 best models in which a given term appears. Red line indicates 80% support. Terms with an importance above the red line are included in our final model.
Model Parameters from the Final Model
| Term | β Coefficient |
|---|---|
| expdivtot.m | 0.50137 |
| h2m | 0.90826 |
| postmei.ratio | 2.28173 |
| mf.pc1 | −0.62495 |
| tau.avg | 2.92341 |
| FbtrLen | −0.96739 |
| het | 0.58295 |
| NumInt | −0.13822 |
FBoxplot of Cohen's d averaged across studies for each of the five chromatin states defined by Filion et al. (2010). Black, blue, and green are repressive chromatin; red and yellow are active chromatin. Boxplots with the same letter above them are not significantly different (α = 0.05; italics indicate difference at 0.05 < α < 0.10), based on Tukey HSD P values, which indicate that: BLACK is greater than GREEN (P = 0.005), RED (P = 0.000016), and YELLOW (P < 0.000001); BLUE is greater than GREEN (marginally so, P = 0.07), RED (P = 0.024), and YELLOW (P < 0.000001); GREEN is not significantly different from RED or YELLOW; RED is greater than YELLOW (P = 0.0013).
FLamin-binding ratio on a log2 scale for YRV and non-YRV genes sorted into active and repressive chromatin domains. Box widths are proportional to the share of each YRV class that is in active vs. repressive chromatin: Approximately two-thirds of YRV genes are in repressive chromatin, whereas the opposite is true of non-YRV genes. Although within repressive chromatin, YRV and non-YRV genes are not significantly different (Mann–Whitney U, P = 0.9689), in active chromatin, YRV genes have significantly higher lamin-binding ratios (Mann–Whitney U, P = 0.0002).
Counts of YRV and Non-YRV Genes that Are in Early- or Late-Replicating Regions of the Genome, by Chromatin State
| Active Chromatin | Repressive Chromatin | |||
|---|---|---|---|---|
| Non-YRV | YRV | Non-YRV | YRV | |
| Early replicating (geneRT > 0) | 1,996 | 204 | 521 | 207 |
| Late replicating (geneRT < 0) | 269 | 40 | 623 | 196 |
| Fisher's exact test (active) | Fisher's exact test (repressive) | |||
| Odds ratio = 1.45 | Odds ratio = 0.792 | |||
FBoxplot of AT content of upstream regions adjacent to YRV and non-YRV genes. Mann–Whitney U, P value = 8.44 × 10−5.