| Literature DB >> 23042552 |
Virginia Sahagun, José M Ranz.
Abstract
In both vertebrates and insects, the conservation of local gene order among distantly related species (microsynteny) is higher than expected in the presence of highly conserved noncoding elements (HCNEs). Dense clusters of HCNEs, or HCNE peaks, have been proposed to mediate the regulation of sometimes distantly located genes, which are central for the developmental program of the organism. Thus, the regions encompassing HCNE peaks and their targets in different species would form genomic regulatory domains (GRDs), which should presumably enjoy an enhanced stability over evolutionary time. By leveraging genome rearrangement information from nine Drosophila species and using gene functional and phenotypic information, we performed a comprehensive characterization of the organization of microsynteny blocks harboring HCNE peaks and provide a functional portrait of the putative HCNE targets that reside therein. We found that Drosophila HCNE peaks tend to colocalize more often than expected and to be evenly distributed across chromosomal elements. Putative HCNE peak targets are characterized by a tight association with particular promoter motifs, higher incidence of severe mutant phenotypes, and evidence of a more precise regulation of gene expression during important developmental transitions. As for their physical organization, ~65% of these putative targets are separated by a median of two genes from their nearest HCNE peaks. These observations represent the first functional portrait of this euchromatic fraction of the Drosophila genome with distinctive evolutionary dynamics, which will facilitate future experimental studies on the interactions between HCNE peaks and their targets in a genetically tractable system such as Drosophila melanogaster.Entities:
Mesh:
Year: 2012 PMID: 23042552 PMCID: PMC3490413 DOI: 10.1093/gbe/evs089
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Average Size of the Microsynteny Blocks Encompassing HCNE Peaks
| Size Unit | |||
|---|---|---|---|
| IGAs | Genes | kb | |
| All 123 MB | |||
| Observed | 10.24 | 11.33 | 208.74 |
| Expected | 5.74–8.30 | 6.76–9.58 | 63.89–106.22 |
| Excluding UCRs | |||
| Observed | 8.18 | 9.15 | 186.33 |
| Expected | 4.82–6.74 | 5.76–7.94 | 51.54–89.32 |
Note.—MB, microsynteny block; IGA, independent gene anchor. IGA refers to physically related genes, for example, overlapping or nested, which are counted only once.
aCalculated based on data from von Grotthuss et al. (2010).
bP < 0.01.
cUsing Monte Carlo simulations, 0.5th–99.5th percentiles of the distribution were obtained.
Relationship between Different Functional Features and Protein-Coding Genes Predicted to Have a Particular Type of Core Promoter
| Number of Genes Associated/Not Associated | |||||
|---|---|---|---|---|---|
| Genomic Features | Inr Only and Inr/DPE | TATA/Inr | Motif 1/6 | DRE | |
| Diagnostic GO terms (GO:0006355, GO:0007275) | 91 | 32 | 18 | 35 | 6.0 × 10−5 |
| Severe detrimental mutant phenotype | 81 | 29 | 26 | 43 | 1.6 × 10−2 |
| Expression profiles | |||||
| >80th percentile (E, P) | 172 | 129 | 96 | 125 | 6.9 × 10−1 |
| >95th percentile (E, P) | 165 | 104 | 78 | 99 | 5.7 × 10−3 |
| >80th percentile (E, P) and <40th percentile (L, AM, AF) | 148 | 95 | 75 | 97 | 2.4 × 10−1 |
| >95th percentile (E, P) and <40th percentile (L, AM, AF) | 145 | 86 | 66 | 83 | 8.4 × 10−3 |
Note.—Inr/DPE, initiator followed by downstream promoter element; TATA/Inr, TATA box followed by initiator; Motif 1/6, Motif 1 followed by Motif 6; DRE, DNA replication element binding factor.
aA randomization test of goodness-of-fit was performed for each genomic feature; P ≥ χ2 with 3 degrees of freedom (100,000 simulations). Subsequently, the Benjamini–Hochberg (Benjamini and Hochberg 1995) correction for multiple testing was applied.
bHigher than a given percentile in at least one timepoint during embryogenesis (E), pupa stage (P), or both and, if it is the case, lower than a given percentile in at least one timepoint in at least two stages among larva (L), adult male (AM), and adult female (AF). Genes examined: with Inr only or Inr/DPE promoter types, 185; with TATA/Inr promoter type, 135; with Motif 1/6 promoter type, 101; and with DRE promoter type, 135.
FAssociation between the type of predicted core promoter and diagnostic GO terms that denote involvement in developmental tasks. Unlike protein-coding genes predicted to have core promoters TATA/Inr, Motif 1/6, or DRE, those with core promoters Inr/DPE and Inr alone show a significant association with diagnostic GO terms (table 2 and supplementary table S2, Supplementary Material online).
Association between Genes with and without Diagnostic GO Terms and Functional Features for Protein-Coding Genes Predicted to Have Different Types of Core Promoters
| Genes with/without Diagnostic GO Terms ( | ||||
|---|---|---|---|---|
| Alternative Proxy | Inr Only and Inr/DPE | TATA/Inr | Motif 1/6 | DRE |
| Severe detrimental mutant phenotype | 70 | 22 | 13 | 26 |
| Expression during development | ||||
| >80th percentile (E, P) | 88 | 31 | 18 | 35 |
| >95th percentile (E, P) | 87 | 28 | 18 | 31 |
| >80th percentile (E, P) and <40th percentile (L, AM, AF) | 87 | 28 | 17 | 31 |
| >95th percentile (E, P) and <40th percentile (L, AM, AF) | 86 | 26 | 17 | 29 |
Note.—Inr/DPE, initiator followed by downstream promoter element; TATA/Inr, TATA box followed by initiator; Motif 1/6, Motif 1 followed by Motif 6; DRE, DNA replication element binding factor.
aTwo-tailed Fisher's exact test; the Benjamini–Hochberg correction (Benjamini and Hochberg 1995) was applied to correct for multiple testing. For mutant phenotypes, the total number of genes for the two categories (severe vs. nonsevere) is 81/34 (Inr only and Inr/DPE), 29/18 (TATA/Inr), 43/20 (Motif 1/6), and 26/21 (DRE). For expression data, the total number of genes for the two categories is 91/94 (Inr only and Inr/DPE), 32/103 (TATA/Inr), 18/83 (Motif 1/6), and 35/100 (DRE).
bHigher than a given percentile in at least one timepoint during embryogenesis (E), pupa stage (P), or both and, if it is the case, lower than a given percentile in at least one timepoint in at least two stages among larva (L), adult male (AM), and adult female (AF).
FTwo-way hierarchical clustering of the levels of expression for genes with core promoter type Inr alone or Inr/DPE, and associated with either one or both diagnostic GO terms, across 30 timepoints of the life cycle of D. melanogaster. Red, overexpession; green, underexpression. High levels of expression are common at some point during embryogenesis, and intermediate to high levels of expression are often observed during the larval–pupal transition. A more reduced number of genes show additionally substantial expression in adult males and females.
FCanonical view of the internal organization of microsynteny blocks that harbor GRDs in the genus Drosophila. Below, median distance between the HCNE peak and the closest outermost marker expressed as a fraction of the total length of the microsynteny block. The distance between the HCNE peak and a putative target gene is indicated as the median number of intervening, or bystander, genes.