| Literature DB >> 21853143 |
Monica T Cooper1, James A Kennison.
Abstract
We have investigated a region of ∼310 kb of genomic DNA within polytene chromosome subdivisions 72A to 72D of Drosophila melanogaster. This region includes 57 predicted protein-coding genes. Seventeen of these genes are in six clusters that appear to have arisen by tandem duplication. Within this region we found 23 complementation groups that are essential for zygotic viability, and we have identified the transcription units for 18 of the 23. We also found a 55 kb region in 72D that is nonessential. Flies deficient for this region are viable and fertile. Within this nonessential region are 48 DNA sequences of 12 to 33 base pairs that are completely conserved among 12 distantly related Drosophila species. These sequences do not have the evolutionary signature of conserved protein-coding DNA sequences, nor do they appear to encode microRNAs, however, the strong selection suggests functions in wild populations that are not apparent in laboratory cultures. This region resembles dispensable gene deserts previously characterized in the mouse genome.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21853143 PMCID: PMC3154481 DOI: 10.1371/journal.pone.0023509
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Complementation groups represented by the mutations within the Df(3L)th102 region of the genome.
| Complementation Group | Number of Alleles | Identified Transcription Unit | Number of Core Amino Acids | Number of Evolutionarily Conserved Amino Acids | Proportion of Conserved Core Amino Acids |
|
| 19 |
| 1633 | 1231 | 75% |
|
| 17 |
| 2142 | 2021 | 94% |
|
| 13 |
| 1092 | 871 | 80% |
|
| 2 |
| 637 | 561 | 88% |
|
| 3 |
| 180 | 177 | 98% |
|
| 8 |
| 671 | 349 | 52% |
|
| 37 |
| 1226 | 948 | 77% |
|
| 11 |
| 438 | 164 | 37% |
|
| 11 |
| 795 | 486 | 61% |
|
| 13 | ||||
|
| 9 | ||||
|
| 10 | ||||
|
| 2 |
| 139 | 121 | 87% |
|
| 3 | ||||
|
| 4 |
| 851 | 467 | 55% |
|
| 2 |
| 629 | 431 | 69% |
|
| 11 |
| 365 | 279 | 76% |
|
| 4 |
| 696 | 206 | 30% |
|
| 3 |
| 240 | 70 | 29% |
|
| 4 |
| 373 | 110 | 29% |
|
| 2 |
| 345 | 35 | 10% |
|
| 1 |
| 133 | 120 | 90% |
|
| 0 |
Core amino acids are those present in all protein isoforms for that gene.
Evolutionarily conserved amino acids are those core amino acids conserved among nine Drosophila species. Because of sequence gaps, we did not include in the analysis the D. yakuba CG5931 gene or the D. persimilis CG5931, DNApol-delta, th, and Mbs genes. The D. mojavensis CG32154 gene was not found by Blat, but was examined by BLAST to identify conserved aa.
DNApol-delta and Arf72A were recovered on the same mutagenized chromosome.
Figure 1Complementation of essential genes with a set of deletions that overlap Df(3L)th102.
The essential genes are listed at the top of the map in blue, with a horizontal bar for each deletion indicating which genes failed to complement that deletion. Deletions with molecularly characterized breakpoints are indicated by red bars. Deletions with no molecular information are indicated by black bars.
Transposon insertion mutants identified by the Drosophila Gene Disruption Project and maintained by the Bloomington Drosophila Stock Center.
| Transposon Insertion Mutants | Complementation Group Affected |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| viable, but males and females poorly fertile |
|
| viable and fertile |
|
| viable and fertile |
|
| viable and fertile |
|
| viable and fertile |
|
| viable and fertile |
|
| viable and fertile |
|
| viable and fertile |
|
| viable and fertile |
Figure 2Molecular map of the genomic region deleted in Df(3L)th102.
The approximately 320 kb of genomic DNA (from 3L: 15918k to 16240k, Release 5.23) is broken into three parts (A, B, and C), and is represented by the horizontal black arrows at the top of each part. The annotated transcription units are represented by colored thick horizontal arrows. The essential transcription units are red and orange. The clusters of transcription units encoding related proteins are brown (the cluster in 2A), grey (the cluster in 2B), purple (the cluster in 2B), dark green (the cluster in 2C), light green (the cluster in 2C), and orange (the essential genes CG32155 and CG32154 in 2C). All other transcription units are blue. The two regions that include the five essential genes (72CDc, 72De, 72Df, 72Dg, and 72Di) for which the transcription units have not been identified are indicated by red horizontal brackets below the candidate transcription units. The DNA missing in flies trans-heterozygous for the overlapping deletions Df(3L)Exel6128 and Df(3L)BSC559 is indicated by the horizontal black bar at the bottom of 2C.
Figure 3Molecular map of the ∼80 kb genomic region between CG5151 and CG5018.
The approximately 80 kb of genomic DNA (from 3L: 16140k to 16220k, Release 5.23) is represented by the horizontal black arrow at the top. The predicted transcription units are represented by colored thick horizontal arrows. The essential transcription unit CG5018 is red. The clusters of transcription units predicted to encode related proteins are dark green and light green. All other transcription units are blue. The transposable X-element (shown by the blue bar) is present in the iso-1 strain, but not in Canton S. The highly conserved DNA sequences are represented by the purple bars in the middle. Below each cluster of conserved sequences are the numbers corresponding to Table S1. The DNA sequences missing in flies trans-heterozygous for the overlapping deletions Df(3L)Exel6128 and Df(3L)BSC559, or Df(3L)Exel6128 and Df(3L)st4 are indicated by the horizontal black bars at the bottom. The dotted line at the left of the Df(3L)Exel6128/Df(3L)st4 black bar indicates the 3kb region to which the distal breakpoint of Df(3L)st4 was mapped.