| Literature DB >> 16872532 |
Stuart J Macdonald1, Anthony D Long.
Abstract
BACKGROUND: A primary objective of comparative genomics is to identify genomic elements of functional significance that contribute to phenotypic diversity. Complex changes in genome structure (insertions, duplications, rearrangements, translocations) may be widespread, and have important effects on organismal diversity. Any survey of genomic variation is incomplete without an assessment of structural changes.Entities:
Mesh:
Year: 2006 PMID: 16872532 PMCID: PMC1779558 DOI: 10.1186/gb-2006-7-7-R67
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Distribution of fine-scale structural features across chromosome arms
| Number of orthologous pairs | |||
| Chromosome arm* | Tested† | Harboring microinversions‡ | Harboring intragenic indels‡ |
| 880 | 24 (0.111) | 12 (0.889) | |
| 997 | 34 (0.003) | 19 (0.294) | |
| 1,120 | 7 (<0.001) | 15 (0.805) | |
| 1,177 | 21 (0.752) | 22 (0.279) | |
| 1,560 | 26 (0.464) | 18 (0.297) | |
| All 5 major arms | 5,734 | 112 | 86 |
*The chromosome arms are given the D. melanogaster designations, X, 2L, 2R, 3L, and 3R. These arms are known to be orthologous to D. pseudoobscura arms, XL, 4, 3, XR, and 2, respectively [58]. †The number of conserved orthologs residing on each arm. ‡Values in parentheses are P values from a two-sided Binomial test of whether the number of event-harboring orthologs per arm differs from expectation. For each test, the number of trials equals the number of conserved orthologs per arm, the number of successes equals the number of event-harboring genes per arm, and the probability of success is equal to the total number of event-harboring genes detected divided by the total number of conserved orthologs tested (5,738).
Figure 1Sequence similarity between Drosophila melanogaster (D. mel) and D. pseudoobscura (D. pse) for the Sox21b (CG32139) gene. Top panel: sliding-window BLAST profile. We stepped through D. melanogaster Sox21b gene in 15 bp increments, and at each position BLASTed a 31 bp segment against the D. pseudoobscura ortholog. Each line represents a BLAST hit with a score above 45, the endpoints show the position of the hit in each genome, and the color of the line represents the orientation of the hit (black = same sequence orientation in each genome, red = different orientations in each genome). Central panel: structure of the Sox21b gene in D. melanogaster. Filled boxes represent exons, and open boxes represent untranslated regions (UTRs). Bottom panel: VISTA plot. The appropriate region of the D. melanogaster-D. pseudoobscura genome alignment was downloaded from the VISTA Browser [44]. We stepped through the alignment in 5 bp increments, and for each 501 bp window calculated the percentage of identical nucleotides between the sequences. The plot is shown relative to the D. melanogaster sequence, and represents a smoothed curve through the data using the ksmooth function in the statistical programming language R [49]. Areas under the curve are painted if they show >70% nucleotide conservation (dark blue = within an exon, light blue = within a UTR, pink = intronic and >100 bp in size).
Figure 2Positions of the 112 microinversion-harboring genes in the D. melanogaster genome. Using data from release 4.2.1 of the D. melanogaster genome assembly, the physical position of each of the 112 microinversion-harboring genes is mapped onto the D. melanogaster chromosomes. The midpoint of each gene is used to map to chromosome. The centromeres for chromosomes 2 and 3 are represented by filled black circles, and the positions of microinversion-harboring genes are indicated by vertical blue lines.
Figure 3GC content across microinversion breakpoints and conserved sequence blocks. Top panel: 143 Drosophila melanogaster-D. pseudoobscura microinversions. For each microinversion breakpoint we extracted 200 bp flanking the breakpoint and 20 bp internal to the inversion as a contiguous section (we examined just 20 bp internal to each inversion breakpoint as the minimum inversion size was 40 bp). For each species, across all sequences for a given inversion breakpoint, we calculated GC content for all overlapping 5 bp windows. Each point in the plot represents the mean GC content for a single window. Vertical dashed lines indicate the inversion breakpoints. Note that the distance between these lines is variable across inversion events. Bottom panel: 774 Drosophila melanogaster-D. pseudoobscura conserved non-coding blocks. Using sliding-window BLAST data we identified all blocks of conserved non-coding sequence from the 93 genes harboring intronic microinversions (see Materials and methods for details). Sequence data were extracted from in/around the conserved blocks and analyzed as described for the microinversion data.
Figure 4Phylogenetic distribution of fifteen microinversion events. For 15 microinversions distinguishing the genomes of D. melanogaster and D. pseudoobscura, we examined orthologous regions from 10 other Drosophila species to determine whether they harbor the standard (St; D. melanogaster-like) or inverted (In; D. pseudoobscura-like) arrangement. Some species could not be reliably shown to have either arrangement (shown with a dash). A consensus phylogeny of the 12 species is provided. The microinversion events are grouped according to phylogenetic position, and mapped onto the consensus phylogeny.