| Literature DB >> 19639236 |
Ryan D Bickel1, Wendy S Schackwitz, Len A Pennacchio, Sergey V Nuzhdin, Artyom Kopp.
Abstract
Genes with overlapping expression and function may gradually diverge despite retaining some common functions. To test whether such genes show distinct patterns of molecular evolution within species, we examined sequence variation at the bric à brac (bab) locus of Drosophila melanogaster. This locus is composed of two anciently duplicated paralogs, bab1 and bab2, which are involved in patterning the adult abdomen, legs, and ovaries. We have sequenced the 148 kb genomic region spanning the bab1 and bab2 genes from 94 inbred lines of D. melanogaster sampled from a single location. Two non-coding regions, one in each paralog, appear to be under selection. The strongest evidence of directional selection is found in a region of bab2 that has no known functional role. The other region is located in the bab1 paralog and is known to contain a cis-regulatory element that controls sex-specific abdominal pigmentation. The coding region of bab1 appears to be under stronger functional constraint than the bab2 coding sequences. Thus, the two paralogs are evolving under different selective regimes in the same natural population, illuminating the different evolutionary trajectories of partially redundant duplicate genes.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19639236 PMCID: PMC2722720 DOI: 10.1007/s00239-009-9265-y
Source DB: PubMed Journal: J Mol Evol ISSN: 0022-2844 Impact factor: 2.395
Fig. 1The bab genomic region. The bab genes are shown in black, CG32334, CG9205, and the 5′ region of trio in gray. Repeat regions identified using RepeatMasker are marked below the ruler. The sites of two transposable element insertions in the D. melanogaster reference genome are marked with triangles
Fig. 2Linkage disequilibrium in the bab region. For each graph, the mean value of r2 (a) and D′ (b) was calculated for all polymorphisms separated by a given distance and combined into 10 bp bins
Summary statistics of sequence variation in the bab region
| bpa | Singletonsc | θWd | πe | Tajima’s | ||
|---|---|---|---|---|---|---|
| Entire region | 147959 | 6655 | 1382 | 0.90 | 0.87 | −0.13 |
| SNPs | 5535 | 1189 | 0.75 | 0.71 | −0.18 | |
| Indels | 1120 | 193 | 0.15 | 0.16 | 0.15 | |
| Coding | 8692 | 292 | 37 | 0.67 | 0.71 | 0.23 |
| UTRs | 4572 | 156 | 29 | 0.68 | 0.64 | −0.22 |
| 5′-UTRs | 857 | 26 | 5 | 0.61 | 0.58 | −0.17 |
| 3′-UTRs | 3715 | 130 | 24 | 0.70 | 0.65 | −0.23 |
| Introns | 75870 | 3366 | 746 | 0.89 | 0.84 | −0.19 |
| Intergenic | 58825 | 2859 | 573 | 0.97 | 0.95 | −0.07 |
aNumber of base pairs represented in the sample
bNumber of polymorphisms identified
cThe number of polymorphisms that contain a single observation of the alternative allele
dWaterson’s Theta per 100 bp
eThe average pairwise diversity per 100 bp
Fig. 3Sliding window analysis of sequence variation in the bab region. All analyses are in 10 kb windows offset by 2 kb. a Three estimates of θ including π (solid line), θW (dashed line), and θH (dotted line). b Tajima’s D. c Fay and Wu’s H statistic. d Fu and Li’s D (dashed line) and F (solid line) statistics
Summary statistics comparing the W1 and W3 sequence samples across the bab region
| bpa | Singletonsc | θWd | πe | Tajima’s | ||
|---|---|---|---|---|---|---|
| W1 | 147959 | 5323 | 1347 | 0.90 | 0.89 | −0.04 |
| W3 | 147959 | 5845 | 1384 | 0.88 | 0.87 | −0.06 |
aNumber of base pairs represented in the sample
bNumber of polymorphisms identified
cThe number of polymorphisms that contain a single observation of the alternative allele
dWaterson’s Theta per 100 bp
eThe average pairwise diversity per 100 bp
Fig. 4Comparison of the W1 and W3 collections. All analyses are in 10 kb windows offset by 2 kb. a Tajima’s D statistic showing pooled collections (solid line), W1 alone (dashed line) and W3 alone (dotted line). b Population differentiation (Fst) between the W1 and W3 samples
McDonald–Kreitman test for genes in the bab region
| Gene | Codons | Replacement (NS) | Silent ( | |||
|---|---|---|---|---|---|---|
| Polymorphic | Fixed | Polymorphic | Fixed | |||
| CG9205 | 199 | 0 | 0 | 10 | 7 | 1 |
| bab1 | 791 | 15 (7) | 1 | 56 (3) | 28 | 0.034* |
| CG32334 | 135 | 5 | 1 | 6 | 0 | 1 |
| bab2 | 828 | 13 (4) | 6 | 79 (9) | 30 | 0.784 |
aP values are the results of a two-tailed Fisher’s exact test. Values below P = 0.05 are indicated