| Literature DB >> 20663190 |
Leanna M Birge1, Marie L Pitts, Richard H Baker, Gerald S Wilkinson.
Abstract
BACKGROUND: Polymorphisms of single amino acid repeats (SARPs) are a potential source of genetic variation for rapidly evolving morphological traits. Here, we characterize variation in and test for an association between SARPs and head shape, a trait under strong sexual selection, in the stalk-eyed fly, Teleopsis dalmanni. Using an annotated expressed sequence tag database developed from eye-antennal imaginal disc tissues in T. dalmanni we identified 98 genes containing nine or more consecutive copies of a single amino acid. We then quantify variation in length and allelic diversity for 32 codon and 15 noncodon repeat regions in a large outbred population. We also assessed the frequency with which amino acid repeats are either gained or lost by identifying sequence similarities between T. dalmanni SARP loci and their orthologs in Drosophila melanogaster. Finally, to identify SARP containing genes that may influence head development we conducted a two-generation association study after assortatively mating for extreme relative eyespan.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20663190 PMCID: PMC3055267 DOI: 10.1186/1471-2148-10-227
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Distribution of single amino-acid repeats containing more than 8 consecutive residues (filled bars) plotted with the relative abundance of each amino acid (open bars) for two fly species. Panel A: Proportion of 98 unique open reading frames containing SARs identified in the Teleopsis dalmanni EST database. Panel B: Proportion of 343 genes containing SARs in regions of Drosophila melanogaster genes homologous to the T. dalmanni EST database.
Heterozygosity and allelic diversity of glutamine repeat loci in T. dalmanni
| Locus (chromosome*) | χ2 | P | N | |||
|---|---|---|---|---|---|---|
| Band4.1 inhibitor LRP (2) | 0.59 | 0.59 | 0.00 | ns | 4 | 163 |
| Bifocal (1) | 0.73 | 0.65 | 2.75 | ns | 4 | 91 |
| Bunched (X) | 0.70 | 0.66 | 0.62 | ns | 3 | 92 |
| Cap-n-collar (2) | 0.48 | 0.50 | 0.08 | ns | 2 | 91 |
| CG10082 (2) | 0.67 | 0.53 | 7.04 | 0.0080 | 4 | 90 |
| CG10321 (2) | 0.63 | 0.60 | 0.38 | ns | 3 | 90 |
| CG10435 (2) | 0.54 | 0.41 | 0.08 | ns | 2 | 91 |
| CG12104 (1) | 0.31 | 0.45 | 7.31 | 0.0069 | 2 | 91 |
| CG17265 | - | - | - | ns | 1 | 94 |
| CG31064 (2) | 0.51 | 0.50 | 0.07 | ns | 5 | 165 |
| CG31224 (2) | 0.30 | 0.65 | 38.03 | < 0.0001 | 4 | 71 |
| CG33692 (1) | 0.61 | 0.60 | 0.16 | ns | 4 | 166 |
| CG34347 (2) | 0.60 | 0.60 | 0.00 | ns | 6 | 91 |
| CG42389 (X) | 0.56 | 0.61 | 1.87 | ns | 4 | 165 |
| CG4409 (2) | 0.35 | 0.41 | 1.13 | ns | 2 | 94 |
| CG8668 (X) | 0.68 | 0.70 | 0.28 | ns | 5 | 159 |
| Corto (2) | 0.74 | 0.67 | 4.13 | 0.042 | 5 | 155 |
| Cryptocephal (X) | 0.57 | 0.59 | 0.22 | ns | 6 | 167 |
| Cyclin-dependent kinase 8 | - | - | - | ns | 1 | 94 |
| Dachshund | - | - | - | ns | 1 | 94 |
| Dorsal switch protein 1 | - | - | - | ns | 1 | 94 |
| E5 (2) | 0.47 | 0.42 | 0.65 | ns | 2 | 86 |
| Ecdysone-induced protein 75B (1) | 0.12 | 0.47 | 43.53 | < 0.0001 | 2 | 92 |
| M-spondin (2) | 0.15 | 0.32 | 12.38 | 0.0054 | 4 | 89 |
| Mastermind (2) | 0.26 | 0.25 | 0.00 | ns | 3 | 90 |
| Mediator complex subunit 26 | - | - | - | ns | 1 | 94 |
| Ptip (1) | 0.50 | 0.53 | 0.40 | ns | 5 | 90 |
| Sine oculis-binding protein | - | - | - | ns | 1 | 94 |
| SRPK (2) | 0.60 | 0.60 | 0.00 | ns | 3 | 75 |
| Tenascin major (1) | 0.28 | 0.30 | 0.15 | ns | 2 | 92 |
| Toutatis (2) | 0.46 | 0.61 | 8.50 | 0.0063 | 7 | 167 |
| 3531953:1 (X) | 0.64 | 0.59 | 1.55 | ns | 5 | 163 |
*chromosome identity corresponds to Johns et al. (2005)
Amino acid sequence variants in the T. dalmanni EST database with length variants obtained by PCR.
| Gene | EST sequences | Repeat length | Repeat sequence | PCR product length (bp) |
|---|---|---|---|---|
| CG12104 | 1 | 14 | 192 | |
| 4 | 13 | 189 | ||
| CG32133 | 2 | 14 | 214 | |
| 1 | 10 | 202 | ||
| CG4409 | 3 | 19 | 214 | |
| 6 | 16 | 205 | ||
| Corto | 1 | 19 | 496 | |
| 1 | 18 | 493 | ||
| Cryptocephal | 2 | 27 | 227 | |
| 2 | 25 | 221 | ||
| 1 | 24 | 218 | ||
| 1 | 23 | 215 | ||
| 4 | 20 | 206 | ||
| 4 | 16 | 194 | ||
| Dorsal switch protein 1 | 1 | 50 | 181 | |
| 1 | 48 | - | ||
| Mastermind | 2 | 26 | 523 | |
| 1 | 25 | 520 | ||
| SRPK | 1 | 30 | 172 | |
| 1 | 26 | 160 | ||
| Tenascin major | 2 | 15 | 206 | |
| 1 | 13 | 200 |
Glutamine content for aligned gene regions in D. melanogaster and T. dalmanni
| Gene name | Glutamine # | Gene name | Glutamine # | ||
|---|---|---|---|---|---|
| Band4.1 inhibitor LRP interactor | 7 | 9 | dikar | 16 | 2 |
| big brain | 13 | 2 | domino | 4 | 16 |
| bunched | 4 | 18 | Dorsal switch protein 1 | 22 | 36 |
| cap-n-collar | 9 | 12 | E2F transcription factor | 9 | 5 |
| CG10082 | 1 | 10* | E5 | 3 | 10 |
| CG10082 | 2 | 19** | E5 | 7 | 10 |
| CG10321 | 3 | 16 | E5 | 0 | 9 |
| CG10321 | 1 | 9 | Ecdysone-induced protein 75B | 9 | 2 |
| CG12104 | 1 | 14 | grainy head | 9 | 0 |
| CG12488 | 9 | 3 | grainy head | 9 | 1 |
| CG14023 | 16 | 1 | GUK-holder | 1 | 9 |
| CG14023 | 12 | 1 | GUK-holder | 1 | 12 |
| CG14213 | 12 | 1 | hairy | 6 | 10 |
| CG14440 | 9 | 2 | headcase | 10 | 0 |
| CG14441 | 16 | 12* | headcase | 20** | 5 |
| CG14441 | 10 | 2 | jim | 17 | 0 |
| CG14650 | 17 | 14 | La related protein | 4 | 9 |
| CG17265 | 1 | 14 | mastermind | 14 | 0 |
| CG17271 | 10 | 10 | mastermind | 17 | 7 |
| CG17446 | 21 | 9* | mastermind | 12* | 21 |
| CG17446 | 12 | 4 | mastermind | 5 | 10 |
| CG2083 | 8 | 9 | mastermind | 12* | 13 |
| CG31064 | 7 | 11 | mastermind | 14 | 14 |
| CG31738 | 0 | 15 | Mediator complex subunit 26 | 2 | 14 |
| CG32772 | 9 | 3 | milton | 0 | 9 |
| CG34114 | 9 | 1 | M-spondin | 0 | 11 |
| CG34114 | 8 | 10 | M-spondin | 6 | 9 |
| CG34347 | 0 | 11 | pipsqueak | 12 | 9* |
| CG4068 | 2 | 9 | Protein associated with topo II related - 1 | 5 | 9 |
| CG4702 | 1 | 9 | ptip | 35 | 1 |
| CG5053 | 12 | 5 | ptip | 7 | 10 |
| CG6619 | 23 | 12* | pumilio | 13 | 11 |
| CG8668 | 2 | 9 | pumilio | 12 | 15 |
| Cirl | 10 | 6 | Regena | 9 | 2 |
| corto | 17 | 0 | reversed polarity | 9 | 0 |
| corto | 8 | 10 | reversed polarity | 9 | 0 |
| corto | 11 | 10 | scribbler | 21 | 22 |
| cryptocephal | 0 | 27 | scribbler | 10 | 5 |
| C-terminal Src kinase | 9 | 4 | Sine oculis-binding protein | 0 | 12 |
| Cyclin-dependent kinase 8 | 27 | 27 | SRPK | 2 | 9 |
| dachshund | 11 | 11 | Tenascin major | 1 | 13 |
| dachshund | 15 | 4 | wallenda | 2 | 9 |
*Region does not contain a run of 9 consecutive glutamines.
**Region contains two polyglutamine repeat regions separated by a single non-glutamine amino acid.
ANOVA on progeny eyespan by parent genotype for autosomal polyglutamine loci
| Female eyespan | Male eyespan | ||||
|---|---|---|---|---|---|
| Locus | F | P | F | P | N |
| Band4.1 inhibitor LRP interactor | 6.01 | 0.0002 | 4.72 | 0.0016 | 98 |
| Cap-n-collar | 0.55 | 0.58 | 0.57 | 0.57 | 89 |
| CG10082 | 1.57 | 0.18 | 1.77 | 0.13 | 88 |
| CG10321 | 0.8 | 0.49 | 1.82 | 0.13 | 88 |
| CG10435 | 0.04 | 0.85 | 0.04 | 0.84 | 89 |
| CG12104 | 3.19 | 0.046 | 2.09 | 0.13 | 89 |
| CG31064 | 0.35 | 0.93 | 0.69 | 0.68 | 98 |
| CG31224 | 1.11 | 0.37 | 1.76 | 0.10 | 73 |
| CG33692 | 2.98 | 0.011 | 3.16 | 0.0074 | 98 |
| CG34347 | 0.66 | 0.78 | 1.01 | 0.45 | 89 |
| CG4409 | 1.43 | 0.24 | 1.38 | 0.26 | 92 |
| Corto | 2.25 | 0.022 | 2.59 | 0.0087 | 96 |
| E5 | 0.85 | 0.43 | 0.90 | 0.41 | 84 |
| Ecdysone-induced protein 75B | 2.71 | 0.07 | 6.13 | 0.0032 | 91 |
| M-spondin | 0.37 | 0.83 | 0.75 | 0.56 | 87 |
| Mastermind | 1.52 | 0.21 | 2.19 | 0.08 | 88 |
| Ptip | 1.25 | 0.28 | 2.84 | 0.0079 | 88 |
| SRPK | 1.95 | 0.06 | 2.54 | 0.015 | 99 |
| Tenascin major | 0.56 | 0.57 | 1.91 | 0.15 | 90 |
| Toutatis | 1.07 | 0.40 | 1.23 | 0.28 | 89 |
ANOVA on progeny eyespan by parent genotype for X-linked polyglutamine loci
| Female eyespan | Male eyespan | |||||
|---|---|---|---|---|---|---|
| Locus | Parent | F | P | F | P | N |
| Bunched | Male | 0.22 | 0.80 | 0.42 | 0.66 | 45 |
| Female | 0.71 | 0.62 | 0.88 | 0.50 | 45 | |
| CG8668 | Male | 0.65 | 0.69 | 0.60 | 0.73 | 48 |
| Female | 0.64 | 0.77 | 1.18 | 0.34 | 46 | |
| CG42389 | Male | 2.04 | 0.12 | 1.87 | 0.15 | 49 |
| Female | 0.81 | 0.55 | 1.83 | 0.13 | 50 | |
| Cryptocephal | Male | 0.82 | 0.54 | 1.28 | 0.29 | 50 |
| Female | 0.33 | 0.92 | 0.71 | 0.65 | 49 | |
| 3531953:1 | Male | 4.18 | 0.011 | 3.03 | 0.039 | 48 |
| Female | 0.99 | 0.46 | 0.97 | 0.47 | 49 | |
Mixed model ANOVA on progeny eye span by progeny polyglutamine genotype and family
| Females | Males | |||||||
|---|---|---|---|---|---|---|---|---|
| Source of variation | df | Var Comp% | F | P | df | Var Comp% | F | P |
| Band4.1 inhibitor LRP interactor (2) | ||||||||
| Family* | 5 | 36.9 | 14.5 | < 0.0001 | 5 | 53.6 | 31.7 | < 0.0001 |
| Genotype* | 4 | 2.0 | 1.7 | 0.16 | 4 | 1.1 | 1.6 | 0.17 |
| Error | 177 | 202 | ||||||
| CG33692 (1) | ||||||||
| Family* | 5 | 37.3 | 13.2 | < 0.0001 | 5 | 38.6 | 18.2 | < 0.0001 |
| Genotype* | 7 | 5.6 | 2.7 | 0.011 | 7 | 6.8 | 3.5 | 0.0013 |
| Error | 168 | 200 | ||||||
| Corto (2) | ||||||||
| Family* | 5 | 33.7 | 10.5 | < 0.0001 | 5 | 46.1 | 18.4 | < 0.0001 |
| Genotype* | 9 | 4.9 | 2.1 | 0.035 | 9 | 1.1 | 1.3 | 0.23 |
| Error | 175 | 200 | ||||||
| Ecdysone-induced protein 75B (1) | ||||||||
| Family* | 4 | 32.9 | 14.3 | < 0.0001 | 4 | 43.3 | 19.2 | < 0.0001 |
| Genotype* | 2 | 5.4 | 4.6 | 0.012 | 2 | 1.2 | 1.8 | 0.18 |
| Error | 134 | 119 | ||||||
| 3531953:1 (X) | ||||||||
| Family* | 4 | 47.7 | 16.2 | < 0.0001 | 4 | 50.6 | 22.6 | < 0.0001 |
| Genotype* | 5 | -1.0 | 0.7 | 0.59 | 5 | 6.5 | 4.5 | 0.0049 |
| Error | 143 | 144 |
*Family and genotype are random effects and body length is a significant (not shown) covariate in all models.
Figure 2Least square adjusted mean eyespan for male (solid) and female (dashed) plotted against genotype for progeny from six families that segregated for allelic variants at CG33692.