| Literature DB >> 16102176 |
Arnar Palsson1, James Dodgson, Ian Dworkin, Greg Gibson.
Abstract
BACKGROUND: Quantitative differences between individuals stem from a combination of genetic and environmental factors, with the heritable variation being shaped by evolutionary forces. Drosophila wing shape has emerged as an attractive system for genetic dissection of multi-dimensional traits. We utilize several experimental genetic methods to validation of the contribution of several polymorphisms in the Epidermal growth factor receptor (Egfr) gene to wing shape and size, that were previously mapped in populations of Drosophila melanogaster from North Carolina (NC) and California (CA). This re-evaluation utilized different genetic testcrosses to generate heterozygous individuals with a variety of genetic backgrounds as well as sampling of new alleles from Kenyan stocks.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16102176 PMCID: PMC1208880 DOI: 10.1186/1471-2156-6-44
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Figure 1Schema of the three experimental crosses. (A) In the round robin (RR), each of 71 inbred lines from NC was crossed to six other lines to produce heterozygous offspring. Six loops of the type shown were used. (B) In the Backcross (BC), each of 76 NC lines were crossed to two phenotypically divergent backgrounds, NC025 and NC144, again resulting in heterozygous offspring. (C) Each of 26 Kenyan second chromosomes extracted into the Samarkand background were crossed to regular Samarkand or Samarkand lines carrying blisteredor Egfrmutations.
Figure 2Shape of the D. melanogaster wing was captured from 9 landmarks (A) and analyzed with TPS software [68]. Veins are labeled both by the Comstock and Needham [69] nomenclature and using developmental genetic terminology (L1–L5). Individual inter-vein regions are designated by the letters A-E. (B – I): Shape differences derived from the four different datasets, namely the inbred (B, F), backcross (C, G), round robin (D, H), and Kenyan (E, I) panels. B-E shows the first principal component for landmarks of the central region of the wing (C1). The region is specified by longitudinal veins L3 and L4 and the placement of the cross veins and the margin. F-I show the shape differences for the seventh principal component for the whole wing data (W7). The whole wing and the central portion are not drawn to scale. Dark lines represent negative values, and gray positive. For whole wing PC 7 (W7) the extremes are at +/- 0.01 units and for central region PC 1 (C1) the values are +/- 0.2.
Figure 3Eigenvalue distribution for the PC's, derived from each of the five datasets individually. (A) Values for the 9 PC's capturing variation in the whole wing, and (B) for the 3 PC's capturing variation for the five landmarks that define the central region of the wing. The decomposition of eigenvalues is comparable for all datasets, with the exception of the Backcross (BC) dataset which deviates qualitatively for the first component in both panels. To illustrate the consistency of shape capture, the principal components for the NC and CA populations were extracted individually for this Figure. All other analyses were conducted on PC's estimated for the CA and NC populations jointly.
Retesting the effects of Egfr SNP's on wing shape and size
| Separately derived a | Jointly derived a | ||||||||||
| SNP | Termb | Trait | Typec | INB | RR | BC | KI | INB | RR | BC | KI |
| C31656T | Gtyp × Sex | Area | N | **** | . | . | . | **** | . | . | . |
| C30200Td | Gtyp | C1 | N | **** | ** | **** | *** | **** | ** | **** | *** |
| T31634C | Gtyp × Sex × Pop | C2 | N | **** | . | . | . | **** | . | . | . |
| T39389C | Gtyp | D1 | S | **** | . | . | . | **** | . | . | . |
| T40722C | Gtyp × Pop | Area | S | **** | . | . | na | **** | . | . | na |
| C30505A | Gtyp | W7 | N | *** | *** | * | ** | *** | *** | ** | * |
a. As described in the Materials and Methods, PC's were calculated for the datasets individually (separately) or for all the data concatenated (jointly).
b. Term denotes the genetic term most significant in the original study. The same term is reported for the repeats, except for the RR where we could only test genotype effects and sites T31634C and T40722C where the genotype terms are reported as population terms are not available.
c. Type indicates the nature of the polymorphism, N: non-coding and S: synonymous.
d. The significance of the T30200C to C1 association is here reported for the data from Palsson and Gibson 2004. After re-genotyping the p-values reduce to 0.062 and 0.061 for the separately and jointly derived data respectively, when analyzed over the NC and CA populations. Note however that the results for RR and BC are for the re-genotyped data. Significance of terms: "." non-significant, "*": 0.05 > p > 0.01, "**":p > 0.001, "***":p > 0.0001, "****":p > 0.00001. P-values are not adjusted to correct for the seventeen new independent tests conducted.
Figure 4Effects of the T30200C polymorphism on C1 in females across experiments and genetic configurations (designated on the X-axis). INB refers to the inbred populations CA and NC, while INB_RR and INB_BC denote the subsets of inbred lines corresponding to the lines used for the recrossing experiments. Likewise, C144 and C25 indicate the estimated effects of the site in backcrosses to line NC144 and NC025 respectively. The last three points show the effects estimates for the three test crosses involving the Kenyan introgressions (KI), namely wildtype Samarkand chromosomes, the blisteredmutant, and the Egfrallele. Each point represents the least square estimate plus or minus one standard error unit for the indicated homozygous genotypes. See Additional Table 2 for corresponding ANOVA's and values for tests with the older genotype data. The PC's were extracted from all datasets jointly to ensure that the axis of variation and units are comparable.
Figure 5Association plots for tests of association between Egfr and shape parameter C1. Each plot shows the negative logarithm of the p-value for the test statistic at each polymorphic SNP from 5' to 3 along the Egfr locus. (A) Association profile from the whole NC inbred panel (N = 121, solid line) and the Kenyan chromosomes (N = 26, broken line). (B) The association profile for the RR experiment (outcrossed, N = 71, broken line) and the subset of NC lines (in inbred condition, N = 71, solid line) used for the outcrossing scheme. (C) Similarly the profiles for the BC panel (backcrossed to NC025 and NC144, N = 79, broken line) and the corresponding NC set (N = 79, solid line). The X scale is broken to indicate gaps between contigs with the gene structure represented below (exons as boxes in three contigs of non-coding sequence). Site T30200C (indicated) is located at the 5' most end of contig 2. Lines corresponding to single site significance levels α = 0.05 (negative log p: 1.30) and more conservative gene wide α = 0.0001 (negative log p: 4.0). Trait values for all lines and crosses come from data that were processed jointly in TPSrelw [68].
Significance of Egfr polymorphisms on central region shape in NC, CA and Kenyan samples
| Effect and significancec | Frequency of rare allele | |||||||
| Site | Significance in NC | Locationa | Typeb | CA | KI | NC | CA | K |
| C6085G | 7.98 × 107 | E1 | R | (OK) ns | (Rev) ns | 15/125 | 13/75 | 4/27 |
| T30200C | 6.11 × 1010 | P2 | N | (Rev) ns | (OK) *** | 26/121 | 21/76 | 8/17 |
| A31442T | 5.49 × 108 | I2 | N | (OK) ns | (OK) ns | 13/92 | 3/24 | 2/20 |
| A36644T | 9.60 × 107 | I2 | N | (Rev) ns | (OK) * | 26/116 | 3/20 | 9/24 |
| A36761C | 5.17 × 106 | I2 | N | na | na | 35/84 | na | na |
| Del37192d | 3.44 × 105 | I2 | N | (OK) ns | (OK) ns | 8/106 | 10/77 | 7/24 |
| A37282G | 9.95 × 106 | I2 | N | nd | nd | 13/106 | 1/77 | 0/23 |
| T39160C | 2.10 × 105 | E4 | S | (OK) ns | (OK) # | 41/128 | 26/76 | 16/35 |
| In39534d | 8.81 × 105 | I5 | N | (OK) ns | (OK) ns | 27/110 | 6/27 | 3/33 |
| C40620Te | 7.37 × 107 | E6 | S | (OK) ns | (OK) # | 46/123 | 19/79 | 7/36 |
| G42242A | 8.88 × 105 | 3' UTR | N | nd | nd | 5/105 | 1/12 | 1/25 |
a. Location within the Egfr locus, where P2 refers to the second promoter and the other indicators to the respective introns (I), exons (E) and the 3'UTR.
b. Type indicates the nature of the polymorphism; R: replacement, N: non-coding and S: synonymous.
c. Effects of the polymorphisms in the same direction as in NC are indicated by "OK" and those in the reverse direction are designated by "Rev", with the significance of the genotypic term indicated. "ns": not significant, "#:" 0.1 > p > 0.05. "*": p > 0.01, "**": p > 0.001, "***": p > 0.0001. P-values are not adjusted to correct for the number of tests conducted. "na": genotypes not available, "nd": not computed because of allele rarity.
d. Del37192 is a one base pair deletion and In39534 a four base insert (AACC repeated).
e. Site C40620T is the same as site 8697 described by Dworkin et al. [21].